[00:44] does anyone here mirror all the repos on github? or have the disk space for them? [00:44] many of the unpopular ones vanish into the ether [00:46] I've considered starting a mirror (I wrote a github-backup that can find related clones, etc) [00:46] someone is running a scrape of the API that gets all newly created repos and publishes the list [00:46] yeah, I saw that a while ago [00:47] are you sharing objects between related repos? [00:47] this one? https://github.com/joeyh/github-backup [00:51] it used to be easier to find all of the repos but github broke navigating past page 10 a few months ago [00:51] I reported it and they got rid of the page > 10 links [00:51] hahahah typical [00:52] :[ [00:56] Internet Archive is giving me "Upload Error: 417" today in Chrome with the flash uploader. I can use IE to upload files though. Anyone know anything about this error? I have using IE... [00:56] * I hate using IE [02:12] ivan`: it makes clones into git remotes, so they're all in the same repo and share objects [02:13] the github api still supports pagination afaik.. dunno how deep, as the interface library I'm using does not do pagination yet [02:36] Swizzle: Try http://archive.org/upload [02:36] Beta html5 uploader [02:36] Chrome only [02:38] Yea... I found that new uploader but didn't like that I can only do 1 file at a time and then need to add the next files through the edit files page [02:41] ah [03:21] joepie91: fanfiction net recently started enforcing the M rating limits, and pulling anything that goes above them, or is too explicit, arbitrarily, as of june 23 [06:26] Hiiiii [06:34] 'ello! [06:35] ...huh, is there a name length for this server? [06:35] Anyway, just popping in to say that you guys are awesome! [06:40] *scratches head* [06:41] drive by ello's [06:41] it's a step up from drive-by spamming [06:49] lol [06:57] Just updated the wiki about the file formats things [06:57] It's getting hysterical, ALL sorts of people coming out of the woodwork, with raised fists in solidarity or open fingers in skepticism [06:57] Just like I like it. [06:59] yep, it's perfect [07:50] SketchCow: You sure know how to give the ol' pot a good stir [09:45] Hm, now I'm getting Upload Error 417 in IE as well as Chrome. Guess it was time for a break anyways [11:05] ivan`, do you have a link to list of new github projects? [11:14] nope, sorry [11:15] per-language I used to get them like this https://github.com/languages/Java/created [14:04] Had a great nap [14:09] http://davetaz-blog.blogspot.com/2012/07/jason-scott-and-archive-team.html [14:09] Changed a mind [14:29] awesome nap if you changed his mind with that [14:30] zzzzzzzz AND NOW DO YOU SEE [14:30] YES [14:30] I SEE IT! [14:31] EXCELLENT zzzz [14:31] now I lost it, stop waking it up [14:35] Here we go, 815gb of fan fiction going into archive.org [14:36] That should take about a day or so. Maybe less, depending on things. [14:56] if only I had a blog so I could paint SketchCow in a god-like light [15:23] BlueMax: because a blog is hard to get right? ;) [15:23] "BlueMaxima's Blog, Post 1: SketchCow is my god" [15:23] "End blog" [15:25] :) [15:30] Google is doing some spring cleaning in the middle of summer, announcing it will shutdown five more services, including iGoogle. Fans of Google's widget-based homepage have a little over year to find a replacement. [15:31] http://www.webmonkey.com/2012/07/google-shuts-down-igoogle/ [15:46] Oh, not SO bad.... the Fan Fiction is up to #5, and that's after an hour. [15:46] So figure another couple hours, we'll have it all up there. [15:54] wow, 815 GB? [15:54] huh [15:54] oh wait, right [15:54] I set up the tracker to only count uncooked data [15:55] SketchCow: if that's going to be public, I'd like to add some text about what the cooked WARCs are and why they exist [16:09] I see that Google Video is going away again, http://youtube-global.blogspot.com/2012/07/google-video-content-moving-to-youtube.html [16:17] http://archive.org/details/archiveteam-fanfiction-00000000 [16:24] Next, PICPLZ [16:30] IA's about to piss off a ton of ficcers [16:30] oh well :D [16:30] ? [16:31] Oh, because of that? [16:31] So wait. Is fan fiction still up? [16:31] yes [16:31] Oh, well fuck, bussy. [16:31] buddy [16:31] in my experience, people in fandoms tend to be strangely sensitive to redistribution [16:31] Didn't know that. I will set them dark after they finish ingesting. [16:31] no problem [16:31] Well, yes, understandably. [16:31] So let me finish uploading them, and then we'll tuck those little bastards away. [16:32] it's still up but they did a censorship purge after we archived it [16:32] There's a LOT of projects in the pipeline - sometimes I miss which are pre-emptive. [16:32] I'll make them all un-indexed, then. [16:32] that's fine [16:32] Available, but not responding to search engines. [16:33] later on, we can diff what we have vs. what's available on fanfiction.net now and go from there [16:33] or something [16:33] Yes, analysis is the big project for the summer [16:33] We need to clean up the archiveteam group on archive.org [16:33] It's a tad of a mess [16:33] Which is fine. [16:33] we have computers for that! [16:33] Actually, this reminds me [16:33] I've set up a channel. #nowwhat [16:34] oh good [16:34] It's the back-end of all this, where we'll code in and discuss cleanup and analysis [16:34] concidentally I just got an idea for an analysis tool [16:50] fileplanet is so-so at the moment, i got contacts into the mothership now but after a first enthusiastic contact i now am talking to someone else and he's like yeah well poop do what you can, we dont really help. not sure what will happen [16:51] we got ~7 of ~9 terabytes done [16:51] the last 2 are "hidden" and i am trying to get them to tell us the URLs if we cant get ftp or rsync instead [16:51] end of status psa [17:23] the s3 api at archive.org doesn't support torrents like the real s3 does it? [17:25] e.g. slapping ?torrent on the end of something like http://s3.us.archive.org/marc_loc_updates/v35.i04.records.xml?torrent [17:50] No [17:50] They go after it a different way. [21:12] with regards to the tars of the fanfiction archives, are the id numbers stories or profiles? [23:25] Anyone heard of Wakoopa? http://blog.wakoopa.com/post/24878499948 [23:25] + https://twitter.com/nevolution/status/221021365921263616 [23:32] never heard of them [23:44] Is it safe to delete data/uploaded (memac project)? [23:58] inky: I just checked out Wakoopa Social. It's an interesting concept.