#archiveteam 2012-07-05,Thu

↑back Search

Time Nickname Message
00:44 🔗 ivan` does anyone here mirror all the repos on github? or have the disk space for them?
00:44 🔗 ivan` many of the unpopular ones vanish into the ether
00:46 🔗 closure_ I've considered starting a mirror (I wrote a github-backup that can find related clones, etc)
00:46 🔗 closure_ someone is running a scrape of the API that gets all newly created repos and publishes the list
00:46 🔗 ivan` yeah, I saw that a while ago
00:47 🔗 ivan` are you sharing objects between related repos?
00:47 🔗 ivan` this one? https://github.com/joeyh/github-backup
00:51 🔗 ivan` it used to be easier to find all of the repos but github broke navigating past page 10 a few months ago
00:51 🔗 ivan` I reported it and they got rid of the page > 10 links
00:51 🔗 chronomex hahahah typical
00:52 🔗 balrog :[
00:56 🔗 Swizzle Internet Archive is giving me "Upload Error: 417" today in Chrome with the flash uploader. I can use IE to upload files though. Anyone know anything about this error? I have using IE...
00:56 🔗 Swizzle * I hate using IE
02:12 🔗 closure_ ivan`: it makes clones into git remotes, so they're all in the same repo and share objects
02:13 🔗 closure_ the github api still supports pagination afaik.. dunno how deep, as the interface library I'm using does not do pagination yet
02:36 🔗 underscor Swizzle: Try http://archive.org/upload
02:36 🔗 underscor Beta html5 uploader
02:36 🔗 underscor Chrome only
02:38 🔗 Swizzle Yea... I found that new uploader but didn't like that I can only do 1 file at a time and then need to add the next files through the edit files page
02:41 🔗 underscor ah
03:21 🔗 bsmith094 joepie91: fanfiction net recently started enforcing the M rating limits, and pulling anything that goes above them, or is too explicit, arbitrarily, as of june 23
06:26 🔗 SketchCow Hiiiii
06:34 🔗 Mister_Ar 'ello!
06:35 🔗 Mister_Ar ...huh, is there a name length for this server?
06:35 🔗 Mister_Ag Anyway, just popping in to say that you guys are awesome!
06:40 🔗 instence *scratches head*
06:41 🔗 instence drive by ello's
06:41 🔗 yipdw it's a step up from drive-by spamming
06:49 🔗 chronomex lol
06:57 🔗 SketchCow Just updated the wiki about the file formats things
06:57 🔗 SketchCow It's getting hysterical, ALL sorts of people coming out of the woodwork, with raised fists in solidarity or open fingers in skepticism
06:57 🔗 SketchCow Just like I like it.
06:59 🔗 chronomex yep, it's perfect
07:50 🔗 ersi SketchCow: You sure know how to give the ol' pot a good stir
09:45 🔗 Swizzle Hm, now I'm getting Upload Error 417 in IE as well as Chrome. Guess it was time for a break anyways
11:05 🔗 omf_ ivan`, do you have a link to list of new github projects?
11:14 🔗 ivan` nope, sorry
11:15 🔗 ivan` per-language I used to get them like this https://github.com/languages/Java/created
14:04 🔗 SketchCow Had a great nap
14:09 🔗 SketchCow http://davetaz-blog.blogspot.com/2012/07/jason-scott-and-archive-team.html
14:09 🔗 SketchCow Changed a mind
14:29 🔗 ersi awesome nap if you changed his mind with that
14:30 🔗 SketchCow zzzzzzzz AND NOW DO YOU SEE
14:30 🔗 ersi YES
14:30 🔗 ersi I SEE IT!
14:31 🔗 SketchCow EXCELLENT zzzz
14:31 🔗 ersi now I lost it, stop waking it up
14:35 🔗 SketchCow Here we go, 815gb of fan fiction going into archive.org
14:36 🔗 SketchCow That should take about a day or so. Maybe less, depending on things.
14:56 🔗 BlueMax if only I had a blog so I could paint SketchCow in a god-like light
15:23 🔗 SmileyG BlueMax: because a blog is hard to get right? ;)
15:23 🔗 BlueMax "BlueMaxima's Blog, Post 1: SketchCow is my god"
15:23 🔗 BlueMax "End blog"
15:25 🔗 SmileyG :)
15:30 🔗 BlueMax Google is doing some spring cleaning in the middle of summer, announcing it will shutdown five more services, including iGoogle. Fans of Google's widget-based homepage have a little over year to find a replacement.
15:31 🔗 BlueMax http://www.webmonkey.com/2012/07/google-shuts-down-igoogle/
15:46 🔗 SketchCow Oh, not SO bad.... the Fan Fiction is up to #5, and that's after an hour.
15:46 🔗 SketchCow So figure another couple hours, we'll have it all up there.
15:54 🔗 yipdw wow, 815 GB?
15:54 🔗 yipdw huh
15:54 🔗 yipdw oh wait, right
15:54 🔗 yipdw I set up the tracker to only count uncooked data
15:55 🔗 yipdw SketchCow: if that's going to be public, I'd like to add some text about what the cooked WARCs are and why they exist
16:09 🔗 jetlag_ I see that Google Video is going away again, http://youtube-global.blogspot.com/2012/07/google-video-content-moving-to-youtube.html
16:17 🔗 SketchCow http://archive.org/details/archiveteam-fanfiction-00000000
16:24 🔗 SketchCow Next, PICPLZ
16:30 🔗 yipdw IA's about to piss off a ton of ficcers
16:30 🔗 yipdw oh well :D
16:30 🔗 SketchCow ?
16:31 🔗 SketchCow Oh, because of that?
16:31 🔗 SketchCow So wait. Is fan fiction still up?
16:31 🔗 yipdw yes
16:31 🔗 SketchCow Oh, well fuck, bussy.
16:31 🔗 SketchCow buddy
16:31 🔗 yipdw in my experience, people in fandoms tend to be strangely sensitive to redistribution
16:31 🔗 SketchCow Didn't know that. I will set them dark after they finish ingesting.
16:31 🔗 yipdw no problem
16:31 🔗 SketchCow Well, yes, understandably.
16:31 🔗 SketchCow So let me finish uploading them, and then we'll tuck those little bastards away.
16:32 🔗 DFJustin it's still up but they did a censorship purge after we archived it
16:32 🔗 SketchCow There's a LOT of projects in the pipeline - sometimes I miss which are pre-emptive.
16:32 🔗 SketchCow I'll make them all un-indexed, then.
16:32 🔗 yipdw that's fine
16:32 🔗 SketchCow Available, but not responding to search engines.
16:33 🔗 yipdw later on, we can diff what we have vs. what's available on fanfiction.net now and go from there
16:33 🔗 yipdw or something
16:33 🔗 SketchCow Yes, analysis is the big project for the summer
16:33 🔗 SketchCow We need to clean up the archiveteam group on archive.org
16:33 🔗 SketchCow It's a tad of a mess
16:33 🔗 SketchCow Which is fine.
16:33 🔗 yipdw we have computers for that!
16:33 🔗 SketchCow Actually, this reminds me
16:33 🔗 SketchCow I've set up a channel. #nowwhat
16:34 🔗 yipdw oh good
16:34 🔗 SketchCow It's the back-end of all this, where we'll code in and discuss cleanup and analysis
16:34 🔗 yipdw concidentally I just got an idea for an analysis tool
16:50 🔗 Schbirid fileplanet is so-so at the moment, i got contacts into the mothership now but after a first enthusiastic contact i now am talking to someone else and he's like yeah well poop do what you can, we dont really help. not sure what will happen
16:51 🔗 Schbirid we got ~7 of ~9 terabytes done
16:51 🔗 Schbirid the last 2 are "hidden" and i am trying to get them to tell us the URLs if we cant get ftp or rsync instead
16:51 🔗 Schbirid end of status psa
17:23 🔗 edsu the s3 api at archive.org doesn't support torrents like the real s3 does it?
17:25 🔗 edsu e.g. slapping ?torrent on the end of something like http://s3.us.archive.org/marc_loc_updates/v35.i04.records.xml?torrent
17:50 🔗 SketchCow No
17:50 🔗 SketchCow They go after it a different way.
21:12 🔗 bsmith094 with regards to the tars of the fanfiction archives, are the id numbers stories or profiles?
23:25 🔗 inky Anyone heard of Wakoopa? http://blog.wakoopa.com/post/24878499948
23:25 🔗 inky + https://twitter.com/nevolution/status/221021365921263616
23:32 🔗 omf_ never heard of them
23:44 🔗 balrog Is it safe to delete data/uploaded (memac project)?
23:58 🔗 arkhive inky: I just checked out Wakoopa Social. It's an interesting concept.

irclogger-viewer