[00:31] Fact: [00:33] Doing a 10tb download with rtorrent [00:33] ....it works [00:39] I asked Brewster about us beginning the MegaWARC conversion of MobileMe Data into MegaWARC format. [00:39] It's time to shove that 274tb of horseshit into Usability Village [00:39] Awwwww yeaaaaaaaaaaah [00:43] got the Goahead [00:43] We'll do it over the next week or two. [01:13] Can I be of assistance with the posterous stuff? [01:14] with my army :) [01:14] #preposterous is the channel for it [01:15] * #preposterus [01:15] thanks [01:19] jk[SVP]: You can bring it back up to super crazy [01:19] kennethre: Come to the Internet Archive Lunch, Friday [01:19] ----------------------------- [01:20] IF YOU'RE IN SF, COME TO THE INTERNET ARCHIVE LUNCH, FRIDAY, NOON [01:20] I AM GOING TO BE THERE [01:20] FREE MAKEOUT SESSION IN THE MACHINE ROOM [01:20] ----------------------------- [01:20] if only I was in SF. [01:20] SketchCow: sounds like a plan :) [01:20] I am speaking for 30 minutes at your little con. [01:20] Oh man I wish I was in SF. [01:20] SketchCow: working on autoscaling groups [01:21] SketchCow: are you going to the github afterparty? [01:22] jk[SVP]: No problem, just wanted to give the all clear. [01:22] Turns out that the systems just retry like crazyfuck until the machine backs back. [01:22] kennethre: I assume so, you guys bought Thursday [01:22] excellent :) [01:25] test autoscaling seems to be working. now dialing it up to "super crazy" [02:02] http://archive.org/search.php?query=collection%3Asecretservicemagazine&sort=-publicdate [02:10] http://blog.teara.govt.nz/wp-content/uploads/2013/02/the-clown.jpg [02:16] * SketchCow watches GLaDOS and jk[SVP] turning both Posterous downloads to full and battle it out: http://www.slipups.com/images/items/469.full.jpg [03:19] are there any working programs for downloading all of a flickr user's photos? [03:21] https://github.com/linpc/Evelyn-Flickr-Downloader appears to be working [03:21] I'm using site:github.com more and more [03:23] github rocks [03:24] just throwing that out there :( [03:26] may i ask a silly question. archiving the net is a lare task. how do you decide what gets targeted and when? [03:27] I believe that Geocities gave their own end of life date [03:27] hahaha [03:27] as did some of the other larger projects [03:27] fireangle, never forget [03:27] big corporate changes put sites in the high-risk category like opera.com right now [03:28] angelfire [03:28] haha [03:28] I assume people just back up what they like most of the time, though [03:28] yes, people backing up what they like is one thing.. [03:28] but a systematic wget/raping is another [03:29] assuming the detect rescource hogs and blacklist.. it could get difficult to meet a given deadline [03:29] im just curious is all.. [03:29] send a NotGooglebot UA ;) [03:30] googlebot/hideyokids-hideyowife [03:30] theys downloadin everythin around here [03:30] hahaha [03:31] i am making the assumption this channel has some sort of organization around its goals, once they are determined. [03:32] it's rather ad-hoc [03:32] but yes [03:32] Sorry, I was missing out. [03:32] What's all up in this? [03:33] Oh, here we go. [03:33] OK. [03:33] In list of priority. [03:33] 1. Sites announcing they're going down (Dying) [03:33] 2. Sites that look "ill", unkept, but have been around a long time, with user content involved in a big way (Life Support) [03:33] 3. Sites that are relatively easy to snag a little backup, because if they went the world would be sad (Mental Ward) [03:34] 4. Things that cropped up and look like they're going to be ripped down or collapse under the weight of attention [03:35] 5. Jennifer Lawrence [03:35] So is that clear enough? [03:35] I was with you until #5 [03:35] She's low priority, dude [03:36] well, ok. [03:36] also she makes me feel old :| [03:36] Did you see the Nicholson GIF? (-bs) [03:54] grrrrrrr [03:54] can't rar this raw avi capture [03:54] last one was fine but this one causes a shit fit [04:04] was searching for old dwarf fortress versions removed from bay12games.com/dwarves/ (which blocks iarchiver due to an old bug which was fixed years ago yet toady doesn't want to remove the block :( ) [04:04] and i found someone's bash_history on github [04:06] VonGuard: you can just give my the avi videos without zipping or raring them [04:07] well [04:07] won't UG delete the torrent? [04:07] if it's not rar'd? [04:07] i don't think they delete torrents that are not rar [04:07] i have video torrents on there that have not being zip/rar [04:08] ok [04:08] ok, then i will set it up. [04:08] same for pdfs lot of the time [04:08] gimme about 10 minutes [04:09] ? [04:09] you watch em on youtube yet/> [04:09] one's all about Sega spending millions on arcades and theme park arcades [04:09] oy [04:09] i have not [04:10] trying to use all bandwidth to grab g4tv.com [04:10] i found tons of missing videos [04:10] it was not missing cause my xml dump [04:10] just there was another xml dump that has videos in images.g4tv.com/videoDB [04:13] gj! [04:23] i may just download all flv7 videos in this xml dump i'm doing [04:24] just download all of and post it with a g4tv.com-video#id-flv7 [04:24] this is also my plan with flvhd vidoes [04:56] i think the raw video file is corrupted in some way [04:56] everything chokes on it [04:56] but i exported and converted it fine [05:10] ok [06:33] good news on the g4tv.com videos [06:34] the flv7 will help just find the missing flv.flv [06:35] so you may get the sd res of the originals flv [07:02] Can someone create a sub-collection for closedsolaris so we can start uploading files [07:03] or can we just stick them in Archive Team and they get moved later? [07:07] As I get time, I put stuff in places. [07:07] saved missing video: https://archive.org/details/g4tv.com-video23112 [09:22] So I got 400 Posterous sites downloaded on the Warrior before they blocked me - how do I stop them blocking me? [09:25] You don't, unfortunally. The only way around is to use another IP or wait a long time. [09:26] Bugger, ok thanks! [09:26] Yeah, it sucks :-( [09:27] A few fellows have sat up some machines on Amazon Web Services and switches IPs there - though that does cost money and so