[00:07] What's the user-agent set to – "Jason/Scott/1.0"? [00:16] It's a default Safari UA I believe [01:06] http://www.slate.com/articles/technology/map_of_the_week/2013/03/google_reader_joins_graveyard_of_dead_google_products.html [02:00] * BlueMaxim looks at all these dead Google products and starts crying [02:01] hey guys, i'm back [02:05] any idea why my warrior for yahoo messages hasn't been working? it either says "Starting reaper", "no tasks available", or "Service blocked us 1 times, backing off for 5 seconds".. [03:08] SketchCow: pls do post the url of the HuffPo interview when it's available [04:47] any activity for the phoenix? [04:56] https://archive.org/details/archiveteam_blog_thephoenix [05:02] SketchCow, id go out and get a hdd..or just donate $$ for you for a hdd for punchfork. i just dont want to be responsible for holding the only single copy on 1 hdd for the rest of enterinity and if something happens to it..well shit id feel really bad and it would suck [05:26] any news on the formspring project (particularly if anyone has heard back from tim yet)? [05:27] #firespring is dead as a doorknob [05:43] http://www.techdirt.com/articles/20130317/16534822353/drm-strikes-again-digital-comics-distributor-jmanga-closing-down-deleting-everyones-purchases.shtml [05:43] wp494 i dont think anyone has..i guess we need to look at scraping to at least get started [05:44] what does one yahoo messages "item" correspond to? [05:45] S[h]O[r]T: lol'd [05:51] hdevalenc: perhaps a topic/thread [05:53] @Sh0rt: someone should get on it ASAP in the event of worst case scenario where tim can't get a userlist [06:00] SketchCow, I now have the space for punchfork if you still want another copy out there [07:32] No, wehave a plan. [07:32] They try and make it go crazy [07:32] We just spirit it out [07:35] Oh man, megaupload [07:41] Yahoo! Korea now completely uploaded - thanks everyone. [07:41] yahoo.kp [07:47] Now just verifying Punchfork uploading. [07:47] These 50gb blocks sometimes clog in the system. [07:48] shocked [08:05] Hooray, I've gotten one of the FOS drives to 10 percent. [08:11] 210458.2 / 583096.0 MB Rate: 23787.6 / 3276.1 KB Uploaded: 759839.0 MB [36%] 1d 8:21 [ R: 3.61] [08:11] InternetCensus2012 [08:11] 210gb [08:12] And dude.... 759gb uploaded. What a good citizen. [08:13] like a boss [08:17] Punchfork is so active, we have 39gb of what, as far as I can tell, is stuff stuck on the side of the tubes. [08:17] I'll overlook it for now. [08:18] run little warrior! [08:20] Yeah, I'm happy - we're at the good point now on that drive. It's about 9tb free now. That's a lot of space. [08:21] The other has 7.8tb drives. [08:24] SketchCow: how are dumps usually processed and made accessible in IA? [08:24] Which dumps. [08:24] like of posterous [08:25] Dumps where we've created, say, hundreds of gigabytes of grabs, we split them into 50gb chunks, and upload those into the archive in the form of a "MegaWARC", which is a WARC of WARCS. [08:25] to rephrase the question: what does happen with the archive teams uploads and how do they get integrated into the IA's searchable data stuff [08:26] By calling it a web object, and including the index we do, a process stops by and ingests the data into the Wayback. [08:26] There used to be an embargo between the addition of data and Wayback availability. Current version doesn't have that. [08:26] I'm trying to think of an example. [08:27] I see. So modern web-archeologists can then either download the WARCs or use the wayback machine [08:27] Yes. [08:27] We have conversion utilities for taking these WARCs and making them not be WARCs. To be zips, or browsable. [08:28] I also have seen some local webserver thingy that serves html pages out of a warc somewhere on github [08:29] But that's the current situation. [08:29] We're going to be more aggressive with putting out tools. [08:29] And other "archive the archiving archivers" stuff. [08:29] do you have a list of things you need help with? [08:30] * or want [08:30] Like, projects? Or things? [08:30] whatever I potentially could do from the other side of your world [08:32] I have boring things that I think change the nature of the world, but they're not 100% archive team projects. [08:32] Archive Team's biggest issue is Wiki updating. [08:38] I'd love to do stuff to the wiki, but I have no idea where to start [08:40] chronomex? Any suggestions? [08:41] hi [08:41] what about [08:41] borign thigns to change the world? [08:41] or at wiki janitor stuff? [08:41] just where to start editing/what needs to be done [08:41] hm [08:42] a bunch of the project-specific pages are stale w.r.t. status, disposition, etc [08:42] BORING THINGS TO CHANGE THE WORLD would be a great slogan at the bottom of a crest. [08:42] yes [08:43] the archiveteam wiki is full of stale info, mostly [08:43] Yes, I'd like it saved off into a way that we can still get to it, but it's marked as completed. [08:44] right [09:06] If the wiki prompts you to fill a captcha for anti-spam there is also a PHP error on the page [09:06] Warning: file_get_contents(/home/archivet/public_html/extensions/SpamBlacklist/wikimedia_blacklist) [function.file-get-contents]: failed to open stream: No such file or directory in /home/archivet/public_html/extensions/SpamBlacklist/SpamBlacklist_body.php on line 123 [09:07] and that was poorly worded, the wiki prompted me to fill out a captcha becuase I included external links and it threw that error on the page [09:10] our wiki? [09:18] http://www.filedropper.com/plutonomyandalgorithms | Professional research [11:24] I wrote the following last night, for the phoenix : http://pastebin.com/Gsm5Qhg2 [11:24] doesn't grab photos.... code isn't the greatest, actually rather poor, but it's a work in progress. [12:50] In WTF news, a hacker turned FBI informant from 30 years ago has sent me a box of Apple II Floppies [12:50] Hilarious Tools Galore [12:53] Also, good news, I have a machine I can WGET 1.14 on [12:53] So more mirrors going in [12:53] Neat. [12:56] SketchCow: this is something you will put on archive.org? The Apple II Floppies/Tools? [12:58] Ultimately, I expect so [12:58] So, since I've partnered with TOSEC, I may begin putting images up on archive.org or another location, to ensure they're publically accessible [12:58] And they can officially enter the pantheon [13:01] so IA is doing maintenance again [13:03] No, there's a problem and we're dealing with it. [13:08] ok [13:09] Has anyone really been far even as decided to use even go want to do look more like? [13:10] Erm, crap, wrong button. [13:10] Has anyone else attempted to grab the Saints Row 3 community items, etc. on http://saintsrow.com? [13:11] GLaDOS, is a cyclon [13:11] the nlp filter failed [13:11] shh [13:19] But yeah, community creations on the Saints Row website [13:19] There is a way to get a search to list them all, but their search function injects the results after pageload [13:23] yeah :( [13:23] I'm trying to figure out the wget for the forums, however those community items are a pita. [15:35] Also Has anyone really been far even as decided to use even go want to do look more like? [15:40] ou've got to be kidding me. I've been further even more decided to use even go need to do look more as anyone can. Can you really be far even as decided half as much to use go wish for that? [16:50] Some here are the last thoughts for them and here are so look like! [16:50] http://www.flickr.com/photos/textfiles/8572533274/in/photostream/lightbox/ [16:59] Since pingmag.jp came back I did a full site grab which has all their old content from 2005-2008 http://archive.org/details/PingmagArtDesignLife [17:27] * SmileyG looks at what RedType and SketchCow said and is so confused. [17:40] SmileyG: it's a meme [17:40] he acked, i syned [17:41] ah ok [17:44] http://knowyourmeme.com/memes/has-anyone-really-been-far-even-as-decided-to-use-even-go-want-to-do-look-more-like [17:47] 4chandata is going again. I am getting 366,720 more images [17:47] :O [17:47] and then the database dump [17:47] i'm currently headbutting against GLaDOS's dedi he is letting me play with [17:47] tim@MushaV3 ~ $ scp ./cookies.txt -v root@37.56.60.160:~ [17:48] not sure why it's not doing anyhting [17:48] Oh, it's a 9, not 6 ¬_¬ [17:49] typos are the mind killer [17:49] yes, yes they are. [17:49] shit, wrong wget on there :S [17:49] time to go to -bs [19:12] ESET just detected a virus from the virtualbox http://i1140.photobucket.com/albums/n567/rand4505/issue.jpg [19:15] ha ha [19:15] Yeah don't browse that posterous site. Viruses ahoy [22:25] freenode -- | RichiH (~richih@freenode/staff/richih): [Global Notice] Hi all. PDPC, freenode’s parent organisation, has been dissolved. Details can be found at http://blog.freenode.net/ with a static copy at http://planet.freenode.net/ [22:25] freenode -- | mrmist (~mrmist@freenode/staff/mrmist): [Global Notice] As a P.S. to the last global, no, we're not dying or going away. If the blog is down, you can read details at https://plus.google.com/b/104326727082310562426/104326727082310562426/posts/CMW4Gst657v thanks for flying freenode! [22:25] huh