[00:14] http://www.engadget.com/2012/06/02/picplz-shutting-down-permanently-on-july-3rd-all-photos-to-be-d/ [00:17] three [00:17] BlueMax: #piczzz [00:18] * Coderjoe laughs heartilly while lightning flashes and thunder rolls in the background [00:19] i bet it gets annoying, but hey we're paying attention right? :) [00:19] sorry, didn't know if it had been posted already or not [00:19] no prob [00:20] I just am playing the roll of The Count today [00:20] One, Two, THREE observant IRCer's, mua-ha-ha [00:21] it is better that several people mention it (even if it gets repetitive) than nobody notice it [00:21] yup, as said guy who mentioned it also, i agree [00:24] alard: is there a channel for the AT universal tracker? [06:50] WELL THAT IS 400 MILES OF TRAVEL I AM NOT GETTING BACK [06:51] hey SketchCow what happened [06:51] Oh, good interview. [06:52] But San Jose --> SF --> Sacramento --> SJ in 12 hours was pushing it [06:58] Don't op him again until he apologizes. [07:17] http://statusboard.archive.org/ is down [09:02] blooper [09:03] a guy on the real news from blaze called some shitburger [09:24] i wonder if there is a way to import wikipedia dump in to like git? [09:24] yes, there is [09:24] take a mediawiki xml dump, run it through a tool I made, and then convert from cvs to git [09:25] where is this tool? [09:25] 1sec [09:26] https://github.com/chronomex/wikiscraper [09:27] here's the instructions for step 3: http://xrtc.net/f/projects/rcs-to-git.shtml [09:36] my code breaks on pages that contain the / character [09:36] so watch out for that [09:36] er, page names [10:29] https://plus.google.com/u/0/118372134229901336648/posts/CdKUrgKipfF [10:29] Dunno if any of you guys are interested... [13:24] Any other critical things? Going on plane, may or may not pay for internet. [13:25] DFJustin: Statusboard down known issue, will be fixed soon. [16:18] SketchCow: should i just dcc the 500mb file? [17:41] What? No. [17:47] i figured not. [18:02] TARing of of Tabblo has begun. [18:02] don't forget to feather them too. *highfive?* [18:03] * LordNlptp rolls eyes [18:06] Looks like Tabblo is just 450gb [18:07] Adorable. [18:37] who besides alard did run crawls to find URLs of me.com(/fortunecity/etc)? [18:39] SketchCow: should i just burn the .tgz to a dvd and send it snail mail? [18:39] what is this I don't even [18:39] print out punch cards, won't you [18:42] What are you talking about? [18:42] What .tgz is this? [18:42] * SketchCow has been a bit distracted. [18:46] HELLO I AM IN THE MOST ERGONOMICALLY UNPLEASANT SITUATION [18:48] SketchCow: http://evansheline.com/wp-content/uploads/2012/02/ergonomics-close-enough.png [18:49] SketchCow: the tar.gz or .tgz of the 518mb of geocities stuff [18:52] Sent you credentials. [20:02] awesome, there is some game (asset and source) preservation project in the works. and that is all i can say about that. [20:05] nice. i wish someone would yell at EA to do that. [20:06] lot of old games (like the origin stuff and possibly the bullfrog stuff) the source is lost for [20:06] same with old RARE stuf [20:06] f [20:52] https://picplz.com/login/?next=/yourphotos/ [20:52] On July 3, 2012, picplz will shut down permanently. [20:52] seen? [20:52] #piczzz yup [21:39] Hi. Anyone wants to help by testing this: https://github.com/alard/warc-proxy ? [21:39] It's a first attempt to make WARC files more usable. [21:42] Any suggestions, ideas are welcome, of course. [21:43] ooh, awesome effort mate [21:47] alard: Works awesomely for me [21:47] I'm able to browse around that warc just fine [21:48] and I'm running python 2.6.5 on Ubuntu 10.04 :] [21:50] alard: sweet. [21:51] if you're up to speed with warc (I'm not ...), an idea I've been kicking around is a proxy that dumps everything to .warc [21:51] that's a damned cool idea [21:51] "what WAS that website?" [21:51] you could combine the two, such that you cruise a .warc file and missing resources get loaded live and dumped into another warc [21:51] yeah, why not save everything! [21:54] i wish google left their 15th anniversary search thing up [21:54] that was really handy [21:58] What was that? Historical search? [21:58] yeah, they had a search engine that used a very stale index [21:59] Buth 15th anniversary? [21:59] *But [21:59] er maybe 10th anniversary, i forget [21:59] it was a few years ago [21:59] and was really useful [22:00] 10th, that's more like it [22:00] it searched an 7 or 8 year old copy of the google db and showed sites linked to archive.org nearest the index date [22:00] was really REALLY handy [22:00] was it this thing? http://www.google.com/search2001.html [22:00] yep [22:00] yes that thing [22:01] that was SO handy [22:01] i used it to find a lot of emulation and gameboy stuff that had vanished from the net [22:01] on archive.org [22:01] well, it's not nearly complete but here? http://blogoscoped.com/archive/2008-11-01-n41.html [22:01] #archiveteam-bs [22:02] well in 2013 it will be the 15th anniversary, maybe they'll bring it back permanently [22:33] pidgin keeps giving me connection refused? and appeaantly xchat had to cycle through almost every efnet server before it found one that going away that works [22:40] #efnet-blues [23:40] who (besides alard) did crawl for urls with ip6?