[01:42] pl r8 [01:47] pl r8? [01:51] arrith: you wont, you MIGHT get 60 or so [02:26] i'm interested in running the Posterous project. I'm not a Posterous user, so being banned isn't a problem for me. [03:08] im alittle tea pot [03:39] Hey y'all. I just spun up a warrior for the first time. The posterous project says to ask in this channel before running anything. What's up with that? [03:45] hey s4y, its just to make sure you know that you might get banned from posterous, besides that it is safe to run [03:45] checkout the FAQ http://archiveteam.org/index.php?title=Posterous [03:45] doh i see you are in #preposterous now :p [03:46] :D [03:46] Sweet. Thanks! [03:46] I'm down with that. [03:51] WiK: well i might have to rely on others to do the url supplying then. i was hoping opml files would build up a decent list [03:55] arrith, i saw something about *.blogspot.com is there something more specific to specify its google reader [03:56] because isnt it just a lot of random blogs too? [03:57] S[h]O[r]T: yeah it is a lot of random blogs [03:58] all of it being behind a login makes crawling particularly tricky [03:58] but is that really what we are looking for, just anything *.blogspot.com and *.tumblr.com ? [03:59] S[h]O[r]T: yeah ideally to get it all, but *at least* to get the popular stuff [04:06] yeah scraping google/bing is going to net the largest amount. doesnt seem to be much i can pull from passive dns data [06:11] An escaped felon is working for dispatch. heriffs bought this fat stuff to do a job for concord and them to remove anyone they hTE AND TO CHEATR. THIS BITCH WHO HAS CREAMATED CHILDREN IN AFTER HOJURS AT THE CEMETARY MURDERED MROE. white plains escaped mike vicient murders DISAPTCH SECRETARY. victimizing if ou want mroe children murdrede with cocaine and molesgted then do nothing. notify [06:11] newyorkand whiet planes pd thanks she killed animals and pets adn familhy [06:13] Weird. [06:13] OK, 3tb of XANGA is now up at archive.org. [06:13] Go team [06:18] woot [06:19] man, cdroms are slow [06:29] Tell me about it. [06:29] I have to start capturing them soon. [06:29] I basically need to set up a station here. [06:29] While doing other things. [06:30] why does it take 7 minutes, it's only half a gig! [06:32] Any faster and the plastic would shatter. [06:32] That's why. [06:33] Yeah I put a cdrom on a drill and spun it until it broke as a diy way to show yourself what happens [06:33] now there are videos on youtube [06:33] take a picture of the cd and scan it, although i wonder if you did that with some kind of microscope if it would work [06:34] yeah Ik now [06:34] it's still frustrating [06:34] http://www.youtube.com/watch?v=-i6yC6rI2Fw [06:34] At 0:56 it is already no good for reading data. [06:34] You hit an event horizon. [06:36] pwhuumph [06:36] nice sound [10:56] Okay I got a login for that 4chandata. [10:57] An actual database dump is being made as well [11:43] :) [11:45] ^ living up to his name right there [11:51] It is a slow download and he has the ftp server setup wrong. It is not serving the full 400k files in the dir [11:52] but we are still getting it. (⌐■_■) [11:57] I am looking at the images... There are amazing landscapes and breath taking animals, then there are the dregs of humanity [11:57] justin bieber in london? [12:31] he was a few days ago, but why are we discussing that here. [14:20] Government of Canada paring down 1500 websites to 1. http://freegovinfo.info/node/3893 [14:21] wow they are trolling all their citizens [14:22] Archiving Google Reader: http://archiveteam.org/index.php?title=Google_Reader [14:54] hmm, Google Reader itself archives all posts in all feeds that anyone is subscribed to [14:55] I wonder if any online feedreaders can replace *that* functionality :( [15:01] does Google Takeout include archived copies of the blog posts? [15:03] nope, since it's not really the user's data... it only includes shared/starred/liked posts (as JSON), in addition to the OPML file [15:03] ok :( [15:23] hm, the "email" button sends the full text of the article. [15:23] wonder if a chrome extension could just click all the email buttons lol [17:42] woot, 4tb and 2tb arrived at my door today [17:49] insert pr0n joke here [17:50] which by itself can be interpreted as a pr0n joke, heh [18:37] Well, Wizards of the Coast just got two publications darked. [18:37] I mean, I knew they WOULD, but Nemo_bis was on fire when he was uploading a pile of periodicals and I figured, ah, why nitpick. [18:37] So, lawyer wrote in, rawr rawr, magazines down. Dragon Magazine and Polyhedron Newsletter [18:42] Yep, it was quite obvious. :) [18:42] I did it for the sake of archiving, just the other day I was surprised that they had let them get some 500-1000 views per item. ^^ [18:43] I know, right? [18:43] Anyway, still active, still engaged enough to tell people to knock it off. [18:44] I'm curious to see how long it takes for Tex to be DMCA'ed [18:46] I didn't add a lot fo legal/technical overhead with those additions, I hope. [18:47] *of [18:47] No. [18:48] Letter comes in, office manager calls me, we chit-chat a while, and 15 seconds later, gone [18:52] :) [18:53] https://www.wizards.com/dnd/tool.aspx?x=dnd/4new/tool/dragonmagazine [18:54] It lives again!!!! [18:57] Now, I'm going to go shoot 2 terabytes of Yahoo Video into the archive. [19:16] :D [19:16] SketchCow: gone but not lost <£ [19:16] errr <3 [19:17] <# [19:48] * philpem continues playing around with DiscFerret code which may recover some unreadable disks [19:48] Ohhh the Kryoflux guys are going to be eating humble pie after this one ^_^ "Impossible" my ass :D [19:49] haha <3 [19:49] Found a bunch of Amstrad CPC floppies a while ago - the machine they came from had the usual fault for those, a slipping drive belt in the floppy drive. [19:49] So that causes the data rate to shift as the belt shuffles around [19:49] hmm [19:50] Pkdev hit nearly 40%, way over what a PLL can handle. [19:50] clock recover sounds simple enough, then every minute you think about it the problem gets harder [19:50] But with careful filtering, you can remove the data and recover the motor speed signal. [19:50] awesome [19:50] Then you run a moving average to get rid of the last of the data and use it to repair the timing [19:51] lots of lovely floating point, anything less than a Pentagram.. er.. Pentium Pro will hate it XD [19:51] reminds me of how VCRs use a PLL on the colorburst to get physical tape stability to nanosecond resolution - if you're off by 20nsec, the colors are wrong [19:51] * philpem <3 his i5 [19:51] computers ftw [19:52] this is a great example of how throwing DSP at a problem can fix the problem well enough that you can do something that ten years ago would have been unthinkable [19:57] My favorite angry guy! [19:57] Gotta head south to NYC shortly. [19:57] Movie showing in a few hours. [19:58] Your favourite angry guy? Me? [19:58] I'm not angry tonight, I got to laugh at the insanity of work *and* I've got an interview for Something Much Better lined up. It's a good day! [19:59] Sunlight through the clouds! [19:59] I have a lot on my plate currently. [20:00] Did a little bridging bit for DEFCON. [20:00] Neat. [20:01] I really need to get off my ass and do the discferret write tests.... It should work, but who knows? I haven't written the API code yet, lol [20:01] It works on the testbench, but on silicon? Who knows?! [21:11] philpem: there's nothing like using advanced maths to fix a stretched rubber band... [21:14] ats_, especially when you can't fix the stretched rubber band without a DeLorean, flux capacitor and 1.21 jiggawatts of pure, unadulterated energy [21:27] SketchCow: I thought about your CDROM dumping situation (not being able to dump them fast enough), and you could do what libraries did in the 90s to use multi-CD catalogs: have a machine with 6 or 7 CD drives, and one big hard drive [22:05] http://archive.org/details/archiveteam_xanga_index [22:22] Storylane finished, http://tracker.archiveteam.org/storylane/ (unless someone finds more usernames) [22:29] I love a story with a happy ending [22:40] teaser -- this is the amstrad disk before: https://www.dropbox.com/s/35mnkvikd353eoa/ISVFix.png [22:40] blue points are flux transitions [22:40] red line is the speed profile [22:40] and after: https://www.dropbox.com/s/bpu9al18hrzm8pz/ISVFixed.png [23:43] somebody had a link to a blog about this new archiving tool for macs? combining wget and warc proxy into a custom thing, could i get that link?