[00:17] awesome :) [00:21] good watch while i'm attacking a pile of fresh HP docs [01:06] http://www.youtube.com/watch?v=-2ZTmuX3cog [02:07] luh EM X RM X SYNDROME XXX ADDRESS BIT XXD CHIP XXX [02:07] It may not seem like much, but that is a string extracted from a Cray hard drive image [02:15] Ymgve: awesome [02:15] Right now it's just "try MFM with all parameters and see what pops up" [02:16] heh [02:16] I suspect you know the right encoding parameters now :) [02:17] at least some of them [02:17] fairly unlikely that other parts of the hard drive use different parameters [02:18] well, the thing is that I just do a global MFM decode without aligning on sector boundaries [02:19] finding the exact sector alignment, checking for and potentially recovering errors and outputting stuff to a usable format is the hard part [02:19] ah [02:38] Ymgve: so how are you planning on finding the sector boundaries? [02:38] identifying what the system would look for to find a sector [02:39] I have found the headers, but there's no sync marks [02:39] just hoping it's repeated enough? [02:40] I also hope someone somewhere knows what checksums are used, if any [02:40] to verify data integrity [05:13] Back [05:13] Hey, the derive queue is down to normal. [05:32] that won't do [05:38] Just watched Jason's talk at Defcon 19. [05:38] Great stuff! [05:40] ARCHIVETEAM [05:40] AR CHIVE TEAM [05:41] I do love that thing. [05:41] I knew it was out of the park, I am sure it'll be on reddit in the week. [05:41] Nothing in there I'm not proud of. [05:41] There's one mental pause I was annoyed by, but it's good. [05:50] The talk kept moving right along. It's nice to know that others share the same passions. [05:56] Go look up my Two Billion Dollars talk [05:57] http://www.youtube.com/watch?v=Gq70QKa7588 [05:58] Also, I really love this icon-speaker-slide thing [05:59] yeah, it's good [06:00] icon is useless, but I don't know what else you would put there [06:00] It's not [06:00] It's good for branding it without watermarking [06:00] okay, not 100% useless [06:00] sure [06:00] And the thing compresses crazy [06:00] watermarking-- [06:00] yeah [06:01] The derive queue nightmare is over [06:01] I can start adding items again [06:12] Eight! Bit! Boosters!! [06:12] http://www.archive.org/details/eight-bit-boosters [06:13] SketchCow: you ever seen this poster before? http://catsonkeyboards.blogspot.com/2011/09/motorola-68000-art.html [06:14] Not that I recall. [06:14] I think it's time for me to throw another 100 issues of something into the queue, don't you. [06:14] absolutely [06:15] SketchCow: could you get around to making me a collection sometime soon? [06:18] Oh yay, this sed allows me to use \0 [06:29] There we go [06:30] 71 Issues of "Your Computer Magazine" [07:01] Now adding MicroHobby magazine, Spanish computer magazine. [07:02] 217 issues. [07:06] Hm. Do I need an IA account to ingest/add stuff? [07:06] pretty sure the answer is yes [07:06] wanted to put up http://www.jodyculkin.com/comics-2/introduction-to-arduino [07:07] yep, it's not hard to get though [07:08] quite easy in fact [07:09] http://www.archive.org/account/login.createaccount.php [07:09] it really is that easy [07:11] Hah, "library card". How cute :) [07:16] Well, we are a library. [07:18] * ersi nods [07:21] http://www.archive.org/details/tpug-newsletter [07:29] SketchCow: how do you handle backups @ archive.org ? [07:29] doyou have raid disks or ..? [07:30] raid != backup [07:31] what's the difference? [07:34] raid is just a lifeline keeping the system up [07:35] backup is at least another copy, on another machine or storage type [07:43] ersi: yeah, man, that's like your opinion [07:43] I will add an item called softside-magazine-44. [07:43] I will give it the title of SoftSide Magazine Issue 44 (Dungeons of the Gods). [07:43] I will say this dates to 1982-07. [07:43] In the collection named softside-magazine... [07:44] hey everyone [07:45] k [07:45] k [07:47] Yeah, it's my opinion and a lot of random sysadmins [07:50] hahaha [07:50] I just broke 4100 uploaded items on archive.org. [07:50] Congrats :] [07:57] crikey [08:00] wejhhh [08:03] Yeah, it was 2500 or something last Tuesday. [08:03] 1,600 in one week [08:03] Scripts hooooooooooo [08:03] I am going to get yelled at [08:03] BEST CARPET CALL-UP EVER [08:04] http://www.archive.org/stream/your-computer-magazine-1981-06/YourComputer_1981_06#page/n0/mode/2up [08:07] heh [08:07] it's not streamable yet .. [08:27] It's bouncing back and forth. [08:27] Give it another few minutes. [08:28] how do you guys make your boingboing dump work like a normal website [08:32] I wouldn't use that thing to re-make boingboing. [08:32] Are you trying to remake boingboing? [08:32] i was think thats what a dump should do [08:33] Yes [08:33] But it is much better for analysis [08:33] How many times does Cory say something wrong, how self-absorbed is Xeni, does Mark F. even give a shit [08:33] You know, graphs [08:35] heh [08:35] what tools did you use to make the boingboing dump? [09:28] godane: looks like the boingboing dump wasn't wasn't created with the intention that anyone would be able to recreate the site from it [09:29] godane: but it would be straight forward enough to use it to populate whatever database is used by whatever cms they boingboing uses [09:29] and thus get a browseable site [09:43] i fear thats how geocities dump is now [09:54] We save shit from burning buildings [09:54] there's not always time to make perfect [09:54] i know [09:55] i just hope i can view as it originally was [09:56] there are mirrors online [09:56] AFAIK you should be able to view it nicely [09:56] and there's people who mirror geocities since.. long [10:47] \sb goto 3:01 [10:48] \sb goto 3:01/sb goto 3:01 [10:48] bleh. note to self: don't try to catch up on overnight IRC'ing before finishing coffee. [10:54] Meh, multitask! [13:04] just found archive time defcon 19 video: http://www.youtube.com/watch?v=-2ZTmuX3cog [13:04] godane: http://ascii.textfiles.com/archives/3278 [13:04] No wonder ;p [13:14] we need to start backing up twit.tv videos [13:14] only say that cause alot of older videos are not on the site anymore [13:15] Get crackalackin' [13:15] just started [13:15] slowly [13:15] doing just the 256kb video files though [13:16] neat :) [13:17] there is 500kb upto 2000kb [13:20] 256kb one is good enough for backing up tnt [13:20] i can get about 40 episodes on to 1 layer dvd at 256kb [13:21] there is over 320 episodes now [13:22] Um, are you referring to the bitrate of the video or the total size of the video file? [13:22] seems a bit low in both cases imo [13:22] 100mb [13:22] around 100mb [13:22] Um.. okay. [13:23] there is also 64kb audio [13:44] inv: raid is not a backup. (google it. it isn't just my and ersi's opinion). last I knew, archive.org handles backups by having each item on at least two different nodes, all of which are available. [13:44] inv: raid also tends to not scale well to the volumes of storage that they deal with. [13:45] it also likes to fuuuuuuuckkkkkk shit uppp [13:48] twit.tv is closing? [13:50] emijrp: it's never wrong to do things prematurely [13:50] ok, archiving http://wiki.twit.tv/wiki/Main_Page [13:50] on the thread of backing up videos because they sometimes disappear from the host: at one time I had the crazy idea of pulling videos that show up on the recently uploaded rss feed at youtube... [13:52] I heard you are crazy. Indeed. [13:52] that wiki has some spam trouble... [13:53] Most wikis have. AT wiki too. [13:54] Man, I'm reading since some days ago about destroyed libraries. [13:55] Thousands libraries were burnt during World War II. Sarajevo library was burnt. Iraq National Library was too during 2003. [13:55] :( [13:55] I'm unfortunally not suprised [13:55] The 2004 Pacific Ocean tsunami flooded libraries and archives in several countries. [13:56] The Chilean earthquake destroyed a lot of monuments. [13:57] any in london during the blitz? [13:57] LOL, that guys wanted Ipads, no books. [14:03] emjrp: twit.tv is not closing [14:03] You can read more here http://www.unesco.org/webworld/mdm/administ/pdf/LOSTMEMO.PDF [14:03] emjrp: Just older episodes video episodes are hard to find [14:08] emijrp: I'm a bit crazy too [14:08] i'm making a archive of linux that can do a full compile offline [14:10] aside from getting the sources and compiler bits, what part of compiling a linux kernel needs to be offline [14:10] not just linux kernel [14:10] a full os [14:11] i'm making a dvd iso [17:38] -------------------------- [17:38] http://techcrunch.com/2011/09/06/the-end/ [17:38] Could someone please Heretrix/wget Techcrunch?" [18:01] SketchCow: running. [18:01] THank you. [18:01] (Let's hope it doesn't block as fast as Google.) [18:01] I have these domains, any more? [18:01] http://crunchboard.com/ [18:01] http://techcrunch.com/ [18:01] http://www.crunchboard.com/ [18:01] http://www.techcrunch.com/ [18:01] http://disrupt.techcrunch.com/ [18:01] http://eu.techcrunch.com/ [18:01] http://fr.techcrunch.com/ [18:01] http://jp.techcrunch.com/ [18:01] http://techcrunch.tv/ [18:01] http://www.techcrunch.tv/ [18:03] I'd focus on the main one and the crunchboard. [18:08] Hmm. What about the comments? [18:09] They're on facebook. [18:16] Are they? Fuck. [18:16] We need those. [18:19] Facebook has a scraping TOS, which I didn't accept so don't know about. [18:19] :) [19:30] SketchCow: The comments will have to be in json format, I'm afraid, outside Heritrix. [20:14] That's fine. [20:14] Hey, I uploaded the rest of the data you gave me [20:14] Let's compare what ended up there with what you expect. [20:58] Great. I checked the sha1 checksums, everything is there. [20:59] i really hate this trend of offloading comments to shithouses like disqus and facebook [21:01] Thanks, alard. [21:02] It's strange: did they have comments on Techcrunch before March 2011? [21:03] They did [21:03] Then where are they? [21:03] I'll bet they're gone [21:03] We'll see what can be done [21:03] btw, anyone notice TC has "Deadpool" as one of its main categories? teh irony [21:03] I think so: all there is is the Facebook comment thing, which they started using in march. [21:05] alard: if it is anything like boingboing and disqus, all comments posted through the old comment system vanished when they switched over [21:12] That's friendly!