#archiveteam 2011-09-06,Tue

↑back Search

Time Nickname Message
00:17 🔗 Soojin awesome :)
00:21 🔗 lowtekk good watch while i'm attacking a pile of fresh HP docs
01:06 🔗 SketchCow http://www.youtube.com/watch?v=-2ZTmuX3cog
02:07 🔗 Ymgve luh EM X RM X SYNDROME XXX ADDRESS BIT XXD CHIP XXX
02:07 🔗 Ymgve It may not seem like much, but that is a string extracted from a Cray hard drive image
02:15 🔗 db48x Ymgve: awesome
02:15 🔗 Ymgve Right now it's just "try MFM with all parameters and see what pops up"
02:16 🔗 db48x heh
02:16 🔗 db48x I suspect you know the right encoding parameters now :)
02:17 🔗 Ymgve at least some of them
02:17 🔗 db48x fairly unlikely that other parts of the hard drive use different parameters
02:18 🔗 Ymgve well, the thing is that I just do a global MFM decode without aligning on sector boundaries
02:19 🔗 Ymgve finding the exact sector alignment, checking for and potentially recovering errors and outputting stuff to a usable format is the hard part
02:19 🔗 db48x ah
02:38 🔗 db48x Ymgve: so how are you planning on finding the sector boundaries?
02:38 🔗 Ymgve identifying what the system would look for to find a sector
02:39 🔗 Ymgve I have found the headers, but there's no sync marks
02:39 🔗 db48x just hoping it's repeated enough?
02:40 🔗 Ymgve I also hope someone somewhere knows what checksums are used, if any
02:40 🔗 Ymgve to verify data integrity
05:13 🔗 SketchCow Back
05:13 🔗 SketchCow Hey, the derive queue is down to normal.
05:32 🔗 DFJustin that won't do
05:38 🔗 TheSIMM Just watched Jason's talk at Defcon 19.
05:38 🔗 TheSIMM Great stuff!
05:40 🔗 chronomex ARCHIVETEAM
05:40 🔗 chronomex AR CHIVE TEAM
05:41 🔗 SketchCow I do love that thing.
05:41 🔗 SketchCow I knew it was out of the park, I am sure it'll be on reddit in the week.
05:41 🔗 SketchCow Nothing in there I'm not proud of.
05:41 🔗 SketchCow There's one mental pause I was annoyed by, but it's good.
05:50 🔗 TheSIMM The talk kept moving right along. It's nice to know that others share the same passions.
05:56 🔗 SketchCow Go look up my Two Billion Dollars talk
05:57 🔗 SketchCow http://www.youtube.com/watch?v=Gq70QKa7588
05:58 🔗 SketchCow Also, I really love this icon-speaker-slide thing
05:59 🔗 chronomex yeah, it's good
06:00 🔗 chronomex icon is useless, but I don't know what else you would put there
06:00 🔗 SketchCow It's not
06:00 🔗 SketchCow It's good for branding it without watermarking
06:00 🔗 chronomex okay, not 100% useless
06:00 🔗 chronomex sure
06:00 🔗 SketchCow And the thing compresses crazy
06:00 🔗 chronomex watermarking--
06:00 🔗 chronomex yeah
06:01 🔗 SketchCow The derive queue nightmare is over
06:01 🔗 SketchCow I can start adding items again
06:12 🔗 SketchCow Eight! Bit! Boosters!!
06:12 🔗 SketchCow http://www.archive.org/details/eight-bit-boosters
06:13 🔗 chronomex SketchCow: you ever seen this poster before? http://catsonkeyboards.blogspot.com/2011/09/motorola-68000-art.html
06:14 🔗 SketchCow Not that I recall.
06:14 🔗 SketchCow I think it's time for me to throw another 100 issues of something into the queue, don't you.
06:14 🔗 chronomex absolutely
06:15 🔗 chronomex SketchCow: could you get around to making me a collection sometime soon?
06:18 🔗 SketchCow Oh yay, this sed allows me to use \0
06:29 🔗 SketchCow There we go
06:30 🔗 SketchCow 71 Issues of "Your Computer Magazine"
07:01 🔗 SketchCow Now adding MicroHobby magazine, Spanish computer magazine.
07:02 🔗 SketchCow 217 issues.
07:06 🔗 ersi Hm. Do I need an IA account to ingest/add stuff?
07:06 🔗 Coderjoe pretty sure the answer is yes
07:06 🔗 ersi wanted to put up http://www.jodyculkin.com/comics-2/introduction-to-arduino
07:07 🔗 chronomex yep, it's not hard to get though
07:08 🔗 chronomex quite easy in fact
07:09 🔗 Coderjoe http://www.archive.org/account/login.createaccount.php
07:09 🔗 Coderjoe it really is that easy
07:11 🔗 ersi Hah, "library card". How cute :)
07:16 🔗 SketchCow Well, we are a library.
07:18 🔗 * ersi nods
07:21 🔗 SketchCow http://www.archive.org/details/tpug-newsletter
07:29 🔗 inv SketchCow: how do you handle backups @ archive.org ?
07:29 🔗 inv doyou have raid disks or ..?
07:30 🔗 ersi raid != backup
07:31 🔗 inv what's the difference?
07:34 🔗 ersi raid is just a lifeline keeping the system up
07:35 🔗 ersi backup is at least another copy, on another machine or storage type
07:43 🔗 inv ersi: yeah, man, that's like your opinion
07:43 🔗 SketchCow I will add an item called softside-magazine-44.
07:43 🔗 SketchCow I will give it the title of SoftSide Magazine Issue 44 (Dungeons of the Gods).
07:43 🔗 SketchCow I will say this dates to 1982-07.
07:43 🔗 SketchCow In the collection named softside-magazine...
07:44 🔗 godane hey everyone
07:45 🔗 faye k
07:45 🔗 faye k
07:47 🔗 ersi Yeah, it's my opinion and a lot of random sysadmins
07:50 🔗 SketchCow hahaha
07:50 🔗 SketchCow I just broke 4100 uploaded items on archive.org.
07:50 🔗 ersi Congrats :]
07:57 🔗 chronomex crikey
08:00 🔗 faye wejhhh
08:03 🔗 SketchCow Yeah, it was 2500 or something last Tuesday.
08:03 🔗 SketchCow 1,600 in one week
08:03 🔗 SketchCow Scripts hooooooooooo
08:03 🔗 SketchCow I am going to get yelled at
08:03 🔗 SketchCow BEST CARPET CALL-UP EVER
08:04 🔗 SketchCow http://www.archive.org/stream/your-computer-magazine-1981-06/YourComputer_1981_06#page/n0/mode/2up
08:07 🔗 chronomex heh
08:07 🔗 chronomex it's not streamable yet ..
08:27 🔗 SketchCow It's bouncing back and forth.
08:27 🔗 SketchCow Give it another few minutes.
08:28 🔗 godane how do you guys make your boingboing dump work like a normal website
08:32 🔗 SketchCow I wouldn't use that thing to re-make boingboing.
08:32 🔗 SketchCow Are you trying to remake boingboing?
08:32 🔗 godane i was think thats what a dump should do
08:33 🔗 SketchCow Yes
08:33 🔗 SketchCow But it is much better for analysis
08:33 🔗 SketchCow How many times does Cory say something wrong, how self-absorbed is Xeni, does Mark F. even give a shit
08:33 🔗 SketchCow You know, graphs
08:35 🔗 db48x heh
08:35 🔗 godane what tools did you use to make the boingboing dump?
09:28 🔗 db48x godane: looks like the boingboing dump wasn't wasn't created with the intention that anyone would be able to recreate the site from it
09:29 🔗 db48x godane: but it would be straight forward enough to use it to populate whatever database is used by whatever cms they boingboing uses
09:29 🔗 db48x and thus get a browseable site
09:43 🔗 godane i fear thats how geocities dump is now
09:54 🔗 ersi We save shit from burning buildings
09:54 🔗 ersi there's not always time to make perfect
09:54 🔗 godane i know
09:55 🔗 godane i just hope i can view as it originally was
09:56 🔗 db48x there are mirrors online
09:56 🔗 ersi AFAIK you should be able to view it nicely
09:56 🔗 ersi and there's people who mirror geocities since.. long
10:47 🔗 DoubleJ \sb goto 3:01
10:48 🔗 DoubleJ \sb goto 3:01/sb goto 3:01
10:48 🔗 DoubleJ bleh. note to self: don't try to catch up on overnight IRC'ing before finishing coffee.
10:54 🔗 ersi Meh, multitask!
13:04 🔗 godane just found archive time defcon 19 video: http://www.youtube.com/watch?v=-2ZTmuX3cog
13:04 🔗 ersi godane: http://ascii.textfiles.com/archives/3278
13:04 🔗 ersi No wonder ;p
13:14 🔗 godane we need to start backing up twit.tv videos
13:14 🔗 godane only say that cause alot of older videos are not on the site anymore
13:15 🔗 ersi Get crackalackin'
13:15 🔗 godane just started
13:15 🔗 godane slowly
13:15 🔗 godane doing just the 256kb video files though
13:16 🔗 ersi neat :)
13:17 🔗 godane there is 500kb upto 2000kb
13:20 🔗 godane 256kb one is good enough for backing up tnt
13:20 🔗 godane i can get about 40 episodes on to 1 layer dvd at 256kb
13:21 🔗 godane there is over 320 episodes now
13:22 🔗 ersi Um, are you referring to the bitrate of the video or the total size of the video file?
13:22 🔗 ersi seems a bit low in both cases imo
13:22 🔗 godane 100mb
13:22 🔗 godane around 100mb
13:22 🔗 ersi Um.. okay.
13:23 🔗 godane there is also 64kb audio
13:44 🔗 Coderjoe inv: raid is not a backup. (google it. it isn't just my and ersi's opinion). last I knew, archive.org handles backups by having each item on at least two different nodes, all of which are available.
13:44 🔗 Coderjoe inv: raid also tends to not scale well to the volumes of storage that they deal with.
13:45 🔗 ersi it also likes to fuuuuuuuckkkkkk shit uppp
13:48 🔗 emijrp twit.tv is closing?
13:50 🔗 ersi emijrp: it's never wrong to do things prematurely
13:50 🔗 emijrp ok, archiving http://wiki.twit.tv/wiki/Main_Page
13:50 🔗 Coderjoe on the thread of backing up videos because they sometimes disappear from the host: at one time I had the crazy idea of pulling videos that show up on the recently uploaded rss feed at youtube...
13:52 🔗 emijrp I heard you are crazy. Indeed.
13:52 🔗 Coderjoe that wiki has some spam trouble...
13:53 🔗 emijrp Most wikis have. AT wiki too.
13:54 🔗 emijrp Man, I'm reading since some days ago about destroyed libraries.
13:55 🔗 emijrp Thousands libraries were burnt during World War II. Sarajevo library was burnt. Iraq National Library was too during 2003.
13:55 🔗 Coderjoe :(
13:55 🔗 ersi I'm unfortunally not suprised
13:55 🔗 emijrp The 2004 Pacific Ocean tsunami flooded libraries and archives in several countries.
13:56 🔗 emijrp The Chilean earthquake destroyed a lot of monuments.
13:57 🔗 Coderjoe any in london during the blitz?
13:57 🔗 emijrp LOL, that guys wanted Ipads, no books.
14:03 🔗 godane emjrp: twit.tv is not closing
14:03 🔗 emijrp You can read more here http://www.unesco.org/webworld/mdm/administ/pdf/LOSTMEMO.PDF
14:03 🔗 godane emjrp: Just older episodes video episodes are hard to find
14:08 🔗 godane emijrp: I'm a bit crazy too
14:08 🔗 godane i'm making a archive of linux that can do a full compile offline
14:10 🔗 Coderjoe aside from getting the sources and compiler bits, what part of compiling a linux kernel needs to be offline
14:10 🔗 godane not just linux kernel
14:10 🔗 godane a full os
14:11 🔗 godane i'm making a dvd iso
17:38 🔗 SketchCow --------------------------
17:38 🔗 SketchCow http://techcrunch.com/2011/09/06/the-end/
17:38 🔗 SketchCow Could someone please Heretrix/wget Techcrunch?"
18:01 🔗 alard SketchCow: running.
18:01 🔗 SketchCow THank you.
18:01 🔗 alard (Let's hope it doesn't block as fast as Google.)
18:01 🔗 alard I have these domains, any more?
18:01 🔗 alard http://crunchboard.com/
18:01 🔗 alard http://techcrunch.com/
18:01 🔗 alard http://www.crunchboard.com/
18:01 🔗 alard http://www.techcrunch.com/
18:01 🔗 alard http://disrupt.techcrunch.com/
18:01 🔗 alard http://eu.techcrunch.com/
18:01 🔗 alard http://fr.techcrunch.com/
18:01 🔗 alard http://jp.techcrunch.com/
18:01 🔗 alard http://techcrunch.tv/
18:01 🔗 alard http://www.techcrunch.tv/
18:03 🔗 SketchCow I'd focus on the main one and the crunchboard.
18:08 🔗 alard Hmm. What about the comments?
18:09 🔗 alard They're on facebook.
18:16 🔗 SketchCow Are they? Fuck.
18:16 🔗 SketchCow We need those.
18:19 🔗 alard Facebook has a scraping TOS, which I didn't accept so don't know about.
18:19 🔗 alard :)
19:30 🔗 alard SketchCow: The comments will have to be in json format, I'm afraid, outside Heritrix.
20:14 🔗 SketchCow That's fine.
20:14 🔗 SketchCow Hey, I uploaded the rest of the data you gave me
20:14 🔗 SketchCow Let's compare what ended up there with what you expect.
20:58 🔗 alard Great. I checked the sha1 checksums, everything is there.
20:59 🔗 Coderjoe i really hate this trend of offloading comments to shithouses like disqus and facebook
21:01 🔗 SketchCow Thanks, alard.
21:02 🔗 alard It's strange: did they have comments on Techcrunch before March 2011?
21:03 🔗 SketchCow They did
21:03 🔗 alard Then where are they?
21:03 🔗 SketchCow I'll bet they're gone
21:03 🔗 SketchCow We'll see what can be done
21:03 🔗 closure btw, anyone notice TC has "Deadpool" as one of its main categories? teh irony
21:03 🔗 alard I think so: all there is is the Facebook comment thing, which they started using in march.
21:05 🔗 Coderjoe alard: if it is anything like boingboing and disqus, all comments posted through the old comment system vanished when they switched over
21:12 🔗 alard That's friendly!

irclogger-viewer