#archiveteam 2011-08-15,Mon

↑back Search

Time Nickname Message
01:50 πŸ”— Wyatt|wor SketchCow: So yeah, got an upload slot for a friendster tarball?
01:54 πŸ”— chronomex PepsiMax: you should put it on archive.org.
01:54 πŸ”— chronomex it's really easy, just requires that you sit down for a few minutes and sort out the metadata
02:24 πŸ”— Wyatt|wor Oh wow, I just realised...I should get an eSATA connector for this box. I'm limited by the USB interface.
02:30 πŸ”— chronomex esata apparently grows hair on your balls
02:30 πŸ”— chronomex good for you
02:30 πŸ”— chronomex makes you a man
02:30 πŸ”— chronomex etc
02:31 πŸ”— chronomex man, hairy woman, whatever
02:45 πŸ”— Wyatt|wor ....right. Well anyway, I can reasonably get away with...oh wait, I just realised: I'm limited by my network interface too. This thing only has a 10/100 card. :/
02:46 πŸ”— Wyatt|wor So yeah, we're under pretty light load on the weekends, so _in theory_ I can get away with 200-250mbit.
02:53 πŸ”— Wyatt|wor How much is a gigabit PCI card these days?
02:55 πŸ”— chronomex dozen bucks?
02:57 πŸ”— Wyatt|wor Oh, well that's not too bad. And I think this thing already has a spare SATA header...I need to find more stuff to upload.
03:01 πŸ”— dashcloud hi guys, is there supposed to be a difference between dd and ddrescue floppy images? 7zip seems to be able to open the dd images, but not the ddrescue images- anyone else seeing this?
04:35 πŸ”— SketchCow This week, we start getting upload slots again.
04:43 πŸ”— Wyatt|wor Ahh, none right now? All right, I'll bide my time. Thanks.
04:44 πŸ”— Wyatt_of_ Huh, limited to nine characters?
04:44 πŸ”— chronomex yep, welcome to efnet
04:45 πŸ”— Wyatt|Wor This is the first I've needed that many. ~_~;;
04:45 πŸ”— Wyatt|Wor Well, "needed" in a very loose sense.
04:45 πŸ”— chronomex wanted?
04:46 πŸ”— Wyatt|Wor Close enough.
04:46 πŸ”— Wyatt|Wor :P
04:51 πŸ”— undersco2 :D
04:51 πŸ”— undersco2 t SketchCow Give me root and I'll set up rsync modules!
05:12 πŸ”— db48xOthe heh
05:43 πŸ”— undersco2 I'm trying to get him to give in by badgering him with all sorts of package installation requests
05:46 πŸ”— chronomex took it to vegas and back
05:48 πŸ”— undersco2 lol
06:07 πŸ”— willbradl hey all. anyone know W4r3zh4ck?
06:08 πŸ”— willbradl it appears he ripped archiveteam.org or at least referenced the rip, created or referenced a rip of ED, and started a blog called THE ARCHiVERS
06:09 πŸ”— willbradl not sure if spam
06:10 πŸ”— chronomex hahahahahaha
06:10 πŸ”— chronomex link?
06:10 πŸ”— willbradl http://archiveteam.org/index.php?title=Special:Contributions/W4r3zh4ck
06:12 πŸ”— zyphlar i come in occasionally to prune/admin the wiki, he's the only one i'm wondering about
06:13 πŸ”— chronomex hmmm.
06:13 πŸ”— chronomex let it stand. but don't let the ED page say "online": http://archiveteam.org/index.php?title=Encyclopedia_Dramatica&diff=prev&oldid=6263
06:14 πŸ”— zyphlar indeed
06:14 πŸ”— zyphlar was it an official project? is the archiving still in progress?
06:15 πŸ”— * chronomex shrugs
06:15 πŸ”— chronomex ED is dead, man
06:15 πŸ”— zyphlar indeed
06:15 πŸ”— zyphlar wondering how legit the .ch mirror is
06:17 πŸ”— chronomex I don't think he understands the difference between "mirror" and "backup". http://archiveteam.org/index.php?title=Frequently_Asked_Questions&diff=prev&oldid=6240
06:21 πŸ”— zyphlar good catch
06:22 πŸ”— chronomex I say leave him alone unless he does something monumentally stupid
06:22 πŸ”— zyphlar cool yeah his links are weird but don't appear malicious
06:24 πŸ”— Wyatt|Wor A while back, Jumpline bought Christian Web Host. Most of their sites are for churches, but sometimes you get a jewel like this: apostolic-sceptre.org
06:26 πŸ”— Cameron_D marquee, hit counter, tables \o/
06:26 πŸ”— zyphlar splash page
06:26 πŸ”— Cameron_D <META content="MSHTML 6.00.2600.0" name=GENERATOR>
06:26 πŸ”— zyphlar it's almost as good as north korea's website
06:26 πŸ”— Wyatt|Wor (BTW, if you notice something broken there, please let me know; I'm attempting to migrate that site to a new server.)
06:27 πŸ”— zyphlar wait, apostolic-sceptre? insert easter eggs.
06:27 πŸ”— Wyatt|Wor I think I saw a whole directory about hexes while I was tarring it up...
06:27 πŸ”— Wyatt|Wor I rather like my job, thanks.
06:30 πŸ”— Wyatt|Wor Though Plesk can die in more fires than the sun would know what to do with.
06:31 πŸ”— zyphlar CMSes and control panels: so people who don't know how to manage websites can screw it up and then pay someone to do it for them anyway
06:36 πŸ”— Wyatt|Wor It's interesting as a software artifact.
06:36 πŸ”— Wyatt|Wor And the places they use perl, it's a lot cleaner than cPanel ever could hope to be.
06:37 πŸ”— zyphlar I wish people would write management systems according to the way the thing that's being managed wants to be managed
06:37 πŸ”— zyphlar instead of making creative new folder/config/scripting schemes
06:37 πŸ”— Wyatt|Wor I'm with you. Sphera users are generally pretty happy and that's just a chroot with a mess of hardlinks.
06:37 πŸ”— zyphlar if your config files say ## DO NOT EDIT BY HAND ## you've failed a bit
06:39 πŸ”— Wyatt|Wor I think I'd rather see that than <?xml version="1.0"?>.
06:40 πŸ”— zyphlar ooh
06:40 πŸ”— chronomex ## DO NOT EDIT BY HAND ## THIS IS NOT ACTUALLY A CONFIG FILE ## CHANGES WILL BE IGNORED
06:40 πŸ”— zyphlar ## I DON'T ACTUALLY KNOW WHAT THIS FILE DOES BUT DON'T TOUCH IT JUST IN CASE ##
06:41 πŸ”— chronomex hahahhaa
06:41 πŸ”— Cameron_D my coding is a bit like that
06:54 πŸ”— Wyatt|Wor I saw something on stackexcahnge, I think it was, recently asking about the "best comments you've seen". There was one in there too, to the effect of //This code has been automatically generated. Changes will be ignored.
15:56 πŸ”— underaway http://i.imgur.com/XZ6GY.jpg OFFICIAL ARCHIVETEAM SUSTENANCE
15:57 πŸ”— underaway You should have seen SketchCow when we went by there
15:57 πŸ”— underaway He had to have like 50
15:59 πŸ”— Schbirid i hope you made photos!
16:10 πŸ”— yipdw hmm, boo -- my me.com username scraper stopped at 525 usernames
16:11 πŸ”— yipdw today, I also learned that Google's search interface will give you a maximum of 100 pages of results
16:32 πŸ”— Jofo yipdw: does this apply when using an API key? Furthermore, those things still work right?
16:32 πŸ”— yipdw Jofo: haven't tried with the API key; my scraper just screen-scraped google.com/search?q=<blah> HTML
16:32 πŸ”— yipdw I'll give it another shot after work
16:34 πŸ”— yipdw I may have to use alard's method of grabbing an IPv6 /112 and hitting Google round-robin style
16:42 πŸ”— SketchCow Back
16:42 πŸ”— db48x deep-fried cheesecake does sound pretty good
16:42 πŸ”— SketchCow W4r3zh4ck is real.
16:42 πŸ”— SketchCow By the way.
16:43 πŸ”— SketchCow Slavishly copy-bot, but still real.
16:51 πŸ”— underaway SketchCow: Tell them about your deep fried cheesecake addiction
16:51 πŸ”— SketchCow I'm working through it
16:51 πŸ”— SketchCow Down to 6 a day
16:51 πŸ”— SketchCow Only when I can't wait
16:51 πŸ”— underaway hahaha
16:52 πŸ”— underaway The archive's sarcasticness level has fallen sharply since you left
16:52 πŸ”— underaway I'm going to have to compensate
16:55 πŸ”— yipdw deep-fried cheesecake sounds like an excellent way to die painfully
16:58 πŸ”— SketchCow Or deliciously
16:59 πŸ”— Jofo SketchCow: hey, I was told to ping you about this: any interest in me scanning ~2 years worth of new scientest mags from the early 2000's? Their content is available behind a paywall online, so I figured it was a no
16:59 πŸ”— Jofo but I figured I'd ask before I recycled it
16:59 πŸ”— SketchCow I'd like to see those issues contributed somewhere.
17:00 πŸ”— SketchCow I can give you a mailing address.
17:00 πŸ”— SketchCow But scanning, no.
17:00 πŸ”— Jofo contributed works, I suppose
17:00 πŸ”— Jofo assuming the USPS doesn't raep
17:00 πŸ”— SketchCow Media Mail
17:01 πŸ”— Jofo k cool, great preso at defcon btw
17:01 πŸ”— Jofo almost had me in tears :3
17:01 πŸ”— SketchCow I had you so rapt you wanted to scan New Scientist
17:02 πŸ”— Jofo ALMOST BUT NOT FOR REAL *flexes muscles*
17:02 πŸ”— Jofo haha
17:02 πŸ”— SketchCow An activity slightly more interesting than watching someone watching paint dry
17:02 πŸ”— Jofo well, I've got a sheet fed scansnap thang, so it would be easy enough to do
17:02 πŸ”— SketchCow 1. Those don't really work
17:02 πŸ”— Jofo slice binding, toss in scanner, press go
17:02 πŸ”— SketchCow 2. That means you'd destroy the new scientists
17:02 πŸ”— SketchCow I fucking HATE destroying original material to scan
17:02 πŸ”— Jofo yeah, well, so would recycling :)
17:02 πŸ”— Jofo which was my original plan
17:03 πŸ”— SketchCow Yes, that's good.
17:03 πŸ”— SketchCow It's good you decided to rape the girl, not just shoot her in the head
17:03 πŸ”— Jofo gettin' some.
17:03 πŸ”— SketchCow Mail address at the ready when you look up media mail costs, which will be trivial.
17:03 πŸ”— Jofo sure, pm?
17:04 πŸ”— SketchCow E-mail me.
17:04 πŸ”— SketchCow jason@textfiles.com
17:04 πŸ”— Jofo sure
17:04 πŸ”— Jofo done
17:15 πŸ”— Jofo SketchCow: oh, yeah. It'll be like three bones for media mail
17:22 πŸ”— SketchCow http://www.youtube.com/watch?v=yzC4hFK5P3g
17:27 πŸ”— Paradoks Out of curiosity, is it worth the effort to non-destructively scan issues of Boardwatch? I keep hoping that someone else has already done it, so I don't have to.
17:28 πŸ”— Paradoks And by "scan", I mean "setup a tripod with a camera on it, and take pictures, page by page.".
17:28 πŸ”— SketchCow Here's the deal.
17:28 πŸ”— SketchCow I have someone working on low-cost scanners, that will do a great job.
17:28 πŸ”— SketchCow And so at this point, I'm willing to just take the items in.
17:31 πŸ”— Jofo I remember seeing a google video about a low-cost, non-destructive scanner that was intended for scanning textbooks
17:31 πŸ”— Jofo it was something like a couple pieces of glass (or plastic) as the "bed", and then it used two inexpensive digital cameras, each pointing at a different page
17:34 πŸ”— Paradoks SketchCow: Cool. Thanks for the update.
17:44 πŸ”— chronomex Jofo: yeah, th open-source bookscanner. guy named Dan made it.
17:44 πŸ”— chronomex now he works for archive.org.
17:45 πŸ”— SketchCow Where he's working on the next one.
17:45 πŸ”— Jofo oic!
17:46 πŸ”— Jofo I figured trying to drop some archive knowledge in here was going to get me one-upped :)
17:48 πŸ”— db48x heh
17:50 πŸ”— SketchCow I should go to more movie themed restaurants, then I at least get a meal even if the movie sucks
17:56 πŸ”— db48x are there that many?
17:56 πŸ”— db48x I'd never heard of a movie-themed restauraunt before
17:57 πŸ”— chronomex me neither
18:03 πŸ”— SketchCow Someone wants archiveteam to produce a backup of his livejournal he lost from 2008.
18:03 πŸ”— SketchCow I am skeptical.
18:08 πŸ”— Jofo I'm still sad I can't find my old geocities page. I wonder if it wasn't archived or if it was purged for disuse
18:13 πŸ”— SketchCow Checked Reocities?
18:16 πŸ”— chronomex SketchCow: lost, eh?
18:20 πŸ”— Jofo SketchCow: yeah. Tried a google site: search on the area51/vault url for my name, which I'm pretty sure was on the page. At one point I even think I went through 'em manually. No go :(
18:27 πŸ”— SketchCow chronomex: He ran some backup thing and he lost it all
18:34 πŸ”— alard The backup thing actually deleted the information? (That would be nice.)
18:35 πŸ”— db48x more likely he lost the backup
18:35 πŸ”— alard Heh, yeah. (Less interesting, though.)
18:36 πŸ”— db48x true :)
18:37 πŸ”— SketchCow No, it apparently didn't back up something it did
18:38 πŸ”— SketchCow http://www.facebook.com/permalink.php?story_fbid=119726521459178&id=100002654938831
18:41 πŸ”— alard SketchCow: I have a Heritrix/WARC backup of Akoha.com (+/- 1GB), but it's probably easier to wait for the new upload slots?
18:41 πŸ”— db48x yea, let me know when you've got slots handy
18:41 πŸ”— db48x I've got a warc of the Google Friends Newsletter
18:42 πŸ”— alard Ah, you managed to get it? Good.
18:43 πŸ”— db48x yea
18:43 πŸ”— db48x it was dumb
18:43 πŸ”— db48x I was forgetting to set a user agent
18:45 πŸ”— alard By the way: I tried to turn wget into a WARC extractor like you suggested.
18:46 πŸ”— alard I got part of the way, that is: it successfully extracted files and rewrote some of the urls.
18:46 πŸ”— db48x ooh, cool
18:46 πŸ”— alard The difficulty is in the different alternative urls: with and without index.html, trailing slashes, 302 redirects.
18:49 πŸ”— alard Plus it's not really what wget is supposed to do, and the implementation is quite hackish. I gave it up, for the moment, but I can upload the code if you're interested in having a go at it.
18:49 πŸ”— db48x make it a git branch
18:50 πŸ”— db48x I want to change the way it handles --timestamp when warcs are enabled
18:51 πŸ”— db48x I want it to download all the files and put them in the warc, but to only overwrite the mirror on disk if the file is newer
18:51 πŸ”— db48x instead of ignoring the --timestamp option entirely
18:51 πŸ”— db48x my filesystem has copy-on-write semantics
18:52 πŸ”— db48x so if wget overwrites the file, the filesystem stores that as a change from the last snapshot
18:53 πŸ”— db48x should be a good project for this weekend
18:53 πŸ”— emijrp I heard wget.
18:54 πŸ”— db48x emijrp: havce you seen alard's wget-warc project?
18:54 πŸ”— emijrp no
18:54 πŸ”— db48x http://archiveteam.org/index.php?title=Wget_with_WARC_output
18:56 πŸ”— emijrp example of headers?
18:56 πŸ”— db48x sure, one moment
18:58 πŸ”— emijrp lol, compare this http://en.wikipedia.org/wiki/List_of_bulletin_board_systems with this http://bbslist.textfiles.com/usbbs.html
18:59 πŸ”— emijrp ALL HUMAN KNOWLEDGE.
18:59 πŸ”— emijrp but only notable one.
19:01 πŸ”— alard db48x: https://github.com/alard/wget-warc/tree/warcextract
19:01 πŸ”— db48x cool
19:02 πŸ”— chronomex emijrp: well, they don't collect trivia or "in popular culture"
19:03 πŸ”— emijrp oh really?
19:03 πŸ”— db48x http://pastebin.com/vv6mK7f1
19:03 πŸ”— chronomex emijrp: there's a war on "In popular culture" sections.
19:04 πŸ”— emijrp im inclusionst
19:05 πŸ”— chronomex I'm hateionist
19:05 πŸ”— chronomex motivated by hate, awww yeah
19:05 πŸ”— emijrp did you mean sysop?
19:05 πŸ”— db48x oh, that reminds me
19:06 πŸ”— db48x alard: I don't see my custom warc header in the warc file
19:07 πŸ”— SketchCow alard: I mentioned the WARC/WGET thing to the archive.org meeting to great love and admiration
19:07 πŸ”— db48x (other than in the resource record where it echos the wget command line)
19:07 πŸ”— SketchCow I just need to get the new slots going today.
19:07 πŸ”— SketchCow Actually, alard
19:07 πŸ”— SketchCow you already have a slot on the old one.
19:08 πŸ”— SketchCow Just pump it there, while I get my ass together over here
19:08 πŸ”— SketchCow Another few gb aren't going to hurt.
19:09 πŸ”— SketchCow 104G .
19:09 πŸ”— SketchCow cache# du -sh .
19:09 πŸ”— SketchCow FAN TASTIC
19:12 πŸ”— alard SketchCow: WARC/WGET in meeting: cool. Upload to old one: will do.
19:12 πŸ”— alard db48x: Your header is there, it's the last line in the warcinfo record.
19:13 πŸ”— alard (If you look between these awkward hex codes.)
19:13 πŸ”— Schbirid http://www.nytimes.com/2011/08/16/arts/music/springsteen-and-others-soon-eligible-to-recover-song-rights.html?_r=2&pagewanted=all <- yeehaw
19:13 πŸ”— alard It's not a header that appears in every record, just in the warcinfo. It's more of a warc-field, really.
19:13 πŸ”— db48x oooh, in the content of the warcinfo record
19:14 πŸ”— db48x tricky
19:16 πŸ”— SketchCow OK, will be back, mailing out a ton of GET LAMPS I owe
19:29 πŸ”— emijrp Schbirid: did you hear about money, lawyers, money, lawyers, money, lawyers and money?
19:29 πŸ”— emijrp Extending that 35 years is easy.
19:30 πŸ”— emijrp By the way. Fuck that. I use Jamendo.
19:31 πŸ”— Schbirid yeah
19:31 πŸ”— Schbirid but fuck jamendo
19:31 πŸ”— Schbirid so much
19:31 πŸ”— Schbirid gah
19:31 πŸ”— Schbirid vorbis is broken for weeks
19:32 πŸ”— Schbirid they do not post why things were deleted
19:32 πŸ”— Schbirid etc etc etc
19:32 πŸ”— Schbirid i love them
19:32 πŸ”— Schbirid but it is hard
19:32 πŸ”— emijrp ogg broken? torrent you mean?
19:32 πŸ”— Schbirid no, streaming is broken now too
19:33 πŸ”— Schbirid and the full albums are not craeted anymore either afaik
19:33 πŸ”— emijrp ah streaming, you can download ogg albums using a trick
19:33 πŸ”— Schbirid i know
19:33 πŸ”— Schbirid i have most of them locally ;)
19:33 πŸ”— Schbirid http://www.jamendo.com/en/user/The%20Chilling%20Spirit <-
19:33 πŸ”— Schbirid me
19:33 πŸ”— emijrp ah spirit
19:33 πŸ”— Schbirid yeah
19:34 πŸ”— Schbirid people never get the schbirid :)
19:48 πŸ”— db48x hrm
19:48 πŸ”— db48x I can't remember my archiveteam.org password
19:48 πŸ”— Wyatt Hmm, "The material sent must be educational media. It canҀ™t contain advertising, video games, computer drives, or digital drives of any kind." That's kind of unfortunate.
19:49 πŸ”— db48x for media mail? yea
19:49 πŸ”— Wyatt They don't have the chart for weights over 20lbs up anymore, either.
19:49 πŸ”— db48x so much for digital media
19:49 πŸ”— chronomex I didn't know that ...
19:49 πŸ”— Wyatt And archiving games or non-educational magazines(I think?)...
19:50 πŸ”— Wyatt Is oregon trail a video game?
19:50 πŸ”— db48x but on the other hand, it's not like they actually open the mail to see what's in it
19:51 πŸ”— chronomex http://pe.usps.com/text/dmm300/173.htm
19:51 πŸ”— Wyatt Uuuuh.
19:51 πŸ”— Wyatt Bullet two: Media Mail can be examined by postal staff to determine if the right price has been paid. If the package is wrapped in a way that makes it impossible to examine, it will be charged the First-Class rate.
19:51 πŸ”— chronomex check out 4.1 I): Computer-readable media containing prerecorded information and guides or scripts prepared solely for use with such media.
19:51 πŸ”— chronomex the DDM, quoted here, is the final arbiter of mailability
19:52 πŸ”— chronomex s/DDM/DMM/
19:53 πŸ”— chronomex Wyatt: where are you reading this? because it contravenes the DMM.
19:53 πŸ”— Wyatt Huh, so a different part of usps.com has different rules. Interesting.
19:53 πŸ”— Wyatt https://www.usps.com/send/media-mail.htm
19:53 πŸ”— Wyatt Under the Rules and Restrictions tab
19:53 πŸ”— chronomex yeah
19:54 πŸ”— chronomex well. that is wrong.
19:54 πŸ”— chronomex if the postman questions it, cite the DMM at him.
19:54 πŸ”— Wyatt Sweet.
19:54 πŸ”— db48x Wyatt: so what're you mailing?
19:55 πŸ”— db48x http://archiveteam.org/index.php?title=Car_Loans_For_Bad_Credit_Issues_12
19:55 πŸ”— Wyatt At the moment, nothing. I just hadn't heard about media mail and Jason mentioned it.
19:55 πŸ”— db48x probably spam
19:55 πŸ”— Wyatt Sounded handy, so I googled.
19:55 πŸ”— db48x Wyatt: ahh
19:55 πŸ”— chronomex yup
19:55 πŸ”— Wyatt Though I do have a couple boxes of magazines for him once I head back for a visit to my folks.
19:57 πŸ”— Wyatt (Though it's tempting to just drive to New York and visit Strong Museum as a mini vacation sometime.)
20:55 πŸ”— SketchCow MEDIA MAIL
20:56 πŸ”— SketchCow MEDIA MAAAIILLLLLLLLLLLLLLLLLLLLLLLLLL
20:56 πŸ”— SketchCow YOU'LL NEVER FAIL / WITH MEDIA MAIL / IT MOVES LIKE A SNAIL / BUT SO CHEAP SO SALE
20:57 πŸ”— Schbirid watch out for the sketchcow
20:57 πŸ”— Schbirid or he will raise his left brow
20:57 πŸ”— Schbirid stare you down with evil sight
20:57 πŸ”— Schbirid "hope you got your backups right"
20:58 πŸ”— SketchCow http://www.archive.org/details/playboybraile00nlsu by the way
20:58 πŸ”— SketchCow I'm writing a blog entry on it, and when I do, I want reddit juice from you bastards
20:58 πŸ”— Schbirid haha
21:02 πŸ”— * underaway juices SketchCow's reddit
21:05 πŸ”— * alard thinks it would be a fun image processing exercise to write a braille OCR tool.
21:05 πŸ”— Schbirid definitely
21:08 πŸ”— SketchCow http://www.hitechcrimesolutions.com/?p=58688 discusses archiveteam, right at the end.
21:11 πŸ”— Schbirid good night
22:09 πŸ”— Coderjoe http://i.imgur.com/TkwQ9.gif
22:18 πŸ”— SketchCow Win
22:26 πŸ”— chronomex awww
22:33 πŸ”— SketchCow CurateCamp Hangout! https://talkgadget.google.com/hangouts/3cf9fc8494ac1c6454f1104c38d5e4827e1fdfc5?authuser=0&hl=en
22:33 πŸ”— SketchCow Hop in
22:33 πŸ”— SketchCow Don't be loud, just watch the show
22:52 πŸ”— underscor fuck google
22:52 πŸ”— underscor just fuck google
22:52 πŸ”— underscor with a rusty chainsaw
22:54 πŸ”— SketchCow On it
23:07 πŸ”— SketchCow https://talkgadget.google.com/hangouts/b239ac58e9dd5b06df6b5176a7bb1e78685a52d4?authuser=0&hl=en-US#

irclogger-viewer