#archiveteam 2012-02-06,Mon

โ†‘back Search

Time Nickname Message
04:34 ๐Ÿ”— arrith woah
04:34 ๐Ÿ”— arrith http://btjunkie.org/goodbye.html
04:34 ๐Ÿ”— arrith 2005 - 2012
04:34 ๐Ÿ”— arrith This is the end of the line my friends. The decision does not come easy, but we've decided to voluntarily shut down. We've been fighting for years for your right to communicate, but it's time to move on. It's been an experience of a lifetime, we wish you all the best!
04:58 ๐Ÿ”— arrith http://tech.pnosker.com/2012/02/05/btjunkie-shuts-down-voluntarily/
04:58 ๐Ÿ”— SketchCow Wow.
05:00 ๐Ÿ”— arrith this megaupload thing is having a crazy ripple effect
05:00 ๐Ÿ”— SketchCow As it would be expected.
05:07 ๐Ÿ”— arrith i was likening this megaupload stuff to the piratebay trial, but i guess it's quite different since the piratebay trial was at least all in sweden, but this is basically extraordinary rendition to the US. some world copyright police level stuff
05:15 ๐Ÿ”— SketchCow Right.
05:16 ๐Ÿ”— chronomex US is world police
05:16 ๐Ÿ”— chronomex ;)
05:29 ๐Ÿ”— yipdw America, fuck yeah
05:53 ๐Ÿ”— SketchCow Taxes are gonna hurrrrrrrrrrrrrrrt this year.
05:53 ๐Ÿ”— SketchCow Everyone find me couhes
05:53 ๐Ÿ”— SketchCow couches
05:53 ๐Ÿ”— chronomex oh I bet they will :|
05:53 ๐Ÿ”— SketchCow Also, the advantages of this netbook are somewhat mitigated by how unpleasant it is to deal with.
05:53 ๐Ÿ”— SketchCow Tiny keyboard, screen, and gmail makes it go HURRRRRRRRRRRRRRRRRR for seconds at a time.
05:54 ๐Ÿ”— chronomex wrrrrr
06:17 ๐Ÿ”— SketchCow http://t.co/hRTpgpwx - 270,000 images from an imageboard that went down in 2009.
06:17 ๐Ÿ”— SketchCow Saved! Thanks, DFJustin
06:20 ๐Ÿ”— underscor http://tracker.archive.org/
06:20 ๐Ÿ”— underscor Wheee
06:21 ๐Ÿ”— DFJustin there's actually a fair amount more where that came from, I now have like 70gb worth of konachan.com, 60gb of minitokyo.net, etc
06:23 ๐Ÿ”— DFJustin but that's the only one that's actually straight up gone
06:23 ๐Ÿ”— underscor DFJustin spends all his time on chans
06:23 ๐Ÿ”— underscor 8D
06:24 ๐Ÿ”— DFJustin (รฏยพยŸรขยˆย€รฏยพยŸ)
06:24 ๐Ÿ”— Aranje hahaha
06:24 ๐Ÿ”— Aranje but it's okay because he's `achiving` it all
06:25 ๐Ÿ”— underscor u18chan should be archived
06:25 ๐Ÿ”— underscor lolololol
06:25 ๐Ÿ”— SketchCow I'm up for all sorts of chan archiving.
06:25 ๐Ÿ”— DFJustin someone not in canada can do that shit :P
06:26 ๐Ÿ”— underscor u18chan is nsfw, btw, as a forewarning
06:26 ๐Ÿ”— underscor SketchCow: What is the archive's position on archiving porn chans?
06:27 ๐Ÿ”— SketchCow ha ha position
06:27 ๐Ÿ”— SketchCow reverse cowgirl
06:27 ๐Ÿ”— underscor best position
06:27 ๐Ÿ”— underscor lol
06:28 ๐Ÿ”— DFJustin the konachan rips were all safely uploaded on...megaupload, luckily the torrent is still active
06:34 ๐Ÿ”— SketchCow Poor Megaupload.
06:34 ๐Ÿ”— SketchCow I'd like to know how the megaupload recovery site is working out.
06:36 ๐Ÿ”— SketchCow DFJustin: CPM CD-ROMs going in
06:37 ๐Ÿ”— DFJustin yay
06:37 ๐Ÿ”— DFJustin the cd that doom guy uploaded at http://www.archive.org/details/D1000 seems to still be non-public
06:38 ๐Ÿ”— SketchCow http://www.archive.org/details/cdrom-1994-11-walnutcreek-cpm&reCache=1
06:39 ๐Ÿ”— SketchCow http://www.archive.org/details/D1000
06:41 ๐Ÿ”— godane i got some linux format dvds
06:41 ๐Ÿ”— godane i found some of the linux-format pdfs
06:43 ๐Ÿ”— arrith wow that walnutcreek disk is wild. always read about those guys.
06:45 ๐Ÿ”— Aranje underscor: how long should those tests take? It's failing to change to `works!` text on chrome 18 dev
06:46 ๐Ÿ”— underscor Aranje: Like 2 seconds max
06:46 ๐Ÿ”— Aranje hmm
06:46 ๐Ÿ”— Aranje def givin me shit
06:46 ๐Ÿ”— SketchCow Which is, coincidentally, underscor's nickname in bed
06:47 ๐Ÿ”— Aranje I'd call that convenient more than anything else
06:47 ๐Ÿ”— underscor SketchCow: Fuck you :D
06:47 ๐Ÿ”— * SketchCow drives around town with the girl you love
06:48 ๐Ÿ”— SketchCow http://www.archive.org/details/cdrom-oakcpm-1999-cdrom by the way, DFJustin
06:48 ๐Ÿ”— SketchCow So both those are in
06:48 ๐Ÿ”— SketchCow Just working back the backlog
06:51 ๐Ÿ”— SketchCow Also, this week we're going to begin moving off batcave to the new machine.
06:52 ๐Ÿ”— DFJustin the fortress of solitude?
06:53 ๐Ÿ”— SketchCow I was thinking jokerslair
06:56 ๐Ÿ”— underscor nick
06:56 ๐Ÿ”— underscor nice*
07:00 ๐Ÿ”— SketchCow 2 seconds max
07:12 ๐Ÿ”— kin37ik whats the best set of commands to issue to Wget to crawl something like fortunecity on a windows box?
07:12 ๐Ÿ”— kin37ik crazy idea i know.
07:13 ๐Ÿ”— SketchCow Stick with doing it on Linux. or a unix variant.
07:14 ๐Ÿ”— arrith kin37ik: so you just want a page list, not to do anything?
07:14 ๐Ÿ”— arrith or rather, keep dled stuff
07:16 ๐Ÿ”— kin37ik i want it to keep DL'ed stuff
07:17 ๐Ÿ”— kin37ik @sketch: i would but my linux box is fried, and im waiting on a new board
07:20 ๐Ÿ”— arrith kin37ik: that would be more than crawling then
07:20 ๐Ÿ”— kin37ik arrith: okay
07:20 ๐Ÿ”— arrith kin37ik: could setup a linux dualboot and/or linux vm
07:20 ๐Ÿ”— arrith you can boot a linux partiton that you also dualboot to
07:21 ๐Ÿ”— arrith soemthing like wgat-warc -r something something
07:22 ๐Ÿ”— kin37ik arrith: i have thought about dual booting a couple of times, but i thought if i have a dedicated linux box, then there wasnt much of a point, the board should be here this week
07:23 ๐Ÿ”— arrith kin37ik: ah yeah, just depends on how soon you want to get started
07:23 ๐Ÿ”— SketchCow Going "I'm all out of water bottles, so I'd like to drink donkey urine" is just not a question worth answering.
07:23 ๐Ÿ”— SketchCow Just wait until the board is back, we'll wait.
07:24 ๐Ÿ”— yipdw also, uncontrolled wget -r on something like fortunecity is a bad idea
07:24 ๐Ÿ”— yipdw it is likely that you will end up with (1) a ton of stuff and (2) nothing that you want
07:24 ๐Ÿ”— kin37ik SketchCow: fair call
07:24 ๐Ÿ”— yipdw controlling wget by pointing it at specific URLs and using more controlled forms of recursive retrieval, like --page-requisites, is much better
07:27 ๐Ÿ”— kin37ik yipdw: which is what i intend to be doing first off, as ive found out alot of old pages from back when fortunecity first started are still on the servers untouched for a long time
07:28 ๐Ÿ”— SketchCow I'd just work with other team members to find these pages.
07:28 ๐Ÿ”— SketchCow Remember also we want WARC formats, too.
07:28 ๐Ÿ”— kin37ik SketchCow: WARC?
07:29 ๐Ÿ”— arrith kin37ik: google wget-warc
07:30 ๐Ÿ”— yipdw kin37ik: Web ARChive; it's a way to record not only response bodies, but also the headers associated with that body, as well as request bodies
07:30 ๐Ÿ”— yipdw and headers
07:31 ๐Ÿ”— yipdw kin37ik: http://bibnum.bnf.fr/WARC/warc_ISO_DIS_28500.pdf
07:31 ๐Ÿ”— yipdw WARC can also store information about the retrieving tool, retriever, etc -- all information that you want when building an archive
07:31 ๐Ÿ”— kin37ik yipdw: aaaahhh i see, nifty
07:32 ๐Ÿ”— yipdw more immediately useful, though, is that WARC is standard and there exist tools to read and present it
07:32 ๐Ÿ”— yipdw e.g. Internet Archive's Wayback Machine, the stuff tef builds for Hanzo Archives
07:33 ๐Ÿ”— kin37ik hmmm
07:33 ๐Ÿ”— kin37ik interesting
08:04 ๐Ÿ”— kin37ik woah, just been a 4 car pile up just around the corner
09:07 ๐Ÿ”— LordNlptp eek
11:21 ๐Ÿ”— godane i think i have a local mirror of defcon.org
11:21 ๐Ÿ”— godane its only 1.8gb
11:21 ๐Ÿ”— godane i think
11:46 ๐Ÿ”— Nemo_bis sigh, just deleted 30 GiB of incomplete mobileme profiles
12:13 ๐Ÿ”— godane got to love defcon-6 website
12:13 ๐Ÿ”— godane all pictures are gone and was not hosted on defcon.org
12:53 ๐Ÿ”— SketchCow HEY SO NERD RESEARCH AND QUESTION
12:53 ๐Ÿ”— SketchCow I was told about this: http://git-annex.branchable.com/
12:53 ๐Ÿ”— SketchCow Anyone want to look at it? It's making waves.
13:07 ๐Ÿ”— SketchCow In theory, reading up on it, we could create an archiveteam GIT hub that spans ALL of archive.org's holding of archiveteam stuff, our other collections, you name it.
13:28 ๐Ÿ”— emijrp imagine a torrent with all the btjunky torrents
13:32 ๐Ÿ”— emijrp tiem to archive isohunt, the pirate bay and friends ?
13:47 ๐Ÿ”— SketchCow Yes
13:47 ๐Ÿ”— SketchCow Yes, it was a while ago.
13:47 ๐Ÿ”— SketchCow I assumed someone was on that already
13:52 ๐Ÿ”— Nemo_bis Was http://www.publicbt.com/ archived, to start with?
14:03 ๐Ÿ”— Ymgve Nemo_bis: there's a link right there to download their database
14:03 ๐Ÿ”— Nemo_bis Ymgve, that's what i'm saying :)
14:04 ๐Ÿ”— emijrp ut infohash != magnet
14:04 ๐Ÿ”— Nemo_bis I don't think that's the whole DB though
14:05 ๐Ÿ”— Nemo_bis emijrp, do you want magnet links?
14:05 ๐Ÿ”— Nemo_bis did btjunkie have them? I don't remember
14:06 ๐Ÿ”— emijrp i mean, whatis the point of that publicbt database? it doesnt contain magnets nor torrents
14:06 ๐Ÿ”— Ymgve umm
14:07 ๐Ÿ”— Nemo_bis and what's a value of a torrents database? it doesn't containg magnets nor seeders
14:07 ๐Ÿ”— Ymgve the hash is basically all you need for magnet
14:08 ๐Ÿ”— emijrp Ymgve: ok
14:08 ๐Ÿ”— Nemo_bis starting from that list of hashes you should be able to produce/download everything
14:08 ๐Ÿ”— Nemo_bis that's what btjunkie itself did
14:08 ๐Ÿ”— Ymgve just prepend magnet:?xt=urn:btih: to your infohash and you got a magnet URL
14:15 ๐Ÿ”— Nemo_bis $ wc -l all.txt
14:15 ๐Ÿ”— Nemo_bis 2907061 all.txt
14:15 ๐Ÿ”— Nemo_bis isohunt claims 8,427,266 torrents
14:17 ๐Ÿ”— Nemo_bis and TPB only 4.297.583
14:33 ๐Ÿ”— emijrp ONLY.
14:34 ๐Ÿ”— Ymgve crowdsource the download of every torrent ever
14:50 ๐Ÿ”— emijrp Think about the day seeding all those torrents is like sharing .txt in TEXTFILES.
15:00 ๐Ÿ”— ersi Yay, thousands of dead torrents
15:00 ๐Ÿ”— ersi that'll be awesome
15:16 ๐Ÿ”— closure SketchCow: I know something about git-annex :)
15:16 ๐Ÿ”— closure (since I wrote it)
15:17 ๐Ÿ”— closure SketchCow: [08:07:26] In theory, reading up on it, we could create an archiveteam GIT hub that spans ALL of archive.org's holding of archiveteam stuff, our other collections, you name it.
15:17 ๐Ÿ”— closure yep, it's doable
16:05 ๐Ÿ”— SketchCow You know nothing
16:05 ๐Ÿ”— SketchCow Get out of the way while the experts work on it
16:05 ๐Ÿ”— SketchCow Actually, it's kind of a strange idea.
16:06 ๐Ÿ”— SketchCow What's it use to verify? Not MD5 hashes, right?
16:06 ๐Ÿ”— SketchCow Also, word's come down. Torrents. Let's get all of them. ALL.
16:19 ๐Ÿ”— emijrp I came.
16:20 ๐Ÿ”— nitro2k01 take down *all* the torrents
16:20 ๐Ÿ”— nitro2k01 (insert meme image)
16:21 ๐Ÿ”— nitro2k01 http://knowyourmeme.com/memes/x-all-the-y
16:57 ๐Ÿ”— Ymgve bittorent uses SHA1, I think
17:00 ๐Ÿ”— emijrp I made a script to download TPB. DO WANT?
17:04 ๐Ÿ”— Ymgve nah, got my own
18:06 ๐Ÿ”— closure SketchCow: git-annex uses sha512 hashes by default, but can use any of the decent hashes
18:06 ๐Ÿ”— closure er, sha256 actually
18:18 ๐Ÿ”— closure I keep all my data in a git annex repo that spans many drives etc. I can run stats like this on my netbook:
18:18 ๐Ÿ”— closure local annex keys: 9
18:18 ๐Ÿ”— closure known annex keys: 41578
18:18 ๐Ÿ”— closure known annex size: 7 terabytes
18:18 ๐Ÿ”— closure local annex size: 952 megabytes
21:31 ๐Ÿ”— godane i have a full backup of defcon website
21:31 ๐Ÿ”— godane just the defcon.org part
21:32 ๐Ÿ”— godane but thats about 3gb and most images from 1-18 are there
21:32 ๐Ÿ”— godane also i found out that the audio for defcon 12 doesn't exist anymore

irclogger-viewer