#archiveteam 2013-03-29,Fri

↑back Search

Time Nickname Message
00:13 🔗 watttttt Does anyone know if the punchfork site archive is available somewhere?
00:13 🔗 watttttt the status page on the wiki says that there are 0 todo items remaining, so I'm curious how i can get the data
00:47 🔗 DoubleJ watttttt: Typically, once everything is uploaded, there are some checks run to make sure the data is in good shape, then it's placed into the Archive Team collection on IA. Later on, it may also be added to the Wayback Machine but that can take a while.
00:48 🔗 DoubleJ For now, I don't think it's publically accessible yet.
00:49 🔗 dashcloud so, can someone review this: http://www.archiveteam.org/index.php?title=Talk:Rescuing_Floppy_Disks and if the DOS/Windows floppy dumping instructions are good enough, can you put it on the main page there?
01:09 🔗 InitHello dashcloud: looks good to me. Might want to note that progressively scanning a floppy like that can cause further corruption, due to how flimsy the media is.
01:11 🔗 dashcloud I just had 3.5'' disks- not 5.25'' if that makes a difference
01:12 🔗 InitHello 3.5" were probably a bit more robust, what with the hard cover, but I still wouldn't bet my life on them
01:12 🔗 InitHello those instructions would also work for 5.25" floppies, anyway
01:12 🔗 dashcloud feel free to add to that page or edit as needed
01:13 🔗 * InitHello remembers turning single-sided 5.25" disks into double-sided with the notch trick
01:13 🔗 InitHello and single-density 700k into 1.44mb floppies with a similar but unrelated notch trick
01:17 🔗 InitHello ... a bucket of meowing creatures is totally kittens, you silly spam checker :<
01:49 🔗 DFJustin watttttt: https://archive.org/details/archiveteam_punchfork
01:49 🔗 DFJustin that's just the raw data, most likely it will get imported into the wayback machine at some point
01:50 🔗 DFJustin oh there's even a fancy search already http://archive.org/download/archiveteam_punchfork_index/
04:04 🔗 wp494 holy shit, I think my warrior actually managed to DoS myself
04:04 🔗 wp494 I couldn't even access my router's management page, lol
04:23 🔗 namespace wp494: LOL.
04:24 🔗 namespace Is there a way to pay to have archive.org content shipped to me on a hard drive?
07:07 🔗 SketchCow Back
11:11 🔗 Nemo_bis Argh, now how will I finish downloading my magazines to archive http://bugs.winehq.org/show_bug.cgi?id=32617#c5
20:05 🔗 omf_ anyone grabbed the http://www.yelp.com/dataset_challenge/dataset data?
20:08 🔗 omf_ speaking of that do we have grabs of the netflix challenge data and the aol research data from years ago?
20:58 🔗 Vito`` omf_: AOL pulled theirs after the privacy issues popped up. I have a copy in an S3 bucket somewhere.
21:00 🔗 Vito`` omf_: I don't think I have netflix
21:00 🔗 omf_ Yeah I had heard these things had copies everywhere
21:01 🔗 Vito`` hard to verify some random IRC person's supposed good copy
21:01 🔗 omf_ Why would someone in here fake a large dataset? I cannot figure out what it would earn them.
21:02 🔗 Vito`` shrug
21:02 🔗 Vito`` that said
21:02 🔗 InitHello sometimes people do strange things for certain reasons
21:02 🔗 Vito`` last dump of audioscrobbler data: http://old.vi.to/mavra/audioscrobbler_profiledata_06-May-2005.tar.gz
21:03 🔗 Vito`` 129.5MB
21:06 🔗 Vito`` AOL data, 439MB: https://s3.amazonaws.com/qumbler-upload/AOL-data.11549945932358662194121828512141411016.tgz
21:10 🔗 omf_ thanks Vito``
21:11 🔗 grmngrl88 http://xeroticmomentsx.blogspot.com/2013/03/amateurgallery.html
21:14 🔗 Vito`` I'm going to guess, just from the URL, that that's not a) something I should visit at work, and b) archival-related
21:15 🔗 omf_ I checked it
21:15 🔗 omf_ it is gifs of porn
21:15 🔗 InitHello shocking
21:15 🔗 omf_ I know. Who ever thought there was pron on the internet
21:16 🔗 InitHello especially from someone with a female-sounding name ending in two digits
22:12 🔗 namespace ...
22:13 🔗 namespace Yeah, spam is weird.
22:18 🔗 Smiley The blog that you are about to view may contain content only suitable for adults. In general, Google does not review nor do we endorse the content of this or any blog. For more information about our content policies, please visit the Blogger Terms of Service.
22:18 🔗 Smiley lol
22:36 🔗 omf_ Yeah first class covering your ass
22:43 🔗 omf_ glitch is up to 644mb and game tome is 513mb
22:51 🔗 omf_ any more small projects
22:52 🔗 omf_ I have been trying out the different butts to help add more data to the wiki and grab more sites

irclogger-viewer