[00:13] <watttttt> Does anyone know if the punchfork site archive is available somewhere?
[00:13] <watttttt> the status page on the wiki says that there are 0 todo items remaining, so I'm curious how i can get the data
[00:47] <DoubleJ> watttttt: Typically, once everything is uploaded, there are some checks run to make sure the data is in good shape, then it's placed into the Archive Team collection on IA. Later on, it may also be added to the Wayback Machine but that can take a while.
[00:48] <DoubleJ> For now, I don't think it's publically accessible yet.
[00:49] <dashcloud> so, can someone review this: http://www.archiveteam.org/index.php?title=Talk:Rescuing_Floppy_Disks and if the DOS/Windows floppy dumping instructions are good enough, can you put it on the main page there?
[01:09] <InitHello> dashcloud: looks good to me. Might want to note that progressively scanning a floppy like that can cause further corruption, due to how flimsy the media is.
[01:11] <dashcloud> I just had 3.5'' disks- not 5.25'' if that makes a difference
[01:12] <InitHello> 3.5" were probably a bit more robust, what with the hard cover, but I still wouldn't bet my life on them
[01:12] <InitHello> those instructions would also work for 5.25" floppies, anyway
[01:12] <dashcloud> feel free to add to that page or edit as needed
[01:13] * InitHello remembers turning single-sided 5.25" disks into double-sided with the notch trick
[01:13] <InitHello> and single-density 700k into 1.44mb floppies with a similar but unrelated notch trick
[01:17] <InitHello> ... a bucket of meowing creatures is totally kittens, you silly spam checker :<
[01:49] <DFJustin> watttttt: https://archive.org/details/archiveteam_punchfork
[01:49] <DFJustin> that's just the raw data, most likely it will get imported into the wayback machine at some point
[01:50] <DFJustin> oh there's even a fancy search already http://archive.org/download/archiveteam_punchfork_index/
[04:04] <wp494> holy shit, I think my warrior actually managed to DoS myself
[04:04] <wp494> I couldn't even access my router's management page, lol
[04:23] <namespace> wp494: LOL.
[04:24] <namespace> Is there a way to pay to have archive.org content shipped to me on a hard drive?
[07:07] <SketchCow> Back
[11:11] <Nemo_bis> Argh, now how will I finish downloading my magazines to archive http://bugs.winehq.org/show_bug.cgi?id=32617#c5
[20:05] <omf_> anyone grabbed the http://www.yelp.com/dataset_challenge/dataset data?
[20:08] <omf_> speaking of that do we have grabs of the netflix challenge data and the aol research data from years ago?
[20:58] <Vito``> omf_: AOL pulled theirs after the privacy issues popped up.  I have a copy in an S3 bucket somewhere.
[21:00] <Vito``> omf_: I don't think I have netflix
[21:00] <omf_> Yeah I had heard these things had copies everywhere
[21:01] <Vito``> hard to verify some random IRC person's supposed good copy
[21:01] <omf_> Why would someone in here fake a large dataset? I cannot figure out what it would earn them.
[21:02] <Vito``> shrug
[21:02] <Vito``> that said
[21:02] <InitHello> sometimes people do strange things for certain reasons
[21:02] <Vito``> last dump of audioscrobbler data: http://old.vi.to/mavra/audioscrobbler_profiledata_06-May-2005.tar.gz
[21:03] <Vito``> 129.5MB
[21:06] <Vito``> AOL data, 439MB: https://s3.amazonaws.com/qumbler-upload/AOL-data.11549945932358662194121828512141411016.tgz
[21:10] <omf_> thanks Vito``
[21:11] <grmngrl88> http://xeroticmomentsx.blogspot.com/2013/03/amateurgallery.html
[21:14] <Vito``> I'm going to guess, just from the URL, that that's not a) something I should visit at work, and b) archival-related
[21:15] <omf_> I checked it
[21:15] <omf_> it is gifs of porn
[21:15] <InitHello> shocking
[21:15] <omf_> I know. Who ever thought there was pron on the internet
[21:16] <InitHello> especially from someone with a female-sounding name ending in two digits
[22:12] <namespace> ...
[22:13] <namespace> Yeah, spam is weird.
[22:18] <Smiley> The blog that you are about to view may contain content only suitable for adults. In general, Google does not review nor do we endorse the content of this or any blog. For more information about our content policies, please visit the Blogger Terms of Service.
[22:18] <Smiley> lol
[22:36] <omf_> Yeah first class covering your ass
[22:43] <omf_> glitch is up to 644mb and game tome is 513mb
[22:51] <omf_> any more small projects
[22:52] <omf_> I have been trying out the different butts to help add more data to the wiki and grab more sites