#archiveteam 2013-01-28,Mon

↑back Search

Time Nickname Message
00:09 🔗 SketchCow 508
00:09 🔗 SketchCow root@teamarchive-1:/2/CDDOWN# find . -name \*.rar -size +200M | wc -l
00:09 🔗 SketchCow 508!
00:16 🔗 chronomex five hundred and eight!
00:20 🔗 SketchCow It went down to 238k/sec but it went back up to 2.4M/s again, finally.
00:23 🔗 ex-parrot completely idle thought, but would the software library we grabbed during http://www.applefritter.com/aol be of interest to anyone?
00:23 🔗 SketchCow Yes
00:24 🔗 ex-parrot point me at where to upload it and I can organise something I suspect. we've got a bit of metadata to go with each file too
00:24 🔗 SketchCow How big is it?
00:24 🔗 ex-parrot I honestly don't remember how big it got to in the end, probably at least 5 gigs though, maybe 10
00:26 🔗 balrog_ ex-parrot: yes, but where can I find it
00:26 🔗 balrog_ I don't know if it was by any means complete
00:26 🔗 ex-parrot just trying to work that out again... it's been so long since I touched this project (I wrote the script to import it all in to drupal)
00:26 🔗 balrog_ aah.
00:26 🔗 balrog_ is any of the AOL stuff still up?
00:26 🔗 balrog_ I checked like two years back and some was.
00:27 🔗 balrog_ (on AOL)
00:28 🔗 ex-parrot I think probably the best thing to do is to hit up Tom next time he's online and we'll get back to y'all :)
00:28 🔗 balrog_ I've tried to email him three or four times
00:28 🔗 balrog_ never got anything
00:28 🔗 ex-parrot damn ok, I can definitely get in touch with him
00:28 🔗 ex-parrot it would be a shame for the files to just sit festering in drupal forever on this mac mini
00:29 🔗 balrog_ can you do a get info and see how big it is?
00:29 🔗 ex-parrot I think we have a few other moderately sized FTP dumps of shareware and things
00:29 🔗 balrog_ and is what you have all of it, or does tom have more?
00:29 🔗 ex-parrot I don't have the files myself, I just know they're on this server somewhere
00:29 🔗 ex-parrot I'll find out :)
00:29 🔗 balrog_ ah :|
00:29 🔗 balrog_ ok
00:30 🔗 ex-parrot I was hoping I could just find it easily in the drupal admin console but it's been so long since I fiddled with it and the UI makes basically no sense, I think I'll have to wait until Tom wakes up
00:30 🔗 balrog_ do you have ssh access to the server?
00:30 🔗 ex-parrot not currently
00:30 🔗 balrog_ ah. :|
00:31 🔗 balrog_ is the server on the web?
00:31 🔗 ex-parrot yeah, I think at least some of the stuff is just on the main http://www.applefritter.com/ server. it used to be exposed in the drupal interface somewhere along with some metadata
00:31 🔗 balrog_ ahh.
00:32 🔗 balrog_ I'm wondering if anyone's tried to log in with aol for os9 lately
00:32 🔗 ex-parrot ah here we go, http://www.applefritter.com/taxonomy/term/279 are the files we grabbed from info-mac around the same time
00:32 🔗 ex-parrot I will have to see about where the AOL files went
00:32 🔗 ex-parrot I should be able to tar up those info-mac files too
00:33 🔗 balrog_ I doubt that's as rare
00:33 🔗 balrog_ info-mac was widely mirrored
00:33 🔗 ex-parrot yeah, I don't think it's as interesting
00:33 🔗 ex-parrot the metadata might not have been mirrored as widely
00:34 🔗 * balrog_ installs aol 5.0 in sheepshaver
00:34 🔗 ex-parrot I'd be interested to know what happens when you fire that up
00:34 🔗 SketchCow DONE.
00:36 🔗 ex-parrot it's possible we never got as far as even uploading the AOL files to the site. I will have a chat with Tom and find out. I am sure we still have a tarball anyway at least :)
00:58 🔗 dashcloud wow- doing some metadata, I saw an article for a Koala Pad Touch Tablet digitizer- I wasn't aware that kind of device existed that long ago
01:04 🔗 DFJustin if any other AOL file areas are still up I would be very, very interested in getting the contents
01:05 🔗 ex-parrot DFJustin: balrog_ mentioned in -bs that some stuff seems still to be up on the actual AOL service itself if you have the client
01:05 🔗 balrog_ yup
01:05 🔗 balrog_ many seem to be up
01:05 🔗 balrog_ however keywords are non-working
01:49 🔗 DFJustin I used the file areas a lot back in 1996-7 ish and there was a LOT of stuff which is not necessarily available elsewhere, shareware but also digital artwork etc.
02:05 🔗 Lord_Nigh i remember downloading slam.mid from the file area in 1996-7ish and afaik it never showed up elsewhere on the internet ever; the hard disk which contained it had an ic explode and i don't think is recoverable
02:05 🔗 balrog_ you're sure that hdd died? :(
02:05 🔗 balrog_ oh
02:05 🔗 balrog_ hmm
02:05 🔗 balrog_ you know which file area?
02:06 🔗 Lord_Nigh its dead dead. as in chip exploded dead
02:06 🔗 Lord_Nigh was a 500mb old sucker too
02:06 🔗 balrog_ Lord_Nigh: we're talking about this in #archiveteam-bs if you want to rejoin
02:48 🔗 turnkit hdd dead with chip explosion isn't necessarily dead is it? just controller board is dead (swap with exact same model -- some data recoverable -?)
02:49 🔗 turnkit sorry -- to -bs
03:53 🔗 SketchCow http://fos.textfiles.com/CDDOWN/ in case anyone wants to walk it before I start uploading in earnest.
04:28 🔗 turnkit SketchCow: from - http://www.kultcds.com/index.php?lang=en ?
04:28 🔗 SketchCow Yes
04:28 🔗 SketchCow All grabbed!
04:28 🔗 SketchCow Now writing scripts for uploads.
04:29 🔗 DFJustin ah is this that hallfiry guy's collection?
04:29 🔗 DFJustin answer: yes
04:30 🔗 DFJustin sweeeeet
04:33 🔗 lemonkey http://www.jwz.org/blog/2013/01/shes-a-flight-risk-2/
04:48 🔗 turnkit He has a nice frontend. (that sounds weird.) I just found I have a few netpower issues he doesn't have. Going to try to contact him to contribute those. Do you have a way to do an update sync? I mean will his archive show up complete in Wayback or is it a standalone "snapshot" on archive.org. I'm still not up to speed with most of what goes on here.
05:14 🔗 SketchCow Hallifry guy's been contacting me
05:14 🔗 SketchCow They're deleting all the CD-ROM images
05:22 🔗 balrog_ why, no room, or complaints?
05:45 🔗 SketchCow No idea.
05:47 🔗 BlueMax Have they been grabbed?
05:53 🔗 turnkit UI / delivery / discoverability is just important as "having" archives. The UI *is* the metadata in a sense. hallfiry's interface is minimalistic but excellent. Hope your archive includes his interface & goes to Wayback where's it's maintained. -?
05:53 🔗 turnkit So, SketchCow, you're saying I should archive the few netpower CD's he's missing just like the MacAddicts CD ISOs, to fos? Is that best?
05:57 🔗 SketchCow I have been grabbing ALL his shizzle
06:28 🔗 SketchCow And while I edit the film, it's all getting uploaded now.
06:32 🔗 godane cool
06:32 🔗 godane also i think if i pull of backing up g4tv.com you all most will have talk about that at some point
06:33 🔗 SketchCow What
06:34 🔗 godane i think the g4tv grab will have to be talked about in one of your speckes
06:35 🔗 godane videos like this: https://archive.org/details/g4tv.com-video3902
06:36 🔗 godane i don't know if you will fan that coverage anywhere else
06:37 🔗 godane also g4tv.com is in the 35k+ of videos
06:38 🔗 balrog_ this aol thing... so far all I've found reaffirms the fact that aol is a massive, massive clusterfuck
06:38 🔗 balrog_ and archiving any portion of it will be very, very painful
06:46 🔗 balrog_ it seems they stopped caring in 2003.
07:25 🔗 * SketchCow is shoving in 100 MacAddict CD-ROMs.
07:25 🔗 SketchCow Naturally the ISOs are completely incompatible with the archive.org trickery.
07:26 🔗 SketchCow http://archive.org/details/macaddict_coverdiscs&reCache=1
07:45 🔗 SketchCow I keep talking in here like Turnkit isn't here.
07:45 🔗 SketchCow Sorry, I think of my e-mail buddies as different than IRC buddies.
07:45 🔗 SketchCow I have a LOT of people who mail me, but never use the IRCs
07:58 🔗 turnkit I'm not here. I keep going in the other room to try to pay my bills, and find myself hovering over the keyboard wondering about stuff. FYI I think MacAddict .iso's #1 - #88 are contiguous, after that I've currenly only sporadic issues. Bidding on eBay for a batch between 89-125 but it doesn't close for a week.
07:58 🔗 turnkit (by "pay bills' I mean get work done that is supposed to pay me)
08:00 🔗 turnkit godane -- https://archive.org/details/g4tv.com-video3902 is good historical footage... valuable
08:00 🔗 godane there is more at video3901
08:01 🔗 godane so its about 16 to 18mins if i remember all together
08:10 🔗 SketchCow I've uploaded them all.
08:19 🔗 godane turnkit: even more historical footage: https://archive.org/details/g4tv.com-video4352
08:19 🔗 godane its about the global jukebox
08:22 🔗 turnkit Google maps should have Global Jukebox features. lol. BTW there are tell-tale single line (field) tape hits in that footage which reveal what tape format it was stored on at one point.
08:23 🔗 turnkit I think it's BetaSP but it might have been uMatic (egads)
09:11 🔗 godane SketchCow: any chance you get to uploading the bbs interviews this year?
09:15 🔗 SketchCow It's likely.
09:33 🔗 godane thats good
10:38 🔗 godane SketchCow: i would have liked these g4 videos to be in a more g4video-web collection
10:39 🔗 godane this is cause g4video is for complete videos
10:39 🔗 godane i'm only complaining cause i care
10:41 🔗 godane also know that the g4tv.com videos will add over 30k+
17:21 🔗 db48x SketchCow: my account on the File Formats wiki still doesn't work. could you verify my username and email address?
17:21 🔗 db48x a password reset email doesn't reach me, so I suspect the address is wrong
17:34 🔗 xk_id what is a polite rate at which to distributely crawl ~4mil pages of a website?
18:01 🔗 chronomex I would answer that but xk_id is no longer with us
18:01 🔗 chronomex lurk moar
18:06 🔗 SketchCow He's adorable!!!!!!
18:23 🔗 db48x is there a polite rate?
18:37 🔗 underscor alard, or anyone who might know, any ideas why I get this when trying to run xanga-grab?
18:37 🔗 underscor http://p.defau.lt/?NDgR01YNIsWOuvuARy1Slg
18:39 🔗 db48x you don't have a module named util
18:39 🔗 underscor Well, yeah
18:39 🔗 underscor But I mean why is it trying to import it if it doesn't provide it
18:45 🔗 db48x underscor: it does provide it: https://github.com/ArchiveTeam/seesaw-kit/blob/master/seesaw/util.py
18:46 🔗 underscor Oh. So my seesaw installation is borked, then
18:46 🔗 underscor Thanks :)
18:46 🔗 db48x glad to help :)
18:48 🔗 DrainLbry what's the methodology for asking Wayback to crawl something immediately again?
18:49 🔗 DrainLbry in the event of deaths, bankruptcies, etc.
18:52 🔗 db48x it's probably in the faq
18:52 🔗 db48x bbl
18:59 🔗 SketchCow http://twitpic.com/byq4ry
19:00 🔗 SketchCow db48x: Start a new account, and then we'll merge and rename it
19:05 🔗 ersi DrainLbry: http://liveweb.archive.org/http://site.to.archive/some_thing/page.html
19:05 🔗 ersi It doesn't support https sites though
19:10 🔗 DrainLbry thanks
19:14 🔗 ersi No, thank you. :)
19:16 🔗 godane i have over 100gb of video now
19:17 🔗 godane SketchCow: will you put my g4 web videos in a different collection then g4video
19:18 🔗 godane i want g4video to be for full videos
19:18 🔗 godane g4tv.com has more clips of things
19:21 🔗 DrainLbry I figured you know, the Iranian's sending a monkey into space deserved a crawl of the Islamic Republic of Iran Iranian Space Agency website (yeah, it's a thing, isa.ir)
19:21 🔗 SketchCow sdfsfsdfdf
19:21 🔗 SketchCow Yeah, fine
19:21 🔗 SketchCow What would you like it called?
19:21 🔗 godane g4video-web
19:35 🔗 SketchCow http://archive.org/details/g4video-web
19:37 🔗 godane thanks
19:39 🔗 alard underscor: You need version 0.0.12 of the seesaw-kit. I've now placed the version check before the import util.
19:40 🔗 underscor alard: I installed 0.0.10 using the "old" way (as a dev package) and now when I do pip install -U seesaw it "upgrades" but seesaw.__version__ is still 0.0.10
19:40 🔗 underscor is there a way to purge/uninstall the old one?
19:41 🔗 alard I don't know. pip uninstall ?
19:43 🔗 alard Also (SketchCow) the current Xanga estimate is 35TB.
19:44 🔗 underscor http://p.defau.lt/?4K1x_PYuh_8lQ7YKc4kHRQ
19:44 🔗 underscor grrr
19:44 🔗 db48x underscor: heh
19:45 🔗 db48x do you have a seesaw directory for it to find?
19:47 🔗 underscor aha, yup
19:47 🔗 underscor there was one earlier in the path that it found
19:47 🔗 underscor womp. :(
20:08 🔗 SketchCow alard: Thanks
20:19 🔗 godane i found a french magazine called TILT microloisirs
20:19 🔗 godane it ran from 1982 to 1994
20:33 🔗 db48x cool. what's a loisir?
20:33 🔗 godane no idea
20:33 🔗 db48x heh
20:39 🔗 alard Loisir means leisure, so it's probably a games magazine?
20:39 🔗 alard I see that underscor is starting to climb the Xanga leaderboard.
20:39 🔗 underscor :D
21:11 🔗 turnkit SketchCow - I can see the MacAddict's on archive.org but the 'super pak 3' disc is not there. Also, is there a way I can help, over time, by adding descriptions on each title, as well as normalize the MacAddict naming in that collection? - i.e. from previous content, one title is "Mac Addict" while the rest are "MacAddict" (space)
21:14 🔗 turnkit naming on older versions slighltly different - e.g. "MacAddict 51 November 2000" vs. newer "MacAddict #051" -- http://archive.org/details/Macaddict51November2000 -- http://archive.org/details/macaddict-cd-051
21:15 🔗 turnkit I'd like to fix/add metadata as I have time too. Is there a way for me to do that in an xml file before they get posted so that it's easy for you? Or is it possible I can be allowed to edit the metadata on those directly?
21:15 🔗 turnkit (going off to nap)
21:15 🔗 ersi Have a good nap
21:27 🔗 SketchCow There's no easy way for you to edit them.
21:27 🔗 SketchCow But mail me a list of changes and I can paste them in.
21:38 🔗 turnkit okay... when I get there... will do.
22:43 🔗 ersi http://www.snaposit.com/shutdown/
22:48 🔗 db48x why did they think that would work out?
22:50 🔗 ersi Dunno, works fine for the linked service (Unlimited storage, 5$/mo)
22:54 🔗 db48x right, so how did they thing that they could charge $9 and only store photos?
22:58 🔗 dashcloud backblaze does the $5 backup storage idea
23:00 🔗 db48x right. which is half the cost of snaposit, and snaposit only stores photos
23:00 🔗 db48x how could they have expected that to work out?
23:07 🔗 ersi I dunno, write a freggin blog post about it or something man
23:07 🔗 ersi online diary
23:15 🔗 alard http://tghw.com/blog/well-that-sucks-what-else-you-got
23:16 🔗 Lord_Nigh speaking of mac stuff is the MacAdvocate II cd archived? i have a copy here though the cd isn't in the greatest shape it does read
23:16 🔗 Lord_Nigh if i rip stuff for archiveteam should i follow the redump.org cd dumping guidelines, i.e. correct audio offsets and pregaps and stuff
23:17 🔗 Lord_Nigh that's only relevant for audio and mixed-audio-data cds, but seems that was popular in the early 90s
23:19 🔗 * ersi nods in alard's direction
23:31 🔗 DFJustin don't see macadvocate on ia so unless sketchcow has one keistered we probably don't have it
23:32 🔗 DFJustin if you're set up to do the redump method then go for it but personally I don't bother for stuff like this
23:57 🔗 SketchCow Nope, don't recall that

irclogger-viewer