[00:09] 508 [00:09] root@teamarchive-1:/2/CDDOWN# find . -name \*.rar -size +200M | wc -l [00:09] 508! [00:16] five hundred and eight! [00:20] It went down to 238k/sec but it went back up to 2.4M/s again, finally. [00:23] completely idle thought, but would the software library we grabbed during http://www.applefritter.com/aol be of interest to anyone? [00:23] Yes [00:24] point me at where to upload it and I can organise something I suspect. we've got a bit of metadata to go with each file too [00:24] How big is it? [00:24] I honestly don't remember how big it got to in the end, probably at least 5 gigs though, maybe 10 [00:26] ex-parrot: yes, but where can I find it [00:26] I don't know if it was by any means complete [00:26] just trying to work that out again... it's been so long since I touched this project (I wrote the script to import it all in to drupal) [00:26] aah. [00:26] is any of the AOL stuff still up? [00:26] I checked like two years back and some was. [00:27] (on AOL) [00:28] I think probably the best thing to do is to hit up Tom next time he's online and we'll get back to y'all :) [00:28] I've tried to email him three or four times [00:28] never got anything [00:28] damn ok, I can definitely get in touch with him [00:28] it would be a shame for the files to just sit festering in drupal forever on this mac mini [00:29] can you do a get info and see how big it is? [00:29] I think we have a few other moderately sized FTP dumps of shareware and things [00:29] and is what you have all of it, or does tom have more? [00:29] I don't have the files myself, I just know they're on this server somewhere [00:29] I'll find out :) [00:29] ah :| [00:29] ok [00:30] I was hoping I could just find it easily in the drupal admin console but it's been so long since I fiddled with it and the UI makes basically no sense, I think I'll have to wait until Tom wakes up [00:30] do you have ssh access to the server? [00:30] not currently [00:30] ah. :| [00:31] is the server on the web? [00:31] yeah, I think at least some of the stuff is just on the main http://www.applefritter.com/ server. it used to be exposed in the drupal interface somewhere along with some metadata [00:31] ahh. [00:32] I'm wondering if anyone's tried to log in with aol for os9 lately [00:32] ah here we go, http://www.applefritter.com/taxonomy/term/279 are the files we grabbed from info-mac around the same time [00:32] I will have to see about where the AOL files went [00:32] I should be able to tar up those info-mac files too [00:33] I doubt that's as rare [00:33] info-mac was widely mirrored [00:33] yeah, I don't think it's as interesting [00:33] the metadata might not have been mirrored as widely [00:34] * balrog_ installs aol 5.0 in sheepshaver [00:34] I'd be interested to know what happens when you fire that up [00:34] DONE. [00:36] it's possible we never got as far as even uploading the AOL files to the site. I will have a chat with Tom and find out. I am sure we still have a tarball anyway at least :) [00:58] wow- doing some metadata, I saw an article for a Koala Pad Touch Tablet digitizer- I wasn't aware that kind of device existed that long ago [01:04] if any other AOL file areas are still up I would be very, very interested in getting the contents [01:05] DFJustin: balrog_ mentioned in -bs that some stuff seems still to be up on the actual AOL service itself if you have the client [01:05] yup [01:05] many seem to be up [01:05] however keywords are non-working [01:49] I used the file areas a lot back in 1996-7 ish and there was a LOT of stuff which is not necessarily available elsewhere, shareware but also digital artwork etc. [02:05] i remember downloading slam.mid from the file area in 1996-7ish and afaik it never showed up elsewhere on the internet ever; the hard disk which contained it had an ic explode and i don't think is recoverable [02:05] you're sure that hdd died? :( [02:05] oh [02:05] hmm [02:05] you know which file area? [02:06] its dead dead. as in chip exploded dead [02:06] was a 500mb old sucker too [02:06] Lord_Nigh: we're talking about this in #archiveteam-bs if you want to rejoin [02:48] hdd dead with chip explosion isn't necessarily dead is it? just controller board is dead (swap with exact same model -- some data recoverable -?) [02:49] sorry -- to -bs [03:53] http://fos.textfiles.com/CDDOWN/ in case anyone wants to walk it before I start uploading in earnest. [04:28] SketchCow: from - http://www.kultcds.com/index.php?lang=en ? [04:28] Yes [04:28] All grabbed! [04:28] Now writing scripts for uploads. [04:29] ah is this that hallfiry guy's collection? [04:29] answer: yes [04:30] sweeeeet [04:33] http://www.jwz.org/blog/2013/01/shes-a-flight-risk-2/ [04:48] He has a nice frontend. (that sounds weird.) I just found I have a few netpower issues he doesn't have. Going to try to contact him to contribute those. Do you have a way to do an update sync? I mean will his archive show up complete in Wayback or is it a standalone "snapshot" on archive.org. I'm still not up to speed with most of what goes on here. [05:14] Hallifry guy's been contacting me [05:14] They're deleting all the CD-ROM images [05:22] why, no room, or complaints? [05:45] No idea. [05:47] Have they been grabbed? [05:53] UI / delivery / discoverability is just important as "having" archives. The UI *is* the metadata in a sense. hallfiry's interface is minimalistic but excellent. Hope your archive includes his interface & goes to Wayback where's it's maintained. -? [05:53] So, SketchCow, you're saying I should archive the few netpower CD's he's missing just like the MacAddicts CD ISOs, to fos? Is that best? [05:57] I have been grabbing ALL his shizzle [06:28] And while I edit the film, it's all getting uploaded now. [06:32] cool [06:32] also i think if i pull of backing up g4tv.com you all most will have talk about that at some point [06:33] What [06:34] i think the g4tv grab will have to be talked about in one of your speckes [06:35] videos like this: https://archive.org/details/g4tv.com-video3902 [06:36] i don't know if you will fan that coverage anywhere else [06:37] also g4tv.com is in the 35k+ of videos [06:38] this aol thing... so far all I've found reaffirms the fact that aol is a massive, massive clusterfuck [06:38] and archiving any portion of it will be very, very painful [06:46] it seems they stopped caring in 2003. [07:25] * SketchCow is shoving in 100 MacAddict CD-ROMs. [07:25] Naturally the ISOs are completely incompatible with the archive.org trickery. [07:26] http://archive.org/details/macaddict_coverdiscs&reCache=1 [07:45] I keep talking in here like Turnkit isn't here. [07:45] Sorry, I think of my e-mail buddies as different than IRC buddies. [07:45] I have a LOT of people who mail me, but never use the IRCs [07:58] I'm not here. I keep going in the other room to try to pay my bills, and find myself hovering over the keyboard wondering about stuff. FYI I think MacAddict .iso's #1 - #88 are contiguous, after that I've currenly only sporadic issues. Bidding on eBay for a batch between 89-125 but it doesn't close for a week. [07:58] (by "pay bills' I mean get work done that is supposed to pay me) [08:00] godane -- https://archive.org/details/g4tv.com-video3902 is good historical footage... valuable [08:00] there is more at video3901 [08:01] so its about 16 to 18mins if i remember all together [08:10] I've uploaded them all. [08:19] turnkit: even more historical footage: https://archive.org/details/g4tv.com-video4352 [08:19] its about the global jukebox [08:22] Google maps should have Global Jukebox features. lol. BTW there are tell-tale single line (field) tape hits in that footage which reveal what tape format it was stored on at one point. [08:23] I think it's BetaSP but it might have been uMatic (egads) [09:11] SketchCow: any chance you get to uploading the bbs interviews this year? [09:15] It's likely. [09:33] thats good [10:38] SketchCow: i would have liked these g4 videos to be in a more g4video-web collection [10:39] this is cause g4video is for complete videos [10:39] i'm only complaining cause i care [10:41] also know that the g4tv.com videos will add over 30k+ [17:21] SketchCow: my account on the File Formats wiki still doesn't work. could you verify my username and email address? [17:21] a password reset email doesn't reach me, so I suspect the address is wrong [17:34] what is a polite rate at which to distributely crawl ~4mil pages of a website? [18:01] I would answer that but xk_id is no longer with us [18:01] lurk moar [18:06] He's adorable!!!!!! [18:23] is there a polite rate? [18:37] alard, or anyone who might know, any ideas why I get this when trying to run xanga-grab? [18:37] http://p.defau.lt/?NDgR01YNIsWOuvuARy1Slg [18:39] you don't have a module named util [18:39] Well, yeah [18:39] But I mean why is it trying to import it if it doesn't provide it [18:45] underscor: it does provide it: https://github.com/ArchiveTeam/seesaw-kit/blob/master/seesaw/util.py [18:46] Oh. So my seesaw installation is borked, then [18:46] Thanks :) [18:46] glad to help :) [18:48] what's the methodology for asking Wayback to crawl something immediately again? [18:49] in the event of deaths, bankruptcies, etc. [18:52] it's probably in the faq [18:52] bbl [18:59] http://twitpic.com/byq4ry [19:00] db48x: Start a new account, and then we'll merge and rename it [19:05] DrainLbry: http://liveweb.archive.org/http://site.to.archive/some_thing/page.html [19:05] It doesn't support https sites though [19:10] thanks [19:14] No, thank you. :) [19:16] i have over 100gb of video now [19:17] SketchCow: will you put my g4 web videos in a different collection then g4video [19:18] i want g4video to be for full videos [19:18] g4tv.com has more clips of things [19:21] I figured you know, the Iranian's sending a monkey into space deserved a crawl of the Islamic Republic of Iran Iranian Space Agency website (yeah, it's a thing, isa.ir) [19:21] sdfsfsdfdf [19:21] Yeah, fine [19:21] What would you like it called? [19:21] g4video-web [19:35] http://archive.org/details/g4video-web [19:37] thanks [19:39] underscor: You need version 0.0.12 of the seesaw-kit. I've now placed the version check before the import util. [19:40] alard: I installed 0.0.10 using the "old" way (as a dev package) and now when I do pip install -U seesaw it "upgrades" but seesaw.__version__ is still 0.0.10 [19:40] is there a way to purge/uninstall the old one? [19:41] I don't know. pip uninstall ? [19:43] Also (SketchCow) the current Xanga estimate is 35TB. [19:44] http://p.defau.lt/?4K1x_PYuh_8lQ7YKc4kHRQ [19:44] grrr [19:44] underscor: heh [19:45] do you have a seesaw directory for it to find? [19:47] aha, yup [19:47] there was one earlier in the path that it found [19:47] womp. :( [20:08] alard: Thanks [20:19] i found a french magazine called TILT microloisirs [20:19] it ran from 1982 to 1994 [20:33] cool. what's a loisir? [20:33] no idea [20:33] heh [20:39] Loisir means leisure, so it's probably a games magazine? [20:39] I see that underscor is starting to climb the Xanga leaderboard. [20:39] :D [21:11] SketchCow - I can see the MacAddict's on archive.org but the 'super pak 3' disc is not there. Also, is there a way I can help, over time, by adding descriptions on each title, as well as normalize the MacAddict naming in that collection? - i.e. from previous content, one title is "Mac Addict" while the rest are "MacAddict" (space) [21:14] naming on older versions slighltly different - e.g. "MacAddict 51 November 2000" vs. newer "MacAddict #051" -- http://archive.org/details/Macaddict51November2000 -- http://archive.org/details/macaddict-cd-051 [21:15] I'd like to fix/add metadata as I have time too. Is there a way for me to do that in an xml file before they get posted so that it's easy for you? Or is it possible I can be allowed to edit the metadata on those directly? [21:15] (going off to nap) [21:15] Have a good nap [21:27] There's no easy way for you to edit them. [21:27] But mail me a list of changes and I can paste them in. [21:38] okay... when I get there... will do. [22:43] http://www.snaposit.com/shutdown/ [22:48] why did they think that would work out? [22:50] Dunno, works fine for the linked service (Unlimited storage, 5$/mo) [22:54] right, so how did they thing that they could charge $9 and only store photos? [22:58] backblaze does the $5 backup storage idea [23:00] right. which is half the cost of snaposit, and snaposit only stores photos [23:00] how could they have expected that to work out? [23:07] I dunno, write a freggin blog post about it or something man [23:07] online diary [23:15] http://tghw.com/blog/well-that-sucks-what-else-you-got [23:16] speaking of mac stuff is the MacAdvocate II cd archived? i have a copy here though the cd isn't in the greatest shape it does read [23:16] if i rip stuff for archiveteam should i follow the redump.org cd dumping guidelines, i.e. correct audio offsets and pregaps and stuff [23:17] that's only relevant for audio and mixed-audio-data cds, but seems that was popular in the early 90s [23:19] * ersi nods in alard's direction [23:31] don't see macadvocate on ia so unless sketchcow has one keistered we probably don't have it [23:32] if you're set up to do the redump method then go for it but personally I don't bother for stuff like this [23:57] Nope, don't recall that