| Time |
Nickname |
Message |
|
00:09
🔗
|
SketchCow |
508 |
|
00:09
🔗
|
SketchCow |
root@teamarchive-1:/2/CDDOWN# find . -name \*.rar -size +200M | wc -l |
|
00:09
🔗
|
SketchCow |
508! |
|
00:16
🔗
|
chronomex |
five hundred and eight! |
|
00:20
🔗
|
SketchCow |
It went down to 238k/sec but it went back up to 2.4M/s again, finally. |
|
00:23
🔗
|
ex-parrot |
completely idle thought, but would the software library we grabbed during http://www.applefritter.com/aol be of interest to anyone? |
|
00:23
🔗
|
SketchCow |
Yes |
|
00:24
🔗
|
ex-parrot |
point me at where to upload it and I can organise something I suspect. we've got a bit of metadata to go with each file too |
|
00:24
🔗
|
SketchCow |
How big is it? |
|
00:24
🔗
|
ex-parrot |
I honestly don't remember how big it got to in the end, probably at least 5 gigs though, maybe 10 |
|
00:26
🔗
|
balrog_ |
ex-parrot: yes, but where can I find it |
|
00:26
🔗
|
balrog_ |
I don't know if it was by any means complete |
|
00:26
🔗
|
ex-parrot |
just trying to work that out again... it's been so long since I touched this project (I wrote the script to import it all in to drupal) |
|
00:26
🔗
|
balrog_ |
aah. |
|
00:26
🔗
|
balrog_ |
is any of the AOL stuff still up? |
|
00:26
🔗
|
balrog_ |
I checked like two years back and some was. |
|
00:27
🔗
|
balrog_ |
(on AOL) |
|
00:28
🔗
|
ex-parrot |
I think probably the best thing to do is to hit up Tom next time he's online and we'll get back to y'all :) |
|
00:28
🔗
|
balrog_ |
I've tried to email him three or four times |
|
00:28
🔗
|
balrog_ |
never got anything |
|
00:28
🔗
|
ex-parrot |
damn ok, I can definitely get in touch with him |
|
00:28
🔗
|
ex-parrot |
it would be a shame for the files to just sit festering in drupal forever on this mac mini |
|
00:29
🔗
|
balrog_ |
can you do a get info and see how big it is? |
|
00:29
🔗
|
ex-parrot |
I think we have a few other moderately sized FTP dumps of shareware and things |
|
00:29
🔗
|
balrog_ |
and is what you have all of it, or does tom have more? |
|
00:29
🔗
|
ex-parrot |
I don't have the files myself, I just know they're on this server somewhere |
|
00:29
🔗
|
ex-parrot |
I'll find out :) |
|
00:29
🔗
|
balrog_ |
ah :| |
|
00:29
🔗
|
balrog_ |
ok |
|
00:30
🔗
|
ex-parrot |
I was hoping I could just find it easily in the drupal admin console but it's been so long since I fiddled with it and the UI makes basically no sense, I think I'll have to wait until Tom wakes up |
|
00:30
🔗
|
balrog_ |
do you have ssh access to the server? |
|
00:30
🔗
|
ex-parrot |
not currently |
|
00:30
🔗
|
balrog_ |
ah. :| |
|
00:31
🔗
|
balrog_ |
is the server on the web? |
|
00:31
🔗
|
ex-parrot |
yeah, I think at least some of the stuff is just on the main http://www.applefritter.com/ server. it used to be exposed in the drupal interface somewhere along with some metadata |
|
00:31
🔗
|
balrog_ |
ahh. |
|
00:32
🔗
|
balrog_ |
I'm wondering if anyone's tried to log in with aol for os9 lately |
|
00:32
🔗
|
ex-parrot |
ah here we go, http://www.applefritter.com/taxonomy/term/279 are the files we grabbed from info-mac around the same time |
|
00:32
🔗
|
ex-parrot |
I will have to see about where the AOL files went |
|
00:32
🔗
|
ex-parrot |
I should be able to tar up those info-mac files too |
|
00:33
🔗
|
balrog_ |
I doubt that's as rare |
|
00:33
🔗
|
balrog_ |
info-mac was widely mirrored |
|
00:33
🔗
|
ex-parrot |
yeah, I don't think it's as interesting |
|
00:33
🔗
|
ex-parrot |
the metadata might not have been mirrored as widely |
|
00:34
🔗
|
* |
balrog_ installs aol 5.0 in sheepshaver |
|
00:34
🔗
|
ex-parrot |
I'd be interested to know what happens when you fire that up |
|
00:34
🔗
|
SketchCow |
DONE. |
|
00:36
🔗
|
ex-parrot |
it's possible we never got as far as even uploading the AOL files to the site. I will have a chat with Tom and find out. I am sure we still have a tarball anyway at least :) |
|
00:58
🔗
|
dashcloud |
wow- doing some metadata, I saw an article for a Koala Pad Touch Tablet digitizer- I wasn't aware that kind of device existed that long ago |
|
01:04
🔗
|
DFJustin |
if any other AOL file areas are still up I would be very, very interested in getting the contents |
|
01:05
🔗
|
ex-parrot |
DFJustin: balrog_ mentioned in -bs that some stuff seems still to be up on the actual AOL service itself if you have the client |
|
01:05
🔗
|
balrog_ |
yup |
|
01:05
🔗
|
balrog_ |
many seem to be up |
|
01:05
🔗
|
balrog_ |
however keywords are non-working |
|
01:49
🔗
|
DFJustin |
I used the file areas a lot back in 1996-7 ish and there was a LOT of stuff which is not necessarily available elsewhere, shareware but also digital artwork etc. |
|
02:05
🔗
|
Lord_Nigh |
i remember downloading slam.mid from the file area in 1996-7ish and afaik it never showed up elsewhere on the internet ever; the hard disk which contained it had an ic explode and i don't think is recoverable |
|
02:05
🔗
|
balrog_ |
you're sure that hdd died? :( |
|
02:05
🔗
|
balrog_ |
oh |
|
02:05
🔗
|
balrog_ |
hmm |
|
02:05
🔗
|
balrog_ |
you know which file area? |
|
02:06
🔗
|
Lord_Nigh |
its dead dead. as in chip exploded dead |
|
02:06
🔗
|
Lord_Nigh |
was a 500mb old sucker too |
|
02:06
🔗
|
balrog_ |
Lord_Nigh: we're talking about this in #archiveteam-bs if you want to rejoin |
|
02:48
🔗
|
turnkit |
hdd dead with chip explosion isn't necessarily dead is it? just controller board is dead (swap with exact same model -- some data recoverable -?) |
|
02:49
🔗
|
turnkit |
sorry -- to -bs |
|
03:53
🔗
|
SketchCow |
http://fos.textfiles.com/CDDOWN/ in case anyone wants to walk it before I start uploading in earnest. |
|
04:28
🔗
|
turnkit |
SketchCow: from - http://www.kultcds.com/index.php?lang=en ? |
|
04:28
🔗
|
SketchCow |
Yes |
|
04:28
🔗
|
SketchCow |
All grabbed! |
|
04:28
🔗
|
SketchCow |
Now writing scripts for uploads. |
|
04:29
🔗
|
DFJustin |
ah is this that hallfiry guy's collection? |
|
04:29
🔗
|
DFJustin |
answer: yes |
|
04:30
🔗
|
DFJustin |
sweeeeet |
|
04:33
🔗
|
lemonkey |
http://www.jwz.org/blog/2013/01/shes-a-flight-risk-2/ |
|
04:48
🔗
|
turnkit |
He has a nice frontend. (that sounds weird.) I just found I have a few netpower issues he doesn't have. Going to try to contact him to contribute those. Do you have a way to do an update sync? I mean will his archive show up complete in Wayback or is it a standalone "snapshot" on archive.org. I'm still not up to speed with most of what goes on here. |
|
05:14
🔗
|
SketchCow |
Hallifry guy's been contacting me |
|
05:14
🔗
|
SketchCow |
They're deleting all the CD-ROM images |
|
05:22
🔗
|
balrog_ |
why, no room, or complaints? |
|
05:45
🔗
|
SketchCow |
No idea. |
|
05:47
🔗
|
BlueMax |
Have they been grabbed? |
|
05:53
🔗
|
turnkit |
UI / delivery / discoverability is just important as "having" archives. The UI *is* the metadata in a sense. hallfiry's interface is minimalistic but excellent. Hope your archive includes his interface & goes to Wayback where's it's maintained. -? |
|
05:53
🔗
|
turnkit |
So, SketchCow, you're saying I should archive the few netpower CD's he's missing just like the MacAddicts CD ISOs, to fos? Is that best? |
|
05:57
🔗
|
SketchCow |
I have been grabbing ALL his shizzle |
|
06:28
🔗
|
SketchCow |
And while I edit the film, it's all getting uploaded now. |
|
06:32
🔗
|
godane |
cool |
|
06:32
🔗
|
godane |
also i think if i pull of backing up g4tv.com you all most will have talk about that at some point |
|
06:33
🔗
|
SketchCow |
What |
|
06:34
🔗
|
godane |
i think the g4tv grab will have to be talked about in one of your speckes |
|
06:35
🔗
|
godane |
videos like this: https://archive.org/details/g4tv.com-video3902 |
|
06:36
🔗
|
godane |
i don't know if you will fan that coverage anywhere else |
|
06:37
🔗
|
godane |
also g4tv.com is in the 35k+ of videos |
|
06:38
🔗
|
balrog_ |
this aol thing... so far all I've found reaffirms the fact that aol is a massive, massive clusterfuck |
|
06:38
🔗
|
balrog_ |
and archiving any portion of it will be very, very painful |
|
06:46
🔗
|
balrog_ |
it seems they stopped caring in 2003. |
|
07:25
🔗
|
* |
SketchCow is shoving in 100 MacAddict CD-ROMs. |
|
07:25
🔗
|
SketchCow |
Naturally the ISOs are completely incompatible with the archive.org trickery. |
|
07:26
🔗
|
SketchCow |
http://archive.org/details/macaddict_coverdiscs&reCache=1 |
|
07:45
🔗
|
SketchCow |
I keep talking in here like Turnkit isn't here. |
|
07:45
🔗
|
SketchCow |
Sorry, I think of my e-mail buddies as different than IRC buddies. |
|
07:45
🔗
|
SketchCow |
I have a LOT of people who mail me, but never use the IRCs |
|
07:58
🔗
|
turnkit |
I'm not here. I keep going in the other room to try to pay my bills, and find myself hovering over the keyboard wondering about stuff. FYI I think MacAddict .iso's #1 - #88 are contiguous, after that I've currenly only sporadic issues. Bidding on eBay for a batch between 89-125 but it doesn't close for a week. |
|
07:58
🔗
|
turnkit |
(by "pay bills' I mean get work done that is supposed to pay me) |
|
08:00
🔗
|
turnkit |
godane -- https://archive.org/details/g4tv.com-video3902 is good historical footage... valuable |
|
08:00
🔗
|
godane |
there is more at video3901 |
|
08:01
🔗
|
godane |
so its about 16 to 18mins if i remember all together |
|
08:10
🔗
|
SketchCow |
I've uploaded them all. |
|
08:19
🔗
|
godane |
turnkit: even more historical footage: https://archive.org/details/g4tv.com-video4352 |
|
08:19
🔗
|
godane |
its about the global jukebox |
|
08:22
🔗
|
turnkit |
Google maps should have Global Jukebox features. lol. BTW there are tell-tale single line (field) tape hits in that footage which reveal what tape format it was stored on at one point. |
|
08:23
🔗
|
turnkit |
I think it's BetaSP but it might have been uMatic (egads) |
|
09:11
🔗
|
godane |
SketchCow: any chance you get to uploading the bbs interviews this year? |
|
09:15
🔗
|
SketchCow |
It's likely. |
|
09:33
🔗
|
godane |
thats good |
|
10:38
🔗
|
godane |
SketchCow: i would have liked these g4 videos to be in a more g4video-web collection |
|
10:39
🔗
|
godane |
this is cause g4video is for complete videos |
|
10:39
🔗
|
godane |
i'm only complaining cause i care |
|
10:41
🔗
|
godane |
also know that the g4tv.com videos will add over 30k+ |
|
17:21
🔗
|
db48x |
SketchCow: my account on the File Formats wiki still doesn't work. could you verify my username and email address? |
|
17:21
🔗
|
db48x |
a password reset email doesn't reach me, so I suspect the address is wrong |
|
17:34
🔗
|
xk_id |
what is a polite rate at which to distributely crawl ~4mil pages of a website? |
|
18:01
🔗
|
chronomex |
I would answer that but xk_id is no longer with us |
|
18:01
🔗
|
chronomex |
lurk moar |
|
18:06
🔗
|
SketchCow |
He's adorable!!!!!! |
|
18:23
🔗
|
db48x |
is there a polite rate? |
|
18:37
🔗
|
underscor |
alard, or anyone who might know, any ideas why I get this when trying to run xanga-grab? |
|
18:37
🔗
|
underscor |
http://p.defau.lt/?NDgR01YNIsWOuvuARy1Slg |
|
18:39
🔗
|
db48x |
you don't have a module named util |
|
18:39
🔗
|
underscor |
Well, yeah |
|
18:39
🔗
|
underscor |
But I mean why is it trying to import it if it doesn't provide it |
|
18:45
🔗
|
db48x |
underscor: it does provide it: https://github.com/ArchiveTeam/seesaw-kit/blob/master/seesaw/util.py |
|
18:46
🔗
|
underscor |
Oh. So my seesaw installation is borked, then |
|
18:46
🔗
|
underscor |
Thanks :) |
|
18:46
🔗
|
db48x |
glad to help :) |
|
18:48
🔗
|
DrainLbry |
what's the methodology for asking Wayback to crawl something immediately again? |
|
18:49
🔗
|
DrainLbry |
in the event of deaths, bankruptcies, etc. |
|
18:52
🔗
|
db48x |
it's probably in the faq |
|
18:52
🔗
|
db48x |
bbl |
|
18:59
🔗
|
SketchCow |
http://twitpic.com/byq4ry |
|
19:00
🔗
|
SketchCow |
db48x: Start a new account, and then we'll merge and rename it |
|
19:05
🔗
|
ersi |
DrainLbry: http://liveweb.archive.org/http://site.to.archive/some_thing/page.html |
|
19:05
🔗
|
ersi |
It doesn't support https sites though |
|
19:10
🔗
|
DrainLbry |
thanks |
|
19:14
🔗
|
ersi |
No, thank you. :) |
|
19:16
🔗
|
godane |
i have over 100gb of video now |
|
19:17
🔗
|
godane |
SketchCow: will you put my g4 web videos in a different collection then g4video |
|
19:18
🔗
|
godane |
i want g4video to be for full videos |
|
19:18
🔗
|
godane |
g4tv.com has more clips of things |
|
19:21
🔗
|
DrainLbry |
I figured you know, the Iranian's sending a monkey into space deserved a crawl of the Islamic Republic of Iran Iranian Space Agency website (yeah, it's a thing, isa.ir) |
|
19:21
🔗
|
SketchCow |
sdfsfsdfdf |
|
19:21
🔗
|
SketchCow |
Yeah, fine |
|
19:21
🔗
|
SketchCow |
What would you like it called? |
|
19:21
🔗
|
godane |
g4video-web |
|
19:35
🔗
|
SketchCow |
http://archive.org/details/g4video-web |
|
19:37
🔗
|
godane |
thanks |
|
19:39
🔗
|
alard |
underscor: You need version 0.0.12 of the seesaw-kit. I've now placed the version check before the import util. |
|
19:40
🔗
|
underscor |
alard: I installed 0.0.10 using the "old" way (as a dev package) and now when I do pip install -U seesaw it "upgrades" but seesaw.__version__ is still 0.0.10 |
|
19:40
🔗
|
underscor |
is there a way to purge/uninstall the old one? |
|
19:41
🔗
|
alard |
I don't know. pip uninstall ? |
|
19:43
🔗
|
alard |
Also (SketchCow) the current Xanga estimate is 35TB. |
|
19:44
🔗
|
underscor |
http://p.defau.lt/?4K1x_PYuh_8lQ7YKc4kHRQ |
|
19:44
🔗
|
underscor |
grrr |
|
19:44
🔗
|
db48x |
underscor: heh |
|
19:45
🔗
|
db48x |
do you have a seesaw directory for it to find? |
|
19:47
🔗
|
underscor |
aha, yup |
|
19:47
🔗
|
underscor |
there was one earlier in the path that it found |
|
19:47
🔗
|
underscor |
womp. :( |
|
20:08
🔗
|
SketchCow |
alard: Thanks |
|
20:19
🔗
|
godane |
i found a french magazine called TILT microloisirs |
|
20:19
🔗
|
godane |
it ran from 1982 to 1994 |
|
20:33
🔗
|
db48x |
cool. what's a loisir? |
|
20:33
🔗
|
godane |
no idea |
|
20:33
🔗
|
db48x |
heh |
|
20:39
🔗
|
alard |
Loisir means leisure, so it's probably a games magazine? |
|
20:39
🔗
|
alard |
I see that underscor is starting to climb the Xanga leaderboard. |
|
20:39
🔗
|
underscor |
:D |
|
21:11
🔗
|
turnkit |
SketchCow - I can see the MacAddict's on archive.org but the 'super pak 3' disc is not there. Also, is there a way I can help, over time, by adding descriptions on each title, as well as normalize the MacAddict naming in that collection? - i.e. from previous content, one title is "Mac Addict" while the rest are "MacAddict" (space) |
|
21:14
🔗
|
turnkit |
naming on older versions slighltly different - e.g. "MacAddict 51 November 2000" vs. newer "MacAddict #051" -- http://archive.org/details/Macaddict51November2000 -- http://archive.org/details/macaddict-cd-051 |
|
21:15
🔗
|
turnkit |
I'd like to fix/add metadata as I have time too. Is there a way for me to do that in an xml file before they get posted so that it's easy for you? Or is it possible I can be allowed to edit the metadata on those directly? |
|
21:15
🔗
|
turnkit |
(going off to nap) |
|
21:15
🔗
|
ersi |
Have a good nap |
|
21:27
🔗
|
SketchCow |
There's no easy way for you to edit them. |
|
21:27
🔗
|
SketchCow |
But mail me a list of changes and I can paste them in. |
|
21:38
🔗
|
turnkit |
okay... when I get there... will do. |
|
22:43
🔗
|
ersi |
http://www.snaposit.com/shutdown/ |
|
22:48
🔗
|
db48x |
why did they think that would work out? |
|
22:50
🔗
|
ersi |
Dunno, works fine for the linked service (Unlimited storage, 5$/mo) |
|
22:54
🔗
|
db48x |
right, so how did they thing that they could charge $9 and only store photos? |
|
22:58
🔗
|
dashcloud |
backblaze does the $5 backup storage idea |
|
23:00
🔗
|
db48x |
right. which is half the cost of snaposit, and snaposit only stores photos |
|
23:00
🔗
|
db48x |
how could they have expected that to work out? |
|
23:07
🔗
|
ersi |
I dunno, write a freggin blog post about it or something man |
|
23:07
🔗
|
ersi |
online diary |
|
23:15
🔗
|
alard |
http://tghw.com/blog/well-that-sucks-what-else-you-got |
|
23:16
🔗
|
Lord_Nigh |
speaking of mac stuff is the MacAdvocate II cd archived? i have a copy here though the cd isn't in the greatest shape it does read |
|
23:16
🔗
|
Lord_Nigh |
if i rip stuff for archiveteam should i follow the redump.org cd dumping guidelines, i.e. correct audio offsets and pregaps and stuff |
|
23:17
🔗
|
Lord_Nigh |
that's only relevant for audio and mixed-audio-data cds, but seems that was popular in the early 90s |
|
23:19
🔗
|
* |
ersi nods in alard's direction |
|
23:31
🔗
|
DFJustin |
don't see macadvocate on ia so unless sketchcow has one keistered we probably don't have it |
|
23:32
🔗
|
DFJustin |
if you're set up to do the redump method then go for it but personally I don't bother for stuff like this |
|
23:57
🔗
|
SketchCow |
Nope, don't recall that |