[01:34] alard: Thanks. [03:59] Bing. [04:02] * Jiro looks for live people. [04:02] SketchCo one [04:04] *^^&^& connection problems... [04:04] Yes, I'm one. [04:05] Do you need something? [04:05] Oh no, zero! [04:05] * Jiro was mostly curious if Usenet was ever archived. I know about Utzoo and Google groups... [04:06] I recall that making the Usenet archive involved several different sources, such as old CD-ROM Usenet distributions. [04:06] Theoretically, one would hope other people could get ahold of those though they're pretty rare. [04:06] We grabbed a copy that was one of the main sources. [04:07] http://archive.org/details/utzoo-wiseman-usenet-archive [04:07] From that, olduse.net was made [04:07] -----^ "I know about Utzoo" [04:07] I see. [04:07] "I was mostly curious if anything is sweet. I know about candy bars and sugar." [04:08] The ones I was referring to came after that )overlapping it I believe) and cert6ainly are not covered by it. [04:08] Ah. [04:08] Well, good luck finding them. [04:08] Let us know when you have them, we'll make a copy. [04:08] * Jiro figured as much. They need archiving. I could suppose we can hope Google will give archivists copies if they ever delete them... [04:09] Also, Google Groups brings up a message "The old Google Groups will be going away soon. [04:09] Switch to the new Google Groups." [04:10] Ah, yes. [04:10] Well, we grabbed a copy of Google Groups' files and webpages. [04:10] But not the lists, yet. [04:10] It's been like that for a while, actually. [04:10] Yeah, thanks. [04:10] Please, please sit here and tell me this. [04:10] All night. [04:11] I'm not trying to tell you anything, I'm wondering what's going on. [04:11] And I told you. [04:11] And now you have a mission. [04:11] Good luck, here's a candy bar. [04:11] (They're sweet) [04:11] Well, I was trying to tell you that there were Usenet distributions on CD but you didnb't seem to know that. [04:11] I did. [04:11] * nitro2k01 grabs popcorn [04:11] Thanks, I guess. [04:12] You are confusing "didn't know" with "might not have yet" [04:12] http://archive.org/search.php?query=usenet%20AND%20collection%3Acdbbsarchive [04:12] I assume you mean things like this [04:12] He's gone. (I DON'T SAY!) [04:12] Is he? [04:12] I stopped tracking quits and joins. [04:12] -!- Jiro [~arromdee@208.65.89.134] has quit [Remote host closed the connection] [04:12] Ah. [04:13] Before You are confusing... [04:13] "Archive Team are a bunch of jerks. I told them they were missing stuff and they showed me they are aware of it and yet they didn't thank me for pointing it out." [04:14] Cnogratulations on finding the most ungrateful job in the universe, Jason [04:14] This is nowhere near that. [04:14] Jizzmopper. [04:14] That one's tough. [04:14] Jizmopper Intern [04:14] also awful [04:14] Then again you might be turned on by jizz [04:14] Unpaid jizzmopper intern [04:14] "for experience" [04:15] In the burgegoniong jizzmopping industry [04:15] (Any interest in the Dell edocs library, BTW? I have about 80% of it) [04:15] I take everything. [04:15] Hey, so two issues. [04:15] 1. I am back home, yay [04:15] 2. This means I downloaded FEZ for the Xbox [04:16] the two forces may fight [04:16] Tell me where to drop it; once it's done I'll start sending it on [04:17] I've heard Fez is excellent; you're in for a good time [04:17] How big is it? [04:17] Dunno offhand; 10-15GB tops? [04:18] About 90% of it is redundant foreign language manusl [04:20] I could clean it if you'd like [04:23] So, wait, it's you and nitro awake right now? [04:23] I have a project, I'd love to fling someone at it. [04:23] I'm not awake much longer; what's up? [04:23] Someone comfortable with bash or perl or whatever. [04:23] No, this is longer term. I'll bring it up tomorrow. [04:23] Alright. I gotta get up at 4am for something [04:23] Sweden. 06:24 AM. Been up all night. [04:25] No worries. [04:25] This is not time critical. [04:26] At all. [04:26] FEZ NOW [04:27] Now I'm curious what that project may be, though [04:50] anothe one for the "websites that time forgot" list http://www.completelyfreesoftware.com/ [04:53] Hey. HEY! http://www.completelyfreesoftware.com/we1_w31.html [04:53] They have WinBar [04:53] I used this when I was 11 years old or so and thought it was the coolest thing [04:54] We also do not currently support any 64-bit operating system as these are generally designed for commercial use. [04:55] * chronomex nods from his amd64 laptop [04:55] primarily used for porn, irc, and personal email [04:56] Jesus said: .Go into all the world and preach the good news to all creation. Whoever believes and is baptised will be saved, but whoever does not believe will be condemned.. [04:56] [Mark 16:15-16 NIV] [04:56] Could this key save your life? [04:56] Click on it to find out. [05:03] SketchC0w I have Child of Eden and it's really cool, how's Fez [05:06] do you have a mouse? click here to find out! [05:07] I remember some of those joke apps with a moving ok button (Win3.1 era) [05:07] Tab, space [05:08] Nothing happened [05:08] I was disappoint [05:08] I wanted a cookie [05:08] did you happen to check your cookies afterward? [05:09] I think you may be too young... [05:09] i remember some of that bs [05:10] how far back has that been going on? [05:10] Point is, no internet at home, no cookies to check [05:10] (I wanted a cookie referred to a symbolic chocolate chip cookie) [05:10] anyway... SketchC0w , i'm not awesome at bash, but would also love to hear [05:11] oh, my bad, skimmed over the part where this wasn't a hilarious website joke [05:11] (Win3.1 era) [05:11] apps [05:11] win3.1 could internet [05:11] apps is what i overlooked [05:12] I'm glad we've come to agreement [05:12] * nitro2k01 touches his nose [05:28] Started my FEZ journey. [05:28] Good stuff. [05:28] Need to get some stuff in gear on Fortress, our remaining box for the moment. [05:29] SketchC0w, FEZ is best played while high [05:29] Thanks. [05:29] I'll mark that down under the line item "for if you ever get high" [05:30] I assume that's the only item on the list [05:31] oh but of course [05:32] I just agreed to be a wedding photographer. [05:37] When the bride says yes, scream "YOU DON'T SAY!" [05:40] nah, he should do a Peter Griffin [05:40] http://www.youtube.com/watch?v=RWgpFhA9Ve8 [07:10] Yay [07:12] What's up [07:15] rather, what's down? [07:16] What's left [07:17] ↑↑↓↓←→←→BA [07:17] 30 lives [07:17] I was just playing Contra today since I got my new SNES USB controller today [07:27] I love how Contra never seems to stray too far from its roots. [07:27] Well...mostly. [07:29] www.youtube.com/watch?v=FGq4kVcVz9U I made this video a few minutes ago [07:37] hmmmm [07:37] wow that was nice, 2 mobileme sites iwth like 3 pages each :D [07:37] and hten - "898 files" [08:14] knol closes tomorro [08:14] w [08:14] i [08:14] k [08:14] i [08:14] e [08:14] p [08:14] d [08:14] i [08:14] a [08:14] wins [08:14] http://www.archiveteam.org/index.php?title=Knol [08:14] 700,000 knol urls http://db.tt/GNrEh61y [08:19] we didn't even bother to archive it? [08:19] there's a knol project and project channel [08:20] ersi, what's the channel? [08:20] and how long will knol be online still [08:20] It's closing tomorrow for what is known [08:21] #klol [08:21] ah yeah, was greppin' my logs for the channel name [08:22] SORT of down to 959gb of friendster. [08:28] knolwikip, is the metadata linked from the wiki page on archive.org? [08:29] no [08:29] just dropbox [08:30] knolwikip, why don't you put it on archive.og, sounds better [08:31] do it you [08:32] aww [08:43] http://vimeo.com/26629985 [08:44] He does the opening act for the archiveteam conference [08:51] I thought the opening act was beer [08:51] And the second act was hookers [08:51] And the final act was beer again [08:51] you're forgetting something [08:51] hookers and blow [08:52] blow blow blow blow [08:52] archivist fantasies [08:53] no, archivists fantasize about floppy disks coming back [08:54] between Ada Lovelace's boobs [08:55] who? [09:00] SketchC0w: Every mobileme uploader is now uploading directly to s3, no longer using batcave. (It proxied over 120 TB.) [09:01] wait on [09:01] wait no [09:01] I'm shoving my shit in at this very moment [09:01] to batcave or fos or wherever the github upload-finished.sh pointed at six hours ago [09:01] But not using seesaw-s3, right? [09:01] no [09:01] ./upload-finished.sh [09:02] This is all about the s3 uploader. upload-finished and normal seesaw upload to fos. The s3 uploader used batcave as a proxy, but now it doesn't. [09:02] I had it sitting on my box for a few months I guess [09:02] ok [09:02] I think the rsync on batcave has been down for quite some time, so if you're uploading you're uploading to fos. [09:02] ok [09:02] fos it is then [10:10] Thanks. [10:26] I think statusboard.archive.org still lives on batcave. [10:26] It appears to be - that user account keeps updating. [10:27] cool stuff [10:27] Shenzhen keeps busy [17:17] WOOOOOOO [17:17] Really? No activity in 6 hours? [17:19] there were lots of bots running on fos [17:40] Someone's uploading hundreds of punk/underground zines to archive.org. [17:40] I'm going after byte as a treat after I get a few more things done. [17:44] hi jason [17:44] Hey. [17:45] At some point I need to deal with your screenshot uploads. [17:45] They need to be a little more organized, it'll make getting them into the collection a little easier. [17:45] yes, maybe [17:46] i'm working away this week, as in i'm leaving in three quarters of an hour, so you might want to put that off until i get back if you need me [17:46] and after that, fortunecity screenshots :) [17:46] I'm not hurting at the moment, but I go into your directory and it's a little butt-y [17:46] (i've got them, just need to copy them somewhere) [17:47] SketchCow: as in the .headers and .info files? [17:47] It helps me a LOT if you either upload them as a .tar.gz or a .zip or set each sequence in a directory so I can do something with them. [17:47] Each 'set' should be in a directory. [17:47] I will take a directory and make it a .zip, or you will, depending. [17:47] That's what should go into archive.org. [17:47] gotcha [17:48] I don't mind FOS doing this work - but if it can be avoided, the machine isn't chugging on it. [17:48] so group them up into batches of a hundred or something? [17:48] Right now, it's chuggling like crazy on mobileme stuff. [17:48] I don't think of hundreds. [17:48] I think of megs/gig.s [17:48] ah, gotcha [17:48] So, say, 500mb pieces. [17:48] I can put all these .zips into an item. [17:49] Only people who would want these are going to go to one item. [17:49] 500mb would only be two pieces [17:49] uh three [17:57] Well, are you going to make more beyond that? [17:57] If so, 500 [17:57] If not, 100 [17:59] SketchCow: i can't say what will happen in future, i didn't expect to be grabbing 1.5-ish gb of fortunecity screenshots but i did [17:59] but okay, i will use my judgment [18:00] Go with 100mb. [18:01] and bundle the .headers and .info files with them? [18:02] Yes please. [18:03] k :) [18:03] i'll sort it out once i am back home [18:04] ha ha, I've been deleting a directory from FOS for 2 hours. [18:05] good grief [18:05] mv x /dev/null ? [18:05] might be faster for all i know [18:06] (althought that sounds dangerous) [18:09] .....and THERE we go. [18:09] It just finished. It's just a lot of files on a drive being hammered. [18:09] I'm trying to .tar up a lot of files that are in this case, friendster. [18:10] ah :) [18:10] This was easily, 1,000,000 files. [18:10] that's a whole lotta friendster [18:11] But it's NOT. [18:11] That's what so CRAZY [18:12] it's a million things! [18:13] i mean just pretend each one of those things is worth $1 [18:19] super, 13 minutes before i leave is a real good time to not be able to find my glasses [18:20] winr4r: They're over there on the thing. [18:20] haha, hey Wyatt [18:28] i'm out now, catch you all later some time :) [18:40] I'm trying to write a splinder-combiner. [18:42] mm [18:42] the s3 endpoint seems to have broken on me [18:47] Operations will happen on us/u... [18:47] root@teamarchive-1:/2/BATCAVE2/SPLINDER# sh chokeabitch splinder-11 us/u [18:47] Already a ug in /splinder/us/u. We have to go DEEPER. [18:47] Surprising! There is no /splinder/us/u/ud [18:47] Surprising! There is no /splinder/us/u/ue [18:47] Already a uh in /splinder/us/u. We have to go DEEPER. [18:47] Why call it "chokeabitch"? I honestly don't know. [18:47] make sure you don't miss any spaces in your paths [18:48] Oh, I always quote out to shit. [18:48] or rather, don't introduce any that shouldn't be there [18:48] Chokeabitch is basically looking at this massive set of 20+ directories from people. [18:49] And then I am basically going through, saying "anything not on this one, move over there" [18:49] It's just helping me. I can't do some sort of major thing because one mistake wipes shit out. [18:50] hm [18:50] I wish I had several dozen TB to play with a replicable database of ArchiveTeam panic grabs [18:50] throw shit in CouchDB and see what happens [18:50] no idea why that would be good or what it would do [18:51] but throwing massive amounts of shit around is always hilarious, or at least monkeys seem to think that [18:51] if [ ! -d "/2/BATCAVE2/SPLINDER/splinder/$CHECKA" ] then echo "Woah, nelly. There isn't a $CHECKA in the main file. Go up a level." exit 1 [18:51] fi [18:51] That's the line - if there's no main directory to put it TO, then stop. [18:52] So if I shove us/u/uk/uka into the main one, and the main one only has /us/u and no /us/u/uk, it'll stop. [18:53] Woah, nelly. There isn't a us/u/ui in the main file. Go up a level. [18:53] root@teamarchive-1:/2/BATCAVE2/SPLINDER# sh chokeabitch splinder-11 us/u/ui [18:53] see? [18:53] here is what I have learned the hard way 100 times [18:53] Never put a cd .. in a bash script [18:54] If you do, you are a wearing a big clown suit and you are jumping into a fire [18:54] You will be a burning clown. [18:54] That sounds like an adequate assessment [18:56] here we go. [18:56] Already a ug in /splinder/us/u. We have to go DEEPER. [18:56] Moving ud to /splinder/us/u/ud [18:56] Moving ue to /splinder/us/u/ue [18:56] Operations will happen on us/u... [18:56] root@teamarchive-1:/2/BATCAVE2/SPLINDER# sh chokeabitch splinder-11 us/u [18:56] Already a uh in /splinder/us/u. We have to go DEEPER. [18:56] Second run. [18:56] Already a ug in /splinder/us/u. We have to go DEEPER. [18:56] Already a uh in /splinder/us/u. We have to go DEEPER. [18:56] Operations will happen on us/u... [18:56] root@teamarchive-1:/2/BATCAVE2/SPLINDER# sh chokeabitch splinder-11 us/u [18:56] And there we have it. [18:59] So, here is an AWESOME WAY TO WRITE DOCS [19:00] It's a very subtle one. [19:00] After I move the directory's contents, I should do a "what the fuck" and remove the parent IF I've moved all the directories in that parent away [19:00] i.e. so you move f/file1 and f/file2, what the fuck, do a rmdir on f, see if that thing's done. [19:00] But like all sane people, I hate 1,000 "directory not empty" errors. [19:01] Turns out, there's a feature in gnu for that. [19:01] --ignore-fail-on-non-empty [19:01] BUT, here's the description. [19:01] ignore each failure that is solely because a directory is non-empty [19:01] SO. [19:01] Does this mean "Just don't print the failure you couldn't remove it?" [19:02] or does it mean "Don't let the fact it's not empty stop you from removing it?" [19:02] Now, I just ran some tests. It means the first. [19:02] But holy crap would you have been sad if it was the second. [19:11] 11:54:19 <@SketchCow> Never put a cd .. in a bash script [19:11] how about pushd? [19:12] WITCH [19:12] * chronomex burns quietly [19:12] Put on the clown suit [19:12] OK, running a script now. [19:13] It's now doing, oh, a week of work in an hour. [19:15] So that's good. [19:20] work /= 40; [19:21] :D [19:29] Moving sak to /splinder/it/s/sa/sak [19:29] Moving sao to /splinder/it/s/sa/sao [19:29] Moving sap to /splinder/it/s/sa/sap [19:29] Moving sau to /splinder/it/s/sa/sau [19:29] Moving say to /splinder/it/s/sa/say [19:29] Six billion of those. [19:39] S3 just broke [19:39] I don't know what that does to our uploads. [19:39] Do we have error checking for it? [19:41] Doesn't curl do that? [19:42] but does our script work properly with s3 down? [19:43] I would expect it to fail to upload anything and go to the retry loop, but I'm probably missing some nuance of how S3 plays into the equation. [20:09] the history of computers is "I would expect it to .... aaaaaaaa" [20:11] Wyatt, I seriously doubt it does so. [20:12] When pload fails, it throws a completely useless HTML (!) error to the log. [20:13] so, when i build an URL like this http://liveweb.archive.org/http://coolsite.com I'm adding the site to the WayBack machine? how cool is it? [20:22] Page cannot be crawled or displayed due to robots.txt. [20:22] "or displayed" [20:23] is that a trick by IA to archive and not show? [20:23] lol [20:23] well [20:23] if they have saved stuff and you change your robots.txt to say "don't crawl this" [20:23] they'll stop displaying it retroactively, but they won't delete what they have [20:24] so it's actually laziness, so they don't have to look and see if they have anything for it [21:52] SketchCow: what's this bash project you were talking about yesterday? [22:00] Not yours [22:01] SketchCow: I've got plenty of time now; whatcha need? [22:28] OK. [22:28] Here's it in basic. [22:29] Right now, we have these .ISO files. [22:29] In the CDBBS section. [22:29] What I would love is to make an HTML file. [22:29] This file would be generated from the ISO files. [22:29] If a HTML file has a files.bbs paradigm, generate an HTML that can be clicked on [22:29] This would then go to the download link for the exact files. [22:30] this needs more explanation. [22:32] An HTML placed as description of the item? OR as embedded file? [22:37] Ideally, there will be a file in the item. [22:37] You click on it and it gives you a listing. Let me find an example. [22:38] http://archive.org/details/BestOfMegaGamesForDOS [22:38] Poor example. One moment. [22:40] So HTML ToCs for the ISOs [22:40] OK, I need a moment. [22:40] More than that. [22:40] There are, often, FILES.BBS files on the disk. [22:40] So descriptive listings could be had. [22:40] And then they link back to the permanent URLs. [22:41] Are there permanent URLs for internal files? I'm just seeing links to the .iso [22:41] Wow, I didn't realize it'd get crazy. [22:41] yes there are. [22:41] http://archive.org/download/cdrom-pcsig12/pc-siglibrary12thedition1993.iso/RUN%2FFASTYPE%2FREADME.TXT [22:42] Tah day [22:44] Ah, clever [22:44] Yes, it's nice. [22:44] And underutilized. [22:44] But wow, I didn't realize we really have that huge shitpile there. [22:45] I mean, it's GREAT [22:45] but discovery is awful [22:45] I have to think more.