[00:36] WHY HELLO [00:37] ahoy maytee [00:46] ohai [00:47] I wish I could have AT members hanging here [00:47] I'm speaking on archive team stuff, they put me on a suite [00:47] Is this Ann Arbor? [00:59] SketchCow: whens the talk going up? [01:28] Oh who knows [01:45] The fuck, they just dropped off a fruit and cheese plate [01:46] I just walked downstairs and toured the ballroom I'm speaking at tomorrow. [01:49] Were there mints on the pillows? [01:50] Nah, it's marriott. [01:53] Fruit and cheese, still classier than any Marriott I've been in [01:53] Yeah, it's a weird marriott. [01:53] And no porn, not that I'm checking. [01:54] Because? two guesses. [01:54] (guessing as to why there will be no porn) [01:54] What city is this? [01:54] hmmmm [01:54] Ann Arbor/Ypsilanti [01:54] It's owned by evangelicals? [01:54] Eagle Creek Resort, which is just a marriott with a conference center. [01:55] No. [01:55] I mean it is, but that's not the reason. [01:55] corporate purchasing card? [01:55] No, this is at all marriotts. [01:55] They're losing millions [01:55] Million Mom March or some similar bullshit? [01:55] disputes over people disagreeing that they watched it? [01:56] Cleaning issues? [01:56] Nope! [01:56] Much more insidious! [01:56] MUCH more insidious [01:56] CONGRESS [01:56] Janitors were watching? [01:56] hd porn costs too much for too little return? [01:56] No, and no and no. [01:56] Here we go! [01:56] Romney is running for president [01:56] ahhhhhh [01:56] He's on the board of marriott [01:56] ahahhaa [01:56] That'll do it [01:56] They've yanked it while he's runnning [01:56] that'd do it it [01:57] sorry, I'm drinking at work [01:57] kind of intoxicated [01:57] Bravo [01:58] my coworker's homebrew is excellent [01:59] beer or liquor? [01:59] beer [01:59] speaking of, I should go stock up on liquor before they privatize all the liquor stores [01:59] Pennsylvania? [01:59] washington [02:00] state stores close down at the end of the month [02:00] and I'm not sure if I trust grocery stores to carry applejack [02:00] Ah, yeah - it'd take time for liquor stores to open up [02:00] sure as hell they won't sell it for cheaper [02:11] hahaha, I just drunkenly called the grocery store to ask whether they'll be carrying a particular liquor [02:12] Do they? [02:12] the person who answered didn't know and said to call back tomorrow before 5 to catch the wine steward [02:13] D: [02:19] Your supermarkets have somolliers? Classy [02:19] hardly [02:19] more like "pimply 18 year old in charge of making sure hobos don't steal the wine" [02:20] Rofl - a wine steward that can't drink [02:20] idk, I've never asked [02:20] they're probably old enough [02:53] s/insidious/stupid/ [02:54] SketchCow in the stizzate [03:29] Coderjoe: It is, but negligibly so. [03:29] It's just 7z l, so it's not too expensive [03:38] eh? [03:38] oh [03:39] make me connect your seemingly-random utterance to something i said 6 hours ago, without saying anything else [03:40] 7z l on a tar file still requires parsing through the entire tar file. granted, it isn't too bad if not compressed. [03:42] but on a 100+GB tar file with a few thousand files, that can still take awhile [03:42] sorry [03:43] results are cached for 24h after last request too though [03:43] so it's not as expensive if it gets popular [03:43] * Coderjoe wanders off to do stuff like pack and try to get the company laptop ready [03:44] Ah, phew. Makes me feel like less of a jerk for using it so much lately :P [03:44] lol [03:44] nah, it's fine [03:44] we want it to be a first class citizen! [03:44] It's one of my projects. [03:44] one of the ones I can talk about anyway [03:44] lol [03:47] It's fantastic - it'd be impossible to item-level index without it [03:47] (at least, on this collection) [03:47] http://i.canvasugc.com/ugc/p/canvas_rytff.png [03:50] nitro2k01: nice! [03:51] haha [06:52] SketchCow: I just noticed the GoNintendo archive is not up to date [06:54] like it stop at episode 46 on archive.org but its currently at 353 [06:55] the worse part is the list only goes down to 329 [06:55] but the path a least works for episodes 210 and up i think [08:43] godane: good ridance [08:43] ugh [08:43] today is going to suck so bad [08:44] I didn't get any sleep, because i was tossing and turning and worrying about waking up on time [08:45] and I have a three hour drive to the other side of the state this morning, working over there for who knows how many hours, then possibly a break, then SketchCow's aadl presentation, and at some point a three hour drive home, arriving home no earlier than 2am [14:33] HI GANG [14:33] Working on the effort [14:33] The slides. [14:42] -!- the effort is now known as The Slides [15:20] Hi all, Tabblo needs your help! [15:20] There's a seesaw script if you want to help: http://archiveteam.org/index.php?title=Tabblo#How_to_help_archiving [15:21] For those on Windows (or unwilling to install things), try the ArchiveTeam Warrior appliance: http://archive.org/details/archiveteam-warrior [15:21] Tracker: http://tabb.heroku.com/ [15:24] alard: how does that stuff work? is it easy to setup? [15:25] i mean the tracker and automatic assignment of task [15:25] Easy for whom? [15:25] ah poop, maybe we should have used wget-warc for fileplanet [15:25] me! :) [15:25] Ah, it's relatively simple, if you know what to do. [15:26] https://github.com/ArchiveTeam/universal-tracker [15:27] cheers [15:53] Hi everyone, [15:53] I heard back from Lisa, and unfortunately Connecticut is not a popular location for SIGS. Our event, and the other events that day have very low registrations. We currently have 4 non-speaker registrants, one event was cancelled and the third event also has low numbers. [15:53] hahahah [15:54] So this person has been wanting me to come in and speak for, like, months [15:54] They've had google voice chats to discuss how they will do each thing [15:54] this, that the other thing [15:54] Lot of work [15:54] P.S. for free [15:54] Anyway, as the note says, this conference currently has 4 attendees [15:55] :'( [15:55] Not the one I'm AT [15:55] The one that wants me to talk in June [15:56] I may have myself a free day [15:56] ah ... yeah [15:59] ------------------------------------------ [15:59] THE NEW TABBLO PROJECT IS IN EFFECT YOU PEOPLE [15:59] GET THE CLIENTS RUNNING - WE ONLY HAVE A FEW DAYS [15:59] ----------------------------------------- [15:59] http://archiveteam.org/index.php?title=Tabblo#How_to_help_archiving [16:01] Need to get on that soon as I get home. [16:04] Dear Mac OS X users, if you exist - brew tap mistydemeo/archiveteam; brew install wget-lua [16:14] alard: something funky with the detection of lua on ubuntu [16:16] pberry: Oh? [16:16] maybe I'm just doing it wrong [16:17] but even after I was sure I had 5.1 it kept saying I needed Lua [16:17] I'm trying the script with the exit on the check commented out [16:18] Do you also have the liblua5.1-0.dev ? [16:19] aha [16:19] that will probably fix it [16:20] \o/ [16:20] alard: thanks [16:28] wow, check out all the new tabblos, all about the site going down [16:29] OK, disappearing for a little - giving keynote! [16:29] one of them is 100% archiveteam http://www.tabblo.com/studio/stories/view/1850589/?nextnav=recent [16:30] Hah! [16:30] I like how Tabblo still has the "Want to make your own?" etc. header, like it's not about to disappear. [16:32] wget-warc needs lua now? [16:33] how's the fortress looking? [16:38] hmm, I should be able to get quite close to tabblo.com on the network and zip thru the zip downloads [16:39] * closure powers up a machine.. [16:40] closure: wget-warc-lua does. [16:41] what's the lua used for, out of curiousity? [16:43] That's where the logic is. This is very simple: https://github.com/ArchiveTeam/tabblo-grab/blob/master/dld-tabblo-user.sh [16:43] but this Lua script is doing the work: https://github.com/ArchiveTeam/tabblo-grab/blob/master/tabblo.lua [16:44] oh ok, filtering etc [16:45] btw, I hope that the embedded S3 creds are not abusable.. [16:46] Shhhhhh. [16:48] ;) [16:49] is the lua support going into wget upstream? [16:56] every tabblo user has been 520k thus far... is this normal or an issue? [16:57] I've got one that was 3m, and one that was 9m [16:57] * Aranje fires up another downloader on another machine [16:58] ooh, there goes a bigger one. nm. [17:01] is the downloader sensible about not wiping out hd space? [17:01] which is to say, will it stop just before it fills a drive? [17:02] aww, wrong datacenter [17:03] darn, I *used* to be in tabblo's datacenter. [17:03] evicted [17:12] The VirtualBox appliance seems to be working nicely [17:21] it is pretty cool to start a download and get this 10 minutes later http://archive.org/details/archiveteam-tabblo-10 [17:24] Can you run multiple warc's on the same box? [17:24] Or does that... piss off tabblo? [17:26] oh, this range stuff is confusing [17:27] so confusing [17:27] Sui, http://archiveteam.org/index.php?title=Tabblo#How_to_help_archiving If you have extra other boxes [17:27] * closure redoes from start [17:28] yeah i was reading that [17:28] i'm wondering if i xenserver has a NAT network [17:28] closure: Do you know if we can run multiple seesaw's on the same box? Will tabblo you if that happens? [17:28] s/tabblo you/tabblo ban you/ [17:29] haven't tried [17:31] SketchCow: hahaha [17:31] damnit [17:32] Aranje, I'm running 5 seesaws on one machine right now, no issues thus far but it hasn't been even an hour yet. [17:32] * Aranje nods [17:32] can you just run them on top of eachother? [17:32] same dir? [17:33] yes. [17:33] awesome [17:33] Aranje: try running ten [17:33] we have the bandwidth [17:33] better [17:34] lemme just spawn 10 new screen tabs [17:34] SketchCow — saws == seesaws? [17:35] okay that's... [17:35] I dunno how many [17:35] Enough that I've got 0-9 [17:35] and maybe more [17:36] btw, who admins the wiki? Registration seems to be kind of broken. [17:36] Aranje: look at the tracker [17:36] has been for a while, mistym, i believe it was disabled because of spam [17:36] aww fucking yeeeee [17:37] good evening, archive team [17:37] winr4r: It doesn't seem to be outright disabled, so much as it throws a php error [17:37] mistym: yes [17:39] Two questions: 1) What's lua? 2) Which version of it do I need to make the tabblo downloader work? [17:40] It's a programming language, and 5.1 is working just grand for me [17:40] lua is a embedded scripting language [17:40] So I need to download a language interpreter to make wget work? Do I even want to know? [17:42] DoubleJ: You need a language interpreter to run the wget scripts. [17:44] DoubleJ: it's just another library [17:44] you don't need a Lua development environment to run wget [17:44] http://tabb.heroku.com/ Leaderboard style tracker for the tabblo project [17:45] codinghorror? [17:45] Jeff Atwood is doing AT now? :P [17:46] :D [17:46] lol [17:47] not that I'd mind, it'd be pretty cool if that actually was him [17:48] Archiving with the Stars [17:49] next up, Joel Spolsky [17:54] my two servers duking it out [17:55] lmao [17:55] lmao [17:55] Aranje: your server is stuck on four instances, so it's going extra slow [17:57] do I have to upload stuff later or is this inline [17:57] upload later [17:57] just making sure [17:58] Sui/Aranje: The Tabblo seesaw script uploads as you go. [17:58] Oh, it does?! [17:58] Awesome :D [17:58] Yes, it's download - rsync - delete - repeat. [18:22] me and beard are duking it out while aranje is like "I'm on the moon!" [18:28] hello [18:28] is there any way i can ask archive.org to make a snapshot of a website/ [18:29] l-fy: Which site? [18:30] http://www.crimelecomunismului.ro [18:31] we will need that because the website will be changed, and for political reasons is important to have a full archive of the website today [18:32] I don't know if it's possible to get a specific Wayback request, but we can grab it and put it up [18:32] shaqfu: that will be cool [18:33] You said it'll be changed in a few hours? [18:34] maybe a day [18:34] and it's important to keep a recorded copy from a international organisation so that the backup will be believed [18:35] I can't get it - no way I can grab the site in time on my line - but hold on... [18:36] =================================== [18:36] IF ANYONE WANTS TO BE A POLITICAL HERO [18:36] i can grab it [18:36] lol [18:36] WE NEED A BACKUP OF http://www.crimelecomunismului.ro/ [18:36] i already started wget --mirror [18:37] FUTURE ROMANIANS WILL THANK YOU [18:37] but it has to be backup by an internation archive [18:37] ======================== [18:37] that should be the right flag? [18:37] damn [18:37] Sui: I think [18:37] right on [18:37] i'll let you know when it finishes [18:37] Sui: Using wget-warc? [18:37] no, should i? [18:37] Yeah [18:38] i'm new to all this [18:38] It'll grab headers necessary for wayback, and it'll give added legitimacy [18:38] Which seems to be an issue here [18:38] oh [18:38] oops [18:38] well, should i stop it? [18:38] Yeah [18:39] We're time-limited here; gotta get it right the first time [18:40] shaqfu: how can i grab it to be ok? [18:40] will wget-warc-lua work? [18:40] i can make a copy in Romania and transfer latter [18:40] Sui: Hm, dunno; never used that one [18:40] l-fy: Is it a different page based on location? [18:41] it's working [18:41] no shaqfu [18:41] l-fy: Should be fine either way, then [18:41] and much faster on a 100mbit line than my house [18:41] yes, i have a 100mbit connection in .ro [18:41] and usually that should be faster than .us [18:41] yeah [18:42] grab from both, tar and diff? [18:43] Hm? [18:43] Sui: how can i grab the website? [18:43] l-fy: wget-warc [18:43] Install it, point it at the site, set up options, go make a sandwich [18:44] Does -m include --convert-links? [18:44] And --page-requisites? [18:44] Total wall clock time: 3m 36s [18:44] Downloaded: 1466 files, 603M in 1m 53s (5.32 MB/s) [18:45] there seems to be just a git [18:46] that site seems to have a bunch of flash links, you may want to follow those manually and make sure they get grabbed [18:47] someone else needs to scrub, i'm terrible at noticing small links [18:47] :( [18:47] the flash items on the front page all seem to point to different hostnames. are those relevant, too? [18:48] which ones? [18:49] for example their historical photo archive: http://fototeca.iiccr.ro [18:49] there is just one to www.fenomenulpitesti.ro [18:49] yes [18:49] that has to be backup [18:49] is a part of iiccr.ro [18:49] one moment [18:50] damn fototeca.iiccr.ro is one of the most important stuff :( [18:50] i didn't knew that is on a different website [18:50] i have a better link [18:51] iiccr.ro with any subdomain [18:51] www.crimelecomunismului.ro is the same as www.iiccr.ro [18:52] what else to mirror then [18:53] anything that has a link from there and is in iiccr.ro domain [18:53] someone else find a use for this http://108.170.13.180/crimelecomunismului.tar.bz2 [18:55] that site also generates its menu via javascript (see http://cdn.iiccr.ro/menu/ieiiccr_menu_ro.js) [18:55] Sui: i'm getting the backup from you [18:55] ok [18:55] i'm also grabbing fototeca [18:56] superb [18:56] thank you [18:57] i'm probably gonna get in trouble for this, thank goodness i have a company in front of me [18:57] note: my company [18:57] shall i grab fenomenu [18:57] l-fy: So what exactly is happening that puts the site at risk? Regime change? [18:58] shaqfu: yes [18:58] we got back a communist goverment [18:58] seems like they're merged with some other institute? [18:58] and they've just removed the president of the institution behind this website [18:59] and that happened today [18:59] Gotcha; hence the rush [18:59] Sui: Once you get everything, upload to archive.org [18:59] patrickg: that announcemnt doesn't have a date [18:59] however, since the most important job of this institution was the website :) [18:59] and promoting the website :) [19:00] oh god i'm mirroring the .ro cia [19:00] if i didn't love the internet any more than this [19:00] l-fy: Just letting you know, AT isn't affiliated with archive.org, so if that's a major issue, let us know [19:00] Sui: what? [19:00] no [19:01] So if it absolutely must be an international org, you might want to contact elsewhere - but I don't think they'll work fast enough [19:01] this is an institution that was studied how communism destroyed Romania, and provide documents and reports [19:01] ok [19:01] shaqfu: i was thinking that archive.org can do a great job [19:01] since they've mirrored some of the website [19:03] l-fy: They'll do a fantastic job hosting it, but if legitimacy's an issue, we're not archive.org [19:03] right now i'm concerned just to not lose anything [19:04] it's still important to have a mirror [19:05] Sui: does it help if I provide you the links of that js-menu as entry points to your crawler? [19:05] i'm just using wget-warc, but sure [19:05] Sui: sorry, I don't know how smart that tool is [19:06] ok, so Sui i'm downloading your archive [19:07] for my personal archive [19:07] now, how can i get archive.org to archive this? [19:07] Sui: http://pastebin.com/ZrG79GvB - extracted from the english and romanian menu-js. the menus might point to places that aren't accessible otherwise (no idea) [19:08] l-fy: Upload it [19:08] shaqfu: how can i do that? [19:08] oh [19:08] patrickg: i got those dirs [19:08] filled with pdfs [19:09] l-fy: Register here: http://archive.org/account/login.createaccount.php [19:10] Then, in the top right, upload button [19:10] l-fy: what about http://www.youtube.com/user/iiccmer/? [19:11] o jeez [19:11] i will ask about backuping that [19:13] https://github.com/rg3/youtube-dl [19:14] damn, how do i get a git? [19:14] i only know svn and cvs [19:15] and google doesn't work because my badwidth is full [19:15] l-fy: git clone git://path (or whatever protocol you use) [19:15] l-fy: You can download the source as a tarball from here if you don't want to clone w/ git: https://github.com/rg3/youtube-dl/downloads [19:15] http://108.170.13.180/fenomenulpitesti.tar.bz2 [19:16] tiny 22M website [19:16] Sui: thank you dude [19:16] aha [19:16] clone [19:16] Sui: did you got fonoteca? [19:16] i can tell you right now, fototeca is humongous [19:16] it's still going [19:16] :( [19:16] o shit [19:19] there's actually an archive of it from 2010 on archive.org [19:19] Which may mean there's a more recent crawl that hasn't gone public yet. [19:19] should i keep crawling or should i save these people bandwidth [19:20] Sui: Keep going; might as well get the most recent possible version of the site [19:20] 361M fototeca.iiccr.ro/ [19:20] Yeah, there might be a gap between whatever the last version the wayback machine has and what you're getting now. [19:20] current status [19:20] also [19:20] we just rolled under 200k on tabblo [19:21] me and Aranje launched seesaw on my two servers [19:23] thank you, thank you guys [19:24] * l-fy apreciate a lot all the effort [19:24] l-fy: It's really awesome you're doing this. [19:24] mistym: no, just that i have a hard time to forget my first 10 years of life [19:24] I understand. [19:25] l-fy: did you do something horrible then? [19:26] chronomex: no, just that .ro was under the communist regime [19:26] ahh [19:26] l-fy: I mean, it's very good to get this done while there is time. [19:27] mistym: someone has to do it [19:28] and now i don't live there anymore [19:29] and i live in SF [19:29] i actually been yesterday at internet archive for a meeting [19:29] l-fy: The PDA one? [19:29] neato [19:29] no [19:29] something with burning man [19:30] Hm, is this the first time AT's dealt with regime change? [19:30] burning man it stuff meeting thing [19:31] actually I think the answer is no, shaqfu [19:31] AT's? [19:31] we did some archiving of egypt [19:31] l-fy: archiveteam [19:31] chronomex: Ah, neat [19:33] is something different [19:33] because .ro is part of the UE [19:33] but [19:34] this traces may disapear [19:35] 570M fototeca.iiccr.ro/ [19:35] crap [19:36] ok, i will backup that latter [19:36] i have to go to work [19:37] we're in the same timezone, so i shouldn't miss you coming back [19:37] Sui: can you give me the link please? [19:37] foto is still running [19:38] it's currently mirroring the letter M [19:38] it looks like it's going in alphabetical order [19:39] Sui: Do you know what to do to upload? [19:40] And it might be best to wait for l-fy to get back to write a desc, since he's familiar with it [19:40] i'll pm him the link [19:41] Awesome [19:41] Sui: i'm a she :) [19:41] thank you [19:42] no problem [19:42] the latest backup from archive.org for iiccr.ro is april 2011 [19:45] wow, we should be done with tabblo within the day [19:45] we went from 200k to 198k in 24 minutes [19:46] well, 199 [19:48] Nice! [19:49] You guys may have this done before I can even start my computer on it when I get back from work. :V [19:49] if we keep going at this speed, it should be done in 5 hours by my horrid math [19:49] wait no [19:50] erase that from the internet [19:50] 24 minutes times 200 = 80 hours [19:50] ^ [19:50] pardon my brain being fried [19:50] getting hakin9 magazine [19:51] good [19:51] i've grabbed the big one [19:51] you should steel yourself for the image dump [19:51] it's gonna be at least 1.5gb [19:52] this has 500MB [19:52] 799M fototeca.iiccr.ro/ currently [19:54] oh, I should have said five days, not five hours [19:55] Sui, how do I get my screen back [19:55] you're attached still [19:55] -Udr [19:55] tks [19:56] i have to go now [19:56] alard: with dld-tabblo-zip.sh, can I delete data/ after running and rerunning? I assume it's been uploaded [19:56] i'll be back latter [19:56] i'll get you that file linked [19:58] hmm [19:58] I want to know what makes them slow to a crawl after a while [19:58] bye [19:58] restarting the scripts fixes it, but it just stops downloading stuff after a bit [19:58] it stops on certain people [19:58] hmm [19:59] i've got 20 right now [19:59] I just restarted my 10 [19:59] they do it from the beginning too [19:59] closure: Yes, you can delete it. If you're running more than one instance you should check which directory you remove. [19:59] i hope whoever owns that rsync server is ok [20:01] I wonder if there's a way to add in a bit of logic to have it move on to a new user if it sits there for more than x seconds [20:01] we're currently downloading at 1k per 20 minutes [20:01] Total wall clock time: 1h 6m 13s [20:01] Downloaded: 49167 files, 775M in 7m 31s (1.72 MB/s) [20:01] fototeca mirrored [20:01] 198x20 is? [20:01] 4000? [20:03] at this speed, 66.66 hours [20:03] so 10.5 days or so [20:03] err [20:03] 5.5 [20:03] I can't math lol [20:03] which is under the 10 we have, hopes [20:03] don't worry, i said five hours earlier [20:03] off by a zero! [20:03] :D [20:03] if we had more machines we could drop that a bit [20:04] *more screen sessions [20:04] should I launch another? [20:04] :> [20:04] i've got two full right now [20:05] if you look at the graph, you can tell right where i launched the next ten about eleven minutes ago [20:07] Check it [20:07] whoa [20:07] look at the count [20:08] I know [20:08] 1k in 10 minutes [20:08] Yeah I'm not even going to talk about how many screen windows are open [20:08] it's glorious [20:08] mainly because I have no fucking idea [20:10] Aranje: look now [20:11] I see that :D [20:11] We gonna have a war to see who can launch more seesaws? [20:11] :D [20:11] just don't break the server [20:12] two per second [20:14] so yeah, if you guys have the resources, you can run as many of these as you want [20:14] Are you sure you're getting real responses and not just error pages? [20:14] We've broken services before. [20:14] This might be useful for someone else: [20:14] Hey, the Tabblo version of wget-warc, with Lua, has a dependency that's not in the default Ubuntu install: liblua5.1-0-dev [20:15] So, install it? :P [20:15] what version of ubunu [20:15] *ubuntu [20:15] Let me check. [20:16] Pronoiac: do 'lsb_release -a' [20:16] I just looked at /etc/issue. [20:16] It's running 10.10. [20:17] that's why [20:17] too old [20:17] that's almost two years old, mate [20:17] Yup. [20:17] I hadn't noticed, but yup. [20:18] My bad; I'd thought it might help someone else. [20:19] Heh :) [20:19] tabblo speed is now 6 minutes per 1k users [20:20] less than a day left, 19 hours [20:20] Has anyone estimated a completion date for the MobileMe mirror lately? [20:25] Yeah, 30 June. [20:25] Hello, just started helping with the tabblo archival - Just to confirm, I can run multiple instances of seesaw.sh in the same directory without things breaking? [20:26] Yes. [20:26] well that is just excellent, thanks [20:28] i've got something like 60 [20:28] i lost count [20:33] i think we're at tabblo's limit [20:34] Well, we'd like to have useful data, not errors. [20:36] welp, I tried to bid on this http://www.rrauction.com/bidtracker_detail.cfm?IN=309 for release to the public domain or at least archive.org, but I've been pushed out by The Old Money :/ [20:37] I know it's not digital but I thought it would be of interest [20:45] xarph: Surprised that's not at SI/NARA [20:50] And it's tempting - hell of a treasure at $2000 [20:50] be my guest, I bow out of an auction when it reaches how much I pay for rent [20:50] except I don't know if you can register now, I believe the cutoff was a while ago [20:52] He's not in the JSC oral history archives, hrm [20:54] And neither is it in NASA's oral history holdings... [20:54] well this was dictation for a for-profit book, so I doubt it fell under any rules for government archiving [20:54] I wasn't sure if he donated it afterwards or not [20:55] Ick, requires references? [20:56] ? [20:57] Eh, too much trouble, esp. since it's about to close [21:07] Not sure who set up the awesome virtual machine image, but I just downloaded it today and when I tried to do the tabblo, I'm getting errors like "./wget-warc-lua: error while loading shared libraries: liblua5.1.so.0: cannot open shared object file: No such file or directory\nERROR (127)" [21:08] I'm running linux so I just installed the tabblo stuff outside the VM and it's working just fine, but thought I would report that something is weird with the warrior VM's tabblo setup [21:08] Thanks for tellin' [21:09] mbeckler: Thanks for the report. It's supposed to download and install liblua5.1.so, but apparently it doesn't in your case. [21:09] that's what I figured, but didn't know much about how to diagnose it [21:12] mbeckler: Can you run apt-get? oh. [21:20] Does anyone that watches have any idea how to grab video from justin.tv? [21:25] okie doke, we're back in the game on the aldrin tapes auction (http://www.rrauction.com/bidtracker_detail.cfm?IN=309). it's currently at $1840. My max proxy bid is currently at $2244. Bids after that are $2716, $3287, $3978, and then it gets ridiculous. I'll keep the channel advised of the status. If we win, hopefully you can find me some good people in silicon valley that will be able to digitize these tapes correctly. :> [21:26] ersi what OS are you on? [21:26] Then would comes the fun part which is getting clearances to put then back into the public domain where they belong. [21:31] xarph: I know archives in Philly/NYC that can handle them and want to, but that's the wrong side of the nation [21:32] S[h]O[r]T: Linux, Ubuntu mostly [21:48] there is rtmpdump, ive tried it before with some sucess but it can be complicated. also not sure if the streams are protected [21:48] there is streamtransport for windows which works great for flash stuff if the stream isnt protected [21:48] hi [21:48] http://code.google.com/p/get-flash-videos/ that too apparently [21:54] i'm uploading welcome to the scene videos [21:54] only version2.0 or season 2.0 right now [21:55] item id will be welcometothescene_version2.0_xvid [21:55] i also added the theme song Maylynne Catch Me [21:56] I am now high bidder at $2716 [22:04] i remember that series hehee [22:16] only three episodes are on archive.org [22:17] so i putting up the full season 2 as on item to make things easier for me [22:17] l-fy: Would you like to write up a desc of the .ro site/ [22:17] shaqfu: sure [22:17] where can i do it? [22:18] l-fy: Do you have everything you want to upload yet? [22:18] no [22:18] i don't have fototeca and fenomentul pitesti [22:19] Once you start uploading stuff, you can add it to the desc section of the upload screen [22:19] Or, write something collection-level [22:20] i also want to get hak5 season 1-3 uploaded to archive.org [22:21] since finding those are going to get hard has time goes by [22:57] now bidding $3287 [23:00] l-fy: did you get the links in your email? [23:00] yes Sui [23:00] thank you [23:01] no problem, you said you didn't get them earlier [23:01] the photo site was smaller than i expected [23:03] more than 700MB? [23:03] 770 [23:03] if compressed a bit better, it'd fit perfectly on a cd-r [23:05] now at $3978 with a max at $5826. If it gets above that… I can't keep bidding. [23:05] <3 [23:07] xarph: what you want to buy? [23:07] http://www.rrauction.com/bidtracker_detail.cfm?CFGRIDKEY=309 [23:08] if I win (and it can be legally cleared), the contents will be made public domain [23:08] but that's step… 40. [23:08] currently on step 1 [23:08] not 40, probably more like 6 [23:08] well you guys are the experts on these things. :> [23:09] I'm just a space memorabilia nut [23:09] acquire, digitize, publish ... 3 steps, I'm probably missing something. [23:10] make sure the tape contents aren't covered by the copyright for the published memoir they were used for, get permission from aldrin in writing, clear it legally in case there are any libel issues... [23:10] well, as long as you get the tapes, nothing else really matters [23:10] the 2nd is the only one I'd really worry about [23:10] they can be digitized, and then shoved into a dark archive if need be [23:11] yah, but like I said…. step one :> [23:11] please, for the sake of history, even if you can't get clearance, please do shove into archive.org and work with them, they can keep it dark for a long time [23:11] so no one will see them until the end of time, but they'll be well cared for [23:11] but you probably already know this [23:11] yep. [23:13] Looks like it's getting out of range :( [23:13] We'll see [23:15] What time does it end? [23:15] right now, 19 minutes away. If there's another bid, the clock is reset to 30 minutes. [23:16] i often get stuck at the acquire step [23:16] lot of stuff here needs digitizing and/or publishing [23:16] I'll check back in 20 minutes, then [23:17] If it goes through, you'll have the $ tonight [23:18] And we'll start tracking down people in the SF/NY area to handle these [23:18] another bid, $4814. One more bid and I'll have to drop out unless archive team can raise some more. [23:19] I could completely empty my savings to chase this but... [23:19] $5826 [23:19] that's me [23:19] It's extraordinary but...I'm done after $1k :( [23:19] Ah, okay [23:19] next bid will end me :/ [23:21] somewhere else, another archivist is like damn big money [23:21] ahahaha [23:21] watch jason scott come in here LOOK WHAT I WON [23:21] >.> [23:21] Kinda funny - he's away all day, and it's been a big day for AT [23:22] Between this and the .ro site [23:22] that was actually kinda fun [23:22] While you were out: SAVED HISTORY, TWICE [23:22] i enjoyed watching my screen scroll [23:23] also how tabblo is going at hyperspeed [23:25] we need archive.org version at home [23:25] i hope we get 60tb disks in five years [23:26] Sui: If you have a lot of bandwidth/storage, #fireplanet could use you \o/ [23:26] shaqfu: I have all magazines of hakin9 [23:26] i'm starting a hosting company, so any extra bandwidth and storage are open for archival use [23:26] just under 500mb too [23:27] godane: Awesome [23:27] Aranje is in on it, he guided me here [23:27] Sui: #fireplanet, then :) [23:28] oh, fileplanet [23:28] i thought that got a reprieve [23:29] Not that I'm aware of [23:30] when we're done with tabblo i'll get cracking on that [23:30] i'm currently maxing out my hdd i/os [23:30] Take your time - it's not super-critical [23:32] We've been at it for almost a month now [23:32] xarph: :( [23:34] That's it, then :( [23:35] At least we had a shot at it. [23:35] And we know it exists. [23:35] Yep [23:35] Anyone else want in on this? Last chance [23:35] The future will write sordid love letters to you [23:36] I just hope it's going to SI/NASA/NARA/Purdue/APS/etc [23:36] the next bid is $7050, plus there's an 18% commission. I'd need at least $2500 more to have a fighting chance. But who knows... [23:36] Yeah no kidding [23:37] If it's for someone to put into a vault or listen to a few times, well, RIP tapes [23:37] Esp. if they listen to it w/o digitizing [23:37] got season 2 of welcome to the scene up: http://archive.org/details/welcometothescene_version2.0_xvid [23:38] need to do filenames like 00, 01, 02 I think [23:38] also i'm getting season 1 of that [23:43] season 1 episodes are like going at 40kbytes a sec [23:44] i'm more likely call the item welcometothescene_version1.0_xvid