[00:10] Here is why we can't have nice things. [00:10] I have 400 videotapes in my house, from GDC (game developers conference) that I've been digitizing. [00:11] Now, what to do when they're done. Don't want to throw them out, don't want to return them because they have no space. They'd throw them out. [00:11] So I suggested Stamford University, which has a games archive and which I have worked with extensively. [00:11] So that was going on [00:11] Now it is not. [00:12] Why? Because Stamford wants GDC to sign a contract saying "We are fine giving you these tapes." [00:12] goddamnit [00:12] GDC legal says "We never got authorization from these people to give away these tapes" [00:12] So now they go "Can you supply it to another archive?" [00:12] And I'm going "well, I can call them, but every legit archive wants SOMETHING saying 'thanks for the tapes'" [00:12] Anyway, so that's where we are. [00:13] goddamnit [00:13] Regardless, I'm digitizing all the fucking tapes and they're all going into archive.org [00:13] So fuck everybody [00:13] yep [00:13] fuck em all, let god sort them out [00:13] erm [00:13] yeah [00:13] Fuck them all, let God type in the metadata [00:14] dictation, motherfuckers [00:14] TOTALLY METADATA GRADE [00:14] lol [00:15] SketchCow: set up your own physical archive :D [00:15] Wayyyy ahead of you [00:15] But my archive wants to give them away [00:15] Ha ha, I could totally.... [00:15] hahaha [00:15] I could sign a contract [00:15] Then turn around and give them to stamford [00:15] hahahaha [00:15] and sign the contract [00:15] cross-archive donation [00:16] I like this [00:16] No, it means I take on the burden [00:16] OH NO [00:16] These things in my house stay in my house [00:16] fuck everybody [00:17] fuck em all, let god sort them out [00:17] God uses RDF, he's fucked [00:18] at least it's not xml-encoded asn.1 [00:28] Just so you can see what these videos look like: [00:28] http://archive.org/details/2004-gdc-deferred-shading-on-dx9-hardware-xbox [00:28] I'm uploading these very quickly. [00:29] time to shove off an email for JSTP I guess [00:35] Tabblo has gone 100% into Wayback [00:35] Take that, bitches [00:35] What about Webshots? :D [00:35] Webshots is partially in [00:35] But some previous ones have to be handled. [00:36] Snd I'm focusing on other stuff right now, stuff no longer up. [00:36] sorry, that was meant to be a joke. [00:36] is there a url for this file format project you posted about earlier? [00:37] http://www.archiveteam.org/index.php?title=Just_Solve_the_Problem_2012 [00:47] SketchCow, question for you: I assume the results of Just Solve The Problem will be laid out in a seperate wiki (correct me if I'm wrong) - do we have a particular layout for each page yet? [00:57] No [00:57] That will happen very shortly [00:57] wiki is about to be set up this weekend. [01:01] good to know SketchCow [01:09] 14.6 gb avi fuck yeah [02:04] OK SELF-DIRECTED PROJECT [02:04] http://www.pummelvision.com/ [02:04] If you can figure out how to save it, let's save it. [02:06] What's this pummelvision supposed to be? [02:06] This video is just a bunch of what appears to be Facebook pictures... [02:07] Yeah [02:07] It's not impressive. [02:07] Someone wrote me and said "could you save it!!!!" [02:08] And it's like............. [02:08] .................no [02:08] heh [02:09] http://techcrunch.com/2010/12/23/pummelvision/ [02:10] I would assume that it is unsavable if we don't have access to the source code...? [02:10] I'm not sure what there is to save in the first place [02:11] it used external sources [02:13] eh, it looks reproducable easily. I don't see if there's a reason to save it. [02:19] SketchCow: i grabbed www.apdl.co.uk today [02:20] there is tons of demo ware and pd ware for risc os in these warc [02:27] has oldversion.com ever been archived [02:28] not really [02:29] the way back machine has last snapshot from 2009 [02:30] okay, so [02:30] I'd like to archive it [02:30] but the fuckers [02:30] use javascript for the downloads [02:30] so I need to figure out how to script wget-lua :P [03:09] seriously? SERIOUSLY? [03:09] these oldversion guys [03:09] for fucks sake [03:09] they really REALLY try to discourage crawling/archiving [03:10] alard, SketchCow, whenever either of you gets here, is there a way to create warcs in python? [03:11] joepie91: how are they doing so? [03:11] oh, js... [03:12] joepie91: there's a trick [03:12] http://www.oldversion.com/main_download.php?sid=N [03:12] and you get the file [03:12] N seems to be sequential :D [03:15] yeah, no [03:15] 302s to the main page [03:16] unless you've gone through the whole sequence of download pages [03:16] :| [03:16] @ balrog_ [03:16] and I have nfi how to script that in lua [03:17] can't you use regular expressions or bash or python? [03:17] problem is [03:17] can't use python in wget [03:17] don't know how to make warcs in python [03:17] :P [03:17] can you see my issue? [03:17] and regular expressions don't do much if you have to make certain page requests to be able to download the file in the first place [03:17] ah, a dl timer [03:17] bleh [03:18] well no, not a timer per se [03:18] yeah you may not be able to use warc here [03:18] what's the format of a warc like? [03:18] in simple terms [03:18] you may have to hack up something involving jdownloader/slimrat/plowshare :| [03:18] oh, I can write my own downloader, the warc thing is the only problem :P [03:18] what I'm thinking of... [03:18] is just writing a download script specifically for the downloads [03:18] then wget-warcing the main site [03:18] and afterwards modifying the warc to point to the files directly [03:19] and adding the files [03:19] but I don't know how modifiable a warc file is [03:24] anyway, time to sleep [03:24] balrog_: thanks for the slimrat/plowshare stuff btw [03:24] wasn't aware of its existence [03:24] goodnight :P [03:58] ugh I hate this - have to sleep., but not tired :( [04:42] joepie91: been there [07:31] oh wonderful, it's getting a habit http://www.us.archive.org/log_show.php?task_id=128637767 [07:38] joepie91: To write warcs in Python, you have http://code.hanzoarchives.com/warc-tools (I've only used that for reading warcs, though). [07:40] joepie91: There is no Wget-Lua documentation yet. You could look at examples, https://github.com/alard/wget-lua/tree/lua/lua-example and the recent *-grab projects, and in the Wget side of the Lua hooks: https://github.com/alard/wget-lua/blob/lua/src/luahooks.c . [07:42] (And just ask if you have a question; most of the documentation is still in my head. You may be the first who writes a Wget-Lua script.) [09:34] http://www.dailydot.com/news/livejournal-shut-down-us-office/ [09:35] signs of decay? [14:15] alard: will have a look, thanks [14:16] hey, um, SketchCow, brainfart: have several people across the world accept old magazines/manuals/CDs/whatever and collectively digitize it [14:17] several people across the world == lower shipping costs [14:18] main problem would be volunteers for this joepie91 [14:18] ofc, but I can imagine that there are at least a few people that have a few spare hours of time where they're bored out of their skull [14:19] so they may as well scan and categorize stuff :P [14:19] (includes me) [14:21] fair enough, you may want to get a few more people to make it worthwhile [19:53] http://archive.org/details/archiveteam-umich-save [19:53] As you can see, now a pile of "WARC" versions, all of which will get into the wayback. [19:56] i'm home today [19:57] i uploaded to linux format isos early today [19:57] now uploading a 3rd [20:14] http://archive.org/details/atariforcecomics-205 [22:29] I just proposed the "hand it to jason, jason will hand it to Stamford" approach [22:29] Artifact laundering. 21st century. [22:30] All you need is some form of cash involved [22:36] is Stamford like Harfurd http://dilbert.com/strips/comic/1994-03-15/ [22:42] laundering++ [22:53] SketchCow: do you have a second? [22:54] preferably several :P [22:57] I was actually wondering when the JSTP Wiki was gonna get underway [23:17] I have occasional seconds. [23:19] brb 3rds of cookie [23:20] DFJustin: please allow me to introduce you to /fast/: http://dilbert.com/fast/1994-03-15/ [23:28] SketchCow: whoop, missed your response - anyway, did you see my brainfart last night? regarding the accepting old materials by snail mail and digitizing them [23:28] seems you already have some experience with that judging from the presentation about the lawsuit