Time |
Nickname |
Message |
00:10
🔗
|
SketchCow |
Here is why we can't have nice things. |
00:10
🔗
|
SketchCow |
I have 400 videotapes in my house, from GDC (game developers conference) that I've been digitizing. |
00:11
🔗
|
SketchCow |
Now, what to do when they're done. Don't want to throw them out, don't want to return them because they have no space. They'd throw them out. |
00:11
🔗
|
SketchCow |
So I suggested Stamford University, which has a games archive and which I have worked with extensively. |
00:11
🔗
|
SketchCow |
So that was going on |
00:11
🔗
|
SketchCow |
Now it is not. |
00:12
🔗
|
SketchCow |
Why? Because Stamford wants GDC to sign a contract saying "We are fine giving you these tapes." |
00:12
🔗
|
chronomex |
goddamnit |
00:12
🔗
|
SketchCow |
GDC legal says "We never got authorization from these people to give away these tapes" |
00:12
🔗
|
SketchCow |
So now they go "Can you supply it to another archive?" |
00:12
🔗
|
SketchCow |
And I'm going "well, I can call them, but every legit archive wants SOMETHING saying 'thanks for the tapes'" |
00:12
🔗
|
SketchCow |
Anyway, so that's where we are. |
00:13
🔗
|
chronomex |
goddamnit |
00:13
🔗
|
SketchCow |
Regardless, I'm digitizing all the fucking tapes and they're all going into archive.org |
00:13
🔗
|
SketchCow |
So fuck everybody |
00:13
🔗
|
chronomex |
yep |
00:13
🔗
|
chronomex |
fuck em all, let god sort them out |
00:13
🔗
|
chronomex |
erm |
00:13
🔗
|
chronomex |
yeah |
00:13
🔗
|
SketchCow |
Fuck them all, let God type in the metadata |
00:14
🔗
|
chronomex |
dictation, motherfuckers |
00:14
🔗
|
chronomex |
TOTALLY METADATA GRADE |
00:14
🔗
|
joepie91 |
lol |
00:15
🔗
|
joepie91 |
SketchCow: set up your own physical archive :D |
00:15
🔗
|
SketchCow |
Wayyyy ahead of you |
00:15
🔗
|
SketchCow |
But my archive wants to give them away |
00:15
🔗
|
SketchCow |
Ha ha, I could totally.... |
00:15
🔗
|
SketchCow |
hahaha |
00:15
🔗
|
SketchCow |
I could sign a contract |
00:15
🔗
|
SketchCow |
Then turn around and give them to stamford |
00:15
🔗
|
chronomex |
hahahaha |
00:15
🔗
|
SketchCow |
and sign the contract |
00:15
🔗
|
chronomex |
cross-archive donation |
00:16
🔗
|
chronomex |
I like this |
00:16
🔗
|
SketchCow |
No, it means I take on the burden |
00:16
🔗
|
SketchCow |
OH NO |
00:16
🔗
|
SketchCow |
These things in my house stay in my house |
00:16
🔗
|
SketchCow |
fuck everybody |
00:17
🔗
|
chronomex |
fuck em all, let god sort them out |
00:17
🔗
|
SketchCow |
God uses RDF, he's fucked |
00:18
🔗
|
chronomex |
at least it's not xml-encoded asn.1 |
00:28
🔗
|
SketchCow |
Just so you can see what these videos look like: |
00:28
🔗
|
SketchCow |
http://archive.org/details/2004-gdc-deferred-shading-on-dx9-hardware-xbox |
00:28
🔗
|
SketchCow |
I'm uploading these very quickly. |
00:29
🔗
|
BlueMax |
time to shove off an email for JSTP I guess |
00:35
🔗
|
SketchCow |
Tabblo has gone 100% into Wayback |
00:35
🔗
|
SketchCow |
Take that, bitches |
00:35
🔗
|
BlueMax |
What about Webshots? :D |
00:35
🔗
|
SketchCow |
Webshots is partially in |
00:35
🔗
|
SketchCow |
But some previous ones have to be handled. |
00:36
🔗
|
SketchCow |
Snd I'm focusing on other stuff right now, stuff no longer up. |
00:36
🔗
|
BlueMax |
sorry, that was meant to be a joke. |
00:36
🔗
|
no2pencil |
is there a url for this file format project you posted about earlier? |
00:37
🔗
|
SketchCow |
http://www.archiveteam.org/index.php?title=Just_Solve_the_Problem_2012 |
00:47
🔗
|
BlueMax |
SketchCow, question for you: I assume the results of Just Solve The Problem will be laid out in a seperate wiki (correct me if I'm wrong) - do we have a particular layout for each page yet? |
00:57
🔗
|
SketchCow |
No |
00:57
🔗
|
SketchCow |
That will happen very shortly |
00:57
🔗
|
SketchCow |
wiki is about to be set up this weekend. |
01:01
🔗
|
BlueMax |
good to know SketchCow |
01:09
🔗
|
DFJustin |
14.6 gb avi fuck yeah |
02:04
🔗
|
SketchCow |
OK SELF-DIRECTED PROJECT |
02:04
🔗
|
SketchCow |
http://www.pummelvision.com/ |
02:04
🔗
|
SketchCow |
If you can figure out how to save it, let's save it. |
02:06
🔗
|
creativec |
What's this pummelvision supposed to be? |
02:06
🔗
|
creativec |
This video is just a bunch of what appears to be Facebook pictures... |
02:07
🔗
|
SketchCow |
Yeah |
02:07
🔗
|
SketchCow |
It's not impressive. |
02:07
🔗
|
SketchCow |
Someone wrote me and said "could you save it!!!!" |
02:08
🔗
|
SketchCow |
And it's like............. |
02:08
🔗
|
SketchCow |
.................no |
02:08
🔗
|
creativec |
heh |
02:09
🔗
|
joepie91 |
http://techcrunch.com/2010/12/23/pummelvision/ |
02:10
🔗
|
creativec |
I would assume that it is unsavable if we don't have access to the source code...? |
02:10
🔗
|
joepie91 |
I'm not sure what there is to save in the first place |
02:11
🔗
|
joepie91 |
it used external sources |
02:13
🔗
|
creativec |
eh, it looks reproducable easily. I don't see if there's a reason to save it. |
02:19
🔗
|
godane |
SketchCow: i grabbed www.apdl.co.uk today |
02:20
🔗
|
godane |
there is tons of demo ware and pd ware for risc os in these warc |
02:27
🔗
|
joepie91 |
has oldversion.com ever been archived |
02:28
🔗
|
godane |
not really |
02:29
🔗
|
godane |
the way back machine has last snapshot from 2009 |
02:30
🔗
|
joepie91 |
okay, so |
02:30
🔗
|
joepie91 |
I'd like to archive it |
02:30
🔗
|
joepie91 |
but the fuckers |
02:30
🔗
|
joepie91 |
use javascript for the downloads |
02:30
🔗
|
joepie91 |
so I need to figure out how to script wget-lua :P |
03:09
🔗
|
joepie91 |
seriously? SERIOUSLY? |
03:09
🔗
|
joepie91 |
these oldversion guys |
03:09
🔗
|
joepie91 |
for fucks sake |
03:09
🔗
|
joepie91 |
they really REALLY try to discourage crawling/archiving |
03:10
🔗
|
joepie91 |
alard, SketchCow, whenever either of you gets here, is there a way to create warcs in python? |
03:11
🔗
|
balrog_ |
joepie91: how are they doing so? |
03:11
🔗
|
balrog_ |
oh, js... |
03:12
🔗
|
balrog_ |
joepie91: there's a trick |
03:12
🔗
|
balrog_ |
http://www.oldversion.com/main_download.php?sid=N |
03:12
🔗
|
balrog_ |
and you get the file |
03:12
🔗
|
balrog_ |
N seems to be sequential :D |
03:15
🔗
|
joepie91 |
yeah, no |
03:15
🔗
|
joepie91 |
302s to the main page |
03:16
🔗
|
joepie91 |
unless you've gone through the whole sequence of download pages |
03:16
🔗
|
joepie91 |
:| |
03:16
🔗
|
joepie91 |
@ balrog_ |
03:16
🔗
|
joepie91 |
and I have nfi how to script that in lua |
03:17
🔗
|
balrog_ |
can't you use regular expressions or bash or python? |
03:17
🔗
|
joepie91 |
problem is |
03:17
🔗
|
joepie91 |
can't use python in wget |
03:17
🔗
|
joepie91 |
don't know how to make warcs in python |
03:17
🔗
|
joepie91 |
:P |
03:17
🔗
|
joepie91 |
can you see my issue? |
03:17
🔗
|
joepie91 |
and regular expressions don't do much if you have to make certain page requests to be able to download the file in the first place |
03:17
🔗
|
balrog_ |
ah, a dl timer |
03:17
🔗
|
balrog_ |
bleh |
03:18
🔗
|
joepie91 |
well no, not a timer per se |
03:18
🔗
|
balrog_ |
yeah you may not be able to use warc here |
03:18
🔗
|
joepie91 |
what's the format of a warc like? |
03:18
🔗
|
joepie91 |
in simple terms |
03:18
🔗
|
balrog_ |
you may have to hack up something involving jdownloader/slimrat/plowshare :| |
03:18
🔗
|
joepie91 |
oh, I can write my own downloader, the warc thing is the only problem :P |
03:18
🔗
|
joepie91 |
what I'm thinking of... |
03:18
🔗
|
joepie91 |
is just writing a download script specifically for the downloads |
03:18
🔗
|
joepie91 |
then wget-warcing the main site |
03:18
🔗
|
joepie91 |
and afterwards modifying the warc to point to the files directly |
03:19
🔗
|
joepie91 |
and adding the files |
03:19
🔗
|
joepie91 |
but I don't know how modifiable a warc file is |
03:24
🔗
|
joepie91 |
anyway, time to sleep |
03:24
🔗
|
joepie91 |
balrog_: thanks for the slimrat/plowshare stuff btw |
03:24
🔗
|
joepie91 |
wasn't aware of its existence |
03:24
🔗
|
joepie91 |
goodnight :P |
03:58
🔗
|
joepie91 |
ugh I hate this - have to sleep., but not tired :( |
04:42
🔗
|
bsmith094 |
joepie91: been there |
07:31
🔗
|
Nemo_bis |
oh wonderful, it's getting a habit http://www.us.archive.org/log_show.php?task_id=128637767 |
07:38
🔗
|
alard |
joepie91: To write warcs in Python, you have http://code.hanzoarchives.com/warc-tools (I've only used that for reading warcs, though). |
07:40
🔗
|
alard |
joepie91: There is no Wget-Lua documentation yet. You could look at examples, https://github.com/alard/wget-lua/tree/lua/lua-example and the recent *-grab projects, and in the Wget side of the Lua hooks: https://github.com/alard/wget-lua/blob/lua/src/luahooks.c . |
07:42
🔗
|
alard |
(And just ask if you have a question; most of the documentation is still in my head. You may be the first who writes a Wget-Lua script.) |
09:34
🔗
|
SketchCow |
http://www.dailydot.com/news/livejournal-shut-down-us-office/ |
09:35
🔗
|
C-Keen |
signs of decay? |
14:15
🔗
|
joepie91 |
alard: will have a look, thanks |
14:16
🔗
|
joepie91 |
hey, um, SketchCow, brainfart: have several people across the world accept old magazines/manuals/CDs/whatever and collectively digitize it |
14:17
🔗
|
joepie91 |
several people across the world == lower shipping costs |
14:18
🔗
|
BlueMax |
main problem would be volunteers for this joepie91 |
14:18
🔗
|
joepie91 |
ofc, but I can imagine that there are at least a few people that have a few spare hours of time where they're bored out of their skull |
14:19
🔗
|
joepie91 |
so they may as well scan and categorize stuff :P |
14:19
🔗
|
joepie91 |
(includes me) |
14:21
🔗
|
BlueMax |
fair enough, you may want to get a few more people to make it worthwhile |
19:53
🔗
|
SketchCow |
http://archive.org/details/archiveteam-umich-save |
19:53
🔗
|
SketchCow |
As you can see, now a pile of "WARC" versions, all of which will get into the wayback. |
19:56
🔗
|
godane |
i'm home today |
19:57
🔗
|
godane |
i uploaded to linux format isos early today |
19:57
🔗
|
godane |
now uploading a 3rd |
20:14
🔗
|
SketchCow |
http://archive.org/details/atariforcecomics-205 |
22:29
🔗
|
SketchCow |
I just proposed the "hand it to jason, jason will hand it to Stamford" approach |
22:29
🔗
|
SketchCow |
Artifact laundering. 21st century. |
22:30
🔗
|
BlueMax |
All you need is some form of cash involved |
22:36
🔗
|
DFJustin |
is Stamford like Harfurd http://dilbert.com/strips/comic/1994-03-15/ |
22:42
🔗
|
chronomex |
laundering++ |
22:53
🔗
|
joepie91 |
SketchCow: do you have a second? |
22:54
🔗
|
joepie91 |
preferably several :P |
22:57
🔗
|
BlueMax |
I was actually wondering when the JSTP Wiki was gonna get underway |
23:17
🔗
|
SketchCow |
I have occasional seconds. |
23:19
🔗
|
chronomex |
brb 3rds of cookie |
23:20
🔗
|
chronomex |
DFJustin: please allow me to introduce you to /fast/: http://dilbert.com/fast/1994-03-15/ |
23:28
🔗
|
joepie91 |
SketchCow: whoop, missed your response - anyway, did you see my brainfart last night? regarding the accepting old materials by snail mail and digitizing them |
23:28
🔗
|
joepie91 |
seems you already have some experience with that judging from the presentation about the lawsuit |