Time |
Nickname |
Message |
00:01
🔗
|
Nemo_bis |
They're working on mirroring text dumps. They're already something like 10 TB https://meta.wikimedia.org/wiki/Mirroring_Wikimedia_project_XML_dumps |
00:01
🔗
|
Nemo_bis |
A mirror has been added recently, an rsync to a second server is happening right now, a new cluster has been ordered yesterday. |
00:01
🔗
|
Nemo_bis |
So, things are moving a bit. |
00:15
🔗
|
soultcer |
qls |
00:15
🔗
|
soultcer |
Whoops, wrong tab |
02:51
🔗
|
bsmith094 |
any ideas for how to scrape fanfiction.net |
02:53
🔗
|
PatC |
bsmith094, didn't you get that already? |
02:53
🔗
|
bsmith094 |
got the stoey id numbers, not the stories |
05:56
🔗
|
yipdw |
heh, wow, www.naenara.com.kp uses client-side imagemaps |
05:56
🔗
|
yipdw |
I haven't seen those in a long time |
13:04
🔗
|
SketchCow |
The article is up |
13:04
🔗
|
SketchCow |
(Tech Review) |
13:04
🔗
|
SketchCow |
You can read it and decide if my concerns were accurate. |
13:10
🔗
|
SketchCow |
http://www.technologyreview.com/article/39317/ |
13:12
🔗
|
PatC |
nice comic :) |
13:34
🔗
|
ersi |
Hoh! Awesome |
14:14
🔗
|
ersi |
haha, awesome quotes |
14:23
🔗
|
Frigolit |
:] |
14:25
🔗
|
ersi |
I did somehow expect Schwartz to totally go nuts on AT though |
14:26
🔗
|
ersi |
dunno why, might have something to do with his personal site |
15:34
🔗
|
Schbirid |
the images rock |
15:49
🔗
|
chronomex |
shmmmm. |
15:59
🔗
|
chronomex |
good morning fellas |
17:17
🔗
|
yipdw |
huh |
17:17
🔗
|
yipdw |
I wonder how they figured out some of the handles in this channel |
17:17
🔗
|
yipdw |
most likely by looking at archiveteam.org and inferring |
17:17
🔗
|
yipdw |
OR PERHAPS SOMEONE IN HERE IS A MOLE |
17:18
🔗
|
yipdw |
also, I've got a WARC of naenara.com.kp; what's the easiest way to get that to IA? register and upload? |
17:19
🔗
|
chronomex |
yep |
17:19
🔗
|
chronomex |
is it in .warc? |
17:20
🔗
|
yipdw |
yes |
17:20
🔗
|
yipdw |
it's only 1.6 GB, gzipped |
17:20
🔗
|
yipdw |
I've got a significant part of the North Korean internet on a USB pen drive |
17:20
🔗
|
yipdw |
that's an awesome thought |
17:21
🔗
|
chronomex |
rad. |
17:23
🔗
|
chronomex |
I'd say upload it, then let info@archive.org know. |
17:24
🔗
|
yipdw |
using http://www.archive.org/create/, I guess? |
17:24
🔗
|
yipdw |
or is there a specialized upload point for WARCs? |
17:24
🔗
|
yipdw |
that link was just the first I found |
17:27
🔗
|
chronomex |
yes, that |
17:30
🔗
|
yipdw |
"You appear to be using the Firefox browser. |
17:30
🔗
|
yipdw |
The browser will only upload files of 2GB or less." |
17:30
🔗
|
yipdw |
that's right |
17:30
🔗
|
yipdw |
good thing that fits within those limits |
17:32
🔗
|
yipdw |
heh |
17:33
🔗
|
yipdw |
I understand how a browser can parse that, but I'm simultaneously amazed that it works |
17:34
🔗
|
winr4r |
yipdw: that is beautiful |
17:35
🔗
|
yipdw |
browsers must have some of the best backwards compatibility ever |
17:57
🔗
|
yipdw |
alright, uploaded and notified |
18:14
🔗
|
Coderjoe |
mm quirks mode |
18:15
🔗
|
Coderjoe |
you can also create items through the s3 interface. each item is a bucket and the first file uploaded to the bucket creates it. |
20:05
🔗
|
soultcer |
I like the TR article |
20:18
🔗
|
bsmith094 |
which article? |
20:23
🔗
|
Nemo_bis |
bsmith094, http://www.technologyreview.com/article/39317/ |
21:18
🔗
|
SketchCow |
Tjamls sp ,icj. yipdw |
21:18
🔗
|
SketchCow |
Tjat |
21:18
🔗
|
SketchCow |
Thanks so much yipdw |
21:18
🔗
|
SketchCow |
That's the golden stuff. |
21:20
🔗
|
yipdw |
np |
21:20
🔗
|
yipdw |
I'll see what else I can get before they officially replace Dear Leader |
21:32
🔗
|
SketchCow |
Excellent. |
21:32
🔗
|
SketchCow |
Actually, I think the grandson is already dear leader. |
21:35
🔗
|
yipdw |
oh, good point |
21:35
🔗
|
yipdw |
ha, naenara updaetd |
21:35
🔗
|
yipdw |
updated |
21:35
🔗
|
yipdw |
might as well run the mirror again |
21:38
🔗
|
SketchCow |
Obviously get the sets. |
21:38
🔗
|
SketchCow |
Want a place to FTP? |
21:40
🔗
|
yipdw |
I'm uploading to archive.org right now via their HTTP interface |
21:40
🔗
|
yipdw |
FTP would probably be nicer, though |
21:40
🔗
|
yipdw |
though I guess I can also use the S3-alike |
21:43
🔗
|
SketchCow |
Get a free account |
21:43
🔗
|
SketchCow |
and you can upload via FTP |
21:43
🔗
|
SketchCow |
I can help with that. |
21:46
🔗
|
yipdw |
SketchCow: got the account |
21:46
🔗
|
yipdw |
er, I mean, I have an account |
21:46
🔗
|
yipdw |
brb |
22:52
🔗
|
chronomex |
hm. I still hate magtape. |
22:58
🔗
|
chronomex |
those fuckers in the 70s |
22:59
🔗
|
chronomex |
the oxide is glued to the tape with a urethane compound, which gets gummy over the course of about 10 years |
22:59
🔗
|
Coderjoe |
whee |
22:59
🔗
|
chronomex |
you can fix it by baking the thing at 135-140F |
22:59
🔗
|
Coderjoe |
and warp the base |
23:00
🔗
|
chronomex |
less than ~135 won't do anything, more than 140 will cause printthrough |
23:00
🔗
|
SketchCow |
I am helping negotiate the possible transfer of something like 135,000 tapes |
23:00
🔗
|
chronomex |
the base of my carts is a 2mm aluminum slab |
23:00
🔗
|
SketchCow |
Isn't that exciting. |
23:00
🔗
|
chronomex |
ooh, what of? |
23:00
🔗
|
chronomex |
and what type of tapes? |
23:00
🔗
|
RedType |
http://www.sfgate.com/cgi-bin/article.cgi?f=/n/a/2011/12/20/national/a065958S85.DTL |
23:00
🔗
|
SketchCow |
It might be 35,000, someone might have typod. |
23:01
🔗
|
SketchCow |
Reel to reel of some sort |
23:01
🔗
|
RedType |
They talked about Base64, a program that compresses digital documents for speedy transmission by removing all the spaces and punctuation marks. |
23:01
🔗
|
RedType |
:| |
23:01
🔗
|
SketchCow |
Everything the Christian Science Monitor recorded for radio, ever |
23:01
🔗
|
chronomex |
ah. i wish these were reel tapes, that would solve some problems :| |
23:01
🔗
|
chronomex |
wow |
23:01
🔗
|
Coderjoe |
mmm... so one side of the tape gets extra crispy while the other stays original recipie |
23:01
🔗
|
chronomex |
Coderjoe: ? |
23:01
🔗
|
chronomex |
oh |
23:01
🔗
|
SketchCow |
They're being digitized, we're just discussing having the original tapes. |
23:02
🔗
|
chronomex |
cool |
23:02
🔗
|
chronomex |
I hate tape *cartridges*. |
23:02
🔗
|
Coderjoe |
chronomex: the 2mm aluminum slab. it will cook one side of the tape more than the other |
23:03
🔗
|
chronomex |
almost to the point where i'm going to pay someone to do this for me |
23:03
🔗
|
chronomex |
Coderjoe: Ah. I see. No. |
23:03
🔗
|
chronomex |
tape is at right angle to the base |
23:04
🔗
|
Coderjoe |
duh |
23:04
🔗
|
chronomex |
well maybe trks 0 and 1 vs 2 and 3 |
23:04
🔗
|
Coderjoe |
side being EDGE not flat |
23:05
🔗
|
chronomex |
who ever talks about datatape that way :P |
23:06
🔗
|
SketchCow |
Anyone feel like typing in Compute! tables of contents? |
23:08
🔗
|
bsmith095 |
archiveteam retruns this site may be compromised on google |
23:09
🔗
|
SketchCow |
Gotta fixz that. |
23:13
🔗
|
bsmith095 |
anything reasonably sized that needs downloading?, not mobileme, too big, i couldnt put a start of a dent in that |
23:15
🔗
|
yipdw |
goddamnit, I sent ^C to the wrong terminal |
23:15
🔗
|
yipdw |
I hate when I kill a process that's been running for an hour or so |
23:15
🔗
|
yipdw |
at least it's idempotent |
23:21
🔗
|
chronomex |
worse still when it's 80% done with a two-week job. |
23:22
🔗
|
yipdw |
I haven't done that before |
23:22
🔗
|
chronomex |
it sucks hard |
23:23
🔗
|
Coderjoe |
even worse is when things get OOM-killed |
23:23
🔗
|
SketchCow |
I'm still, STILL adding Jamendo. |
23:33
🔗
|
chronomex |
yeah. oom is the pits. |
23:45
🔗
|
bsmith095 |
did thingiverse ever finish up, or is thta still open? |
23:48
🔗
|
SketchCow |
We did a full round |
23:48
🔗
|
SketchCow |
We'll do another round at some point. |