Time |
Nickname |
Message |
01:52
🔗
|
Coderjoe |
uploading video #200 |
01:52
🔗
|
Coderjoe |
yay |
01:58
🔗
|
winr4r |
excellent |
02:07
🔗
|
dashcloud |
holy crap- this must be the finest job posting ever: http://blogs.valvesoftware.com/abrash/ |
02:08
🔗
|
underscor |
just got nfs access to the imagedump server for wikimedia foundation |
02:08
🔗
|
underscor |
those will be going up soon on a.o |
02:08
🔗
|
dashcloud |
congrats! |
02:09
🔗
|
underscor |
thanks :) |
02:09
🔗
|
Wyatt|Wor |
Holy carp, is THAT where Abrash wound up? |
02:10
🔗
|
dashcloud |
after 14 years he came back there it looks like |
02:12
🔗
|
underscor |
204.9.55.82:/z/public/pub/wikimedia/dumps 157T 31T 126T 20% /mnt/dumps |
02:12
🔗
|
underscor |
204.9.55.82:/z/public/pub/wikimedia/images 143T 16T 126T 12% /mnt/images |
02:12
🔗
|
underscor |
wheeee |
02:18
🔗
|
winr4r |
underscor: excellent! |
02:19
🔗
|
winr4r |
i've got hundreds of photos there |
02:19
🔗
|
underscor |
:) |
02:19
🔗
|
winr4r |
i was hoping they'd outlive wikimedia, and it seems it will |
02:19
🔗
|
underscor |
next step it to write ingestion logic to get it all into archive.org |
02:19
🔗
|
underscor |
s/it/is/ |
02:19
🔗
|
winr4r |
(hundrds on the commons, that is) |
02:19
🔗
|
winr4r |
underscor: good luck, and well done on scoring those |
02:20
🔗
|
underscor |
thanks |
02:20
🔗
|
Wyatt|Wor |
Wow, that's...a big array. |
02:21
🔗
|
Wyatt|Wor |
Two of them. |
02:21
🔗
|
underscor |
yeah |
02:21
🔗
|
underscor |
the box has something like 480TB on it |
02:22
🔗
|
Wyatt|Wor |
Where box == rack, I'd imagine. |
02:23
🔗
|
underscor |
it's all on one "machine" |
02:23
🔗
|
underscor |
connected over fibrechannel disk enclosures |
02:24
🔗
|
Wyatt|Wor |
So something like a Ceph cluster? Okay, makes sense. |
02:59
🔗
|
shaqfu |
Well, my archives con today reinforced how awesome AT is |
02:59
🔗
|
shaqfu |
So, go you guys o/ |
03:01
🔗
|
mistym |
shaqfu: marac? |
03:01
🔗
|
shaqfu |
mistym: ...how'd you know? |
03:02
🔗
|
mistym |
shaqfu: Seen a bunch of people I follow twittering it up today. Wasn't there myself. |
03:02
🔗
|
shaqfu |
mistym: Yep, was there today/tomorrow |
03:02
🔗
|
shaqfu |
mistym: Didn't know we had another traditional archivist in the room |
03:04
🔗
|
mistym |
shaqfu: Yep! For a given value of "traditional" anyway, but yeah, did my masters in a traditional archives program 'n all. |
03:04
🔗
|
shaqfu |
mistym: Spiffy; just got mine |
03:05
🔗
|
mistym |
Congrats! |
03:05
🔗
|
shaqfu |
Thanks :) |
03:05
🔗
|
shaqfu |
Are you with a place now? |
03:05
🔗
|
mistym |
Yeah, I work at a museum in Manitoba. |
03:06
🔗
|
shaqfu |
Gotcha; bit distant for MARAC, then |
03:07
🔗
|
mistym |
Yeah, not exactly in the area. |
03:07
🔗
|
shaqfu |
Is there a regional one for central Canada? |
03:08
🔗
|
mistym |
Not in my province, at least. Alberta's archivists are pretty active though. |
03:08
🔗
|
shaqfu |
Oh, wow; that's a hike |
03:11
🔗
|
shaqfu |
The digital object seminar was cool; the job one, harrowing; the DH one, dull |
03:12
🔗
|
mistym |
I saw anarchivist tweeting about the job one. It sounded brutal. |
03:12
🔗
|
shaqfu |
Yep |
03:13
🔗
|
shaqfu |
Lots of "the system is totally fucking broken and we can't fix it" |
03:14
🔗
|
shaqfu |
And more of the usual student/professional divide, but nobody discussed it :( |
03:14
🔗
|
mistym |
:( |
03:15
🔗
|
shaqfu |
For those of us stuck between them, we're SOL |
03:18
🔗
|
mistym |
Hm, is there a url for the wayback machine to load the latest version of a page, rather than a list or specific revision? |
03:24
🔗
|
shaqfu |
But yeah, after listening to a bunch of "real" archivist talk digital records for a day, I appreciate AT that much more |
03:26
🔗
|
mistym |
shaqfu: I know, right? It's unfortunate that it's hard to have a constructive discussion in that environment. |
03:27
🔗
|
shaqfu |
mistym: Yeah, lots of "we need to be doing something!" and nothing getting done |
03:27
🔗
|
mistym |
Yeah... |
03:27
🔗
|
Wyatt|Wor |
Aah, I was just about to ask about that... |
03:28
🔗
|
mistym |
shaqfu: Giving a talk at this year's ACA that I'm hoping to balance with a little "we can do things! let's get things done!" |
03:28
🔗
|
shaqfu |
mistym: C being Canadian or Certified? |
03:28
🔗
|
mistym |
Canadian. |
03:28
🔗
|
mistym |
("Canuck") |
03:29
🔗
|
mistym |
Wyatt|Wor: re: wayback machine? |
03:29
🔗
|
shaqfu |
If a bunch of malcontents online can move mountains, imagine how much big institutions could do... |
03:29
🔗
|
Wyatt|Wor |
mistym: Re: "listening to a bunch of "real" archivist talk digital records for a day" |
03:29
🔗
|
mistym |
Wyatt|Wor: Ahh. |
03:29
🔗
|
Wyatt|Wor |
Whoops, those quotes escaped. |
03:30
🔗
|
mistym |
Need to be more careful escaping. |
03:31
🔗
|
mistym |
shaqfu: I dunno, the longer I'm in big institutions, the more I worry institutional glacial workflows can't be made to work at the speed that's useful. |
03:31
🔗
|
winr4r |
that archive team was necessary shows that there's a real problem with "real" archivists |
03:31
🔗
|
mistym |
I'm being overly pessimistic there, but there is major reorientation that institutions, in the big-institution sense, are going to have to do. |
03:31
🔗
|
winr4r |
as much as many of them do great work, a lot of them have a lot of great ideas about digital preservation and very little wget |
03:32
🔗
|
Wyatt|Wor |
winr4r: I think there are more problems than simply archivists. |
03:32
🔗
|
winr4r |
Wyatt|Wor: oh of course |
03:32
🔗
|
winr4r |
i don't doubt that |
03:32
🔗
|
shaqfu |
mistym: Yeah, it's hard to respond to "you have 30 days before we delete everything" at the speed of bureaucracy |
03:32
🔗
|
mistym |
Mhm... |
03:33
🔗
|
winr4r |
yes |
03:33
🔗
|
Wyatt|Wor |
It's also hard to change the mentality of people who put their data up without a second thought. |
03:33
🔗
|
shaqfu |
Yeah :( |
03:33
🔗
|
Wyatt|Wor |
And harder still to change the businesses that will bean-count something into oblivion without so much as a half-hearted apology. |
03:33
🔗
|
shaqfu |
And yeah, there needs to be a serious realignment if we're going to realistically deal with these records |
03:34
🔗
|
shaqfu |
Christ Almighty, Geocities alone is bigger than most university library systems |
03:34
🔗
|
shaqfu |
Good fucking luck doing it the old-fashioned way; I'll see you at the heat death of the universe |
03:34
🔗
|
Wyatt|Wor |
And yes, the metadata problem is monstrous. |
03:34
🔗
|
shaqfu |
Wyatt|Wor: It's solvable - you can work miracles with machine language processing |
03:35
🔗
|
shaqfu |
Which can at least deal with text stuff |
03:35
🔗
|
winr4r |
give it a decade or so |
03:35
🔗
|
Wyatt|Wor |
shaqfu: Haha, you got me. I was just thinking about how to hack away at it with NLP. |
03:35
🔗
|
shaqfu |
Wyatt|Wor: One panel today was about using topic modeling on newspapers; I'm sure, given time, it'll apply to messier collections |
03:36
🔗
|
shaqfu |
But yeah, get a lot of processing power together, point it at Geocities, and you'll have something at least usable |
03:36
🔗
|
dashcloud |
maybe the only way to handle the new workflows is to have a totally separate group inside that's connected in name only, so they can respond in the timeframes required |
03:37
🔗
|
mistym |
shaqfu: I see Yahoo as being a pretty good analogue. Not data deletion Yahoo, but the old Yahoo web directory. |
03:37
🔗
|
shaqfu |
Ah, yeah |
03:37
🔗
|
mistym |
I've heard (unconfirmed) that they were among the biggest employers of library-school graduates at one point in time! |
03:38
🔗
|
shaqfu |
Yeah, like DMOZ, except people used it |
03:38
🔗
|
mistym |
Exactly. |
03:38
🔗
|
mistym |
There was a point where people thought of the internet as a thing you could index by hand, with meticulous metadata. |
03:38
🔗
|
mistym |
Then the dot-com boom came, the internet *exploded*, and that was never possible again. |
03:38
🔗
|
shaqfu |
Yep |
03:39
🔗
|
winr4r |
mistym: i still think there's a place for it |
03:39
🔗
|
shaqfu |
winr4r: For hand curation? |
03:39
🔗
|
winr4r |
shaqfu: yes |
03:39
🔗
|
winr4r |
actually we have it already: it's called twitter |
03:39
🔗
|
shaqfu |
winr4r: Possible, but things move too fast for that |
03:39
🔗
|
winr4r |
you just distribute the task |
03:40
🔗
|
mistym |
winr4r: For subject-specific stuff, etc. What I'm saying is that there will never again be a time where all the Internet that's fit to print is hand-curated to someone's professional standards. |
03:40
🔗
|
shaqfu |
winr4r: That requires an established network |
03:40
🔗
|
winr4r |
shaqfu: yes |
03:40
🔗
|
dashcloud |
I think a better example is any of the social bookmarking sites (or any bookmarking site that has bookmarks open to the public) |
03:40
🔗
|
dashcloud |
like delicious and pinboard |
03:41
🔗
|
Wyatt|Wor |
dashcloud: Not enough. There's simply too much data to rely on that. |
03:41
🔗
|
winr4r |
mistym: you might be right |
03:41
🔗
|
winr4r |
on the other hand, is there actually more good stuff on the internet than there was in 1999? :P |
03:41
🔗
|
dashcloud |
hell yes |
03:42
🔗
|
dashcloud |
in quantity yes, percentage wise maybe maybenot |
03:42
🔗
|
SketchCow |
HiiiiIiIiiIIiiiI |
03:42
🔗
|
SketchCow |
I saw "Detention" |
03:42
🔗
|
SketchCow |
you must see detention. |
03:43
🔗
|
mistym |
Detention? |
03:43
🔗
|
aggro |
But I've done nothing wrong! |
03:43
🔗
|
winr4r |
hi jason |
03:43
🔗
|
shaqfu |
Movie about chewing gum in high school? |
03:43
🔗
|
Wyatt|Wor |
Evening to you. |
03:45
🔗
|
shaqfu |
SketchCow: Has there been talk at IA about using NLP to handle metadata for these McLargeHuge collections? |
03:47
🔗
|
zgrant |
What does NLP = ? |
03:48
🔗
|
zgrant |
Quick search shows Neuro Linguistic Programming, but that doesn't seem right. |
03:48
🔗
|
aggro |
That's probably it. |
03:48
🔗
|
winr4r |
natural language processing |
03:48
🔗
|
aggro |
If you're trying to data mine metadata for relevant info to humans |
03:48
🔗
|
zgrant |
winr4r: Thanks |
03:51
🔗
|
shaqfu |
Letting machines figure out word associations, more or less |
03:51
🔗
|
zgrant |
Interesting. |
03:52
🔗
|
zgrant |
I'm reading the The Stanford Natural Language Processing Group web page. Who knew? Well I guess you did. :) |
03:53
🔗
|
SketchCow |
shaqfu: Absolutely no |
03:54
🔗
|
shaqfu |
SketchCow: Really? Admittedly, I'm surprised |
03:55
🔗
|
shaqfu |
Seems like the only reasonable solution - barring some miracle, I don't see there being enough humans to mark everything up |
03:57
🔗
|
SketchCow |
Don't be the latest in a hundred people I've dealt with surprised that archive.org doesn't have much manpower. |
03:58
🔗
|
shaqfu |
SketchCow: I'm not surprised IA has minimal staff; I knew that already :P I'm surprise there's been no talk about letting machines do the heavy markup lifting |
03:59
🔗
|
mistym |
It's kind of easy to assume that archive.org is some all-powerful automoton. Then you stand in a room WITH THE INTERNET and suddenly you realize that it's not actually powered by elder gods or smth. |
03:59
🔗
|
mistym |
(even if it *is* in a church) |
04:01
🔗
|
SketchCow |
Again |
04:01
🔗
|
SketchCow |
THERE'S NOBODY TO TALK |
04:01
🔗
|
winr4r |
i'd say let someone in 20 or 30 years deal with it when they're widely recognised for being as important as they are |
04:01
🔗
|
shaqfu |
SketchCow: Gotcha this time |
04:01
🔗
|
winr4r |
you could worry about NLP now to find all those cat photos, or you could 'grep -ri "cat\.*photo" /geocities' a thousand times as fast in 20 years' time |
04:02
🔗
|
shaqfu |
winr4r: For the amount of processing power it'd take, and how NLP isn't really at the point you'd need yet, yeah, may as well wait |
04:03
🔗
|
Wyatt|Wor |
I don't think there's only one "right" approach. |
04:03
🔗
|
shaqfu |
There rarely is |
04:03
🔗
|
Wyatt|Wor |
Especially since the data is hierarchical that actually could help quite a bit. |
04:04
🔗
|
Wyatt|Wor |
Well, up to a point. |
04:05
🔗
|
Wyatt|Wor |
(I'm not familiar enough with the data set to know how what proportion of neighbourhoods are just numbered with random stuff shoved in) |
04:05
🔗
|
shaqfu |
Didn't they stop that system after a point? |
04:05
🔗
|
winr4r |
shaqfu: yes, after around 1999 i believe |
04:06
🔗
|
shaqfu |
So it's hard to regard that hierarchy for any serious use, unless you're limiting your work to 199x-1999 |
04:06
🔗
|
Wyatt|Wor |
One harpoon. |
04:06
🔗
|
shaqfu |
Hm? |
04:07
🔗
|
mistym |
Hooray, scraping script running. Hopefully will be successful! |
04:07
🔗
|
winr4r |
mistym: what are you scraping? |
04:07
🔗
|
mistym |
winr4r: digiplay.info |
04:07
🔗
|
winr4r |
mistym: excellent |
04:08
🔗
|
mistym |
Even though the data is kind of messy, it's not too much work to extract it into structured json. |
04:08
🔗
|
SketchCow |
This is also why I keep bringing in assholes from outside to dump open-source solutions and leverage archive.org against it |
04:09
🔗
|
SketchCow |
There's no dev space inside the company |
04:09
🔗
|
chronomex |
leveraged synergies |
04:09
🔗
|
chronomex |
wtf |
04:09
🔗
|
chronomex |
SketchCow is talking about leveraged synergies |
04:09
🔗
|
shaqfu |
Hunh; I knew it ran lean, but didn't expect it to run *that* lean |
04:10
🔗
|
Wyatt|Wor |
SketchCow Clicker? |
04:10
🔗
|
SketchCow |
The press says things like 200-300 employees |
04:10
🔗
|
SketchCow |
But the vast vast vast majority of those people are scanners. Book scanners. |
04:10
🔗
|
shaqfu |
Isn't it something like 20-30 core? |
04:10
🔗
|
SketchCow |
I'd say, maybe, MAYBE, my observaton is 20-30. |
04:10
🔗
|
SketchCow |
Yes, 20, 30. |
04:11
🔗
|
SketchCow |
Now, work that out. |
04:11
🔗
|
SketchCow |
We have 5 people overseeing the book scanning centers. |
04:11
🔗
|
SketchCow |
Boom, now we're 20-25% down |
04:11
🔗
|
SketchCow |
etc |
04:12
🔗
|
SketchCow |
I'm like hiring six new employees in terms of stuff and publicity and the rest |
04:12
🔗
|
SketchCow |
But I can only do things that are being brought in, there's no way to make those poor devs do MORE work |
04:12
🔗
|
chronomex |
15 coder-librians is not enough |
04:12
🔗
|
SketchCow |
And there we are. |
04:12
🔗
|
Wyatt|Wor |
And those people are also jointly responsible for running all the servers and such? |
04:12
🔗
|
SketchCow |
So if we do some sort of NLP smart tagging smartiness, great. Get on it. |
04:12
🔗
|
SketchCow |
Free tour. |
04:12
🔗
|
SketchCow |
Yes, there's a team of 5-10 dev/admin/network people |
04:13
🔗
|
SketchCow |
OH LOOK AT ALL YOUR EYES GO WIDE |
04:13
🔗
|
SketchCow |
Anyway, so yeah, get on it. |
04:13
🔗
|
winr4r |
and there's got to be like six billion servers there |
04:13
🔗
|
SketchCow |
I'll just use my universal access to ensure you get stuff to help you. |
04:14
🔗
|
Ymgve |
you should get a rifle, some tranq darts, then go hang out outside one of google's datacenters |
04:14
🔗
|
shaqfu |
Pity 'bout the 5-10 years math ed it'd take to do it; it'd be a badass project |
04:14
🔗
|
SketchCow |
Go rape a gaduate program is my suggestion |
04:15
🔗
|
SketchCow |
Anyway, unrelated, I need to go to bed now. |
04:15
🔗
|
Wyatt|Wor |
Yeah, the perfect admin abduction is hard to pull off. |
04:15
🔗
|
Wyatt|Wor |
Good night. |
04:15
🔗
|
winr4r |
night jason |
04:15
🔗
|
shaqfu |
G'nite |
04:15
🔗
|
SketchCow |
Let's keep making amazing shit |
04:17
🔗
|
SketchCow |
Ooo, one of batcave's two remaining mounted drive sets has been emptied out |
04:17
🔗
|
SketchCow |
We're now down to one. 9gb. |
04:17
🔗
|
winr4r |
in any case, i think there's a risk of over-complicating the "saving shit" strategy (which provably works very well) and turning it into a discussion about "how do we make sure that every single thing is categorised as well as books are in a library" and thereby getting very little done |
04:17
🔗
|
winr4r |
SketchCow: s/g/t/ ? |
04:17
🔗
|
winr4r |
i can't imagine you having only 9gb of *anything* |
04:17
🔗
|
SketchCow |
ha ha |
04:18
🔗
|
shaqfu |
winr4r: Yep, that's what happened on this end; it turned into an issue of description |
04:18
🔗
|
SketchCow |
Did I write 9gb? |
04:18
🔗
|
SketchCow |
I DO need a rest |
04:18
🔗
|
SketchCow |
9tb |
04:18
🔗
|
shaqfu |
Which, really, people only care 'bout good-enough |
04:18
🔗
|
winr4r |
SketchCow: did you actually sleep at all last night? |
04:18
🔗
|
shaqfu |
Barring special cases - obviously shit like presidential letters need Awesome |
04:19
🔗
|
chronomex |
meh, president is just another sack of meat |
04:19
🔗
|
winr4r |
shaqfu: good enough and actually existing beats immaculately described archives that do not |
04:19
🔗
|
shaqfu |
winr4r: You got it |
04:20
🔗
|
mistym |
"less process more product" etc |
04:20
🔗
|
shaqfu |
mistym: wrought grand - okay item-level description |
04:21
🔗
|
mistym |
~500 pages of 5000. This may take awhile. |
04:24
🔗
|
winr4r |
good luck |
04:30
🔗
|
Wyatt|Wor |
So as a baseline, any thoughts on what metadata should be given priority? Dublin Core and a mostly-flat ontology of descriptive tags? |
04:34
🔗
|
SketchCow |
http://archive.org/details/stage6 |
04:35
🔗
|
Wyatt|Wor |
He even archives in his sleep. ;) |
04:36
🔗
|
SketchCow |
zzzzzzreclassifyfuckdublincorzzzzzzmmzzzzz |
04:39
🔗
|
winr4r |
haha |
04:42
🔗
|
Wyatt|Wor |
That's fine. I'm not a particularly huge fan of DCMI, even though I live a stone's throw from Dublin. |
04:42
🔗
|
Coderjoe |
mmm |
04:42
🔗
|
Coderjoe |
i has a collection |
04:43
🔗
|
mistym |
What has you a collection of? |
04:43
🔗
|
Coderjoe |
i'm uploading the stage6 items |
04:43
🔗
|
winr4r |
Coderjoe: how did you get them? |
04:44
🔗
|
mistym |
Ahh. |
04:44
🔗
|
Coderjoe |
winr4r: I downloaded them between the closing announcement and the shutdown |
04:44
🔗
|
Coderjoe |
with metadata and everythign |
04:44
🔗
|
Coderjoe |
http://wegetsignal.org/stage6/ |
04:44
🔗
|
Coderjoe |
will probably be a little slow |
04:45
🔗
|
winr4r |
25 terabytes? |
04:45
🔗
|
chronomex |
<3 |
04:45
🔗
|
Coderjoe |
winr4r: I don't have 25 TB of videos |
04:45
🔗
|
Coderjoe |
only 290-ish GB |
04:45
🔗
|
winr4r |
oh, nm, saw the percentage |
04:45
🔗
|
winr4r |
but still, good work :) |
04:46
🔗
|
Wyatt|Wor |
Good one. |
04:46
🔗
|
Wyatt|Wor |
Is that 25TB before or after deriving? |
04:46
🔗
|
Coderjoe |
that was the projected size of what was up on the stage6 servers |
04:47
🔗
|
Wyatt|Wor |
Ah... :/ |
04:48
🔗
|
winr4r |
on an unrelated note, is there a big list of fortunecity sites that you guys have been using? |
04:48
🔗
|
Coderjoe |
and this was just me with three network connections (home, work, and a server in california) |
04:48
🔗
|
winr4r |
if i'm well within my bandwidth cap towards the end of the month, i will set my screenshot bot loose again |
04:48
🔗
|
Coderjoe |
winr4r: I think it came from google results |
04:49
🔗
|
winr4r |
http://archive.org/details/geocities-screengrabs-collection in case you didn't know |
04:49
🔗
|
winr4r |
4000+ from geocities |
04:50
🔗
|
Wyatt|Wor |
winr4r: Does it run as a normal user? I give you an account on my VPS, if you'd like. |
04:51
🔗
|
winr4r |
Wyatt|Wor: yes, though it does need an xvfb to run on |
04:51
🔗
|
winr4r |
and a bunch of dependencies that aren't normally on a server |
04:51
🔗
|
Wyatt|Wor |
That doesn't necessarily mean it's not doable. |
04:52
🔗
|
Wyatt|Wor |
(Though I've never messed with xvfb on a headless machine) |
04:52
🔗
|
winr4r |
Wyatt|Wor: me neither |
04:55
🔗
|
Coderjoe |
xvncserver wouldn't work? |
04:55
🔗
|
Coderjoe |
(you should be able to take a shot of the desktop or the like, I would think) |
04:55
🔗
|
winr4r |
Coderjoe: i'd expect it would |
04:55
🔗
|
winr4r |
i just know for sure that Xvfb does |
04:56
🔗
|
Coderjoe |
i mean, yes, it isn't a vfb, but still |
04:56
🔗
|
Coderjoe |
hmm. 6 items i need to redo |
04:57
🔗
|
Coderjoe |
i'm sure it will increase |
04:57
🔗
|
Wyatt|Wor |
Okay, looks like xvfb will work on a headless box. That's what Google says. |
04:58
🔗
|
Coderjoe |
btw, that "videos listed" stat is just the videos I had pulled into my database with my importer. the "total video count" at the bottom is my estimated total video count that stage6 hosted |
04:58
🔗
|
winr4r |
Wyatt|Wor: splendid! |
06:57
🔗
|
Nemo_bis |
underscor, what dump server? |
06:59
🔗
|
Nemo_bis |
ah, your.org |
09:15
🔗
|
Wyatt|Wor |
Curious, since this thing is _still_ grepping that file, the webdav-feed.json and .xml...what role do they serve, exactly? |
09:15
🔗
|
winr4r |
how big is the file? |
09:16
🔗
|
winr4r |
and how can it take that long to grep anything? |
09:16
🔗
|
chronomex |
fgrep is much faster, for fixed strings |
09:17
🔗
|
winr4r |
yeah but i grepped a 1.9gb file in seconds, earlier today |
09:17
🔗
|
chronomex |
it was probably all sitting in ram |
09:17
🔗
|
winr4r |
(getting a list of fortunecity sites from the ODP) |
09:17
🔗
|
winr4r |
chronomex: nope, fresh from the disk |
09:17
🔗
|
chronomex |
hm, ok |
09:19
🔗
|
Wyatt|Wor |
It's the json is 35MB. The incantation is grep http://gallery.me.com/[^"<]+ data/p/pe/per/pertormod1/gallery.me.com/webdav-feed.json and it's accumulated 3766 CPU _Minutes_ |
09:19
🔗
|
winr4r |
okay, i would call that a bug |
09:20
🔗
|
chronomex |
indeed |
09:20
🔗
|
Deewiant |
Run it interactively and see what it outputs, if anything? |
09:20
🔗
|
emijrp |
use [^"<]+? and grep -E |
09:20
🔗
|
Wyatt|Wor |
Even assuming the worst case of grep's iconv locale performance, I'm inclined to agree. |
09:20
🔗
|
Wyatt|Wor |
Sorry, there's an -oE in there I missed |
09:21
🔗
|
Wyatt|Wor |
(It's the seesaw-s3.sh) |
09:21
🔗
|
winr4r |
just timed a regex again on a copy of said 19gb file, 19.4 seconds |
09:21
🔗
|
chronomex |
winr4r: 1.9 or 19? |
09:21
🔗
|
winr4r |
1.9gb* |
09:21
🔗
|
emijrp |
+? |
09:21
🔗
|
chronomex |
ah |
09:21
🔗
|
emijrp |
+? |
09:21
🔗
|
winr4r |
so yeah 3766 minutes for a 35mb file is a LITTLE excessive |
09:22
🔗
|
chronomex |
Wyatt|Wor: hmmmm. I would try some cut | grep(not -E) action. |
09:22
🔗
|
Deewiant |
That comes out to about 162 bytes per second (and dropping, if it's still going) |
09:22
🔗
|
chronomex |
drooping |
09:23
🔗
|
Wyatt|Wor |
That's my perspective too. I think the emulated ARM processor that booted Linux on the 8-bit MC had a better data rate about a thousand times that. |
09:25
🔗
|
alard |
winr4r: I see you're asking about a list of FortuneCity sites. I can send you the list from which we've been downloading, if that helps. |
09:25
🔗
|
Deewiant |
Hey, it's faster than some 600 baud modems, according to Wikipedia. |
09:25
🔗
|
chronomex |
some? |
09:25
🔗
|
winr4r |
alard: i'd appreciate it, we can compare notes too |
09:25
🔗
|
Deewiant |
https://en.wikipedia.org/wiki/List_of_device_bandwidths#Modems_.E2.80.93_narrow_and_broadband |
09:26
🔗
|
winr4r |
alard: i grabbed a list from ODP (hence greeping a 1.9gb file), did you guys try that? |
09:26
🔗
|
Deewiant |
There are two 600 baud ones that're 1.2 kbit/s and one that's 2.4 kbit/s. |
09:26
🔗
|
chronomex |
ah |
09:26
🔗
|
alard |
winr4r: What's ODP? |
09:26
🔗
|
winr4r |
alard: open directory project |
09:26
🔗
|
alard |
Ah, I see. No, I just googled. |
09:26
🔗
|
winr4r |
okay, one sec |
09:27
🔗
|
winr4r |
http://dl.dropbox.com/u/57276499/sitelist.txt |
09:28
🔗
|
winr4r |
is what i got from ODP |
09:28
🔗
|
Wyatt|Wor |
I actually don't understand this regex, even. o is --only-matching -E is extended regex... how does this work? [^"<]+ |
09:29
🔗
|
chronomex |
I don't think you should need -E for that |
09:29
🔗
|
alard |
win4r: Okay, got it. I'm currently making my list. |
09:29
🔗
|
alard |
Wyatt|Wor: The regex matches anything until " or < |
09:29
🔗
|
alard |
I believe that's matching urls in the webdav file. |
09:30
🔗
|
alard |
So it will match from http:// until the tag ends. |
09:30
🔗
|
Wyatt|Wor |
alard: Ah, I thought the caret was an anchor to the beginning? |
09:31
🔗
|
winr4r |
Wyatt|Wor: not within []s |
09:31
🔗
|
alard |
Between [] it's a negation. So 'anything but " and < ' |
09:31
🔗
|
winr4r |
alard: thanks :) |
09:31
🔗
|
Wyatt|Wor |
ooooooh, I see. Hmm, need to put more skill points in RegEx. And the +? |
09:31
🔗
|
chronomex |
one or more instances of the preceding object |
09:32
🔗
|
alard |
* means zero or more. |
09:32
🔗
|
chronomex |
er, matching element, which in this case is the whole [] expression |
09:32
🔗
|
Wyatt|Wor |
Ah, so [] create a single semantic unit. I see. |
09:32
🔗
|
chronomex |
indeed |
09:33
🔗
|
chronomex |
it matches exactly one character |
09:33
🔗
|
alard |
winr4r: http://db.tt/PjVwK1A2 (a 3.7MB .txt.bzip2) |
09:34
🔗
|
chronomex |
is that the file we're working on? |
09:34
🔗
|
alard |
No, that's the list of all fortunecity sites. |
09:34
🔗
|
winr4r |
alard: thanks! |
09:35
🔗
|
alard |
winr4r: You'll have to expand the streets yourself, we've basically archived anything from number 0 to 2600. |
09:37
🔗
|
winr4r |
alard: so com/campus/athena = campus.fortunecity.com/athena/<numbers here> ? |
09:41
🔗
|
Wyatt|Wor |
Hm, so it's definitely finding things, though it seems awfully slow... |
09:42
🔗
|
emijrp |
paste the entire command line |
09:43
🔗
|
Wyatt|Wor |
grep -oE 'http://gallery.me.com/[^"<]+' data/p/pe/per/pertormod1/gallery.me.com/webdav-feed.json # Pretty much verbatim from seesaw-s3.sh |
09:44
🔗
|
Wyatt|Wor |
Ah, I think I've got the problem. Who do I bug about a patch? |
09:45
🔗
|
emijrp |
a bug in grep? |
09:45
🔗
|
Wyatt|Wor |
Well, yes, to an extent. But it's a bug I think we can safely work around. export LANG=C And it's about three orders of magnitude faster |
09:46
🔗
|
Wyatt|Wor |
I could have sworn the iconv bug was fixed though. :/ |
09:46
🔗
|
emijrp |
where is the file to parse? i want to make some grep tests |
09:46
🔗
|
Wyatt|Wor |
emijrp: Let me stick it somewhere. |
09:48
🔗
|
Wyatt|Wor |
Come to think of it , DCC would have been faster... |
09:48
🔗
|
Wyatt|Wor |
radiusic.com/bigfeet.json |
09:48
🔗
|
Wyatt|Wor |
"d" and "t" are totally right next to each other. |
09:50
🔗
|
emijrp |
downlaiding |
09:52
🔗
|
Wyatt|Wor |
Okay yeah, it hit me because my grep is old. It's apparently fixed in grep 2.9 |
09:53
🔗
|
Wyatt|Wor |
(Didn't realise I was still using grep 2.5.4) |
09:53
🔗
|
emijrp |
what do you want, the entire url o just the domain + username?= |
09:53
🔗
|
chronomex |
grep 2.old |
09:53
🔗
|
Wyatt|Wor |
The problem is, in this case, most distros in production are probably using old grep. |
09:54
🔗
|
Wyatt|Wor |
CentOS 6 has grep 2.6 |
09:54
🔗
|
winr4r |
i'm on 2.5.4 too |
09:55
🔗
|
winr4r |
but piping it to a file, it takes a few seconds |
09:56
🔗
|
winr4r |
8 seconds, to be precise |
09:56
🔗
|
winr4r |
time grep -oE 'http://gallery.me.com/[^"<]+' bigfeet.json > what |
09:56
🔗
|
winr4r |
is what i am using |
09:57
🔗
|
Wyatt|Wor |
emijrp: The problem isn't that it doesn't work. The problem is that when you're using many versions of grep in the wild with LANG=en_US.utf8 (or any unicode, locale for that matter), it's fantastically slow. |
09:59
🔗
|
Nemo_bis |
unicode comparisons are always very slow |
09:59
🔗
|
Wyatt|Wor |
The good thing is, we can patch our scripts by explicitly setting LANG=C and LC_CTYPE=C and that should be safe. |
10:00
🔗
|
winr4r |
i am en_GB.utf-8 and that grep still takes seconds rather than hours |
10:00
🔗
|
Wyatt|Wor |
(Or just unset LC_CTYPE) |
10:01
🔗
|
Nemo_bis |
In what format is the file you're grepping save in? Wouldn't this matter? |
10:01
🔗
|
Nemo_bis |
I suppose grep should determine what charset it's being used, but will do so only from the headers... |
10:02
🔗
|
Wyatt|Wor |
Nemo_bis: It's just a JSON file from mobileme |
10:03
🔗
|
alard |
winr4r: "so com/campus/athena = campus.fortunecity.com/athena/<numbers here> ?" Yes, or www.fortunecity.com/campus/athena/<number>. (The subdomain approach doesn't work with co.uk/it/es/se, I think.) |
10:03
🔗
|
winr4r |
alard: gotcha |
10:48
🔗
|
emijrp |
netsplit |
11:08
🔗
|
Wyatt|Wor |
Okay, updated grep and things are much speedier. I'll try to take a look at the memac scripts when I get home and figure out where to add that env. |
11:09
🔗
|
winr4r |
it's weird though |
11:10
🔗
|
winr4r |
that i can be running the same version also with a UTF-8 LANG and do in nineteen seconds what your grep didn't finish in hours |
11:10
🔗
|
* |
winr4r isn't exactly on a speed-demon computer |
11:14
🔗
|
Wyatt|Wor |
winr4r: What distro/version? |
11:15
🔗
|
winr4r |
Wyatt|Wor: ubuntu 10.04 |
11:15
🔗
|
Wyatt|Wor |
Distro-specific patches will do that. Yeah, Debian patched it a while back. |
11:15
🔗
|
winr4r |
ah :) |
11:15
🔗
|
Wyatt|Wor |
Gentoo just stabled a newer version instead. |
11:15
🔗
|
winr4r |
that explains that |
11:16
🔗
|
Wyatt|Wor |
(But this is my work computer, so I don't exactly bother updating often) |
11:17
🔗
|
* |
winr4r nods |
11:17
🔗
|
winr4r |
1507 screenshots ;D |
11:18
🔗
|
Wyatt|Wor |
Ooh, going pretty fast. |
11:27
🔗
|
oli |
06:25:13 up 13:46, 6 users, load average: 54.70, 54.97, 58.34 |
11:27
🔗
|
oli |
hmm i think i started too many threads |
11:27
🔗
|
Wyatt|Wor |
What, that's it? |
11:27
🔗
|
oli |
hahaha |
11:27
🔗
|
oli |
yeah that's it |
11:28
🔗
|
Wyatt|Wor |
;) |
11:30
🔗
|
oli |
i got a box from softlayer and its not going over 100mbit :( |
11:31
🔗
|
Wyatt|Wor |
Time to go home. Later. |
11:32
🔗
|
winr4r |
bye Wyatt|Wor! |
13:33
🔗
|
SketchCow |
See? I sleep like everyone else. Here I am, back up again. |
13:34
🔗
|
undersco2 |
lies |
13:40
🔗
|
winr4r |
haha |
13:41
🔗
|
winr4r |
hey cow, i'll have fortunecity screenshots for you soon |
13:41
🔗
|
winr4r |
i'll email you when i'm done, it's not urgent |
13:44
🔗
|
SketchCow |
Sounds fun |
13:49
🔗
|
winr4r |
Wyatt|Wor is letting me use his VPS for it |
13:49
🔗
|
winr4r |
i'm at nearly 2000 now |
13:49
🔗
|
winr4r |
how are you? :) |
13:49
🔗
|
SketchCow |
Just blew another bulk of mobileme off batcave. |
13:49
🔗
|
SketchCow |
The machine is now down to 8.8tb of data. |
13:50
🔗
|
SketchCow |
Which is good, it's down from rough 28tb |
13:52
🔗
|
SketchCow |
Mostly, I'm stunned, I'm finding additional pieces of friendster |
13:52
🔗
|
SketchCow |
And everything else. |
13:52
🔗
|
SketchCow |
Also, our Berlios grab |
13:53
🔗
|
winr4r |
that's the archive team equivalent of finding loose change in your sofa? |
13:53
🔗
|
SketchCow |
Yeah |
13:54
🔗
|
Nemo_bis |
What about splinder? |
13:54
🔗
|
Nemo_bis |
SketchCow, I think chronomex needed a place where to upload his last pieces of Splinder. |
13:55
🔗
|
SketchCow |
Next, I need to start shoving splinder into archive.org proper. |
14:01
🔗
|
SketchCow |
Hey there, I'm James. I'm from Australia. |
14:01
🔗
|
SketchCow |
I have the site bookmarked again, and will read a few files when I get the time. |
14:01
🔗
|
SketchCow |
I'm nearly seventeen, and I remember coming across textfiles at 12 or 13.. |
14:01
🔗
|
SketchCow |
You are fucking straight up, and I respect it.. |
14:01
🔗
|
SketchCow |
Thanks for doing what you do! |
14:02
🔗
|
winr4r |
:) |
14:02
🔗
|
winr4r |
doesn't that sort of thing just make your day? |
14:04
🔗
|
SketchCow |
Well, I get a lot of them. |
14:04
🔗
|
SketchCow |
But I do appreciate them. |
14:04
🔗
|
winr4r |
mhm |
14:29
🔗
|
SketchCow |
2.2T mobileme-03 |
14:29
🔗
|
SketchCow |
2.8G mobileme-05 |
14:29
🔗
|
SketchCow |
413G mobileme-06 |
14:29
🔗
|
SketchCow |
7.5G mobileme-04 |
14:29
🔗
|
SketchCow |
See just kind of lying around there |
14:30
🔗
|
winr4r |
2.2 terabytes sounds like a small figure then you see "413G" and then it's like "oh, that is actually a big number" |
14:35
🔗
|
oli |
haha |
14:35
🔗
|
oli |
are there any other projects apart from mobileme i can be helping with? i have bandwidth to spare |
14:39
🔗
|
SketchCow |
Check the wiki? |
14:40
🔗
|
SketchCow |
I don't actually know offhand which need bandwidth OTHER than mobileme |
14:41
🔗
|
SketchCow |
I'm about to dump a pile of Polish shareware CDs onto the cdbbscollection. |
14:42
🔗
|
oli |
yeah i looked, theres not really anything else to do from what i can see :( |
14:43
🔗
|
Deewiant |
Can't you simply use more bandwidth on mobileme, or is mobileme at its limit or something? |
14:44
🔗
|
SketchCow |
Mobileme is a cancer eating all our attention - after I finish with batcave's decomission I will start regarding other things we can do. |
14:45
🔗
|
oli |
i cant seem to get more than about 100mbit out of mobileme from my box at softlayer even though its on a gige connection |
14:45
🔗
|
oli |
and im running a lot of threads, running more just bogs the system down and doesnt get anything downloading faster |
14:50
🔗
|
SketchCow |
Someone has sent me 4gb (or thereabouts) of mid 1990s Spanish demoscene stuff. |
14:52
🔗
|
undersco2 |
SketchCow: that message from james is really cool |
14:53
🔗
|
SketchCow |
Yeah, and he's still a young nubile 17 year old and not some busted old mare like you |
14:53
🔗
|
* |
SketchCow turns undersco2 |
14:53
🔗
|
SketchCow |
Come back when you've earned three fiddy |
14:53
🔗
|
oli |
i cant resolve textfiels.com :/ |
14:53
🔗
|
oli |
textfiles.com rather |
14:53
🔗
|
SketchCow |
Record expires on 07-Oct-2021. |
14:53
🔗
|
undersco2 |
SketchCow: <3 |
14:53
🔗
|
undersco2 |
hahaha |
14:53
🔗
|
SketchCow |
It ain't that! |
14:53
🔗
|
SketchCow |
2021! |
14:53
🔗
|
winr4r |
same here |
14:53
🔗
|
SketchCow |
Bitches! |
14:54
🔗
|
undersco2 |
;; ANSWER SECTION: |
14:54
🔗
|
undersco2 |
textfiles.com. 3600 IN A 208.86.224.90 |
14:54
🔗
|
undersco2 |
fine here |
14:54
🔗
|
SketchCow |
Well, I'm ON textfiles.com, so it's not the machine. |
14:54
🔗
|
oli |
cant get it from my box in australia or here in budapest |
14:54
🔗
|
SketchCow |
Likely, someone is assfucking the apache. |
14:54
🔗
|
SketchCow |
One moment. |
14:54
🔗
|
undersco2 |
|
14:54
🔗
|
undersco2 |
The connection was reset |
14:54
🔗
|
undersco2 |
|
14:54
🔗
|
undersco2 |
|
14:54
🔗
|
undersco2 |
The connection to the server was reset while the page was loading. |
14:54
🔗
|
oli |
yep same as undersco2 |
14:54
🔗
|
undersco2 |
yep |
14:54
🔗
|
undersco2 |
oh fuck |
14:55
🔗
|
undersco2 |
that's a lot of returns |
14:55
🔗
|
undersco2 |
sorry |
14:55
🔗
|
SketchCow |
Ah, here we are. |
14:55
🔗
|
SketchCow |
Someone has 480 simultaneous connections to the machine. |
14:55
🔗
|
SketchCow |
That might be a factor. |
14:55
🔗
|
winr4r |
jesus |
14:55
🔗
|
oli |
anyone know a way/system for a redundant multi node filesystem i can run between many computers? |
14:55
🔗
|
oli |
w./ linux |
14:55
🔗
|
SketchCow |
ha ha. |
14:56
🔗
|
SketchCow |
Someone's about to meet my old friend mister soft firewall |
14:56
🔗
|
undersco2 |
hahaha |
14:56
🔗
|
undersco2 |
oli: ceph |
14:56
🔗
|
winr4r |
can't you put in a rewrite rule for his IP so he downloads 480 goatses every time? |
14:56
🔗
|
undersco2 |
^ |
14:56
🔗
|
undersco2 |
hahahahaha |
14:57
🔗
|
oli |
undersco2: thx will look into it |
14:59
🔗
|
SketchCow |
tcp4 0 33078 208.86.224.90.80 189.19.142.212.42364 LAST_ACK |
14:59
🔗
|
SketchCow |
tcp4 0 33078 208.86.224.90.80 189.19.142.212.42365 LAST_ACK |
14:59
🔗
|
SketchCow |
tcp4 0 33078 208.86.224.90.80 189.19.142.212.42591 LAST_ACK |
14:59
🔗
|
SketchCow |
tcp4 0 33080 208.86.224.90.80 189.19.142.212.42506 LAST_ACK |
14:59
🔗
|
SketchCow |
tcp4 0 33080 208.86.224.90.80 189.19.142.212.42566 LAST_ACK |
14:59
🔗
|
SketchCow |
tcp4 0 33080 208.86.224.90.80 189.19.142.212.42328 LAST_ACK |
14:59
🔗
|
SketchCow |
tcp4 0 33079 208.86.224.90.80 189.19.142.212.42257 LAST_ACK |
14:59
🔗
|
SketchCow |
tcp4 0 33078 208.86.224.90.80 189.19.142.212.42238 LAST_ACK |
14:59
🔗
|
SketchCow |
tcp4 0 33078 208.86.224.90.80 189.19.142.212.42239 LAST_ACK |
14:59
🔗
|
SketchCow |
tcp4 0 33078 208.86.224.90.80 189.19.142.212.42129 LAST_ACK |
14:59
🔗
|
SketchCow |
tcp4 0 33080 208.86.224.90.80 189.19.142.212.42126 LAST_ACK |
14:59
🔗
|
SketchCow |
tcp4 0 33077 208.86.224.90.80 189.19.142.212.42127 LAST_ACK |
14:59
🔗
|
SketchCow |
tcp4 0 33078 208.86.224.90.80 189.19.142.212.42128 LAST_ACK |
14:59
🔗
|
SketchCow |
tcp4 0 33079 208.86.224.90.80 189.19.142.212.42080 LAST_ACK |
14:59
🔗
|
SketchCow |
It's like that all the way down. |
14:59
🔗
|
SketchCow |
Just blocked him AND turned off the website for a moment |
14:59
🔗
|
SketchCow |
I blocked his subnet, because it feels good man |
15:00
🔗
|
winr4r |
haha |
15:01
🔗
|
SketchCow |
It's getting there. |
15:01
🔗
|
SketchCow |
Another 3-4 minutes, it'll be down to normal, then I'll restart. |
15:01
🔗
|
SketchCow |
I love these toolbags. |
15:01
🔗
|
oli |
thanks |
15:02
🔗
|
SketchCow |
WOAH SHIT THIS WEBSITE IS MIRRORED IN 15 LOCATIONS AND HAS BEEN ON THE NET FOR 14 YEARS I BETTER OPEN THREE BILLION CONNECTIONS AND SUCK IT DOWN NOW |
15:02
🔗
|
SketchCow |
AAAAAHHHH COULD GO ANY MORE |
15:02
🔗
|
SketchCow |
ANY SECOND NOW IT MIGHT DIE |
15:02
🔗
|
SketchCow |
AIIREEEEEEE I ATE SUGER FROSTED SUGAR THIS MORNING WHILE DRINKING QUIK |
15:03
🔗
|
winr4r |
^ this is the exact same conversation being had in fortunecity's secret IRC channel |
15:03
🔗
|
SketchCow |
AAAARRRGGGGIGIGIGIGIGIGG |
15:04
🔗
|
winr4r |
haha. |
15:05
🔗
|
SketchCow |
Well, they're still at 200 connections, but bringing textfiles.com back. |
15:06
🔗
|
BlueMax |
whoops, was that me? |
15:07
🔗
|
SketchCow |
Hooray, my methamphetamine textfile is up |
15:07
🔗
|
SketchCow |
http://www.textfiles.com/drugs/himet1.txt |
15:07
🔗
|
SketchCow |
INTERNET SAVED |
15:07
🔗
|
BlueMax |
lol |
15:07
🔗
|
BlueMax |
I can't help but wonder how many textfiles we missed. |
15:10
🔗
|
SketchCow |
In terms of what. |
15:10
🔗
|
SketchCow |
When you say we missed, do you mean me? |
15:10
🔗
|
mistym_ |
Erk, looks like scraping errors in my digiplay.info data. At least I have the html cached now. |
15:10
🔗
|
SketchCow |
Because as far as I can tell, believe it or not, I got most of them, ultimately. |
15:10
🔗
|
BlueMax |
No, I miss you whenever I sleep. |
15:11
🔗
|
BlueMax |
Most of them? |
15:11
🔗
|
SketchCow |
Well, nearly all that were passed from BBS to BBS. |
15:11
🔗
|
BlueMax |
I wonder if the BBSes that are up today still have any we don't. |
15:11
🔗
|
BlueMax |
Or you don't. |
15:12
🔗
|
BlueMax |
However you want to say it. |
15:12
🔗
|
Wyatt |
So if someone is willfully trying to immortalise their community/content on IA, how best to go about that? |
15:12
🔗
|
winr4r |
hiya Wyatt :) |
15:13
🔗
|
winr4r |
Wyatt: going strong btw, over 2000 now! |
15:14
🔗
|
Wyatt |
Feel free to hammer on it all week if you want. Start a couple in parallel, even. |
15:14
🔗
|
winr4r |
:D |
15:15
🔗
|
SketchCow |
SCREENSHOT ALL THE THINGS |
15:15
🔗
|
SketchCow |
Ha ha, this isn't spanish demo scene. |
15:15
🔗
|
SketchCow |
This is Spanish ATARI demo scene |
15:15
🔗
|
Wyatt |
Oh hawt |
15:16
🔗
|
BlueMax |
mmm, tilt that joystick |
15:22
🔗
|
SketchCow |
http://archive.org/details/spanish-demoscene-collection |
15:23
🔗
|
emijrp |
Finally some Spanish content. |
15:24
🔗
|
SketchCow |
Por último, todo el mundo puede ser un idiota! |
15:25
🔗
|
BlueMax |
Everyone can be an idiot! |
15:25
🔗
|
emijrp |
ú |
15:25
🔗
|
emijrp |
lólólólól |
15:25
🔗
|
emijrp |
TACO. |
15:31
🔗
|
emijrp |
A weird thing of IA items is that don't show who is the uploader, so, you can't search for similar stuff using the uploader contributions list. |
15:33
🔗
|
SketchCow |
Agreed |
15:35
🔗
|
SketchCow |
root@teamarchive-0:/2/FRIENDSTER# du -sh . |
15:35
🔗
|
SketchCow |
1.7T . |
15:36
🔗
|
winr4r |
:D |
15:40
🔗
|
emijrp |
15 years later |
15:40
🔗
|
emijrp |
17:35:42 <SketchCow> 1.7P . |
15:40
🔗
|
emijrp |
17:35:42 <SketchCow> root@teamarchive-0:/2/FACEBOOK# du -sh . |
15:40
🔗
|
winr4r |
yes |
15:40
🔗
|
Wyatt |
Only 1.7? |
15:40
🔗
|
BlueMax |
Onl- DAMNIT |
15:40
🔗
|
nitro2k01 |
Haha |
15:41
🔗
|
winr4r |
https://www.google.co.uk/search?hl=en&site=webhp&q=define+gigabyte |
15:41
🔗
|
winr4r |
also |
15:41
🔗
|
Wyatt |
Facebook has got to be past 100PB by now, right? |
15:41
🔗
|
nitro2k01 |
All of this is nothing compared to when YouTube goes down |
15:41
🔗
|
winr4r |
will someone click on the audio icon there and tell me that google is just trolling us |
15:41
🔗
|
BlueMax |
YouTube Collection |
15:42
🔗
|
winr4r |
JYGABYTE |
15:42
🔗
|
winr4r |
nitro2k01: let's not even think about that :< |
15:44
🔗
|
nitro2k01 |
I wonder if there's a real risk that Flickr goes down. I think not, since it seems to be one of the Yahoo services that are actually profitable. |
15:44
🔗
|
nitro2k01 |
Or, I would imagine so |
15:44
🔗
|
Wyatt |
Didn't they lay off most of the people working on it? |
15:45
🔗
|
nitro2k01 |
I don't know, just speaking of the data retention here |
15:45
🔗
|
nitro2k01 |
Seems unlikely they would just kill it like Geocities |
15:45
🔗
|
nitro2k01 |
Since quite a few people actually have those pro badges that means they pay up every year |
15:48
🔗
|
Wyatt |
I'd say Delicious is definitely likely to be amputated first among their remaining high-profile sites. |
15:49
🔗
|
winr4r |
Wyatt: it already was |
15:49
🔗
|
winr4r |
delicious is not owned by yahoo anymore |
15:49
🔗
|
Wyatt |
...what, someone actually bought it?! |
15:49
🔗
|
nitro2k01 |
And myspace... Just think of all the flashy designs that were trashed overnight |
15:50
🔗
|
emijrp |
http://longbets.org |
15:50
🔗
|
winr4r |
Wyatt: yup, i believe it was the founder of youtube |
15:50
🔗
|
Wyatt |
I thought that was a joke. I...am rather surprised. |
15:50
🔗
|
Wyatt |
Somehow I missed the reality of it. |
15:50
🔗
|
SketchCow |
You did. |
15:50
🔗
|
SketchCow |
Hard. |
15:51
🔗
|
SketchCow |
And it was two of the founders of youtube |
15:51
🔗
|
SketchCow |
In April. |
15:51
🔗
|
SketchCow |
Of 2011. |
15:51
🔗
|
winr4r |
yeah, just looked that up and found that |
15:51
🔗
|
winr4r |
delicious was actually one of the best services that i have seen |
15:52
🔗
|
winr4r |
all together now: "fuck yahoo!" |
15:53
🔗
|
emijrp |
fuck the internet |
15:53
🔗
|
nitro2k01 |
Fuck everything |
15:53
🔗
|
* |
nitro2k01 omniphile |
15:53
🔗
|
SmileyG |
http://i.imgur.com/PHzN9.jpg |
15:54
🔗
|
SmileyG |
FUCK IT WITHOUT DEPS! |
15:54
🔗
|
nitro2k01 |
Heh |
15:54
🔗
|
winr4r |
haha |
15:54
🔗
|
Wyatt |
Right, now I have to google for what Yahoo even owns anymore. |
16:01
🔗
|
SketchCow |
shhh, we're fucking |
16:02
🔗
|
undersco2 |
lol |
16:03
🔗
|
undersco2 |
SketchCow: textfiles still down? |
16:03
🔗
|
undersco2 |
curl textfiles.com |
16:03
🔗
|
undersco2 |
curl: (7) couldn't connect to host |
16:03
🔗
|
Wyatt |
If it helps I just described AT as bearers of a "titanic black strap-on archival dildocannon"... |
16:04
🔗
|
ersi |
nitro2k01: so, what says they won't delete all of the free users content? :P |
16:04
🔗
|
undersco2 |
...bahaha |
16:04
🔗
|
ersi |
nitro2k01: I mean, don't give Yahoooooo too much credit |
16:05
🔗
|
nitro2k01 |
They're stupid if they shut down a service that brings in the green stuff (this includes even free accounts) |
16:05
🔗
|
nitro2k01 |
Hey something is actually making money in our empire. LET'S SHUT IT DOWN! MOAHAHAHAHA! |
16:05
🔗
|
ersi |
They've demonstrated exactly how they work over and over again |
16:05
🔗
|
ersi |
Just because they get income on a project, doesn't mean they'll let it be |
16:05
🔗
|
ersi |
We do have #flickrfckr you know, just not grabbing everything continously ;p |
16:06
🔗
|
nitro2k01 |
Worst case scenario, it'll branch off, or Yahoo will go bankrupt and be split |
16:06
🔗
|
ersi |
You're now on the lulz list |
16:06
🔗
|
nitro2k01 |
Come back in 5 years when you've discovered I was right :p |
16:07
🔗
|
SketchCow |
textfiles.com is down. |
16:07
🔗
|
SketchCow |
I put the firewall block in the wrong place |
16:07
🔗
|
SketchCow |
LOVE the nerds all backseat humping me on how I should run my website |
16:07
🔗
|
SketchCow |
LOVE LOVE LOOOOOOOOOOOOOOVE IT |
16:07
🔗
|
SketchCow |
Love that. |
16:07
🔗
|
ersi |
put it in the cloud maaan |
16:07
🔗
|
undersco2 |
I'm not trying to tell you how to do it |
16:07
🔗
|
undersco2 |
I just wanted to read the meth textfile |
16:07
🔗
|
Wyatt |
Put it on the moooon! |
16:08
🔗
|
SketchCow |
Can I say that a lot? Can I say it in a way that just echoes in the back of your mind for days and days? Aspy nerds guffawing and saying what I should do a and b and c and d as and why it's better and so on? |
16:08
🔗
|
SketchCow |
Love it |
16:08
🔗
|
SketchCow |
I want to fuck it and make 10 of it and fuck those and make 100 of it |
16:08
🔗
|
nitro2k01 |
Sometimes aspie nerds are right |
16:08
🔗
|
nitro2k01 |
SOMETIMES |
16:08
🔗
|
ersi |
sometimes they're just loud fucking assholes though |
16:08
🔗
|
SketchCow |
A broken clock is right twice a day and also doesn't flip out when you move its juice box |
16:09
🔗
|
nitro2k01 |
And sometimes even both |
16:09
🔗
|
nitro2k01 |
in the same time |
16:09
🔗
|
undersco2 |
I think SketchCow just likes to fuck |
16:09
🔗
|
undersco2 |
regardless of what sentiment it is |
16:09
🔗
|
ersi |
SketchCow: :D |
16:10
🔗
|
nitro2k01 |
SketchCow likes to fuck because he's a dick <3 |
16:11
🔗
|
ersi |
point is I don't give a fuck if you're right in five years or not |
16:12
🔗
|
ersi |
because it doesn't matter |
16:12
🔗
|
nitro2k01 |
Right. What matters is say something positive about Yahoo -> lulz list |
16:12
🔗
|
nitro2k01 |
Or even just neutral common sense |
16:13
🔗
|
nitro2k01 |
Yahoo must be bashed |
16:13
🔗
|
nitro2k01 |
It's the rite of passage |
16:14
🔗
|
ersi |
right, totally |
16:14
🔗
|
ersi |
We'll leave it at that |
16:15
🔗
|
nitro2k01 |
inb4 someone highlights me in two hours and goes like "Well you see the real point is..." |
16:17
🔗
|
winr4r |
SketchCow: holy shit that was brilliant |
16:19
🔗
|
SketchCow |
Me: Blue hair, silver tube top, fishnets, Knee high black biker boots. |
16:19
🔗
|
SketchCow |
You: Red mohawk, black pentagram gauges, viper piercings. |
16:19
🔗
|
SketchCow |
I was grinding on you in the pit, then we went to the bathroom, and got f***ed up. You had a nice c**k and I was wasted so I let [you] raw dog it in the stall. You were really good and you had to gag me so I would make too much noise. |
16:19
🔗
|
SketchCow |
Anyway I'm pregnant. It's yours. contact me if you want to be part of your child's life. |
16:20
🔗
|
SketchCow |
What's brilliant. |
16:20
🔗
|
nitro2k01 |
I came and farted. |
16:24
🔗
|
undersco2 |
SketchCow: hot |
16:29
🔗
|
SketchCow |
Oh here we go |
16:29
🔗
|
SketchCow |
Gigabytes of polish cd-roms |
16:30
🔗
|
SketchCow |
First one in! |
16:30
🔗
|
SketchCow |
http://archive.org/details/chip-cds will get them as they go |
16:31
🔗
|
BlueMax |
Good luck with that :P |
16:31
🔗
|
SketchCow |
http://archive.org/details/chip-cds-1997-0 added |
16:31
🔗
|
undersco2 |
yay |
17:04
🔗
|
SketchCow |
http://archive.org/details/chip-cds |
17:04
🔗
|
SketchCow |
awwww yeah |
17:05
🔗
|
emijrp |
language attribute is wrong |
17:06
🔗
|
SketchCow |
Yes |
17:06
🔗
|
SketchCow |
That's the ingestor. |
17:06
🔗
|
SketchCow |
After it's done, I'll fix them like THAT |
17:15
🔗
|
* |
SmileyG is so confused |
17:15
🔗
|
SmileyG |
so you run a website SketchCow / |
17:16
🔗
|
SmileyG |
or were you quoting someone? :D |
17:19
🔗
|
SketchCow |
I run a website |
17:23
🔗
|
SketchCow |
Almost done with the CDs! |
17:23
🔗
|
SmileyG |
aye |
17:23
🔗
|
SmileyG |
Wyatt is updating me elsewhere ;) |
17:23
🔗
|
SketchCow |
More and more things I can put to bed on batcave. |
17:23
🔗
|
SmileyG |
Nice video of you at Defcon. |
17:26
🔗
|
SketchCow |
Whoops, forgot 1998, fixing. |
17:28
🔗
|
SmileyG |
Have to say, your very good at presenting |
17:35
🔗
|
Nemo_bis |
For a moment I thought SketchCow had already ripped all my discs. |
17:35
🔗
|
emijrp |
This is sad, but I don't understand most of English spoken talks. |
17:35
🔗
|
emijrp |
That includes SketchCow presentations. |
17:36
🔗
|
emijrp |
Language is a fucking barrier. |
17:40
🔗
|
emijrp |
https://www.universalsubtitles.org/es/videos/NE0VZdfk5yzP/info/archive-team-a-distributed-preservation-of-service-attack/ |
17:41
🔗
|
emijrp |
#subtitlesteam spread the word about backups writing subtitles in any language |
17:48
🔗
|
emijrp |
who can help? |
17:49
🔗
|
SmileyG |
WTF IS UP WITH THAT |
17:50
🔗
|
SmileyG |
Bulgey McFishhat? |
17:50
🔗
|
chronomex |
<3 |
17:51
🔗
|
SmileyG |
emijrp: have you tried googles auto translate+cc stuff? |
17:51
🔗
|
emijrp |
SmileyG: sucks |
17:51
🔗
|
SmileyG |
Ah :S |
17:55
🔗
|
SmileyG |
Hnnnm |
17:55
🔗
|
SmileyG |
you guys got room to grab megaupload? :/ |
17:56
🔗
|
Wyatt |
Can't be "got" in its current state. |
17:56
🔗
|
Wyatt |
(Last I heard, at least) |
17:56
🔗
|
Wyatt |
EFF is fighting that fight. |
18:00
🔗
|
SmileyG |
yeah |
18:01
🔗
|
SmileyG |
Just, if someone turned up and went "Ok, you don't wanna pay for it, we can store it. Hand it over"..... once the legal issues are done, I think the hosting company would jump at the chance by the sound of things... |
18:01
🔗
|
SmileyG |
(funny how they were happy to take the money up until then ;)) |
18:02
🔗
|
SmileyG |
heh |
18:02
🔗
|
SmileyG |
one day, the archive will end up larger than the entire worlds current info :/ |
18:06
🔗
|
SmileyG |
I think that'll be a proud moment |
18:14
🔗
|
SmileyG |
hmm |
18:14
🔗
|
SmileyG |
I think I have a new found respect, and a bit of a man crush on SketchCow :O |
18:16
🔗
|
winr4r |
SmileyG: HANDS OFF HE'S MINE |
18:16
🔗
|
SmileyG |
hehe |
18:16
🔗
|
SmileyG |
i wish I had the..... well, money he musth ave :D |
18:16
🔗
|
SmileyG |
I havel ike £4 spare a month :( |
18:18
🔗
|
emijrp |
gayteam |
18:18
🔗
|
SmileyG |
no9t much I can do with £4 heh :D |
18:21
🔗
|
ersi |
£4/mo? That's not much |
18:21
🔗
|
ersi |
but it's something |
18:22
🔗
|
mistym |
Ergh. digigame.info uses completely different html encoding for its different content types, looks like I'll have to special case a bunch of stuff manually. Oh well. |
18:23
🔗
|
winr4r |
mistym: wonderful! |
18:23
🔗
|
winr4r |
i love that so much <3 |
18:23
🔗
|
winr4r |
random content encoding <3 |
18:23
🔗
|
winr4r |
p.s. if i see another UnicodeDecodeError i will actually burn an orphanage |
18:23
🔗
|
mistym |
e.g. journal articles (at least that I've seen so far) use a bunch of unambiguously named divs. Whereas proceedings articles use tables with unnamed elements. |
18:24
🔗
|
mistym |
winr4r: Oh yeah, that's the other thing I love - random unicode fails. |
18:24
🔗
|
winr4r |
mistym: oh sorry i thought you meant content encoding |
18:24
🔗
|
mistym |
winr4r: I was ambiguous, my fault |
18:24
🔗
|
mistym |
My absolute FAVOURITE case is Excel for Mac. |
18:25
🔗
|
mistym |
It can export CSV that IT ITSELF cannot read because it uses some crazy text encoding. |
18:25
🔗
|
winr4r |
haha! |
18:25
🔗
|
winr4r |
that's beautiful |
18:26
🔗
|
mistym |
I thought I was doing something wrong when I couldn't figure out what encoding to give it in Ruby's CSV.parse. But no, Excel itself couldn't open the data it produced. Brilliant. |
18:27
🔗
|
chronomex |
not relevant: http://youtube.com/watch?v=7odAbL3Ygts |
18:27
🔗
|
winr4r |
mistym: it's an achievement of sorts |
18:27
🔗
|
SmileyG |
one way encoding \o/ |
18:40
🔗
|
shaqfu |
Well, that was encouraging |
18:41
🔗
|
shaqfu |
"We haven't seen names named, but the literature mentions companies working to provide DRM-free software for long-term preservation" |
18:41
🔗
|
winr4r |
i'll believe that when i see it |
18:41
🔗
|
winr4r |
i.e. never |
18:41
🔗
|
shaqfu |
winr4r: "encouraging" not "fucking awesome" |
18:42
🔗
|
winr4r |
shaqfu: hi, btw :) |
18:42
🔗
|
shaqfu |
winr4r: ohai o/ |
18:43
🔗
|
SmileyG |
http://www.flickr.com/photos/djsmiley2k/4548258767/ |
18:43
🔗
|
SmileyG |
:O |
18:43
🔗
|
SmileyG |
my cat |
18:43
🔗
|
SmileyG |
is like |
18:43
🔗
|
SmileyG |
his cat |
18:43
🔗
|
SmileyG |
:O |
18:43
🔗
|
SmileyG |
http://www.flickr.com/photos/djsmiley2k/4488022986/ |
18:44
🔗
|
shaqfu |
You store your soap on the roof?! |
18:44
🔗
|
mistym |
SmileyG: Aww, your cat's a cutey |
18:44
🔗
|
SmileyG |
i got 4 |
18:44
🔗
|
SmileyG |
:O |
18:44
🔗
|
SmileyG |
he talks |
18:44
🔗
|
SmileyG |
:D |
18:44
🔗
|
shaqfu |
Same litter? |
18:44
🔗
|
SmileyG |
or at least tries to. He thanks you if you open the door for him. |
18:44
🔗
|
SmileyG |
lol Is Jason from Norwich, UK? |
18:45
🔗
|
ersi |
getting creepy |
18:45
🔗
|
winr4r |
SmileyG: you're about six million miles out |
18:45
🔗
|
winr4r |
SmileyG: are you from norwich? |
18:46
🔗
|
SmileyG |
No, but the cat was :D |
18:46
🔗
|
winr4r |
SmileyG: oh! |
18:46
🔗
|
SmileyG |
It was originally my.... wifes brothers girlfriends grans |
18:46
🔗
|
SmileyG |
she couldn't look after it, she couldn't look after it, we lived with him and my wifes parents, and so he came with us |
18:46
🔗
|
winr4r |
oh my god, sockington has figured out self-replication |
18:46
🔗
|
SmileyG |
:D |
18:46
🔗
|
emijrp |
You talk some hours ago about the closing of YouTube. That closing has been happening since ages. Using a sample of 6500 videos about SpanishRevolution, 3.59% of them were deleted (or accounts closed) after 6 months. You can extrapolate to the YouTube age and the million videos are uploaded. |
18:46
🔗
|
* |
winr4r just saw the photo |
18:47
🔗
|
SmileyG |
winr4r: Its freaky aint it? |
18:47
🔗
|
SmileyG |
their face is slightly different |
18:47
🔗
|
winr4r |
SmileyG: it really is! |
18:47
🔗
|
SmileyG |
and apollo has smaller eyes |
18:47
🔗
|
SmileyG |
But the markings, wow. |
18:47
🔗
|
SmileyG |
Sorry, larger eyes, smaller iris |
18:47
🔗
|
winr4r |
SmileyG: i wondered about norwich, i'm from king's lynn |
18:47
🔗
|
SmileyG |
winr4r: :D |
18:47
🔗
|
winr4r |
norfolk best county in world |
18:47
🔗
|
SmileyG |
I have a old school friend who moved to king's lynn |
18:48
🔗
|
winr4r |
SmileyG: on purpose?! |
18:48
🔗
|
SmileyG |
his family moved when I was..... 13? |
18:48
🔗
|
SmileyG |
moved to hunstantington? |
18:48
🔗
|
winr4r |
ah |
18:48
🔗
|
SmileyG |
(I've spelt that wrong). |
18:48
🔗
|
winr4r |
hunstanton |
18:48
🔗
|
SmileyG |
Yah. |
18:48
🔗
|
shaqfu |
mistym: In other news, Archivematica is really damn cool |
18:48
🔗
|
SmileyG |
Wild cats there... heh |
18:48
🔗
|
winr4r |
hunstanton is nice, king's lynn is a massive shithole |
18:49
🔗
|
mistym |
shaqfu: Isn't it? Those guys are awesome. |
18:49
🔗
|
mistym |
shaqfu: They have an IRC channel over on Freenode, though it's not usually too busy. |
18:49
🔗
|
mistym |
Wait, no. Not Freenode, it was some other server. |
18:49
🔗
|
shaqfu |
mistym: I hadn't heard of it before this weekend, but it came up at nearly every talk this weekend |
18:49
🔗
|
winr4r |
SmileyG: fortunately i'm about 6 miles south of it |
18:49
🔗
|
SmileyG |
who will log all of irc :S |
18:50
🔗
|
mistym |
SmileyG: Who will bug every public space ;o |
18:50
🔗
|
shaqfu |
Anyway, AFK, lunch |
18:50
🔗
|
SmileyG |
winr4r: ah |
18:50
🔗
|
SmileyG |
I'm |
18:50
🔗
|
SmileyG |
I'm in coventry... |
18:50
🔗
|
SmileyG |
don't suppose you also heard the sonic booms? |
18:50
🔗
|
winr4r |
SmileyG: things will get better |
18:50
🔗
|
winr4r |
and nope! |
18:51
🔗
|
SmileyG |
I really quite like cov :d |
18:51
🔗
|
SmileyG |
D: |
18:52
🔗
|
* |
winr4r adores apollo |
18:52
🔗
|
mistym |
Sigh. Twitter is making me jealous. Not only are there tons of #marac tweets, but now Capy games are showing off the ridiculous 25-foot screen installation of Super TIME Force in LA. |
18:53
🔗
|
SmileyG |
winr4r: hehe |
18:53
🔗
|
SmileyG |
theres some pics on there of my other cats too |
18:54
🔗
|
SmileyG |
I don't think i've actually done a "cats" set hto ¬_¬ failure by me there |
18:54
🔗
|
SmileyG |
anyway, dads birthday meal tonight :/ |
18:54
🔗
|
SmileyG |
laters |
18:54
🔗
|
winr4r |
bye! |
19:05
🔗
|
shaqfu |
mistym: I'll refrain from posting beach pics, then |
19:06
🔗
|
mistym |
;o |
19:06
🔗
|
shaqfu |
Nothing like capping a conference with a beach trip |
19:08
🔗
|
winr4r |
OH GOD |
19:08
🔗
|
winr4r |
IS JASON GOING TO GO CLEAN SHAVED AGAIN |
19:08
🔗
|
winr4r |
https://twitter.com/#!/textfiles/status/191239290158710785/photo/1 |
19:09
🔗
|
winr4r |
*suspense* |
19:09
🔗
|
winr4r |
(talking of twitter) |
19:13
🔗
|
shaqfu |
It feels surreal seeing him hatless |
19:14
🔗
|
winr4r |
haha |
19:16
🔗
|
chronomex |
that will pass |
19:17
🔗
|
emijrp |
the hat is inside his hair, you will see it after the cut |
19:17
🔗
|
winr4r |
emijrp: hahaha |
19:17
🔗
|
shaqfu |
Rofl |
19:18
🔗
|
chronomex |
yes, I think jason will be bald in 1 hour |
19:18
🔗
|
winr4r |
no way |
19:19
🔗
|
winr4r |
i'm going with clean-shaven |
19:19
🔗
|
winr4r |
no beard |
19:19
🔗
|
chronomex |
no hair |
19:19
🔗
|
winr4r |
it was important enough to announce on twitter, so i'm guessing it's the beard |
19:26
🔗
|
winr4r |
oh, i was wrong! |
19:27
🔗
|
shaqfu |
Phew; balance of nature not disturbed |
19:29
🔗
|
chronomex |
I was right |
20:10
🔗
|
SketchCow |
Oh good |
20:10
🔗
|
SketchCow |
hairblogging |
20:15
🔗
|
SketchCow |
Looking good |
20:23
🔗
|
winr4r |
SketchCow: yes you do! :D |
20:24
🔗
|
DFJustin |
who wants some cp/m http://archive.org/download/cdrom-rlee-peters-cpm-archive/rlee_peters_cpm_archive.zip/ |
20:27
🔗
|
SketchCow |
Whoops, deleted two cds by mistake |
20:28
🔗
|
SketchCow |
Shiiiiiit happens |
20:28
🔗
|
SketchCow |
Ironically shoving it into archive.org |
20:28
🔗
|
SketchCow |
and I killed it |
20:28
🔗
|
SketchCow |
I make mistakes too! |
20:28
🔗
|
chronomex |
wat no |
20:28
🔗
|
SketchCow |
Pretty commercial CDs, no worries, they'l show again. |
20:30
🔗
|
SketchCow |
OK, all those CDs have Polish as the language now |
20:30
🔗
|
winr4r |
excellent! |
20:57
🔗
|
SketchCow |
I havent done bald recently |
21:03
🔗
|
winr4r |
you haven't done no-beard in a while either |
21:03
🔗
|
winr4r |
(you shouldn't, you're a whole lot less scary and Jason Scott without one) |
21:05
🔗
|
SketchCow |
ha ha |
21:05
🔗
|
SketchCow |
thanks for the fashion advice |
21:06
🔗
|
emijrp |
be careful, here are more gays than archivists |
21:08
🔗
|
winr4r |
i'm not gay! |
21:09
🔗
|
winr4r |
the whole public bathroom thing was a misunderstanding |
21:11
🔗
|
SketchCow |
Whoops, fucked up AGAIN |
21:11
🔗
|
SketchCow |
Where's my hug |
21:11
🔗
|
* |
winr4r hugs SketchCow! |
21:24
🔗
|
alard |
winr4r: There were about ten FortuneCity sites on your list that we didn't have, but I have now downloaded those too. |
21:25
🔗
|
winr4r |
alard: yay! |
21:28
🔗
|
winr4r |
http://members.fortunecity.com/aaronsmom/ |
21:29
🔗
|
winr4r |
:/ |
21:29
🔗
|
winr4r |
found that while flicking through screenshots earlier |
21:32
🔗
|
emijrp |
nice |
21:34
🔗
|
winr4r |
it's a tribute for someone, by people who loved them, done as well as they could in the late 90s |
21:37
🔗
|
emijrp |
curious, first image fail http://web.archive.org/web/20090203061353/http://members.fortunecity.com/aaronsmom/ |
21:38
🔗
|
DFJustin |
for a while fortunecity was doing referer blocking such that the wayback machine got their placeholder image for everything |
21:38
🔗
|
winr4r |
DFJustin: ah |
21:39
🔗
|
SketchCow |
aaronsmom has got it going on |
21:39
🔗
|
winr4r |
well among other things, that's where i hope the screenshot collection will be useful |
21:40
🔗
|
emijrp |
this guy http://awt.ancestry.com/cgi-bin/igm.cgi?op=GET&db=lockard-park&id=I52598&ti=5541 |
21:41
🔗
|
winr4r |
the downside: i had to disable javascript in the script i'm using, because their ads had a hilarious "slide up out of nowhere and cover up all the content" thing going on |
21:41
🔗
|
SketchCow |
Well fuck, I did it FUCKING AGAIN |
21:41
🔗
|
winr4r |
emijrp: yes |
21:41
🔗
|
SketchCow |
Well, of 80 CD-ROMs, I murdered 4 in their beds |
21:41
🔗
|
winr4r |
SketchCow: hey remember that thing you did three times? |
21:41
🔗
|
SketchCow |
Made a few choices with the scripting I shouldn't have. |
21:41
🔗
|
winr4r |
i think it'd be a good idea to not do that |
21:42
🔗
|
winr4r |
seriously though, what happened? |
21:42
🔗
|
SketchCow |
Pressing control-c during a zip-up makes it go "OK, stop running the zip, but keep running the script that calls it." |
21:42
🔗
|
SketchCow |
booooo |
21:43
🔗
|
winr4r |
oh shit :< |
21:43
🔗
|
SketchCow |
Again, I'm not too worried |
21:43
🔗
|
winr4r |
and "keep running" means "rm -rf"? |
21:43
🔗
|
SketchCow |
I can get these |
21:43
🔗
|
SketchCow |
Well keep running means rm that thing being zipped, yes |
21:43
🔗
|
SketchCow |
Normally I don't do that, got lazy, made mistake. |
21:43
🔗
|
winr4r |
ah |
21:43
🔗
|
winr4r |
so you didn't actually lose anything |
21:43
🔗
|
SketchCow |
Anyway, I'll just tell that guy we need to re-upload. |
21:43
🔗
|
SketchCow |
No, I definitely lost stuff that was at arm's reach |
21:43
🔗
|
SketchCow |
Dude must re-send |
21:44
🔗
|
winr4r |
bummer |
21:44
🔗
|
SketchCow |
It's OK, we have a billion of these things going |
21:46
🔗
|
winr4r |
somewhere, in my loft or piled under other piles of shit i have a couple of magazine cover CDs from the late 1990s |
21:46
🔗
|
SketchCow |
I just moved the shareware cd collection to the title bar of archive.org's software section. |
21:46
🔗
|
SketchCow |
It was time to do it. |
21:46
🔗
|
SketchCow |
Oh my god, I have so many cds, I am considering having someone come over or who is local to me to do it. |
21:46
🔗
|
winr4r |
from the late 1990s where it's like "hey you don't have the INTERNET but here is SOME OF IT" |
21:46
🔗
|
winr4r |
i need to get those to you some time |
21:47
🔗
|
winr4r |
SketchCow: i can imagine, i have a *few*, at most like 5, but i think it was an interesting time |
21:47
🔗
|
emijrp |
man, read this http://www.chron.com/CDA/archives/archive.mpl/1998_3052078/man-jailed-after-friend-shot.html |
21:48
🔗
|
winr4r |
emijrp: moral of the story: don't be friends with stupid people |
21:50
🔗
|
winr4r |
emijrp: WAIT HOLD ON AARON |
21:50
🔗
|
mistym |
Also: if a friend says "Hey, watch this!" and pulls out a gun, don't stick around. |
21:50
🔗
|
SketchCow |
Wow, one of the CD-ROMs has been downloaded 1,1442 |
21:50
🔗
|
SketchCow |
1,442 |
21:51
🔗
|
winr4r |
holy shit |
21:53
🔗
|
winr4r |
how do archive.org do backups anyway? |
21:53
🔗
|
winr4r |
i mean that is just an unbelievably huge amount of shit |
21:55
🔗
|
chronomex |
they duplicate off-site |
21:59
🔗
|
Coderjoe |
winr4r: they have the data on two nodes locally, and try to duplicate it off-site (like in alexandria) |
22:00
🔗
|
emijrp |
And the question is, have they lost data? |
22:00
🔗
|
Coderjoe |
i have to wonder if they regularly scrub items |
22:01
🔗
|
Coderjoe |
(verify hashes against those in the files.xml file) |
22:02
🔗
|
emijrp |
TOP SECRET. |
22:02
🔗
|
SketchCow |
They do things. |
22:03
🔗
|
Coderjoe |
oh boy. some of these items are crap like full anime episodes |
22:06
🔗
|
BlueMax |
I just imagine a giant-arse RAID array |
22:06
🔗
|
BlueMax |
dunno why |
22:06
🔗
|
Coderjoe |
ugh |
22:06
🔗
|
Coderjoe |
no |
22:09
🔗
|
BlueMax |
has this been shared here yet? http://www.masswerk.at/googleBBS/ |
22:11
🔗
|
winr4r |
"ERROR: Quota Exceeded. Please see http://code.google.com/apis/websearch" :< |
22:11
🔗
|
emijrp |
Typical BBS error. |
22:12
🔗
|
BlueMax |
lol |
22:21
🔗
|
SketchCow |
Poor google |
22:21
🔗
|
SketchCow |
Getting DDOS |
22:21
🔗
|
SketchCow |
Coderjoe: You realize a lot of this is likely to go dark. |
22:25
🔗
|
SketchCow |
OK, CDs done |
22:25
🔗
|
SketchCow |
Going out to see the Comic-Con Documentary... with Morgan Spurlock presenting! And Q&A. |
22:25
🔗
|
SketchCow |
http://archive.org/search.php?query=collection%3Achip-cds&sort=-publicdate |
22:26
🔗
|
Coderjoe |
SketchCow: yes. unfortunately |
23:30
🔗
|
Wyatt|Wor |
This is just morbid curiosity at this point, but that grep is still going. |