Time |
Nickname |
Message |
00:04
🔗
|
underscor |
This wget has been hanging for +12 hours now |
00:04
🔗
|
underscor |
and the last few lines in the log are |
00:04
🔗
|
underscor |
2011-11-09 08:24:47 URL:http://web.me.com/bobfarmer/20110726Web%20Cards/ps01/ps01_446.htm [2326/2326] -> "data/b/bo/bob/bobfarmer/web.me.com/files/web.me.com/bobfarmer/20110726Web Cards/ps01/ps01_446.htm" [1] |
00:04
🔗
|
underscor |
2011-11-09 12:54:27 ERROR 404: Not Found. |
00:04
🔗
|
underscor |
http://web.me.com/bobfarmer/20110726Web%20Cards/ps20/: |
00:04
🔗
|
underscor |
http://web.me.com/bobfarmer/20110726Web%20Cards/ps20/feed.xml: |
00:04
🔗
|
underscor |
http://web.me.com/bobfarmer/20110726Web%20Cards/ps21/: |
00:04
🔗
|
underscor |
2011-11-11 12:10:48 ERROR 404: Not Found. |
00:04
🔗
|
underscor |
http://web.me.com/bobfarmer/20110726Web%20Cards/ps21/feed.xml: |
00:04
🔗
|
underscor |
2011-11-11 12:10:49 ERROR 404: Not Found. |
00:04
🔗
|
underscor |
It is now 00:04 server time |
00:04
🔗
|
underscor |
so it's been nearly 12 hours since the last update |
00:04
🔗
|
underscor |
alard: Do you think it's dead or something? |
00:13
🔗
|
alard |
underscor: Maybe, maybe it's trying to download a very big file. Have you looked at the url list? |
00:16
🔗
|
underscor |
alard: There'd be an entry for every file, right? |
00:16
🔗
|
underscor |
http://web.me.com/bobfarmer/20110726Web%20Cards/ps21 |
00:16
🔗
|
underscor |
http://web.me.com/bobfarmer/20110726Web%20Cards/ps21/feed.xml |
00:16
🔗
|
underscor |
http://web.me.com/bobfarmer/20110726Web%20Cards/ps21/ps21_001.htm |
00:16
🔗
|
underscor |
http://web.me.com/bobfarmer/20110726Web%20Cards/ps21/ps21_002.htm |
00:16
🔗
|
underscor |
http://web.me.com/bobfarmer/20110726Web%20Cards/ps21/ps21_003.htm |
00:16
🔗
|
underscor |
http://web.me.com/bobfarmer/20110726Web%20Cards/ps21/ps21_004.htm |
00:16
🔗
|
underscor |
http://web.me.com/bobfarmer/20110726Web%20Cards/ps21/ps21_005.htm |
00:17
🔗
|
underscor |
None of those are particularly large |
00:17
🔗
|
alard |
No, so it's probably hanging. |
00:17
🔗
|
underscor |
k |
00:17
🔗
|
underscor |
ctrl-c, dld-single? |
00:17
🔗
|
alard |
Yes. |
00:18
🔗
|
underscor |
k :) |
00:26
🔗
|
DFJustin |
I can't read chinese no |
01:45
🔗
|
DrFaustus |
hola |
01:46
🔗
|
DrFaustus |
there is an old friend of mine that has been running a mailing list for a long time. I'm not sure how much longer he's going to do it. |
01:47
🔗
|
DrFaustus |
He just got the server back up, but it's pretty old |
01:47
🔗
|
DrFaustus |
http://www.team.net/archive/ |
01:47
🔗
|
DrFaustus |
he may have even older email archives from before the "new" mailer |
01:48
🔗
|
DrFaustus |
mjb@autox.team.net |
01:48
🔗
|
DrFaustus |
can anyone help grab that stuff before mark peters out? |
01:53
🔗
|
db48x |
DrFaustus: very probably |
01:53
🔗
|
db48x |
I believe someone here already has a mailman archiver |
01:55
🔗
|
db48x |
ah, it's even set up to let us download mbox files |
01:55
🔗
|
DrFaustus |
indeed |
01:56
🔗
|
DrFaustus |
you'd have to contact mark to see if he still has even older archive files around |
01:56
🔗
|
DrFaustus |
he's an old fart of a unix admin from university of utah |
01:56
🔗
|
DrFaustus |
so it'd likely be in an easy formar |
01:57
🔗
|
db48x |
well, mbox is never as easy as it should be |
01:57
🔗
|
db48x |
but it's easy enough :) |
01:58
🔗
|
DrFaustus |
fair enough |
01:59
🔗
|
DrFaustus |
those lists hold about fifteen years of technical discussions on just about every classic british sports car made |
02:31
🔗
|
DrFaustus |
thanks guys |
03:47
🔗
|
yipdw |
whoa, when did the splinder grab go from 990,000 users to 1.3 million? |
03:47
🔗
|
yipdw |
oh, splinder.us |
03:47
🔗
|
yipdw |
I see |
04:05
🔗
|
db48x |
where are the splinder scripts? github? |
04:41
🔗
|
yipdw |
db48x: https://github.com/ArchiveTeam/splinder-grab |
05:52
🔗
|
underscor |
Wow, the splinder todo went way up! |
05:53
🔗
|
underscor |
Luckily, splinder looks to only be about a terabyte and a half |
06:53
🔗
|
yipdw |
heh |
06:53
🔗
|
yipdw |
http://memac.heroku.com/ is reporting 666 MB/user |
06:53
🔗
|
yipdw |
I always knew Mac users were satanic |
06:54
🔗
|
underscor |
hahaha |
09:55
🔗
|
chronomex |
damn. wget-warc is eating all my ram. |
10:10
🔗
|
chronomex |
shit, that fucker ran for a day and a half before it got OOM killed |
11:55
🔗
|
db48x |
alard: I notice that when I visit memac.heroku.com, it's getting log messages about splinder :) |
11:56
🔗
|
alard |
db48x: It's the same source. (But it's not showing them, I hope?) |
12:11
🔗
|
Wyatt|Wor |
So I just realised, I have a sizable ball of google groups to upload still. |
12:11
🔗
|
Wyatt|Wor |
Also a few chunks of berlios |
12:25
🔗
|
Wyatt|Wor |
I'm off to a Doc appointment now, but feel free to /msg me a place to rsync to; I'll get to it tonight. Sorry to be so late about this. |
13:12
🔗
|
db48x |
alard: yea, it's not showing them |
13:13
🔗
|
db48x |
alard: you should set up seperate streams for them |
13:13
🔗
|
alard |
Why? It works? |
13:15
🔗
|
db48x |
it's just extra work every time a message comes in |
13:16
🔗
|
db48x |
anyway, the real reason I'm looking is that the splinder tracker kills my browser when it's open |
13:29
🔗
|
db48x |
hmm. updating the chart is super expensive |
13:31
🔗
|
alard |
Yes, I'm looking at a way to have fewer points in the graph, that should help somewhat. |
13:35
🔗
|
db48x |
it could just update less frequently |
14:21
🔗
|
alard |
db48x: Should be faster now. |
14:29
🔗
|
db48x |
that it is |
15:24
🔗
|
db48x |
alard: are you also alart? :) |
15:46
🔗
|
alard |
Yes, typo. :) |
15:46
🔗
|
closure |
Wyatt|Wor: you need to ask SketchCow for an rsync on batcave |
19:03
🔗
|
Schbirid |
i think i will start logging http://store.steampowered.com/stats/ |
19:03
🔗
|
Schbirid |
might be interesting to make a 365 day graph |
19:10
🔗
|
dnova |
Recipient: Bovine Ignition Systems |
19:10
🔗
|
dnova |
Amount: $100.00 |
19:10
🔗
|
dnova |
lol |
19:30
🔗
|
underscor |
haha |
20:19
🔗
|
yipdw |
huh, this is kind of weird |
20:19
🔗
|
yipdw |
https://gist.github.com/7723903aa5ff2c0fbeb3 |
20:20
🔗
|
Paradoks |
I got an error on the malacarne profile, when it attempted to download the blog from porno.splinder.com. |
20:21
🔗
|
yipdw |
oh, the docum profile has been made unavailable |
20:21
🔗
|
yipdw |
ok |
20:32
🔗
|
underscor |
Paradoks: haha, nice name |
21:04
🔗
|
yipdw |
so it looks like we're download about 21.8k Splinder users/day |
21:04
🔗
|
yipdw |
somehow, that doesn't seem fast enough |
21:05
🔗
|
yipdw |
to the Amazon |
21:05
🔗
|
yipdw |
'course, that's moot if we're maxing their pipe already :P |
21:06
🔗
|
underscor |
when do we have til again? |
21:11
🔗
|
db48x |
it was 14 days, right? |
21:11
🔗
|
db48x |
that's only 300k |
21:12
🔗
|
underscor |
uh oh |
21:12
🔗
|
Schbirid |
how much data/bandwidth is it roughly if i joined as downloader? |
21:12
🔗
|
db48x |
Schbirid: tiny |
21:12
🔗
|
underscor |
Yeah |
21:12
🔗
|
underscor |
Like 0.8 mb/user |
21:13
🔗
|
db48x |
http://splinder.heroku.com/ |
21:13
🔗
|
underscor |
db48x: Download moar! |
21:13
🔗
|
db48x |
I'm limited by iops |
21:13
🔗
|
db48x |
I guess I can leave off sorting poetry for a while |
21:13
🔗
|
underscor |
aww :( |
21:13
🔗
|
Schbirid |
• :D |
21:14
🔗
|
underscor |
I'm pulling 4mbps right now |
21:15
🔗
|
* |
db48x cancels three other tasks |
21:15
🔗
|
Schbirid |
ok, how do i join in? |
21:16
🔗
|
db48x |
pull from the git repository |
21:16
🔗
|
underscor |
I'm running 96 clients right now |
21:16
🔗
|
underscor |
<3 |
21:17
🔗
|
db48x |
Schbirid: https://github.com/ArchiveTeam/splinder-grab |
21:17
🔗
|
underscor |
18-25% iowait, so that's probably just about perfectly balanced |
21:18
🔗
|
underscor |
RX bytes:10762759777147 (10.7 TB) TX bytes:12421364281615 (12.4 TB) |
21:20
🔗
|
Schbirid |
db48x: done! how interruptable is it? |
21:20
🔗
|
Schbirid |
i switch off my pc at night |
21:20
🔗
|
db48x |
touch STOP and it'll stop cleanly |
21:21
🔗
|
Schbirid |
nice |
21:21
🔗
|
underscor |
Schbirid: BLASPHEMY |
21:21
🔗
|
underscor |
NO SHUTTING OFF IN HERE |
21:21
🔗
|
underscor |
21:21:52 up 18 days, 14:46, 1 user, |
21:21
🔗
|
underscor |
16:22:03 up 7 days, 23:35, 1 user, |
21:21
🔗
|
underscor |
13:22:22 up 26 days, 14:59, 2 users |
21:22
🔗
|
Schbirid |
:) |
21:22
🔗
|
underscor |
15:24:01 up 18 days, 2:17, 2 users, |
21:22
🔗
|
yipdw |
21:22:55 up 41 days, 14:43, 1 user, load average: 0.32, 0.13, 0.25 |
21:22
🔗
|
yipdw |
me@avatar:~$ uptime |
21:22
🔗
|
underscor |
yipdw: :( |
21:22
🔗
|
Schbirid |
ouch, seems to want python2 or something? |
21:23
🔗
|
Schbirid |
db48x: http://pastebin.com/vV9fu51i |
21:23
🔗
|
yipdw |
of course, what that really means is "41 days since last kernel upgrade" |
21:23
🔗
|
underscor |
haha |
21:23
🔗
|
yipdw |
since who the hell uses ksplice etc |
21:23
🔗
|
underscor |
ofc |
21:23
🔗
|
Schbirid |
my python is 3.2.2 by default, 2 would be python2 |
21:23
🔗
|
underscor |
Yeah, it uses 2.[5-7].x, iirc |
21:24
🔗
|
underscor |
uses/needs |
21:24
🔗
|
Schbirid |
do i just add |
21:25
🔗
|
Schbirid |
#!/usr/bin/python2 |
21:25
🔗
|
Schbirid |
to the soup py file? |
21:25
🔗
|
underscor |
Oh, I see, you have it installed aready |
21:25
🔗
|
underscor |
Yeah, change it to wherever python 2.x lives |
21:25
🔗
|
yipdw |
Schbirid: substitute python2 for python at dld-profile.sh:88 |
21:26
🔗
|
underscor |
0 1:24PM:abuie@teamarchive-0:/2/TBAG/mobileme-grab 3944 Ï du -sh data |
21:26
🔗
|
underscor |
1.3T data |
21:26
🔗
|
Schbirid |
totally missed that, cheers |
21:26
🔗
|
yipdw |
ha |
21:26
🔗
|
Schbirid |
underscor: good for you, my python is bigger though |
21:26
🔗
|
underscor |
lol |
21:26
🔗
|
underscor |
Just a *little* mobileme data |
21:27
🔗
|
Schbirid |
mobileme is a name i never heard anyone call IT |
21:27
🔗
|
Schbirid |
okok, i will stop ;D |
21:27
🔗
|
underscor |
:P |
21:27
🔗
|
yipdw |
first time I've ever used the EU West EC2 region |
21:27
🔗
|
underscor |
yipdw: Work well? |
21:27
🔗
|
yipdw |
dunno yet |
21:27
🔗
|
yipdw |
we'll see |
21:28
🔗
|
yipdw |
I wonder if a micro will be good enough |
21:28
🔗
|
yipdw |
yeah, probably |
21:29
🔗
|
Schbirid |
working well now, thanks |
21:29
🔗
|
underscor |
102 hour tar? |
21:29
🔗
|
underscor |
:(:(:(:(:(:(:(:(:(:(:(:(:(:(:( |
21:32
🔗
|
yipdw |
underscor: what do you use to manage downloader instances? GNU parallel or something? |
21:32
🔗
|
yipdw |
I figure if I'm going to get raped by Amazon EC2, I might as well deserve it |
21:33
🔗
|
underscor |
yipdw: tmux panes |
21:33
🔗
|
underscor |
Lemme take a screenshot |
21:33
🔗
|
yipdw |
oh |
21:34
🔗
|
yipdw |
hmm |
21:34
🔗
|
yipdw |
https://gist.github.com/3018d5389a62de4d2caa |
21:34
🔗
|
yipdw |
could be worse, I guess |
21:34
🔗
|
underscor |
http://i.imgur.com/MpNcW.png |
21:35
🔗
|
yipdw |
yikes |
21:36
🔗
|
underscor |
:D |
21:36
🔗
|
underscor |
I like that I can still keep an eye on them though |
21:36
🔗
|
yipdw |
I guess |
21:36
🔗
|
yipdw |
I'm not likely to invest that much effort though :P |
21:36
🔗
|
yipdw |
hmm |
21:36
🔗
|
yipdw |
I guess I could have monit monitor them for me |
21:36
🔗
|
underscor |
haha |
21:37
🔗
|
yipdw |
and periodically run dld-single on failed ones |
21:37
🔗
|
yipdw |
ABSTRACTION SOLVES LAZINESS |
21:38
🔗
|
underscor |
Is monit good for this? |
21:38
🔗
|
yipdw |
it's overkill |
21:38
🔗
|
yipdw |
IMO |
21:38
🔗
|
underscor |
I've never used it, but heard of it before |
21:39
🔗
|
yipdw |
I just want something to automatically restart clients that stop due to errors |
21:39
🔗
|
underscor |
oh |
21:39
🔗
|
yipdw |
but a loop in bash does that just as wel |
21:39
🔗
|
yipdw |
l |
21:39
🔗
|
underscor |
:P |
21:39
🔗
|
underscor |
while true; ./dld-client yipdw ;done |
21:39
🔗
|
underscor |
Yeah |
21:39
🔗
|
underscor |
hahah |
21:39
🔗
|
yipdw |
yeah, more or less |
21:39
🔗
|
yipdw |
it'll just screw up badly when we're done |
21:39
🔗
|
yipdw |
or, more precisely, when the tracker has nothing left |
21:42
🔗
|
underscor |
yeah |
21:42
🔗
|
underscor |
but hopefully you'll be around when we get closeish |
21:43
🔗
|
underscor |
:D |
21:43
🔗
|
Schbirid |
that heroku page totally needs a users/timeunit per participant :) |
21:43
🔗
|
yipdw |
yeah |
21:43
🔗
|
* |
underscor winds |
21:43
🔗
|
underscor |
wins* |
21:43
🔗
|
underscor |
hahah |
21:45
🔗
|
yipdw |
"Quadruple Extra Large Hi-Memory On-Demand Instance" |
21:45
🔗
|
yipdw |
jeez |
21:45
🔗
|
underscor |
ha |
21:45
🔗
|
yipdw |
just call it "Super Size Bigass Instance With Extra Fries" |
21:45
🔗
|
yipdw |
"Now With More Molecules" |
21:45
🔗
|
underscor |
hahahaha, the akamai edge servers I'm downloading from are 2 hops away |
21:45
🔗
|
underscor |
It's basically "peering point"-> |
21:46
🔗
|
yipdw |
Splinder uses Akamai? |
21:46
🔗
|
underscor |
"akamai's router" |
21:46
🔗
|
underscor |
No, mobile-me does |
21:46
🔗
|
yipdw |
oh |
21:46
🔗
|
yipdw |
I was like "damn, I've been hitting the wrong thing" |
21:46
🔗
|
underscor |
haha |
21:47
🔗
|
underscor |
http://tracker.archive.org/tracker.png |
21:47
🔗
|
underscor |
You can see where I stopped splinder overnight, haha |
21:47
🔗
|
underscor |
And then just started up mobile me |
21:47
🔗
|
yipdw |
good job sir |
21:47
🔗
|
yipdw |
wow, 40 MB of Splinder data for henrymusica |
21:47
🔗
|
underscor |
http://tracker.archive.org/batcave.png |
21:47
🔗
|
yipdw |
that's the biggest I've seen yet |
21:47
🔗
|
underscor |
Mobileme's data goes straight to batcave |
21:48
🔗
|
underscor |
Nice little peak where it's started |
21:48
🔗
|
underscor |
Schbirid: I DON'T SEE YOU ON THE TRACKER YET.........;.......... |
21:49
🔗
|
Schbirid |
i see me and i am just passing db48x |
21:49
🔗
|
underscor |
Oh, are you spirit? |
21:49
🔗
|
Schbirid |
haha |
21:49
🔗
|
Schbirid |
yes |
21:49
🔗
|
underscor |
oh |
21:49
🔗
|
underscor |
grr :P |
21:49
🔗
|
Schbirid |
=( |
21:49
🔗
|
underscor |
Use your irc nick! |
21:49
🔗
|
underscor |
hehe |
21:50
🔗
|
Schbirid |
only germans get it :\ |
21:50
🔗
|
Schbirid |
no idea why |
21:50
🔗
|
underscor |
Google translate says nothing |
21:51
🔗
|
yipdw |
that is a more profound statement than you know |
21:51
🔗
|
underscor |
:D |
21:51
🔗
|
Schbirid |
its just spirit pronounshed like shad |
21:51
🔗
|
underscor |
I like how the "Users downloaded" line on splinder pretty much follows my line at the beginning |
21:52
🔗
|
underscor |
A global "users/hour" counter would be nice |
21:52
🔗
|
* |
underscor loves making all these feature requests for alard_ |
21:54
🔗
|
underscor |
!!!!!! |
21:54
🔗
|
underscor |
I officially have the highest bandwidth-used port at IA |
21:56
🔗
|
db48x |
heh |
21:57
🔗
|
Schbirid |
for splinder, how many parallel instances should i run? bandwidth is tiny but maybe saturation is elsewhere? |
21:58
🔗
|
underscor |
Saturation is disk io |
21:59
🔗
|
underscor |
Run ~10 and see if your iowait shoots up |
22:15
🔗
|
yipdw |
hmm |
22:15
🔗
|
yipdw |
[ec2-user@ip-10-227-178-174 it]$ sudo iostat |
22:15
🔗
|
yipdw |
Linux 2.6.35.14-97.44.amzn1.x86_64 (ip-10-227-178-174) 11/12/2011 _x86_64_ (1 CPU) |
22:15
🔗
|
yipdw |
avg-cpu: %user %nice %system %iowait %steal %idle |
22:15
🔗
|
yipdw |
2.63 0.00 2.91 2.19 19.40 72.87 |
22:16
🔗
|
yipdw |
that's with 6 dld-clients on a t1.micro |
22:16
🔗
|
yipdw |
I guess I can double that |
22:19
🔗
|
alard_ |
underscor: http://splinder.heroku.com/ |
22:20
🔗
|
underscor |
alard_: I love you |
22:20
🔗
|
underscor |
Remind me to buy you a beer when I turn 21 |
22:20
🔗
|
chronomex |
underscor: I think you can do alcohol mail-order, the only person who needs to be over 21 is the recipient iiuc |
22:21
🔗
|
underscor |
haha |
22:21
🔗
|
yipdw |
wow, we're only pulling 500 kB/s? |
22:21
🔗
|
underscor |
I don't know how well international alcohol mail-order would go over |
22:21
🔗
|
chronomex |
yipdw: it uses a linear interpolation of reported data |
22:22
🔗
|
yipdw |
ahh, ok |
22:22
🔗
|
chronomex |
f.e. I've been downloading this one user for 4 days |
22:22
🔗
|
yipdw |
so I guess down clients will |
22:22
🔗
|
yipdw |
yeah, and that |
22:22
🔗
|
yipdw |
jeez |
22:22
🔗
|
alard |
yipdw: And I'm not even sure it's completely correct, so it may be helpful to check the numbers. |
22:22
🔗
|
yipdw |
how big is the WARC for that user? |
22:22
🔗
|
chronomex |
yipdw: huge. wget-warc died the first time around thanks to my OOM killer. |
22:23
🔗
|
yipdw |
damn |
22:23
🔗
|
chronomex |
18G and growing |
22:23
🔗
|
yipdw |
one of splinder's top users |
22:23
🔗
|
chronomex |
web.me.com is "at least 23879 files" |
22:23
🔗
|
yipdw |
oh |
22:23
🔗
|
yipdw |
mobileme |
22:23
🔗
|
chronomex |
yeah, mobileme |
22:23
🔗
|
yipdw |
I was looking at the splinder dash |
22:24
🔗
|
* |
chronomex not doing splinder |
22:27
🔗
|
yipdw |
bwahaha |
22:27
🔗
|
yipdw |
underscor's monopoly on the splinder board is broken |
22:28
🔗
|
yipdw |
well, was |
22:28
🔗
|
underscor |
What happened? |
22:28
🔗
|
yipdw |
a bunch of other download clients finished |
22:28
🔗
|
underscor |
oic |
22:30
🔗
|
chronomex |
heh |
23:06
🔗
|
yipdw |
heh, oops |
23:06
🔗
|
yipdw |
just realized this about the micro EC2 instance I was running for splinder: |
23:06
🔗
|
yipdw |
Mem: 611252k total, 537012k used, 74240k free, 27696k buffers |
23:06
🔗
|
yipdw |
Swap: 0k total, 0k used, 0k free, 423180k cached |
23:35
🔗
|
alard |
Good news for anyone not underscor: you can click a name to hide that downloader from the graph, so you can see yourself a little better. http://splinder.heroku.com/ |
23:35
🔗
|
Wyatt|Wor |
Oh yeah, I haven't offered my congratulations yet. alard, great work on getting a patch accepted to wget! |
23:35
🔗
|
alard |
Thanks! |
23:35
🔗
|
underscor |
alard: That's awesome |
23:35
🔗
|
underscor |
Feels good to remove everyone else |
23:35
🔗
|
underscor |
;D |
23:36
🔗
|
alard |
underscor: I thought you already did? |
23:36
🔗
|
underscor |
huh? |
23:36
🔗
|
Wyatt|Wor |
Wait, what are we graphing here? Is there some large-scale fetch task I missed in the hurlyburly of moving? |
23:37
🔗
|
underscor |
's funny how the graph changes when I remove myself |
23:37
🔗
|
underscor |
Wyatt|Wor: splinder.com is shutting down in like 13 days |
23:42
🔗
|
Wyatt|Wor |
Oh, okay. The wiki page hasn't been updated. I take it I have to make an account first, then point these github scripts at my account and let it run? |
23:44
🔗
|
underscor |
Don't need an account |
23:44
🔗
|
underscor |
Clone the repo, ./get-wget-warc.sh, ./dld-client.sh Wyatt |
23:44
🔗
|
underscor |
(run a few of the clients if your io can take it) |
23:48
🔗
|
Wyatt|Wor |
Understood; I'll get on that then. |
23:51
🔗
|
alard |
underscor: Maybe it's time for some ops? |
23:52
🔗
|
underscor |
:) |
23:55
🔗
|
ndurner1 |
is there a reason why I am not a member of github.com/archiveteam anymore? |