[14:46] So, it's time to nail some of this problem down. alard, if you're around. [14:58] SketchCow: What's the problem? [15:23] Rsync issues. [15:24] I'm seeing a lot fail, would like to determine what it might be. [15:33] SketchCow: Hi. [15:45] SketchCow: What do you mean by rsync issues? [15:47] or, rather, what's their failure mode? [15:49] There's at least one problem I can think of: a combination of the rsync speed-limit option (closes the connection after 15 minutes at less than 1B/s) and the upload redirector (you're assigned a random upload server, which doesn't work too well with partial uploads). [16:31] Yes [16:31] The failing is happening constantly. [16:31] Not sure what to do here. [16:31] I can do more uploading [16:31] more slots [16:33] May 20 16:33:05 teamarchive0.fnf.archive.org rsyncd[10162]: rsync error: error in rsync protocol data stream (code 12) at io.c(605) [Receiver=3.0.9] [16:33] May 20 16:33:05 teamarchive0.fnf.archive.org rsyncd[10166]: rsync: connection unexpectedly closed (24 bytes received so far) [Receiver] [16:33] May 20 16:33:05 teamarchive0.fnf.archive.org rsyncd[10166]: rsync error: error in rsync protocol data stream (code 12) at io.c(605) [Receiver=3.0.9] [16:33] May 20 16:33:05 teamarchive0.fnf.archive.org rsyncd[10161]: rsync: connection unexpectedly closed (24 bytes received so far) [Receiver] [16:33] May 20 16:33:05 teamarchive0.fnf.archive.org rsyncd[10161]: rsync error: error in rsync protocol data stream (code 12) at io.c(605) [Receiver=3.0.9] [16:33] MA lot of those [16:39] Hmm, I'm not getting any of those on my three upload servers [16:39] Drive's not full, you don't have any weird firewall rules? [17:04] top - 17:03:04 up 3 days, 17:19, 2 users, load average: 599.88, 577.53, 525.71 [17:04] he heeeeee [17:06] lol [17:07] Tasks: 1379 total, 1 running, 1378 sleeping, 0 stopped, 0 zombie [17:09] It's weird your load is so high [17:09] but I guess the server is doing a lot of other things at the same time? [17:10] (weird becasue the bt-download box, which is handling >400 rsyncs atm, has a load of ~6-10 [17:10] ) [17:10] Let's check the mrtg [17:11] Won't push out [17:11] Weird, it's doing barely anything network-wise [17:11] Fair bit of disk IO though [17:11] disk IO can cause massive load due to IOWait [17:12] baiscally every cycle that your thread is waiting on another threads IO causes load+1 [17:12] So 200threads, all waiting for 1 threads IO = +200 load [17:12] Maybe that's part of it, then [17:12] took me a long time to understand that [17:12] As most people think it's just CPU. [17:15] Here's the thing. [17:15] So, I can keep the links small [17:15] But we basically are at the behest of slow uploads [17:19] SketchCow: At the moment, no new people are getting redirected to fos, fyi [17:21] Really. [17:21] You snagged all that? [17:21] Works for me, FOS is only so powerful [17:26] mhm, I brought up 3 machines that were idle due to waiting on some other project stuff to go through and get the go-ahead [17:27] and they seem to be handling it okay [17:27] They aren't doing anything else like statusboard, et al. so that probably really helps