#internetarchive.bak 2016-11-28,Mon

↑back Search

Time Nickname Message
01:31 🔗 Start has quit IRC (west.us.hub irc.mzima.net)
01:39 🔗 Start has joined #internetarchive.bak
02:50 🔗 yipdw Shard 13, the lesser-known sequel to District 13
02:52 🔗 VADemon has quit IRC (Read error: Operation timed out)
04:15 🔗 db48x Kaz: I'd be happy to show you how to create a shard
04:16 🔗 db48x also to try to figure out why your iabak isn't downloading any more stuff
04:16 🔗 db48x do you have a NOMORE file lying around?
04:16 🔗 db48x that status page is great
04:27 🔗 kyan has quit IRC (Quit: Leaving)
06:03 🔗 db48x Kaz: I love how simple the html for your status page is :)
06:06 🔗 db48x Kaz: but you should check in a script to create it
06:18 🔗 Start has quit IRC (Remote host closed the connection)
06:22 🔗 Start has joined #internetarchive.bak
06:54 🔗 Start has quit IRC (Quit: Disconnected.)
07:00 🔗 Kaz db48x: there's a script to create it, but I'm trying to work out a way to have it work out active shards automatically, and keep them in order
07:01 🔗 Kaz as for downloads, I don't have a NOMORE file, it just runs through each of the active shards and then eventually says "we've run out of things for you to download" etc, don't have a log file to hand atm
09:23 🔗 sevs has joined #internetarchive.bak
10:47 🔗 Jon trying to download my second shard is not going well. I need to re-arrange my partitions it seems
11:38 🔗 iabak-reg 03registrar 05master 60d8023 06other 10SHARD5/pubkeys registration of milenko on SHARD5
12:44 🔗 iabak-reg 03registrar 05master 8ceaa41 06other 10SHARD18/pubkeys registration of fusl on SHARD18
13:30 🔗 kurt has joined #internetarchive.bak
13:40 🔗 iabak-reg 03registrar 05master d84e93b 06other 10SHARD11/pubkeys registration of fusl on SHARD11
13:47 🔗 iabak-reg 03registrar 05master c55efc5 06other 10SHARD6/pubkeys registration of Kaz on SHARD6
13:54 🔗 iabak-reg 03registrar 05master 46ab4da 06other 10SHARD11/pubkeys registration of fusl on SHARD11
14:03 🔗 VADemon has joined #internetarchive.bak
14:17 🔗 VADemon has quit IRC (Read error: Operation timed out)
14:30 🔗 sep332_ has joined #internetarchive.bak
15:02 🔗 Start has joined #internetarchive.bak
15:21 🔗 iabak-reg 03registrar 05master 60daf86 06other 10SHARD18/pubkeys registration of fusl on SHARD18
15:49 🔗 Start has quit IRC (Quit: Disconnected.)
15:50 🔗 iabak-reg 03registrar 05master 2396904 06other 10SHARD17/pubkeys registration of fusl on SHARD17
15:51 🔗 atomotic has joined #internetarchive.bak
16:02 🔗 Jon what I've probably done wrong is, I put 1T at ~iabak, and ran that to get my first shard (3); then, on re-run determined i was picking up shard4, and put another 1T mount at ./shard4
16:03 🔗 Jon so ~iabak is pretty much full, and ./iabak is not happy about that, but it mangaes to fetch a few 10-20GB each time I run it before giving up
16:03 🔗 Jon into iabak4 that is
16:03 🔗 Jon I should probably move ~iabak -> ~iabak/shard3 and put a small-ish LV at ~iabak for transient stuff
16:16 🔗 iabak-reg 03registrar 05master d735526 06other 10SHARD10/pubkeys registration of rtucker-iabak on SHARD10
17:06 🔗 atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com)
17:36 🔗 HCross I can get 10.5TB of storage for 42.65£, or get 6TB for £23 (the 10TB is an i7, and the 6TB is an ARM). (All pricing is monthly) What do people suggest? Spend more for non ARM
17:46 🔗 iabak-reg 03registrar 05master 3db70fa 06other 10SHARD5/pubkeys registration of deewiant+ia.bak on SHARD5
17:46 🔗 Kaz depends how long you plan to keep it for
17:47 🔗 HCross im using the ARM atm, and its gotten to 600GB and given up
17:47 🔗 HCross as it cant checksum fast
17:47 🔗 Kaz �43/mo adds up very quickly compared to building a half-decent machine and buying your own drives
17:48 🔗 HCross yea
17:48 🔗 Kaz 12+ drives to a case, replace/expand as and when you want etc
17:52 🔗 HCross Kaz, I want to build something, but dont really know where to get started
17:53 🔗 Kaz depends what your budget is to start out really. then you've got building your own vs buying decom'd servers off ebay etc
18:35 🔗 VADemon has joined #internetarchive.bak
21:15 🔗 kyan has joined #internetarchive.bak
21:21 🔗 iabak-reg 03registrar 05master 1bc4740 06other 10SHARD10/pubkeys registration of Kaz on SHARD10
21:25 🔗 Kaz db48x: just for some context as I'm going through testing things. SHARD19 is my testbed for now
21:26 🔗 Kaz 'git annex get --auto -J50' caps cpu at 100% for ~30 seconds, then exits
21:27 🔗 Kaz 'git annex get -J50' is now downloading normally (Though I assume this is just going to grab every file it can get)
21:35 🔗 iabak-reg 03registrar 05master 11cedde 06other 10SHARD19/pubkeys registration of Kaz on SHARD19
22:25 🔗 iabak-reg 03registrar 05master 58ace3c 06other 10SHARD6/pubkeys registration of me on SHARD6
22:40 🔗 iabak-reg 03registrar 05master a78b3da 06other 10SHARD5/pubkeys registration of me on SHARD5
23:06 🔗 db48x Kaz: indeed
23:07 🔗 db48x git-annex git --auto tells it to use the stored preferences for what files it should get, but we don't use that git-annex feature
23:07 🔗 db48x see git-annex help prefferred-content
23:08 🔗 db48x err, preferred
23:12 🔗 db48x the reason why iabak doesn't use preferred-content settings is that git-annex doesn't randomize the order in which it scans the repository, so everyone would tend to download the same files when they all jumped into a brand new shard
23:13 🔗 Start has joined #internetarchive.bak
23:14 🔗 db48x iabak uses git-annex find --not --copies 4 to select files to download, shuffles that list, and then feeds it back to git-annex get
23:16 🔗 db48x we also process it slightly to change it from a list of files into a list of items first, so that we tell git-annex to download the whole item
23:16 🔗 db48x that helps the user when they go to view the files because they'll have complete items rather than one file from each one
23:23 🔗 iabak-reg 03registrar 05master 33a35c1 06other 10SHARD6/pubkeys registration of milenko on SHARD6
23:25 🔗 db48x Kaz: btw, the new status page needs a fast-forward button, so that we can skip over the boring parts of waiting for the clients to download things
23:25 🔗 SketchCow Is there a prototype of a new status page?
23:26 🔗 db48x SketchCow: http://iabak.archiveteam.org/status.html
23:27 🔗 SketchCow https://s-media-cache-ak0.pinimg.com/236x/1b/ea/58/1bea585a849b1de39c4122fcb1dbcf62.jpg can replace the missing one
23:27 🔗 db48x nice
23:28 🔗 Kaz Not too sure I understand what you mean?
23:29 🔗 db48x Kaz: it currently takes _days_ to see any changes in the graphs on that page
23:30 🔗 Kaz right
23:31 🔗 Kaz will have a look into things, not too sure on the best way to go about that really
23:31 🔗 db48x so if you put in a fast-forward button that speeds up the changes to the graphs so that they go up more quickly, that'd be great
23:31 🔗 db48x I'll leave the implementation details up to you

irclogger-viewer