#internetarchive.bak 2015-05-23,Sat

↑back Search

Time Nickname Message
01:05 🔗 ryang uhh
01:05 🔗 ryang I haven't done shit
01:05 🔗 ryang somebody is maybe giving me credit for good work?
01:06 🔗 mntasauri has quit IRC (Max SendQ exceeded)
01:06 🔗 ryang but at some point I do want to help the effort
01:06 🔗 ryang I think it's a good cause
01:06 🔗 mntasauri has joined #internetarchive.bak
01:09 🔗 closure ryang: this seems to be yours: 2207633498105: 8e7f6693-32d1-45f3-9995-b9596aa783de -- ubuntu@ia:/mnt/ia/IA.BAK/shard6
01:10 🔗 ryang closure: can't be. I don't even have 2TB of storage in my house right now
01:10 🔗 ryang Unless my buddy hooked up my old NAS, deleted my data, and started it for me
01:10 🔗 closure ryang: atwgpc?
01:11 🔗 ryang I don't even know what you mean by that.
01:11 🔗 closure oic. than ryan.g != ryang
01:11 🔗 ryang must not be
01:14 🔗 closure btw, ryang.g has pulled another 250 gb since I mentioned him a few hours ago
01:14 🔗 ryang my hero. :P
01:14 🔗 ryang I wish I could help out
01:14 🔗 ryang But my shit has been pretty ruined ever since the fbi ruined my shit.
01:15 🔗 pikhq ???
01:15 🔗 pikhq There's a story in there.
01:19 🔗 wp494_ is now known as wp494
01:56 🔗 closure hey, someone want to write something that can, given an IA collection name, return some numeric ranking for the collection?
01:57 🔗 closure number of downloads, or something
01:57 🔗 closure some shards will have a lot of small collections, and it would be nice to sort by popularity for display
02:06 🔗 Start !ao https://ghostbin.com/paste/9hr4r
02:17 🔗 primus104 has quit IRC (Leaving.)
02:33 🔗 antomatic has joined #internetarchive.bak
02:37 🔗 antomati_ has quit IRC (Ping timeout: 370 seconds)
02:50 🔗 closure found a collection that breaks the shard creator.
02:50 🔗 closure amusingly, it's from the CCC
02:50 🔗 * closure fixes
02:56 🔗 VADemon_ has quit IRC (Quit: left4dead)
02:57 🔗 tpw_rules okay we are back on line
02:57 🔗 tpw_rules sorry for all the registration spam
02:58 🔗 tpw_rules closure: git annex seems to like leaving around ssh processes after being ctrl+Ced
02:59 🔗 tpw_rules maybe it's iabak's fault, but it doesn't start ssh directly
02:59 🔗 closure there's a background ssh that's used for connection caching
02:59 🔗 closure git config annex.sshcaching false will disable that
02:59 🔗 tpw_rules i have that disabled.
03:11 🔗 closure I guess it's just the background iabak-hourlysync
03:19 🔗 closure 8.8 tb shard. so that just happened
03:43 🔗 tpw_rules closure: oh dear. i guess it's time to add new disks
03:49 🔗 closure look on the bright side, we're getting close to 1 million files, which is 1/271th of all the files
03:50 🔗 * closure notices that the iabak server only has 15 million free inodes.. will need to do something about that one of these months
03:56 🔗 tpw_rules store each shard as a disk image?
03:56 🔗 tpw_rules and loop mount it
03:57 🔗 chazchaz has quit IRC (Ping timeout: 369 seconds)
03:58 🔗 closure /dev/simfs 80G 35G 46G 43% /
03:58 🔗 closure will run out of disk 1st, but it may be expandable
04:00 🔗 chazchaz has joined #internetarchive.bak
04:03 🔗 * tpw_rules is currently downloading at around 10 floppies per second, or 1.3 CDs per minute
04:05 🔗 closure satas per fortnight
04:05 🔗 tpw_rules i calculate 1.3TB/day
04:06 🔗 tpw_rules 15MB/sec
04:08 🔗 tpw_rules what directories will be created in .git/annex that might contain data other than objects/ and bad/
04:15 🔗 closure tmp/
04:15 🔗 tpw_rules that's it?
04:15 🔗 closure yes
04:15 🔗 tpw_rules k, qool
04:15 🔗 closure if you mean large object data
04:15 🔗 tpw_rules yes
04:16 🔗 tpw_rules could there be a setting to just delete bad data instead of letting it hang around?
04:17 🔗 closure guess there could be..
04:18 🔗 closure rm -rf will work find on that bad/ dir
04:57 🔗 closure oh btw, with today's git-annex release, you should be able to parallelize downloads better
04:57 🔗 closure try something like git annex get -j10
04:58 🔗 closure er, -J10
05:02 🔗 tpw_rules are you at all related to the iabak script itself?
05:07 🔗 tpw_rules will ./iabak auto-download the latest version of git-annex?
05:07 🔗 tpw_rules or do i have to remove that directory
05:09 🔗 closure it should auto-upgrade
05:09 🔗 iabak-reg 03registrar 05master 955c037 06other 10SHARD1/pubkeys registration of twatson52 on SHARD1
05:10 🔗 iabak-reg 03registrar 05master 7b1b8a8 06other 10SHARD5/pubkeys registration of twatson52 on SHARD5
05:10 🔗 iabak-reg 03registrar 05master e2a6e52 06other 10SHARD6/pubkeys registration of twatson52 on SHARD6
05:10 🔗 tpw_rules what about automatically setting annex.sshcaching false? i think i've been missing registration because the ./checkoutshard script died
05:14 🔗 * tpw_rules thinks he is the whiniest member of this channel
05:43 🔗 closure woah, top of leaderboard maneuvering! :)
06:32 🔗 ryang pikhq: I joined an anon chat room back in 2010 when they were doing their ddos's and bs'ed with them. FBI decided that made me a conspirator.
07:41 🔗 Senji closure: how do I tell *which* of my shards is unregistered? :)
08:03 🔗 Senji I like -J, but I miss seeing wget doing its thing :(
08:33 🔗 primus104 has joined #internetarchive.bak
08:46 🔗 ppiixx woo -J is much faster than my hacky old multiple instances of iabak setup
10:05 🔗 Kenshin has quit IRC (Quit: ZNC - http://znc.in)
12:11 🔗 tpw_rules Senji: exactly :(
12:31 🔗 tpw_rules bah that figure for me is a lie. i don't have that much storage attached
12:36 🔗 tpw_rules Senji: slight mitigation: install iftop
13:14 🔗 primus104 has quit IRC (Leaving.)
14:32 🔗 VADemon has joined #internetarchive.bak
15:02 🔗 Kenshin has joined #internetarchive.bak
15:03 🔗 svchfoo2 sets mode: +o Kenshin
15:04 🔗 Atluxity has joined #internetarchive.bak
15:40 🔗 primus104 has joined #internetarchive.bak
16:42 🔗 primus104 has quit IRC (Leaving.)
17:24 🔗 VADemon_ has joined #internetarchive.bak
17:25 🔗 svchfoo3 has quit IRC (Read error: Connection reset by peer)
17:30 🔗 VADemon has quit IRC (west.us.hub irc.eversible.com)
18:13 🔗 primus104 has joined #internetarchive.bak
18:32 🔗 svchfoo3 has joined #internetarchive.bak
18:32 🔗 svchfoo2 sets mode: +o svchfoo3
18:34 🔗 pikhq ryang: Oh "fun".
18:42 🔗 lhobas_ has joined #internetarchive.bak
18:56 🔗 primus104 has quit IRC (Leaving.)
19:11 🔗 kyan has joined #internetarchive.bak
19:32 🔗 primus104 has joined #internetarchive.bak
19:46 🔗 kyan has quit IRC (Quit: This computer has gone to sleep)
21:05 🔗 S[h]O[r]T closure ryan.g is me :p
21:06 🔗 S[h]O[r]T you guys for some reason wanted an email reg and not a nickname like every other project ;)
21:08 🔗 S[h]O[r]T my box is now temp down since it lives in the cloud and my friend who operates that cloud is having some openstack issues. should be back by monday hopefully
21:28 🔗 pikhq The point of the email reg is to have a way to contact someone who has stuff expiring.

irclogger-viewer