[00:27] *** Start has joined #internetarchive.bak [00:28] 03registrar 05master cf23978 06other 10SHARD6/pubkeys registration of infinity on SHARD6 [00:52] 03registrar 05master 038c73d 06other 10SHARD7/pubkeys registration of kurtmclester on SHARD7 [00:55] shard5 done [00:59] https://soundcloud.com/renjith-vijay/fast-and-furious-7-get-low-ringtone [00:59] That plays every time we move shards. [01:00] i've got 6TB down so far in general. how can i check progress on a specific shard? [01:00] closure: ^ [01:01] tpw_rules: http://iabak.archiveteam.org/SHARD5.html [01:01] can substitute SHARD% for the shard you want to view [01:01] i mean me specifically [01:01] SHARD5* [01:15] yeah, we could make individual pages for each registered user. already have the graphs and data. [01:19] what about just have it as part of git annex? [01:20] tpw_rules: just run git-annex info [01:23] oh duh [01:24] why are some "unknown size"? [01:24] xml files with dummy size 0 in the IA survey [01:24] is the working tree the ones on your server or on disk size [01:25] that's the total files, local annex is on your disk [01:25] ok [01:26] okay so i have about a third of shard6 [01:26] gonna need to add more disk soon [01:30] tpw_rules: what's the UUID of your copy of the shard? [01:32] cd shard5; git config --get annex.uuid [01:32] *** primus104 has quit IRC (Leaving.) [01:34] *** BotOfWar has quit IRC (Quit: left4dead) [01:35] shard5: ada4868b-f969-40ba-ae7e-d58a20f1c477 shard6: a0c7d62a-53ec-4641-8ecf-af122b2cd5da shard1: 6ca85911-2e94-49b6-bbdc-cb1be080e300 [01:35] i'm talking about shard6 [01:35] *** VADemon has joined #internetarchive.bak [01:35] can you remove all the ones that claim to be twatson52 that aren't those? i made a bunch of ones that got blown up in various ways [01:36] they all have the same email [01:36] http://iabak.archiveteam.org:8080/render/?width=1060&height=733&_salt=1432604162.838&target=iabak.shardstats.leaderboard.a0c7d62a-53ec-4641-8ecf-af122b2cd5da.shard6 [01:37] http://iabak.archiveteam.org:8080/render/?width=1060&height=733&_salt=1432604200.538&target=iabak.shardstats.leaderboard.a0c7d62a-53ec-4641-8ecf-af122b2cd5da.shard6&from=-1weeks [01:37] pretty nice slope there [01:38] you can use iabak.archiveteam.org:8080 to look at all the different stats we collect [01:38] gives you a nice browser and graph editor [01:38] is there any reason the line is all dashy? [01:39] we only check in once per hour [01:39] you can use the graph editor to make it join up the dots [01:39] but shouldn't it connect the dots? i get the stairstep but it looks like it was done with a dashed line [01:39] ah [01:40] it doesn't know a priori what a gap in the data means [01:40] in this case it means a missed sample; it could just as easily mean a zero [02:08] has anybody else fiddled around with union filesystems for storing all the data? [02:09] I use ZFS [02:09] what's the goal of using a union fs? [02:09] i'm coming at it from the point of having piles of hard drives that could be used [02:09] basically be able to have a failure of one drive not affect all the data stored [02:09] yea [02:10] ZFS allows striping/mirroring/raid across disks [02:10] and, more importantly, a failure of one drive not make the other data unusable like it would in a RAID 0 [02:11] true [02:11] I prefer raid though, so that the occasional error can be corrected [02:11] but otoh i'm thinking about the idea of writing a disk manager that would create a new repo on each disk and manage filling them up and fscking as plugged in [02:11] so i don't need 45 drive enclosures and usb ports [02:11] :) [02:12] we should have a way for the iabak client to help that out [02:12] but then i have to actually swap drives around and i'm lazy [02:13] well how in the hands of the masses do you want this to be? i know several people who would be happy to help if it were just something they kept running on windows in the tray [02:14] and i really don't think making it all shell scripts is condusive to that ideal [02:15] very true [02:16] With multiple disks you have to make sure that they're not all getting the same data [02:17] also note that if i say i have an idea, you can say "shut up and make it" because i could. but here goes. have a config file of archive destinations. for each, give it a name and directory. in each directory, put a file with the name and a uuid [02:17] No good having 3 copies of a file sitting next to each other in one drawer [02:17] this way you can have the directory be a mount point (and the same for multiple desinations) and the file can be used to determine if a disk is mounted/which one is there [02:17] sep332: oh yeah, that's important [02:18] sep332: yea, git annex can handle that if we tell it to [02:18] though i get the approach of building this from common tools so it can be rebuilt in the event of thermonuclear meltdown, i think it limits options a lot [02:19] but this program could also read the config and determine which ones need to be checked and ask for the appropriate one to show up at the directory [02:19] so it could easily support multiple drives that are swapped out or multiple locations on one computer [02:20] also perhaps the option to download locally so it can fsck one drive and later copy files to another when it comes back [02:22] we can simplify that more, even [02:23] git annex repositories already have their uuid [02:23] can you scan a directory for a git repo easily? [02:23] yes [02:23] recursively? [02:23] but yeah. just read the .git/whatever in the pointed to directory to make sure [02:23] so if the iabak script listened for devices being added (udev, on linux) and found the repositories on them, it could fsck them [02:24] ignore my previous two questions [02:24] yep [02:24] does git exist for windows without cygwin? [02:24] and could git annex? [02:24] no [02:25] is it possible to bundle cygwin into an installer? [02:25] sure [02:25] git already does that [02:25] i have some experience with windows gui programming (python/pyqt so it could even be cross-platform) and bundle all that stuff together [02:25] excellent :) [02:27] now this is where it gets shaky: how do we tie multiple repos on multiple computers to one name? that would be necessary to prevent duplication. i have a friend with a stack of six or so laptops; they should all know the files the others have [02:27] yes, each repository knows which other repsotories have each file [02:28] Yeah, this is git-annex's job [02:28] so we would have some association of the sets of repositories to an account and git annex can query those in the set? [02:28] mostly [02:29] i mean we don't want two copies of a file three feet from each other, but having two copies three states from each other is good [02:29] tpw_rules: yes [02:29] *** VADemon has quit IRC (Quit: left4dead) [02:29] I suspect we're going to run into a LITTLE of that no matter what, because people will want to "help" [02:29] theres a couple of ways to do it [02:30] there's a git-annex feature coming down the line at some point which we could rely on [02:30] SketchCow: sure, but i wasn't sure how we automatically prevented the first and allowed the second [02:30] or iabak could simply say git annex get --not --copies 4 --not in otherrepo1 --not --in otherrepo2... [02:31] that list would only update when a sync happened though [02:31] might have to tune that in practice [02:31] the git annex feature is called balanced preferred content: http://git-annex.branchable.com/design/balanced_preferred_content/ [02:32] if the disk is offline, then the last sync for it has the must up-to-date information already :) [02:32] well this could also support n drives simultaneously too [02:33] simplest to put a single repository on each of them [02:33] i was thinking more across multiple computers. but the balanced thing would sovle that problem [02:34] both ways of doing it will work for n computers and n drives, and n drives on each of m computers ;) [02:34] but which will be the most perfect [02:35] :) [02:36] i'd be happy to write a gui but i don't know enough about git annex and bash to do the script. i feel we should move it to python or something [02:36] or haskell :P [02:36] haskell would be fun [02:36] I don't know haskell very well yet; it'd be fun to learn [02:36] i need to learn haskell [02:37] i have worked with functionaly programming languages, but nothing truly functional [02:44] closure: Seems like I need to start giving you more collections [02:44] And we are getting to the point where we need to do a sanity check to make sure a collection isn't already being backed up [02:46] I have that sanity check in place actually [02:46] http://iabak.archiveteam.org/client/f9601d3062715f39f6290547fbaf14b3e6c2b4fb.html [02:47] Great [02:50] 03registrar 05master da43505 06other 10SHARD6/pubkeys registration of kurtmclester on SHARD6 [02:56] Shard6 is filling in nicely. [04:03] maybe we don't need all commits in the channel :P [04:53] Yes. We. Do. [04:55] okay okay okay [04:57] /media/iabak/disk2/IA.BAK;/media/iabak/disk3/IA.BAK;/media/iabak/disk4/IA.BAK 8.1T 5.9T 1.8T 77% /home/thomas/iabak/IA.BAK [04:57] This is what we PLAYYYY FORRRRR [04:57] fourtunately i have 3x 3TB disks that i'm not using right now [04:58] anyway, it is time to say good night and let the datums flow in [05:03] 03registrar 05master a2a9eb2 06other 10SHARD6/pubkeys registration of kevin on SHARD6 [06:00] *** zottelbey has joined #internetarchive.bak [06:37] 03registrar 05master 9e6e2ff 06other 10SHARD6/pubkeys registration of bas+at on SHARD6 [07:48] tpw: http://iabak.archiveteam.org/stats/SHARD5.leaderboard-raw seems to think you have 8T 8T 8T 1.5T [07:51] Err, no, divite all those numbers by 10, I can'tmath :) [07:51] I'll just go tback to bed, clearly I'm not awake yet [07:53] new stats pages per user are really nice :) [07:55] just noticed the cleanup function in the iabak script doesn't work on OS X (more stupid Mac-only glitches I assume) - https://github.com/ArchiveTeam/IA.BAK/blob/a420ad/iabak-helper#L282 throws "No such file or directory" (pid does exist, statement should eval to true) [07:56] *** ivan` has joined #internetarchive.bak [07:56] #L285 in that file seems off to me, think the file is supposed to be rm'ed, not the pid-number right? [07:56] *** primus104 has joined #internetarchive.bak [08:03] https://news.ycombinator.com/item?id=9602868 [08:16] I would like to nominate https://archive.org/details/archiveteam_greader for distributed archival because it's got 8TB of compressed text, a lot from dead blogs that are nowhere else [08:16] the Directory and Stats are unimportant and omitting them saves ~800GB [08:16] I was planning on dumping it into my Google Drive or onto external drives but never got around to either but will maybe try later [08:19] 03registrar 05master 37605fb 06other 10SHARD6/pubkeys registration of cyrus on SHARD6 [08:21] 03registrar 05master 5463225 06other 10SHARD6/pubkeys registration of peter on SHARD6 [08:26] *** Start has quit IRC (Read error: Connection reset by peer) [08:26] *** Start_ has joined #internetarchive.bak [08:28] *** Cyrus has joined #internetarchive.bak [08:28] Emcy: I'd say "2 sheets" rather than "4 or more" :) [08:28] Bah, mischat :) [08:40] 03registrar 05master d422268 06other 10SHARD6/pubkeys registration of antoine on SHARD6 [09:17] This project just got mentioned on hackernews. [09:17] Might cause a run on clients. [09:17] Or whiners. [09:17] Or whiny clients [09:17] Or client whiners [09:19] 03registrar 05master c394c48 06other 10SHARD6/pubkeys registration of koos303 on SHARD6 [09:21] *** atomotic has joined #internetarchive.bak [09:23] All four plus extra. [09:25] *** atomotic has quit IRC (Client Quit) [09:26] We are probably going to start getting into the light realm of bad actors. [09:26] We'll see how we handle it. [09:28] 03registrar 05master f586bb7 06other 10SHARD6/pubkeys registration of info on SHARD6 [09:36] *** Start_ has quit IRC (Read error: Connection reset by peer) [09:36] *** lufix has joined #internetarchive.bak [09:37] *** Start has joined #internetarchive.bak [10:05] 03registrar 05master 40f2c21 06other 10SHARD6/pubkeys registration of bdupray on SHARD6 [11:03] *** lufix has quit IRC (Ping timeout: 240 seconds) [11:03] 03registrar 05master 9ebe0e8 06other 10SHARD6/pubkeys registration of alex_online78532 on SHARD6 [11:10] *** lufix has joined #internetarchive.bak [11:10] *** hendi has joined #internetarchive.bak [11:27] is there a way to set a nicer name for my account? currently I'm named "info"? [11:28] I believe there is an intention to allow nicknames. Currently it's using the bit before the @ in your email address [11:28] alright, thanks; I'll keep an eye out for that functionality then [11:30] 03registrar 05master 744b699 06other 10SHARD6/pubkeys registration of andrei.zbikowski on SHARD6 [11:57] *** atomotic has joined #internetarchive.bak [12:34] 03registrar 05master d785a5d 06other 10SHARD6/pubkeys registration of olliejudge on SHARD6 [12:39] *** beardicus has joined #internetarchive.bak [12:40] hello, meat popsicles. [12:40] i think i need to register. [12:41] finally got a chance to update my scripts this morning... should be fscking shard1 right now. [12:41] but i am "out of touch" [12:41] beardicus: i think you just need to run change-email in the iabak dir [12:42] hmm. i now see `register-helper.pl`... let's see what that does. [12:44] 03registrar 05master a378f40 06other 10SHARD1/pubkeys registration of brian on SHARD1 [12:44] 03registrar 05master dde25de 06other 10SHARD2/pubkeys registration of brian on SHARD2 [12:45] That seems to have worked [12:45] that was change-email that did it. thanks ppiixx [12:46] noting that prompt-email did nothing. [12:46] also noting that my system has neither systemd nor cron, so the script is a little complainy and i'll have to figure that out. [12:47] No cron! [12:47] it's a synology nas. [12:47] there's a gui thingy for running tasks though, i think. [12:48] "Task Scheduler" yay. [12:49] assuming the corrrect periodic command to blip is iabak-cronjob? daily? [12:50] yep [12:50] *** sankin has joined #internetarchive.bak [12:51] 03registrar 05master b95aab9 06other 10SHARD6/pubkeys registration of olliejudge on SHARD6 [12:56] beardicus: My synology nas has cron, I believe? [12:57] hmm. crond does exist. no crontab though. [12:59] 03registrar 05master 69747f3 06other 10SHARD6/pubkeys registration of atrus6 on SHARD6 [13:30] beardicus: Ah, I see :) http://www.multigesture.net/articles/how-to-use-cron-on-a-synology-nas/ [13:30] Might help [13:31] yeah. apparently you need to use tabs between fields too. [13:48] 03registrar 05master e1e0acf 06other 10SHARD6/pubkeys registration of jon archive.org on SHARD6 [13:50] 03registrar 05master 629d09d 06other 10SHARD6/pubkeys registration of mail on SHARD6 [13:55] *** Start has quit IRC (Disconnected.) [14:18] 03registrar 05master f8bf3c8 06other 10SHARD6/pubkeys registration of chiploaded on SHARD6 [14:34] *** zottelbey has quit IRC (Ping timeout: 512 seconds) [14:41] *** Start has joined #internetarchive.bak [14:46] *** ohhdemgir has joined #internetarchive.bak [14:52] *** atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [14:55] 03registrar 05master 09b57ac 06other 10SHARD6/pubkeys registration of moritz.steiner on SHARD6 [15:17] *** zottelbey has joined #internetarchive.bak [15:19] 03registrar 05master 3988cd2 06other 10SHARD6/pubkeys registration of mariusz on SHARD6 [15:20] *** beardicus has quit IRC (Sleep.) [15:20] *** primus104 has quit IRC (Leaving.) [15:28] 03registrar 05master e5f4b99 06other 10SHARD6/pubkeys registration of iabackup on SHARD6 [15:45] I second ivan`'s nomination of the google reader archive, even though the files are huge [15:50] *** Start has quit IRC (Disconnected.) [15:57] *** Start has joined #internetarchive.bak [16:00] *** scatman has joined #internetarchive.bak [16:01] *** Start has quit IRC (Client Quit) [16:12] *** mariusz has joined #internetarchive.bak [16:13] *** Zero_Dogg has joined #internetarchive.bak [16:14] 03registrar 05master 91c2b2a 06other 10SHARD6/pubkeys registration of archive.org on SHARD6 [16:14] Hi. How can I register myself? :) [16:16] Your install-git-annex is a bit stupid, always defaulting to i386. There are standalone tarballs for arm too, that works on raspberry pi (which I'd be using if I set it up) [16:16] *** Start has joined #internetarchive.bak [16:18] https://github.com/zerodogg/scriptbucket/blob/master/gitannex-install#L52-L60 is an example of the logic needed [16:23] do submit patches, we all run on i386/x86-64 and therefore haven't had a need to generalize [16:25] I will :) [16:26] Got some spare space that I might be able to use for this, but it's on a raspi server [16:26] Zero_Dogg: So you have criticism and can't donate space! [16:26] You... you came from Hackernews [16:27] does it require much cpu after the whole thing is downloaded (ie. does it git annex fsck, much)? [16:27] SketchCow: lol [16:28] SketchCow: I came from your blog, actually :p [16:28] Oh, THAT dump [16:28] hah [16:30] http://upload.wikimedia.org/wikipedia/commons/5/55/Creature_from_the_Black_Lagoon_poster.jpg [16:39] I will say, the Dogg is right in one regard - we should come up with some FAQ/information on how system intensive the ongoing holding of the data is. [16:43] See? All nice and constructive, complete with pull request. Not hackernewsy at all [16:45] *** Start has quit IRC (Disconnected.) [16:46] That's the way we like it. [16:58] *** Lord has joined #internetarchive.bak [16:58] hello [16:59] i'm quite interested in this project (backink up the web backup :-) ) [17:06] i launched iabak, it downloaded some files and it failed [17:06] i created a user without home so the script failed [17:06] (maybe this info interest you) [17:08] i think i'll face another problem : the script is downloading gitannex i386 but my gentoo doesn't have multilib support [17:08] *** Beardface has joined #internetarchive.bak [17:09] 03registrar 05master 256141c 06other 10SHARD6/pubkeys registration of lord-ia on SHARD6 [17:10] here i am :-) [17:10] it works [17:10] (with lots of setlocale errors) [17:14] can git-annex be configured to use a set amount of space? like 2TB? [17:19] sep332: git config annex.diskreserve 2000GB [17:20] that's not how much space it *won't* use? [17:23] *** primus104 has joined #internetarchive.bak [17:26] sep332: you're right:) [17:26] sep332: sorry [17:26] np. it's a cool idea just not what i'm looking for [17:29] The new clients are cuasing some partying. [17:30] 03registrar 05master 3de6df9 06other 10SHARD6/pubkeys registration of dylan.barlett on SHARD6 [17:33] is that a problem? i'll bring pizza [17:33] i'm getting an error trying to sync to shard 6: error: Ref refs/heads/synced/git-annex is at 8c3a1a32ad19cc72f8429d7078dce8e9bc7e9e67 but expected e606f7f0f586c8f1504bd49cccb48d81dfa0a873 [17:35] No, it's not causing a problem at all. Just watching the activity. [17:35] We also are getting dilletantes, which is good, because that's a worthwhile experiment. [17:36] (People joining to fuck around and see what it does, then going "well that was fun" and disappearing, likely already, but certainly within the 2 week/4 week period) [17:38] My theory is this will just cause a bunch of 0.00 clients, since people are unlikely to go "let's see what it does.... DOWNLOAD A TERABYTE" [17:39] 03registrar 05master b29843a 06other 10SHARD6/pubkeys registration of frozenbeardme on SHARD6 [17:42] 03registrar 05master 5e39162 06other 10SHARD6/pubkeys registration of ryan on SHARD6 [17:46] it works! [17:48] That's what we hope! [17:48] How much space you got, Beardface! [17:48] * SketchCow rubs hands like Mr. Burns [17:48] ~1T so your last comment is kind of relevant, heh [17:49] I currently have 1.5TB and think about adding some more [17:50] should I do RAID1, or go without RAID, and just redownload when a drive fails? [17:50] *** Start has joined #internetarchive.bak [17:50] it checks for systemd to install a service, if not found it exits.. intentional? (when you start it again it installs a cron instead) [17:52] *** primus104 has quit IRC (Leaving.) [17:53] hendi: there is already redundancy in the iabak software so that multiple people will get the same file, so local RAID is unnecessary [17:54] agree with DFJustin - it's wasted space, unless you personally have an interest in a collection in a way you are making it available elsewhere. [17:58] great, thank you [17:58] expect at least 15TB from me, then [18:01] Fantastic. [18:01] That'll help a lot. [18:02] I plan on running iabak on more than one computer. Is the software smart enough to send me a different set of data to each one? [18:02] not yet [18:02] hendi: i've used mhddfs to attach a bunch of drives into one filesystem [18:02] the advantage being that if one drive breaks, it doesn't take everything [18:03] mariusz: It's a worthwhile feature going forward for closure and db48x to consider, where machines are called buddies and they're treated as one machine. [18:04] Yeah,tThat would be great.. [18:05] This project keeps coming up with new feature-adds [18:06] another question. I have about 50 older drives ranging from 80GB to 1TB that I could manually plug-in once every few weeks. Is "cold storage" supported? If yes, any info how to go about this? [18:06] Somebody in 4 years is going to go "Man, this closure guy thought of EVERYTHING...." [18:06] mariusz: Not yet, in any meaningful way. [18:06] I should say it's supported in git-annex, but we're being simple... for now. [18:06] mariusz: i solved that problem by getting a bunch of extremely cheap usb enclosures and attaching them as one [18:06] Because of aforementioned discovery of "buddy" feature and similar features. [18:07] oh i had another idea: be able to set up an IPC socket that git annex will request downloads from so we can use something other than wget. i was thinking a pretty gui [18:08] or even just a download command [18:09] tpw_rules, thanks for the hint, I'll have a look at mhddfs! [18:09] get 0.1.38 btw, the later version is a bit crashy. it's a union fs, but it supports writes too [18:13] I think it does support cold storage as long as you check in at least once a month [18:13] another idea - sneakerneting the data. i.e get a beer and copy your shard :) [18:14] It's a thought down the line. [18:15] There's a second/third/fourth wave of approach as we hit the upper limits of just scooping people out and into the project. [18:15] But it's still holding up for people going "Oh, yeah, got 10tb lying around." [18:15] The main critical thing is to make sure we have chosen collections that aren't wasteful. [18:18] I'm a little suspicious about some of the recent ones like wikipediadumps [18:20] (why is that plural? i thought one dump contained the entire history of everything) [18:21] well there is more than one wikipedia (language) [18:25] oh, true [18:35] I think wikipediadumps is on the edge. [18:35] On the other hand, our collection of dumps goes WAY back farther than anyone. [18:35] I did some of those, with lots of skeletons [18:36] Like Erik Moller pro-childporn arguments that were quietly expunged when he became Wikipedia org dude [18:36] Or Jimbo Wales getting into an argument with someone, and having a db admin remove the thing he said, and then going "I never said that." [18:36] And who knows what else, down there. [18:36] But year. [18:36] Yeah. [18:36] Maybe we need a nominations page. [18:37] Yes. [18:37] We do. [18:37] OK, one moment. [18:37] *** Start has quit IRC (Read error: Connection reset by peer) [18:46] http://archiveteam.org/index.php?title=INTERNETARCHIVE.BAK/nominations [18:49] are the shars a set size? [18:49] shards* [18:49] The shards are a set number of files/items [18:50] So imagine it's... 20 [18:50] 20 1m files mean tiny shard [18:50] 20 1g files means fatty shard [18:50] Once things are going into the upper reaches/echelons, we'll see cases where shards 100-1000 are all 1mb or some such chicanery [18:51] ahh [18:53] 03registrar 05master a1cadb4 06other 10SHARD6/pubkeys registration of mmein301+i on SHARD6 [18:53] Like, 80% of the work going on now is to find and deal with use cases and bugs/contingencies they reveal. [18:53] 10% is improving the UI and interaction [18:54] 10% is filler, primarily melted hooves and horns [19:00] 03registrar 05master 95ae19f 06other 10SHARD6/pubkeys registration of nico+iabak on SHARD6 [19:01] btw. does any one knows why git config annex.web-options=--limit-rate=200k returns "invalid key" error? [19:01] If I want to run iabak on multiple machines, should I copy the private key over for accounting and stuff, or use a new one on each machine? [19:08] 03registrar 05master b6d2ef7 06other 10SHARD6/pubkeys registration of cyberjacob+IA on SHARD6 [19:08] mariusz: I think git config doesn't want the = there [19:08] hendi: use a new one [19:12] closure: that worked. thanks. probably would be good [19:13] to change the README [19:13] ;) [19:13] Glad to see I'm not the only person using a foo+bar address :) [19:14] 03registrar 05master 1d18796 06other 10SHARD7/pubkeys registration of mail on SHARD7 [19:15] error: Ref refs/heads/synced/git-annex is at 38179c6b1a70682556e88bf6d5c94187cdaabaac but expected ffeac6bf96c796c6117981d2ee64fc642edbaa01 [19:16] i am still getting sync problems like that [19:16] i don't have more than one iabak script running [19:18] well, that can happen if someone else pushed a change at the same time. It should normally clear up the next time, unless you're unludky [19:24] 03registrar 05master f760a1f 06other 10SHARD7/pubkeys registration of cyberjacob+IA on SHARD7 [19:28] *** Start has joined #internetarchive.bak [19:30] *** Start has quit IRC (Client Quit) [19:31] 03registrar 05master c7d5ea9 06other 10SHARD6/pubkeys registration of eskild on SHARD6 [19:48] *** beardicus has joined #internetarchive.bak [19:50] 03registrar 05master 40e077c 06other 10SHARD7/pubkeys registration of moritz.steiner on SHARD7 [19:50] *** primus104 has joined #internetarchive.bak [19:51] /win 20 [19:52] bah [19:53] where are the authorized_keys files again? I think I'm missing from shard1 [19:54] *** atomotic has joined #internetarchive.bak [19:57] .git/annex/id_rsa and id_rsa.pub [19:58] *** CyberJaco has joined #internetarchive.bak [19:58] Hi [19:59] i have id_rsa but I'm getting Permission denied (publickey) when i sync [19:59] hi CyberJaco [19:59] 03registrar 05master dd3be51 06other 10SHARD6/pubkeys registration of hannson on SHARD6 [20:00] that's weird, why is the last letter of my name mising... [20:00] sep332: sounds like the wrong key, we have separate sets of keys for each shard, so shard1 may not have the pubkey you're using for other shards [20:00] looks like an 8-char limit? [20:01] closure: can i register a new one? [20:01] sep332: manually, yes.. [20:01] ./register-helper.pl "$SHARD" "$uuid" "$registrationemail" "$(cat id_rsa.pub)" [20:02] full in the bits, that will give an url you can hit to register [20:05] *** mariusz has quit IRC (Read error: Operation timed out) [20:05] sep332: or, I can manually add it [20:07] 03registrar 05master f36f056 06other 10SHARD1/pubkeys registration of sean.palmer on SHARD1 [20:08] 03registrar 05master fd09178 06other 10SHARD6/pubkeys registration of brian on SHARD6 [20:09] closure: is it the [annex] uuid or the [remote "origin"] annex-uuid? [20:10] 03registrar 05master 84567e0 06other 10SHARD6/pubkeys registration of steven.m.reed on SHARD6 [20:11] SHARD6 does the climb [20:11] sep332: the annex.uuid [20:17] closure: that's what i put in, same error [20:17] i'm using the same key for all shards [20:18] check perms of your id_rsa file [20:19] -rw------- [20:24] sep332: check if shard1's git config has remote.origin.annex-ssh-options set [20:27] nope. i'll just copy it from another shard [20:28] well, not the whole config, just that setting [20:28] yeah [20:29] alright it's working. thanks closure [20:42] *** CyberJaco is now known as zz_CyberJ [20:44] 03registrar 05master 4de5003 06other 10SHARD6/pubkeys registration of mariusz on SHARD6 [20:46] *** sankin has quit IRC (Leaving.) [20:50] awww yes here comes mariusz [20:56] *** atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [21:01] *** xhdr has joined #internetarchive.bak [21:10] closure: i think the sync error may have killed the iabak-hourly process. it's been a couple hours since that error and it hasn't tried again [21:21] indeed, that could happen [21:22] *** laxity has joined #internetarchive.bak [21:29] closure: noticed that the cleanup function in the iabak script doesn't work on OS X (more stupid Mac-only glitches I assume) - https://github.com/ArchiveTeam/IA.BAK/blob/a420ad/iabak-helper#L282 throws "No such file or directory" (pid does exist, statement should eval to true) [21:29] any clue what could cause that? [21:29] (#L285 in that file seems off to me, think the file is supposed to be rm'ed, not the pid-number right?) [21:34] 03registrar 05master e16b5ee 06other 10SHARD6/pubkeys registration of matt on SHARD6 [22:12] I should have gone ahead and fixed that last night when you mentioned it [22:12] couldn't sleep anyway [22:15] do you not love me [22:16] SketchCow on npr, eep [22:23] 03registrar 05master 93b954b 06other 10SHARD6/pubkeys registration of carl.moden on SHARD6 [22:39] *** mariusz has joined #internetarchive.bak [23:05] *** Atluxity has quit IRC (Ping timeout: 360 seconds) [23:10] *** Atluxity has joined #internetarchive.bak [23:12] 03registrar 05master 240d6ac 06other 10SHARD6/pubkeys registration of paul.chambers on SHARD6 [23:18] *** beardicus has quit IRC (Quit: Sleep.) [23:19] *** Atluxity has quit IRC (Ping timeout: 360 seconds) [23:23] *** Start has joined #internetarchive.bak [23:26] *** Atluxity has joined #internetarchive.bak [23:27] *** beardicus has joined #internetarchive.bak [23:31] closure: ah, an interesting clue [23:32] so where is the ffi function that calls CreateProcess? [23:36] I'm looking at http://hackage.haskell.org/package/process-1.2.3.0/docs/src/System-Process.html#createProcess, but I don't see where it actually calls the win32 api... [23:37] *** Atluxity has quit IRC (Ping timeout: 360 seconds) [23:38] *** Atluxity has joined #internetarchive.bak [23:45] db48x: oh, I just nailed that problem [23:45] *** zottelbey has quit IRC (Quit: Leaving) [23:46] writing the PR for the library that needs changes now.. [23:46] you were in the right place, but it has a side of C files :) [23:49] *** Atluxity has quit IRC (Ping timeout: 360 seconds) [23:52] 03registrar 05master de4a487 06other 10SHARD6/pubkeys registration of iabak on SHARD6 [23:53] ah, good [23:53] is https://github.com/haskell/process/blob/master/System/Process/Internals.hs#L414 closer? [23:54] ah, https://github.com/haskell/process/blob/master/cbits/runProcess.c#L557 [23:56] presumably you're adding a way to add that flag where that's called here: https://github.com/haskell/process/blob/master/System/Process/Internals.hs#L452 [23:57] that's the plan, but I'm actually watching austraian cooking show :P [23:57] feel free to send patch to https://github.com/haskell/process/issues/32 [23:58] *** Atluxity has joined #internetarchive.bak