[09:35] emijrp: my replacement hdd arrived, I'll have to stop everything and reinstall OS today [09:36] it would be nice to have the upload but not overwrite existing files feature, to resume (not necessary of course) [09:36] it doesnt overwrite if you conservate the log [21:44] ok, upload restarted [21:45] nemo is happy [21:50] emijrp: yes because the reupload filter is working reasonably well, it's even finding some forgotten dumps http://archive.org/details/wiki-enanarchopediaorg [21:50] emijrp: btw I'm not sure the Citizendium dump is perfect, does it contain all the revisions it should? [21:51] I never verified [21:51] i checked integrity and looks ok [21:51] how is the anarcho wiki smaller a month later? [21:52] emijrp: did you compare it with their special:statistics ? [21:52] that's the point, for a while I've been running a broken dumpgenerator I'm afraid [21:52] yes, and it seems ok for me [21:52] good [21:52] http://archive.org/details/wiki-enanarchopediaorg it also doesn't manage to update last-updated-date [21:55] metadata is not changed [21:56] probably need a special flag in curl [22:00] emijrp: yes, let me check [22:03] emijrp: should be --header 'x-archive-ignore-preexisting-bucket:1' but I'm not completely sure [22:04] is there a list of wikis we need to dump? [22:04] I have some spare boxen I'd like to use to help [22:05] underscor: http://code.google.com/p/wikiteam/wiki/TaskForce [22:07] (to start with :p) [22:07] emijrp: why don't you try to rerun Pavlo's crawler? we'll soon need more wikis to download, that list was really old :) [22:08] i will check, but it uses yahoo api and dont know if that still works [22:09] hm [22:10] you might need someone with an old key or something [22:10] Are we going to repeat these at some point? [22:10] or do we have a plan in place for that? [22:11] yearly... [22:12] many wikis will dead, others will born, and the index needs to be updated [22:12] ah, okay [22:12] when did we start? [22:12] I forgot [22:12] haha [22:13] start what? [22:14] April [22:14] for these TaskForce wikis [22:15] we had a big leap waiting for the uploader script being coded by gnomes [22:15] aha [22:16] yeah, I remember Nemo_bis (I think) complaining about that [22:16] hahaha [22:16] ok, so once these launcher.pys finish their pieces of a list, what do I do next? [22:17] upload [22:17] it is explained at the bottom [22:17] i dont know if move that section above the table [22:18] oh, found it! [22:18] have to go [22:18] would it be helpful just to have a set of s3 keys for wikiteam uploader? [22:19] I can get a special "group" account type thing like we used for mobileme [22:26] Nemo_bis: any idea why this happened? [22:26] http://hastebin.com/xifamifuka.coffee [22:26] chronomex: is4 SketchCow soultcer +o? [22:27] underscor: what a horrible pastebin [22:27] yeah, the syntax selector fucked itself [22:28] no, I mainly hate the JS [22:30] omg they even boast about i "the leegant pastebin" sigh [22:31] underscor: try now [22:31] like, redownload launcher.py? [22:31] underscor: yes [22:32] kk [22:32] File "launcher.py", line 98 [22:32] IndentationError: unexpected indent [22:32] ^ [22:32] alex@angstrom:/remote/storage5$ python launcher.py list066-ah [22:32] os.system('7z a -ms=off ../%s-history.xml.7z %s-history.xml %s-titles.txt index.html Special:Version.html errors.log' % (prefix, prefix, prefix)) [22:32] SketchCow: <3 [22:33] oh wait [22:33] dammit, SketchCow [22:33] :P [22:34] Nemo_bis: ^ [22:37] underscor: can't you just fix it? you're the programmer :p https://code.google.com/p/wikiteam/source/browse/trunk/batchdownload/launcher.py [22:38] Nemo_bis: I don't have svn set up here [22:38] you just have an extra space [22:38] on line 98 [22:38] oh wait [22:38] I can just edit online [22:38] WOAH [22:45] yes [23:26] underscor: how is it going?