[01:15] hmmm [01:16] I have a wiki looping infinitely on "XML for Main_Page is incorrect" [01:16] Solution? [06:50] underscor: ctrl-c [06:51] that's what I've been doing [06:51] but then how do I pipe the output to create a log? [06:52] underscor: a log of what? [06:53] the log of output [06:53] like http://p.defau.lt/?3qpqOZdAwinhcBFqCAg4vg [06:53] underscor: interrupting that dump doesn't kill the launcher.py process [06:54] oh, I thought it would kill the > bit [06:54] ok [18:04] Nemo_bis: do you remember which script we used to discard dead wikis of Andrew Pavlo 22,000 wikis list? [18:09] Nemo_bis: checkalive.py ok [18:12] emijrp: you did it all :) [18:12] emijrp: do you have a new list to run it on? [18:15] no [18:37] this failed to download he images, are corrupt http://archive.org/details/wiki-startrekfreedomcom_wiki [18:51] emijrp: differently than in the Citizendium dump, the list was created [18:52] oic, images were downloaded but are bogus [18:54] O_o The requested method POST is not allowed for the URL /wiki/images/9/9d/1238704362854102575papapishu_albatross_2_svg_med.png. [18:54] mebbe the downloader should check HTTP headers [19:01] emijrp: I'm downloading those images for real now [19:02] there is a bug downloading the .desc files [19:02] im fixing now [19:03] ah, also .desc? [19:03] I'm downloading only files [19:12] how only files? [19:13] .desc are downloaded with every file [19:26] emijrp: too lazy tor erun the script right now, just wget'ing everything [19:26] actually, already did [19:31] surely, it is not the only dump with corrupt files [19:32] i check some more and are ok, but corrupt image downloads is an old issue [19:38] emijrp: do they all give error 405? [19:38] I could just grep for it [19:38] i dont know [19:58] emijrp: there's lots of them, I'll send you a list [19:59] lots of what? dumps with corrupt imageS? [19:59] with 405 errors on images