[10:39] hey, i was trying to dump http://wiki.quakeworld.nu/ but no success. their server is a bit misconfigured so i guess that might be it? [10:39] i tried: python2 /mnt/ramdisk/wikiteam-read-only/dumpgenerator.py --index=http://wiki.quakeworld.nu/ --xml --images --delay=2 [10:39] and got a python error in return in CleanHTML [15:12] Schbirid: http://hastebin.com/wacuxijavu [15:15] Nemo_bis: ^ [15:15] you broke it in r842; see paste for the fix [15:18] sweet! [16:51] balrog: ok thanks, will commit in a moment [17:11] sadly my downloads are all failing with weird errors [17:25] For http://www.editthis.info/1337/Main_Page it's because it can't find the index.php [17:49] https://code.google.com/p/wikiteam/issues/detail?id=49 wouldn't harm [18:13] ok, that may be working now [18:14] now why do I get timeouts for URLs such as http://1605.wiki-site.com/api.php which load fine in browser http://p.defau.lt/?wYcu3e3yU_adEePS0Lm__Q [18:59] ah well, no wonder, it doesn't resolve on my server [19:02] actually, not true [19:05] ah, now I get 403 also at home, so some stupid throttling [19:51] Okay done. [20:07] w0rp: done what? :) [20:08] I joined the channel. [20:26] w0rp: ah, saw your comment on the other channel now :) I'm afraid we don't produce WARC files here [20:26] those would be extremely expensive to generate, instead we export the underlying data from which the HTML is produced [20:27] do you have some specific MediaWiki site in mind? [20:30] I'm using the Python dump script now to take a copy of a site probably only I care about. tanasinn.info [20:31] w0rp: good; please add it to the index we most work with nowadays http://wikiapiary.com/wiki/Special:FormEdit/Website [20:33] hm, it's a 2007 wiki but it wasn't in Pavlo's list... we really need better lists of wikis [20:36] It's pretty much about a bunch of stuff strange people who speak English around the world discovered from strange Japanese speaking HTML BBS people. [20:37] http://tanasinn.info/wiki/Kopipe:PIG_DISGUSTING For example. [20:46] omg [20:49] interesting, hmm http://halcy.de/pages/bottrop [20:52] That's a really cool trick, and it's been in use with 2-ch style BBSes for a while. [20:52] *2ch