Time |
Nickname |
Message |
10:39
🔗
|
Schbirid |
hey, i was trying to dump http://wiki.quakeworld.nu/ but no success. their server is a bit misconfigured so i guess that might be it? |
10:39
🔗
|
Schbirid |
i tried: python2 /mnt/ramdisk/wikiteam-read-only/dumpgenerator.py --index=http://wiki.quakeworld.nu/ --xml --images --delay=2 |
10:39
🔗
|
Schbirid |
and got a python error in return in CleanHTML |
15:12
🔗
|
balrog |
Schbirid: http://hastebin.com/wacuxijavu |
15:15
🔗
|
balrog |
Nemo_bis: ^ |
15:15
🔗
|
balrog |
you broke it in r842; see paste for the fix |
15:18
🔗
|
Schbirid |
sweet! |
16:51
🔗
|
Nemo_bis |
balrog: ok thanks, will commit in a moment |
17:11
🔗
|
Nemo_bis |
sadly my downloads are all failing with weird errors |
17:25
🔗
|
Nemo_bis |
For http://www.editthis.info/1337/Main_Page it's because it can't find the index.php |
17:49
🔗
|
Nemo_bis |
https://code.google.com/p/wikiteam/issues/detail?id=49 wouldn't harm |
18:13
🔗
|
Nemo_bis |
ok, that may be working now |
18:14
🔗
|
Nemo_bis |
now why do I get timeouts for URLs such as http://1605.wiki-site.com/api.php which load fine in browser http://p.defau.lt/?wYcu3e3yU_adEePS0Lm__Q |
18:59
🔗
|
Nemo_bis |
ah well, no wonder, it doesn't resolve on my server |
19:02
🔗
|
Nemo_bis |
actually, not true |
19:05
🔗
|
Nemo_bis |
ah, now I get 403 also at home, so some stupid throttling |
19:51
🔗
|
w0rp |
Okay done. |
20:07
🔗
|
Nemo_bis |
w0rp: done what? :) |
20:08
🔗
|
w0rp |
I joined the channel. |
20:26
🔗
|
Nemo_bis |
w0rp: ah, saw your comment on the other channel now :) I'm afraid we don't produce WARC files here |
20:26
🔗
|
Nemo_bis |
those would be extremely expensive to generate, instead we export the underlying data from which the HTML is produced |
20:27
🔗
|
Nemo_bis |
do you have some specific MediaWiki site in mind? |
20:30
🔗
|
w0rp |
I'm using the Python dump script now to take a copy of a site probably only I care about. tanasinn.info |
20:31
🔗
|
Nemo_bis |
w0rp: good; please add it to the index we most work with nowadays http://wikiapiary.com/wiki/Special:FormEdit/Website |
20:33
🔗
|
Nemo_bis |
hm, it's a 2007 wiki but it wasn't in Pavlo's list... we really need better lists of wikis |
20:36
🔗
|
w0rp |
It's pretty much about a bunch of stuff strange people who speak English around the world discovered from strange Japanese speaking HTML BBS people. |
20:37
🔗
|
w0rp |
http://tanasinn.info/wiki/Kopipe:PIG_DISGUSTING For example. |
20:46
🔗
|
Nemo_bis |
omg |
20:49
🔗
|
Nemo_bis |
interesting, hmm http://halcy.de/pages/bottrop |
20:52
🔗
|
w0rp |
That's a really cool trick, and it's been in use with 2-ch style BBSes for a while. |
20:52
🔗
|
w0rp |
*2ch |