[21:00] Hmm, I should throw an issue in [21:01] Used dumpgenerator.py to grab a dump of http://www.irc-wiki.org/ , but it didn't grab all the pages in http://www.irc-wiki.org/index.php?title=Special%3AAllPages&from=&to=&namespace=274 [21:02] I think it might've gotten one or two pages at first, but the rest were missing (tried setting it up locally to make sure it worked, had to grab the Widget pages and import them into my local instance myself) [21:06] danneh_: can you please paste the exact command you used? [21:07] Nemo_bis: ./dumpgenerator.py --api=http://www.irc-wiki.org/api.php --xml --images --delay=10 [21:08] The dump it produced is here, if you wanna take a look: https://archive.org/details/wiki-irc_wikiorg [21:08] Didn't realise it missed a few 'til I uploaded it, unfortunately [21:13] danneh_: what page is missing, specifically? [21:15] The titles list finds them all. [21:15] So they must have failed at some point in download. [21:15] Hmm, gimmie a sec [21:18] Special:Export downloads them all (though not as file, the webserver must have some issue). [21:18] Wait a minute, they're all in the history file, just grepped it [21:18] Damn, I must've munged the importing, not had the correct Namespaces setup or something [21:18] Sorry, should have checked that out before! [21:20] danneh_: ah, you tried to import them somewhere else? [21:21] I tried to import it to a new mediawiki instance (wanted to check it out, make sure my dumps actually worked properly) [21:21] It may not be your fault, import is quite hacky. Namespaces shouldn't prevent download, unless they make some titles illegal. [21:21] It might be because I didn't have the widget extension loaded when I imported the history, and then I loaded it later [21:22] There are many reasons an import can fail... it's very useful tht you're testing that, because very few do. Be sure to report issues to bugzilla.wikimedia.org, even though you're not sure they're actual bugs. [21:22] Could've refused to import, or could've wiped the namespace when it was first loaded [21:24] Sure. I'll have a bit more of a play, see what I can come up with [21:24] danneh_: ah, I know what happened [21:24] Your pages are not gone, they're just hidden [21:25] danneh_: you need to run this script, then they'll magically reappear https://www.mediawiki.org/wiki/Manual:NamespaceDupes.php [21:28] hmm, fair enough [21:29] it's giving me a bit of an error, but that could also be because I was silly and readded those missing pages [21:30] Ah, then you'll need the --suffix [21:34] Hmm: http://pastebin.com/DX81f2Vs [21:35] Don't worry too much about it, I'll probably end up wiping the wikidata and importing from scratch [21:36] Also just wondering, because the 'main page' that's created by MediaWiki by default is newer than the one I'm importing, after I import I need to go into the history of Main_Page and set it as it was one or two versions ago. Would you possibly know a way to avoid that, deleting Main_Page completely before I bother importing anything, perhaps? [22:08] Not really, import always overwrites whatever you already hvae [22:18] Hmm, probably just a difference between how Special:Import and importDump.php work [22:19] Special:Import made it so all the history for Main_Page was there, but the most recent change was MediaWiki user setting it to the default mediawiki main page, but importDump seems to have just set it correctly off the bat \o/ [22:45] Nemo_bis: Just wondering, any ideas with this? http://pastebin.com/pNxuqt0a [22:45] All else fails, I'll have a chat to the MediaWiki guys later [23:01] Wiki is running hot!