#wikiteam 2014-02-11,Tue

↑back Search

Time Nickname Message
21:00 🔗 danneh_ Hmm, I should throw an issue in
21:01 🔗 danneh_ Used dumpgenerator.py to grab a dump of http://www.irc-wiki.org/ , but it didn't grab all the pages in http://www.irc-wiki.org/index.php?title=Special%3AAllPages&from=&to=&namespace=274
21:02 🔗 danneh_ I think it might've gotten one or two pages at first, but the rest were missing (tried setting it up locally to make sure it worked, had to grab the Widget pages and import them into my local instance myself)
21:06 🔗 Nemo_bis danneh_: can you please paste the exact command you used?
21:07 🔗 danneh_ Nemo_bis: ./dumpgenerator.py --api=http://www.irc-wiki.org/api.php --xml --images --delay=10
21:08 🔗 danneh_ The dump it produced is here, if you wanna take a look: https://archive.org/details/wiki-irc_wikiorg
21:08 🔗 danneh_ Didn't realise it missed a few 'til I uploaded it, unfortunately
21:13 🔗 Nemo_bis danneh_: what page is missing, specifically?
21:15 🔗 Nemo_bis The titles list finds them all.
21:15 🔗 Nemo_bis So they must have failed at some point in download.
21:15 🔗 danneh_ Hmm, gimmie a sec
21:18 🔗 Nemo_bis Special:Export downloads them all (though not as file, the webserver must have some issue).
21:18 🔗 danneh_ Wait a minute, they're all in the history file, just grepped it
21:18 🔗 danneh_ Damn, I must've munged the importing, not had the correct Namespaces setup or something
21:18 🔗 danneh_ Sorry, should have checked that out before!
21:20 🔗 Nemo_bis danneh_: ah, you tried to import them somewhere else?
21:21 🔗 danneh_ I tried to import it to a new mediawiki instance (wanted to check it out, make sure my dumps actually worked properly)
21:21 🔗 Nemo_bis It may not be your fault, import is quite hacky. Namespaces shouldn't prevent download, unless they make some titles illegal.
21:21 🔗 danneh_ It might be because I didn't have the widget extension loaded when I imported the history, and then I loaded it later
21:22 🔗 Nemo_bis There are many reasons an import can fail... it's very useful tht you're testing that, because very few do. Be sure to report issues to bugzilla.wikimedia.org, even though you're not sure they're actual bugs.
21:22 🔗 danneh_ Could've refused to import, or could've wiped the namespace when it was first loaded
21:24 🔗 danneh_ Sure. I'll have a bit more of a play, see what I can come up with
21:24 🔗 Nemo_bis danneh_: ah, I know what happened
21:24 🔗 Nemo_bis Your pages are not gone, they're just hidden
21:25 🔗 Nemo_bis danneh_: you need to run this script, then they'll magically reappear https://www.mediawiki.org/wiki/Manual:NamespaceDupes.php
21:28 🔗 danneh_ hmm, fair enough
21:29 🔗 danneh_ it's giving me a bit of an error, but that could also be because I was silly and readded those missing pages
21:30 🔗 Nemo_bis Ah, then you'll need the --suffix
21:34 🔗 danneh_ Hmm: http://pastebin.com/DX81f2Vs
21:35 🔗 danneh_ Don't worry too much about it, I'll probably end up wiping the wikidata and importing from scratch
21:36 🔗 danneh_ Also just wondering, because the 'main page' that's created by MediaWiki by default is newer than the one I'm importing, after I import I need to go into the history of Main_Page and set it as it was one or two versions ago. Would you possibly know a way to avoid that, deleting Main_Page completely before I bother importing anything, perhaps?
22:08 🔗 Nemo_bis Not really, import always overwrites whatever you already hvae
22:18 🔗 danneh_ Hmm, probably just a difference between how Special:Import and importDump.php work
22:19 🔗 danneh_ Special:Import made it so all the history for Main_Page was there, but the most recent change was MediaWiki user setting it to the default mediawiki main page, but importDump seems to have just set it correctly off the bat \o/
22:45 🔗 danneh_ Nemo_bis: Just wondering, any ideas with this? http://pastebin.com/pNxuqt0a
22:45 🔗 danneh_ All else fails, I'll have a chat to the MediaWiki guys later
23:01 🔗 SketchCow Wiki is running hot!

irclogger-viewer