[06:52] is someone grabbing tcrf.net ? I see that archivebot is munching through the history [06:53] * exmic starts a dumpgenerator [06:53] hopefully this will work quickly and we can tell archivebot to ignore history pages [06:54] .............................................................................................................. 54946 titles retrieved in the namespace 6 [06:54] holy [07:52] That's all spam. [07:53] Ah, maybe not. https://www.mediawiki.org/wiki/Manual:Namespace#Built-in_namespaces [07:55] exmic: 6 GB in April https://archive.org/details/wiki-tcrfnet , dunno if a duplicate would be in compliance with SketchCow's rules. :) [07:55] dupes are fiiiine [07:55] :D [07:56] Suggestions for keywords? Seems a cute wiki. [07:56] I'm also doing an images dump, which that may not include [07:56] SketchCow: do you think it would be possible to make your keyword-machine also unpack 7z files and parse XML? :) We could add keywords to thousands wikis. (Yes, of course I'd try to code it myself.) [07:57] 6 GB is the image dump, history is only 20 MB [07:57] ahh [07:57] well, doing it anyway [08:00] :) [16:44] ttp://trends.builtwith.com/cms/MediaWiki [16:44] * http://trends.builtwith.com/cms/MediaWiki