Time |
Nickname |
Message |
06:52
🔗
|
exmic |
is someone grabbing tcrf.net ? I see that archivebot is munching through the history |
06:53
🔗
|
* |
exmic starts a dumpgenerator |
06:53
🔗
|
exmic |
hopefully this will work quickly and we can tell archivebot to ignore history pages |
06:54
🔗
|
exmic |
.............................................................................................................. 54946 titles retrieved in the namespace 6 |
06:54
🔗
|
exmic |
holy |
07:52
🔗
|
Nemo_bis |
That's all spam. |
07:53
🔗
|
Nemo_bis |
Ah, maybe not. https://www.mediawiki.org/wiki/Manual:Namespace#Built-in_namespaces |
07:55
🔗
|
Nemo_bis |
exmic: 6 GB in April https://archive.org/details/wiki-tcrfnet , dunno if a duplicate would be in compliance with SketchCow's rules. :) |
07:55
🔗
|
exmic |
dupes are fiiiine |
07:55
🔗
|
Nemo_bis |
:D |
07:56
🔗
|
Nemo_bis |
Suggestions for keywords? Seems a cute wiki. |
07:56
🔗
|
exmic |
I'm also doing an images dump, which that may not include |
07:56
🔗
|
Nemo_bis |
SketchCow: do you think it would be possible to make your keyword-machine also unpack 7z files and parse XML? :) We could add keywords to thousands wikis. (Yes, of course I'd try to code it myself.) |
07:57
🔗
|
Nemo_bis |
6 GB is the image dump, history is only 20 MB |
07:57
🔗
|
exmic |
ahh |
07:57
🔗
|
exmic |
well, doing it anyway |
08:00
🔗
|
Nemo_bis |
:) |
16:44
🔗
|
Nemo_bis |
ttp://trends.builtwith.com/cms/MediaWiki |
16:44
🔗
|
Nemo_bis |
* http://trends.builtwith.com/cms/MediaWiki |