#archiveteam 2014-03-25,Tue

↑back Search

Time Nickname Message
05:20 🔗 DFJustin so http://mjh.playpark.net/notices/details/mahjonghime_official_closure_notice - I grabbed the main site through archivebot but there are forums
05:21 🔗 DFJustin because of how the forums are structured, archivebot can't really get them without downloading hundreds of thousands of threads in other subforums
05:45 🔗 exmic there are forum scraper lua scripts for wget on the archiveteam github, do any of them apply?
05:45 🔗 exmic https://github.com/ArchiveTeam/wget-lua-forum-scripts
06:19 🔗 SketchCow godane: The dark collections was a bugunrelated to me, and the collections will be darked again.
12:57 🔗 SadDM exmic DFJustin: I've used some of those scripts and they're fantastic. Also, there is one for this particular flavour of forum software. Since the forum in question is just a sub-section of a larger board though, grabbing it individually resists a vaive attempt. I'm currently grabbing all the forums (2.7M posts and several thousand topics). I'll let you know how it goes.
15:55 🔗 SadDM DFJustin: the lua script got caught in a wicked loop... so that didn't work. Fortunately the sub-forum in question was fairly small, so I manually build a url list and grabbed it like that.
15:55 🔗 SadDM SketchCow: when you have a moment could you move https://archive.org/details/mahjong_hime_forums-20140325 to the archive team collection?
15:58 🔗 DFJustin \o/
16:18 🔗 exmic hm, that's good
16:20 🔗 SadDM For the record, I heartily endorse using those lua scripts to grab forums. The phpbb ones have worked great for me so far.
16:21 🔗 exmic :D
16:38 🔗 DFJustin hmm those would be awfully nice to have as an archivebot parameter
16:46 🔗 Jonimus is there a link somewhere to these scripts?
16:48 🔗 Jonimus I'd guess github but I don't see a generic forum grab repo.
16:49 🔗 SadDM Jonimus: https://github.com/ArchiveTeam/wget-lua-forum-scripts
16:50 🔗 Jonimus thanks for solving my blindness, I will have to put this to good use soon.
20:17 🔗 SketchCow SadDM:: Done.

irclogger-viewer