[13:11] awesome scan [14:53] http://i.imgur.com/cZrr3bS.gif ARCHIVE TEAM HERE TO SAVE THE DAY [14:53] The manuals are STILL uploading [14:57] Anything that needs my attention? [15:03] Well, aparently it's "National Preservation Week" (http://www.ala.org/alcts/confevents/preswk/origins)... which is every week for us. [15:04] http://www.globalpost.com/sites/default/files/imagecache/gp3_slideshow_large/photos/2013-April/download_all_the_things.jpg [15:06] http://image.slidesharecdn.com/lovenotestothefuture-111129201939-phpapp01/95/slide-17-728.jpg?cb=1322619877 [15:07] Ugh, don't remind me. Adding metadata to all of these fanzines I've got is a PITA... but I do love the future. [15:08] http://image.slidesharecdn.com/ux-of-urls-130820105348-phpapp02/95/slide-55-638.jpg?cb=1377014195 [15:09] I'm sorry, did I miss the memo where we're only supposed to commincate via images today? ;-) [15:14] http://i.imgur.com/EKIb8Rt.jpg [15:15] lol... I've got nothing. [15:20] how many SadDM ? [15:20] I'm uploading 1200+ pdf's of radio stuff to SketchCow.... I haven't added metadata... ;) [15:21] I've downloaded 500+ so far and am starting to sort and upload those. [15:21] If you want to follow my progress: http://archive.org/search.php?query=uploader%3A%22aeakett%40gmail.com%22%20AND%20subject%3A%22fanzine%22&sort=-publicdate [15:23] When you say uploading to me.... you mean archive.org? [15:34] i mean fos. [15:34] the ftp. [15:34] I wouldn't upload to archive.org without metadata :O [15:47] SketchCow: I have some PDFs which have been sitting on my hdd for too long as well [16:05] Smiley: I meant SadDM. [16:06] No, wait. [16:06] Wow, you two talked at the same time I was distracted. [16:06] Nemo_bis: Upload that stuff [16:06] I'm forcing the issue with metadata [16:06] Want to build an interface for it. [16:10] SketchCow: no worries. [16:26] SketchCow: I'm uploading to IA [16:50] Got it. [16:50] More, more! [16:50] https://www.youtube.com/watch?v=L7SkrYF8lCU [16:56] is the subtext there that I should murder somebody for their bandwidth? [17:47] Yes [19:57] Another 100+ ezines going up as we speak. Unfortunately that's about the end of big runs. Now the *real* sorting begins. [20:37] uploading my jamendo api scrapes, i forgot them earlier when i uploaded the dbdumps [20:37] need hdd space, so... [21:51] http://archiveteam.org/index.php?title=4chan doesn't list http://fgts.eu/ [21:52] probably because it's very new afaik [21:53] they have a great list of other Fuuka archives here http://fgts.eu/_/articles/faq/ [21:54] http://www.comicbookresources.com/?page=article&id=52513 [21:54] Can people take on it? [21:56] They are leaving the forums up for 14 days at the oldforums link and then deleting it all. [22:10] That's a lot of posts [22:11] if the vbulletin lua script works with it, it should be fairly easy, but I've found it to be hit and miss [22:26] OK, it looks like the lua script doesn't work. [22:26] Anybody here fluent in lua? [22:27] I think it's getting tripped up by the forum id being more than just a number (forumdisplay.php?23-Community-Forum instead of forumdisplay.php?23) [22:37] oho [22:37] I think I might have actually reworked the vbulletin script to work with that style [22:37] let me check [22:39] dingding yep [22:40] http://files.q931.fastmail.fm/vbulletin.lua [22:40] it should be properly worked into the thing, but I didn't feel like that [22:41] SadDM: ^ [22:45] nice! I'm sitting down to eat now, but will try it later. [23:13] exmic: still no love [23:14] hm, ok [23:20] what's going on? [23:21] well, all I can tell you is that your script downloads the same 50 urls as the original one. [23:23] hm [23:23] ok, does line 30 add the urls found to a list of some kind? to retrieve later? because it still has "f=" [23:24] maybe I didn't actually run that successfully [23:24] I have no idea, the file is like a year old [23:24] hah, may 13 2013 [23:24] fair enough [23:24] but I think all you should need to do is change the regexp [23:25] because if you use the shorter style of url, it'll redirect you to the longer one [23:25] on line 28? [23:25] and warc of course will capture all the requests [23:25] I mean use the existing vbulletin script and modify it [23:26] yours isn't that different. it looks like you modified the regexps. [23:26] this *might* be enough to get me started [23:28] that's about what I did, yes [23:28] you have to match a different url and generate something slightly different [23:28] the important bit is you grab the number and you can usually ditch the slug