[00:04] Trying it [00:04] This program is a little fatty [00:06] I can live with it. [00:06] It needed bison, flex, libtool and I assume some sort of hooker delivering pizza [00:06] who doesn't? [00:07] sorry about that - I just remembered using it to look at some stuff during the ptch effort, but I wasn't paying much attention beyond 'someone else mentioned it' [00:07] it seemed to do the job though [00:09] Baljem: that looks like it might come in handy for the dayjob... *bookmarked* [00:09] Except for how fatty it is, and I mean it should be renamed Notorious J.Q., it does do the job perfectly. [00:09] Once you learn its crazy little moon language. [00:27] root@teamarchive0:/0/CDROMS/homelessnation-bliptv-2013.12# ../homeland "youthskillszone_0002-20061208-112888" [00:27] UPLOAD DATE: 20061208 [00:28] TITLE: youthskillszone_0002 [00:28] DESCRIPTION: [00:28] URL Basename: youthskillszone_0002-116485 [00:28] So good. [00:30] it took me far too long to parse that as Youth Skills Zone, instead of some sort of incitement to underage homicide [04:52] http://www.computerworld.com.au/article/536478/target_breach_unfolds_information_vanishes_from_web/ [07:01] "cloud party joins yahoo" http://www.reddit.com/r/shutdown/comments/1w8dbj/cloudparty_shuts_down/ [07:14] anyone know perl? some porting to lua needed for https://github.com/ArchiveTeam/dogster-grab/blob/master/fliqz.lua [07:57] root@teamarchive0:/0/CDROMS/upload_in_progress_do_not_delete# tar vtf 4chandata.tar | wc -l [07:57] 375793 [07:57] Advice: Don't actually look at these 375,793 images [08:10] lol [08:13] now I'm just tempted to look! [16:38] is anyone actively grabbing http://www.oldgamemags.com ? [16:51] So, can sombody explain to me in broad strokes what the issues are with using wget to mirror forums? [16:53] SadDM: wget is almost guaranteed to get lost in search pages and such [16:53] i'm grabbing parts of it [16:55] SadDM: imagine a calendar function with a "next month" button [16:55] or search, yeah [16:55] or gazillion of "you are not authorised" pages for PMs to people, profiles, etc [16:56] Schbirid: i uploaded his NGC Magazine collection: https://archive.org/details/ngc_magazine [16:57] Oh, OK. I think I'm starting to understand. [16:57] SadDM: imagine a calendar function with a "next month" button [16:57] hehe [16:57] the dreaded calendar [16:58] calendars are probably archivebot's worst enemy [16:58] arch nemesis kind of enemy [16:58] to be honest, i have never let a grab run until 2038 ;) [16:59] So I guess that currently the only sane way to do it would be to pre-scrape out forum and thread ids and then assemble a bunch of individual wgets... ugh. [16:59] SadDM: ignore patterns :) [16:59] is that something built into wget? [17:00] SadDM: check out our wiki [17:00] * SadDM runs off to man wget [17:00] godane: would be ace if you could add attribution to the scanners at least [17:02] SadDM: eg http://archiveteam.org/index.php?title=PhpBB http://archiveteam.org/index.php?title=VBulletin [17:03] oh nice... thanks [23:13] just got an email saying that canv.as is being shut down [23:13] ... for what it's worth [23:14] I'm working with moot on it. [23:14] woot to moot [23:14] :D I expected as much but I didn't want to assume anything.