Time |
Nickname |
Message |
00:04
🔗
|
SketchCow |
Trying it |
00:04
🔗
|
SketchCow |
This program is a little fatty |
00:06
🔗
|
SketchCow |
I can live with it. |
00:06
🔗
|
SketchCow |
It needed bison, flex, libtool and I assume some sort of hooker delivering pizza |
00:06
🔗
|
Baljem |
who doesn't? |
00:07
🔗
|
Baljem |
sorry about that - I just remembered using it to look at some stuff during the ptch effort, but I wasn't paying much attention beyond 'someone else mentioned it' |
00:07
🔗
|
Baljem |
it seemed to do the job though |
00:09
🔗
|
SadDM |
Baljem: that looks like it might come in handy for the dayjob... *bookmarked* |
00:09
🔗
|
SketchCow |
Except for how fatty it is, and I mean it should be renamed Notorious J.Q., it does do the job perfectly. |
00:09
🔗
|
SketchCow |
Once you learn its crazy little moon language. |
00:27
🔗
|
SketchCow |
root@teamarchive0:/0/CDROMS/homelessnation-bliptv-2013.12# ../homeland "youthskillszone_0002-20061208-112888" |
00:27
🔗
|
SketchCow |
UPLOAD DATE: 20061208 |
00:28
🔗
|
SketchCow |
TITLE: youthskillszone_0002 |
00:28
🔗
|
SketchCow |
DESCRIPTION: |
00:28
🔗
|
SketchCow |
URL Basename: youthskillszone_0002-116485 |
00:28
🔗
|
SketchCow |
So good. |
00:30
🔗
|
Baljem |
it took me far too long to parse that as Youth Skills Zone, instead of some sort of incitement to underage homicide |
04:52
🔗
|
S[h]O[r]T |
http://www.computerworld.com.au/article/536478/target_breach_unfolds_information_vanishes_from_web/ |
07:01
🔗
|
chfoo |
"cloud party joins yahoo" http://www.reddit.com/r/shutdown/comments/1w8dbj/cloudparty_shuts_down/ |
07:14
🔗
|
chfoo |
anyone know perl? some porting to lua needed for https://github.com/ArchiveTeam/dogster-grab/blob/master/fliqz.lua |
07:57
🔗
|
SketchCow |
root@teamarchive0:/0/CDROMS/upload_in_progress_do_not_delete# tar vtf 4chandata.tar | wc -l |
07:57
🔗
|
SketchCow |
375793 |
07:57
🔗
|
SketchCow |
Advice: Don't actually look at these 375,793 images |
08:10
🔗
|
joepie91 |
lol |
08:13
🔗
|
BlueMax |
now I'm just tempted to look! |
16:38
🔗
|
Schbirid |
is anyone actively grabbing http://www.oldgamemags.com ? |
16:51
🔗
|
SadDM |
So, can sombody explain to me in broad strokes what the issues are with using wget to mirror forums? |
16:53
🔗
|
joepie91 |
SadDM: wget is almost guaranteed to get lost in search pages and such |
16:53
🔗
|
godane |
i'm grabbing parts of it |
16:55
🔗
|
Schbirid |
SadDM: imagine a calendar function with a "next month" button |
16:55
🔗
|
Schbirid |
or search, yeah |
16:55
🔗
|
Schbirid |
or gazillion of "you are not authorised" pages for PMs to people, profiles, etc |
16:56
🔗
|
godane |
Schbirid: i uploaded his NGC Magazine collection: https://archive.org/details/ngc_magazine |
16:57
🔗
|
SadDM |
Oh, OK. I think I'm starting to understand. |
16:57
🔗
|
joepie91 |
<Schbirid>SadDM: imagine a calendar function with a "next month" button |
16:57
🔗
|
joepie91 |
hehe |
16:57
🔗
|
joepie91 |
the dreaded calendar |
16:58
🔗
|
joepie91 |
calendars are probably archivebot's worst enemy |
16:58
🔗
|
joepie91 |
arch nemesis kind of enemy |
16:58
🔗
|
Schbirid |
to be honest, i have never let a grab run until 2038 ;) |
16:59
🔗
|
SadDM |
So I guess that currently the only sane way to do it would be to pre-scrape out forum and thread ids and then assemble a bunch of individual wgets... ugh. |
16:59
🔗
|
joepie91 |
SadDM: ignore patterns :) |
16:59
🔗
|
SadDM |
is that something built into wget? |
17:00
🔗
|
Schbirid |
SadDM: check out our wiki |
17:00
🔗
|
* |
SadDM runs off to man wget |
17:00
🔗
|
Schbirid |
godane: would be ace if you could add attribution to the scanners at least |
17:02
🔗
|
Schbirid |
SadDM: eg http://archiveteam.org/index.php?title=PhpBB http://archiveteam.org/index.php?title=VBulletin |
17:03
🔗
|
SadDM |
oh nice... thanks |
23:13
🔗
|
jfranusic |
just got an email saying that canv.as is being shut down |
23:13
🔗
|
jfranusic |
... for what it's worth |
23:14
🔗
|
SketchCow |
I'm working with moot on it. |
23:14
🔗
|
ersi |
woot to moot |
23:14
🔗
|
jfranusic |
:D I expected as much but I didn't want to assume anything. |