[00:01] has anyone tried one of the browser addons to download everything on the page? [00:04] what's up with the menu on the wiki? [00:06] What do you mean? [00:06] downthemall is good, dashcloud. Dunno about using it for archiving though. [00:06] You can do regex based file selection on pages with it [00:06] does it present a normal user agent string, or a special one? [00:07] I think it uses firefox's, but don't quote me on that [00:20] folks, little issue here- you have to accept the terms & conditions via the pop up box every time [00:22] actually, you may only need to do it once- downthemall re-started the last 3 I had from a previous try, and actually got the PDFs this time [00:28] that looks like it- I just re-started the whole batch, and I'm actually getting PDFs now [03:39] ooh neat, IPv6 [03:45] magic [03:57] :) [06:44] Whee [06:44] man... I really need to do a better job of managing my various collections [06:47] Hmm, 12gb of Amiga Format Cover CDs? Sure. [16:27] SketchCow: You only have 55 days left [16:27] and you're only at 49% [16:27] Better get cracking [16:28] Er, 47% [16:28] Even worse! [16:46] nnnnnnnn-netsplit! [18:16] SketchCow, all: I made a 'JSTOR liberator' example. (Only works in Google Chrome at the moment.) [18:17] Give it a try at http://severe-samurai-6114.heroku.com/ (it doesn't really save anything, just an example of how it would work) [19:57] So.. curious, does anyone know if all Geocities sites are completely gone? Because I just ran across a site that is only half-dead [19:57] http://www.geocities.com/xanderubi/html/john_hughes_shooting_locations.html [20:18] MMovie: that's kind of strange. I seem to recall something about paid sites [20:59] Update: the JSTOR download-upload thing now also works in Firefox. [21:07] thanks- works just fine in FF 6 here [21:11] The whole idea is a bit silly, but that's probably the point. [22:00] alard: That's really neat! [22:00] I've been working on seeing what I can get the conventional way too [22:00] 12969 [22:00] 61617 [22:00] [5:53:56 PM] Alex Buie: 0 9:47PM:abuie@anonymous-finland-11:~/jstor 2381 Ã cat ids.txt|sort -u|wc -l [22:00] [5:54:31 PM] Alex Buie: 0 9:48PM:abuie@anonymous-brazil-2:~/jstor 2381 Ã cat ids.txt|sort -u|wc -l [22:00] [5:54:40 PM] Alex Buie: Not too shabby so far [22:00] [5:54:48 PM] Alex Buie: Those are only the IDs though [22:00] [5:55:28 PM] Alex Buie: root@anonymous-norway-1:~# ls *pdf|wc -l [22:00] 1147 [22:00] [5:55:56 PM] Alex Buie: root@anonymous-norway-1:~# du -sh . [22:00] 1.1G . [22:00] [5:56:19 PM] Alex Buie: Downloads go a bit slower [22:03] underscor: That's nice. Finding and downloading is not the problem, I think. (I have a bunch of scripts and a list of issue IDs that can be used to download everything.) [22:03] I saw :) [22:03] This is more about the 'hilarious game' idea mentioned by SketchCow earlier. [22:04] Yeah, I know [22:04] One thing I noticed though [22:04] is that there are some documents that are "free" that aren't properly associated with an issue [22:04] That's strange. [22:04] ie, they only show up in search results [22:05] Let me see if I can go find an example [22:07] alard: Hmm, I'll have to keep looking [22:07] I know I had one [22:07] Here's what made me suspicious though [22:07] "Journals are listed under the current title; the articles included in the Early Journal Content may be under a previous title. [22:07] " [22:08] http://about.jstor.org/sites/default/files/jstor-ejc_title_2011-09-12.3.pdf [22:08] Strange examples are hard to find. :) [22:08] Isn't that just the 'this journal was continued as X' line? [22:09] Where is that? [22:09] I must have missed it [22:09] Looking for an example :) [22:10] http://www.jstor.org/action/showPublication?journalCode=ameranth [22:10] http://www.jstor.org/action/showPublication?journalCode=trananthsociwash [22:11] In those cases both names appear in the browse-by-title list. [22:13] Aha [22:13] That's cool [22:18] we should try to archive all techtv videos