[00:46] here's the solution to a problem you didn't know you had: http://gifprint.com/ Now you can keep your gifs animated after you print them out! [00:59] also, if you love reading reverse-engineering stories, here's one about reversing the 1974 Sinclair Scientific calculator: http://files.righto.com/calculator/sinclair_scientific_simulator.html [02:09] so i have uploaded the breaking news coverage during the hunt in boston for the bomber [02:09] here it is: https://archive.org/details/GBTV_04_19_2013 [02:10] i think there was more breaking news after the wilkow show on this day too [02:11] and i have it [04:34] how would someone download fanfiction.net? thought it might be a good idea to get a more up to date backup [04:57] I mean the last one was twenty months ago [05:37] BlueMax: the way I did it was by spidering it [05:37] BlueMax: https://github.com/ArchiveTeam/ffnet-grab [05:37] I tried using that yipdw but the tracker is down [05:37] yeah, I killed it when the project finished [05:38] the code may still work, or at least its ideas [05:38] that code is also pre-seesaw [05:38] I'm not smart enough to edit that code to a way that it'll work, if I'm looking at it right it seems to be tied to use your server [05:38] (but considering this is me I'm probably wrong) [05:39] the server name is hardcoded, but it can be any tracker [05:39] ah, well I don't know how to set any of that up [05:39] however at this point I think you'd be better off taking the ideas in that project, evaluating whether or not they're still good, and turning it into a Warrior-ready Seesaw application [05:40] I don't even know what the seesaw is :/ [05:40] man I'm a bit behind aren't I [05:40] it's what the warrior uses [05:41] https://github.com/ArchiveTeam/seesaw-kit [05:41] hey everyone [05:42] i got a 1.5tb hard drive today [05:42] I'm in over my head already [05:43] couldnt' we check the old fanfiction dump and use that as a start point for author names [05:43] yes, but you'd need to check if they've done anything new [05:43] or just spider by sections [05:43] or if there's new sections, etc. [05:44] I do not know of way to make it work without spidering the whole site [05:44] since all of it (well, almost) is subject to change [05:44] spidering it isn't too hard tho [05:45] best way is to do it this way [05:45] for example: http://www.fanfiction.net/anime/?l=a [05:47] then spider the index [05:48] to get all ratings you spider using this: http://www.fanfiction.net/anime/C-Sword-and-Cornett/?&srt=1&r=10 [05:49] then add a --accept-regex="(p=)" [05:50] also i know of a way to do a crappy brute force at it too [05:52] just do like seq 1 to 10000 or something then add to this url: http://www.fanfiction.net/s/# [05:52] # is the number of the seq [05:52] godane: previously, the work was chunked by profile [05:52] see https://github.com/ArchiveTeam/ffnet-grab/blob/client/retrieve.py [05:52] they did, however, change their URL scheme, so that's not going to work anymore [05:52] however, an adaptation should still be oky [05:57] man I was hoping it'd be simple. Never is simple with archiving is it [05:59] FF isn't too bad [05:59] I'm simple :P [05:59] these days, we have way better tools too [05:59] ffnet-grab was pre-Warrior [05:59] alard's seesaw etc. help out a lot [06:00] the warrior is a work of genius [06:04] i must have push over 100 items to archiveteam-fire cause of my guardian world articles dump [06:04] wow its 186 items [06:08] If anybody ever gets around to doing another archive of ff.net I'll jump on and grab a copy [06:14] jesus christ SketchCow cannot type to save his live on a phone :P [06:14] ... [06:15] and neither can I apparently. [06:16] Hah, people usign pghones [06:16] ffs [06:18] *facepalm* [06:19] I went and downloaded the TAR of the 2012 backup [06:19] surprised they were all converted to ePubs [06:22] I wonder how much was added over the last 20 months... [06:25] BlueMax: quite a bit [06:25] BlueMax: another neat question is how much was deleted :P [06:27] man, if I knew how to do all this I would :P [07:36] i'm grabbing the gallery pages of theguardian in my world articles index [07:37] i have to make another dump cause there is stuff like ?picture= urls [07:37] and my grabs was not getting it [07:42] good news is that there are no in 2007/dec urls so this will be a full dump of 2007 galleries [07:44] looks there dec/2007 urls [07:44] oh well [08:09] sooo [08:10] Tradehill told customers last week it would transfer their accounts to a U.S. credit union to make it easier to complete transactions. The company said in an e-mail to clients that customer accounts were being frozen as of Aug. 23 for the move, and clients who didnâ??t want to switch to the Internet Archive Federal Credit Union were offered the option to liquidate their holdings. [08:10] ?! [08:35] https://www.facebook.com/media/set/?set=a.497149920370098.1073741831.451026158315808&type=1 [08:36] That is cool [08:41] so my gallery dump of 2007 urls is over 200mb now [08:41] biggest dump of the guardian so far [09:59] its over 400mb now [12:53] looks like the early video episodes of the tech guy on twit was spoty [12:54] just look here: http://web.archive.org/web/20100715144706/http://feeds.twit.tv/ttg_video_large [12:54] there is like none in the 670s [13:03] there maybe some more in the 660s then what is posted in the old rss video [13:03] *video feed [19:38] so i'm going to grab the bomb patrol afghanistan series that aired on g4 [20:55] http://forums.somethingawful.com/showthread.php?threadid=3567802