#archiveteam-bs 2013-09-01,Sun

โ†‘back Search

Time Nickname Message
00:46 ๐Ÿ”— dashcloud here's the solution to a problem you didn't know you had: http://gifprint.com/ Now you can keep your gifs animated after you print them out!
00:59 ๐Ÿ”— dashcloud also, if you love reading reverse-engineering stories, here's one about reversing the 1974 Sinclair Scientific calculator: http://files.righto.com/calculator/sinclair_scientific_simulator.html
02:09 ๐Ÿ”— godane so i have uploaded the breaking news coverage during the hunt in boston for the bomber
02:09 ๐Ÿ”— godane here it is: https://archive.org/details/GBTV_04_19_2013
02:10 ๐Ÿ”— godane i think there was more breaking news after the wilkow show on this day too
02:11 ๐Ÿ”— godane and i have it
04:34 ๐Ÿ”— BlueMax how would someone download fanfiction.net? thought it might be a good idea to get a more up to date backup
04:57 ๐Ÿ”— BlueMax I mean the last one was twenty months ago
05:37 ๐Ÿ”— yipdw BlueMax: the way I did it was by spidering it
05:37 ๐Ÿ”— yipdw BlueMax: https://github.com/ArchiveTeam/ffnet-grab
05:37 ๐Ÿ”— BlueMax I tried using that yipdw but the tracker is down
05:37 ๐Ÿ”— yipdw yeah, I killed it when the project finished
05:38 ๐Ÿ”— yipdw the code may still work, or at least its ideas
05:38 ๐Ÿ”— yipdw that code is also pre-seesaw
05:38 ๐Ÿ”— BlueMax I'm not smart enough to edit that code to a way that it'll work, if I'm looking at it right it seems to be tied to use your server
05:38 ๐Ÿ”— BlueMax (but considering this is me I'm probably wrong)
05:39 ๐Ÿ”— yipdw the server name is hardcoded, but it can be any tracker
05:39 ๐Ÿ”— BlueMax ah, well I don't know how to set any of that up
05:39 ๐Ÿ”— yipdw however at this point I think you'd be better off taking the ideas in that project, evaluating whether or not they're still good, and turning it into a Warrior-ready Seesaw application
05:40 ๐Ÿ”— BlueMax I don't even know what the seesaw is :/
05:40 ๐Ÿ”— BlueMax man I'm a bit behind aren't I
05:40 ๐Ÿ”— yipdw it's what the warrior uses
05:41 ๐Ÿ”— yipdw https://github.com/ArchiveTeam/seesaw-kit
05:41 ๐Ÿ”— godane hey everyone
05:42 ๐Ÿ”— godane i got a 1.5tb hard drive today
05:42 ๐Ÿ”— BlueMax I'm in over my head already
05:43 ๐Ÿ”— godane couldnt' we check the old fanfiction dump and use that as a start point for author names
05:43 ๐Ÿ”— yipdw yes, but you'd need to check if they've done anything new
05:43 ๐Ÿ”— godane or just spider by sections
05:43 ๐Ÿ”— yipdw or if there's new sections, etc.
05:44 ๐Ÿ”— yipdw I do not know of way to make it work without spidering the whole site
05:44 ๐Ÿ”— yipdw since all of it (well, almost) is subject to change
05:44 ๐Ÿ”— yipdw spidering it isn't too hard tho
05:45 ๐Ÿ”— godane best way is to do it this way
05:45 ๐Ÿ”— godane for example: http://www.fanfiction.net/anime/?l=a
05:47 ๐Ÿ”— godane then spider the index
05:48 ๐Ÿ”— godane to get all ratings you spider using this: http://www.fanfiction.net/anime/C-Sword-and-Cornett/?&srt=1&r=10
05:49 ๐Ÿ”— godane then add a --accept-regex="(p=)"
05:50 ๐Ÿ”— godane also i know of a way to do a crappy brute force at it too
05:52 ๐Ÿ”— godane just do like seq 1 to 10000 or something then add to this url: http://www.fanfiction.net/s/#
05:52 ๐Ÿ”— godane # is the number of the seq
05:52 ๐Ÿ”— yipdw godane: previously, the work was chunked by profile
05:52 ๐Ÿ”— yipdw see https://github.com/ArchiveTeam/ffnet-grab/blob/client/retrieve.py
05:52 ๐Ÿ”— yipdw they did, however, change their URL scheme, so that's not going to work anymore
05:52 ๐Ÿ”— yipdw however, an adaptation should still be oky
05:57 ๐Ÿ”— BlueMax man I was hoping it'd be simple. Never is simple with archiving is it
05:59 ๐Ÿ”— yipdw FF isn't too bad
05:59 ๐Ÿ”— BlueMax I'm simple :P
05:59 ๐Ÿ”— yipdw these days, we have way better tools too
05:59 ๐Ÿ”— yipdw ffnet-grab was pre-Warrior
05:59 ๐Ÿ”— yipdw alard's seesaw etc. help out a lot
06:00 ๐Ÿ”— BlueMax the warrior is a work of genius
06:04 ๐Ÿ”— godane i must have push over 100 items to archiveteam-fire cause of my guardian world articles dump
06:04 ๐Ÿ”— godane wow its 186 items
06:08 ๐Ÿ”— BlueMax If anybody ever gets around to doing another archive of ff.net I'll jump on and grab a copy
06:14 ๐Ÿ”— BlueMax jesus christ SketchCow cannot type to save his live on a phone :P
06:14 ๐Ÿ”— BlueMax ...
06:15 ๐Ÿ”— BlueMax and neither can I apparently.
06:16 ๐Ÿ”— GLaDOS Hah, people usign pghones
06:16 ๐Ÿ”— GLaDOS ffs
06:18 ๐Ÿ”— BlueMax *facepalm*
06:19 ๐Ÿ”— BlueMax I went and downloaded the TAR of the 2012 backup
06:19 ๐Ÿ”— BlueMax surprised they were all converted to ePubs
06:22 ๐Ÿ”— BlueMax I wonder how much was added over the last 20 months...
06:25 ๐Ÿ”— yipdw BlueMax: quite a bit
06:25 ๐Ÿ”— yipdw BlueMax: another neat question is how much was deleted :P
06:27 ๐Ÿ”— BlueMax man, if I knew how to do all this I would :P
07:36 ๐Ÿ”— godane i'm grabbing the gallery pages of theguardian in my world articles index
07:37 ๐Ÿ”— godane i have to make another dump cause there is stuff like ?picture= urls
07:37 ๐Ÿ”— godane and my grabs was not getting it
07:42 ๐Ÿ”— godane good news is that there are no in 2007/dec urls so this will be a full dump of 2007 galleries
07:44 ๐Ÿ”— godane looks there dec/2007 urls
07:44 ๐Ÿ”— godane oh well
08:09 ๐Ÿ”— joepie91 sooo
08:10 ๐Ÿ”— joepie91 Tradehill told customers last week it would transfer their accounts to a U.S. credit union to make it easier to complete transactions. The company said in an e-mail to clients that customer accounts were being frozen as of Aug. 23 for the move, and clients who didnรƒยข??t want to switch to the Internet Archive Federal Credit Union were offered the option to liquidate their holdings.
08:10 ๐Ÿ”— joepie91 ?!
08:35 ๐Ÿ”— DFJustin https://www.facebook.com/media/set/?set=a.497149920370098.1073741831.451026158315808&type=1
08:36 ๐Ÿ”— omf_ That is cool
08:41 ๐Ÿ”— godane so my gallery dump of 2007 urls is over 200mb now
08:41 ๐Ÿ”— godane biggest dump of the guardian so far
09:59 ๐Ÿ”— godane its over 400mb now
12:53 ๐Ÿ”— godane looks like the early video episodes of the tech guy on twit was spoty
12:54 ๐Ÿ”— godane just look here: http://web.archive.org/web/20100715144706/http://feeds.twit.tv/ttg_video_large
12:54 ๐Ÿ”— godane there is like none in the 670s
13:03 ๐Ÿ”— godane there maybe some more in the 660s then what is posted in the old rss video
13:03 ๐Ÿ”— godane *video feed
19:38 ๐Ÿ”— godane so i'm going to grab the bomb patrol afghanistan series that aired on g4
20:55 ๐Ÿ”— DFJustin http://forums.somethingawful.com/showthread.php?threadid=3567802

irclogger-viewer