[04:44] wpull now has experimental phantomjs support: https://github.com/chfoo/wpull/blob/master/wpull/phantomjs_test.py [05:27] cool [05:27] trying it out, pip3 install . didn't install sqlalchemy, I think it's missing in your install_requires [05:32] chfoo: I see an error when using --phantomjs Can't open '/home/at/.local/lib/python3.3/site-packages/wpull/phantomjs.js' [05:33] ah, i forgot to update setup.py [05:37] try a blogspot dynamic views blog, I am seeing a lot of very long tracebacks [05:37] python3 -m wpull --phantomjs -r --warc-cdx --warc-file=thehamiltonproject.blogspot.com http://thehamiltonproject.blogspot.com/ [05:37] -> https://www.refheap.com/af99a5602b5666ba880fd4ca2/raw [05:46] seeing similar things with phantomjs 1.9.7 instead of ubuntu's 1.9.0 too [05:50] hmm, i think i'll need to test it in a clean environment [06:50] i'm uploading my bearbone grab of nxtbooks [06:51] this will make it possible to to grab the pdf files [06:53] i doing a grab of Pillsbury section of that [14:35] are there any trackers still up for the geocities torrent? all the ones in the _PATCHED_ torrent are timing out for me. [14:40] yes, I'm seeding [14:41] let me find the torrent file for you [14:41] cool, thanks [14:51] sep332: try this one http://piratebaytorrents.info/6353395/Geocities_-_The_PATCHED_Torrent.6353395.TPB.torrent [14:55] sep332: you know it's over 600 GB right ? takes a while if your torrent client creates all the files at start [14:55] yeah, i've done this one before :) [14:55] i'm specifically getting tracker timeout errors [14:57] not getting any peers ? [14:57] ccc.de times out, thepiratebay.org doesn't have a tracker anymore, openbittorrent and publicbt are refusing connections [14:58] nope, left it running all night and no peers. i checked that my ports are open [14:58] could you /msg me your ip and i'll add your peer manually? [14:58] hmm, if you can manually add a peer try 162.220.26.146 [15:00] got it :) [15:00] thanks biggiejon! [15:00] alright, PEX kicked in and i have a lot of peers now [15:01] ya, sometimes just haev to hit one and it will get many more [15:01] maybe someone should update the "official" torrent to have some working trackers... [15:02] :) [15:19] all the stuff is at https://archive.org/details/archiveteam-geocities [15:20] Is the 8-part snapshot the same as what's in that torrent? [15:22] I believe so [17:52] SketchCow: Threw on your DC17 talk while I'm working [17:52] Somehow stories about Coke bottles in vaginas isn't that distracting [18:53] is there a way to search content of pages in the wayback machine? [19:18] not at present [19:18] ok, i figured indexing that would be pretty hard [19:21] yeah [19:21] that would be cool though [19:24] i guess google doesn't crawl the wayback? [19:25] https://web.archive.org/robots.txt [19:25] nope [19:26] although that'd be a great source of fun [19:27] indeed. [19:27] wayback has an archived copy of a google search page that has an archived copy of the google search page that [19:27] etc [19:28] yeah [19:58] i'm starting to upload svoboda newspaper issues for year 1920 [23:11] i got another episode of the screen savers [23:11] thanks to myspleen [23:12] its the episode with Steve Wozniak and Kevin Mitnik hosting the screen savers [23:40] This reminds me of Jason: http://i.imgur.com/VW1jQvx.gif [23:53] a little bit