[08:01] ... [08:01] My grab is done. [08:02] grab of what? [08:02] A website I'd rather not name. Sadly I think the connection just timed out. [08:02] k, upload to IA? [08:03] Maybe, need to make sure it's relatively complete/doesn't have my username at the top of every page. [08:03] you know about warc-proxy? [08:04] That wouldn't help here, forum grab. [08:04] hmmm, so you didn't make a warc? [08:04] I did. [08:04] I think I did anyway. [08:04] lol [08:04] I used --warc-file [08:04] yah [08:04] Though I'm still not exactly sure what that option is supposed to do. [08:04] well if you use warc proxy, you can "see" what's in the warc [08:05] Ah. [08:05] as if it was live [08:05] Less won't do it? [08:05] --warc-file="$WARC_NAME" [08:05] less will show you the code. [08:05] SO you can grep and stuff [08:07] I think it saved it as raw html files and then as a warc. [08:07] Anyway, where do I get this warc-proxy utility? [08:08] github has instructions i believe [08:08] yah, it will do both a mirror + the warc [08:09] Thanks. [08:10] morning folks [09:17] WE NEED MORE SPEED ON XANGA. [10:01] are they throttling? [13:27] not that I've noticed. [15:01] Hi, is there a possibility to kill specific threads in the warrior? [15:02] I get the following error: [15:02] Traceback (most recent call last): File "/usr/local/lib/python2.6/dist-packages/tornado/stack_context.py", line 258, in _nested File "/usr/local/lib/python2.6/dist-packages/tornado/stack_context.py", line 228, in wrapped File "", line 102, in handle_response IOError: [Errno 24] Too many open files: [15:02] and after I'm unable to upload anything... [15:11] reboot your warrior is prob fastest fix [15:11] you could by logging in to the warrior [15:11] but that sounds like the entire webserver fell over [15:14] reboot it [15:15] * Smiley points to 16:11:05 GMT [15:17] Smiley: there are still a couple of packages ready... [15:17] isn't there a possibilty to use run-pipeline or so? [15:18] Just to push them out? [15:21] yes [15:22] http://archiveteam.org/index.php?title=Posterous#Seesaw_script_.28for_advanced_users.29 [15:22] need to swap posterous for xanga [15:23] thx [15:23] will check this [15:39] okay... i cannot figure out how to start rsync automatically.... going to kill it then... [15:40] or is there a possbility to increment concurrent uploads while running? [15:43] dunno [16:40] did you know Google Reader is actually capable of reading feeds hosted on FTP servers [16:41] https://www.google.com/reader/view/#stream/feed%2Fftp%3A%2F%2Fftp.tcrc.edu.tw%2FMySQL%2Fworkbench%2Findex.html%253Ffeed%3Drss2 [17:30] wow. the warrior web UI is pretty nice [17:31] so, xanga? [17:41] #jenga Coderjoe [17:56] I mean should I choose that rather than one of the other options [18:45] Coderjoe: yes [21:34] This is exciting news from LibreOffice: http://fridrich.blogspot.ch/2013/06/libreoffice-import-filter-for-legacy.html - many pre-OS X word/text formats are now supported [21:35] nice. wonder if that can be used to automate batch conversions easily. [21:38] probably