Time |
Nickname |
Message |
08:01
🔗
|
namespace |
... |
08:01
🔗
|
namespace |
My grab is done. |
08:02
🔗
|
Smiley |
grab of what? |
08:02
🔗
|
namespace |
A website I'd rather not name. Sadly I think the connection just timed out. |
08:02
🔗
|
Smiley |
k, upload to IA? |
08:03
🔗
|
namespace |
Maybe, need to make sure it's relatively complete/doesn't have my username at the top of every page. |
08:03
🔗
|
Smiley |
you know about warc-proxy? |
08:04
🔗
|
namespace |
That wouldn't help here, forum grab. |
08:04
🔗
|
Smiley |
hmmm, so you didn't make a warc? |
08:04
🔗
|
namespace |
I did. |
08:04
🔗
|
namespace |
I think I did anyway. |
08:04
🔗
|
Smiley |
lol |
08:04
🔗
|
namespace |
I used --warc-file |
08:04
🔗
|
Smiley |
yah |
08:04
🔗
|
namespace |
Though I'm still not exactly sure what that option is supposed to do. |
08:04
🔗
|
Smiley |
well if you use warc proxy, you can "see" what's in the warc |
08:05
🔗
|
namespace |
Ah. |
08:05
🔗
|
Smiley |
as if it was live |
08:05
🔗
|
namespace |
Less won't do it? |
08:05
🔗
|
Smiley |
--warc-file="$WARC_NAME" |
08:05
🔗
|
Smiley |
less will show you the code. |
08:05
🔗
|
Smiley |
SO you can grep and stuff |
08:07
🔗
|
namespace |
I think it saved it as raw html files and then as a warc. |
08:07
🔗
|
namespace |
Anyway, where do I get this warc-proxy utility? |
08:08
🔗
|
Smiley |
github has instructions i believe |
08:08
🔗
|
Smiley |
yah, it will do both a mirror + the warc |
08:09
🔗
|
namespace |
Thanks. |
08:10
🔗
|
winr4r |
morning folks |
09:17
🔗
|
Smiley |
WE NEED MORE SPEED ON XANGA. |
10:01
🔗
|
winr4r |
are they throttling? |
13:27
🔗
|
Smiley |
not that I've noticed. |
15:01
🔗
|
frogainci |
Hi, is there a possibility to kill specific threads in the warrior? |
15:02
🔗
|
frogainci |
I get the following error: |
15:02
🔗
|
frogainci |
Traceback (most recent call last): File "/usr/local/lib/python2.6/dist-packages/tornado/stack_context.py", line 258, in _nested File "/usr/local/lib/python2.6/dist-packages/tornado/stack_context.py", line 228, in wrapped File "<string>", line 102, in handle_response IOError: [Errno 24] Too many open files: |
15:02
🔗
|
frogainci |
and after I'm unable to upload anything... |
15:11
🔗
|
Smiley |
reboot your warrior is prob fastest fix |
15:11
🔗
|
Smiley |
you could by logging in to the warrior |
15:11
🔗
|
Smiley |
but that sounds like the entire webserver fell over |
15:14
🔗
|
omf_ |
reboot it |
15:15
🔗
|
* |
Smiley points to 16:11:05 GMT |
15:17
🔗
|
frogainci |
Smiley: there are still a couple of packages ready... |
15:17
🔗
|
frogainci |
isn't there a possibilty to use run-pipeline or so? |
15:18
🔗
|
frogainci |
Just to push them out? |
15:21
🔗
|
Smiley |
yes |
15:22
🔗
|
Smiley |
http://archiveteam.org/index.php?title=Posterous#Seesaw_script_.28for_advanced_users.29 |
15:22
🔗
|
Smiley |
need to swap posterous for xanga |
15:23
🔗
|
frogainci |
thx |
15:23
🔗
|
frogainci |
will check this |
15:39
🔗
|
frogainci |
okay... i cannot figure out how to start rsync automatically.... going to kill it then... |
15:40
🔗
|
frogainci |
or is there a possbility to increment concurrent uploads while running? |
15:43
🔗
|
Smiley |
dunno |
16:40
🔗
|
ivan` |
did you know Google Reader is actually capable of reading feeds hosted on FTP servers |
16:41
🔗
|
ivan` |
https://www.google.com/reader/view/#stream/feed%2Fftp%3A%2F%2Fftp.tcrc.edu.tw%2FMySQL%2Fworkbench%2Findex.html%253Ffeed%3Drss2 |
17:30
🔗
|
Coderjoe |
wow. the warrior web UI is pretty nice |
17:31
🔗
|
Coderjoe |
so, xanga? |
17:41
🔗
|
omf_ |
#jenga Coderjoe |
17:56
🔗
|
Coderjoe |
I mean should I choose that rather than one of the other options |
18:45
🔗
|
winr4r |
Coderjoe: yes |
21:34
🔗
|
dashcloud |
This is exciting news from LibreOffice: http://fridrich.blogspot.ch/2013/06/libreoffice-import-filter-for-legacy.html - many pre-OS X word/text formats are now supported |
21:35
🔗
|
arrith1 |
nice. wonder if that can be used to automate batch conversions easily. |
21:38
🔗
|
dashcloud |
probably |