Time |
Nickname |
Message |
08:56
🔗
|
ivan` |
http://www.h-online.com/open/news/item/The-H-is-closing-down-1920027.html |
10:41
🔗
|
alih-duck |
hrm, is anyone around who understands the wget-lua stuff? i |
10:42
🔗
|
alih-duck |
'm trying to rewrite a url on failure, and have it still recurse as if it was the original |
10:42
🔗
|
alih-duck |
seems to not call any recursion in taht case if you just insert it from get_urls |
11:03
🔗
|
omf_ |
I just had a brain fart. Does archive.org support all the nifty features on .tar.gz or .tar.bz2 files like with zips? |
11:05
🔗
|
xmc |
think so yes |
11:34
🔗
|
Cameron_D |
http://www.h-online.com/open/news/item/The-H-is-closing-down-1920027.html |
11:34
🔗
|
Cameron_D |
oh, already linked |
13:35
🔗
|
Cameron_D |
attempting to grab H-online |
14:11
🔗
|
DFJustin |
omf_: .tar yes, .tar.gz or .tar.bz2 no |
15:22
🔗
|
nathanm |
hi. |
15:23
🔗
|
nathanm |
I just wanted to bring to your attention the fact that The H Online is shutting down, so you might want to archive it. |
15:25
🔗
|
nathanm |
For over 6 years, it's been a useful Linux news source. Not anymore. |
15:27
🔗
|
balrog |
they do say that "Work is taking place to create an archive to ensure that the content of the site will remain publicly accessible." but yeah |
15:27
🔗
|
balrog |
also that's unfortunate :/ |
15:34
🔗
|
Jonimus |
08:34:22 @Cameron_+| attempting to grab H-online |
15:34
🔗
|
Jonimus |
So yeah, someone is already on it ;) |
17:02
🔗
|
ivan` |
tumblr "Along with the fact that if your blog is flagged adult they set robots.txt to noindex on your whole subdomain, so you're nuked from google" https://news.ycombinator.com/item?id=6070931 |
17:26
🔗
|
DFJustin |
lovely, no ia_archiver exception so goodbye wayback machine http://hot-redheads.tumblr.com/robots.txt |
17:27
🔗
|
DFJustin |
if we can get in touch with tumblr staff that may be something they could fix |
17:27
🔗
|
DFJustin |
since wayback has no search |
17:52
🔗
|
soultcer |
I think Yahoo! is the sworn arch enemy of archivists everywhere. Why would they do something to help the Internet Archive? |
18:15
🔗
|
DFJustin |
well not management obviously but some lower-level server jockey |