#archiveteam 2013-07-19,Fri

↑back Search

Time Nickname Message
08:56 🔗 ivan` http://www.h-online.com/open/news/item/The-H-is-closing-down-1920027.html
10:41 🔗 alih-duck hrm, is anyone around who understands the wget-lua stuff? i
10:42 🔗 alih-duck 'm trying to rewrite a url on failure, and have it still recurse as if it was the original
10:42 🔗 alih-duck seems to not call any recursion in taht case if you just insert it from get_urls
11:03 🔗 omf_ I just had a brain fart. Does archive.org support all the nifty features on .tar.gz or .tar.bz2 files like with zips?
11:05 🔗 xmc think so yes
11:34 🔗 Cameron_D http://www.h-online.com/open/news/item/The-H-is-closing-down-1920027.html
11:34 🔗 Cameron_D oh, already linked
13:35 🔗 Cameron_D attempting to grab H-online
14:11 🔗 DFJustin omf_: .tar yes, .tar.gz or .tar.bz2 no
15:22 🔗 nathanm hi.
15:23 🔗 nathanm I just wanted to bring to your attention the fact that The H Online is shutting down, so you might want to archive it.
15:25 🔗 nathanm For over 6 years, it's been a useful Linux news source. Not anymore.
15:27 🔗 balrog they do say that "Work is taking place to create an archive to ensure that the content of the site will remain publicly accessible." but yeah
15:27 🔗 balrog also that's unfortunate :/
15:34 🔗 Jonimus 08:34:22 @Cameron_+| attempting to grab H-online
15:34 🔗 Jonimus So yeah, someone is already on it ;)
17:02 🔗 ivan` tumblr "Along with the fact that if your blog is flagged adult they set robots.txt to noindex on your whole subdomain, so you're nuked from google" https://news.ycombinator.com/item?id=6070931
17:26 🔗 DFJustin lovely, no ia_archiver exception so goodbye wayback machine http://hot-redheads.tumblr.com/robots.txt
17:27 🔗 DFJustin if we can get in touch with tumblr staff that may be something they could fix
17:27 🔗 DFJustin since wayback has no search
17:52 🔗 soultcer I think Yahoo! is the sworn arch enemy of archivists everywhere. Why would they do something to help the Internet Archive?
18:15 🔗 DFJustin well not management obviously but some lower-level server jockey

irclogger-viewer