[08:56] http://www.h-online.com/open/news/item/The-H-is-closing-down-1920027.html [10:41] hrm, is anyone around who understands the wget-lua stuff? i [10:42] 'm trying to rewrite a url on failure, and have it still recurse as if it was the original [10:42] seems to not call any recursion in taht case if you just insert it from get_urls [11:03] I just had a brain fart. Does archive.org support all the nifty features on .tar.gz or .tar.bz2 files like with zips? [11:05] think so yes [11:34] http://www.h-online.com/open/news/item/The-H-is-closing-down-1920027.html [11:34] oh, already linked [13:35] attempting to grab H-online [14:11] omf_: .tar yes, .tar.gz or .tar.bz2 no [15:22] hi. [15:23] I just wanted to bring to your attention the fact that The H Online is shutting down, so you might want to archive it. [15:25] For over 6 years, it's been a useful Linux news source. Not anymore. [15:27] they do say that "Work is taking place to create an archive to ensure that the content of the site will remain publicly accessible." but yeah [15:27] also that's unfortunate :/ [15:34] 08:34:22 @Cameron_+| attempting to grab H-online [15:34] So yeah, someone is already on it ;) [17:02] tumblr "Along with the fact that if your blog is flagged adult they set robots.txt to noindex on your whole subdomain, so you're nuked from google" https://news.ycombinator.com/item?id=6070931 [17:26] lovely, no ia_archiver exception so goodbye wayback machine http://hot-redheads.tumblr.com/robots.txt [17:27] if we can get in touch with tumblr staff that may be something they could fix [17:27] since wayback has no search [17:52] I think Yahoo! is the sworn arch enemy of archivists everywhere. Why would they do something to help the Internet Archive? [18:15] well not management obviously but some lower-level server jockey