[02:38] http://www.reddit.com/r/shutdown/comments/1p3wah/ultimate_guitar_is_looking_ill/ [02:55] im trying to upload to ftp and i once again cant see the folder [02:55] ersi [02:56] Cameron_D: is someone mirroring it? we may have negative time here [03:09] I'm not sure [03:09] and its too complex for ArchiveBot (stuff is spread accross multiple subdomains with redirects everywhere) [03:09] Q_Q [03:10] IA currently has a fair bit stored though [04:19] Cameron_D: where is it spread amongst subdomains? [04:21] A large amount of stuff is hosted on tabs.ultimateguitar.com, but all the listings are on the main domain [04:49] ... there is an /r/shutdown? [04:49] how did I not know of this? [04:49] Wouldn't that simply be an --allow-domain or something? [04:49] I know we did something like that for IGN, was it? [04:50] Er, --domains it was [04:50] GLaDOS: +1 for that comment [04:51] \o/ [04:54] so tabs.ultimateguitar.com, www.ultimateguitar.com, cdn.ustatik.com, any other URLS to include? [04:54] I think my.ug and plus.ug require login [05:06] ultimateguitar is huge [05:15] ^ [05:51] mobileme is also huge [05:58] hehe [06:02] just set three warriors on it and let them strip the skeleton [06:14] am imagining warriors as piranhas skeletonizing a website [06:14] (sorry, offtopic) [14:15] I've watched enough Jason Scott videos, I'm angry, I'm technical, and I love collecting shit. I'm in. [14:18] welcome :) [14:19] Welcome aboard [14:21] I'll just let the Warrior tick over in the background for now. [14:21] That's an awesome start :) [14:33] * w0rp didn't even know blip.tv was shutting down. [14:33] *sigh* [14:41] changed TOS [14:45] so they're deleting a bunch [14:47] I think someone mentioned this a while ago, but twitch.tv has decided they're going to prune old broadcasts (somewhat randomly it appears), and at least some of it has already happened- so if you like a particular broadcaster, make sure to download their streams [14:50] Does anyone have a good YouTube archiving program? I've been looking into writing my own, because I did it once before, but then some detail changed a while back which made it not work anymore. [14:50] I reckon I can do better than youtube-dl. [14:53] I think youtube-dl is probably the best one- someone here had a fork of it I think (can't remember the name), but otherwise I think it's the finest one. If you've got a problem, ask and maybe someone has the answer. [14:55] Well, youtube-dl is pretty good on the outside, but the code is... bad. [14:55] So when I looked at it to make it do things a little differently, I gave up pretty quickly. [14:57] I did used to have a script which read a plaintext file with usernames in separate lines, watch API feeds or something, and it would just automatically save YouTube videos, running forever. [15:52] How do people here feel about .mht? [15:52] I've been saving pages in .mht files for a while. [16:08] We generally prefer WARC, since then we can upload stuff to Internet Archive (archive.org) and make it available in their Wayback Machine [16:16] Yeah, I read about WARC. I'll probably use that in future. [16:21] :) [16:21] I've got like 700MB of imageboard and modern Japanese style BBS threads, it seems. [18:44] http://www.rollingstone.com/music/news/lou-reed-velvet-underground-leader-and-rock-pioneer-dead-at-71-20131027 [18:47] :( [18:47] 71's a good run, though. [19:47] #WooZu7# [19:47] uh, ignore that [19:48] :P [20:03] Everyone does that at least once. [20:03] if that was a password, you should probably change it. [21:29] phillipsj: don't worry; oh he could see it; to everyone else it just looks like ********* [22:04] so, just watched the Internet Archive celebration talk, and that's some amazing things there- especially the 30 year collection of news on VHS tape [22:09] oh my [22:09] thats insane [22:09] I work at a tech recycling, next time I get some mag tapes, im sending them to IA [22:19] are all of these on IA? http://retropdfs.wordpress.com/currently-available-collections/ [22:19] also, does anyone have issues of the German magazine "Happy Computer"? [22:19] many were scanned but not all [23:17] SketchCow: the Number Squares program is stream_only collection for some reason [23:22] another recovery sucess story http://sideline.ghegs.com/