[00:07] wow- bitcoin is popular enough for regular spammers? [00:24] manual downloading is coming along- after an fruitless search of HP's ftp site, a google site search is proving far more useful, and I've got a nice pile of manuals cued for download [00:53] Coppernic Agent is a good way of doing multiple searches you can finlter and save [01:08] so how's it work with wine? [01:40] Sorry, not run it in Linux. Copernic has been around since the 90's, (It's how we searched before web engines became popular) they may have a Linux version. I'll check. [01:54] Looks like version 6.11 was tested and given a gold rating under Wine, so it looks like it should work well. [01:55] http://appdb.winehq.org/objectManager.php?sClass=version&iId=8645 [01:56] no native Linux version, but then they are a commercial company [02:02] is aws ec2 worth it for these types of grabs? [02:03] not sure [02:03] also has SketchCow been dumping textfiles.com into IA, cause AFAIK, they have only the time capsule, and thats old [02:06] getting the urls for the manuals is the hardest part- after that any computer with wget can grab them [02:41] I'm chatting with some of my Unreal Tournament colleagues. There are those of us who have been archiving everything we can for a while [02:43] There is a sexy Frenchman (he's damn sexy now!) Who claims he already has the complete Fileplanet ! [02:44] ! [02:45] Medor or Darkmedor, has his mirror of www.ut-files.com and says once he has sorted it, it will be going live again [02:46] ut-files recently had a crisis as it is hosted by another UT mad private individual by the tag "SKILLS" [02:47] The site is payed for by donations, and that is a risk [02:50] medors mirror is the only other complete and live copy of ut-files.com. I have been suggesting combining mirrors via a portal like the Aminet uses. Then using CoralCDN in all the links, and create a massive UnrealCDN [03:04] Question to the team: Have you been able to use "FXP" protocol to do site to site transfers ? [03:12] I've heard of it but never tried [03:19] folks, this is a little flakey, but otherwise it's perfect- you get the website url and the title of the page, which is great for service manuals since they seem to have numeric-only identifiers a lot: http://jurnsearch.wordpress.com/2012/01/27/how-to-extract-google-search-results-with-url-title-and-snippet-in-a-csv-file/ [03:26] FXP is very handy, for those who have the time, but not the bandwidth or storage capacity. I used to use it at work to move stuff from sites to my private folder on a friends server. Igloo has a Linux version, have a look. Again their devs may be able to work with you for a custom version if needed. http://www.iglooftp.com/linux/ [03:33] can anyone get this on to archive.org: http://en.wikipedia.org/wiki/The_Site [03:33] its old enough that i shouldn't be taking down [03:33] thats if anyone can find full episodes of it [03:43] Here's a blog with some info on setting up FXP for your servers http://blog.b2netsolutions.com/server-administration-guides/setup-fxp-on-ftp-servers/ [04:05] Can't find the signup for the wiki :( [04:10] iTunes Ping is closing its doors September 30th. Backup your data if possible. [04:19] I don't know if there is anything to save but Google Music China is shuttering. http://www.businessweek.com/ap/2012-09-21/google-says-it-will-shut-china-music-service [04:20] Just thought I'd announce that. [04:30] 2 [04:31] m [05:13] BTW peeps, I asked the author of the Opera Webcache plugin, to add CoralCDN and the Web Archive's live URLs. you can now manually drag pages into the Wayback Machine. I do it regularly with pages and files I find important. (saving the web a little at a time) Another good reason to use Opera browser :D [05:39] coralcdn isn't a permanent archive though right, just a load balancer that will purge old stuff out as new stuff comes in? [05:54] correct [05:54] coralcdn has a ~1h expiry, iirc [08:36] @chronomex The flush time of Coral is something I have been wanting to know. is this based on entry to the CDN, or last access? [08:37] I'm pretty sure it's X minutes from last access, with some upper bound on lifetime without refresh [08:40] I have been suggesting to a few admins of some Unreal based servers, that DLs and redirects for game servers, be routed through it as a test. That is good to know [08:41] Game files tend to be temporary, but private hosted servers sometimes take a hammering [08:46] So as long as people keep joining the server frequently, even if the redirect fails, gamers should experience little impact. Groovy. Can't let anything stop people shooting their best friends.