[04:36] has yahoo blogs and wretch been saved yet? [08:07] I have no idea if those have been saved already [08:20] hmm, the yahoo blog and wretched channels are op-less. everyone join #shipwretched! please [08:25] ---> #shipwretched! <--- for yahoo blogs and wretch [08:32] I'm there [08:32] I'll take a look at those webistes now [08:41] is someone who archived Fileplanet availabel here right now? [12:39] oh yeah, it's #shipwretched (no exclamation mark) [14:21] linea is going away on the december 15th [14:21] so I'm downloading these: [14:21] http://blog.getlinea.com/ [14:21] http://info.getlinea.com/ [14:21] https://www.getlinea.com/ [14:24] also [14:25] I'm doing a full pastebin grab [14:25] not just of the urls with the codes [14:25] but the full site [14:25] crawl [15:10] arkiver: grab this too: http://www.youtube.com/user/GetLinea [15:44] i need some help with this: http://computerpoweruser.com/articles/archive/G0803/36g03/36g03.asp?guid= [15:45] this error is in it: [15:45] The include file '/includes/security.inc' was not found. [15:45] /articles/archive/G0803/36g03/36g03.asp, line 3 [15:45] that tells me the file still exist [16:11] Thinking.... [16:11] Why don't we archive EBAY? [16:11] As a rolling, ongoing project? [16:11] grabbing new item description pages, etc. [16:11] archiving them for eternity instead of ebay's usual 90 days (or so) [16:12] could be a nice rolling project. [16:12] if we can get it setup with minimal admistration then it'd be great for idle warriors. [17:08] i added cookies to my steam app page dump [17:09] turns out its needed for games that a M rating [17:09] other wise i don't get the page [17:20] WHAT FORSOOTH, PRITHEE TELL ME THE SECRET WORD [17:21] anyone? [17:21] lol [17:22] no one? :( [17:22] Nova_: yahoosucks [17:22] thanks. [17:22] no worries [17:24] a site I want to go on is archived, is there a way I can get on it so that I can look trough old photos of me and my friends and stuff? [17:25] Nova_: which site? [17:25] hyves.nl [17:25] not sure if it's gone into wayback yet..... hmmm [17:25] we normally get some kind of browsing page setup... however i don't know if it's been done yet [17:25] the guy who was running the project isn't here atm [17:26] ah okay I understand, but someday it will go online? I am not in a hurry so yeah. [17:26] yup [17:26] might already be listed on archive.org somewhere [17:27] https://archive.org/details/hyves << it'll be in there somewhere. [17:49] antomic, that's a good idea! [17:49] I would like to start download ebay [17:49] but do you think 10 GB memory is enough for ebay download? [17:50] Nova_ What was the exact name of your hyves url? [18:01] If you are archiving a big website or know a website which is going to die, please add it here to the list: http://archiveteam.org/index.php?title=Projects [18:20] IMPORTANT QUESTION: is winamp already fully download??? [18:26] well, www.winamp.com is [18:26] http://archivebot.at.ninjawedding.org:4567/#/histories/http://www.winamp.com/ [18:26] forums.winamp.com, not sure [18:26] well [18:26] I got the following domains listed: [18:27] http://blog.winamp.com/ [18:27] http://dev.winamp.com/ [18:27] http://forums.winamp.com/ [18:27] http://www.winamp.com/ [18:27] plug them into IA and see what their snapshots are [18:27] I'm downloading them all again just to be 100% sure they are really downloaded [18:27] or plug them into that histories URL [18:29] http://dev.winamp.com/ and http://blog.winamp.com/ are downloaded by the archivebot, but they are very small??? [18:29] not sure if they are downloaded 100%... [18:29] if it doesn't say "aborted", it's done [18:30] with the exception of links not discoverable by wget [18:30] also, check IA [18:30] there are snapshots for all sites dating to December 12 [18:30] which is pretty recent [18:30] yes I saw it [18:30] it's probably ok then [18:30] :) [18:30] pfieuw [18:30] if you want to do another one, I suggest forums.winamp.com [18:30] be aware that that requires a lot of space [18:31] yes I'm doing all 4 again [18:31] http://archiveteam.org/index.php?title=Projects [18:31] see the first line here from the table: [18:31] but I need to go now [18:31] will let you know how my download goes [18:31] and I hope I will be finished by the 20th of december [18:31] (which I doubt...) [18:32] (since it's whole full forum...) [18:32] brb [19:39] interesting article and project: http://arstechnica.com/information-technology/2013/12/british-library-sticks-1-million-pics-on-flickr-asks-for-help-making-them-useful/ photos here: http://www.flickr.com/photos/britishlibrary [19:45] ohhhh archivebot . . . . [19:45] im not sure he'll grqb flickr [19:45] jdownloader will [19:48] youtube-dl says it handles flickr too [19:48] the problem with flickr, as with many other sites these days, is that retrieving URLs from the web interface requires an event loop that executes page stuff [19:48] if someone has a good way to do this in a stable manner, a pull request would be good [19:48] does google have some way of doing this for their cache? [19:49] ][#'aszxxxxxhnjbgflk ;http://www.newstatesman.com/sci-tech/2013/12/trawling-dark-web [19:49] wow my cat is good at typing [19:56] I'll take a look at it if I can download that flickr account... [20:01] will leave it runnig for some time [20:01] and then I'll test a wrc.gz file if it is going ok