[00:23] I don't remember when I first used altavista, other than it was still well within the altavista.digital.com days [00:25] to those with access to IA, could I get the google answers backup I did moved to the ArchiveTeam collection? I talked to somebody already [00:25] http://archive.org/details/google-answers-archive [01:00] Found some new stuff to archive in the RSS. Guess what? It's Yahoo! [01:00] http://yahoo.tumblr.com/post/54125001066/keeping-our-focus-on-whats-next [01:03] http://downloads.yahoo.com [01:03] let's see [01:03] that's probably important :| [01:05] It's not GeoCities, but there must be something to archive! [01:08] I mean, we don't need shareware downloads, but maybe Citizen Sports, FoxyTunes, Yahoo! Neighbors, and Yahoo! Stars India can be archived. [01:09] It seems like Archive Team is turning into Archive Yahoo's Unprofitable Services Team. [01:18] FoxyTunes makes sense [01:18] it's dead since FF14 [01:19] What about the other stuff? [01:20] Stars hasn't been updated since april and it's a India exclusive [01:30] Citizen Sports is gone [01:34] Ah, somebody already posted. [01:37] Yup [02:14] Anyone online? [03:31] Citizen Sports is 100% gone. [03:31] oh [04:20] What's the best way to backup Direct Messages? [04:21] On Twitter [06:54] "we don't need shareware downloads"? ORLY? What if it isn't something that was only available for download there and hasn't been archived yet? [06:56] granted, we don't really need the AVG, FF, IE, Flash, etc. at this time [06:56] most of the popular shareware programs are already available cracked on other sites [07:33] Impressions about this: http://www.zdnet.com/googles-blogger-to-delete-all-adult-blogs-with-ads-in-three-days-7000017451/ ? [07:33] woah [07:36] If 255k users is an accurate estimate of Geocities @ http://answers.google.com/answers/threadview/id/13908.html [07:36] (note: 2002) [07:36] How do you backup images on blogspot? [07:36] then this probably dwarfs the Geocities shutdown... even a small fraction of 110M blogs that get automatically marked as 'adult' for something or other, which have ads [07:37] and, to boot, a lot of it *is porn* [07:37] People love porn. [07:37] I doubt we'll lose any porn from this [07:38] I'm betting most of it is reposts from other sites [07:39] Still [07:39] there are probably some interesting sites that might be lost [07:39] like this for example: http://blackboardsinporn.blogspot.com/ [07:39] What I mean is, no shortage of volunteers if someone gets on organizing the project very quickly [07:39] I am backing this up ATM [07:39] Is there anyway to find out what blogs are marked as adult? [07:40] so... [07:40] the policy is variously described as [07:41] 1) all sites labelled by Google with an adult content warning which have ad (either automatic or opt-in) [07:41] all adult content sites with adult content ads [07:41] and all sites with adult ads [07:42] however, here is one of the takedown notices - [07:42] http://static02.mediaite.com/geekosystem/uploads/2013/06/blogger-policy-640x376.jpg [07:43] "For clarification, the current Terms of Service calls adult content, “images or videos that contain nudity or sexual activity” and says that it permits adult-themed blogs as long as they are properly marked as such in the Blogger settings. It also states that it does not allow users to “create blogs where a significant percentage of the content is ads or links to commercial porn sites,” which would appear to gel [07:43] However, some people are receiving this notice despite never having used their Google account to create any kind of blog in the first place, which doesn’t inspire a lot of faith in Google’s ability to differentiate between exploitative commercial ventures and thoughtful sex-positive websites for grown-ups. It’s also incredibly unclear what Google considers to be an “advertisement to adult websites” in these new [07:43] with this upcoming purge. [07:43] guidelines: if you run a site that reviews adult toys, for example, but that never links to “commercial porn,” you might have been safe under the old TOS but not under the upcoming one. Or you might be fine." [07:45] "The letter, apparently sent out earlier this week – just days ahead of the deletion – is part of a crackdown on pornographic advertising on its Blogger network, reckoned to have around 100m blogs. Normal estimates suggest that around 10% of blogs on large networks are pornographic." [07:47] how are new projects initiated? [08:33] write wiki page, write pipeline code, configure a tracker, set up an upload target, set up megawarcing on upload target [08:34] see https://github.com/ArchiveTeam for sample -grabs and universal-tracker for a tracker you can test locally [11:11] Hi folks. I've been running a warrior for a couple of months now. I'd like to spin up a few instances on the same host, so to do that I need to change the default HTTP listen port that run-warrior connects to. [11:12] I've modified /home/warrior/warrior-code2/src/seesaw/run-warrior, but that's not being picked up. [11:12] ...nor is a change to /usr/local/bin/run-warrior [11:13] Any tips on what to change? [11:14] do you need the http server? I just --disable-web-server [11:14] Hi ivan. Don't I need that in order to configure my nick and the project I'm helping? [11:15] This is my first time actually logging in to a shhell on the warrior applience, so I'm just beginning to explore [11:15] I dunno, sorry, I've been running pipeline without warrior code [11:15] I'm comfortable with Linux [11:15] Ah OK. [11:15] Thanks. [11:16] You listed a command line argumment there, I'm not even sure which script I ought to give the argument to! :-) [11:17] wherever the run-pipeline command is [11:17] Ah, thanks. I'll have a rummage! [11:17] Hm nothing of that name in the process table [11:19] The file is in /home/warrior/warrior-code2/src/seesaw [11:25] Nope, that didn't do it. [13:29] benklaase: Are you trying to run several warrior instances? Or several pipeline-projects in one warrior? [14:19] Hi @ersi, just seeing your reply now. [14:19] I'm running several (well, two) warrior instances. [14:20] What would you recommend? [14:21] The google thing is a fuckin mess [14:22] Burninate: RUn it [16:40] just got another 400gb of game files for the ispygames collection [18:00] weird. my warrior vm rebooted at some point [18:02] 18 hours 2 minutes ago [18:03] What is AT doing about AltaVista closing? Is there anything we can save? [18:24] altavista has just been a branding around search.yahoo.com for years, nothing to do afaics [18:24] well, we can remember the first time we used altavista. I think I used it in 1995 :) [18:25] 1.5 days left for greader [18:29] Yeah, Altavista's a bearskin at this point. [19:09] I used it along with Ask Jeeves(no more jeeves now, haha) in elementary school. [19:10] idk how long it's been ask.com. I stopped using those when I got to sixth grade and started yahoo [22:39] so it looks like a very good thing i'm archiving g4 [22:39] it looks like its not in the tvnews search page at all [22:40] but there is a collection for g4 in the tv archive collection for some reason [22:43] evening [23:32] ugh [23:32] 145G googlegargle [23:32] where may I upload? [23:32] 145 gigs of google video stuff from april 2011... [23:33] give me an rsync server and I'll upload it. it's eating space here ;( [23:33] SmileyG: did you grab the rest? [23:40] joepie91: you there? [23:41] yes, hai [23:41] quick question, may I PM? [23:41] :P [23:41] sure, you can always do that without asking [23:41] (if it's a useful question that is, doesn't go for random people asking me 'asl')