[00:09] have you guys seen this? https://twitter.com/#!/herpderpedia [00:10] Yeah [00:10] hahahaha [00:11] the oatmeal's kitten bbq gif is also hilarious [00:14] there was also this, but he stopped: https://twitter.com/wikipediawtf [00:23] http://www.vice.com/en_uk/read/lamar-smith-sopa-copyright-whoops [00:28] http://techfleece.com/2012/01/18/screenshots-of-sopa-pipa-website-protests/ [00:35] man [00:35] the news is an echo chamber [00:39] which for us is a good thing i guess [00:43] "Don't cry. Disney owns the rights to that emotion." [00:44] hehe [00:44] (old: http://www.youtube.com/watch?v=uvXo4sGB7zM ) [00:44] yeah, i remember it [00:46] for the news files: http://www.zdnet.com/blog/facebook/facebook-ceo-mark-zuckerberg-talks-sopa-pipa/7600?tag=mantle_skin;content and http://www.facebook.com/zuck/posts/10100210345757211 [00:51] http://s3.amazonaws.com/theoatmeal-img/comics/sopa/sopa.gif - The Oatmeal's kitten bbq anti-sopa gif [00:54] thats funny [00:55] cool [00:55] Firefox default home page is black [00:55] not non-functional, but black with links to information about SOPA [00:59] Isn't it just about:home? [00:59] yes [01:01] hm, not black here [01:07] they just changed it, for some reason [01:08] [8:08:21 PM EDT] Kenji Nagahashi: (wide ingesting speed is approaching 6TB/day...) [01:09] wide is the crawl that feeds the wayback [01:17] ah, no [01:17] they didn't change it [01:19] it's just that the black out ended at 8pm eastern [01:20] mere seconds before you loaded the page [01:21] fucking easterners [01:23] heh, yea [01:23] Mozilla is in PST though [01:32] SketchCow: you've probably seen this, but: http://www.vice.com/en_uk/read/lamar-smith-serial-copyright-violator [01:32] db48x: I'm in favor of a blackout that lasts until 23:59 everywhere in the world [01:32] which means, largest possible window [01:44] i think most sites worked off EST [01:45] est is for losers [01:46] whee I got half a gig :3 [01:51] balrog: yea, same here [01:51] btw, have you guys ever read Eastern Standard Tribe? [01:52] yeah, thats a neat story [01:52] indeed [01:52] a funny idea behind it too [01:56] Back [01:56] Yeah, it's amazing [01:57] (the Lamar thing) [02:03] yeah [02:03] specifically blocking IA [02:05] http://www.reddit.com/r/SOPA/comments/omdac/thanking_those_who_didnt_make_the_news_a_guide_to/ [02:44] weird [02:45] I have no sensation of taste or temperature in the left half of my tongue [03:08] SketchCow: I have more complete captures of sites yipdw listed (with a custom tool instead of wget) - how do I send them to you ? [03:08] I also have a shit load of dupes to process first [03:22] * db48x yawns [03:22] I need a movie to watch [03:22] haven't gotten much programming done, alas [03:22] but I'm calling it a day [03:54] * lemonkey waves [03:54] * PatC waves [03:54] I see myself via irssi [03:54] HALP [03:54] :) [03:55] CHANCE OF PUBLIC INVITE NEXT TIME: 0 [03:55] heh [03:55] sketchcow: LOL [03:55] :P [03:56] Yes, this is actually happening. [03:57] :D [04:01] I almost want to try and make g+ work to see what the fuck is happening [04:02] Someone just got offtopic [04:11] I just left [04:12] SketchCow: hey, you're in SF. how long are you sticking around? [04:13] Just to Friday [04:16] ah, so probably can't take you to dinner on friday night then [04:16] Man, fuck trixter. [04:16] Yeah, busy, sorry [04:19] well, next time you're here then [04:19] No problem, I'll be back in February AND march [04:19] Going to GDC [04:19] ah, cool [04:21] looking forward to seeing anything in particular there? [04:23] having the left half of my tongue not report taste or temperature is kinda surreal [04:24] my macaroni and cheese feels both hot and cold [04:33] What is this, some sort of stroke [04:36] SketchCow: I asked a question before: what do you think are needed default metadata fields in a floppy archival/preservation format? [04:36] there's probably a page on metadata that has the answer [04:37] https://wiki.ucop.edu/display/Curation/BagIt [04:38] SketchCow: I hope it's just the ansthesia wearing off [04:39] (after all, it's just my tongue and not the whole left side of my body :) [04:39] Right [04:39] NUMB ARM [04:39] TELL SHEILA I LOVED HER urk [04:39] heh [04:39] hah [04:40] Also: http://www.loc.gov/standards/premis/registry/premis-project_name.php?proj_ID=637 [04:42] What you will find is there's standards floating around and everyone is not done with them and everyone's a fucking wimp and they all drag in the mud for years. [04:43] OK, I'm heading home. Got a lot done but never enough. [04:43] BagIt seems usable [04:43] That guy, holy shit, he wiped me out. [04:43] oh? [04:47] SketchCow: just as an FYI, pre-blackout news stories are being uploaded to my module under batcave; the rest + an index are coming shortly [04:47] SketchCow: also, if you haven't already gotten tef upload access, his archives of some blackout pages are likely more complete than mine due to Javascript fuckery that I couldn't get wget to properly handle [04:48] also for shits and giggles I want to put all this stuff into a torrent and link it off TPB [04:54] also [04:54] http://questionablecontent.net/ [06:45] yipdw: zombo.com has a sopa thing [13:23] yipdw: also maddox xmission [14:28] yipdw: I can always just send you the warcs eh [14:37] yipdw: also I think I have screenshots of the sites on the list [15:58] hi everybody [16:00] Am I in time to help for splinder ? [16:47] altlabel: splinder is basically done [16:48] but there are other projects we'd be happy to have help with [16:50] db48x: oh, I see. Sorry I arrived late, I managed to get some free disk-space just yesterday night :) [16:50] :) [16:50] if you've got plenty of disk space, then the MobileMe project needs downloaders [16:50] I'd be happy to help [16:50] http://archiveteam.org/index.php?title=MobileMe [19:43] heh [19:44] of these news stories, I am finding that if we be incredibly generous and assume that ALL of the HTML in the news page itself is news content [19:45] it still only comes out to like 3% of the size of a WARC of said story [19:45] the rest is Javascript, HTML, and aids [19:45] er [19:45] ads [19:45] and s/HTML, // [19:45] AIDS [19:45] AIIIIIIIDS [19:45] lol [19:50] http://www.foxnews.com/scitech/2012/01/19/feds-shut-down-file-sharing-website/ [19:50] shit shit shit shit shit [19:50] Too bad we didn't archive it [19:51] ooooh http://www.hopenumbernine.net/ [19:51] oh crap :( [19:51] kimble, noooo [19:58] non fox http://online.wsj.com/article_email/SB10001424052970204616504577171060611948408-lMyQjAxMTAyMDEwOTExNDkyWj.html [19:58] wait, really? [19:58] I just accessed megaupload yesterdayt [19:59] oh [19:59] but they shut it down today [19:59] I GUESS THE FEDS HAVE MY IP [19:59] UH OH [19:59] minutes ago [19:59] nah [19:59] yeah I don't really acre [19:59] care [19:59] i do [19:59] that site was fast and nice [20:00] even though kimble is kind of a weirdo [20:00] I meant about the Feds having my IP [20:02] 127.0.0.1 [20:10] What I don't get is that they're a hong kong company [20:10] How does that work?????? [20:19] Some people were arrested in New Zealand apparently? [20:19] oh shit man [20:36] https://gist.github.com/1641705 [20:36] B R I L L I A N T [20:49] That is excellent [20:51] ! [20:51] hmm wayback can't archive gist because it always redirects to https [21:21] DFJustin: just git clone it [21:29] lol [21:31] ahahahaha nice [21:41] wayback can't access https only sites? [21:44] Megaupload was closed O.O [21:46] as far as I can tell it doesn't do https at all [21:47] Yeah [21:48] What I don't get is how they can do that when they're a hong kong company? [21:48] I mean, I guess they can seize the servers that are in the USA, but they had some in NL and other places [21:48] USA thinks they are "the best" [21:48] I guess NZ complied with the extradition request too [21:48] The images server is still online [21:49] "Federal agents and other law enforcement agencies simultaneously moved to search bank records and server farms in multiple locations around the globe, authorities said." [21:49] http://wwwstatic.megaupload.com/muimg/logo.gif [21:49] so they called their buddies elsewhere and said "hey shut this down on your end too kthx" [21:50] well now that's a poor show http://wwwstatic.megaupload.com/robots.txt [21:50] :( [21:51] :( [21:51] I like the US less and less every day [21:51] they actually expect people to believe that they would be making $500 million MORE if it weren't for megaupload??? [21:51] I hate the lost sales arguement [21:51] argument* [21:51] >:| [21:52] http://webcache.googleusercontent.com/search?q=cache:BUzX5jpdk8cJ:megaupload.com/+Megaupload&cd=2&hl=pt-BR&ct=clnk&gl=br [21:55] so it seems like downloading as much as possible from mediafire, rapidshare, etc would be a good idea [21:56] it appears so [21:56] i wonder if bayfiles will get bigger [21:56] Yes, [21:57] Thing about downloading mediafire/rapidshare/etc is that there's a pretty good chance you'll end up with illegal content [21:57] D: [21:57] Welcome to the World War III [21:58] This time there aren't guns... [21:58] Illegal content != Illegally distributed content [21:59] underscor: plenty of legal and legally distributed content on MU/RS/etc [21:59] I use mediafire for temporary uploads of stuff I produce [21:59] nitro2k01: Touché [21:59] balrog: No, I know that [21:59] I'm just saying that people who participate (if we decided to do such a project) need to be aware of the possible ramifications [22:00] underscor: well yeah [22:03] @underscor How many TB do you think it will be? [22:03] if you have to ask, it's too many [22:08] DFJustin: way more TB than mac.com/mobileme? [22:09] I think yes [22:09] how mant tb would what be? hosting something like MU? [22:10] think PB, not TB [22:10] RapidShare is one of the world's largest file-hosting sites, with 10 petabytes of files on its servers, and handling up to three million users simultaneously.[3] [22:11] O.O [22:11] it's a lot of data [22:13] how large is Usenet, in comparison? [22:14] Rapidshare is so dead [22:15] We're just kind of fucked with Archive Team getting that. [22:15] All I can say is the best policy, if we want to pursue, is look for prominent linkage and grab down things as they get prominence. [22:16] I would hope someone is doing that. [22:16] We should make a term for it and encourage people out there to do it. [22:16] Like Livetap [22:19] Out of curiosity, did you ever get any SERIOUS legal threats, SketchCow? (Not the million dollar one...) [22:19] Sure. [22:20] underscor, is there any public graphs for the overall IA server stats, like ganglia.wikimedia.org? [22:27] Nemo_ter: Not overall [22:28] underscor, awww [22:28] Every IA server has graphs at ia[6|7]00xxxx.us.archive.org:8088/mrtg [22:28] yep, I want moar [22:28] http://ia700002.us.archive.org:8088/mrtg/ [22:28] Oh, nope [22:28] Sorry [22:28] I was just looking at how much memory my derive is consuming [22:29] (40 GiB apparently? :o) [22:29] :D [22:29] SketchCow: You don't think we can archive 10 petabytes of data? [22:29] Nemo_ter: That... doesn't make sense [22:29] underscor, it's cached http://iw600302.us.archive.org:8088/mrtg/memv2.html http://www.us.archive.org/log_show.php?task_id=94284874 [22:30] Oh, yeah, okay [22:31] There's also the host_stats page, but I don't know if regular people can access them [22:35] no, login required