[00:27] So the common crawl is 81 TB and just the url list is 437gb [00:27] 5 billion unique urls [00:28] i got a 3 part techtv bestbuy ad [00:29] all 3 parts puts it at 18mins [00:29] so looks like we have most of the half hour ad [00:51] i published warcat 2.1 which seems to be stable. if anyone has some time and some large warc files, try the verify feature on them. (see github.com/chfoo/warcat) [01:13] Heiii [02:11] What privacy plugins do people use for their web browser? I currently use: BetterPrivacy, Flashblock, Ghostery, HTTPS-Everywhere, and NoScript [02:12] Also I setup a separate firefox profile which I run just for social media and the like so no cookies or tracking [02:13] I use nothing at all. I've got flash turned to "click to play" [02:14] flashblock isn't needed anymore with current firefox, there's a pref now [02:14] plugins.click_to_play [02:18] Also social media sites are fucking javascript hogs [02:19] they could pull a session down before, now fuck them [02:22] you must hate yourself [02:28] ^ [02:56] omf_: i use adblock plus, noscript and beef taco [02:56] ghosetry has some ..annoyances... with noscript [03:01] I am installing beef taco now [03:02] I like how adblock plus does not need a restart to work [03:05] sad that the original taco had its trademark/IP sold and is now actually adware/spyware itself, which was once software which PROTECTED privacy [03:05] major irony [03:05] hence the creation of beef taco [03:05] oh and don't forget to enable 'do not track' in firefox as well [03:06] options->privacy->tell websites i don't want to be tracked [03:06] and also disable 3rd party cookies. its very VERY rare to have a site that uses those for anything except tracking [03:07] to do that, go to options->privacy, set the 'firefox will' thing to 'use custom settings for history' and uncheck 'accept 3rd party cookies' [03:08] if you are sufficiently paranoid you can make it clear history on every shutdown [03:08] I think that's set to be the default for an upcoming release- no third party cookies [03:08] I have DNT enabled and already have 3rd party cookies disable. Yes dashcloud it is in the next version [03:24] http://securitywatch.pcmag.com/security/310268-bing-delivers-five-times-as-many-malicious-websites-as-google [03:26] btw bing is the default 'external' search engine in thunderbird [03:26] and there is no way short of patching xml and ini files by hand to shut it off [03:26] which i find completely idiotic [03:27] there's no 'manage search engines' thing like in firefox [03:28] that is dumb [03:31] apparently its because microsoft made a deal with mozilla for both firefox and thunderbird, while google only made one for firefox [03:39] For chromium I got adblock plus, disconnect, ghostery, https everywhere, [03:39] The TACO extension is the same losers who did the firefox one [03:40] the major problem with chrome is theres no way to shim noscript in there to block javascript completely, which may have been a deliberate decision by google to prevent people from completely blocking their ad network tracking stuff [03:40] theres some similar 'stuff' but its nowhere as complete as noscript is, and most can be bypassed trivially [03:41] that's the main reason i use firefox [03:41] if chrome had a fully functional noscript version i would probably switch to it [03:41] I keep chromium around for testing websites [03:42] The firefox ecosystem has way more plugins. Just take a look at the chrome main categories vs addons.mozilla. Mozilla has a privacy category, google does not [03:43] theres also the original 'adblock' for chrome (which has the cute 'donate and enable catblock' thing) [03:43] catblock replaces all ads with pictures of cats [04:16] sounds good [07:39] :D [07:40] my computer at home features random catsploion technology [07:40] Randomly when using it cats will appear [07:40] all around me... [07:40] this is what happens when you live with 5 cats. [07:43] all the crazy people I know live with cats [07:43] this scares me [07:45] well cat poop makes you crazy for a start [07:48] Well, maybe you should record that, the internet is full with cats anyway :) [07:48] grumpy cat, cheesburger.... [07:49] ...sockington... [08:48] hmmm whiskey and whisky [08:48] I like boht [09:02] GLaDOS: we broke anarchive? I can't ssh in [09:03] Possible it's the upload of the wowvault to IA... [09:14] so i found out that fireflyfans.net has podcasts [09:15] gong to try to get firefly talk podcast [09:20] Smiley: how did you.. [09:21] how did I what? [09:21] I blame omf_ :D [09:22] Well then.. [09:23] debug1: Connecting to 37.59.60.160 [37.59.60.160] port 22. [09:23] debug1: Connection established. [09:23] hangs after provinding the keys [09:24] use ssh -v [09:24] it is more chatty that way [09:26] yea hI have [09:26] -h [09:26] It's just waiting for the box to either accept hte key or not [09:33] Gah, host provides no management options. [09:35] D: [09:35] can just wait [09:35] omf_: might know what happened [09:37] Smiley: have an NIC handle that I can tie the server to? (OVH) [09:39] hmmm a wat? [09:39] Basically, an OVH account. [09:39] Can [09:39] 't get one, don't reside in the UK [09:39] https://www.ovh.co.uk/cgi-bin/en/nic/newNic.cgi here have fun [09:41] weird that page just plain doesn't work for me [09:41] Oh, no script ¬_¬ [09:42] An error 500 happened. [09:42] ait [09:42] Wait [09:42] There's a .com version [09:42] it works now.... [09:43] You have just created your customer ID: bt59399-ovh [09:44] Just as I made one on a different site of theirs.. [09:44] They really need to sort that out. [09:46] lol [09:51] Rebooting.. [09:54] Smiley: accessible again. [10:21] ok [10:21] rebooted :( [10:22] sry [10:22] tis ok [10:22] :D [10:22] least it means my mega script has stopped finally [10:23] M-M-M-MEGA SCRIPT [10:24] * Smiley loads up BlueMax in the script [10:24] baibai now [10:24] * BlueMax bluescreens [10:26] * BlueMax throws Smiley out the window [10:28] * GLaDOS windows BlueMax out the Smiley [10:28] * Smiley sounds the trumpet [10:28] * BlueMax is windowed [10:29] one eight of posterous left to download [10:29] jesus christ the Warrior is a powerful thing [10:30] 1/8th? [10:30] Wait, where are you getting those figures from? The tracker doesn't list everyone :( [10:31] I was going by items [10:31] 3.7 million done 670,000 to go [10:31] yeah, hmmm [10:31] I mean nearly 6TB has gone through the warrior network [10:31] that just astounds me [10:33] *7TB [10:33] Yah it's impressive. [10:55] I see an archiveteam warrior twitter account has popped out of nowhere. [10:57] * GLaDOS hides [10:59] You are very energetic. Just don't overstep. [11:00] I'll try not to. [11:00] You will find that "hmm, nobody has gotten back to me with what I think I need in a time limit of my own choosing, I will go steam ahead" is a life of thrills and lonliness. [11:04] Ah, the way I feel about posterous [11:04] * Smiley hopes to soon fuel the steamroler. [11:04] +l [11:08] That reminds me, I have to continue migrating all of my things over to a different VPS [11:08] why? :( [11:09] 55AUD/month compared to 19AUD/semi-annually [11:09] For about the same [11:09] Just less IPs. [11:11] fair enough. [11:11] Oh, and there's the wiki cleanup.. [11:12] Yeah, I'll enqueue that now. [11:14] Hopefully I kept the scripts that I used to do so.. [11:25] SketchCow, it was my idea, the Twitter feed [11:27] Ooh, yes, found the cleanup scripts. [11:30] Wow, you had an idea to have a twitter feed? [11:30] Patent that shit [11:30] Isn't that illegal? [11:31] Patent that illegal shit [11:32] you know, there's beating a dead horse, then there's beating the dustpile remains of a heavily bashed dead horse [11:32] But I love beating dust.. [11:32] no you love beating Chell [11:33] Nah.. [11:33] And that is how you defeat a BlueMax. [11:34] * BlueMax notes the giant red horse target on his back [11:34] Or you use a twist tie [11:36] What's a twist tie [11:36] I see.. [11:39] Isn't twist ties illegal? [11:40] I heard mentioning that they were illegal was illegal.. [11:40] oh they're cable ties [11:40] duuuuuuuh [11:41] * BlueMax slaps himself so hard his head turns black [11:41] We're on day 4-5 of no FOS. [11:41] I'm getting unhappy. [11:43] do you know who's fault it is? [11:45] Also http://i.imgur.com/NhGpv3c.jpg [11:46] now I'm pretty sure that's illegal [11:53] About to set the Bitsavers process to be automatic. [11:53] That should be exciting - that collection will just grow randomly each day. [12:11] hmmm [12:11] I ponder if I should rant. [12:12] On one hand I have some stronger growing feelings, on the other hand I have a charity which does something amazing. [12:12] I also wonder if this is a simular thing SketchCow is feeling (?) [12:20] I feel many things. [12:21] http://bitcoin.clarkmoody.com/ is worth watching [12:29] http://www.youtube.com/watch?feature=player_detailpage&v=GdV4pr4frd4#t=10s [12:30] SketchCow: I meant about FOS being down. [12:36] ok that site just died SketchCow [12:36] did you tweet it or something o_O [13:45] Still up [13:46] I am not THAT influential. [13:49] ;) [13:49] I'm just tring to see if I can sort out mining here on my work machine for lols [14:07] never know when your gonna get randomly lucky and hit a block. [14:07] maybe we should have a archiveteam pool ;D [14:07] What can the warriors do when idle? BITCOIN MINING ;) [14:07] Or a project to help raise money? :/ [14:16] Oh wow, let's do the opposite of all that [14:19] yes, I'm joking [14:19] bored at work and thinking random stuff. [14:19] Oh wow, record companies clearly buying views via bitcoin, fun :D [14:32] so i'm getting g4 images in warc [14:32] based on the g4tv.com/images/ html [14:33] the index file is 10.6mb [14:33] should take sometime [15:02] GLaDOS, or Smiley around? [15:02] I actually get some sleep and the server needs to be rebooted [15:03] yo [15:03] it has been [15:03] I saw [15:03] GLaDOS: did it earlier, we wanted to ask you first ;D [15:03] Oh, AGAIn? [15:03] I am logged in, can someone reset my password so I can switch to root, I had a hard crash locally yesterday so I did not save that pass [15:04] I want to look at the logs and find out what happened [15:04] arugh use keys :P [15:04] and yes I'll fix now for you [15:04] Keys does not allow sudo -s [15:04] I mean I cannot use ssh keys to do sudo -s [15:04] sudo su works... [15:05] shell? [15:05] what is sudo -s? [15:05] yep [15:05] o_O [15:05] Smiley@anarchive:~$ sudo su -s /bin/sh [15:05] hi [15:05] sh-4.1# echo "hi" [15:05] Bitsavers is now automatically mirrored at archive.org. [15:05] :D [15:05] hmmmm that works tho [15:05] BING [15:06] Smiley@anarchive:~$ sudo -s /bin/sh [15:06] sh-4.1# [15:06] Also that works omf_ so.... not sure what you broke [15:07] SketchCow: as in, automatically grabbed, or they are actually pushing into IA? [15:07] Smiley, how are you not getting this. I do not know my user account password. I can log into the machine because I setup ssh keys. I was still using the pass glados had generated [15:08] I pm'ed you new pass, but.... sudo shouldn't even ask pass [15:08] Also I am pissed but not at any of you. [15:08] We had huge losses because of the reboot [15:08] lol i know the feeling bud, fix what you need to fix, and we talk later. [15:08] Yeah it sucks, :/ [15:08] Need to learn from this. [15:09] I had a tool monitoring the network connection I checked every hour [15:09] nethogs [15:09] To see if anything was timing out [15:11] Nod. [15:12] messages has nothing useful [15:12] Was the wowvault upload going? [15:12] no. The uploads I had done were finished before I crashed out [15:13] Ok, weird. [15:41] http://hima.gptouch.com/games/jurassic_heart/ [15:45] http://commons.wikimedia.org/wiki/File:Muzzle_gag2.jpg THANKS WIKIMEDIA [15:47] * Smiley doesn't click. [16:08] SketchCow: https://commons.wikimedia.org/wiki/File:Caprice_des_Dieux.JPG [16:08] yes, they actually pixelated the photo because the drawing in the logo is copyrighted. [16:33] Hilarious [17:09] SketchCow: they have saved us from a global copyright disaster! [17:09] :P [17:47] i'm hoping to get this video here: http://web.archive.org/web/20001017234457/http://techtv.com/techtvnews/politicsandlaw/story/0,3685,2591998,00.html [17:47] George W. Bush's Exclusive Interview with TechTV: Transcript [18:05] urgh nI need to do more tidying [18:07] but first, a bit of gaminjg :D [18:48] found 2 part interview with Jeffrey Katzenberg [18:49] one of the founders of dreamworks [22:51] "Suspension Reason: Huge connections" [23:08] jk[SVP]: what host? [23:10] uploading april 2008 shows of spark [23:30] semoweb, there was a promo on lowendbox [23:31] "You have totally made more than 2.5K connections." [23:31] lol