[00:06] Sixty (60) Dell R710 computer servers; [00:07] I imagined megaupload had more [00:07] there are probably still servers in HK [00:09] true [00:09] another salvo in big content's fight to cut off independants? [00:10] Nah, I think independents are just collateral damage [00:10] It's a bit tinfoil-hat-y to believe these things happen due to independent/non-signed artists [00:10] After all, bandcamp [00:13] from wikipedia it sounds like they're incorporated in hk but don't do anything there [00:34] so SketchCow, is this the start of another big floppy copying push? [00:34] excuse me, didn't you guys learn NOT to copy that floppy? [00:35] also home taping is destorying music [00:35] destroying also [00:36] and remember VHS tape recording is like the Boston Strangler [00:36] fuck, I'm afraid now [00:37] what if they come to take my 1985 warez? [00:37] extreme home taping! [00:39] I just finished downloading all game patches on direct2drive.com [00:39] just in case they fuck up the transfer to gamefly [00:47] is there even any interest om archving RS/MU though? [00:47] most of it is either warez, or personal files that are irrelevant to anyone but the owner [00:48] and even if you want to archive the warez, legal issues aside, it would be useful because most of it is uploaded in passworded .rar files (the passwords are given out in the forums/sites that provide the links) [00:48] so you would end up with petabytes of duplicated information (there must be thousands of copies of a popular movie at RS or MU) which is passworded, and thus useless [00:49] fbi.gov down for anyone else? [00:49] Getting a footprint 503 error [00:49] damnt you're making us all part of your ddos effort [00:49] HTTP/1.1 503 Service Unavailable [00:49] Server: Footprint 4.8/FPMCP [00:49] hahaha [00:49] lol [00:49] yeah down for me too [00:49] yup, down for me too [00:50] I get a valid HTTP response and body [00:50] buess anon is ddosing it too [00:50] *guess [00:50] i don't even get a response, it times out [00:50] increase your timeout [00:51] how can i do that? :/ [00:52] what might be a decent archive effort would be, maybe, .torrent sites. you could archive/index different torrents and save the .torrent file, allowing it to be saved and re-used in the future - if you're interested in backing up such sites [00:53] I'm already doing piratebay [00:54] the problem comes when they turn to magnet URLs only [00:54] maybe I should look into creating a simple magnet scraper [00:54] but still they'll onyl work as long as the tracker is up [00:54] DHT? [00:54] not necesariyl theirs, but someone else's tracker [00:54] you should start a magnet scraper that creates .torrent files. [00:55] That would be rad [00:55] chronomex [00:55] we already do that ;) [00:55] what hi [00:55] ah yesh i was forgetting about DHT... i guess that would work [00:55] ok [00:56] DHT and magnet == no chance of takedown [00:56] unless they outlaw non-recognized protocols [00:56] problem with DHT is distribution of torrent meta on low seeded content [00:56] will take a while to search the DHT network [00:56] to get ot [00:56] it** [00:56] but as i said [00:56] we are indexing all DHT torrent meta [00:56] well, slowly [00:56] i think we are at 14mil torrents [00:57] linux is like that knife i keep in the vase behind my sink that i use to cut big poops so i don't have to use the plunger. it's useful on rare occasion and for dirty jobs and i feel awkward about it afterwards :3 [00:57] scraping the .torrent files? [00:57] http://zoink.it/sync/all201112.txt.bz2 <-- hash list as of end of 2011 [00:57] that is a list of everything zoink.it is hosting [00:57] cool [00:57] we have a DHT torrent scraper [00:58] developed by the guy who made Vuze [00:58] Sweet, I wanted to do one when I have the time [00:59] it requires about 260 IPv6 IPs atm [00:59] Though, I am more interested if the theory with the DDoS via fake announces on DHT really works or if you can't reach a high enough volume [00:59] Why does it need so many IPs? [00:59] due to how DHT works [00:59] and to spread the listeners [00:59] to catch as many torrent meta as possible [00:59] wait, only 260? [01:00] it grows [01:00] it's at 260 atm [01:00] started with 10 [01:00] also, why IPv6? [01:00] cause we have many IPv6's available [01:00] but can you cover the whole hash space with so little? [01:00] no [01:00] also, won't IPv6 exclude you from seeing a lot of the DHT? [01:01] but it has to naturally grow [01:01] otherwise DHT will be flooded [01:01] which is not good [01:01] but yeah, we are at about 14mil torrents [01:01] it's growing [01:01] we do probably 250k a week [01:01] can you inject hashes? [01:01] we just listen [01:01] nothing more [01:01] i could ask [01:02] I mean, take hashes from another source to populate your db [01:02] like piratebay [01:02] or http://publicbt.com/all.txt.bz2 [01:03] how resent is that? [01:03] http://publicbt.com/ - "An aggregated scrape file for the entire cluster can be found here. The file is updated every 5 minutes. and is ~50 MB, instead of the ~125 MB from each node in the cluster." [01:03] 60MB [01:03] mine is 293MB [01:04] Do you only store the infohash or do you download the torrent metadata as well? [01:04] well [01:04] Awesome. [01:04] if you /torrent/{hash}.torrent [01:04] you can get the torrent [01:04] So you put all scraped results on zoink? [01:04] it was also pushed onto torrage.com [01:04] but it had a hdd crash [01:05] "Torrents that have not been downloaded for a period of 6 months will automatically be removed from the system." [01:05] :( [01:05] it doesn't get nuked [01:05] we just say that :) [01:05] ah [01:05] NovaKing: Is it ok if I download all torrents from zoink.it? Any rules to follow? [01:05] well [01:05] i prefer to push to you [01:05] it's about 550gb atm i think [01:06] Just imagine all this imported into a searchable db [01:07] it exists ;) [01:07] Filesystem Size Used Avail Use% Mounted on [01:07] /dev/sda1 927G 368G 560G 40% / [01:07] 560GB [01:07] http://torrindex.com/ <-- searchable index of what torrage had in it's system [01:08] To be honest I am not so much interested in "where can I find House season 5 as torrent" but more in other stuff like average torrent size, biggest torrent and so on [01:09] Even cooler if could have scrape (as in tracker scrape) data as well [01:09] NovaKing: you got stuff that's not on torrage, apparently [01:10] NovaKing: Can I PM? [01:12] don't fucking ask permission to /msg [01:12] I kinda hope someone with too much money and a fat connection is sitting somewhere fetching everything on usenet [01:12] drives me up the wall [01:13] It's the polite thing to do. [01:13] no [01:13] it doesn't make sense [01:13] do you ask permission before you email someone? [01:14] Nope, but email is not the same as IRC private message [01:14] if you don't want random /msg, set yourself +g [01:14] soultcer: right, irc is less personal [01:15] That's why asking for permission to PM makes it so much more classy ;-) [01:15] that's true [01:15] fuck that shit [01:15] I'm just used to channels where that's required [01:15] i agree chronomex, i think most people ask permission because some people get a lot of them and have developed quite teh sensitivity to being msg'd hehe :) [01:15] the only reason to talk about /msg in public channel is if you think someone is ignoring you / has a client that isn't dinging them [01:16] if you go to a tech-support channel with thousands of users, a single expert can easily be flooded with questions for anything he says [01:16] and that can make it hard for them to distinsguish urgent messages from those hundreds of tech-support messages... but yeah they should jjust +g themselves [01:22] I just watched the Megaupload song for the first time [01:22] Me too [01:22] I do now fully support the US government in shutting them down [01:22] That song is awful and needs to be nuked from orbit [01:23] haha [01:23] the point of the song wasn't to be good. [01:24] the song will be exhibit A on the trial that the guys were a menace to society [01:26] yipdw: are you sticking your sopa warcs anywhere ? [01:29] I've got < 0.5 gig but it has screenshots of sites too [01:53] http://i.imgur.com/rR592.png [01:58] well this has to take the price for 'most effort on an under construction logo' http://kvartirakrasivo.ru/404/index.php [01:58] prize [02:00] wow [02:01] every day he's shufflin' [02:01] woah [02:01] it rotates with the mouse position [02:36] http://9gag.com/gag/1951427 [02:38] Hahahahahah [03:05] ok, back [03:05] Ymgve: torrage.com's hdd died [03:05] we keep our sites in sync [03:05] i hadn't done it in 2 months though as i was on holidays [03:05] so we were about 1mil out of sync [03:05] ah [03:06] i will be syncing it back up with torrage when he gets the new hdd cluster up [03:06] underscor: pm away [03:06] soultcer: scrapping that many torrents = a lot of resources [03:07] and relies on trackers being up at the time [03:07] scraping multiple tracker [03:07] getting the uniques [03:07] and DHT scraping is not viable [03:07] there is a bloom scrape spec in proposal [03:07] but that is only a rough guess average [03:08] as DHT is decentralized [03:23] chronomex: I think people tend to ask permission because there are some networks/people who get really cranky if you don't, so it's like you can't win either way. :( Though I personally agree that it's kind of stupid to expect someone to ask permission first. [06:58] tef: yeah [06:58] tef: I'd like to make an index of them first and dedupe them [06:58] as I had an error in my grabber that grabbed like 50 copies of many articleds [06:58] but I have been just rsyncing to batcave [07:58] ah [08:00] cool [09:55] so, I've deleted my local copy of Splinder because I needed some space for a while and have now some space free [09:56] how ca I help with Proust (or something else)? [10:10] Something is wrong with the wiki. A user that has been blocked months ago just created a spam page?!? [10:12] Hm, maybe because the account got deleted back then [10:15] who deletes accounts :o [10:15] ah, that evil extension [10:16] We had thousands of fake spam accounts in the form of "name nn" where nn was some integer and needed to get rid of them [11:07] https://www.youtube.com/watch?v=wyzwA5Qjd20 [12:42] The day after SOPA protest, MegaUpload is closed. LOLOLOL. [12:43] By the way, I had an account with Jamendo albums and free culture books there. [12:50] Has anyone got a copy of MegaUpload? [12:50] Rapidshare probably ;-) [12:52] If they take down MU, they will take down all them. [12:52] This is probably only a test. [13:03] Hmm... MU is down here too, Sweden [13:04] down everywhere [13:05] click on More on the side here http://www.filestube.com/search.html?q=gameboy&select=All [13:05] thats how many file services there are, not counting all chinese ones [13:05] mission impossible :) [13:06] What.s really eye-opening about this indictment is the property that the Feds have seized from the defendants. It lists a number of bank accounts, PayPal accounts, 15 Mercedes-Benz vehicles, a Rolls-Royce with the license plate .GOD,. a rare Lamborghini and a Maserati. It seems the defendants had a number of vehicles with creative license plates including .HACKER,. .POLICE,. .STONED,. .GOOD,. .CEO,. and the ominous .GUILTY.. (See below for the full l [13:06] Whoa, a license plate that says guilty? [13:06] They're SOOO screwed [13:13] I hate when shit like this happens because I don't remember exactly what I had on my account [13:13] And worse, if anything there was my only copy [13:20] MegaUpload has probably a lot of money, so, this may be a cool legal fight. But, they may pay a big amount and avoid the case. [13:23] He MIGHT be able to settle this out of court, BUT I doubt my beloved files are coming back ._. [13:26] On 18 January the Supreme Court of the United States upheld the Uruguay Round Agreements Act (URAA), which may result in the deletion of thousands of files on Wikimedia Commons published outside the US between 1923 and 1977. The community is discussing appropriate action to take. [13:27] http://commons.wikimedia.org/wiki/Commons:Deletion_requests/All_files_copyrighted_in_the_US_under_the_URAA [13:28] nitro2k01: It's always a bad idea to only have one copy [13:28] Of course [13:28] If it was the only copy of something, it probably wasn't something important [13:29] The problem more has to do with not knowing what I actually had on there [13:29] A bit obsessive, yes [13:30] No, I feel ya'. I hate the thought that I didn't know what was lost, more than the fact that I lost something [16:27] cool scored a superdisk drive at work [18:03] Nemo_bis: if you want to help with Proust, I've got some scripts but no way to deduplicate [18:04] well actally [18:04] what I *really* need for Proust is some way to access private content of profiles [18:04] I have not been able to find one [18:08] Nemo_bis: although I think where effort is really needed is mobileme [18:26] We have to move to mobileme now. [18:27] on that note, I've been uploading a few hundred gigs of mobileme for the past week now [18:27] why do Apple users store so much shit in the cloud [19:00] SketchCow: I've had good success using apple II + adt for flippies [19:00] it seems to be more successful at reading disks than the kryoflux even although that may be down to a crappy pc 5.25" drive [19:01] caveats: copy protected disks don't work, and in my experience you need a pc with a real serial port, the usb serial is crap (at least the one I got) [19:02] I'll need to get my real commercial disks properly kryofluxed at some point but for the stacks of warez and user group disks, a sector dump is fine [19:02] wow.. adt.. had forgotten about that.. used to use it all the time [19:05] Yeah, that's true. [19:06] yipdw, ok, then I'll do some MobileMe... though it seems a huge task, too huge for my hard disk [19:09] how much storage do you have? [19:09] I'm just offloading to a 1TB disk [19:10] I'm using my desktop, 500 GiB total and some 160 free or possibly free [19:10] although I can rsync --delete [19:10] $ sh get-wget-warc.sh [19:10] get-wget-warc.sh: 19: builtin: not found [19:12] GNU bash, version 3.2.48(1)-release (x86_64-apple-darwin10.0) [19:12] haven't seen that one before [19:12] Copyright (C) 2007 Free Software Foundation, Inc. [19:12] I know this version works [19:12] as should bash 4 [19:12] are you sure sh is actually bash? [19:12] that's not true on many systems [19:13] hm, you're right [19:13] sorry [19:14] I'm not tech-savvy, just pretending [19:15] how many streams are needed to download at 10 Mb/s, more or less? [19:15] and for what it's worth, I think downloading Mobileme and e.g. RapidShare are equally risky wrt copyright minefields [19:15] there is some serious shit in these public.me.com directories [19:15] oh? [19:15] look not, lest ye be corrupted [19:15] I'm just looking at the URL files [19:15] they could be lying [19:16] but that's all the MPAA does, so [19:16] ahahaha [19:16] hipsters got in a huge tizzy over 37signals looking at log files last week [19:16] so.... [19:16] what, them looking at their server and app log files? [19:17] http://37signals.com/svn/posts/3076-i-heard-you-like-numbers?58#comments [19:17] oh [19:17] jesus christ [19:17] MMhMMMM [19:18] SHIT YOU SEND TO OTHER PEOPLE CAN BE SEEN [19:18] NO FILM AT 11 [19:18] I'm more worried about them having /svn/ in the url [19:19] the followup is pretty lame [19:19] http://37signals.com/svn/posts/3078-trust-is-fragile#extended [19:21] there's a post in there by an Oracle employee that is ridiculous and completely misses the trust relationship [19:21] "you can tell your customers you are using ORACLE DATABASE VAULT and give them absolutely no way to verify that anything you are saying is not bullshit, but IT WILL MAKE THEM FEEL BETTER" [19:22] yipdw, do I need to change the grep errors checks to fit Italian output again? [19:22] certainly there are probably business and technical reasons why MySQL has no similar feature, but I think the fact that server-side encryption is completely useless in a user-generated content setting also has something to do with it [19:22] Nemo_bis: not sure -- it's possible [19:49] anyone know who might have archived the downloads from oracle.com from some years ago? I'm looking for a retired product that was available on the download page a few years ago, but since they retired support for it they're being less than helpful :( [20:03] Dark_Star: check ftp sites [20:03] they never get cleaned up [20:07] Dark_Star: What are you looking for, exactly? [20:20] Oracle for VAX (don't laugh ;-) [20:27] excellent [20:27] wait they had a free download of that? [20:46] yep, they had a lot of their DBs available for free in the last years, but for VMS/VAX you maybe still needed a PAK to activate it, I guess. That's what I wanted to find out. [20:51] But I can get the PAKs (DECs fancy term for "license keys") for free from the OpenVMS hobbyist program if they're indeed needed [21:55] http://depot.ninjawedding.org/cds.png [21:55] dum di dum [21:56] guess I should archive his statements now [23:59] the Whitehouse has released a response to the petitions about scanning all of the governement's private records: https://wwws.whitehouse.gov/petitions#!/response/digitizing-federal-public-records