[00:16] SketchCow, payment in bitcoin, I'm interested in your view. [02:15] It's the opposite of a traditional automated teller that dispenses currency. Instead, these Bitcoin ATMs will accept dollar bills -- using the same validation mechanism as vending machines -- and instantly convert the amount to Bitcoins and deposit the result in your account. [02:15] http://news.cnet.com/8301-13578_3-57570925-38/need-bitcoins-this-atm-takes-dollars-and-funds-your-account/ [02:15] cc SmileyG [07:59] joepie91: hmmm [07:59] so they are buying bitcoin from the market, interesting [08:50] archiveteam is the only non-scummy scraping group [08:51] when you find a forum or site about scrapers it usually has SEO and other nonsense as the uses [08:51] they put the value on making money, not on solving the problem [09:13] glad to hear [09:13] in a way, sad in another [09:15] screen scraping has a weird stigma to it [09:15] Like it is bad [09:18] its bad for a infomation collection purpose [09:18] however thats not what we are doing. [09:20] I think the deep problem is very simple [09:20] People need to understand that when you put something on the internet it is for EVERYONE [09:20] nod [09:20] That is the core problem [09:20] 5th dimention. [09:21] We see these programmers writing about how big websites just need APIs [09:21] reddit has a great api [09:21] indeed, heard a stupid story about how someone at my wife's work posted photos of her baby info (scan date, etc) on instagram [09:21] take any url [09:21] people started talking about, and she went bonkers about how people were talking about her at work [09:21] :/ [09:22] add /.json to the end and bam [09:22] xbox.com is one which people screen scrape [09:22] they DO have a api, but its limited by number of requests and they only allow some people access each year. [09:22] well that is fucking MS for you [09:22] ;) [09:22] destroyer of everything [09:23] i thought that was IGN ;) [09:23] shit IGN was a couple of website [09:23] Cannot write to 'planetcivilization.gamespy.com/index.html?start_from=11' (Connection timed out). [09:23] FINISHED --2013-02-24 09:17:48-- [09:23] MS killed encarta [09:23] Total wall clock time: 6m 55s [09:23] Downloaded: 110 files, 4.2M in 58s (74.6 KB/s) [09:23] MSN, hotmail and hundreds of communities [09:24] You're making Yahoo sound good.. [09:24] :D [09:24] scary isn't it [09:24] * SmileyG sets MS speed to 1082mph [09:24] what about all the MSDN data that now only lives on stacks of cds in the back rooms of offices [09:25] they killed xda, how long till those forums are gone [09:26] we come down harder because Yahoo is an internet company and they should "know" better [09:26] Maybe we should get an insider into Microsoft and Yahoo, and get archives that way.. [09:28] Yahoo just killed all remote jobs and I am not moving [09:28] for MS it would have to be system admin at redmond [09:28] well easier would be to find someone who has all the MSDN cds [09:29] my old job had hundreds upon hundreds of them [09:30] a research university with data partnerships would be easest [09:45] if your looking for old M$ or msdn stuff i know someone. its a lot of stuff though [09:46] I know [09:46] it is important stuff [09:46] take directx for example [09:46] they documented the shit out of that [09:47] now as a side project I would be interested in mapping feature growth in it vs opengl [09:47] and factor in how game consoles slow api development [09:47] and the result would be some valuable information for graphics programmers now [09:48] graphics programming as a whole still seems like all black magic to most programmers [09:49] When I get the right job I am going to do all kinds of science stuff to help build up our computer knowledge [09:49] the discipline is so young and we have so much data. It is a unique opportunity [09:59] http://www.thenational.ae/news/world/africa/how-timbuktu-s-heritage-was-saved-in-rice-sacks-and-canoes [10:01] they need to scan that shit in [10:03] they talk about saving the books, and re-cataloging but what about digital [10:03] you want to save information. Take a high res picture of each page [10:03] well they had a building doing that but it's a giant target [10:03] I didn't see that in the article [10:04] the newly built south african one that got ransacked but luckily only had a small percentage of the collection [10:06] there's a little bit here but most of it is behind a registration wall http://www.tombouctoumanuscripts.org/db/ [10:09] I think everybody involved knows it needs digitizing but it's much easier said than done when dealing with thousands of fragile documents (no speed-scanning) and an african country with no money and difficult logistics [10:09] yeah I see the whole thing as a logistics problem not a people problem [10:09] we are here, they are way over thre [15:17] For the videos on archive.org, is there a way to tell which encode is the one that was originally uploaded and which are re-encodes? [15:19] For example, I'm looking at http://archive.org/details/Quicksand_clear and I don't want to download the 1 G MPEG2 if the 592 M Cineoack is the original. [15:26] Never mind. I found the answer in the files.xml [15:51] looks like i have 710.4gb of g4tv.com video so far [15:51] thats about the about of video i still have to upload [15:52] i moved the older stuff thats uploaded to another drive [16:42] hi folks, back from vacation- anything new and exciting happen over the past week? [16:44] dashcloud: ign is going down [16:44] g4tv is going down [16:44] the whole site? [16:44] that's a pretty damn big site [16:45] i got maybe over 800gb of videos from g4tv.com so far [16:46] good news is i may have almost all non hd videos from there [16:46] that's a lot of video [16:46] i have uploaded over 9000+ so far [16:47] i'm trying to grab the bigger hd videos [16:47] like pressconfs [16:47] podcast stuff [16:48] also guys i have the first 4 episodes of conan late night show from 1993 [16:49] there are also 3 tapes of conan from 1997 [16:49] on myspleen [17:34] so i found something interesting [17:36] flvhd videos for e3 2010 don't exist but for some reason microsoft had one [17:36] 2010 wasn't the year e3 skipped? [17:37] no [17:37] i have e3 2010 [17:37] just the hd videos don't exist [17:37] does e3 self host the videos? [17:37] but when i tryed grabing the flvhd for microsoft it was the same video non-hd flv [17:38] md5sum is the same [18:52] A sha256sum of 2tb worth of files took 15 hours to complete [18:53] finally done [20:41] http://dilbert.com/strips/comic/2013-02-24/ - sometimes the code wins [22:20] omf_: heh [23:49] http://www.epiclol.com/cdn/pictures/2012/06/monopoly-com-edition-circa-2000-weve-come-a-long-way-baby_1338542678_epiclolcom.jpg