#archiveteam-bs 2013-03-10,Sun

โ†‘back Search

Time Nickname Message
00:47 ๐Ÿ”— balrog_ SketchCow: not quite AT-related, but netflix just killed their API: http://developer.netflix.com/blog/read/Changes_to_the_Public_API_Program
00:48 ๐Ÿ”— balrog_ see: "The changes, outlined below, are designed to allow us to focus our API efforts on supporting the products and features used most by our members. They are also designed to allow us to continue to offer the public API program in a way that aligns with our goals." ...then ... "We will no longer issue new public API developer keys." ... "We will no longer accept new API affiliates."
00:59 ๐Ÿ”— ersi that sucks
01:19 ๐Ÿ”— omf_ These companies think they are helping themselves
01:19 ๐Ÿ”— omf_ someone is probably already working on a way to exploit netflix not having this feature anymore into a money making possibility
01:20 ๐Ÿ”— Smiley paid public api's!
01:20 ๐Ÿ”— omf_ that too
01:20 ๐Ÿ”— omf_ there are many I would pay to have brought back
01:22 ๐Ÿ”— omf_ why don't they think people would pay
01:22 ๐Ÿ”— Smiley because "pirates!!!!"
01:22 ๐Ÿ”— Smiley or something
01:22 ๐Ÿ”— omf_ :)
01:22 ๐Ÿ”— Smiley you know
01:22 ๐Ÿ”— Smiley it's not like the music industry is recovering or anything
01:23 ๐Ÿ”— omf_ I know, a bunch of bullshit so a fucking manager can act all powerful
01:23 ๐Ÿ”— Smiley It's not like linux users pay more for humble bundles than anyone else...
01:23 ๐Ÿ”— omf_ My word is LAW!!!1!
01:23 ๐Ÿ”— Smiley omf_: ah yes
01:23 ๐Ÿ”— Smiley like my office
01:23 ๐Ÿ”— Smiley "you must be in the office".
01:23 ๐Ÿ”— Smiley ..... even when we are doing a full reneveation and there isn't actually any room to sit down?
01:23 ๐Ÿ”— Smiley "Yes!"
01:23 ๐Ÿ”— Smiley ok, I'll come in, and not get any work done all day. Cool, thanks.
01:24 ๐Ÿ”— omf_ Managers who do that are worthless. They think that is how to do a good job
01:24 ๐Ÿ”— Smiley oh and we might ask you to come in at the weekend to help with moving stuff
01:24 ๐Ÿ”— omf_ Smiley, I had to come in at night a few times and help install servers and shit
01:24 ๐Ÿ”— Smiley me: heh, ok good luck with that
01:24 ๐Ÿ”— Smiley concidering I do what is basically 13hour shifts due to trains etc because you _won't_ let me work from home sometimes.... lets see how well that goes down.
01:24 ๐Ÿ”— Smiley omf_: occasionally is fine
01:24 ๐Ÿ”— omf_ we had a sys admin staff of 5 and yet programmers still had to come in and help
01:25 ๐Ÿ”— omf_ but not every single time
01:25 ๐Ÿ”— Smiley Could you of done those servers some other time if someone had let you?
01:26 ๐Ÿ”— omf_ nope. Scheduled maintainence windows were at night only
01:27 ๐Ÿ”— omf_ Hardcore bureaucracy
01:30 ๐Ÿ”— Smiley fun times.
01:30 ๐Ÿ”— Smiley we just don't do that
01:30 ๐Ÿ”— Smiley small enough to just laugh most of the time
04:08 ๐Ÿ”— godane so i have to redownload the video pages again
04:08 ๐Ÿ”— godane did something thing stupid
04:09 ๐Ÿ”— godane tryed to open a text file when selecting a everything
04:09 ๐Ÿ”— godane including the warc.gz
04:10 ๐Ÿ”— godane so if you guys don't get this sorry
06:54 ๐Ÿ”— omf_ Just saw the newest Star Trek trailer. I still have no clue what the plot is for that movie
10:09 ๐Ÿ”— godane so i'm near the end of the images ids of g4
10:10 ๐Ÿ”— godane only about 40k ids to check to go
10:33 ๐Ÿ”— godane looks like my item for commodore 64 training tape is hot right now
10:34 ๐Ÿ”— godane its number 6 download in computersandtechvideos collection
10:41 ๐Ÿ”— omf_ I just uploaded a grab of the twit cleaner site
11:50 ๐Ÿ”— omf_ So an opinion/experience question
11:51 ๐Ÿ”— omf_ How much working space do you set aside for data projects?
11:51 ๐Ÿ”— omf_ I am currently setting aside 2tb and it is not enough
11:55 ๐Ÿ”— omf_ I just stacked up a few hundred gigs of archiveteam stuff and it takes forever to upload
11:58 ๐Ÿ”— Schbirid you can get 1TB servers from ovh at your local kimsufi for <20รขย‚ยฌ / month
12:01 ๐Ÿ”— omf_ I am looking at their site now
12:40 ๐Ÿ”— ersi http://googleresearch.blogspot.com/2013/03/learning-from-big-data-40-million.html
13:25 ๐Ÿ”— omf_ ersi, thanks for that article
13:29 ๐Ÿ”— omf_ aaaawww they don't talk about how they tie this information into freebase
13:29 ๐Ÿ”— omf_ It was mentioned in a freebase talk from 2 years ago, they were trying it out then
13:30 ๐Ÿ”— omf_ disambiguation is hard, but the work google has done benefits everyone
13:30 ๐Ÿ”— omf_ the data is in freebase which is CC-BY so everyone has it now. It is also cross referenced against the universal rdf ids
13:31 ๐Ÿ”— omf_ so more complex relations can be formed and not worry about stale data sources as the project continues
13:34 ๐Ÿ”— omf_ hell the schema system for freebase is a master class in taxonomy
13:34 ๐Ÿ”— ersi RIP MetaWeb
13:35 ๐Ÿ”— omf_ everything they did except graphd is still open and updated by google. I think metaweb is resting nicely
13:35 ๐Ÿ”— omf_ I always tell people about metaweb so they realize google didn't start this, they bought it
13:35 ๐Ÿ”— ersi Indeed
13:36 ๐Ÿ”— omf_ And people pissed on metaweb's head until google bought them
13:36 ๐Ÿ”— omf_ the common sentiment "It cannot be done"
13:36 ๐Ÿ”— ersi Heh, Pitbull - Back In Time is a fitting tune to listen to while chatting in AT channels
13:36 ๐Ÿ”— ersi yeah
13:36 ๐Ÿ”— ersi which is quite unfortunate in itself
13:36 ๐Ÿ”— omf_ People never recognize greatness
13:37 ๐Ÿ”— omf_ semantic meaning on the web is now possible because of the decade of work metaweb did
13:37 ๐Ÿ”— omf_ 10 fucking years but it is worth it
13:38 ๐Ÿ”— omf_ I could take freebase, tie it against the wayback machine, do a sample set of learning and then be able to do facebook style graph searches against the past of the web
13:39 ๐Ÿ”— omf_ I never understood why google doesn't factor time into search
13:39 ๐Ÿ”— omf_ for a large majority of their searchable content they have reliable date information.
13:40 ๐Ÿ”— omf_ They could make billions just selling analytics off that
13:40 ๐Ÿ”— omf_ how a product or brand changes over time and map out the growth over the internet
13:41 ๐Ÿ”— omf_ then factor in they have incoming and outgoing link data on these sites. They could rank these people and then compare against a human tested corpus and bam new information
13:42 ๐Ÿ”— omf_ For us mere mortals the major problem is the fact that a server cluster to run this costs a small fortune
13:43 ๐Ÿ”— ersi I'd like to know how much data they keep. Especially web history.
13:44 ๐Ÿ”— omf_ The n-gram data google released even as large as it is, is still manageable on a single server
14:34 ๐Ÿ”— omf_ http://www.osnews.com/story/26849/Google_called_the_MPEG-LA_s_bluff_and_won
18:28 ๐Ÿ”— godane so i have save a few missing clips
18:28 ๐Ÿ”— godane found there file names cause alot of images use the video name for image files
18:40 ๐Ÿ”— godane so i may try to grab the tech guy audio archive
18:59 ๐Ÿ”— godane i have uploaded over 15k videos to g4video-web collection
19:03 ๐Ÿ”— godane PresserTestH261_G4750.swf: http://archive.org/details/g4tv.com-video35810
19:04 ๐Ÿ”— godane its a ces presser test video
19:04 ๐Ÿ”— godane also PresserTestH261_G4750.swf is the desc

irclogger-viewer