[00:06] *** godane has joined #archiveteam-bs [00:42] *** schbirid has quit IRC (Read error: Operation timed out) [00:52] *** schbirid has joined #archiveteam-bs [01:38] *** useretail has joined #archiveteam-bs [01:42] *** primus104 has quit IRC (Leaving.) [02:17] *** schbirid has quit IRC (Read error: Operation timed out) [02:28] http://www.pcworld.com/article/2863878/microsofts-reported-spartan-browser-will-be-lighter-more-flexible-than-internet-explorer.html [02:52] *** mistym has quit IRC (Remote host closed the connection) [04:32] *** snowman_ has joined #archiveteam-bs [05:01] *** snowman_ has quit IRC (Excess Flood) [05:29] *** aaaaaaaaa has quit IRC (Leaving) [06:09] well here's to hoping MS doesn't screw spartan up [06:09] otherwise they may as well label it IE12 if it's a piece of shite [07:12] *** brayden has quit IRC (Ping timeout: 606 seconds) [07:18] *** brayden has joined #archiveteam-bs [07:19] antomatic: there are some projects for mass subtitling; the biggest problem they face is not lack of user contributions, but rather copyright issues [07:19] that is, "I have to distribute this video to be able to let people add subtitles... but I can't" [07:20] [08:19] antomatic: there are some projects for mass subtitling; the biggest problem they face is not lack of user contributions, but rather copyright issues [07:20] [08:19] that is, "I have to distribute this video to be able to let people add subtitles... but I can't" [07:20] (not sure if those arrived, client derp) [07:20] also, funny sidenote: in one of the talks at 31c3, the speaker was having great fun with the automated mistranslation :) [07:20] er [07:20] missubtitling* [07:21] let me see what one it was... [07:22] joepie91: Need some kind of "Take the video from there and overlay subtitles from somewhere else" player perhaps. ;) [07:22] antomatic: technically very hard, if not impossible [07:23] problem is not so much presenting finished subtitles [07:23] as it is the subtitle development environment [07:23] antomatic: it was dotsub! [07:23] hrm [07:23] http://dotsub.com/ [07:24] it appears to have gotten considerably more commercial.. [07:24] yikes, I see what you mean [07:24] example is http://dotsub.com/view/aed3b8b2-1889-4df5-ae63-ad85f5572f27 [07:25] http://dotsub.com/tutorials still shows some traces from the pre-enterprise times.. [07:26] All comes back to that fundamental problem of initial transcription. [07:26] This just does it with the cloud and 'crowdsourcing' (i.e. lots of people paid almost nothing) [07:27] and ultimately because they want to get paid, most videos just don't get captioned at all. [07:27] Amara is similar, although volunteery. [07:27] (.org) [07:27] * joepie91 wonders why he still sees buffering on YT despite 100mbps FttH [07:28] antomatic: meh-ish [07:28] run by non-profit, but doesn't seem all that non-commercial.. [07:29] which reminds me [07:29] yeah, it's very much "but come through this door for our Pro service" [07:29] I should try and do a talk on the importance of non-commercialness for next 31c3 or something [07:29] er [07:29] next c3 * [07:29] that'd be 32c3 :P [07:29] :) [07:29] antomatic: that's pretty much the impression I get [07:31] some essential things have to become free to be sustainable, if the alternative is that nobody will pay for them [07:31] hm [07:31] it's more complicated than that [07:31] it's not so much about price as it is about priorities [07:32] a commercial thing can never grow to its full potential, because there are always profit-related considerations at stake [07:32] some aspect of a service or thing may be important or really useful or really innovative, but if it doesn't increase or at the very least maintain the profit margin, it likely won't ever be implemented [07:32] yes. [07:32] this happens in non-profit organizations too [07:38] Need some incredible milspec-or-better voice recognition technology - better than the best of what's currently commerically available - or a way to massively parallelise the transcription effort in a way that requires no skill (and elicits no feeling of 'work') to the humans involved [07:38] e.g. audio captcha-style stuff [07:41] and even then... [07:41] ugh. this should be a solved problem by now. [07:42] oh well. one day. [07:48] antomatic: cut up videos into spoken text segments (this is not very hard, basic audio processing stuff), then present them as a game? [07:49] hell, make it a 'typing speed battle' game [07:49] but hey, copyright [08:03] ignore copyright [08:29] Ctrl-S: works until you do organized efforts with a single point of failure [08:29] ie. "get too big" [08:29] *** primus104 has joined #archiveteam-bs [09:08] *** mistym has joined #archiveteam-bs [09:45] *** brayden has quit IRC (Ping timeout: 606 seconds) [10:42] *** mistym has quit IRC (Remote host closed the connection) [11:05] *** BlueMaxim has quit IRC (Quit: Leaving) [11:17] *** wp494 has quit IRC (Quit: LOUD UNNECESSARY QUIT MESSAGES) [11:23] *** wp494 has joined #archiveteam-bs [14:00] *** Incurza has joined #archiveteam-bs [14:02] SketchCow:: i'm starting to upload marshill audio collection [14:03] 31c3: https://pbs.twimg.com/media/B6QtkYQCEAECnxi.jpg:large [14:05] SketchCow: good news is i think i got every sermons in audio [14:05] going back to 2000 [14:16] looks like i can get chosun libo english urls going back to 1997 [14:16] http://english.chosun.com/svc/list.html?pn=9109 [14:17] its the english pages of a south korenan daily news site [14:19] *** schbirid has joined #archiveteam-bs [16:32] *** aaaaaaaaa has joined #archiveteam-bs [16:52] *** SN4T14_ has joined #archiveteam-bs [16:55] *** Gfy has quit IRC (Ping timeout: 265 seconds) [16:57] *** Gfy has joined #archiveteam-bs [16:57] *** SN4T14 has quit IRC (Ping timeout: 369 seconds) [17:10] *** godane has quit IRC (Read error: Connection reset by peer) [17:13] *** godane has joined #archiveteam-bs [17:14] *** x3hc230 has joined #archiveteam-bs [17:15] hai [17:15] ho [17:15] seems a bit odd to talk here first but [17:16] what...is the topic about? [17:16] that AT shouldn't save things and right to be forgotten etc etc? [17:16] nah [17:16] that AT is archiving things [17:17] even if assholes are deleting their services [17:17] ...against the wish of their users? [17:17] eg http://ourincrediblejourney.tumblr.com/ [17:17] no [17:17] we dont care about that [17:18] "if it's public, it's archived" ? [17:18] (not familiar with your philosophy as a team, genuine Q) [17:19] different people have different philosophies [17:19] but "if it's public, it's archived" seems a good description of most [17:20] i don't see how (technically) it could be otherwise [17:20] sometimes sites cooperate and give us more access [17:20] x3hc230: turns out that people being aware of their shit being archived somehow makes them much more angry than it being done silently [17:20] sometimes we find more [17:20] sure tumblr.com might go poof one day and most of the data gone, but i'm pretty sure there are other ATs out there working for other purposes than historical [17:21] joepie91, the person i spoke to you about earlier was very very EXTREMELY happy that the site was saved [17:21] x3hc230: sure, and they're not the only one - but there's an equal amount of people who get disproportionately offended :) [17:22] but that just comes with the territory, I suppose [17:22] i would have suggested offering a "forget me" option, but how the hell do you verify who is who [17:22] that already exists in the form of an abusemail address [17:22] barely anybody ever uses it [17:22] eps when the host is non-coop / agressive [17:22] esp [17:23] and/or robots.txt [17:24] x3hc230: you should watch some of Jason Scott's talks, they're highly amusing and relevant [17:33] https://www.youtube.com/watch?v=vp2eG0TQubg ?? [17:42] *** lytv has quit IRC (Read error: Operation timed out) [17:43] *** lytv has joined #archiveteam-bs [17:43] *** schbirid has quit IRC (Read error: Operation timed out) [17:46] .t [17:46] Thu, 01 Jan 2015 17:46:26 GMT [17:46] .title [17:46] joepie91: Jason Scott On Selling Your Home - YouTube [17:46] wat [17:46] ... no [17:46] lol [17:46] x3hc230: http://anarchivism.org/w/Jason_Scott_Talks [17:47] *** lytv has quit IRC (Client Quit) [17:50] *** schbirid has joined #archiveteam-bs [17:59] *** lytv has joined #archiveteam-bs [18:00] so looks like kbs news930 started to have sign language person on every episode since August 2006 it looks like [18:04] *** sep332 has quit IRC (bye) [18:20] looks like jibs pid=21 changed from general news name to 820 news name [18:20] http://www.jibstv.com/tv/vod_list.asp?pid=21&page=37 [18:21] its only 820 news from the start 2014 [18:23] it 'could' be called Synthesis News [18:23] if you break this up: 종합뉴스 [18:23] *** xtr-201 has joined #archiveteam-bs [18:23] 종 = Species [18:24] 합 = Sum [18:24] 뉴 = New [18:25] 스 = Switch [18:25] this is translated with google [18:27] i'm most likely going to call it general news 820 [18:43] *** x3hc230 has quit IRC (Read error: Connection reset by peer) [18:46] *** x3hc23 has joined #archiveteam-bs [18:53] so i got a clip of barry the dinosaur in south korea [18:53] its date is 2006-12-18 [19:11] *** balrog has quit IRC (Read error: Operation timed out) [19:17] *** x3hc230 has joined #archiveteam-bs [19:17] *** x3hc23 has quit IRC (Read error: Connection reset by peer) [19:27] *** x3hc23 has joined #archiveteam-bs [19:27] *** x3hc230 has quit IRC (Read error: Connection reset by peer) [19:35] *** mistym has joined #archiveteam-bs [20:13] *** BlueMaxim has joined #archiveteam-bs [20:26] *** dashcloud has quit IRC (No Ping reply in 180 seconds.) [20:27] *** dashcloud has joined #archiveteam-bs [20:33] *** Jonimus has quit IRC (Quit: WeeChat 1.0.1) [21:02] *** Jonimus has joined #archiveteam-bs [21:31] *** schbirid has quit IRC (Leaving) [21:54] *** garyrh has quit IRC (Read error: Operation timed out) [22:03] *** wp494 has quit IRC (hub.se efnet.portlane.se) [22:03] *** jk[SVP] has quit IRC (hub.se efnet.portlane.se) [22:04] *** jk[[SVP]] has joined #archiveteam-bs [22:04] *** wp494 has joined #archiveteam-bs [22:07] *** garyrh has joined #archiveteam-bs [22:18] *** jk[[SVP]] is now known as jk[SVP] [22:40] *** dashcloud has quit IRC (Read error: Operation timed out) [22:49] *** dashcloud has joined #archiveteam-bs [22:51] *** x3hc23 has quit IRC (Read error: Operation timed out) [22:57] *** dashcloud has quit IRC (Read error: Operation timed out) [22:58] *** dashcloud has joined #archiveteam-bs [23:05] i'm at 328k items uploaded [23:05] 329k based on this link: https://archive.org/metamgr.php?&w_uploader=slaxemulator@gmail.com&mode=more [23:28] jeepers creepers, guy [23:35] its also 39tb of data [23:49] What is the record for people who are not archive.org staff? [23:54] *** mistym has quit IRC (Remote host closed the connection)