[05:16] can anyone download this from way back machine: https://web.archive.org/web/20070626122847/http://msnbc.vo.llnwd.net/e1/video/podcast/pdv_nn_netcast_m4v-08-01-2007-193947.m4v [05:16] its very troubling when wayback machine will just say it has a video file but only gives like 33k of it [05:18] cause if i can get those links working i can then give you guys about 3 months of nbc ngiht news from 2007 [05:18] http://web.archive.org/cdx/search/cdx?url=http://msnbc.vo.llnwd.net/e1/video/podcast/pdv_nn_netcast_m4v-08-01-2007-193947.m4v [05:18] It only got 33k (27897 bytes) [05:18] that sucks [05:20] http://web.archive.org/cdx/search/cdx?url=http://msnbc.vo.llnwd.net/* btw [05:20] note that's on a somewhat fragile app server and is also prone to breaking or going away [05:21] I think there's a max it will return per page, too [05:21] (though all of that one fits on one page) [05:27] is there a file search for wayback machine urls [05:28] not per a domain [05:28] not publicly [05:28] they're not indexed in a way to make that cheap [05:29] it basically runs as a map-reduce job on the global wayback index in hadoop, afaik [05:30] (our indexes are done by SERT, which is basically "reverse subdomain order") [05:30] so like foo.archive.org/bar.txt becomes org,archive,foo)/bar.txt [05:31] so we can efficiently look up something like "all org domains" or "all files on archive.org and all subdomains", etc [05:50] i'm going to check all vo.llnwd.net domains [06:54] anyways i'm fixing a typo i did with cbsradio [06:55] the creator for those items has a typo [07:14] Famicoman: All downloaded, now to inject into the archive. [07:20] just know my fix for the cbs raido typo is going to create some fake cbs radio dates [07:20] i will got thur those and deindex them later [07:29] i'm uploading one of my Best Computer Games issue dvds [07:29] that will be about 16gb of data on 2 isos [07:30] one is a video disc and another is the game files [10:05] wasn't there a gnu tool to transpose (rotate the table so rows become columns and vice-versa) csv files? [10:07] looked through coreutils, i guess si had "pr" in mind but that does not do it [13:10] Cool cool, so in the span of a week I've met both kennethreitz and SketchCow, without having to travel somewhere [13:11] Sketchy is out exploring Stockholm atm [13:14] ersi: you were at pycon? [13:16] Yeah, 'course [13:16] man, I was a volounteer there [13:16] I even helped organise it, slightly, like, totally minimally [13:17] oh, huh :O [13:17] :O [13:17] we must have met without knowing :P [13:17] Indeed [13:18] well, there was only like 260-290 attendees.. so we've *def* met.. but yeah :D [13:18] :P [13:20] should really try to get together sometime [13:20] indeed~ [14:18] silly yahoo, one of my clients was sending spam: temporarily deferred due to user complaints - 4.16.55.1; see http://postmaster.yahoo.com/421-ts01.html [14:18] only they removed the postmaster url. [14:20] yeah, they be silly [14:22] they like to be silly [15:28] uploaded: https://archive.org/details/dvdrom-lki-62 [15:37] so looks like can't get anything upload right now [15:38] keep getting slow down errors [15:42] godane: fwiw, I'm uploading right now without any problems. [15:43] https://archive.org/details/Talislanta-wizard_hunter [15:53] its working again [15:54] nevermind [15:54] it go 100% with on then started to fail again [15:55] now its saying 400 bad request [16:34] uploaded: https://archive.org/search.php?query=creator%3A%22The+Midday%22 [16:34] Heh http://www.wjla.com/articles/2012/01/jason-scott-sentenced-to-100-years-71267.html @sketchcow [18:58] SketchCow: there is going to a Wisconsin Public Radio collection [18:58] with sub-collection for each of the shows [18:58] it will have to be that way so i can upload to it [18:59] since i'm at that 30 collection limit or something [19:46] The Midday collection so far: https://archive.org/search.php?query=collection%3Agodaneinbox%20AND%20subject%3A%22The%20Midday%22&sort=-date [21:48] 2013 of The Midday collection is getting uploaded [21:49] i'm getting stuff done [22:56] Great. [22:57] All hail, met esri. [22:59] esri or ersi? [23:01] ersi [23:01] I just woke up from a nap. [23:01] I did a lot of walking in Stockholm. [23:01] I mean, a lot. Miles and miles. [23:01] And I got my goddamn swedish meatballs in sweden [23:01] All I wanted [23:04] did you visit an Ikea as well? [23:05] bought a arkhiv? [23:05] i'm going after more global national [23:15] SketchCow: do you know if TV Archive project saves Global News channel? [23:15] i only ask cause i search for global national came up nothing [23:17] For what it's worth, I don't know how important it is to get that over other things. [23:18] But I honestly don't know. underscor is in much better shape to answer. [23:28] all i know is global news doesn't do a good job of keeping stuff [23:29] i will be luckly to get stuff over a year old [23:29] for example [23:29] only 20 episodes of march 2014 global national episodes still work [23:30] feb 2014 only has 11 episodes still working [23:31] jan 2014 is also 11 [23:32] whats more funny is the between 2013-09-26 to 2014-01-05 only 4 episodes are not working [23:32] Well go for it. [23:35] also the stupid podcasts they release only go back 6 weeks [23:35] so the streams are doing better but not by much [23:46] Achieved newsblur zero. [23:46] inbox zero is perhaps a bit too ambitious. [23:47] Is such a thing possible?! [23:51] I had it once. [23:51] a man can dream [23:52] https://www.youtube.com/watch?v=Ad9U3h2UmcA