| Time |
Nickname |
Message |
|
05:16
🔗
|
godane |
can anyone download this from way back machine: https://web.archive.org/web/20070626122847/http://msnbc.vo.llnwd.net/e1/video/podcast/pdv_nn_netcast_m4v-08-01-2007-193947.m4v |
|
05:16
🔗
|
godane |
its very troubling when wayback machine will just say it has a video file but only gives like 33k of it |
|
05:18
🔗
|
godane |
cause if i can get those links working i can then give you guys about 3 months of nbc ngiht news from 2007 |
|
05:18
🔗
|
underscor |
http://web.archive.org/cdx/search/cdx?url=http://msnbc.vo.llnwd.net/e1/video/podcast/pdv_nn_netcast_m4v-08-01-2007-193947.m4v |
|
05:18
🔗
|
underscor |
It only got 33k (27897 bytes) |
|
05:18
🔗
|
godane |
that sucks |
|
05:20
🔗
|
underscor |
http://web.archive.org/cdx/search/cdx?url=http://msnbc.vo.llnwd.net/* btw |
|
05:20
🔗
|
underscor |
note that's on a somewhat fragile app server and is also prone to breaking or going away |
|
05:21
🔗
|
underscor |
I think there's a max it will return per page, too |
|
05:21
🔗
|
underscor |
(though all of that one fits on one page) |
|
05:27
🔗
|
godane |
is there a file search for wayback machine urls |
|
05:28
🔗
|
godane |
not per a domain |
|
05:28
🔗
|
underscor |
not publicly |
|
05:28
🔗
|
underscor |
they're not indexed in a way to make that cheap |
|
05:29
🔗
|
underscor |
it basically runs as a map-reduce job on the global wayback index in hadoop, afaik |
|
05:30
🔗
|
underscor |
(our indexes are done by SERT, which is basically "reverse subdomain order") |
|
05:30
🔗
|
underscor |
so like foo.archive.org/bar.txt becomes org,archive,foo)/bar.txt |
|
05:31
🔗
|
underscor |
so we can efficiently look up something like "all org domains" or "all files on archive.org and all subdomains", etc |
|
05:50
🔗
|
godane |
i'm going to check all vo.llnwd.net domains |
|
06:54
🔗
|
godane |
anyways i'm fixing a typo i did with cbsradio |
|
06:55
🔗
|
godane |
the creator for those items has a typo |
|
07:14
🔗
|
SketchCow |
Famicoman: All downloaded, now to inject into the archive. |
|
07:20
🔗
|
godane |
just know my fix for the cbs raido typo is going to create some fake cbs radio dates |
|
07:20
🔗
|
godane |
i will got thur those and deindex them later |
|
07:29
🔗
|
godane |
i'm uploading one of my Best Computer Games issue dvds |
|
07:29
🔗
|
godane |
that will be about 16gb of data on 2 isos |
|
07:30
🔗
|
godane |
one is a video disc and another is the game files |
|
10:05
🔗
|
schbirid |
wasn't there a gnu tool to transpose (rotate the table so rows become columns and vice-versa) csv files? |
|
10:07
🔗
|
schbirid |
looked through coreutils, i guess si had "pr" in mind but that does not do it |
|
13:10
🔗
|
ersi |
Cool cool, so in the span of a week I've met both kennethreitz and SketchCow, without having to travel somewhere |
|
13:11
🔗
|
ersi |
Sketchy is out exploring Stockholm atm |
|
13:14
🔗
|
tephra |
ersi: you were at pycon? |
|
13:16
🔗
|
ersi |
Yeah, 'course |
|
13:16
🔗
|
tephra |
man, I was a volounteer there |
|
13:16
🔗
|
ersi |
I even helped organise it, slightly, like, totally minimally |
|
13:17
🔗
|
ersi |
oh, huh :O |
|
13:17
🔗
|
tephra |
:O |
|
13:17
🔗
|
tephra |
we must have met without knowing :P |
|
13:17
🔗
|
ersi |
Indeed |
|
13:18
🔗
|
ersi |
well, there was only like 260-290 attendees.. so we've *def* met.. but yeah :D |
|
13:18
🔗
|
tephra |
:P |
|
13:20
🔗
|
tephra |
should really try to get together sometime |
|
13:20
🔗
|
ersi |
indeed~ |
|
14:18
🔗
|
midas |
silly yahoo, one of my clients was sending spam: temporarily deferred due to user complaints - 4.16.55.1; see http://postmaster.yahoo.com/421-ts01.html |
|
14:18
🔗
|
midas |
only they removed the postmaster url. |
|
14:20
🔗
|
ersi |
yeah, they be silly |
|
14:22
🔗
|
midas |
they like to be silly |
|
15:28
🔗
|
godane |
uploaded: https://archive.org/details/dvdrom-lki-62 |
|
15:37
🔗
|
godane |
so looks like can't get anything upload right now |
|
15:38
🔗
|
godane |
keep getting slow down errors |
|
15:42
🔗
|
SadDM |
godane: fwiw, I'm uploading right now without any problems. |
|
15:43
🔗
|
SadDM |
https://archive.org/details/Talislanta-wizard_hunter |
|
15:53
🔗
|
godane |
its working again |
|
15:54
🔗
|
godane |
nevermind |
|
15:54
🔗
|
godane |
it go 100% with on then started to fail again |
|
15:55
🔗
|
godane |
now its saying 400 bad request |
|
16:34
🔗
|
godane |
uploaded: https://archive.org/search.php?query=creator%3A%22The+Midday%22 |
|
16:34
🔗
|
is4 |
Heh http://www.wjla.com/articles/2012/01/jason-scott-sentenced-to-100-years-71267.html @sketchcow |
|
18:58
🔗
|
godane |
SketchCow: there is going to a Wisconsin Public Radio collection |
|
18:58
🔗
|
godane |
with sub-collection for each of the shows |
|
18:58
🔗
|
godane |
it will have to be that way so i can upload to it |
|
18:59
🔗
|
godane |
since i'm at that 30 collection limit or something |
|
19:46
🔗
|
godane |
The Midday collection so far: https://archive.org/search.php?query=collection%3Agodaneinbox%20AND%20subject%3A%22The%20Midday%22&sort=-date |
|
21:48
🔗
|
godane |
2013 of The Midday collection is getting uploaded |
|
21:49
🔗
|
godane |
i'm getting stuff done |
|
22:56
🔗
|
SketchCow |
Great. |
|
22:57
🔗
|
SketchCow |
All hail, met esri. |
|
22:59
🔗
|
yipdw |
esri or ersi? |
|
23:01
🔗
|
SketchCow |
ersi |
|
23:01
🔗
|
SketchCow |
I just woke up from a nap. |
|
23:01
🔗
|
SketchCow |
I did a lot of walking in Stockholm. |
|
23:01
🔗
|
SketchCow |
I mean, a lot. Miles and miles. |
|
23:01
🔗
|
SketchCow |
And I got my goddamn swedish meatballs in sweden |
|
23:01
🔗
|
SketchCow |
All I wanted |
|
23:04
🔗
|
dashcloud |
did you visit an Ikea as well? |
|
23:05
🔗
|
midas |
bought a arkhiv? |
|
23:05
🔗
|
godane |
i'm going after more global national |
|
23:15
🔗
|
godane |
SketchCow: do you know if TV Archive project saves Global News channel? |
|
23:15
🔗
|
godane |
i only ask cause i search for global national came up nothing |
|
23:17
🔗
|
SketchCow |
For what it's worth, I don't know how important it is to get that over other things. |
|
23:18
🔗
|
SketchCow |
But I honestly don't know. underscor is in much better shape to answer. |
|
23:28
🔗
|
godane |
all i know is global news doesn't do a good job of keeping stuff |
|
23:29
🔗
|
godane |
i will be luckly to get stuff over a year old |
|
23:29
🔗
|
godane |
for example |
|
23:29
🔗
|
godane |
only 20 episodes of march 2014 global national episodes still work |
|
23:30
🔗
|
godane |
feb 2014 only has 11 episodes still working |
|
23:31
🔗
|
godane |
jan 2014 is also 11 |
|
23:32
🔗
|
godane |
whats more funny is the between 2013-09-26 to 2014-01-05 only 4 episodes are not working |
|
23:32
🔗
|
SketchCow |
Well go for it. |
|
23:35
🔗
|
godane |
also the stupid podcasts they release only go back 6 weeks |
|
23:35
🔗
|
godane |
so the streams are doing better but not by much |
|
23:46
🔗
|
SketchCow |
Achieved newsblur zero. |
|
23:46
🔗
|
SketchCow |
inbox zero is perhaps a bit too ambitious. |
|
23:47
🔗
|
garyrh |
Is such a thing possible?! |
|
23:51
🔗
|
SketchCow |
I had it once. |
|
23:51
🔗
|
Ravenloft |
a man can dream |
|
23:52
🔗
|
SketchCow |
https://www.youtube.com/watch?v=Ad9U3h2UmcA |