Time |
Nickname |
Message |
00:04
🔗
|
SketchCow |
Understood and known. |
00:04
🔗
|
SketchCow |
Hey, anyone got a server they can give me an account on to run firefox and an X server (small one) on? |
00:35
🔗
|
SN4T14 |
SketchCow, still need that server? |
00:45
🔗
|
SketchCow |
I'd like to play around on one, yes. |
00:46
🔗
|
SN4T14 |
ohhdemgir, are you there? |
09:16
🔗
|
midas |
https://blog.box.com/2014/06/box-acquiring-streem-bringing-the-cloud-to-your-desktop/ |
09:20
🔗
|
garyrh |
"Streem has been acquired by Box! We're creating an optional migration path so all of your data will be safe!" |
09:39
🔗
|
Nemo_bis |
"optional" and "all" don't get along well |
09:43
🔗
|
garyrh |
looks like streem has disabled all public videos. |
10:01
🔗
|
garyrh |
actually, the videos are still technically accessible |
10:01
🔗
|
garyrh |
e.g. https://streem.s3.amazonaws.com/objects/9b38131adb0b0c6e36830d8fbeeb3fb4/LinuxCon_and_CloudOpen_North_America_2013_-_Linux_Kernel_Panel.mp4 |
10:02
🔗
|
ohhdemgir |
SN4T14, you can give SketchCo1 an account on arc01 if you want |
11:30
🔗
|
* |
SketchCow jumps up and down at the counter |
11:32
🔗
|
BlueMax |
one million hard drives, sir? |
11:45
🔗
|
midas |
Ryan Kearney is delivering his 1PB drive |
14:14
🔗
|
SN4T14 |
SketchCow, hang on, let me get you an account. :p |
14:19
🔗
|
SN4T14 |
Although, you will have to set up the DE and everything yourself (or you can set up a VM and have the installer do it for you) |
15:28
🔗
|
Arkiver2 |
http://www.theguardian.com/technology/2014/jun/17/youtube-indie-labels-music-subscription |
15:33
🔗
|
db48x |
Arkiver2: fun |
15:34
🔗
|
db48x |
Arkiver2: how will we identify the videos that are likely to be taken down? |
15:34
🔗
|
Arkiver2 |
db48x: I have no idea |
15:34
🔗
|
Arkiver2 |
is it public with which music labels google made an agreement? |
15:34
🔗
|
Arkiver2 |
Does google release those knd of contracts? |
15:35
🔗
|
db48x |
I don't suppose there's a list of independant artists and their youtube channels... |
15:35
🔗
|
db48x |
hah, no |
15:38
🔗
|
Arkiver2 |
found some |
15:38
🔗
|
Arkiver2 |
Adele and Arctic Monkeys are two examples of artists that are going to be blocked |
15:38
🔗
|
Arkiver2 |
so let's do those at least |
15:39
🔗
|
db48x |
https://www.youtube.com/playlist?list=PL55DF5F0E7C2C2DD3 |
15:39
🔗
|
db48x |
https://www.youtube.com/playlist?list=PLV7t4yekvqhv9x2_P8Jvue6IvtH4LCm20 |
15:39
🔗
|
db48x |
https://www.youtube.com/user/notsignedtv |
15:40
🔗
|
ivan` |
prepare for blocked youtube content http://www.theguardian.com/technology/2014/jun/17/youtube-indie-labels-music-subscription |
15:40
🔗
|
Arkiver2 |
ivan`: we're discussing it already :P |
15:40
🔗
|
ivan` |
I missed it :) |
15:40
🔗
|
Arkiver2 |
lol |
15:40
🔗
|
Arkiver2 |
we're searching for indepenent artists |
15:41
🔗
|
db48x |
the best way I know of to download a youtube channel is to use http://www.jwz.org/hacks/youtubefeed.pl |
15:41
🔗
|
ivan` |
does that get you 1080p and 256kbit DASH? |
15:41
🔗
|
Arkiver2 |
db48x: but that doesn't create a warc with youtube vids right? |
15:41
🔗
|
db48x |
yea, it sorts by quality and grabs the best one |
15:41
🔗
|
db48x |
Arkiver2: nope |
15:42
🔗
|
db48x |
ivan`: of course the list of video types could easily be wrong or out of date, we'd want to double-check :) |
15:42
🔗
|
ivan` |
db48x: have you observed it download a 1080p video after 2013-10? |
15:42
🔗
|
db48x |
yes |
15:42
🔗
|
db48x |
well |
15:42
🔗
|
Arkiver2 |
db48x: I'll start crawls with heritrix on the channels of some artists so their pages of the videos are saved, BUT NOT THE VIDEOS THEMSELVES are in the warcs |
15:42
🔗
|
db48x |
in 2013 yes, dunno about after november actually... |
15:43
🔗
|
db48x |
ivan`: we could modify it to just grab all the offered videos, that would do the trick |
15:43
🔗
|
ivan` |
anyway this is how I use youtube-dl https://www.refheap.com/d97ee2660f3ebec52c8265f1e/raw |
15:43
🔗
|
Nemo_bis |
emijrp made me upload thousands videos with https://code.google.com/p/emijrp/source/browse/trunk/scrapers/youtube2internetarchive.py :) |
15:43
🔗
|
db48x |
already have to modify it not to skip videos older than 2 days |
15:45
🔗
|
db48x |
I think the article is probably a bit alarmist though |
15:46
🔗
|
db48x |
I can't see them banning every account where someone sat down with a guitar in front of a camera |
15:46
🔗
|
yipdw |
they'll ban you if you don't use your Real Name |
15:46
🔗
|
yipdw |
at least |
15:46
🔗
|
db48x |
on the other hand, they could just review all videos that seem to have music but that don't fall afoul of their stupid copyrighted-music detector |
15:47
🔗
|
yipdw |
Rumour Has It that that's the case |
15:47
🔗
|
yipdw |
I guess everyone will have to find Someone Like Youtube |
15:48
🔗
|
db48x |
still, let's engage our paranoia anyway |
15:51
🔗
|
joepie91_ |
Arkiver2, db48x: not a whole lot of "you" left in "youtube" |
15:51
🔗
|
db48x |
heh |
15:52
🔗
|
joepie91_ |
also |
15:52
🔗
|
joepie91_ |
[17:46] <db48x> I can't see them banning every account where someone sat down with a guitar in front of a camera |
15:52
🔗
|
joepie91_ |
you'd be amazed |
16:01
🔗
|
db48x |
yea, I couldn't imagine them finding them all, but then I remembered their copyrighted-music detector |
16:02
🔗
|
db48x |
anything that _doesn't_ get flagged by that but that doesn't look like speech is basically going to be independant music |
16:05
🔗
|
yipdw |
it's good to know that Youtube is no better than the labels |
16:05
🔗
|
yipdw |
creative destruction of expectations |
16:16
🔗
|
SN4T14 |
db48x, stop being silly and start using youtube-dl. :p |
16:20
🔗
|
db48x |
youtube-dl won't download a whole rss feed |
16:20
🔗
|
SN4T14 |
Why do you need RSS feeds? |
16:20
🔗
|
db48x |
so that I can go around finding and downloading whole channels |
16:21
🔗
|
db48x |
rather than individual videos |
16:21
🔗
|
Arkiver2 |
I have a windows program here that downloads videos in the best quality |
16:22
🔗
|
SN4T14 |
db48x, youtube-dl can download entire channels |
16:22
🔗
|
schbirid |
youtube-dl does that just fine afaik |
16:22
🔗
|
schbirid |
and playlists etc |
16:22
🔗
|
SN4T14 |
No need to mess around with RSS feeds when there's simpler ways. ;) |
16:23
🔗
|
db48x |
it'd be nice if the documentation mentioned that |
16:24
🔗
|
schbirid |
man youtube-dl :P |
16:25
🔗
|
SN4T14 |
^ |
16:25
🔗
|
db48x |
obviously that's no help if you haven't installed it because it looks like it won't do what you want :) |
16:25
🔗
|
schbirid |
heh true |
16:26
🔗
|
SN4T14 |
also https://www.google.is/search?q=youtube+dl+entire+channel |
16:26
🔗
|
SN4T14 |
;) |
16:26
🔗
|
SN4T14 |
Yes, that is an Icelandic Google link because I'm lazy. :p |
16:27
🔗
|
db48x |
you're supposed to use lmgtfy.com :) |
16:28
🔗
|
SN4T14 |
www.lmgtfy.com/?q=no |
16:28
🔗
|
SN4T14 |
:p |
16:28
🔗
|
db48x |
heh |
16:28
🔗
|
db48x |
should we build a list of channels on an etherpad or the wiki or something? |
16:29
🔗
|
SN4T14 |
piratepad! :D |
16:29
🔗
|
db48x |
:) |
16:29
🔗
|
schbirid |
wiki |
16:30
🔗
|
schbirid |
isnt it what its for? |
16:30
🔗
|
db48x |
wikis are great, but there's more round-trip time |
16:31
🔗
|
SN4T14 |
Make a wiki page for it, and link to the piratepad list. ;) |
16:31
🔗
|
joepie91_ |
db48x: or, y'know, the bottom of http://rg3.github.io/youtube-dl/supportedsites.html |
16:31
🔗
|
joepie91_ |
:P |
16:31
🔗
|
db48x |
http://piratepad.net/C2ioWiy8fG |
16:32
🔗
|
SN4T14 |
youtube-dl supports downloading archive.org, we should archive it! :D |
16:32
🔗
|
joepie91_ |
lol |
16:32
🔗
|
SN4T14 |
db48x, I liked your old name! :p |
16:33
🔗
|
db48x |
what was it? |
16:35
🔗
|
SN4T14 |
Add name here |
16:35
🔗
|
SN4T14 |
:p |
16:36
🔗
|
db48x |
ah |
16:40
🔗
|
db48x |
does freedb have information about labels in it? |
16:46
🔗
|
yipdw |
db48x: not sure, but MusicBrainz does |
16:46
🔗
|
yipdw |
db48x: and their data is freely available -> http://musicbrainz.org/doc/MusicBrainz_Database |
16:48
🔗
|
db48x |
nice |
16:48
🔗
|
db48x |
want to find all the artists with no label, then do some youtube searches? |
16:49
🔗
|
yipdw |
at some point, but it's also artists on non-participating labels |
16:49
🔗
|
yipdw |
e.g. for Adele you'd probably want to look up XL |
16:49
🔗
|
yipdw |
XL Recordings that is |
16:51
🔗
|
db48x |
yea |
16:53
🔗
|
yipdw |
the constitutents of the Worldwide Independent Network might be a good place to start for that list |
16:54
🔗
|
db48x |
good idea |
16:55
🔗
|
db48x |
not many actual artists in a youtube search for 'indenendant artist' |
16:56
🔗
|
db48x |
however I spell it |
16:56
🔗
|
db48x |
mostly interviews, promoters and consultants |
16:59
🔗
|
db48x |
this is good though: https://www.youtube.com/watch?v=Hbxy9xvpZ10&list=PLC32FEF51263DD92C |
16:59
🔗
|
SN4T14 |
db48x, it's spelled "independent" |
17:00
🔗
|
db48x |
yes, I spelled it correctly when I did the search |
18:37
🔗
|
SketchCow |
http://instagram.com/p/pWpuz6MxuB/ (Server decomissioned) |
18:39
🔗
|
SN4T14 |
That looks so fun. :D |
18:48
🔗
|
godane |
i think i surpass my old record in godaneinbox |
18:48
🔗
|
godane |
its at 18735 now |
18:51
🔗
|
DFJustin |
cripes |
18:55
🔗
|
midas |
so, about this RAWPORTER, did I miss anything? |
18:56
🔗
|
SketchCow |
I think I punched them |
18:56
🔗
|
SketchCow |
Then we sat |
18:56
🔗
|
SketchCow |
But if we can pull ANYTHING out of them, do it. |
18:57
🔗
|
SketchCow |
Chances might be it's not possible. |
18:57
🔗
|
SketchCow |
Might be limited release, but scan them |
18:57
🔗
|
midas |
someone did a scan already if im not mistaken |
18:58
🔗
|
midas |
or was that steem |
18:59
🔗
|
midas |
joepie91_: you did some rawporter work yesterday with the markers |
19:01
🔗
|
midas |
i think the s3 wasnt secured |
19:01
🔗
|
midas |
so we can grab all pictures and video's |
19:01
🔗
|
midas |
http://rawporter.s3.amazonaws.com/ |
19:03
🔗
|
SketchCow |
Well, do it. |
19:11
🔗
|
midas |
s3cmd du s3://rawporter |
19:11
🔗
|
midas |
WARNING: Retrying failed request: /?marker=thumbs/l_f5fnivczoddwq7.jpg (timed out) |
19:11
🔗
|
midas |
WARNING: Waiting 3 sec... |
19:11
🔗
|
midas |
78880173037 s3://rawporter/ |
19:11
🔗
|
midas |
well peeps? |
19:12
🔗
|
SN4T14 |
78GB? That's pretty small... |
19:12
🔗
|
midas |
thats what is on the s3 |
19:14
🔗
|
SN4T14 |
Weird |
19:18
🔗
|
joepie91_ |
:P |
19:18
🔗
|
joepie91_ |
just grab all of S3 |
19:19
🔗
|
midas |
meh, good point |
19:20
🔗
|
joepie91_ |
grab first, assess later |
19:21
🔗
|
yipdw |
that advice has also served me well on the North Side of Chicago |
19:21
🔗
|
joepie91_ |
yipdw: ? |
19:21
🔗
|
yipdw |
bad regional joke |
19:21
🔗
|
joepie91_ |
(also, do we have a way of grabbing an entire S3 bucket with WARC?) |
19:21
🔗
|
joepie91_ |
lol |
19:22
🔗
|
* |
joepie91_ is not from that region |
19:22
🔗
|
yipdw |
basically the north side and the rest of the North Shore area is unusually sexually active |
19:22
🔗
|
SN4T14 |
So basically sex apartheid |
19:23
🔗
|
yipdw |
nah |
19:23
🔗
|
yipdw |
we have real racism in Chicag |
19:23
🔗
|
yipdw |
o |
19:23
🔗
|
joepie91_ |
"unusually sexually active"? |
19:23
🔗
|
yipdw |
it's on the high end of the curve |
19:23
🔗
|
yipdw |
anyway |
19:24
🔗
|
yipdw |
I think I passed the -bs threshold on line 1 |
19:27
🔗
|
joepie91_ |
lol |
19:30
🔗
|
midas |
grabbing s3 now |
19:32
🔗
|
SN4T14 |
midas, according to my calculations, you're going to cost them $4-$9.5 in S3 costs from grabbing all of that. :p |
19:33
🔗
|
midas |
im going to grab it 20 times SN4T14 ;) |
19:35
🔗
|
schbirid |
earbits.com mp3 tars are incoming, grab them while you can: https://archive.org/search.php?query=subject%3A%22earbits.com%22%20mp3 |
19:35
🔗
|
SN4T14 |
while true; do curl https://rawporter.s3.amazonaws.com/AWOL/gallery.swf -o /dev/null; done |
19:35
🔗
|
SN4T14 |
:p |
19:35
🔗
|
SN4T14 |
Not sure if that's correct, I rarely use curl. :p |
19:36
🔗
|
midas |
s3cmd get s3://rawporter --recursive /hurr/durr |
19:36
🔗
|
SN4T14 |
while true; do s3cmd get s3://rawporter --recursive /dev/null &; done |
19:36
🔗
|
SN4T14 |
:D |
19:36
🔗
|
schbirid |
lol, open s3 is the new ID iteration |
19:37
🔗
|
SN4T14 |
ID iteration? |
19:37
🔗
|
midas |
open s3 is running around with your middlefingers in the air and screaming |
19:38
🔗
|
joepie91_ |
schbirid: nah, open S3 is much more efficient |
19:38
🔗
|
joepie91_ |
than ID iteration |
19:38
🔗
|
joepie91_ |
:P |
19:38
🔗
|
schbirid |
:> |
19:38
🔗
|
schbirid |
SN4T14: wget http:///www.internet.com/file?id=123 |
19:38
🔗
|
joepie91_ |
jesus wtf, 28KB/sec |
19:38
🔗
|
joepie91_ |
how congested is IA |
19:39
🔗
|
midas |
jeez joepie91_ |
19:39
🔗
|
midas |
are you on dialup? |
19:39
🔗
|
joepie91_ |
20kb/sec now, cancelled it |
19:39
🔗
|
joepie91_ |
midas: IA is, apparently |
19:40
🔗
|
schbirid |
not from here |
19:41
🔗
|
midas |
i |
19:41
🔗
|
midas |
i've seen worse according to the weathermap |
19:41
🔗
|
midas |
slowdown rate on s3 is also very low |
19:41
🔗
|
midas |
are you downloading or uploading joepie91_ |
19:41
🔗
|
midas |
? |
19:41
🔗
|
joepie91_ |
dfl |
19:41
🔗
|
joepie91_ |
dl * |
19:43
🔗
|
joepie91_ |
it's going over HE |
19:43
🔗
|
joepie91_ |
now brb |
19:49
🔗
|
Muad-Dib |
http://arstechnica.com/business/2014/06/artists-who-dont-sign-with-youtubes-new-subscription-service-to-be-blocked/ |
19:52
🔗
|
exmic |
yeah that's fucked up |
19:55
🔗
|
db48x |
Muad-Dib: we need someone to build a list of independent artists |
19:56
🔗
|
db48x |
yipdw suggested looking at the members of the Worldwide Independent Network |
20:40
🔗
|
Nemo_bis |
congrats midas and ohhdemgir :) https://archive.org/metamgr.php?f=histogram&group=uploader&w_collection=ftpsites |
20:40
🔗
|
SN4T14 |
"You must be logged in to access this service." >.> |
20:40
🔗
|
schbirid |
:) |
20:41
🔗
|
Nemo_bis |
and why aren't you logged in on archive.org, aren't you going after spam and support requests in forums etc. etc. |
20:42
🔗
|
db48x |
aww, I'm not authorized |
20:42
🔗
|
Nemo_bis |
oh, look, there is only one 2331388015 KB item https://archive.org/metamgr.php?f=histogram&group=size&w_collection=ftpsites |
20:42
🔗
|
Nemo_bis |
The second is 893892396 KB from another wikisourceror, I swear I didn't suggest him |
20:45
🔗
|
schbirid |
how can i see how much i uploaded? |
20:48
🔗
|
Nemo_bis |
schbirid: https://archive.org/metamgr.php?f=histogram&group=size&w_uploader=spirit@quaddicted.com but I'm not sure if there's a way to sum the first column |
20:49
🔗
|
schbirid |
oi, dont leak my mail address to irc please |
20:49
🔗
|
schbirid |
thanks |
20:49
🔗
|
exmic |
well, meatmgr |
20:49
🔗
|
Nemo_bis |
sorry, I had a doubt for a moment but then thought it's in all the xml files anyway :p |
20:49
🔗
|
exmic |
:P |
20:50
🔗
|
Nemo_bis |
I should have used mine as example |
20:50
🔗
|
schbirid |
yeah but those are for admins only |
20:50
🔗
|
schbirid |
while in this channel i maybe know 10% |
20:50
🔗
|
schbirid |
no biggie |
20:50
🔗
|
exmic |
what, the xml files? |
20:50
🔗
|
schbirid |
yeah |
20:51
🔗
|
Nemo_bis |
everyone can download them |
20:51
🔗
|
exmic |
hmm |
20:51
🔗
|
exmic |
that's what I thought, Nemo_bis |
20:51
🔗
|
schbirid |
its kinda crazy how much archive.org shows stupid admins like me :D |
20:51
🔗
|
schbirid |
oh? :( |
20:51
🔗
|
* |
db48x spams schbirid |
20:52
🔗
|
Nemo_bis |
they're not even hidden behind the "HTTPS"/download link in items like https://archive.org/details/wiki-wikiurbandeadcom |
20:54
🔗
|
garyrh |
joepie91, i've been downloading rawporter |
20:54
🔗
|
garyrh |
nearly done |
20:55
🔗
|
garyrh |
is anyone else downloading rawporter? |
20:56
🔗
|
SN4T14 |
I think midas was as well. |
20:58
🔗
|
midas |
just the s3 files |
20:58
🔗
|
midas |
6200 of 39K |
20:59
🔗
|
midas |
probably done in the morning |
21:00
🔗
|
joepie91_ |
schbirid: anybody can see uploader, yes |
21:02
🔗
|
Nemo_bis |
OTOH, http://blog.archive.org/2013/10/25/reader-privacy-at-the-internet-archive/ : almost nobody on the web is so good |
21:07
🔗
|
DFJustin |
I do wish there was an uploader privacy option for items though |
21:07
🔗
|
DFJustin |
other than registering a throwaway email |
21:08
🔗
|
midas |
DFJustin: darken it directly after uploading? |
21:08
🔗
|
midas |
altho, it wont be findable anymore |
21:09
🔗
|
exmic |
also won't be downloadable or anything |
21:24
🔗
|
Nemo_bis |
lol mistym |
21:25
🔗
|
Nemo_bis |
* midas |
21:51
🔗
|
ohhdemgir |
Nemo_bis, "User: ohhdemgirls is not authorized to access this service." |
21:52
🔗
|
Nemo_bis |
well, you're second with most items in ftpsites |
21:53
🔗
|
ohhdemgir |
i wanna see!! |
21:57
🔗
|
SN4T14 |
I think midas was as well. |
21:57
🔗
|
SN4T14 |
Whoops, this isn't Cygwin |
21:57
🔗
|
SN4T14 |
lol |
23:25
🔗
|
underscor |
schbirid: Nemo_bis: yeah, they're totally open in the current system |
23:25
🔗
|
underscor |
Much of the system is architected on that being the case but eventually we want to move to a different user ID |
23:25
🔗
|
underscor |
as manpower allows |