Time |
Nickname |
Message |
00:00
π
|
|
killsushi has joined #archiveteam-bs |
00:05
π
|
|
LowLevelM has joined #archiveteam-bs |
00:06
π
|
|
astrid has joined #archiveteam-bs |
00:06
π
|
|
Fusl sets mode: +o astrid |
00:07
π
|
LowLevelM |
Ok Fusl, How do you get 5Tbps? I assume that is spread across many servers. right? |
00:07
π
|
Fusl |
500 servers each 10gbit |
00:07
π
|
LowLevelM |
500 servers. What? |
00:08
π
|
LowLevelM |
That must be soo expensive |
00:08
π
|
Fusl |
Β―\_(γ)_/Β― |
00:09
π
|
JAA |
-ot this? |
00:09
π
|
Fusl |
no more comments need to be made so no |
00:13
π
|
|
systwi has joined #archiveteam-bs |
00:20
π
|
arkiver |
Fusl: :O |
00:21
π
|
arkiver |
that is niiiiice |
00:54
π
|
|
qwebirc20 has quit IRC (Ping timeout: 261 seconds) |
00:54
π
|
|
LowLevelM has quit IRC (Ping timeout: 261 seconds) |
01:14
π
|
|
LowLevelM has joined #archiveteam-bs |
01:19
π
|
|
LowLevelM has quit IRC (Ping timeout: 260 seconds) |
01:38
π
|
|
LowLevelM has joined #archiveteam-bs |
01:38
π
|
|
HashbangI has quit IRC (Read error: Connection reset by peer) |
01:46
π
|
|
HashbangI has joined #archiveteam-bs |
02:31
π
|
|
BlueMax has joined #archiveteam-bs |
03:09
π
|
|
qw3rty118 has joined #archiveteam-bs |
03:15
π
|
|
qw3rty117 has quit IRC (Read error: Operation timed out) |
03:48
π
|
|
odemgi_ has joined #archiveteam-bs |
03:49
π
|
|
odemg has quit IRC (Read error: Operation timed out) |
03:50
π
|
|
odemgi has quit IRC (Read error: Operation timed out) |
04:04
π
|
|
odemg has joined #archiveteam-bs |
05:02
π
|
|
killsushi has quit IRC (Read error: Operation timed out) |
05:42
π
|
|
m007a83_ is now known as m007a83 |
06:19
π
|
|
LowLevelM has quit IRC (Ping timeout: 260 seconds) |
06:58
π
|
|
schbirid has joined #archiveteam-bs |
07:11
π
|
|
Atom has quit IRC (Ping timeout: 252 seconds) |
07:12
π
|
|
Atom has joined #archiveteam-bs |
07:53
π
|
SketchCow |
I'm now running a script to see how many of the mirrored youtube videos are missing from youtube. |
07:59
π
|
Fusl |
SketchCow: on a related note, have you noticed that youtube is now aggressively blocking/rate-limiting youtube-dl and similar downloaders such that mass mirroring of youtube channels is not easily possible anymore? |
08:01
π
|
Fusl |
https://torrentfreak.com/youtube-blocks-popular-mp3-stream-ripping-sites-190710/ |
08:02
π
|
Flashfire |
it wasnt as big a problem blocking the sites but now blocking the software itself |
08:27
π
|
|
MillerBOS has quit IRC (Read error: Connection reset by peer) |
08:27
π
|
|
pikami_ has quit IRC (Write error: Broken pipe) |
08:27
π
|
|
odemgi_ has quit IRC (Write error: Broken pipe) |
08:27
π
|
|
thejsa has quit IRC (Write error: Broken pipe) |
08:27
π
|
|
dashcloud has quit IRC (Write error: Broken pipe) |
08:27
π
|
|
m007a83_ has joined #archiveteam-bs |
08:27
π
|
|
benjinss has joined #archiveteam-bs |
08:27
π
|
|
odemgi_ has joined #archiveteam-bs |
08:27
π
|
|
benjinss has quit IRC (Read error: Connection reset by peer) |
08:27
π
|
|
MillerBOS has joined #archiveteam-bs |
08:28
π
|
|
thejsa has joined #archiveteam-bs |
08:28
π
|
|
dashcloud has joined #archiveteam-bs |
08:28
π
|
|
pikami has joined #archiveteam-bs |
08:29
π
|
|
benjinss has joined #archiveteam-bs |
08:33
π
|
|
stapler11 has quit IRC (Read error: Operation timed out) |
08:33
π
|
|
benjinsmi has quit IRC (Ping timeout: 604 seconds) |
08:33
π
|
|
m007a83 has quit IRC (Read error: Operation timed out) |
08:34
π
|
|
stapler11 has joined #archiveteam-bs |
08:40
π
|
|
Igloo has quit IRC (Read error: Operation timed out) |
08:40
π
|
|
Igloo has joined #archiveteam-bs |
08:44
π
|
|
LeG0ax has joined #archiveteam-bs |
08:45
π
|
|
RichardG has quit IRC (Read error: Operation timed out) |
08:45
π
|
|
RichardG has joined #archiveteam-bs |
08:46
π
|
|
Ing3b0rg has quit IRC (Ping timeout: 506 seconds) |
08:46
π
|
|
LeG0ax is now known as Ing3b0rg |
08:47
π
|
|
nyany has quit IRC (Read error: Operation timed out) |
08:48
π
|
|
svchfoo3 has quit IRC (Ping timeout: 506 seconds) |
08:49
π
|
|
eientei95 has quit IRC (Ping timeout: 506 seconds) |
08:49
π
|
|
PurpleSym has quit IRC (Read error: Operation timed out) |
08:49
π
|
|
purplebot has quit IRC (Read error: Operation timed out) |
08:49
π
|
|
pikami has quit IRC (Ping timeout: 506 seconds) |
08:50
π
|
|
pikami has joined #archiveteam-bs |
08:50
π
|
|
PurpleSym has joined #archiveteam-bs |
08:51
π
|
|
eientei95 has joined #archiveteam-bs |
08:51
π
|
|
eientei95 has quit IRC (Handshake flooding) |
08:53
π
|
|
h3ndr1k_ has joined #archiveteam-bs |
08:53
π
|
|
eientei95 has joined #archiveteam-bs |
08:53
π
|
|
eientei95 has quit IRC (Handshake flooding) |
08:54
π
|
|
h3ndr1k has quit IRC (Ping timeout: 740 seconds) |
08:56
π
|
|
eientei95 has joined #archiveteam-bs |
09:00
π
|
|
h3ndr1k_ is now known as h3ndr1k |
09:43
π
|
|
nyany has joined #archiveteam-bs |
09:44
π
|
|
purplebot has joined #archiveteam-bs |
09:44
π
|
|
svchfoo3 has joined #archiveteam-bs |
09:44
π
|
|
Fusl sets mode: +o svchfoo3 |
10:10
π
|
|
betamax_ is now known as betamax |
10:22
π
|
|
deevious has joined #archiveteam-bs |
11:12
π
|
|
Raccoon has joined #archiveteam-bs |
11:28
π
|
|
BlueMax has quit IRC (Read error: Connection reset by peer) |
11:46
π
|
Fusl_ |
JAA: where do you want the nratv stuff uploaded? |
12:39
π
|
Fusl_ |
fuzzy8021: you around? |
12:45
π
|
Fusl_ |
arkiver: fyi, i'm pulling flickr out of jrwr's storage now and soon doing the others as well so if you have anything still running that pulls data together from there, now is a good time to kill all of that |
12:46
π
|
JAA |
Fusl_: Ah, right, NRATV. So you have ~20k WARCs and ~20k video files, right? |
12:46
π
|
Fusl_ |
22188 |
12:47
π
|
JAA |
Probably best to coordinate this with IA. |
12:47
π
|
JAA |
We'll want the video files as items I assume, with the appropriate metadata. |
12:47
π
|
Fusl_ |
do we need JS for this or do you have contact with people at IA? |
12:48
π
|
JAA |
Not sure about the WARCs, either as they are or megawarcs I guess. |
12:48
π
|
Fusl |
they're currently not megawarced |
12:48
π
|
JAA |
Jason's probably the guy for that. I haven't spoken with anyone about that. |
12:48
π
|
Igloo |
What do you need from IA Fusl? |
12:48
π
|
Igloo |
I can go poke the slack. |
12:48
π
|
JAA |
We'll want an "NRATV" collection I think. |
12:49
π
|
Fusl |
ideally we want two i guess, one for the videos and one for the raw warc files that contains the videos |
12:50
π
|
JAA |
Yeah, "NRATV" for the videos and "NRATV WARCs" for the WARCs? |
12:50
π
|
|
Mateon1 has quit IRC (Read error: Operation timed out) |
12:50
π
|
Fusl |
whatever is fine for them |
12:51
π
|
JAA |
It would be even better if we could throw video and WARC in one item, but that doesn't work I think due to the mediatype. |
12:51
π
|
|
Mateon1 has joined #archiveteam-bs |
12:51
π
|
JAA |
Will have to extract the metadata also. I'll look into that later. |
12:59
π
|
Igloo |
If you don't get a response from JS or arkiver etc I can ping the slack when we know what we want. |
13:26
π
|
fuzzy8021 |
sup Fusl_ |
13:29
π
|
Fusl |
fuzzy8021: 95.216.12.47 is yours, right? |
13:29
π
|
fuzzy8021 |
yep |
13:30
π
|
fuzzy8021 |
do you need it? |
13:32
π
|
|
luckcolor has quit IRC (Ping timeout: 246 seconds) |
13:34
π
|
Fusl |
if you dont need it anymore, i'd like to take over the server into my hetzner account so you dont have to pay for it anymore |
13:37
π
|
fuzzy8021 |
sure why not. havent gotten around to using it yet |
14:36
π
|
arkiver |
Fusl: I donΒ΄t have anything pulling from there |
14:36
π
|
arkiver |
and thanks for working on it! |
14:38
π
|
|
deevious has quit IRC (Quit: deevious) |
15:18
π
|
|
luckcolor has joined #archiveteam-bs |
15:22
π
|
SketchCow |
What up |
15:26
π
|
JAA |
SketchCow: 22k NRATV videos, each has a video file and a WARC (containing the playlist and all video segments) |
15:27
π
|
|
Verified_ has quit IRC (Ping timeout: 252 seconds) |
15:27
π
|
JAA |
Metadata isn't ready yet, but I think I have it somewhere. |
15:27
π
|
SketchCow |
OK... so we want to make a collection? OK. |
15:27
π
|
SketchCow |
Isn't some stuff up |
15:27
π
|
JAA |
Yeah |
15:29
π
|
SketchCow |
archiveteam_nratv now exists |
15:29
π
|
|
killsushi has joined #archiveteam-bs |
15:29
π
|
SketchCow |
Is there a consistency of naming of what's already up I can use to shove them in? |
15:45
π
|
JAA |
I don't think anything's uploaded yet. At least not from us. |
15:46
π
|
SketchCow |
OK, so just upload them, I'll shove them into the collection when you're ready. |
15:46
π
|
SketchCow |
Or someone can ping me with access requests |
15:46
π
|
SketchCow |
But I set it up and gave it an NRATV bio and whee |
15:46
π
|
JAA |
Fusl_: ^ (Or if you want me to do it, let me know.) |
15:46
π
|
SketchCow |
So I tried an experiment that failed |
15:46
π
|
SketchCow |
I want to take a Youtube iD and know if the video's gone or not. |
15:47
π
|
SketchCow |
I can't find a consistent way to check. |
15:47
π
|
SketchCow |
There MUST be something out there |
15:57
π
|
Fusl |
SketchCow: `test 200 == $(curl -sfo/dev/null -w '%{http_code}' "http://www.youtube.com/oembed?url=http://www.youtube.com/watch?v=${ID}")` |
15:58
π
|
SketchCow |
Damn, that's dense |
15:58
π
|
SketchCow |
Is that bash? |
15:58
π
|
Fusl |
aye |
16:03
π
|
SketchCow |
What are the possible outputs |
16:03
π
|
SketchCow |
Because for me it outputs blank |
16:03
π
|
Fusl |
it will give an exit value of either 0 or 1 |
16:03
π
|
Fusl |
so you can use it within an if-condition |
16:03
π
|
SketchCow |
Not here |
16:03
π
|
astrid |
or follow it with && echo $? |
16:04
π
|
Fusl |
; echo $? |
16:04
π
|
astrid |
er right |
16:04
π
|
Fusl |
&& echo $? would only print if it succeeds |
16:04
π
|
SketchCow |
I don't want to seem ungrateful |
16:04
π
|
astrid |
computers. |
16:04
π
|
SketchCow |
But man, that's dense |
16:05
π
|
SketchCow |
Also, the whole endeavor is getting right into my face how much absolute horseshit people upload to the archive |
16:05
π
|
SketchCow |
Which is not a mood lightener |
16:06
π
|
SketchCow |
Oh, 5,000 hours of thai television..... thank you |
16:06
π
|
SketchCow |
Especially with the 100%, complete and utter lack of metadata |
16:06
π
|
SketchCow |
The robots after I'm dead will thank you |
16:07
π
|
SketchCow |
BOB=`test 200 == $(curl -sfo/dev/null -w '%{http_code}' "http://www.youtube.com/oembed?url=http://www.youtube.com/watch?v=${ID}");echo $?`;echo $BOB |
16:08
π
|
Raccoon |
start a streaming service that requires viewers to fill out metadata for you |
16:09
π
|
SketchCow |
for each in `ia search collection:archiveteam_youtube --itemlist`; do YT=`echo $each | sed 's/youtube-//g'`; FOF=`test 200 == $(curl -sfo/dev/null -w '%{http_code}' "http://www.youtube.com/oembed?url=http://www.youtube.com/watch?v=${ID}");echo $?`;echo "$FOF"; if [ "$FOF" = "1" ]; then echo "$YT exists."; else echo "Oh no.... $YT is gone gone gone!"; echo "$each" >> deads.txt; fi; |
16:09
π
|
SketchCow |
done |
16:09
π
|
SketchCow |
What could possibly go wrong |
16:09
π
|
Igloo |
Be careful you don't get banned by YT |
16:09
π
|
Igloo |
Not sure how they're testing that. |
16:10
π
|
SketchCow |
Oh no |
16:10
π
|
SketchCow |
banned by YT |
16:10
π
|
SketchCow |
What will I do |
16:10
π
|
SketchCow |
How will I spend that free time |
16:10
π
|
Igloo |
It will break your script. |
16:11
π
|
Igloo |
That's all I was saying. |
16:11
π
|
Fusl |
for gods sake make JAA use mips for this! :P |
16:11
π
|
SketchCow |
0idOIGRrbHU exists. |
16:11
π
|
SketchCow |
1 |
16:11
π
|
SketchCow |
0ikhVJCblnk exists. |
16:11
π
|
SketchCow |
1 |
16:11
π
|
SketchCow |
0j6aV3YSue8 exists. |
16:12
π
|
SketchCow |
I'm mostly interested in seeing how many of these are actually missing |
16:12
π
|
SketchCow |
And how many are straight up mirrors |
16:12
π
|
arkiver |
IΒ΄m putting my money on 0.8% |
16:12
π
|
Fusl |
0.3% |
16:13
π
|
SketchCow |
I've only got to work from the de-indexed set |
16:13
π
|
SketchCow |
NON-de-indexed |
16:13
π
|
arkiver |
IΒ΄m putting my second money on Fusl being correct |
16:13
π
|
Fusl |
arkiver: thats not how it works :P |
16:13
π
|
arkiver |
:) |
16:13
π
|
SketchCow |
You bid one dollar over |
16:13
π
|
SketchCow |
And fuck them |
16:13
π
|
SketchCow |
(That's how the Price is Right works) |
16:14
π
|
Fusl |
im too young for this |
16:17
π
|
SketchCow |
By the way - so far none are missing. |
16:18
π
|
SketchCow |
I choose random youtube IDs to go make sure things are fine, and I have not been delighted at the video chosen to be mirrored. |
16:18
π
|
SketchCow |
Which tells me they're not choosing. They're mirroring almost random things |
16:19
π
|
arkiver |
mirroring whatever they find personally interesting |
16:19
π
|
SketchCow |
No, I don't think so |
16:19
π
|
SketchCow |
No, no. |
16:19
π
|
arkiver |
although thereΒ΄s exceptions among those people |
16:19
π
|
Igloo |
How many times have you been Rick Rolled? |
16:19
π
|
SketchCow |
Not when you mirror 15,000 videos |
16:19
π
|
SketchCow |
No, that's just high-spectrum grab-bag snowplowing through someone else's harddrives |
16:20
π
|
arkiver |
yeah true |
16:24
π
|
|
Raccoon has quit IRC (Ping timeout: 265 seconds) |
16:25
π
|
SketchCow |
Yeah, so far, zero percent down. |
16:25
π
|
SketchCow |
Waiting for my ban |
16:25
π
|
SketchCow |
DO IT |
16:25
π
|
SketchCow |
DOOOO IT |
16:25
π
|
schbirid |
i say 1.5% are gone |
16:41
π
|
SketchCow |
I say that before we're done, two will die, and one will be irrevocably changed |
17:39
π
|
|
Verified_ has joined #archiveteam-bs |
18:06
π
|
|
Ryz has joined #archiveteam-bs |
18:14
π
|
|
m007a83_ is now known as m007a83 |
19:57
π
|
betamax |
speaking of YouTube archiving, is ivan still the one running the GDrive-based archiver that only uploads videos once they're taken down? |
19:57
π
|
betamax |
or is that now someone else? |
20:00
π
|
|
icedice has joined #archiveteam-bs |
20:03
π
|
Igloo |
It was, but it's also been banned mostly. |
20:05
π
|
betamax |
ah, shame |
20:33
π
|
SketchCow |
With the caveat that we made this shit up on the spot, 0% of the URLs I had access to are not still in youtube. |
20:47
π
|
ivan_ |
SketchCow: I found that after a few years, ~8% of my YouTube was gone from YouTube |
20:47
π
|
ivan_ |
but I'm not a tubeupper |
20:49
π
|
|
stapler11 has quit IRC (Leaving) |
20:56
π
|
betamax |
Hypothetical question (asking here before I bother info@archive.org), anyone know if IAs system allows for items uploaded to one account to be transferred to another? |
20:57
π
|
Smiley |
so can we use Google Compute credit? |
21:00
π
|
SketchCow |
Tell me the circumstances this would happen |
21:02
π
|
Smiley |
guy I know goes "I have Β£283 worth of Google Compute Dealie credit though, if anyone can think of a use for it?" |
21:02
π
|
Smiley |
I'm not sure if we can use the warrior scripts on it, or something |
21:03
π
|
hook54321 |
PurpleSym: Awhile ago you asked if the Circavie archives exist anywhere, did you ever find them? |
21:06
π
|
Igloo |
Smiley: the outbound would be the issue |
21:06
π
|
ivan_ |
I ran grab-site on GCE trial credit and got my servers and API project were removed with no warning |
21:06
π
|
Igloo |
and I think he was reffering to the movement of items to anothe ruser |
21:06
π
|
Smiley |
ivan_: dafaq :/ |
21:20
π
|
astrid |
betamax: yes it can be done |
21:22
π
|
betamax |
good to know, thanks (don't want to waste time on impossible requests) |
21:29
π
|
SketchCow |
betamax: As I wrote: Tell me the circumstances this would happen |
21:33
π
|
betamax |
oh, sorry, thought you meant someone else |
21:33
π
|
SketchCow |
In general, we entertain all requests but it should be for a good reason. |
21:33
π
|
betamax |
basically I started writing scripts to mirror UK council webcasts (which are deleted after a set time) to IA, and initially used my personal IA account |
21:33
π
|
SketchCow |
If someone's trying to put one over, we'll suss it out. |
21:34
π
|
SketchCow |
But if you're able to prove you can log into both accounts, the effort is trivial. |
21:34
π
|
betamax |
now I realise there's so many that it would be better to have a dedicated account as all my other items on that account are getting buried |
21:34
π
|
SketchCow |
Yes. |
21:34
π
|
SketchCow |
What we would do is: |
21:34
π
|
betamax |
(this is currently hypothetical as I'm in the midst of re-writing the script and haven't made the second dedicated account yet) |
21:35
π
|
SketchCow |
- Mail your old account's mailing address saying "You requested we do this. Is this you?" |
21:35
π
|
SketchCow |
And you go yes. |
21:36
π
|
betamax |
great. It won't be for a few weeks (finishing scripts, updating VM to debian 10, etc...) but knowing it is possible is a big help |
22:01
π
|
|
LowLevelM has joined #archiveteam-bs |
22:06
π
|
|
LowLevelM has quit IRC (Ping timeout: 260 seconds) |
22:10
π
|
|
LowLevelM has joined #archiveteam-bs |
22:22
π
|
|
LowLevelM has quit IRC (Ping timeout: 260 seconds) |
22:23
π
|
SketchCow |
It is. |
22:23
π
|
SketchCow |
You can come to me. |
22:59
π
|
|
BlueMax has joined #archiveteam-bs |
23:01
π
|
|
LowLevelM has joined #archiveteam-bs |
23:04
π
|
|
schbirid has quit IRC (Remote host closed the connection) |
23:35
π
|
|
yano_ is now known as yano |
23:57
π
|
odemgi_ |
SketchCow, get this shit.... people think I'm you/you're me and that it's you that runs the-eye |