Time |
Nickname |
Message |
00:07
🔗
|
|
trc_ has joined #archiveteam-bs |
00:07
🔗
|
|
Arcorann has joined #archiveteam-bs |
00:07
🔗
|
|
trc has quit IRC (Read error: Connection reset by peer) |
00:08
🔗
|
|
trc__ has joined #archiveteam-bs |
00:10
🔗
|
|
trc_ has quit IRC (Read error: Connection reset by peer) |
02:32
🔗
|
|
Raccoon has quit IRC (Ping timeout: 265 seconds) |
03:14
🔗
|
|
Meli has quit IRC (Ping timeout: 272 seconds) |
03:14
🔗
|
|
sHATNER has quit IRC (Ping timeout: 272 seconds) |
03:15
🔗
|
|
Meli has joined #archiveteam-bs |
03:15
🔗
|
|
actually_ has quit IRC (Ping timeout: 272 seconds) |
03:15
🔗
|
|
obskyr has joined #archiveteam-bs |
03:16
🔗
|
|
brayden has quit IRC (Ping timeout: 272 seconds) |
03:16
🔗
|
|
Laverne has quit IRC (Ping timeout: 272 seconds) |
03:17
🔗
|
|
Terbium has quit IRC (Ping timeout: 272 seconds) |
03:17
🔗
|
|
Terbium has joined #archiveteam-bs |
03:40
🔗
|
|
HP_Archiv has joined #archiveteam-bs |
03:41
🔗
|
|
qw3rty__ has joined #archiveteam-bs |
03:42
🔗
|
|
ephemer0l has quit IRC (Read error: Connection reset by peer) |
03:48
🔗
|
|
qw3rty_ has quit IRC (Read error: Operation timed out) |
03:48
🔗
|
|
HP_Archiv has quit IRC (Quit: Leaving) |
04:13
🔗
|
|
sHATNER has joined #archiveteam-bs |
04:15
🔗
|
lennier1 |
Donald Trump says he's banning TikTok in the USA. From what I've read, some people doubt he has the authority to do this, so I'd expect a court battle if he follows through with an executive order. https://www.cnn.com/2020/07/31/tech/tiktok-trump-bytedance-sale/index.html |
04:19
🔗
|
|
Laverne has joined #archiveteam-bs |
04:19
🔗
|
|
brayden has joined #archiveteam-bs |
04:55
🔗
|
superkuh |
Dictator envy. |
05:45
🔗
|
|
ephemer0l has joined #archiveteam-bs |
06:08
🔗
|
|
Raccoon has joined #archiveteam-bs |
08:14
🔗
|
|
jshoard has joined #archiveteam-bs |
08:28
🔗
|
|
Craigle has quit IRC (Quit: The Lounge - https://thelounge.chat) |
08:44
🔗
|
|
BlueMaxim has joined #archiveteam-bs |
08:44
🔗
|
|
BlueMax has quit IRC (Read error: Connection reset by peer) |
08:45
🔗
|
|
DLoader_ has joined #archiveteam-bs |
08:47
🔗
|
|
jshoard_ has joined #archiveteam-bs |
08:47
🔗
|
|
jshoard has quit IRC (Read error: Connection reset by peer) |
08:56
🔗
|
|
DLoader has quit IRC (Ping timeout: 745 seconds) |
08:56
🔗
|
|
DLoader_ is now known as DLoader |
09:02
🔗
|
LowLevelM |
From what I understand now, he is doing it for the same reason he banned huawei. To prevent the Chinese from spying on us citizens. |
09:18
🔗
|
|
Craigle has joined #archiveteam-bs |
10:07
🔗
|
endrift |
that's the cover story at least |
10:08
🔗
|
endrift |
whether or not that's ACTUALLY why he's doing it... |
10:08
🔗
|
endrift |
that's a different matter |
10:11
🔗
|
|
jshoard__ has joined #archiveteam-bs |
10:11
🔗
|
|
jshoard_ has quit IRC (Read error: Connection reset by peer) |
10:14
🔗
|
LowLevelM |
hopefully archiveteam is not full of lefties, because if that's true, I may leave |
10:15
🔗
|
|
jshoard has joined #archiveteam-bs |
10:19
🔗
|
|
jshoard__ has quit IRC (Read error: Operation timed out) |
10:25
🔗
|
|
jshoard has quit IRC (Read error: Operation timed out) |
10:43
🔗
|
Kaz |
is that like |
10:43
🔗
|
Kaz |
a threat ot? |
10:52
🔗
|
Tugboat |
LowLevelM: don't worry, everyone is basically a communist |
11:09
🔗
|
endrift |
Offended that I implied your fuehrer-wannabe might be lying about yet another thing? |
11:10
🔗
|
endrift |
If that is a thorn enough in your thin skin to get you to leave, then that's fine by me |
11:28
🔗
|
|
BlueMaxim has quit IRC (Read error: Connection reset by peer) |
12:40
🔗
|
|
kiska has joined #archiveteam-bs |
12:40
🔗
|
|
kiska2 has joined #archiveteam-bs |
12:40
🔗
|
|
kiska2 has quit IRC (Client Quit) |
12:40
🔗
|
|
kiska has quit IRC (Client Quit) |
12:41
🔗
|
|
kiska has joined #archiveteam-bs |
12:41
🔗
|
|
kiska2 has joined #archiveteam-bs |
12:48
🔗
|
|
LowLevelM has quit IRC (The Lounge - https://thelounge.chat) |
13:39
🔗
|
|
trc_ has joined #archiveteam-bs |
13:40
🔗
|
|
trc__ has quit IRC (Read error: Connection reset by peer) |
13:43
🔗
|
|
Nikchemny has joined #archiveteam-bs |
13:50
🔗
|
|
Nikchemny has quit IRC (Ping timeout: 252 seconds) |
14:21
🔗
|
|
systwi_ has joined #archiveteam-bs |
14:21
🔗
|
|
Nikchemny has joined #archiveteam-bs |
14:25
🔗
|
Nikchemny |
nico_32_: Is there a progress? |
14:29
🔗
|
|
systwi has quit IRC (Ping timeout: 622 seconds) |
14:55
🔗
|
Nikchemny |
OpenWayback by the Library of Congress isn'y ok: https://webarchive.loc.gov/all/*/lib.ru |
15:02
🔗
|
|
VADemon has joined #archiveteam-bs |
15:13
🔗
|
|
Arcorann has quit IRC (Read error: Connection reset by peer) |
15:40
🔗
|
|
kiska has quit IRC (The Lounge - https://thelounge.chat) |
15:40
🔗
|
|
kiska has joined #archiveteam-bs |
16:17
🔗
|
|
Nikchemny has quit IRC (Quit: Page closed) |
16:56
🔗
|
|
trc_ has quit IRC (Quit: Goodbye) |
17:01
🔗
|
|
Nikchemny has joined #archiveteam-bs |
17:10
🔗
|
|
wyatt8740 has quit IRC (Read error: Operation timed out) |
17:11
🔗
|
|
wyatt8740 has joined #archiveteam-bs |
17:25
🔗
|
|
Larsenv has quit IRC (Quit: ZNC 1.8.0 - https://znc.in) |
17:27
🔗
|
Nikchemny |
nico_32_: http://wiki.laser.ru/index.php/%D0%9A%D0%B0%D1%82%D0%B0%D0%BB%D0%BE%D0%B3_wiki-%D1%81%D0%B0%D0%B9%D1%82%D0%BE%D0%B2 - another list of Russian wikis |
17:35
🔗
|
|
Larsenv has joined #archiveteam-bs |
17:36
🔗
|
|
jshoard has joined #archiveteam-bs |
17:47
🔗
|
|
Nikchemny has quit IRC (Quit: Page closed) |
17:58
🔗
|
|
Aoede has quit IRC (Quit: ZNC - https://znc.in) |
18:06
🔗
|
|
Aoede has joined #archiveteam-bs |
18:41
🔗
|
|
balrog has quit IRC (Bye) |
18:41
🔗
|
|
balrog has joined #archiveteam-bs |
19:09
🔗
|
|
slyphic has joined #archiveteam-bs |
20:10
🔗
|
|
ephemer0l has quit IRC (Quit: http://quassel-irc.org - Chat comfortably. Anywhere.) |
20:13
🔗
|
|
ephemer0l has joined #archiveteam-bs |
21:11
🔗
|
SketchCow |
06:14 < LowLevelM> hopefully archiveteam is not full of lefties, because if that's true, I may leave |
21:11
🔗
|
SketchCow |
<smiles in marx> |
21:15
🔗
|
SketchCow |
https://twitter.com/textfiles/status/1289670466399948800 |
21:25
🔗
|
|
jshoard_ has joined #archiveteam-bs |
21:30
🔗
|
|
jshoard has quit IRC (Read error: Operation timed out) |
21:44
🔗
|
JAA |
lol |
21:48
🔗
|
SketchCow |
JAA you sent me a huge "we should do a thing" letter |
21:48
🔗
|
SketchCow |
And I am actually at an impasse with it |
21:48
🔗
|
SketchCow |
Because my standard procedure is to throw these sorts of requests at JAA |
21:49
🔗
|
|
jshoard_ has quit IRC (Read error: Operation timed out) |
21:50
🔗
|
JAA |
SketchCow: Hm? I don't remember sending you anything recently. |
21:52
🔗
|
JAA |
On another note, Clutch is the third major(ish) game clip hosting site to shut down within less than a year after Plays.tv and Mixer. What's going on there? |
21:52
🔗
|
JAA |
SketchCow: Oh, are you confusing me with jrwr again? |
21:53
🔗
|
SketchCow |
This was about Tigris |
21:53
🔗
|
JAA |
Ah, that. |
21:53
🔗
|
SketchCow |
Not THIS time! |
21:53
🔗
|
JAA |
Heh :-) |
21:53
🔗
|
SketchCow |
Anyway, it's easier now |
21:53
🔗
|
SketchCow |
jrwr is the one with the hat |
21:54
🔗
|
JAA |
I mean... https://commons.wikimedia.org/wiki/File:Jason_Scott_(2017_Portrait).jpg |
21:54
🔗
|
SketchCow |
Anyway, I never knew what to do there |
21:54
🔗
|
SketchCow |
I mean identification for MY purposes |
21:54
🔗
|
SketchCow |
https://i1.wp.com/www.safer-computing.com/wp-content/uploads/2019/03/hatchan.jpg |
21:57
🔗
|
lennier1 |
Now that game streaming is such a big business, I guess it's not surprising a bunch of companies would try--and mostly fail--to compete with Twitch. |
21:58
🔗
|
JAA |
Yeah, I didn't really know either to be honest. I guess I thought you might know some people (since CollabNet/SVN is quite well-known) or launch a Twitter shitstorm or something else to somehow get them to reconsider their early shutdown. In the end though, the site did return for a few days after the original shutdown date for some reason, and I believe I got virtually everything from it. |
21:58
🔗
|
JAA |
Ah yes, that hat. :-) |
22:01
🔗
|
|
wyatt8740 has quit IRC (Read error: Operation timed out) |
22:02
🔗
|
JAA |
lennier1: Yeah, true. Plays.TV and Mixer launched in 2015/16, Clutch is apparently more recent. Just a bit strange that they fail so shortly after each other. Probably just a coincidence though. |
22:02
🔗
|
|
wyatt8740 has joined #archiveteam-bs |
22:02
🔗
|
lennier1 |
FWIW, fourzerofour estimated that clutch.win is only about 1TB of video. https://www.reddit.com/r/Archiveteam/comments/i1wep6/clutchwin_gameplay_videos_is_shutting_down_august/ |
22:04
🔗
|
JAA |
That's surprisingly small. |
22:05
🔗
|
JAA |
The videos seem to be on Fastly, so that should be fast. |
22:06
🔗
|
lennier1 |
Yeah, not bad at all assuming the math is right. |
22:07
🔗
|
Jake4 |
That seems to be wrong? "about 180K videos" the front page "games" with clip numbers seems to add up to over 3,168,000 |
22:09
🔗
|
JAA |
Yeah, that seems way more realistic. |
22:09
🔗
|
JAA |
Fortnite alone has 2.1M clips. |
22:10
🔗
|
Jake4 |
(More around 3.8-4M based on the more games tab. with his estimate of 5mb per clip, something much bigger than a few TB) |
22:11
🔗
|
JAA |
Yeah, assuming 4M clips and their numbers otherwise, that suggests ~20 TB. |
22:12
🔗
|
JAA |
Still not too bad actually. |
22:12
🔗
|
Jake4 |
Agreed, not too bad. |
22:14
🔗
|
|
DopefishJ has quit IRC (Remote host closed the connection) |
22:25
🔗
|
SketchCow |
I got a lovely contact from a group of people who did a big console save |
22:25
🔗
|
SketchCow |
And they're using wayback for some of it and felt bad |
22:25
🔗
|
SketchCow |
I said not to feel bad |
22:38
🔗
|
|
ndiddy has joined #archiveteam-bs |
22:41
🔗
|
|
DFJustin has joined #archiveteam-bs |
22:52
🔗
|
JAA |
According to the API, there are 4026695 clips on Clutch currently. |
22:53
🔗
|
Jake4 |
20 something TB if the original 5mb per clip is correct then? |
22:54
🔗
|
JAA |
Somewhere around that, yeah. |
22:55
🔗
|
JAA |
I'll gather all the clip slugs through the API. |
22:56
🔗
|
JAA |
Or rather, they call them "posts". |
23:09
🔗
|
OrIdow6 |
JAA: Looks like the API is fairly rich in metadata (e.g. durations and video URLs, obviously useful for estimates), so maybe get the full thing if you can |
23:10
🔗
|
OrIdow6 |
Unless you're using a less detailed one than the popular list |
23:11
🔗
|
lennier1 |
Is every post a video? |
23:11
🔗
|
JAA |
OrIdow6: Nope, that's the API I'm using, though not the popular one. |
23:11
🔗
|
JAA |
Going after recent, games list, etc. |
23:11
🔗
|
JAA |
And yes, I'm getting the "entire" API, more or less. |
23:12
🔗
|
JAA |
lennier1: Yeah, "post" is just the internal name for clips. |
23:13
🔗
|
lennier1 |
Cool. The reddit post said there were a lot of non-video pages as well. Not sure what those are. Images? Text? |
23:15
🔗
|
JAA |
Huh |
23:16
🔗
|
JAA |
I haven't seen anything non-video so far at least, but I haven't gone very deep yet. |
23:18
🔗
|
Ryz |
Welp, the ArchiveBot job https://clutch.win/ ( bxf3kqpjaozbxf92xsvfegjs6 ) has videos accessible and being downloaded Oo; |
23:19
🔗
|
OrIdow6 |
JAA: Good |
23:20
🔗
|
JAA |
Ryz: Yeah, I expected that. The video URLs are in a JSON blob in the page. |
23:26
🔗
|
JAA |
Clips are available in three versions apparently: high resolution, high resolution with watermark (= "Clutch" + username), and low resolution. |
23:27
🔗
|
JAA |
They call these high_quality_video_url, watermark_video_url, and video_url, respectively. |
23:28
🔗
|
OrIdow6 |
For a video site, this has a very clean structure; wouldn't be surprised if it played back alright |
23:30
🔗
|
Ryz |
!ig dow85s79nfu00rlg6tozid6sm ^https?://www\.start\.co\.il\:6789/ |
23:30
🔗
|
Ryz |
Oops |
23:31
🔗
|
JAA |
Uhm... Their Fastly thing is actually an S3 bucket, and it's publicly listable. lol |
23:33
🔗
|
OrIdow6 |
How did you figure that out? |
23:34
🔗
|
JAA |
<magic.gif> |
23:34
🔗
|
OrIdow6 |
Headers on https://ftw.global.ssl.fastly.net/media/videos/uploads/78/78b5/78b5e970f6cc3a2a73abb1081a179f454c8990f3.mp4 look like Google Cloud storage through Fastly |
23:34
🔗
|
JAA |
Well yeah, maybe not AWS S3, but something S3-like. |
23:35
🔗
|
OrIdow6 |
Oh |
23:35
🔗
|
JAA |
https://ftw.global.ssl.fastly.net/ |
23:36
🔗
|
JAA |
Listing that now. |
23:37
🔗
|
JAA |
Can't get the post slugs or IDs from that, I think, but at least it gives us a very good size estimate. |
23:40
🔗
|
OrIdow6 |
Doesn't look like it, unless there's a databse dump or something like that hidden there |
23:40
🔗
|
OrIdow6 |
Video names are just sha1 of content |
23:44
🔗
|
JAA |
Right, and the internal DB IDs don't use the hash sadly. |
23:44
🔗
|
|
Gallifrey has quit IRC (Read error: Connection reset by peer) |
23:45
🔗
|
JAA |
Also, apparently the watermark_video_url is only added a bit after the upload. |
23:45
🔗
|
|
HP_Archiv has joined #archiveteam-bs |
23:46
🔗
|
|
acridAxid has joined #archiveteam-bs |
23:50
🔗
|
|
Gallifrey has joined #archiveteam-bs |
23:51
🔗
|
|
Gallifrey has quit IRC (Read error: Connection reset by peer) |
23:51
🔗
|
|
chirlu has joined #archiveteam-bs |
23:53
🔗
|
|
BlueMax has joined #archiveteam-bs |
23:55
🔗
|
|
Gallifrey has joined #archiveteam-bs |
23:58
🔗
|
JAA |
This will need some more work, I'll continue tomorrow. |
23:59
🔗
|
|
Gallifrey has quit IRC (Read error: Connection reset by peer) |