[00:07] *** trc_ has joined #archiveteam-bs [00:07] *** Arcorann has joined #archiveteam-bs [00:07] *** trc has quit IRC (Read error: Connection reset by peer) [00:08] *** trc__ has joined #archiveteam-bs [00:10] *** trc_ has quit IRC (Read error: Connection reset by peer) [02:32] *** Raccoon has quit IRC (Ping timeout: 265 seconds) [03:14] *** Meli has quit IRC (Ping timeout: 272 seconds) [03:14] *** sHATNER has quit IRC (Ping timeout: 272 seconds) [03:15] *** Meli has joined #archiveteam-bs [03:15] *** actually_ has quit IRC (Ping timeout: 272 seconds) [03:15] *** obskyr has joined #archiveteam-bs [03:16] *** brayden has quit IRC (Ping timeout: 272 seconds) [03:16] *** Laverne has quit IRC (Ping timeout: 272 seconds) [03:17] *** Terbium has quit IRC (Ping timeout: 272 seconds) [03:17] *** Terbium has joined #archiveteam-bs [03:40] *** HP_Archiv has joined #archiveteam-bs [03:41] *** qw3rty__ has joined #archiveteam-bs [03:42] *** ephemer0l has quit IRC (Read error: Connection reset by peer) [03:48] *** qw3rty_ has quit IRC (Read error: Operation timed out) [03:48] *** HP_Archiv has quit IRC (Quit: Leaving) [04:13] *** sHATNER has joined #archiveteam-bs [04:15] Donald Trump says he's banning TikTok in the USA. From what I've read, some people doubt he has the authority to do this, so I'd expect a court battle if he follows through with an executive order. https://www.cnn.com/2020/07/31/tech/tiktok-trump-bytedance-sale/index.html [04:19] *** Laverne has joined #archiveteam-bs [04:19] *** brayden has joined #archiveteam-bs [04:55] Dictator envy. [05:45] *** ephemer0l has joined #archiveteam-bs [06:08] *** Raccoon has joined #archiveteam-bs [08:14] *** jshoard has joined #archiveteam-bs [08:28] *** Craigle has quit IRC (Quit: The Lounge - https://thelounge.chat) [08:44] *** BlueMaxim has joined #archiveteam-bs [08:44] *** BlueMax has quit IRC (Read error: Connection reset by peer) [08:45] *** DLoader_ has joined #archiveteam-bs [08:47] *** jshoard_ has joined #archiveteam-bs [08:47] *** jshoard has quit IRC (Read error: Connection reset by peer) [08:56] *** DLoader has quit IRC (Ping timeout: 745 seconds) [08:56] *** DLoader_ is now known as DLoader [09:02] From what I understand now, he is doing it for the same reason he banned huawei. To prevent the Chinese from spying on us citizens. [09:18] *** Craigle has joined #archiveteam-bs [10:07] that's the cover story at least [10:08] whether or not that's ACTUALLY why he's doing it... [10:08] that's a different matter [10:11] *** jshoard__ has joined #archiveteam-bs [10:11] *** jshoard_ has quit IRC (Read error: Connection reset by peer) [10:14] hopefully archiveteam is not full of lefties, because if that's true, I may leave [10:15] *** jshoard has joined #archiveteam-bs [10:19] *** jshoard__ has quit IRC (Read error: Operation timed out) [10:25] *** jshoard has quit IRC (Read error: Operation timed out) [10:43] is that like [10:43] a threat ot? [10:52] LowLevelM: don't worry, everyone is basically a communist [11:09] Offended that I implied your fuehrer-wannabe might be lying about yet another thing? [11:10] If that is a thorn enough in your thin skin to get you to leave, then that's fine by me [11:28] *** BlueMaxim has quit IRC (Read error: Connection reset by peer) [12:40] *** kiska has joined #archiveteam-bs [12:40] *** kiska2 has joined #archiveteam-bs [12:40] *** kiska2 has quit IRC (Client Quit) [12:40] *** kiska has quit IRC (Client Quit) [12:41] *** kiska has joined #archiveteam-bs [12:41] *** kiska2 has joined #archiveteam-bs [12:48] *** LowLevelM has quit IRC (The Lounge - https://thelounge.chat) [13:39] *** trc_ has joined #archiveteam-bs [13:40] *** trc__ has quit IRC (Read error: Connection reset by peer) [13:43] *** Nikchemny has joined #archiveteam-bs [13:50] *** Nikchemny has quit IRC (Ping timeout: 252 seconds) [14:21] *** systwi_ has joined #archiveteam-bs [14:21] *** Nikchemny has joined #archiveteam-bs [14:25] nico_32_: Is there a progress? [14:29] *** systwi has quit IRC (Ping timeout: 622 seconds) [14:55] OpenWayback by the Library of Congress isn'y ok: https://webarchive.loc.gov/all/*/lib.ru [15:02] *** VADemon has joined #archiveteam-bs [15:13] *** Arcorann has quit IRC (Read error: Connection reset by peer) [15:40] *** kiska has quit IRC (The Lounge - https://thelounge.chat) [15:40] *** kiska has joined #archiveteam-bs [16:17] *** Nikchemny has quit IRC (Quit: Page closed) [16:56] *** trc_ has quit IRC (Quit: Goodbye) [17:01] *** Nikchemny has joined #archiveteam-bs [17:10] *** wyatt8740 has quit IRC (Read error: Operation timed out) [17:11] *** wyatt8740 has joined #archiveteam-bs [17:25] *** Larsenv has quit IRC (Quit: ZNC 1.8.0 - https://znc.in) [17:27] nico_32_: http://wiki.laser.ru/index.php/%D0%9A%D0%B0%D1%82%D0%B0%D0%BB%D0%BE%D0%B3_wiki-%D1%81%D0%B0%D0%B9%D1%82%D0%BE%D0%B2 - another list of Russian wikis [17:35] *** Larsenv has joined #archiveteam-bs [17:36] *** jshoard has joined #archiveteam-bs [17:47] *** Nikchemny has quit IRC (Quit: Page closed) [17:58] *** Aoede has quit IRC (Quit: ZNC - https://znc.in) [18:06] *** Aoede has joined #archiveteam-bs [18:41] *** balrog has quit IRC (Bye) [18:41] *** balrog has joined #archiveteam-bs [19:09] *** slyphic has joined #archiveteam-bs [20:10] *** ephemer0l has quit IRC (Quit: http://quassel-irc.org - Chat comfortably. Anywhere.) [20:13] *** ephemer0l has joined #archiveteam-bs [21:11] 06:14 < LowLevelM> hopefully archiveteam is not full of lefties, because if that's true, I may leave [21:11] [21:15] https://twitter.com/textfiles/status/1289670466399948800 [21:25] *** jshoard_ has joined #archiveteam-bs [21:30] *** jshoard has quit IRC (Read error: Operation timed out) [21:44] lol [21:48] JAA you sent me a huge "we should do a thing" letter [21:48] And I am actually at an impasse with it [21:48] Because my standard procedure is to throw these sorts of requests at JAA [21:49] *** jshoard_ has quit IRC (Read error: Operation timed out) [21:50] SketchCow: Hm? I don't remember sending you anything recently. [21:52] On another note, Clutch is the third major(ish) game clip hosting site to shut down within less than a year after Plays.tv and Mixer. What's going on there? [21:52] SketchCow: Oh, are you confusing me with jrwr again? [21:53] This was about Tigris [21:53] Ah, that. [21:53] Not THIS time! [21:53] Heh :-) [21:53] Anyway, it's easier now [21:53] jrwr is the one with the hat [21:54] I mean... https://commons.wikimedia.org/wiki/File:Jason_Scott_(2017_Portrait).jpg [21:54] Anyway, I never knew what to do there [21:54] I mean identification for MY purposes [21:54] https://i1.wp.com/www.safer-computing.com/wp-content/uploads/2019/03/hatchan.jpg [21:57] Now that game streaming is such a big business, I guess it's not surprising a bunch of companies would try--and mostly fail--to compete with Twitch. [21:58] Yeah, I didn't really know either to be honest. I guess I thought you might know some people (since CollabNet/SVN is quite well-known) or launch a Twitter shitstorm or something else to somehow get them to reconsider their early shutdown. In the end though, the site did return for a few days after the original shutdown date for some reason, and I believe I got virtually everything from it. [21:58] Ah yes, that hat. :-) [22:01] *** wyatt8740 has quit IRC (Read error: Operation timed out) [22:02] lennier1: Yeah, true. Plays.TV and Mixer launched in 2015/16, Clutch is apparently more recent. Just a bit strange that they fail so shortly after each other. Probably just a coincidence though. [22:02] *** wyatt8740 has joined #archiveteam-bs [22:02] FWIW, fourzerofour estimated that clutch.win is only about 1TB of video. https://www.reddit.com/r/Archiveteam/comments/i1wep6/clutchwin_gameplay_videos_is_shutting_down_august/ [22:04] That's surprisingly small. [22:05] The videos seem to be on Fastly, so that should be fast. [22:06] Yeah, not bad at all assuming the math is right. [22:07] That seems to be wrong? "about 180K videos" the front page "games" with clip numbers seems to add up to over 3,168,000 [22:09] Yeah, that seems way more realistic. [22:09] Fortnite alone has 2.1M clips. [22:10] (More around 3.8-4M based on the more games tab. with his estimate of 5mb per clip, something much bigger than a few TB) [22:11] Yeah, assuming 4M clips and their numbers otherwise, that suggests ~20 TB. [22:12] Still not too bad actually. [22:12] Agreed, not too bad. [22:14] *** DopefishJ has quit IRC (Remote host closed the connection) [22:25] I got a lovely contact from a group of people who did a big console save [22:25] And they're using wayback for some of it and felt bad [22:25] I said not to feel bad [22:38] *** ndiddy has joined #archiveteam-bs [22:41] *** DFJustin has joined #archiveteam-bs [22:52] According to the API, there are 4026695 clips on Clutch currently. [22:53] 20 something TB if the original 5mb per clip is correct then? [22:54] Somewhere around that, yeah. [22:55] I'll gather all the clip slugs through the API. [22:56] Or rather, they call them "posts". [23:09] JAA: Looks like the API is fairly rich in metadata (e.g. durations and video URLs, obviously useful for estimates), so maybe get the full thing if you can [23:10] Unless you're using a less detailed one than the popular list [23:11] Is every post a video? [23:11] OrIdow6: Nope, that's the API I'm using, though not the popular one. [23:11] Going after recent, games list, etc. [23:11] And yes, I'm getting the "entire" API, more or less. [23:12] lennier1: Yeah, "post" is just the internal name for clips. [23:13] Cool. The reddit post said there were a lot of non-video pages as well. Not sure what those are. Images? Text? [23:15] Huh [23:16] I haven't seen anything non-video so far at least, but I haven't gone very deep yet. [23:18] Welp, the ArchiveBot job https://clutch.win/ ( bxf3kqpjaozbxf92xsvfegjs6 ) has videos accessible and being downloaded Oo; [23:19] JAA: Good [23:20] Ryz: Yeah, I expected that. The video URLs are in a JSON blob in the page. [23:26] Clips are available in three versions apparently: high resolution, high resolution with watermark (= "Clutch" + username), and low resolution. [23:27] They call these high_quality_video_url, watermark_video_url, and video_url, respectively. [23:28] For a video site, this has a very clean structure; wouldn't be surprised if it played back alright [23:30] !ig dow85s79nfu00rlg6tozid6sm ^https?://www\.start\.co\.il\:6789/ [23:30] Oops [23:31] Uhm... Their Fastly thing is actually an S3 bucket, and it's publicly listable. lol [23:33] How did you figure that out? [23:34] [23:34] Headers on https://ftw.global.ssl.fastly.net/media/videos/uploads/78/78b5/78b5e970f6cc3a2a73abb1081a179f454c8990f3.mp4 look like Google Cloud storage through Fastly [23:34] Well yeah, maybe not AWS S3, but something S3-like. [23:35] Oh [23:35] https://ftw.global.ssl.fastly.net/ [23:36] Listing that now. [23:37] Can't get the post slugs or IDs from that, I think, but at least it gives us a very good size estimate. [23:40] Doesn't look like it, unless there's a databse dump or something like that hidden there [23:40] Video names are just sha1 of content [23:44] Right, and the internal DB IDs don't use the hash sadly. [23:44] *** Gallifrey has quit IRC (Read error: Connection reset by peer) [23:45] Also, apparently the watermark_video_url is only added a bit after the upload. [23:45] *** HP_Archiv has joined #archiveteam-bs [23:46] *** acridAxid has joined #archiveteam-bs [23:50] *** Gallifrey has joined #archiveteam-bs [23:51] *** Gallifrey has quit IRC (Read error: Connection reset by peer) [23:51] *** chirlu has joined #archiveteam-bs [23:53] *** BlueMax has joined #archiveteam-bs [23:55] *** Gallifrey has joined #archiveteam-bs [23:58] This will need some more work, I'll continue tomorrow. [23:59] *** Gallifrey has quit IRC (Read error: Connection reset by peer)