[00:08] *** Joseph_ has joined #archiveteam-bs [00:08] *** VerfiedJ has quit IRC (Read error: Connection reset by peer) [00:09] *** Joseph_ has quit IRC (Client Quit) [01:13] *** Despatche has quit IRC (Connection reset by deer) [01:40] *** omarroth has joined #archiveteam-bs [01:41] *** omarroth has quit IRC (Client Quit) [01:42] *** omarroth has joined #archiveteam-bs [02:36] *** bithippo has joined #archiveteam-bs [03:01] *** bithippo has quit IRC (Textual IRC Client: www.textualapp.com) [03:56] *** exoire has quit IRC (Read error: Operation timed out) [03:59] How much of Twitter have we archived? [04:04] *** RedType has quit IRC (leaving) [04:06] From the number of deleted tweets that keep coming up from years ago I assumed there was a database somewhere with all of it. Is that not the case? [04:23] *** qw3rty118 has joined #archiveteam-bs [04:28] *** qw3rty117 has quit IRC (Read error: Operation timed out) [04:57] *** odemgi has joined #archiveteam-bs [04:58] *** odemg has quit IRC (Ping timeout: 265 seconds) [05:01] *** odemgi_ has quit IRC (Read error: Operation timed out) [05:10] *** odemg has joined #archiveteam-bs [05:25] *** kiskabak has quit IRC (Ping timeout: 265 seconds) [05:25] *** w0rmybak has quit IRC (Ping timeout: 265 seconds) [05:30] *** Albardin has quit IRC (Ping timeout: 600 seconds) [05:31] *** SomeoneEl has quit IRC (Ping timeout: 633 seconds) [05:45] *** omarroth has quit IRC (Read error: Operation timed out) [05:47] *** tomaspark has joined #archiveteam-bs [06:12] *** wp494 has quit IRC (Ping timeout: 633 seconds) [06:13] *** wp494 has joined #archiveteam-bs [06:34] https://www.reddit.com/r/RevEng_TutsAndTools/comments/afr10k/full_siterip_of_exetools_forum_except_attachments/ee0tqlk/ [08:56] *** odemg has quit IRC (Quit: Leaving) [10:01] *** BlueMax has quit IRC (Quit: Leaving) [11:59] *** VerfiedJ has joined #archiveteam-bs [12:13] *** antomat__ is now known as antomatic [12:24] *** antomati_ has joined #archiveteam-bs [12:26] *** antomatic has quit IRC (Read error: Operation timed out) [12:26] *** antomati_ is now known as antomatic [12:35] *** Joseph_ has joined #archiveteam-bs [12:36] *** VerfiedJ has quit IRC (Read error: Connection reset by peer) [12:36] *** Joseph_ is now known as VerfiedJ [14:08] *** yano has quit IRC (Quit: WeeChat, The Better IRC Client, https://weechat.org/) [14:17] *** yano has joined #archiveteam-bs [14:52] *** yano has quit IRC (Read error: Connection reset by peer) [14:52] *** VADemon has quit IRC (west.us.hub irc.Prison.NET) [14:52] *** achip has quit IRC (west.us.hub irc.Prison.NET) [14:52] *** marked has quit IRC (west.us.hub irc.Prison.NET) [14:52] *** yano has joined #archiveteam-bs [14:59] *** marked has joined #archiveteam-bs [14:59] *** VADemon has joined #archiveteam-bs [14:59] *** achip has joined #archiveteam-bs [15:09] *** enowaldo has joined #archiveteam-bs [15:09] kiska: Here I am. [15:09] enowaldo: Are we just watching them for signs of death [15:10] kiska: Yes. https://ello.co [15:10] Or are there signs of death occurring? [15:10] Founded ~2015, all the original crew are gone, owned by a deathly quiet Los Angeles artworks management group with virtually no contact info. [15:10] Hrm I see... [15:10] Site's been up, but strangeness going on since ~August, organisationally. [15:11] Very heavily graphics-art oriented, a small bit of discussion. ~1-10 million users, I suspect, actives a very small fraction of that. [15:11] Hrm: "engage our global community of 625K artists in 175 countries." [15:11] Just happened to see the Deathwatch section at archiveteam.org [15:11] There you go. [15:12] It's also reasonably culturally interesting. There are some local talents who are pretty damned good. A poet and an SVG graphics wizard among them, several writers. [15:13] And tons and tons of visual art, as I said. [15:13] *** wp494 has quit IRC (Read error: Operation timed out) [15:13] To "discover" artists, you need an account [15:13] This would fit "owners missing in action" off the Dev/Infra page. [15:13] Regwalls are stupid. [15:13] *** wp494 has joined #archiveteam-bs [15:14] There's *no* indication of imminent death, but a lot of creaking going on. [15:19] Right adding this to my todo list [15:22] kiska: Founders leaving, post: https://ello.co/dredmorbius/post/bl2c2s7jdyexthpvybp3rw [15:23] One of the bigger flags: the site had been registered as a B-Corp, but there's no sign that it still is, and no announcements or clarity on that point. [15:23] I see... [15:25] David Dailey is the SVG wizard (CMU emeritus) I mentioned, long thread on various oddness and disappointments, from 4 months ago: https://ello.co/ddailey/post/h86db_x3malsegwcy6mtwq [15:25] File those two among your evidence, along with the corp status vagueness. [15:31] arkiver: Can I also get you to talk to IA about ello as well? [15:36] kiska: sure, need a project? [15:37] I believe it would be best to have it [15:37] There are apparently 625k artists of varying size [15:38] And fanfiction.net and writing.com as well? Since we had a ArchiveBot job for them and they crashed [15:39] fanfiction and writing are probably not too large [15:39] I´d say anything below 50 TB is totally fine [15:39] as long as it´s in real danger [15:40] Alright, ello is probably the most in danger of the 3 projects on my todo, since owners have gone awol and various oddities have emerged [15:42] kiska: Time to archive it. [15:43] Yep! Also it looks like we have time so testing is a thing I can do unlike tumblr [15:44] Yay! [15:45] *** VerfiedJ has quit IRC (Read error: Connection reset by peer) [15:45] *** VerfiedJ has joined #archiveteam-bs [15:52] t3: This is probably the website in question: https://adamowicz.pl/ [15:53] WTF? YouTube are removing annotations? [15:54] VerfiedJ: http://adamowicz.pl/ [15:55] enowaldo: Yes they are [15:55] VerfiedJ: I'm having it archived through ArchiveBot. It's now in the queue. [15:55] Oh, wait, those are the on-screen nags, not the captions, right? [15:56] t3: ok, thanks [16:05] VerfiedJ: Does he have a Facebook account? [16:06] t3: https://www.facebook.com/Pawel.Adamowicz [16:06] and instagram: https://www.instagram.com/adamowiczpawel/ [16:12] VerfiedJ: Thanks! [17:14] *** Joseph_ has joined #archiveteam-bs [17:17] *** VerfiedJ has quit IRC (Ping timeout: 252 seconds) [17:20] kiska: any estimates on ello? [17:21] 625k artists, assuming 30 images per artist and 3MiB/image, I estimate 375TiB, but assume it will be alot more [17:22] riht [17:22] right [17:22] I might have put an extra 0 somewhere, but I am going to estimate 400TiB for ello [17:22] not sure we can get a project for that tbh, it´s a lot of data and afaik it´s not shutting down [17:22] just instable [17:22] but I´ll bring it up [17:22] don´t have too high hopes though [17:46] *** step has quit IRC (ZNC 1.7.1 - https://znc.in) [17:46] PurpleSym: I'm currently archiving onsemi.com. [17:47] It seems to be grabbing onsemi.jp and onsemi.cn URLs too. [17:56] *** step has joined #archiveteam-bs [17:58] t3: Are you saving to WARC and uploading to IA? [18:02] PurpleSym: Yes. [18:15] *** zhongfu has quit IRC (Remote host closed the connection) [18:16] *** zhongfu has joined #archiveteam-bs [18:18] Good, t3. I looked into recovering metadata from PDF’s today, but PDF to text conversion is a nightmare. So we’ll need scrape the corresponding websites. [18:20] *** schbirid has joined #archiveteam-bs [18:28] *** Wizzito has joined #archiveteam-bs [18:28] Yep [18:29] Flickr is definitely going to beat MobileMe as the biggest ArchiveTeam project [18:29] 200+ TB already [18:40] :) [18:47] kiska: regarding ello. since the size it quite big we probably won´t start that project yet [18:48] but feel free to write the scripts and as soon as we have signals it´s going down we´ll get started on it [18:50] *** Kenshin has quit IRC (Read error: Operation timed out) [18:51] *** Kenshin has joined #archiveteam-bs [18:56] *** tuluu has quit IRC (Ping timeout: 252 seconds) [19:03] *** nikow has quit IRC (Read error: Connection reset by peer) [19:50] fyi it is possible to get your gdrive in a state when emptying the trash is not possible anymore... [19:51] it just says it does but nothing happens for me >.> [19:53] *** Wizzito has quit IRC (Quit: Leaving) [20:01] schbirid: Who deletes things? [20:07] :P [20:07] not google, evitably [20:10] *** Kenshin has quit IRC (Read error: Connection reset by peer) [20:10] *** Kenshin has joined #archiveteam-bs [20:21] *** VerfiedJ has joined #archiveteam-bs [20:23] *** Joseph_ has quit IRC (Ping timeout: 252 seconds) [20:58] *** Mateon1 has quit IRC (Ping timeout: 265 seconds) [20:59] *** Mateon1 has joined #archiveteam-bs [21:01] *** phuzion has quit IRC (Quit: http://quassel-irc.org - Chat comfortably. Anywhere.) [21:42] *** odemg has joined #archiveteam-bs [21:58] *** znak has quit IRC (Quit: leaving) [22:01] *** znak has joined #archiveteam-bs [22:04] *** phuzion has joined #archiveteam-bs [22:07] *** phuzion has quit IRC (Client Quit) [22:07] *** phuzion has joined #archiveteam-bs [22:19] *** Hani111 has joined #archiveteam-bs [22:20] *** Hani111_ has joined #archiveteam-bs [22:22] *** Hani111 has quit IRC (Read error: Connection reset by peer) [22:22] *** Hani111_ has quit IRC (Read error: Connection reset by peer) [22:22] *** Hani has quit IRC (Read error: Connection reset by peer) [22:23] *** Hani111_ has joined #archiveteam-bs [22:23] *** Hani111_ is now known as Hani [22:27] *** Hani111 has joined #archiveteam-bs [22:29] arkiver: Re: Ello: OK. But keep a watch on it. [22:29] *** m007a83_ is now known as m007a83 [22:31] *** Hani111_ has joined #archiveteam-bs [22:32] *** Hani has quit IRC (Read error: Operation timed out) [22:32] *** Hani111_ is now known as Hani [22:33] *** Hani111 has quit IRC (Read error: Operation timed out) [22:36] *** C4K3 has quit IRC (Quit: leaving) [22:37] *** VerfiedJ has quit IRC (Quit: Leaving) [22:43] *** VerfiedJ has joined #archiveteam-bs [22:43] *** VerfiedJ has quit IRC (Remote host closed the connection) [22:43] *** C4K3 has joined #archiveteam-bs [22:49] enowaldo: exactly [23:17] kiska: I think lots of Ello users haven't uploaded anything [23:22] I just had a look at what they serve to users, 1 ~200KiB image. So I assume 100 images per artist, 625k artists, 2MiB/image. Gives me ~125TiB ±10%. And if what hook54321 says is true, that number can be further reduced because they haven't uploaded anything [23:23] Also the initial ~400TB estimate was made using 30MiB/image which was a slight mistake on my part for inserting an extra 0 in my calculation [23:27] *** BlueMax has joined #archiveteam-bs