[00:16] *** Arcorann has joined #archiveteam-bs [00:16] *** Arcorann has quit IRC (Read error: Connection reset by peer) [00:16] *** Arcorann has joined #archiveteam-bs [00:50] *** cf has quit IRC (Ping timeout: 745 seconds) [00:54] *** Craigle has quit IRC (Quit: Ping timeout (120 seconds)) [00:54] *** Craigle has joined #archiveteam-bs [00:55] *** cf has joined #archiveteam-bs [01:12] *** igloo25 has quit IRC (Ping timeout: 745 seconds) [01:25] *** LowLevelM has joined #archiveteam-bs [01:36] *** Gallifrey has quit IRC (Remote host closed the connection) [01:37] *** Gallifrey has joined #archiveteam-bs [01:44] Shadyness over phone numbers with Grubhub ( https://www.buzzfeednews.com/article/venessawong/grubhub-phone-order-call-fee-coronavirus ) and Yelp ( https://www.vice.com/amp/en_us/article/wjwebw/yelp-is-sneakily-replacing-restaurants-phone-numbers-so-grubhub-can-take-a-cut ) on charging fees even if you try to call the restaurants directly [01:44] ...Unsure if have enough power to do a proactive archiving task of both of their websites [01:45] They didn't die due to their shady stuff in the past, they probably won't die from this either (sadly). [01:45] Also, this sounds like worth archiving those countless websites Grubhub is setting up (this was reported back in 2019 June): https://www.theverge.com/2019/6/28/19154220/grubhub-seamless-fake-restaurant-domain-names-commission-fees [01:53] https://www.cnbc.com/2020/03/27/yelp-stops-adding-gofundme-pages-to-businesses-after-opt-out-complaints.html [02:39] *** lunik13 has joined #archiveteam-bs [03:17] *** qw3rty_ has joined #archiveteam-bs [03:24] *** qw3rty__ has quit IRC (Read error: Operation timed out) [03:50] *** HP_Archiv has joined #archiveteam-bs [04:00] *** DogsRNice has quit IRC (Read error: Connection reset by peer) [04:02] Forgot to mention, Turiver is done since almost 24 hours. I'll consider this complete now; at least there were no further spikes of 404s. (The AB job is still running and won't finish anytime soon.) [05:24] *** HP_Archiv has quit IRC (Quit: Leaving) [06:40] *** robogoat has quit IRC (Read error: Operation timed out) [06:43] *** robogoat has joined #archiveteam-bs [06:48] *** BeefyBoot has joined #archiveteam-bs [06:57] archive.today & alternative domains down today? [07:11] *** Stilett0 is now known as Stiletto [08:17] BeefyBoot: using cloudflare dns? [08:18] 5.196.68.232 archive.is archive.today archive.li archive.vn [08:21] Yes, I was [08:37] OrIdow6: Do you think your Winnipeg thing is going to finish in time? [08:38] (Just wondering, I had a note about that site.) [09:44] *** schbirid has quit IRC (Quit: Leaving) [10:34] *** BlueMax has quit IRC (Read error: Connection reset by peer) [10:49] *** Gallifrey has quit IRC (Remote host closed the connection) [11:07] *** BeefyBoot has quit IRC (Quit: Connection closed for inactivity) [14:02] *** chfoo has quit IRC (Read error: Operation timed out) [14:10] *** chfoo has joined #archiveteam-bs [14:31] *** jmtd has joined #archiveteam-bs [14:32] *** sknebel has quit IRC (Write error: Broken pipe) [14:32] *** Jon| has quit IRC (Write error: Broken pipe) [14:32] *** thejsa_ has quit IRC (Write error: Broken pipe) [14:32] *** thejsa has joined #archiveteam-bs [14:32] *** sknebel has joined #archiveteam-bs [15:49] *** Arcorann has quit IRC (Read error: Connection reset by peer) [16:14] *** DogsRNice has joined #archiveteam-bs [17:35] *** jshoard has joined #archiveteam-bs [17:59] *** Gallifrey has joined #archiveteam-bs [19:07] *** jshoard has quit IRC (Leaving) [19:07] *** jshoard has joined #archiveteam-bs [20:05] *** LowLevelM has quit IRC (The Lounge - https://thelounge.chat) [20:15] *** qw3rty_ has quit IRC (Leaving) [21:03] *** Mayonaise has quit IRC (Read error: Operation timed out) [21:03] *** dxrt_ has quit IRC (Read error: Operation timed out) [21:04] *** paul2520 has quit IRC (Read error: Operation timed out) [21:04] *** Wingy has quit IRC (Read error: Operation timed out) [21:04] *** Lord_Nigh has quit IRC (Read error: Operation timed out) [21:04] *** fredgido_ has joined #archiveteam-bs [21:04] *** Lord_Nigh has joined #archiveteam-bs [21:04] *** jshoard_ has joined #archiveteam-bs [21:05] *** Jake has quit IRC (Read error: Operation timed out) [21:05] *** Mayonaise has joined #archiveteam-bs [21:05] *** lennier2 has joined #archiveteam-bs [21:05] *** _niklas has quit IRC (Read error: Operation timed out) [21:05] *** twigfoot has quit IRC (Read error: Operation timed out) [21:06] *** asdf0101 has quit IRC (Read error: Operation timed out) [21:06] *** MrRadar has quit IRC (Read error: Operation timed out) [21:06] *** drcd has quit IRC (Read error: Operation timed out) [21:06] *** twigfoot has joined #archiveteam-bs [21:06] *** MrRadar has joined #archiveteam-bs [21:06] *** drcd has joined #archiveteam-bs [21:06] *** sembiance has quit IRC (Read error: Operation timed out) [21:06] *** ranma has joined #archiveteam-bs [21:07] *** pie_ has quit IRC (Read error: Operation timed out) [21:07] *** sivoais_ has quit IRC (Read error: Operation timed out) [21:07] *** sivoais has joined #archiveteam-bs [21:07] *** Gallifrey has quit IRC (Read error: Operation timed out) [21:07] *** Gfy_ has quit IRC (Read error: Operation timed out) [21:08] *** Gallifrey has joined #archiveteam-bs [21:08] *** ranma_ has quit IRC (Read error: Operation timed out) [21:08] *** kisspunch has quit IRC (Read error: Operation timed out) [21:09] *** Gfy has joined #archiveteam-bs [21:09] *** Yurume_ has quit IRC (Read error: Operation timed out) [21:09] *** jshoard has quit IRC (Read error: Operation timed out) [21:10] *** fredgido has quit IRC (Read error: Operation timed out) [21:10] *** Yurume has joined #archiveteam-bs [21:10] *** Kenshin has quit IRC (Read error: Operation timed out) [21:10] *** lennier1 has quit IRC (Read error: Operation timed out) [21:10] *** lennier2 is now known as lennier1 [21:10] *** Kenshin has joined #archiveteam-bs [21:10] *** pie_ has joined #archiveteam-bs [21:10] *** wp494 has quit IRC (Read error: Operation timed out) [21:10] *** _niklas has joined #archiveteam-bs [21:10] *** kisspunch has joined #archiveteam-bs [21:10] *** systwi_ has joined #archiveteam-bs [21:10] *** asdf0101 has joined #archiveteam-bs [21:10] *** notroot2 has joined #archiveteam-bs [21:10] *** Jake has joined #archiveteam-bs [21:10] *** notroot has quit IRC (Read error: Operation timed out) [21:10] *** dxrt_ has joined #archiveteam-bs [21:11] *** systwi has quit IRC (Read error: Operation timed out) [21:11] *** sembiance has joined #archiveteam-bs [21:11] *** Wingy has joined #archiveteam-bs [21:12] *** paul2520 has joined #archiveteam-bs [21:32] *** wp494 has joined #archiveteam-bs [21:36] *** jshoard_ has quit IRC (Leaving) [21:39] Huh, WBM gets 15-25 TiB a day: https://twitter.com/textfiles/status/1282231639515504640 [21:39] Does that include all the ArchiveBot business of sending stuff via FOS and onto WBM [21:39] ? [21:58] Why wouldn't it? [21:59] AB data is roughly 2 TB/day on average since 1 July. [22:39] *** xit has joined #archiveteam-bs [23:11] Is AB the biggest contributor to integrating content to WBM every day? oo; [23:13] *** Arcorann has joined #archiveteam-bs [23:19] it certainly isn't by pure size [23:19] may be by unique URLs though [23:22] Seems like SPN2 is about the same size as AB by both data size and URL count, and SPN is smaller. [23:34] jodizzle: At the current rate, and with the current queue, it would; right now, I'm going through a bunch of old posts that all have 0 comments (suggesting to me that they didn't have comments when originally put up); in any case the more recent pages are well-covered [23:49] Great, good to hear [23:50] *** Arcorann_ has joined #archiveteam-bs [23:57] *** Arcorann has quit IRC (Read error: Operation timed out)