[00:24] *** systwi_ has joined #internetarchive [00:31] *** systwi has quit IRC (Read error: Operation timed out) [01:18] *** OrIdow6^2 has joined #internetarchive [01:20] *** OrIdow6 has quit IRC (Ping timeout: 265 seconds) [01:40] *** OrIdow6^2 has quit IRC (Ping timeout: 265 seconds) [02:25] *** Stilett0 has joined #internetarchive [02:25] *** DopefishJ has joined #internetarchive [02:31] *** sircmpwn has quit IRC (Read error: Connection reset by peer) [02:31] *** OrIdow6 has joined #internetarchive [02:34] *** tonsofpcs has quit IRC (Ping timeout: 745 seconds) [02:34] *** Stiletto has quit IRC (Ping timeout: 745 seconds) [02:34] *** DFJustin has quit IRC (Ping timeout: 745 seconds) [02:39] *** sircmpwn has joined #internetarchive [03:10] *** tonsofpcs has joined #internetarchive [03:30] *** DogsRNice has quit IRC (Read error: Connection reset by peer) [03:43] *** qw3rty__ has joined #internetarchive [03:51] *** qw3rty_ has quit IRC (Read error: Operation timed out) [04:41] Nemo_bis, do you have the link to the complaint [05:11] *** Ryz has quit IRC (Remote host closed the connection) [05:11] *** kiska1825 has quit IRC (Remote host closed the connection) [05:12] *** Ryz has joined #internetarchive [05:12] *** kiska1825 has joined #internetarchive [06:57] *** wessel152 has quit IRC (Read error: Operation timed out) [07:14] *** atphoenix has quit IRC (Read error: Connection reset by peer) [07:14] *** atphoenix has joined #internetarchive [08:12] Raccoon: https://www.courtlistener.com/docket/17211300/hachette-book-group-inc-v-internet-archive/ [08:28] *** Ryz has quit IRC (Remote host closed the connection) [08:28] *** kiska1825 has quit IRC (Remote host closed the connection) [08:29] *** Ryz has joined #internetarchive [08:29] *** kiska1825 has joined #internetarchive [13:23] Ah, very nice. thanks. [13:24] ( complaint: https://www.courtlistener.com/recap/gov.uscourts.nysd.537900/gov.uscourts.nysd.537900.1.0.pdf -- response: https://www.courtlistener.com/recap/gov.uscourts.nysd.537900/gov.uscourts.nysd.537900.33.0.pdf ) [14:29] *** Craigle3 has joined #internetarchive [14:34] *** Craigle3 has quit IRC (Read error: Connection reset by peer) [14:35] *** Craigle4 has joined #internetarchive [14:37] *** Craigle has quit IRC (Ping timeout: 745 seconds) [14:37] *** Craigle4 is now known as Craigle [16:09] *** Craigle has quit IRC (Quit: The Lounge - https://thelounge.chat) [16:11] *** Craigle has joined #internetarchive [17:59] *** t3 has joined #internetarchive [19:31] *** DogsRNice has joined #internetarchive [20:07] *** Stilett0 is now known as Stiletto [22:17] wish the wbm could detect when a proper page has been replaced by domain hosting boiler plates, 404s, and other post-mortem not-the-page-you-were-looking-for [22:18] so that those can be identified and avoided while navigating the timeline for the most fresh version of the page before its demise [22:19] ie. https://web.archive.org/web/20150907215822/http://www.merlyn.demon.co.uk/critdate.htm [22:20] s/404/304/ etc [22:31] I've thought that the timeline could indicate how much the page has changed in between each snapshot (rather, group of snapshots) [22:33] Something like, for each one, change the color based on the SimHash vs. the previous one [22:33] Also the calendar [22:33] Obviously [22:35] Though that would obviously be expensive to do, as you can't just use the CDX, you have to look into the page [22:35] contents