[00:25] *** Zerote has quit IRC (Ping timeout: 260 seconds) [02:11] *** balrog has quit IRC (Quit: Bye) [02:16] *** balrog has joined #wikiteam [08:21] *** Zerote has joined #wikiteam [13:46] Nemo_bis: Hey, I'll continue in here, reckon it's a little easier [13:46] I ran a modified version of the script that just spits out links whenever more than one entry on IA that matches the search string is found: https://pastebin.com/v6Bq2a2p [13:46] The script selects the first on the list, which is not guaranteed to be the newest dump. For example, for the Archlinux page (line 42-45 in the pastebin), the script will select the first of the two IA dumps, which is from 2012, instead of the second, which has a dump as new as 2018. [14:53] Zerote: it doesn't help that uploader.py doesn't manage to update the "lastupdateddate" field [14:54] but you could just add "&sort=-publicdate" to the search URL to get the most recent item and that would increase the chances to get the most recent dump [14:54] (though we update older items with newer dumps so that's not certain either) [15:07] Yeah, it might be a good idea to search through all the results to find the one containing the newest dump [16:37] Zerote: with screenscraping that's slightly tedious but I think you can get a list of files from the internetarchive library [16:37] anyway not rocket science, just needs to be tested :) [23:38] *** Zerote has quit IRC (Ping timeout: 262 seconds)