#wikiteam 2019-04-21,Sun

↑back Search

Time Nickname Message
00:25 🔗 Zerote has quit IRC (Ping timeout: 260 seconds)
02:11 🔗 balrog has quit IRC (Quit: Bye)
02:16 🔗 balrog has joined #wikiteam
08:21 🔗 Zerote has joined #wikiteam
13:46 🔗 Zerote Nemo_bis: Hey, I'll continue in here, reckon it's a little easier
13:46 🔗 Zerote I ran a modified version of the script that just spits out links whenever more than one entry on IA that matches the search string is found: https://pastebin.com/v6Bq2a2p
13:46 🔗 Zerote The script selects the first on the list, which is not guaranteed to be the newest dump. For example, for the Archlinux page (line 42-45 in the pastebin), the script will select the first of the two IA dumps, which is from 2012, instead of the second, which has a dump as new as 2018.
14:53 🔗 Nemo_bis Zerote: it doesn't help that uploader.py doesn't manage to update the "lastupdateddate" field
14:54 🔗 Nemo_bis but you could just add "&sort=-publicdate" to the search URL to get the most recent item and that would increase the chances to get the most recent dump
14:54 🔗 Nemo_bis (though we update older items with newer dumps so that's not certain either)
15:07 🔗 Zerote Yeah, it might be a good idea to search through all the results to find the one containing the newest dump
16:37 🔗 Nemo_bis Zerote: with screenscraping that's slightly tedious but I think you can get a list of files from the internetarchive library
16:37 🔗 Nemo_bis anyway not rocket science, just needs to be tested :)
23:38 🔗 Zerote has quit IRC (Ping timeout: 262 seconds)

irclogger-viewer