Time |
Nickname |
Message |
00:25
🔗
|
|
Zerote has quit IRC (Ping timeout: 260 seconds) |
02:11
🔗
|
|
balrog has quit IRC (Quit: Bye) |
02:16
🔗
|
|
balrog has joined #wikiteam |
08:21
🔗
|
|
Zerote has joined #wikiteam |
13:46
🔗
|
Zerote |
Nemo_bis: Hey, I'll continue in here, reckon it's a little easier |
13:46
🔗
|
Zerote |
I ran a modified version of the script that just spits out links whenever more than one entry on IA that matches the search string is found: https://pastebin.com/v6Bq2a2p |
13:46
🔗
|
Zerote |
The script selects the first on the list, which is not guaranteed to be the newest dump. For example, for the Archlinux page (line 42-45 in the pastebin), the script will select the first of the two IA dumps, which is from 2012, instead of the second, which has a dump as new as 2018. |
14:53
🔗
|
Nemo_bis |
Zerote: it doesn't help that uploader.py doesn't manage to update the "lastupdateddate" field |
14:54
🔗
|
Nemo_bis |
but you could just add "&sort=-publicdate" to the search URL to get the most recent item and that would increase the chances to get the most recent dump |
14:54
🔗
|
Nemo_bis |
(though we update older items with newer dumps so that's not certain either) |
15:07
🔗
|
Zerote |
Yeah, it might be a good idea to search through all the results to find the one containing the newest dump |
16:37
🔗
|
Nemo_bis |
Zerote: with screenscraping that's slightly tedious but I think you can get a list of files from the internetarchive library |
16:37
🔗
|
Nemo_bis |
anyway not rocket science, just needs to be tested :) |
23:38
🔗
|
|
Zerote has quit IRC (Ping timeout: 262 seconds) |