Time |
Nickname |
Message |
02:07
🔗
|
|
balrog has quit IRC (Quit: Bye) |
02:12
🔗
|
|
balrog has joined #wikiteam |
09:27
🔗
|
|
balrog has quit IRC (Read error: Operation timed out) |
09:37
🔗
|
|
balrog has joined #wikiteam |
09:37
🔗
|
|
kiska1 has quit IRC (Read error: Operation timed out) |
09:37
🔗
|
|
logchfoo1 has quit IRC (Ping timeout: 600 seconds) |
09:38
🔗
|
|
logchfoo2 starts logging #wikiteam at Fri Jan 04 09:38:42 2019 |
09:38
🔗
|
|
logchfoo2 has joined #wikiteam |
09:38
🔗
|
|
kiska1 has joined #wikiteam |
10:55
🔗
|
|
hook54321 has quit IRC (Quit: Connection closed for inactivity) |
13:04
🔗
|
|
hook54321 has joined #wikiteam |
14:19
🔗
|
|
Fae has joined #wikiteam |
14:21
🔗
|
Fae |
Hi, I have around a million public domain image files on Wikimedia Commons that I'd like to gradually add source archive links for using a bot, and I'd like to use Python to run it. Does anyone know of an existing module or example Python script I can crib from? |
14:28
🔗
|
Fae |
BTW, started to look at https://github.com/jjjake/internetarchive |
15:02
🔗
|
Nemo_bis |
Fae: what are "source archive links"? |
15:06
🔗
|
Nemo_bis |
Are you talking about external links like https://commons.wikimedia.org/wiki/Category:Uploads_by_F%C3%A6_with_linkrot ? |
16:24
🔗
|
|
hook54321 has quit IRC () |
16:25
🔗
|
|
hook54321 has joined #wikiteam |
18:17
🔗
|
Fae |
Yes, but working links that I can add to IA via a housekeeping task |
18:18
🔗
|
Fae |
I think I can do this with the internetarchive module above, but it'll take some customization per batch if I have to add in useful metadata |
18:19
🔗
|
Fae |
I'd rather just throw web links at IA; not sure if that's sufficient |
21:36
🔗
|
Fae |
Got my archives going, using an initial tranche of train photos https://commons.wikimedia.org/w/index.php?search=incategory%3A%22Photographs+from+trainpix.org%22+hastemplate%3AWayback |