Time |
Nickname |
Message |
00:19
🔗
|
|
dan- has joined #wikiteam |
01:26
🔗
|
|
kyan has joined #wikiteam |
04:07
🔗
|
|
kyan has quit IRC (Ping timeout: 258 seconds) |
04:46
🔗
|
|
kyan has joined #wikiteam |
04:59
🔗
|
|
kyan has quit IRC (Quit: Leaving) |
08:44
🔗
|
|
phuzion has quit IRC (Ping timeout: 186 seconds) |
11:55
🔗
|
|
vitzli has joined #wikiteam |
13:16
🔗
|
|
phuzion has joined #wikiteam |
14:00
🔗
|
|
Start has quit IRC (Quit: Disconnected.) |
16:12
🔗
|
|
Start has joined #wikiteam |
16:14
🔗
|
|
Start has quit IRC (Client Quit) |
16:17
🔗
|
|
chfoo has quit IRC (Ping timeout: 310 seconds) |
16:19
🔗
|
|
Start has joined #wikiteam |
16:19
🔗
|
|
chfoo has joined #wikiteam |
16:20
🔗
|
|
svchfoo1 sets mode: +o chfoo |
17:04
🔗
|
|
Start has quit IRC (Quit: Disconnected.) |
18:13
🔗
|
|
vitzli has quit IRC (Quit: Leaving) |
18:22
🔗
|
arkiver |
Nemo_bis: how does wikiteam currently get the page prefix for the URL? |
18:23
🔗
|
arkiver |
for example, prefix for archiveteam wiki URLs is http://archiveteam.org/index.php?title= |
18:24
🔗
|
Nemo_bis |
arkiver: there are many cases and often wikis are misconfigured+ |
18:25
🔗
|
arkiver |
Nemo_bis: I'm currently naming items like: |
18:25
🔗
|
arkiver |
mediawikieu:archiveteam.org/api.php:archiveteam.org/index.php?title= |
18:26
🔗
|
Nemo_bis |
arkiver: part of the logic is in https://github.com/WikiTeam/wikiteam/blob/master/dumpgenerator.py#L1952 |
18:26
🔗
|
Nemo_bis |
uh? I don't get it |
18:26
🔗
|
arkiver |
well |
18:27
🔗
|
arkiver |
mediawiki:apilink:urlprefix |
18:27
🔗
|
arkiver |
mediawikieu is for grabbing external links |
18:27
🔗
|
arkiver |
Nemo_bis: scripts are here https://github.com/ArchiveTeam/wikis-grab/blob/master/pipeline.py#L212 |
18:31
🔗
|
arkiver |
Nemo_bis: items are split here https://github.com/ArchiveTeam/wikis-grab/blob/master/pipeline.py#L201 |
18:32
🔗
|
arkiver |
itemnames* |
18:42
🔗
|
arkiver |
I just made an update, the wikis project is now ready for mediawikieu items |
18:42
🔗
|
arkiver |
So we can queue those up :) |
19:50
🔗
|
|
VADemon has joined #wikiteam |
19:52
🔗
|
VADemon |
arkiver, a bug? 302's and repeated index.php's: https://dl.dropbox.com/u/53753604/screenshots/2015.10.26_20-48-53__firefox.png |
20:13
🔗
|
|
Start has joined #wikiteam |
21:16
🔗
|
|
Start has quit IRC (Quit: Disconnected.) |
22:47
🔗
|
|
Muad-Dib has quit IRC (Ping timeout: 252 seconds) |
23:00
🔗
|
|
Muad-Dib has joined #wikiteam |
23:10
🔗
|
arkiver |
VADemon: I'll have a look at it, thanks! |
23:10
🔗
|
arkiver |
The scripts are still kind of in the testing phase |
23:11
🔗
|
arkiver |
VAFeon"I just had a look at it and I think I can't fix that |
23:20
🔗
|
arkiver |
VADemon: I see it's possible to have a loop though, will fix that |
23:27
🔗
|
arkiver |
VADemon: scripts are updated. |
23:38
🔗
|
arkiver |
1878 new mediawikieu items queued |
23:45
🔗
|
arkiver |
scripts are updated again. |
23:45
🔗
|
|
Start has joined #wikiteam |