Time |
Nickname |
Message |
02:22
🔗
|
|
SmileyG has quit IRC (Remote host closed the connection) |
02:22
🔗
|
|
Smiley has joined #internetarchive |
03:05
🔗
|
JAA |
VoynichCr: Define "doesn't work"? Is there an error or an example item/task? Also, "Language" or "language"? The metadata docs list it as lowercase (as all official metadata fields): https://archive.org/services/docs/api/metadata-schema/index.html#language |
03:22
🔗
|
|
Jake has joined #internetarchive |
03:23
🔗
|
|
qw3rty__ has joined #internetarchive |
03:31
🔗
|
|
qw3rty_ has quit IRC (Read error: Operation timed out) |
07:06
🔗
|
|
sivoais has quit IRC (se.hub hub.efnet.us) |
07:06
🔗
|
|
Somebody2 has quit IRC (se.hub hub.efnet.us) |
07:06
🔗
|
|
dtm has quit IRC (se.hub hub.efnet.us) |
07:06
🔗
|
|
Ryz has quit IRC (se.hub hub.efnet.us) |
07:06
🔗
|
|
legoktm has quit IRC (se.hub hub.efnet.us) |
07:06
🔗
|
|
Jonimoose has quit IRC (se.hub hub.efnet.us) |
07:06
🔗
|
|
edsu_ has quit IRC (se.hub hub.efnet.us) |
07:06
🔗
|
|
Stiletto has quit IRC (se.hub hub.efnet.us) |
07:06
🔗
|
|
fredgido has quit IRC (se.hub hub.efnet.us) |
07:06
🔗
|
|
Jake has quit IRC (se.hub hub.efnet.us) |
07:06
🔗
|
|
systwi has quit IRC (se.hub hub.efnet.us) |
07:06
🔗
|
|
ats has quit IRC (se.hub hub.efnet.us) |
07:06
🔗
|
|
dxrt_ has quit IRC (se.hub hub.efnet.us) |
07:06
🔗
|
|
namespace has quit IRC (se.hub hub.efnet.us) |
07:06
🔗
|
|
jrwr has quit IRC (se.hub hub.efnet.us) |
07:06
🔗
|
|
sknebel has quit IRC (se.hub hub.efnet.us) |
07:06
🔗
|
|
Larsenv has quit IRC (se.hub hub.efnet.us) |
07:06
🔗
|
|
bugZPDX has quit IRC (se.hub hub.efnet.us) |
07:06
🔗
|
|
kiska1825 has quit IRC (se.hub hub.efnet.us) |
07:06
🔗
|
|
yano has quit IRC (se.hub hub.efnet.us) |
07:06
🔗
|
|
jodizzle has quit IRC (se.hub hub.efnet.us) |
07:06
🔗
|
|
JAA has quit IRC (se.hub hub.efnet.us) |
07:06
🔗
|
|
simon816 has quit IRC (se.hub hub.efnet.us) |
07:06
🔗
|
|
kiska has quit IRC (se.hub hub.efnet.us) |
07:06
🔗
|
|
swebb has quit IRC (se.hub hub.efnet.us) |
07:07
🔗
|
|
Jake has joined #internetarchive |
07:07
🔗
|
|
Stiletto has joined #internetarchive |
07:07
🔗
|
|
namespace has joined #internetarchive |
07:07
🔗
|
|
jrwr has joined #internetarchive |
07:07
🔗
|
|
sivoais has joined #internetarchive |
07:07
🔗
|
|
kiska1825 has joined #internetarchive |
07:07
🔗
|
|
Ryz has joined #internetarchive |
07:07
🔗
|
|
sknebel has joined #internetarchive |
07:07
🔗
|
|
fredgido has joined #internetarchive |
07:07
🔗
|
|
edsu_ has joined #internetarchive |
07:07
🔗
|
|
Jonimoose has joined #internetarchive |
07:07
🔗
|
|
swebb has joined #internetarchive |
07:07
🔗
|
|
simon816 has joined #internetarchive |
07:07
🔗
|
|
kiska has joined #internetarchive |
07:07
🔗
|
|
JAA has joined #internetarchive |
07:07
🔗
|
|
jodizzle has joined #internetarchive |
07:07
🔗
|
|
legoktm has joined #internetarchive |
07:07
🔗
|
|
yano has joined #internetarchive |
07:07
🔗
|
|
Somebody2 has joined #internetarchive |
07:07
🔗
|
|
dtm has joined #internetarchive |
07:07
🔗
|
|
dxrt_ has joined #internetarchive |
07:07
🔗
|
|
bugZPDX has joined #internetarchive |
07:07
🔗
|
|
ats has joined #internetarchive |
07:07
🔗
|
|
systwi has joined #internetarchive |
07:07
🔗
|
|
Larsenv has joined #internetarchive |
07:07
🔗
|
|
ny.us.hub sets mode: +o JAA |
07:07
🔗
|
|
AlsoJAA sets mode: +o JAA |
07:08
🔗
|
|
JAA sets mode: +o AlsoJAA |
07:25
🔗
|
VoynichCr |
JAA: yeah I wanted the official metadata field |
07:26
🔗
|
VoynichCr |
I asked for command-line, but now i am using the library: r = internetarchive.modify_metadata(itemid, metadata=dict(language='English')) |
07:27
🔗
|
VoynichCr |
It works fine, I have added language metadata to over 10,000 ZIM files of Wikipedia, Wiktionary... |
07:28
🔗
|
VoynichCr |
Check the left sidebar for langs https://archive.org/search.php?query=subject%3A%22kiwix%22%20AND%20subject%3A%22zim%22 |
07:28
🔗
|
VoynichCr |
Over 250 different languages |
07:29
🔗
|
VoynichCr |
Some languages has only 2, 1 or zero items, now there is content on IA in those 250+ languages |
07:29
🔗
|
VoynichCr |
Some languages had* |
10:18
🔗
|
Nemo_bis |
Thanks VoynichCr , nice. |
10:19
🔗
|
Nemo_bis |
We also still have to fix uploader.py which doesn't update existing metadata fields (in particular lastupdateddate), I think |
10:37
🔗
|
VoynichCr |
Nemo_bis: it shows any error? |
12:32
🔗
|
Nemo_bis |
VoynichCr: no; it's the usual old bug report |
14:43
🔗
|
JAA |
VoynichCr: Nice! So it all worked now but only through the library? |
14:44
🔗
|
|
vitzli has joined #internetarchive |
14:48
🔗
|
|
vitzli has quit IRC (Client Quit) |
15:06
🔗
|
VoynichCr |
JAA: yeah, i still don't know what is the command line parameter to Language field, anyway i am using the python library, better for me |
15:07
🔗
|
JAA |
Hmm, I don't see anything special in the code for that field. |
15:29
🔗
|
|
systwi has quit IRC (Read error: Operation timed out) |
15:41
🔗
|
|
systwi has joined #internetarchive |
16:06
🔗
|
|
fredgido_ has joined #internetarchive |
16:13
🔗
|
|
fredgido has quit IRC (Ping timeout: 622 seconds) |
16:28
🔗
|
VoynichCr |
another little idea is uploading all distros in DistroWatch Torrent Archive https://distrowatch.com/dwres.php?resource=bittorrent&sortorder=date |
16:28
🔗
|
VoynichCr |
feeding every .torrent in an individual item, and let IA bittorrent do the magic |
16:30
🔗
|
VoynichCr |
i think there are almost 1,000 distros there |
17:32
🔗
|
|
systwi_ has joined #internetarchive |
17:38
🔗
|
|
systwi has quit IRC (Ping timeout: 622 seconds) |
17:44
🔗
|
JAA |
error uploading (file): Please reduce your request rate. - total_tasks_queued exceeds global_limit |
17:44
🔗
|
JAA |
IA busy today, huh? |
17:53
🔗
|
JAA |
Looks like things broke around 16:20. |
17:54
🔗
|
JAA |
https://analytics0.archive.org/stats/s3.php?tz=UTC |
18:44
🔗
|
arkiver |
yeah |
18:44
🔗
|
arkiver |
stuff is crashing |
18:44
🔗
|
arkiver |
JAA: should be back soon |
18:55
🔗
|
JAA |
:-) |
19:02
🔗
|
arkiver |
all coming back now |
19:07
🔗
|
JAA |
Yes, the graphs are beginning to look much prettier again. |
19:13
🔗
|
|
JAA sets mode: +o arkiver |
19:14
🔗
|
arkiver |
wooh |
19:14
🔗
|
* |
arkiver has ops |
19:16
🔗
|
JAA |
VoynichCr: I just tried it. `ia metadata test_language_20200707 --modify='language:Zulu'` worked fine: https://archive.org/details/test_language_20200707 |
19:28
🔗
|
VoynichCr |
interesting... I used Language |
19:30
🔗
|
JAA |
That's why I mentioned that it's lowercase in the docs. |
19:35
🔗
|
VoynichCr |
dumb internet archive, it's case sensitive for tags, and non case sensitive for urls in WBM |
19:38
🔗
|
arkiver |
metadata is case sensitive |
19:38
🔗
|
arkiver |
WBM is something different entirely |
21:52
🔗
|
arkiver |
JAA: and things have become a lot faster now for processing tasks |
21:53
🔗
|
JAA |
Nice! |
22:50
🔗
|
|
jrwr has quit IRC (Ping timeout: 260 seconds) |
22:59
🔗
|
|
fallenoak has quit IRC (Ping timeout: 1230 seconds) |
22:59
🔗
|
|
justcool3 has quit IRC (Ping timeout: 1230 seconds) |
23:00
🔗
|
|
t3 has quit IRC (Ping timeout: 1230 seconds) |
23:00
🔗
|
|
amelia386 has quit IRC (Ping timeout: 1230 seconds) |
23:00
🔗
|
|
diggan has quit IRC (Ping timeout: 1230 seconds) |
23:01
🔗
|
|
hook54321 has quit IRC (Ping timeout: 1230 seconds) |
23:02
🔗
|
|
lenary has quit IRC (Read error: Connection timed out) |
23:03
🔗
|
|
tech234a has quit IRC (Ping timeout: 1230 seconds) |
23:03
🔗
|
|
xit has quit IRC (Ping timeout: 1230 seconds) |
23:03
🔗
|
|
Ctrl-S___ has quit IRC (Read error: Connection timed out) |
23:06
🔗
|
|
HCross has quit IRC (Read error: Connection timed out) |
23:06
🔗
|
|
Kaz has quit IRC (Read error: Connection timed out) |