#internetarchive 2018-04-17,Tue

↑back Search ←Prev date Next date→ Show only urls(Click on time to select a line by its url)

WhoWhatWhen
***zhongfu has quit IRC (Remote host closed the connection)
zhongfu has joined #internetarchive
[00:08]
zhongfu has quit IRC (Remote host closed the connection) [00:17]
..... (idle for 24mn)
zhongfu has joined #internetarchive [00:41]
.................... (idle for 1h38mn)
Stilett0- is now known as Stiletto [02:19]
.................. (idle for 1h25mn)
qw3rty113 has joined #internetarchive [03:44]
qw3rty112 has quit IRC (Ping timeout: 600 seconds)
odemg has quit IRC (Read error: Operation timed out)
[03:50]
odemg has joined #internetarchive [04:05]
........................................................................................................................................................... (idle for 12h53mn)
ivandoes anyone know the exact set of characters you can't use in an archive.org item? (if you want a working directory listing)
so far I have figured out that "+" is incorrectly converted to " " probably by using the wrong php quoting function, and that you can't start or end filenames with whitespace because of how browsers interpret hrefs
[16:58]
.... (idle for 15mn)
ah # and % are also bad because they aren't encoded [17:14]
.... (idle for 19mn)
also ? and \
I accidentally discovered that you can add a /? to the end of a /download/ URL to get redirected from the broken PHP directory lister to a working nginx directory lister
[17:33]
........... (idle for 51mn)
***Lord_Nigh has quit IRC (Ping timeout: 252 seconds) [18:26]
Lord_Nigh has joined #internetarchive [18:33]
....... (idle for 32mn)
JAA"you can't start or end filenames with whitespace because of how browsers interpret hrefs" Really? Can't you just encode it as %20?
ivan: "An identifier is composed of any unique combination of alphanumeric characters, underscore (_) and dash (-)."
[19:05]
ivanJAA: well, of course you can, but IA doesn't apply any sane escaping [19:06]
JAAhttps://internetarchive.readthedocs.io/en/latest/metadata.html#archive-org-identifiers
So basically avoid anything else, I guess.
[19:06]
ivanit's got just enough escaping to stop XSS but not to do the right thing [19:06]
JAAAh, right. [19:06]
ivanwould help if archive.org put their PHP spaghetti on github and took PRs :-) [19:07]
JAAOh yes, I'd love to see their software open-sourced. [19:08]
ivanwould you? you haven't looked inside yet [19:08]
JAAOh, I'm absolutely sure it's horrible.
I'm a strong supporter of open-source software, so I consider horrible open-source code better than any closed-source code.
And once it's open-source, others can contribute and make it better.
[19:14]
................................ (idle for 2h35mn)
***Coderjo has quit IRC (Read error: Operation timed out) [21:50]
Coderjo has joined #internetarchive [21:58]
..... (idle for 20mn)
Lord_Nigh has quit IRC (Read error: Operation timed out)
Lord_Nigh has joined #internetarchive
[22:18]

↑back Search ←Prev date Next date→ Show only urls(Click on time to select a line by its url)