Time |
Nickname |
Message |
15:42
🔗
|
soultcer |
So |
15:43
🔗
|
soultcer |
Some shorteners allow to add newlines to the URLs |
15:43
🔗
|
soultcer |
This includes at least tinyurl and is.gd, probably most others as well |
15:43
🔗
|
soultcer |
Any ideas on how to add those shorturls as well |
15:44
🔗
|
soultcer |
our current data format is "short code"|"long url"\n |
15:45
🔗
|
soultcer |
So a newline in the long url breaks our data format. Previously I just threw away URLs with newlines, or removed them if they were at the end of an URL, but there should be a better way |
15:46
🔗
|
soultcer |
The easiest solution would be to escape newlines for example with a backslash. Has the advantage that we don't have to rewrite all files to a new format |
15:48
🔗
|
soultcer |
If I don't hear someone complain about the new format, I'll probably add it to tinyback2 (yeah, yet another rewrite :D) |
15:48
🔗
|
chronomex |
oh dear. |
15:50
🔗
|
soultcer |
The good news is, that tinyback2 has unicode support. Say hello to tinyarro.ws scraping ;-) |