Time |
Nickname |
Message |
02:58
🔗
|
JAA |
HMV has a shortener hmv.co which apparently uses the po.st platform. |
02:58
🔗
|
JAA |
Codes seen on ArchiveBot job b9rmt1xo20l3o3qf34tsoeg92 seem to be 6-char 0-9a-zA-Z. |
02:59
🔗
|
JAA |
Examples: http://hmv.co/N4CKkp http://hmv.co/xy2Cmh http://hmv.co/PndERo |
04:43
🔗
|
|
odemg has quit IRC (Ping timeout: 265 seconds) |
04:55
🔗
|
|
odemg has joined #urlteam |
05:20
🔗
|
|
coldon2dr has quit IRC (Quit: https://quassel-irc.org - Chat comfortably. Anywhere.) |
05:39
🔗
|
|
w0rmybak has joined #urlteam |
05:39
🔗
|
|
kiskabak has joined #urlteam |
06:55
🔗
|
|
cascode has quit IRC (Quit: cascode) |
09:54
🔗
|
|
hook54321 has quit IRC (Quit: Connection closed for inactivity) |
12:53
🔗
|
Flashfire |
https://bitly.com/pages/landing/branded-short-domains-powered-by-bitly?bsd=cup.cm |
14:57
🔗
|
Kagee |
that's not a good way to check, as it says the same for any domain |
16:23
🔗
|
|
hook54321 has joined #urlteam |
17:39
🔗
|
|
mtntmnky_ has quit IRC (Remote host closed the connection) |
17:40
🔗
|
|
mtntmnky_ has joined #urlteam |
17:50
🔗
|
|
mtntmnky_ has quit IRC (Remote host closed the connection) |
18:39
🔗
|
|
mtntmnky_ has joined #urlteam |
18:59
🔗
|
t3 |
Is doi.org considered a URL shortener? |
19:08
🔗
|
JAA |
I guess it could be called one, but we won't scan DOIs. Besides the fact that each journal has its own format for the second part of the DOI, that wouldn't finish before the heat death of the universe probably. |
19:17
🔗
|
|
mtntmnky_ has quit IRC (Remote host closed the connection) |
19:22
🔗
|
|
mtntmnky_ has joined #urlteam |
19:29
🔗
|
t3 |
JAA: How about http://shortdoi.org/ |
19:35
🔗
|
t3 |
It is similar to a traditional URL shortener. |
19:37
🔗
|
JAA |
Interesting, hadn't seen that before. |
19:38
🔗
|
JAA |
It's not as much at risk as the traditional URL shorteners though since it's run by the same organisation as the DOI management itself. |
19:39
🔗
|
JAA |
At least I think of it like that. |
19:42
🔗
|
t3 |
JAA: But still... It can be archived. |
19:45
🔗
|
Somebody2 |
and it looks pretty short... I'd be open to grabbing it. |
19:46
🔗
|
JAA |
Yeah sure, we can do it. |
19:47
🔗
|
Somebody2 |
setting it up now |
19:48
🔗
|
JAA |
301 on existing short DOI, 404 on inexistent. |
19:48
🔗
|
Somebody2 |
whoops -- or not |
19:48
🔗
|
Somebody2 |
we're currently exporting |
19:48
🔗
|
Somebody2 |
the important part is that the URL is: http://dx.doi.org/10/ |
19:49
🔗
|
Somebody2 |
not shortdoi.org |
19:49
🔗
|
t3 |
Yay! |
19:49
🔗
|
JAA |
https://doi.org/10/x |
19:50
🔗
|
|
hook54321 has quit IRC () |
19:50
🔗
|
|
hook54321 has joined #urlteam |
19:51
🔗
|
Somebody2 |
hm, dx.doi.org is what I get redirected to |
19:51
🔗
|
Somebody2 |
not doi.org |
19:52
🔗
|
Somebody2 |
but doi.org works too, and it's slightly shorter |
20:01
🔗
|
JAA |
https://doi.org/10/aabce redirects to https://doi.org/10.1093/ref:odnb/30479 for me. |
20:01
🔗
|
JAA |
And with dx.doi.org it also redirects to dx.doi.org. |
20:06
🔗
|
Somebody2 |
we *ALREADY* did this project |
20:06
🔗
|
Somebody2 |
https://tracker.archiveteam.org:1338/api/project_settings?name=shortdoi-org |
20:06
🔗
|
Somebody2 |
although we seem to have missed uppercase letters, oddly |
20:07
🔗
|
Somebody2 |
https://ia801202.us.archive.org/zipview.php?zip=/17/items/urlteam_2016-07-02-18-17-02/shortdoi-org.2016-07-02-18-17-02.zip |
20:08
🔗
|
Somebody2 |
we can run it again, I suppose... |
20:10
🔗
|
Somebody2 |
I'll start that now |
20:10
🔗
|
t3 |
Let's just fix it first. |
20:10
🔗
|
Somebody2 |
yep |
20:12
🔗
|
Somebody2 |
ok, expanded the alphabet, and restarted from the beginning |
20:14
🔗
|
t3 |
Somebody2: What does https://ia801202.us.archive.org/zipview.php?zip=/17/items/urlteam_2016-07-02-18-17-02/shortdoi-org.2016-07-02-18-17-02.zip contain? |
20:14
🔗
|
t3 |
I don't know how to open the files. |
20:15
🔗
|
Somebody2 |
That's the previous run's results |
20:15
🔗
|
Somebody2 |
the zip file contains the results, as xz files (another compression format) |
20:15
🔗
|
Somebody2 |
separated by length |
20:15
🔗
|
Somebody2 |
it's a bizare format, but it's what we have |