Time |
Nickname |
Message |
00:26
🔗
|
|
JesseW has joined #urlteam |
01:01
🔗
|
|
VADemon has quit IRC (Quit: left4dead) |
01:22
🔗
|
|
JesseW has quit IRC (Ping timeout: 370 seconds) |
01:33
🔗
|
|
Start has quit IRC (Quit: Disconnected.) |
01:33
🔗
|
|
Start has joined #urlteam |
02:24
🔗
|
|
marvinw is now known as ivan` |
02:58
🔗
|
|
JesseW has joined #urlteam |
03:31
🔗
|
JesseW |
ur1-ca seems to be freaking out. looking into it |
03:32
🔗
|
JesseW |
ur1-ca hasn't returned results since yesterday -- so I'm just going to turn it off |
03:34
🔗
|
JesseW |
the server seems *very* speedy -- did you do something to improve it, xmc (or chfoo)? |
03:36
🔗
|
JesseW |
We have now finished scanning all of the 5 character entries for da-gd . Stopping it for now until phuzion clarifies if it includes 6 character ones (we haven't found any so far) |
04:30
🔗
|
chfoo |
oh, yeah. fixed an overlooked sqlite setting |
04:49
🔗
|
JesseW |
today? |
04:49
🔗
|
JesseW |
I know you did so a little while ago, but it seemed noticablely faster than yesterday for me |
05:09
🔗
|
chfoo |
oh, nope, i didn't touch anything today |
05:09
🔗
|
|
phuzion has quit IRC (Read error: Operation timed out) |
05:10
🔗
|
JesseW |
Hm, IDK then. |
05:25
🔗
|
|
DoomTay has joined #urlteam |
05:28
🔗
|
DoomTay |
So if I understand the project has a collection of shortened urls and their unshortened counterparts |
05:28
🔗
|
JesseW |
yep, many GBs worth |
05:28
🔗
|
DoomTay |
Do you think that data could be compiled into something machine-accessible so, say, a browser extension could be made to make use of it? |
05:29
🔗
|
JesseW |
it's already machine-accessible, but not efficiently random-accessible (due to compression) |
05:29
🔗
|
DoomTay |
Dang |
05:29
🔗
|
JesseW |
look over http://archiveteam.org/index.php?title=URLTeam#Archives to see how it is currently stored |
05:30
🔗
|
JesseW |
it would be pretty straightforward to convert it into other formats -- just takes someone bothering to do it |
05:31
🔗
|
|
phuzion has joined #urlteam |
05:32
🔗
|
JesseW |
If I were to make a service to use the data, I'd lean towards excluding data from still-existing shortening services, to avoid unnecessarily irritating them (thereby maybe making it harder to scrape more from them). |
05:32
🔗
|
JesseW |
But we do have data from a number of dead shortening services, and I see *no* reason not to create a service that will translate those shortcodes on demand. |
05:33
🔗
|
DoomTay |
Agreed on both counts |
05:36
🔗
|
JesseW |
It would require someone willing to pay for the (probably limited) hosting costs |
05:37
🔗
|
DoomTay |
Hmm |
05:41
🔗
|
DoomTay |
I was about to suggest whoever's hosting urlte.am then I saw that that's actually a tracker under archive.org |
05:41
🔗
|
DoomTay |
Er, archiveteam.org |
05:42
🔗
|
DoomTay |
Speaking of which, what's up with http://tracker.archiveteam.org:1337/calculator ? |
05:42
🔗
|
JesseW |
xmc currently hosts the box that the tracker runs on (thanks xmc!) |
05:43
🔗
|
JesseW |
DoomTay: it's not particularly useful to random users, but it's very helpful for tracker admins, as it lets us convert shortcodes (like a4nG) into sequence numbers (which various places in the admin interface expect). |
05:44
🔗
|
DoomTay |
Ah |
05:44
🔗
|
JesseW |
it doesn't really need to be made public, but it's harmless for it to be, so it is |
05:50
🔗
|
|
DoomTay has quit IRC (Quit: Page closed) |
06:59
🔗
|
|
dashcloud has quit IRC (Ping timeout: 250 seconds) |
07:05
🔗
|
|
dashcloud has joined #urlteam |
07:06
🔗
|
|
svchfoo1 sets mode: +o dashcloud |
07:06
🔗
|
|
svchfoo3 sets mode: +o dashcloud |
07:19
🔗
|
|
JesseW has quit IRC (Ping timeout: 370 seconds) |
07:39
🔗
|
|
dashcloud has quit IRC (Read error: Connection reset by peer) |
07:39
🔗
|
|
dashcloud has joined #urlteam |
07:40
🔗
|
|
svchfoo1 sets mode: +o dashcloud |
07:40
🔗
|
|
svchfoo3 sets mode: +o dashcloud |
08:18
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
08:21
🔗
|
|
dashcloud has joined #urlteam |
08:21
🔗
|
|
svchfoo1 sets mode: +o dashcloud |
08:21
🔗
|
|
svchfoo3 sets mode: +o dashcloud |
08:33
🔗
|
|
hawc145 has joined #urlteam |
08:35
🔗
|
|
zhongfu has quit IRC (Quit: No Ping reply in 180 seconds.) |
08:37
🔗
|
|
chfoo has quit IRC (Read error: Operation timed out) |
08:39
🔗
|
|
HCross has quit IRC (Ping timeout: 370 seconds) |
08:40
🔗
|
|
luckcolor has quit IRC (Read error: Operation timed out) |
08:42
🔗
|
|
svchfoo1 has quit IRC (Ping timeout: 370 seconds) |
08:44
🔗
|
|
chfoo has joined #urlteam |
08:45
🔗
|
|
zhongfu has joined #urlteam |
08:45
🔗
|
|
svchfoo3 sets mode: +o chfoo |
08:46
🔗
|
|
luckcolor has joined #urlteam |
08:47
🔗
|
|
svchfoo1 has joined #urlteam |
08:47
🔗
|
|
svchfoo3 sets mode: +o svchfoo1 |
09:42
🔗
|
|
WinterFox has joined #urlteam |
10:22
🔗
|
|
hawc145 is now known as HCross |
11:38
🔗
|
|
VADemon has joined #urlteam |
12:34
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
12:38
🔗
|
|
dashcloud has joined #urlteam |
12:38
🔗
|
|
svchfoo1 sets mode: +o dashcloud |
13:12
🔗
|
|
VADemon has quit IRC (Quit: left4dead) |
13:45
🔗
|
|
VADemon has joined #urlteam |
15:08
🔗
|
|
JesseW has joined #urlteam |
15:29
🔗
|
|
JesseW has quit IRC (Read error: Operation timed out) |
15:42
🔗
|
|
JW_work has joined #urlteam |
15:57
🔗
|
|
WinterFox has quit IRC (Remote host closed the connection) |
17:12
🔗
|
xmc |
JesseW: the other admins did some serious speedups on the disk, so ... yes |
17:13
🔗
|
xmc |
& i would love to see e.g. tr.im.urlte.am/aw3at redirect to the right place |
17:15
🔗
|
xmc |
i would probably have done it but i don't really have the storage budget for ten grillion shortlinks to be on hot storage |
17:19
🔗
|
JW_work |
ah, cool — good to know |
17:23
🔗
|
JW_work |
OK, well, I think a good next step would be to identify which of the shorteners we have data for are now dead, and how much (uncompressed) data that comes to. I'll see about doing that. |
17:24
🔗
|
|
DoomTay has joined #urlteam |
18:18
🔗
|
|
DoomTay has left |
23:35
🔗
|
|
WinterFox has joined #urlteam |