| Time |
Nickname |
Message |
|
01:12
🔗
|
|
phuzion has quit IRC (Remote host closed the connection) |
|
01:23
🔗
|
|
JesseW has joined #urlteam |
|
02:28
🔗
|
|
phuzion has joined #urlteam |
|
03:18
🔗
|
JesseW |
Pausing zurl-ws ; it hasn't generated results since the last export |
|
04:05
🔗
|
JesseW |
p-ly is still generating results ... albeit VERY SLOWLY |
|
04:05
🔗
|
JesseW |
or, to put it another way, I think we're grabbing custom URLs at this point |
|
04:05
🔗
|
JesseW |
yatuc-com is done |
|
04:08
🔗
|
JesseW |
grumble. Apparently we had grabbed yatuc before, but I didn't see it in the list :-( |
|
04:09
🔗
|
bwn |
oof |
|
04:09
🔗
|
JesseW |
well, it was back in 2014 |
|
04:09
🔗
|
JesseW |
and the name didn't follow the convention |
|
04:55
🔗
|
bwn |
is the terror of tinytown scraper able to extract elements from the returned html? |
|
04:55
🔗
|
JesseW |
yep, via regex |
|
04:56
🔗
|
JesseW |
the default scraper, I mean. Custom code can do pretty much anything -- but it's more of a hassle to write |
|
05:00
🔗
|
bwn |
ah, i was looking at coinurl.com(cur.lv), it the redirect page that shows the interstitial ads uses a bit of javascript to grab the ad, and the destination url is in <a .. id="skip-ad" |
|
05:00
🔗
|
bwn |
http://cur.lv/ypc7w |
|
05:00
🔗
|
JesseW |
nice, we should be able to handle that |
|
05:00
🔗
|
JesseW |
although they will probably be somewhat vigilant about scrapers |
|
05:10
🔗
|
bwn |
added a couple notes to that one on the wiki re the js |
|
05:11
🔗
|
JesseW |
nice, thanks |
|
05:47
🔗
|
bwn |
fav.me (deviantart) and flip.it look pretty heavily used |
|
06:10
🔗
|
JesseW |
hm |
|
06:35
🔗
|
|
JesseW has quit IRC (Ping timeout: 370 seconds) |
|
06:39
🔗
|
bwn |
JesseW: fav.me and flip.it should hopefully have enough info now, let me know if you need more |
|
08:41
🔗
|
|
WinterFox has joined #urlteam |
|
10:52
🔗
|
|
SilSte has quit IRC (Remote host closed the connection) |
|
11:57
🔗
|
|
SilSte has joined #urlteam |
|
12:28
🔗
|
|
WinterFox has quit IRC (Remote host closed the connection) |
|
13:01
🔗
|
|
phuzion has quit IRC (Quit: Bye) |
|
13:02
🔗
|
|
phuzion has joined #urlteam |
|
13:11
🔗
|
|
phuzion has quit IRC (Quit: Bye) |
|
13:13
🔗
|
|
phuzion has joined #urlteam |
|
16:11
🔗
|
|
JesseW has joined #urlteam |
|
16:19
🔗
|
|
JesseW has quit IRC (Ping timeout: 370 seconds) |
|
16:30
🔗
|
|
JesseW has joined #urlteam |
|
16:38
🔗
|
|
JesseW has quit IRC (Ping timeout: 370 seconds) |
|
17:02
🔗
|
|
JW_work has joined #urlteam |
|
17:03
🔗
|
JW_work |
bwn: OK, will look when I get home tonight. |
|
23:08
🔗
|
|
JW_work has quit IRC (Read error: Operation timed out) |
|
23:18
🔗
|
|
JW_work has joined #urlteam |