Time |
Nickname |
Message |
01:12
🔗
|
|
phuzion has quit IRC (Remote host closed the connection) |
01:23
🔗
|
|
JesseW has joined #urlteam |
02:28
🔗
|
|
phuzion has joined #urlteam |
03:18
🔗
|
JesseW |
Pausing zurl-ws ; it hasn't generated results since the last export |
04:05
🔗
|
JesseW |
p-ly is still generating results ... albeit VERY SLOWLY |
04:05
🔗
|
JesseW |
or, to put it another way, I think we're grabbing custom URLs at this point |
04:05
🔗
|
JesseW |
yatuc-com is done |
04:08
🔗
|
JesseW |
grumble. Apparently we had grabbed yatuc before, but I didn't see it in the list :-( |
04:09
🔗
|
bwn |
oof |
04:09
🔗
|
JesseW |
well, it was back in 2014 |
04:09
🔗
|
JesseW |
and the name didn't follow the convention |
04:55
🔗
|
bwn |
is the terror of tinytown scraper able to extract elements from the returned html? |
04:55
🔗
|
JesseW |
yep, via regex |
04:56
🔗
|
JesseW |
the default scraper, I mean. Custom code can do pretty much anything -- but it's more of a hassle to write |
05:00
🔗
|
bwn |
ah, i was looking at coinurl.com(cur.lv), it the redirect page that shows the interstitial ads uses a bit of javascript to grab the ad, and the destination url is in <a .. id="skip-ad" |
05:00
🔗
|
bwn |
http://cur.lv/ypc7w |
05:00
🔗
|
JesseW |
nice, we should be able to handle that |
05:00
🔗
|
JesseW |
although they will probably be somewhat vigilant about scrapers |
05:10
🔗
|
bwn |
added a couple notes to that one on the wiki re the js |
05:11
🔗
|
JesseW |
nice, thanks |
05:47
🔗
|
bwn |
fav.me (deviantart) and flip.it look pretty heavily used |
06:10
🔗
|
JesseW |
hm |
06:35
🔗
|
|
JesseW has quit IRC (Ping timeout: 370 seconds) |
06:39
🔗
|
bwn |
JesseW: fav.me and flip.it should hopefully have enough info now, let me know if you need more |
08:41
🔗
|
|
WinterFox has joined #urlteam |
10:52
🔗
|
|
SilSte has quit IRC (Remote host closed the connection) |
11:57
🔗
|
|
SilSte has joined #urlteam |
12:28
🔗
|
|
WinterFox has quit IRC (Remote host closed the connection) |
13:01
🔗
|
|
phuzion has quit IRC (Quit: Bye) |
13:02
🔗
|
|
phuzion has joined #urlteam |
13:11
🔗
|
|
phuzion has quit IRC (Quit: Bye) |
13:13
🔗
|
|
phuzion has joined #urlteam |
16:11
🔗
|
|
JesseW has joined #urlteam |
16:19
🔗
|
|
JesseW has quit IRC (Ping timeout: 370 seconds) |
16:30
🔗
|
|
JesseW has joined #urlteam |
16:38
🔗
|
|
JesseW has quit IRC (Ping timeout: 370 seconds) |
17:02
🔗
|
|
JW_work has joined #urlteam |
17:03
🔗
|
JW_work |
bwn: OK, will look when I get home tonight. |
23:08
🔗
|
|
JW_work has quit IRC (Read error: Operation timed out) |
23:18
🔗
|
|
JW_work has joined #urlteam |