#newsgrabber 2017-09-06,Wed

Logs of this channel are not protected. You can protect them by a password.

↑back Search ←Prev date Next date→ Show only urls(Click on time to select a line by its url)


WhoWhatWhen
***newsbuddy has quit IRC (Remote host closed the connection)
newsbuddy has joined #newsgrabber
[01:40]
newsbuddyHello! I've just been (re)started. Follow my newsgrabs in #newsgrabberbot [01:40]
***newsbuddy has quit IRC (Remote host closed the connection)
newsbuddy has joined #newsgrabber
[01:40]
newsbuddyHello! I've just been (re)started. Follow my newsgrabs in #newsgrabberbot [01:40]
.............. (idle for 1h5mn)
***kyan has quit IRC (Read error: Operation timed out) [02:45]
................................... (idle for 2h53mn)
underscor has quit IRC (Read error: Operation timed out) [05:38]
.......... (idle for 48mn)
MrRadar has quit IRC (Read error: Operation timed out)
MrRadar has joined #newsgrabber
[06:26]
............................................................ (idle for 4h55mn)
Sait0_san has quit IRC (Quit: Page closed) [11:24]
........... (idle for 53mn)
HCross2FYI - BT.dk seem to be sending ISP abuse reports
https://www.irccloud.com/pastebin/jLMJr34U/
arkiver: ^
[12:17]
^ am resolving - Can someone please accept https://github.com/ArchiveTeam/NewsGrabber-Services/pull/1 [12:28]
....... (idle for 34mn)
arkiver: can you please accept that git pull I sent in for the BT.dk service? I worked with Michael from BT and this is the best compromise so we can still get something [13:02]
......... (idle for 42mn)
JAA70k requests over 5 days gets them worried? Oh, please... [13:44]
.............. (idle for 1h5mn)
arkiverHCross2: merged
but not in the discoverers yet I think
[14:49]
HCross2Thanks
I forwarded you my email chain with him
[14:50]
.............................. (idle for 2h25mn)
jrwrI ioniced the ssdb
Check to.ser how fast I'm processing HCross2
[17:15]
Aoedehttp://www.cbc.ca/2017/whatsyourstory/scripts/2017/whatsyourstory/scripts/2017/whatsyourstory/scripts/2017/whatsyourstory/scripts/scripts/scripts/scripts/scripts/2017/whatsyourstory/scripts/2017/whatsyourstory/scripts/2017/whatsyourstory/scripts/2017/whatsyourstory/scripts/scripts/scripts/scripts/scripts/2017/whatsyourstory/scripts/2017/whatsyourstory/scripts/2017/whatsyourstory/scripts/2017/whatsyourstory/scri
pts/scripts/scripts/scripts/scripts/2017/whatsyourstory/scripts/2017/whatsyourstory/scripts/2017/whatsyourstory/scripts/2017/whatsyourstory/scripts/scripts/scripts/scripts/scripts/22.f8670aff1fd5781d825d.js
wpull gets stuck in loops like this
[17:20]
................... (idle for 1h30mn)
mlsIt's actually the discovered urls being fed to the script
Aoede: Did you also notice how it's slowing down wpull?
[18:50]
Aoedemls: didn't notice that
wait nevermind
it does slow down
[18:59]
mlsHCross2 arkiver Think it's worth looking into the url issue? [19:03]
HCross2thats interesting, so its the discovery end doing it [19:04]
mlsThat's what I already figured, apologies for not sharing my suspicions [19:05]
............ (idle for 56mn)
HCross2arkiver: that capping you mentioned on grabbed URLs, we may need to actually deploy that further back in the chain almost, on the discovery side [20:01]

↑back Search ←Prev date Next date→ Show only urls(Click on time to select a line by its url)