#newsgrabber 2018-06-16,Sat

Logs of this channel are not protected. You can protect them by a password.

↑back Search ←Prev date Next date→ Show only urls(Click on time to select a line by its url)


WhoWhatWhen
***odemg has quit IRC (Ping timeout: 260 seconds) [01:51]
odemg has joined #newsgrabber [02:03]
................ (idle for 1h18mn)
odemg has quit IRC (Ping timeout: 260 seconds) [03:21]
odemg has joined #newsgrabber
qw3rty119 has joined #newsgrabber
[03:34]
qw3rty118 has quit IRC (Read error: Operation timed out) [03:40]
............ (idle for 57mn)
anonymoos has quit IRC (west.us.hub irc.Prison.NET)
phirephly has quit IRC (west.us.hub irc.Prison.NET)
phirephl- has joined #newsgrabber
[04:37]
.... (idle for 17mn)
anonymoos has joined #newsgrabber [04:55]
............................ (idle for 2h18mn)
blitzed has quit IRC (Quit: Leaving) [07:13]
..... (idle for 21mn)
Smiley has quit IRC (Read error: Operation timed out)
Smiley has joined #newsgrabber
[07:34]
.................................................................................... (idle for 6h55mn)
HCrossFYI - Transition to the new server will be happening within the hour [14:33]
odemgkinky
HCross, treating you well?
[14:36]
HCrossoh cock, still getting the settings init error
https://www.irccloud.com/pastebin/jDdLVMS8/
now on 2.7.9 - same version as the old box
[14:37]
.... (idle for 18mn)
arkiver: just a quick question. I seem to be tripping up on https://github.com/ArchiveTeam/NewsGrabber-Main/blob/master/settings.py#L16 - what should that file rsync_targets be please?
its not on the old server
[14:57]
***newsbuddy has joined #newsgrabber [15:03]
newsbuddyHello! I've just been (re)started. Follow my newsgrabs in #newsgrabberbot [15:03]
HCrossok... uncommenting settings.init() worked [15:03]
arkiverHCross: hmm
just back
[15:05]
HCrossah ok
ive got it to go
[15:05]
arkiverok good [15:06]
HCrossit might be worth clearing the tracker and starting again as well [15:06]
arkiveryeah [15:06]
HCrosslast time I tried tho, that page crashed
although the tracker reboot might have fixed that
[15:07]
arkiverah hold on [15:07]
HCrossive updated DNS [15:07]
arkiverHCross: do you want to requeue everything?
go to Workarounds and Requeue
[15:07]
HCrossyes, but ill need to copy the warriorfiles over first
theyll take a time
[15:08]
arkiverk
ok
[15:08]
***newsbuddy has quit IRC (Remote host closed the connection) [15:08]
arkiverI hope to replace most of newsbuddy eventually with WebArchiver
writing lots of documentation for that now
[15:10]
HCrossperfect [15:10]
arkiverJAA: for the outdated version of wget in wget-lua. I plan on using only the normal wget for WebArchiver and extract URLs at the same time when WARCs are being deduplicated [15:11]
HCrossI can see DNS is beginning to propgagate [15:12]
arkiverawesome :) [15:13]
HCrosswarriorfiles may take a while to copy, theres 3255752 of them
meanwhile, im going to fix all the discoveries - and replace LA
[15:14]
arkiverdiscoveries as in the server working on discovery? [15:15]
HCrossyes [15:15]
...... (idle for 27mn)
***JAA has quit IRC (Read error: Connection reset by peer) [15:42]
........ (idle for 37mn)
JAA has joined #newsgrabber
bakJAA sets mode: +o JAA
[16:19]
......................... (idle for 2h4mn)
HCrossarkiver: is it possible to totally reset the tracker? These warriorlists are going to take ages to copy [18:23]
***MrRadar has quit IRC (Read error: Operation timed out) [18:33]
MrRadar has joined #newsgrabber [18:44]
KazHCross: https://tracker.archiveteam.org/newsgrabber/admin/queues
there's a destroy button, but I've never used it..
[18:46]
...... (idle for 26mn)
HCross6.6GB/15GB of warriorfiles copied, its only doing 3Mbit [19:12]
Kazburn it down [19:16]
......... (idle for 40mn)
arkiverHCross: want me to remove all out and to be done items? [19:56]
HCrossyes please [19:56]
arkiverdone. everything is gone
except done items
I guess we can leave those?
[19:57]
HCrossyes [19:57]
arkiverok [19:57]
***newsbuddy has joined #newsgrabber [19:59]
newsbuddyHello! I've just been (re)started. Follow my newsgrabs in #newsgrabberbot [19:59]
***newsbuddy has quit IRC (Remote host closed the connection)
newsbuddy has joined #newsgrabber
[19:59]
newsbuddyHello! I've just been (re)started. Follow my newsgrabs in #newsgrabberbot [19:59]
HCrossarkiver: odd one, we seem to be getting a lot of the URL updates [20:13]
arkiverwhat do you mean with URL updates? [20:17]
HCross"0 urls added in the past 15 minutes" [20:20]
arkiverah
let's see how that goes in a little bit
will have a look if it's still there in a few minutes
[20:24]
HCrossok
rebooting Singapore, something doesnt seem to be happy
also.. is there an IP whitelist on being able to upload to the tracker?
[20:25]
arkiverafaik no [20:27]
HCrossok, I see we have warrior tasks ready to go too [20:27]
....... (idle for 33mn)
arkiverlooking into this URL adding problem now [21:00]
HCrossta [21:01]
arkiveralso WebArchiver is now fully documented https://github.com/ArchiveTeam/WebArchiver/commit/370006eef29e8a1aaad2ed7a2e24845f42d08ee5 [21:01]
HCrossvery nice [21:01]
arkivernote that WebArchiver is not totally ready yet for use for projects
but the docs will make the code a lot better understandable
JAA: ^ now with docstrings
next up is lots of logging, that has been missing too
[21:02]
JAANice [21:06]
...... (idle for 26mn)
HCrosshmm, not seeing any output when it tries to update the tracker [21:32]
***newsbuddy has quit IRC (Remote host closed the connection)
newsbuddy has joined #newsgrabber
[21:41]
newsbuddyHello! I've just been (re)started. Follow my newsgrabs in #newsgrabberbot [21:41]

↑back Search ←Prev date Next date→ Show only urls(Click on time to select a line by its url)