Time |
Nickname |
Message |
00:07
🔗
|
|
chimyatta has quit IRC (Read error: Connection reset by peer) |
00:19
🔗
|
|
picklefac has quit IRC (Quit: My MacBook has gone to sleep. ZZZzzz…) |
00:56
🔗
|
|
VADemon has quit IRC (Quit: left4dead) |
00:57
🔗
|
|
VADemon has joined #archiveteam-ot |
01:47
🔗
|
|
picklefac has joined #archiveteam-ot |
01:56
🔗
|
|
VerfiedJ has quit IRC (Quit: Leaving) |
02:02
🔗
|
|
BlueMax has joined #archiveteam-ot |
02:19
🔗
|
|
m007a83 has quit IRC (Read error: Operation timed out) |
02:39
🔗
|
|
m007a83 has joined #archiveteam-ot |
03:25
🔗
|
eientei95 |
ivan_: Oh huh, that does explain stuff |
04:43
🔗
|
|
Despatche has quit IRC (Read error: Operation timed out) |
04:47
🔗
|
|
odemg has quit IRC (Ping timeout: 265 seconds) |
04:59
🔗
|
|
odemg has joined #archiveteam-ot |
05:13
🔗
|
|
Despatche has joined #archiveteam-ot |
05:34
🔗
|
|
wp494 has joined #archiveteam-ot |
05:41
🔗
|
|
wp494_ has quit IRC (Read error: Operation timed out) |
05:46
🔗
|
|
yano_ has joined #archiveteam-ot |
05:46
🔗
|
|
swebb has quit IRC (Read error: Operation timed out) |
05:46
🔗
|
|
Frogging has quit IRC (Read error: Operation timed out) |
05:46
🔗
|
|
Frogging has joined #archiveteam-ot |
05:47
🔗
|
|
yano has quit IRC (Read error: Operation timed out) |
05:48
🔗
|
|
swebb has joined #archiveteam-ot |
05:48
🔗
|
|
bithippo has quit IRC (Ping timeout: 246 seconds) |
05:48
🔗
|
|
JAA has quit IRC (Ping timeout: 246 seconds) |
05:52
🔗
|
|
godane has quit IRC (Read error: Operation timed out) |
06:03
🔗
|
|
godane has joined #archiveteam-ot |
06:10
🔗
|
systwi |
is jwplayer a b**** to archive into the wayback machine? |
06:13
🔗
|
systwi |
and is there a difference between "pipeline", "worker" and "warrior"? |
06:21
🔗
|
systwi |
(ok last question i promise) what would one do if they were archiving a website with archivebot, but that website decided to block that archivebot's IP under suspicion of, for example, a DoS attack |
06:22
🔗
|
systwi |
in laymen's terms they mistake archivebot for a spammer |
06:47
🔗
|
|
JAA has joined #archiveteam-ot |
06:48
🔗
|
|
odemg has quit IRC (Ping timeout: 265 seconds) |
06:48
🔗
|
|
bakJAA sets mode: +o JAA |
06:49
🔗
|
|
odemg has joined #archiveteam-ot |
07:08
🔗
|
|
nataraj has joined #archiveteam-ot |
07:30
🔗
|
|
nataraj has quit IRC (Read error: Operation timed out) |
08:51
🔗
|
|
Hani has quit IRC (Read error: Connection reset by peer) |
08:52
🔗
|
|
Hani has joined #archiveteam-ot |
09:02
🔗
|
|
Oddly has joined #archiveteam-ot |
09:13
🔗
|
|
Despatche has quit IRC (Read error: Operation timed out) |
09:31
🔗
|
psi |
haha good joke https://usercontent.irccloud-cdn.com/file/Q8hptmDJ/image.png |
09:44
🔗
|
|
nataraj has joined #archiveteam-ot |
09:59
🔗
|
|
nataraj has quit IRC (Read error: Operation timed out) |
10:22
🔗
|
|
nataraj has joined #archiveteam-ot |
10:50
🔗
|
|
nataraj has quit IRC (Read error: Operation timed out) |
12:00
🔗
|
|
Oddly has quit IRC (Ping timeout: 259 seconds) |
12:19
🔗
|
JAA |
systwi: The warrior is the VM. The pipeline is what communicates with the tracker and spawns workers. A worker processes a single item (e.g. one Tumblr blog). At least that's how I use those terms. |
12:20
🔗
|
JAA |
systwi: And if AB jobs get banned, we typically increase the delay massively (usually to 3 minutes) and hope that the ban gets lifted. If that doesn't help, we might restart the job with a browser user agent. |
12:20
🔗
|
kiska |
And if its an IP ban, then we queue it on a different pipeline |
12:24
🔗
|
JAA |
systwi: And re #archiveteam: That's exactly why I have JS disabled by default. XMR miners are another fairly frequent annoyance in certain parts of the web. And well, tracking and all that crap. |
12:26
🔗
|
kiska |
PEP8: continuation line under-indented for visual indent.... |
12:26
🔗
|
JAA |
Fuck PEP8's indentation rules. |
12:26
🔗
|
JAA |
Tabs FTW |
12:28
🔗
|
kiska |
Yeah I am using pycharm to make sure I don't make any variable errors, but.... |
12:29
🔗
|
JAA |
I'm sure you can disable it somewhere. |
12:29
🔗
|
JAA |
Haven't used PyCharm in years though. |
12:30
🔗
|
kiska |
Yeah I am now coding up the wedpics discovery project |
12:31
🔗
|
JAA |
Ah, sweet. I wonder how much we will be able to get out of that. The codes seem too long. |
12:31
🔗
|
kiska |
Yeah... |
12:32
🔗
|
kiska |
Do you mind writing the generation code? a-z0-9 any number ranges |
12:32
🔗
|
kiska |
I am going to prefix it with id: |
12:33
🔗
|
JAA |
Oh, only lowercase? |
12:33
🔗
|
kiska |
Actually now would be a good time for our own pastebin service |
12:33
🔗
|
kiska |
Let me check that |
12:35
🔗
|
JAA |
Also, why are we discussing this in -ot? :-) |
13:14
🔗
|
|
Oddly has joined #archiveteam-ot |
14:33
🔗
|
|
wp494_ has joined #archiveteam-ot |
14:36
🔗
|
|
wp494 has quit IRC (Read error: Operation timed out) |
14:40
🔗
|
|
Oddly has quit IRC (Ping timeout: 255 seconds) |
15:53
🔗
|
|
VerfiedJ has joined #archiveteam-ot |
15:57
🔗
|
|
odemg has quit IRC (Ping timeout: 265 seconds) |
16:00
🔗
|
|
odemg has joined #archiveteam-ot |
16:19
🔗
|
|
yano_ is now known as yano |
16:32
🔗
|
|
Oddly has joined #archiveteam-ot |
17:03
🔗
|
|
nataraj has joined #archiveteam-ot |
17:07
🔗
|
|
Oddly has quit IRC (Ping timeout: 255 seconds) |
17:23
🔗
|
|
nataraj has quit IRC (Read error: Operation timed out) |
17:32
🔗
|
astrid |
fuck computers. |
18:10
🔗
|
|
LFlare has quit IRC (Quit: The Lounge - https://thelounge.chat) |
18:23
🔗
|
|
Oddly has joined #archiveteam-ot |
19:09
🔗
|
|
picklefac has quit IRC (Remote host closed the connection) |
19:10
🔗
|
|
picklefac has joined #archiveteam-ot |
19:24
🔗
|
|
nataraj has joined #archiveteam-ot |
19:35
🔗
|
|
nataraj has quit IRC (Read error: Operation timed out) |
19:44
🔗
|
|
odemg has quit IRC (Ping timeout: 265 seconds) |
19:45
🔗
|
|
odemg has joined #archiveteam-ot |
19:53
🔗
|
|
Oddly has quit IRC (Ping timeout: 255 seconds) |
21:01
🔗
|
|
Mateon1 has quit IRC (Ping timeout: 360 seconds) |
21:01
🔗
|
|
Mateon1 has joined #archiveteam-ot |
21:16
🔗
|
|
LFlare has joined #archiveteam-ot |
21:42
🔗
|
|
nataraj has joined #archiveteam-ot |
21:45
🔗
|
|
robogoat_ is now known as robogoat |
21:51
🔗
|
|
nataraj has quit IRC (Read error: Operation timed out) |
22:17
🔗
|
|
odemg has quit IRC (Ping timeout: 265 seconds) |
22:20
🔗
|
|
odemg has joined #archiveteam-ot |
23:33
🔗
|
|
wp494 has joined #archiveteam-ot |
23:42
🔗
|
|
wp494_ has quit IRC (Ping timeout: 615 seconds) |