Time |
Nickname |
Message |
00:13
🔗
|
|
schbirid2 has joined #archiveteam |
00:16
🔗
|
|
schbirid has quit IRC (Read error: Operation timed out) |
00:24
🔗
|
|
ZexaronS- has quit IRC (Quit: Leaving) |
00:44
🔗
|
|
RichardG has quit IRC (Read error: Operation timed out) |
00:44
🔗
|
|
RichardG has joined #archiveteam |
00:49
🔗
|
|
username1 has joined #archiveteam |
00:52
🔗
|
|
schbirid2 has quit IRC (Read error: Operation timed out) |
01:11
🔗
|
|
schbirid2 has joined #archiveteam |
01:15
🔗
|
|
username1 has quit IRC (Read error: Operation timed out) |
01:15
🔗
|
|
username1 has joined #archiveteam |
01:17
🔗
|
|
schbirid2 has quit IRC (Read error: Operation timed out) |
01:23
🔗
|
|
j08nY has quit IRC (Quit: Leaving) |
01:42
🔗
|
|
RichardG has quit IRC (Read error: Operation timed out) |
01:42
🔗
|
|
RichardG has joined #archiveteam |
01:43
🔗
|
|
mls has quit IRC (Ping timeout: 250 seconds) |
02:11
🔗
|
|
pizzaiolo has quit IRC (Quit: pizzaiolo) |
02:12
🔗
|
|
RichardG has quit IRC (Read error: Operation timed out) |
02:12
🔗
|
|
RichardG has joined #archiveteam |
02:16
🔗
|
|
Odd0002 has quit IRC (Remote host closed the connection) |
02:39
🔗
|
|
schbirid2 has joined #archiveteam |
02:42
🔗
|
|
username1 has quit IRC (Read error: Operation timed out) |
02:44
🔗
|
|
Guest has joined #archiveteam |
02:50
🔗
|
SketchCow |
Do what you can. |
02:50
🔗
|
|
username1 has joined #archiveteam |
02:52
🔗
|
|
schbirid2 has quit IRC (Read error: Operation timed out) |
02:52
🔗
|
|
Odd0002 has joined #archiveteam |
02:58
🔗
|
SketchCow |
I'm trying to crack the code with mixes.djfez.com |
03:03
🔗
|
|
schbirid2 has joined #archiveteam |
03:06
🔗
|
|
username1 has quit IRC (Read error: Operation timed out) |
03:07
🔗
|
|
Asparagir has joined #archiveteam |
03:10
🔗
|
|
username1 has joined #archiveteam |
03:11
🔗
|
|
schbirid2 has quit IRC (Read error: Operation timed out) |
03:28
🔗
|
|
schbirid2 has joined #archiveteam |
03:30
🔗
|
|
username1 has quit IRC (Read error: Operation timed out) |
03:49
🔗
|
SketchCow |
HI I CRACKED THE CODE WITH HELP AND I'M DOWNLOADING IT |
03:52
🔗
|
|
qw3rty has joined #archiveteam |
03:58
🔗
|
|
qw3rty2 has quit IRC (Read error: Operation timed out) |
04:07
🔗
|
wp494 |
OKAY WONDERFUL |
04:21
🔗
|
SketchCow |
This is a lot of goddamn music |
04:22
🔗
|
|
Froggypwn has quit IRC (Read error: Operation timed out) |
04:23
🔗
|
|
Froggypwn has joined #archiveteam |
04:33
🔗
|
|
Soni has quit IRC (Read error: Operation timed out) |
04:33
🔗
|
|
Sk1d has quit IRC (Ping timeout: 250 seconds) |
04:38
🔗
|
SketchCow |
https://www.twitch.tv/textfilesdotcom |
04:38
🔗
|
SketchCow |
I'm livestreaming archive and other work now. |
04:41
🔗
|
* |
Somebody2 is watching |
04:42
🔗
|
|
BubuAnabe has joined #archiveteam |
04:42
🔗
|
BubuAnabe |
taringa.net may be upgrading to a new version and some sections of the social network may be discontinued |
04:43
🔗
|
|
Sk1d has joined #archiveteam |
04:43
🔗
|
BubuAnabe |
Do you think something could be done about that? |
04:55
🔗
|
|
Soni has joined #archiveteam |
04:58
🔗
|
Somebody2 |
SketchCow: What *is* that noise? |
05:00
🔗
|
SketchCow |
Which |
05:04
🔗
|
Somebody2 |
The mechanical noise -- likely the floppy drive reading disks. |
05:09
🔗
|
|
mls has joined #archiveteam |
05:12
🔗
|
|
godane has quit IRC (Read error: Operation timed out) |
05:35
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
05:35
🔗
|
|
dashcloud has joined #archiveteam |
05:35
🔗
|
|
RichardG has quit IRC (Read error: Operation timed out) |
05:35
🔗
|
|
RichardG has joined #archiveteam |
05:44
🔗
|
|
godane has joined #archiveteam |
06:12
🔗
|
|
RichardG has quit IRC (Read error: Operation timed out) |
06:12
🔗
|
|
RichardG has joined #archiveteam |
06:38
🔗
|
|
Odd0002 has quit IRC (Remote host closed the connection) |
06:38
🔗
|
|
RichardG has quit IRC (Read error: Operation timed out) |
06:39
🔗
|
|
RichardG has joined #archiveteam |
07:06
🔗
|
|
RichardG has quit IRC (Read error: Operation timed out) |
07:06
🔗
|
|
RichardG has joined #archiveteam |
07:26
🔗
|
|
atomotic has joined #archiveteam |
07:29
🔗
|
|
r1c0 has joined #archiveteam |
07:30
🔗
|
|
r1c0 has quit IRC (Client Quit) |
07:30
🔗
|
|
r1c0 has joined #archiveteam |
07:49
🔗
|
|
RichardG has quit IRC (Read error: Operation timed out) |
07:49
🔗
|
|
RichardG has joined #archiveteam |
08:03
🔗
|
|
icedice has joined #archiveteam |
08:04
🔗
|
|
bwn has quit IRC (Ping timeout: 268 seconds) |
08:12
🔗
|
|
bwn has joined #archiveteam |
08:18
🔗
|
|
kyounko has joined #archiveteam |
08:28
🔗
|
|
pikhq has quit IRC (Read error: Operation timed out) |
08:37
🔗
|
|
pikhq has joined #archiveteam |
08:56
🔗
|
|
schbirid2 has quit IRC (Quit: Leaving) |
09:08
🔗
|
|
schbirid has joined #archiveteam |
09:19
🔗
|
SketchCow |
Tell the lune person I downloaded all of djfez. |
09:24
🔗
|
JAA |
BubuAnabe: We could always try throwing it into ArchiveBot, but it's pretty massive and will take a long time. Do you have any more details regarding which parts of Taringa will disappear? (Link to an announcement or something?) |
09:25
🔗
|
JAA |
SketchCow: Sweet. Did you only grab the actual audio or the entire website? (I.e. should we stop the ArchiveBot job or not?) |
09:32
🔗
|
|
j08nY has joined #archiveteam |
09:49
🔗
|
|
Boppen has quit IRC (Quit: Nettalk6 - www.ntalk.de) |
09:49
🔗
|
|
schbirid2 has joined #archiveteam |
09:51
🔗
|
|
schbirid has quit IRC (Read error: Operation timed out) |
09:54
🔗
|
|
Boppen has joined #archiveteam |
10:02
🔗
|
|
redlob has quit IRC (Read error: Operation timed out) |
10:15
🔗
|
|
redlob has joined #archiveteam |
10:34
🔗
|
|
r1c0 is now known as enr1c0^aw |
11:03
🔗
|
|
RichardG has quit IRC (Read error: Operation timed out) |
11:03
🔗
|
|
RichardG has joined #archiveteam |
11:04
🔗
|
|
kitties has quit IRC (Quit: Connection closed for inactivity) |
11:22
🔗
|
|
ZexaronS has joined #archiveteam |
11:22
🔗
|
|
kyounko has quit IRC (Read error: Connection reset by peer) |
11:24
🔗
|
|
pizzaiolo has joined #archiveteam |
11:24
🔗
|
|
kyounko has joined #archiveteam |
11:34
🔗
|
|
atomotic has quit IRC (Quit: My MacBook has gone to sleep. ZZZzzz…) |
11:35
🔗
|
|
Jonison has joined #archiveteam |
11:38
🔗
|
|
BlueMaxim has quit IRC (Quit: Leaving) |
11:45
🔗
|
|
enr1c0^aw is now known as r1c0 |
11:46
🔗
|
|
Rasierkan has joined #archiveteam |
11:49
🔗
|
|
Guest has quit IRC (Ping timeout: 250 seconds) |
12:08
🔗
|
|
ZexaronS- has joined #archiveteam |
12:08
🔗
|
|
kyounko has quit IRC (Ping timeout: 246 seconds) |
12:09
🔗
|
|
ZexaronS has quit IRC (Ping timeout: 260 seconds) |
12:12
🔗
|
|
atomotic has joined #archiveteam |
12:12
🔗
|
|
ZexaronS has joined #archiveteam |
12:13
🔗
|
|
ZexaronS- has quit IRC (Ping timeout: 260 seconds) |
12:16
🔗
|
pizzaiolo |
folks, the huge spanish Terra website is going down tomorrow |
12:16
🔗
|
pizzaiolo |
they have news sites in many spanish speaking countries |
12:16
🔗
|
pizzaiolo |
https://www.terra.com.br/noticias/sala-de-imprensa/faq-clientes-terramail,fbd5387ca5d4564236664d5f8ed8bb94uwqbteac.html |
12:17
🔗
|
pizzaiolo |
shutting down: terra.com, terra.com.ar, mi.terra.cl, terra.com.co, terra.com.mx, terra.com.pe, terra.com.ve y terra.com.ec |
12:17
🔗
|
|
ZexaronS has quit IRC (Ping timeout: 268 seconds) |
12:57
🔗
|
|
r1c0 has quit IRC (Quit: Broken pipe) |
12:58
🔗
|
|
r1c0 has joined #archiveteam |
13:24
🔗
|
|
r1c0 has quit IRC (Quit: Broken pipe) |
13:25
🔗
|
|
RichardG has quit IRC (Read error: Operation timed out) |
13:25
🔗
|
|
RichardG has joined #archiveteam |
13:25
🔗
|
|
n00b859 has joined #archiveteam |
13:32
🔗
|
|
odemg has quit IRC (Read error: Operation timed out) |
13:35
🔗
|
|
odemg has joined #archiveteam |
14:02
🔗
|
|
RichardG has quit IRC (Read error: Operation timed out) |
14:02
🔗
|
|
RichardG has joined #archiveteam |
14:07
🔗
|
SketchCow |
Let the archivebot job keep going. |
14:13
🔗
|
|
brayden_ has joined #archiveteam |
14:13
🔗
|
|
swebb sets mode: +o brayden_ |
14:25
🔗
|
|
brayden has quit IRC (Read error: Operation timed out) |
14:25
🔗
|
|
brayden_ is now known as brayden |
14:42
🔗
|
|
atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) |
14:51
🔗
|
|
RichardG has quit IRC (Read error: Operation timed out) |
14:51
🔗
|
|
RichardG has joined #archiveteam |
15:04
🔗
|
|
alfie has quit IRC (Remote host closed the connection) |
15:06
🔗
|
|
alfie has joined #archiveteam |
15:21
🔗
|
|
thuban4 has joined #archiveteam |
15:24
🔗
|
|
thuban3 has quit IRC (Read error: Operation timed out) |
15:33
🔗
|
|
n00b859 has quit IRC (Quit: Page closed) |
15:48
🔗
|
|
RichardG has quit IRC (Read error: Operation timed out) |
15:49
🔗
|
|
RichardG has joined #archiveteam |
15:57
🔗
|
|
acro has joined #archiveteam |
16:14
🔗
|
|
RichardG has quit IRC (Read error: Operation timed out) |
16:14
🔗
|
|
RichardG has joined #archiveteam |
16:41
🔗
|
|
RichardG has quit IRC (Read error: Operation timed out) |
16:41
🔗
|
|
RichardG has joined #archiveteam |
17:03
🔗
|
|
thuban has joined #archiveteam |
17:05
🔗
|
|
thuban4 has quit IRC (Read error: Operation timed out) |
17:31
🔗
|
BubuAnabe |
JAA: There's a really poor FAQ about the new version but it seems http://taringa.net/mi will disapear and the http://taringa.net/posts/ and http://taringa.net/comunidades/ sections will merge into "canales" (spanish for channels) |
17:31
🔗
|
|
RichardG has quit IRC (Read error: Operation timed out) |
17:31
🔗
|
|
RichardG has joined #archiveteam |
17:58
🔗
|
|
RichardG has quit IRC (Read error: Operation timed out) |
17:58
🔗
|
|
RichardG has joined #archiveteam |
18:25
🔗
|
|
RichardG has quit IRC (Read error: Operation timed out) |
18:25
🔗
|
|
RichardG has joined #archiveteam |
18:47
🔗
|
JAA |
BubuAnabe: Hmm. Content is spread over all kinds of URLs, for example the individual posts on /mi are under /username/mi/something. I guess archiving the whole thing has some similarities with archiving all of Reddit: it's not as simple as doing a recursive grab of the website but requires much more work to discover all posts, comments, etc. And the whole thing being in Spanish doesn't exactly make it eas |
18:47
🔗
|
JAA |
ier either. Anyway, let's take this to #archiveteam-bs . |
18:53
🔗
|
|
Kalroth_ has joined #archiveteam |
18:56
🔗
|
|
bRick5772 has joined #archiveteam |
18:58
🔗
|
|
Kalroth has quit IRC (Quit: Bye!) |
18:58
🔗
|
|
Kalroth_ is now known as Kalroth |
19:04
🔗
|
|
TheLovina has joined #archiveteam |
19:10
🔗
|
|
wp494 has quit IRC (Ping timeout: 250 seconds) |
19:39
🔗
|
|
RichardG has quit IRC (Read error: Operation timed out) |
19:39
🔗
|
|
RichardG has joined #archiveteam |
20:06
🔗
|
|
RichardG has quit IRC (Read error: Operation timed out) |
20:06
🔗
|
|
RichardG has joined #archiveteam |
20:12
🔗
|
|
Jonison has quit IRC (Read error: Connection reset by peer) |
20:12
🔗
|
|
Jonison has joined #archiveteam |
20:30
🔗
|
|
whydomain has quit IRC (Remote host closed the connection) |
20:30
🔗
|
|
whydomain has joined #archiveteam |
20:31
🔗
|
|
deetwelve has quit IRC (Ping timeout: 260 seconds) |
20:36
🔗
|
|
deetwelve has joined #archiveteam |
20:45
🔗
|
|
Jonison has quit IRC (Read error: Connection reset by peer) |
20:46
🔗
|
|
ItsYoda has quit IRC (Quit: rippppp to the yoda you used to know!) |
20:49
🔗
|
|
luckcolor has quit IRC (Remote host closed the connection) |
20:50
🔗
|
|
ItsYoda has joined #archiveteam |
20:50
🔗
|
|
luckcolor has joined #archiveteam |
20:50
🔗
|
|
deetwelve has quit IRC (Ping timeout: 260 seconds) |
20:51
🔗
|
|
deetwelve has joined #archiveteam |
20:51
🔗
|
|
Jonison has joined #archiveteam |
21:10
🔗
|
|
username1 has joined #archiveteam |
21:11
🔗
|
|
schbirid2 has quit IRC (Read error: Operation timed out) |
21:18
🔗
|
|
schbirid2 has joined #archiveteam |
21:19
🔗
|
|
zerkalo has joined #archiveteam |
21:20
🔗
|
|
username1 has quit IRC (Read error: Operation timed out) |
21:44
🔗
|
|
bRick5772 has quit IRC (Quit: Leaving.) |
22:02
🔗
|
|
wp494 has joined #archiveteam |
22:03
🔗
|
|
Jonison has quit IRC (Read error: Connection reset by peer) |
22:22
🔗
|
|
r1c0 has joined #archiveteam |
22:51
🔗
|
|
r1c0 has quit IRC (Quit: Broken pipe) |
22:52
🔗
|
|
r1c0 has joined #archiveteam |
22:56
🔗
|
|
icedice has quit IRC (Quit: Leaving) |
23:16
🔗
|
|
scyther has quit IRC (Remote host closed the connection) |
23:16
🔗
|
|
Sanqui has quit IRC (Remote host closed the connection) |
23:22
🔗
|
Rasierkan |
how many instances can I savely run on the same docker node? |
23:22
🔗
|
Rasierkan |
without getting banned that is |
23:23
🔗
|
JAA |
That depends on the project. Some services ban you very quickly (e.g. Yahoo), others don't care if you hit them with dozens of connections. |
23:23
🔗
|
Rasierkan |
am doing the tinyurl one right now I think |
23:23
🔗
|
Rasierkan |
got it set on automatic |
23:24
🔗
|
Rasierkan |
does it auto detect a ban and switch projects for me? |
23:28
🔗
|
JAA |
With "ArchiveTeam's Choice", it will switch between projects, yes. If you want to maximise your contribution, you can of course run one instance of the warrior per project or use the scripts directly (lower overhead) and adjust the concurrency as appropriate for each project. |
23:30
🔗
|
Rasierkan |
Can I just spin up 5-6 docker instances |
23:30
🔗
|
Rasierkan |
and let it idle on automatic for some weeks |
23:30
🔗
|
Rasierkan |
if I understood you right I can ? |
23:32
🔗
|
JAA |
That would probably not be a good idea, unless you have multiple IP addresses. Maybe I wasn't clear above: the "Our choice" selection is just a short-hand for "whichever project we currently deem most important". So all your instances would run the same project, and while it's not a problem for URLTeam, once we switch the project, you'll likely quickly get banned. |
23:32
🔗
|
Rasierkan |
I see |
23:32
🔗
|
Rasierkan |
I could put it up on multiple adresses but that would probably be more hassle |
23:33
🔗
|
Rasierkan |
so I just run a single instance |
23:37
🔗
|
Rasierkan |
or rather single instance on automatic and the rest of the instances on the urlteam because theres no problem? |
23:45
🔗
|
|
Sanqui has joined #archiveteam |
23:49
🔗
|
|
scyther has joined #archiveteam |
23:50
🔗
|
JAA |
Rasierkan: I'm not sure where the limit is. I've been running 10 threads per IP for several months. At that concurrency, the tracker's actually the limit because it doesn't generate new items quickly enough (URLTeam is the one project where items are generated dynamically on the tracker). My machines rarely process more than three items at once, i.e. seven threads are just waiting for the tracker to catch |
23:50
🔗
|
JAA |
up. |
23:50
🔗
|
JAA |
Let's take this to #archiveteam-bs if you have further questions. |
23:55
🔗
|
|
odemg has quit IRC (Read error: Operation timed out) |