#archiveteam 2017-06-29,Thu

↑back Search

Time Nickname Message
00:13 🔗 schbirid2 has joined #archiveteam
00:16 🔗 schbirid has quit IRC (Read error: Operation timed out)
00:24 🔗 ZexaronS- has quit IRC (Quit: Leaving)
00:44 🔗 RichardG has quit IRC (Read error: Operation timed out)
00:44 🔗 RichardG has joined #archiveteam
00:49 🔗 username1 has joined #archiveteam
00:52 🔗 schbirid2 has quit IRC (Read error: Operation timed out)
01:11 🔗 schbirid2 has joined #archiveteam
01:15 🔗 username1 has quit IRC (Read error: Operation timed out)
01:15 🔗 username1 has joined #archiveteam
01:17 🔗 schbirid2 has quit IRC (Read error: Operation timed out)
01:23 🔗 j08nY has quit IRC (Quit: Leaving)
01:42 🔗 RichardG has quit IRC (Read error: Operation timed out)
01:42 🔗 RichardG has joined #archiveteam
01:43 🔗 mls has quit IRC (Ping timeout: 250 seconds)
02:11 🔗 pizzaiolo has quit IRC (Quit: pizzaiolo)
02:12 🔗 RichardG has quit IRC (Read error: Operation timed out)
02:12 🔗 RichardG has joined #archiveteam
02:16 🔗 Odd0002 has quit IRC (Remote host closed the connection)
02:39 🔗 schbirid2 has joined #archiveteam
02:42 🔗 username1 has quit IRC (Read error: Operation timed out)
02:44 🔗 Guest has joined #archiveteam
02:50 🔗 SketchCow Do what you can.
02:50 🔗 username1 has joined #archiveteam
02:52 🔗 schbirid2 has quit IRC (Read error: Operation timed out)
02:52 🔗 Odd0002 has joined #archiveteam
02:58 🔗 SketchCow I'm trying to crack the code with mixes.djfez.com
03:03 🔗 schbirid2 has joined #archiveteam
03:06 🔗 username1 has quit IRC (Read error: Operation timed out)
03:07 🔗 Asparagir has joined #archiveteam
03:10 🔗 username1 has joined #archiveteam
03:11 🔗 schbirid2 has quit IRC (Read error: Operation timed out)
03:28 🔗 schbirid2 has joined #archiveteam
03:30 🔗 username1 has quit IRC (Read error: Operation timed out)
03:49 🔗 SketchCow HI I CRACKED THE CODE WITH HELP AND I'M DOWNLOADING IT
03:52 🔗 qw3rty has joined #archiveteam
03:58 🔗 qw3rty2 has quit IRC (Read error: Operation timed out)
04:07 🔗 wp494 OKAY WONDERFUL
04:21 🔗 SketchCow This is a lot of goddamn music
04:22 🔗 Froggypwn has quit IRC (Read error: Operation timed out)
04:23 🔗 Froggypwn has joined #archiveteam
04:33 🔗 Soni has quit IRC (Read error: Operation timed out)
04:33 🔗 Sk1d has quit IRC (Ping timeout: 250 seconds)
04:38 🔗 SketchCow https://www.twitch.tv/textfilesdotcom
04:38 🔗 SketchCow I'm livestreaming archive and other work now.
04:41 🔗 * Somebody2 is watching
04:42 🔗 BubuAnabe has joined #archiveteam
04:42 🔗 BubuAnabe taringa.net may be upgrading to a new version and some sections of the social network may be discontinued
04:43 🔗 Sk1d has joined #archiveteam
04:43 🔗 BubuAnabe Do you think something could be done about that?
04:55 🔗 Soni has joined #archiveteam
04:58 🔗 Somebody2 SketchCow: What *is* that noise?
05:00 🔗 SketchCow Which
05:04 🔗 Somebody2 The mechanical noise -- likely the floppy drive reading disks.
05:09 🔗 mls has joined #archiveteam
05:12 🔗 godane has quit IRC (Read error: Operation timed out)
05:35 🔗 dashcloud has quit IRC (Read error: Operation timed out)
05:35 🔗 dashcloud has joined #archiveteam
05:35 🔗 RichardG has quit IRC (Read error: Operation timed out)
05:35 🔗 RichardG has joined #archiveteam
05:44 🔗 godane has joined #archiveteam
06:12 🔗 RichardG has quit IRC (Read error: Operation timed out)
06:12 🔗 RichardG has joined #archiveteam
06:38 🔗 Odd0002 has quit IRC (Remote host closed the connection)
06:38 🔗 RichardG has quit IRC (Read error: Operation timed out)
06:39 🔗 RichardG has joined #archiveteam
07:06 🔗 RichardG has quit IRC (Read error: Operation timed out)
07:06 🔗 RichardG has joined #archiveteam
07:26 🔗 atomotic has joined #archiveteam
07:29 🔗 r1c0 has joined #archiveteam
07:30 🔗 r1c0 has quit IRC (Client Quit)
07:30 🔗 r1c0 has joined #archiveteam
07:49 🔗 RichardG has quit IRC (Read error: Operation timed out)
07:49 🔗 RichardG has joined #archiveteam
08:03 🔗 icedice has joined #archiveteam
08:04 🔗 bwn has quit IRC (Ping timeout: 268 seconds)
08:12 🔗 bwn has joined #archiveteam
08:18 🔗 kyounko has joined #archiveteam
08:28 🔗 pikhq has quit IRC (Read error: Operation timed out)
08:37 🔗 pikhq has joined #archiveteam
08:56 🔗 schbirid2 has quit IRC (Quit: Leaving)
09:08 🔗 schbirid has joined #archiveteam
09:19 🔗 SketchCow Tell the lune person I downloaded all of djfez.
09:24 🔗 JAA BubuAnabe: We could always try throwing it into ArchiveBot, but it's pretty massive and will take a long time. Do you have any more details regarding which parts of Taringa will disappear? (Link to an announcement or something?)
09:25 🔗 JAA SketchCow: Sweet. Did you only grab the actual audio or the entire website? (I.e. should we stop the ArchiveBot job or not?)
09:32 🔗 j08nY has joined #archiveteam
09:49 🔗 Boppen has quit IRC (Quit: Nettalk6 - www.ntalk.de)
09:49 🔗 schbirid2 has joined #archiveteam
09:51 🔗 schbirid has quit IRC (Read error: Operation timed out)
09:54 🔗 Boppen has joined #archiveteam
10:02 🔗 redlob has quit IRC (Read error: Operation timed out)
10:15 🔗 redlob has joined #archiveteam
10:34 🔗 r1c0 is now known as enr1c0^aw
11:03 🔗 RichardG has quit IRC (Read error: Operation timed out)
11:03 🔗 RichardG has joined #archiveteam
11:04 🔗 kitties has quit IRC (Quit: Connection closed for inactivity)
11:22 🔗 ZexaronS has joined #archiveteam
11:22 🔗 kyounko has quit IRC (Read error: Connection reset by peer)
11:24 🔗 pizzaiolo has joined #archiveteam
11:24 🔗 kyounko has joined #archiveteam
11:34 🔗 atomotic has quit IRC (Quit: My MacBook has gone to sleep. ZZZzzz…)
11:35 🔗 Jonison has joined #archiveteam
11:38 🔗 BlueMaxim has quit IRC (Quit: Leaving)
11:45 🔗 enr1c0^aw is now known as r1c0
11:46 🔗 Rasierkan has joined #archiveteam
11:49 🔗 Guest has quit IRC (Ping timeout: 250 seconds)
12:08 🔗 ZexaronS- has joined #archiveteam
12:08 🔗 kyounko has quit IRC (Ping timeout: 246 seconds)
12:09 🔗 ZexaronS has quit IRC (Ping timeout: 260 seconds)
12:12 🔗 atomotic has joined #archiveteam
12:12 🔗 ZexaronS has joined #archiveteam
12:13 🔗 ZexaronS- has quit IRC (Ping timeout: 260 seconds)
12:16 🔗 pizzaiolo folks, the huge spanish Terra website is going down tomorrow
12:16 🔗 pizzaiolo they have news sites in many spanish speaking countries
12:16 🔗 pizzaiolo https://www.terra.com.br/noticias/sala-de-imprensa/faq-clientes-terramail,fbd5387ca5d4564236664d5f8ed8bb94uwqbteac.html
12:17 🔗 pizzaiolo shutting down: terra.com, terra.com.ar, mi.terra.cl, terra.com.co, terra.com.mx, terra.com.pe, terra.com.ve y terra.com.ec
12:17 🔗 ZexaronS has quit IRC (Ping timeout: 268 seconds)
12:57 🔗 r1c0 has quit IRC (Quit: Broken pipe)
12:58 🔗 r1c0 has joined #archiveteam
13:24 🔗 r1c0 has quit IRC (Quit: Broken pipe)
13:25 🔗 RichardG has quit IRC (Read error: Operation timed out)
13:25 🔗 RichardG has joined #archiveteam
13:25 🔗 n00b859 has joined #archiveteam
13:32 🔗 odemg has quit IRC (Read error: Operation timed out)
13:35 🔗 odemg has joined #archiveteam
14:02 🔗 RichardG has quit IRC (Read error: Operation timed out)
14:02 🔗 RichardG has joined #archiveteam
14:07 🔗 SketchCow Let the archivebot job keep going.
14:13 🔗 brayden_ has joined #archiveteam
14:13 🔗 swebb sets mode: +o brayden_
14:25 🔗 brayden has quit IRC (Read error: Operation timed out)
14:25 🔗 brayden_ is now known as brayden
14:42 🔗 atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com)
14:51 🔗 RichardG has quit IRC (Read error: Operation timed out)
14:51 🔗 RichardG has joined #archiveteam
15:04 🔗 alfie has quit IRC (Remote host closed the connection)
15:06 🔗 alfie has joined #archiveteam
15:21 🔗 thuban4 has joined #archiveteam
15:24 🔗 thuban3 has quit IRC (Read error: Operation timed out)
15:33 🔗 n00b859 has quit IRC (Quit: Page closed)
15:48 🔗 RichardG has quit IRC (Read error: Operation timed out)
15:49 🔗 RichardG has joined #archiveteam
15:57 🔗 acro has joined #archiveteam
16:14 🔗 RichardG has quit IRC (Read error: Operation timed out)
16:14 🔗 RichardG has joined #archiveteam
16:41 🔗 RichardG has quit IRC (Read error: Operation timed out)
16:41 🔗 RichardG has joined #archiveteam
17:03 🔗 thuban has joined #archiveteam
17:05 🔗 thuban4 has quit IRC (Read error: Operation timed out)
17:31 🔗 BubuAnabe JAA: There's a really poor FAQ about the new version but it seems http://taringa.net/mi will disapear and the http://taringa.net/posts/ and http://taringa.net/comunidades/ sections will merge into "canales" (spanish for channels)
17:31 🔗 RichardG has quit IRC (Read error: Operation timed out)
17:31 🔗 RichardG has joined #archiveteam
17:58 🔗 RichardG has quit IRC (Read error: Operation timed out)
17:58 🔗 RichardG has joined #archiveteam
18:25 🔗 RichardG has quit IRC (Read error: Operation timed out)
18:25 🔗 RichardG has joined #archiveteam
18:47 🔗 JAA BubuAnabe: Hmm. Content is spread over all kinds of URLs, for example the individual posts on /mi are under /username/mi/something. I guess archiving the whole thing has some similarities with archiving all of Reddit: it's not as simple as doing a recursive grab of the website but requires much more work to discover all posts, comments, etc. And the whole thing being in Spanish doesn't exactly make it eas
18:47 🔗 JAA ier either. Anyway, let's take this to #archiveteam-bs .
18:53 🔗 Kalroth_ has joined #archiveteam
18:56 🔗 bRick5772 has joined #archiveteam
18:58 🔗 Kalroth has quit IRC (Quit: Bye!)
18:58 🔗 Kalroth_ is now known as Kalroth
19:04 🔗 TheLovina has joined #archiveteam
19:10 🔗 wp494 has quit IRC (Ping timeout: 250 seconds)
19:39 🔗 RichardG has quit IRC (Read error: Operation timed out)
19:39 🔗 RichardG has joined #archiveteam
20:06 🔗 RichardG has quit IRC (Read error: Operation timed out)
20:06 🔗 RichardG has joined #archiveteam
20:12 🔗 Jonison has quit IRC (Read error: Connection reset by peer)
20:12 🔗 Jonison has joined #archiveteam
20:30 🔗 whydomain has quit IRC (Remote host closed the connection)
20:30 🔗 whydomain has joined #archiveteam
20:31 🔗 deetwelve has quit IRC (Ping timeout: 260 seconds)
20:36 🔗 deetwelve has joined #archiveteam
20:45 🔗 Jonison has quit IRC (Read error: Connection reset by peer)
20:46 🔗 ItsYoda has quit IRC (Quit: rippppp to the yoda you used to know!)
20:49 🔗 luckcolor has quit IRC (Remote host closed the connection)
20:50 🔗 ItsYoda has joined #archiveteam
20:50 🔗 luckcolor has joined #archiveteam
20:50 🔗 deetwelve has quit IRC (Ping timeout: 260 seconds)
20:51 🔗 deetwelve has joined #archiveteam
20:51 🔗 Jonison has joined #archiveteam
21:10 🔗 username1 has joined #archiveteam
21:11 🔗 schbirid2 has quit IRC (Read error: Operation timed out)
21:18 🔗 schbirid2 has joined #archiveteam
21:19 🔗 zerkalo has joined #archiveteam
21:20 🔗 username1 has quit IRC (Read error: Operation timed out)
21:44 🔗 bRick5772 has quit IRC (Quit: Leaving.)
22:02 🔗 wp494 has joined #archiveteam
22:03 🔗 Jonison has quit IRC (Read error: Connection reset by peer)
22:22 🔗 r1c0 has joined #archiveteam
22:51 🔗 r1c0 has quit IRC (Quit: Broken pipe)
22:52 🔗 r1c0 has joined #archiveteam
22:56 🔗 icedice has quit IRC (Quit: Leaving)
23:16 🔗 scyther has quit IRC (Remote host closed the connection)
23:16 🔗 Sanqui has quit IRC (Remote host closed the connection)
23:22 🔗 Rasierkan how many instances can I savely run on the same docker node?
23:22 🔗 Rasierkan without getting banned that is
23:23 🔗 JAA That depends on the project. Some services ban you very quickly (e.g. Yahoo), others don't care if you hit them with dozens of connections.
23:23 🔗 Rasierkan am doing the tinyurl one right now I think
23:23 🔗 Rasierkan got it set on automatic
23:24 🔗 Rasierkan does it auto detect a ban and switch projects for me?
23:28 🔗 JAA With "ArchiveTeam's Choice", it will switch between projects, yes. If you want to maximise your contribution, you can of course run one instance of the warrior per project or use the scripts directly (lower overhead) and adjust the concurrency as appropriate for each project.
23:30 🔗 Rasierkan Can I just spin up 5-6 docker instances
23:30 🔗 Rasierkan and let it idle on automatic for some weeks
23:30 🔗 Rasierkan if I understood you right I can ?
23:32 🔗 JAA That would probably not be a good idea, unless you have multiple IP addresses. Maybe I wasn't clear above: the "Our choice" selection is just a short-hand for "whichever project we currently deem most important". So all your instances would run the same project, and while it's not a problem for URLTeam, once we switch the project, you'll likely quickly get banned.
23:32 🔗 Rasierkan I see
23:32 🔗 Rasierkan I could put it up on multiple adresses but that would probably be more hassle
23:33 🔗 Rasierkan so I just run a single instance
23:37 🔗 Rasierkan or rather single instance on automatic and the rest of the instances on the urlteam because theres no problem?
23:45 🔗 Sanqui has joined #archiveteam
23:49 🔗 scyther has joined #archiveteam
23:50 🔗 JAA Rasierkan: I'm not sure where the limit is. I've been running 10 threads per IP for several months. At that concurrency, the tracker's actually the limit because it doesn't generate new items quickly enough (URLTeam is the one project where items are generated dynamically on the tracker). My machines rarely process more than three items at once, i.e. seven threads are just waiting for the tracker to catch
23:50 🔗 JAA up.
23:50 🔗 JAA Let's take this to #archiveteam-bs if you have further questions.
23:55 🔗 odemg has quit IRC (Read error: Operation timed out)

irclogger-viewer