[00:16] *** pizzaiolo has quit IRC (Quit: pizzaiolo) [00:17] *** pizzaiolo has joined #archiveteam [00:40] *** drumstick has quit IRC (Read error: Operation timed out) [00:45] *** BlueMaxim has joined #archiveteam [01:26] *** Lagittaja has quit IRC (Quit: Leaving) [01:26] *** pizzaiolo has quit IRC (Quit: pizzaiolo) [01:49] *** drumstick has joined #archiveteam [01:59] *** odemg has quit IRC (Read error: Operation timed out) [02:08] *** odemg has joined #archiveteam [02:52] *** drumstick has quit IRC (Read error: Connection reset by peer) [02:59] *** drumstick has joined #archiveteam [03:05] *** Asparagir has joined #archiveteam [04:21] *** Jogie has quit IRC (Read error: Operation timed out) [04:26] *** Jogie has joined #archiveteam [04:49] *** Sk1d has quit IRC (Ping timeout: 194 seconds) [04:55] *** Sk1d has joined #archiveteam [05:25] *** dserodio has quit IRC (Read error: Operation timed out) [05:32] *** dserodio has joined #archiveteam [05:54] *** Asparagir has quit IRC (Asparagir) [06:28] *** pam has joined #archiveteam [06:33] *** pam has quit IRC (Ping timeout: 268 seconds) [07:01] *** Soni has quit IRC (Ping timeout: 272 seconds) [07:02] *** Soni has joined #archiveteam [07:07] *** kevinr has quit IRC (Read error: Operation timed out) [07:08] *** flyingzum has quit IRC (Read error: Operation timed out) [07:26] *** Selanda has quit IRC (Ping timeout: 250 seconds) [07:27] *** Selanda has joined #archiveteam [07:28] *** flyingzum has joined #archiveteam [07:36] *** treora has quit IRC (Read error: Operation timed out) [07:37] *** cache_ has quit IRC (Read error: Operation timed out) [07:40] *** treora has joined #archiveteam [07:41] *** cache_ has joined #archiveteam [07:49] *** kevinr has joined #archiveteam [07:54] *** trvz has joined #archiveteam [07:57] *** atomotic has joined #archiveteam [08:40] *** luckcolor has quit IRC (Read error: Operation timed out) [08:40] *** MrRadar2 has quit IRC (Read error: Operation timed out) [08:40] *** bluesoul has quit IRC (Read error: Operation timed out) [08:43] *** luckcolor has joined #archiveteam [08:44] *** tsr has quit IRC (Read error: Operation timed out) [08:48] *** bluesoul has joined #archiveteam [08:51] *** MrRadar2 has joined #archiveteam [08:53] *** Honno has joined #archiveteam [09:00] *** tsr has joined #archiveteam [09:06] *** Dimtree has quit IRC (Read error: Operation timed out) [09:52] *** Dimtree has joined #archiveteam [10:48] *** Soni has quit IRC (Ping timeout: 272 seconds) [10:49] *** mls has quit IRC (Ping timeout: 250 seconds) [11:01] *** mls has joined #archiveteam [11:03] *** BlueMaxim has quit IRC (Quit: Leaving) [11:18] *** drumstick has quit IRC (Ping timeout: 600 seconds) [11:18] *** drumstick has joined #archiveteam [11:27] *** drumstick has quit IRC (Ping timeout: 255 seconds) [11:30] *** pizzaiolo has joined #archiveteam [11:30] *** Soni has joined #archiveteam [11:37] *** is-_ has joined #archiveteam [11:39] *** is- has quit IRC (Read error: Operation timed out) [12:00] *** winworldp has joined #archiveteam [12:05] *** Lagittaja has joined #archiveteam [12:05] *** winworldp has quit IRC (Ping timeout: 268 seconds) [12:13] *** Dimtree has quit IRC (Peace) [12:20] *** mls has quit IRC (Ping timeout: 250 seconds) [12:23] *** Dimtree_ has joined #archiveteam [12:37] *** mls has joined #archiveteam [12:38] *** Soulflare has quit IRC (Quit: http://drsclan.net) [12:38] *** Soulflare has joined #archiveteam [12:39] *** Dimtree_ is now known as Dimtree [12:48] *** pizzaiolo has quit IRC (Quit: pizzaiolo) [12:48] *** pizzaiolo has joined #archiveteam [12:52] *** refeed has joined #archiveteam [14:19] *** Guest has quit IRC (Read error: Operation timed out) [14:40] *** atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [14:40] *** second has joined #archiveteam [14:44] *** sep332 has joined #archiveteam [15:34] *** TheLovina has quit IRC (Ping timeout: 370 seconds) [15:34] *** atomotic has joined #archiveteam [16:30] *** BartoCH has joined #archiveteam [16:34] *** refeed has quit IRC (Quit: Leaving) [16:42] *** Mateon1 has quit IRC (Read error: Operation timed out) [16:42] *** Mateon1 has joined #archiveteam [16:49] *** Aranje has joined #archiveteam [16:50] *** jrra has left part [17:09] *** etudier has quit IRC (Quit: My MacBook has gone to sleep. ZZZzzz…) [17:11] *** etudier has joined #archiveteam [17:14] *** TheLovina has joined #archiveteam [17:23] *** atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [18:07] *** atomotic has joined #archiveteam [18:15] Hello [18:15] Can I run the warrior in docker? [18:15] And what kind of websites will you be hitting? [18:16] What is the best way to archive a website right now? A javascript website that is. [18:16] wpull seems to be unmaintained at the moment [18:18] *** Asparagir has joined #archiveteam [18:18] Yes, there is a Docker image for the warrior. We're archiving whatever needs to be archived; at the moment, there are only the usual suspects running (URL shorteners and news articles, both very long-running projects), but MiiVerse will be coming up soonish. [18:20] There isn't any proper way to archive very JS-heavy websites currently. You can try with wpull and PhantomJS, but that setup has its own problems. If the site isn't too large, i.e. it's feasible to click through everything manually, try using your browser with warcprox. [18:20] second: ^ [18:22] The latter is something I was trying with chrome headless. [18:22] Yes, I'll look into that as well with Firefox once the headless mode is released (sometime this month, I think?). [18:24] chrome headless actually crashed because the page I'm trying to archive has so much JS on it [18:24] Well, good luck then. :-D [18:24] Also, I think we should move this to #archiveteam-bs. [18:24] Is there any tool right now to catagorize data? [19:11] *** TheLovina has quit IRC (Read error: Operation timed out) [19:30] *** Mateon1 has quit IRC (Remote host closed the connection) [19:30] *** Mateon1 has joined #archiveteam [19:35] *** atomotic has quit IRC (Quit: My MacBook has gone to sleep. ZZZzzz…) [19:37] *** is-_ is now known as is- [19:45] *** dzho has joined #archiveteam [19:49] *** K4k has quit IRC (Quit: WeeChat 1.6) [19:49] *** K4k has joined #archiveteam [19:52] *** K4k has quit IRC (Client Quit) [19:53] *** K4k has joined #archiveteam [20:03] *** K4k has quit IRC (Quit: WeeChat 1.6) [20:04] *** K4k has joined #archiveteam [20:06] *** atomotic has joined #archiveteam [20:33] *** Aranje has quit IRC (Ping timeout: 245 seconds) [20:49] *** Aranje has joined #archiveteam [21:57] *** Stiletto has quit IRC (Ping timeout: 260 seconds) [21:58] *** drumstick has joined #archiveteam [22:16] *** dashcloud has quit IRC (Read error: Operation timed out) [22:19] *** dashcloud has joined #archiveteam [22:37] *** Honno has quit IRC (Read error: Operation timed out) [22:42] *** BartoCH has quit IRC (Quit: WeeChat 1.9) [22:46] *** atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [22:46] *** dashcloud has quit IRC (Remote host closed the connection) [22:50] *** Mateon1 has quit IRC (Remote host closed the connection) [22:50] *** Mateon1 has joined #archiveteam [22:52] *** dashcloud has joined #archiveteam [23:43] *** nertzy has quit IRC (Read error: Operation timed out)