[00:06] *** Sk1d has quit IRC (Ping timeout: 250 seconds) [00:13] *** Sk1d has joined #archiveteam-bs [00:23] so using my rpi archivebox [00:24] i'm using my setup-wifi script piggyback on my wifi router and connect rpi to internet [00:24] this way a address like 192.168.1.9:8000 will give me kiwix server on it [00:25] without going to a offline wifi network [00:26] i also add a --threads=4 to my kiwix.sh script [00:27] that making kiwix work alot of faster [00:33] so i'm digitize another tape [00:34] first recording is 'How to win at Blackjack' [00:34] on Action Pay-Per-View [00:34] with some promos at the start [00:35] its John Patricks How To Win At Blackjack [00:38] so my rpi kiwix server crash a 2nd time [00:39] How To Win At Jack Black [00:39] i'm unpluging it for now [00:51] so i'm downloading NH Outlook, digitizing a tape, and uploading DTIC for 137xxx range [01:11] Random question, does anyone know if there's some method to search IA for all subdomains they've crawled of a certain domain? [01:18] *** Darkstar has quit IRC (Ping timeout: 1212 seconds) [01:23] *** prb has quit IRC (Read error: Operation timed out) [01:30] *** Darkstar has joined #archiveteam-bs [01:42] phuzion: nice to see you around again! [01:43] Somebody2: I'm still around-ish, just been busy with work and such, you know. [02:19] *** yuitimoth has joined #archiveteam-bs [02:20] *** Darkstar has quit IRC (Ping timeout: 633 seconds) [02:29] phuzion: type the domain into the search box (possibly without the TLD) and it'll show you some [02:29] possibly all [02:36] *** schbirid has quit IRC (Ping timeout: 255 seconds) [02:39] *** dashcloud has quit IRC (Ping timeout: 633 seconds) [02:42] *** dashcloud has joined #archiveteam-bs [02:43] *** qw3rty110 has joined #archiveteam-bs [02:46] *** Darkstar has joined #archiveteam-bs [02:48] *** schbirid has joined #archiveteam-bs [02:49] *** qw3rty19 has quit IRC (Read error: Operation timed out) [02:58] *** WubTheCap has joined #archiveteam-bs [03:35] so looks like i got a making of Virtuosity on HBO [04:02] *** qw3rty111 has joined #archiveteam-bs [04:08] *** qw3rty110 has quit IRC (Read error: Operation timed out) [05:02] *** qw3rty112 has joined #archiveteam-bs [05:08] *** qw3rty111 has quit IRC (Read error: Operation timed out) [05:14] *** RichardG has quit IRC (Ping timeout: 360 seconds) [05:47] *** Atros has joined #archiveteam-bs [05:47] *** Petri152 has quit IRC (Ping timeout: 246 seconds) [05:48] *** Petri152 has joined #archiveteam-bs [05:49] *** atrocity has quit IRC (Read error: Operation timed out) [05:52] *** Ravenloft has quit IRC (Ping timeout: 492 seconds) [05:54] *** yipdw has quit IRC (Remote host closed the connection) [05:56] *** me_ has joined #archiveteam-bs [06:13] *** RichardG has joined #archiveteam-bs [06:35] phuzion: I'm also here on-and-off, myself. [06:54] Anyone know what happened to the original copy of this video? https://www.mediamatters.org/video/2018/01/16/nra-s-media-outlet-launches-anti-trans-attacks-against-chelsea-manning/219081 [07:18] SketchCow: your FOS keeps trying to reupload the file i'm uploading [07:19] like at once percent it tries to start uploading it again [07:39] *** godane1 has joined #archiveteam-bs [07:40] *** Jonimus has quit IRC (Quit: WeeChat 1.4) [07:41] *** godane has quit IRC (Read error: Operation timed out) [07:43] SketchCow: one of the tapes a home movie [07:44] showing newspaper articles about gangs [08:08] *** odemg has quit IRC (Read error: Operation timed out) [08:24] *** odemg has joined #archiveteam-bs [08:45] *** MrRadar2 has quit IRC (Read error: Operation timed out) [08:45] *** BnAboyZ has quit IRC (Read error: Operation timed out) [08:45] *** BnAboyZ has joined #archiveteam-bs [08:50] *** yuitimoth has quit IRC (Ping timeout: 480 seconds) [08:56] *** yuitimoth has joined #archiveteam-bs [08:58] *** MrRadar2 has joined #archiveteam-bs [09:04] *** MrRadar2 has quit IRC (Read error: Operation timed out) [09:04] *** yuitimoth has quit IRC (Remote host closed the connection) [09:06] *** yuitimoth has joined #archiveteam-bs [09:18] *** zhongfu has joined #archiveteam-bs [09:19] *** yuitimoth has quit IRC (Read error: Operation timed out) [09:22] *** yuitimoth has joined #archiveteam-bs [09:36] *** Darkstar has quit IRC (Ping timeout: 252 seconds) [09:53] GMT-morning [09:54] *** Darkstar has joined #archiveteam-bs [10:10] *** MrRadar2 has joined #archiveteam-bs [10:16] *** MrRadar2 has quit IRC (Read error: Operation timed out) [10:22] What would be the best way to archive an entire subreddit + threads + comments + text posts that is private (not public) [10:26] Use AB and use the ignore set I think [10:26] That works [10:27] No, it won't work for private subreddits. [10:27] Also, it won't grab the entire subreddit because the website only returns 1000 threads. [10:27] Ah I missed that about the private bit [10:29] I'm not really sure if there's any good solution. Reddit broke the cloudsearch syntax, so I'm not even aware of any method to find older threads in subreddits currently. [10:29] its not a big subreddit [10:29] And they went closed-source, so you can't just read the code to figure out a way around that. [10:29] about 400 posts maybe a dozen comments on tem [10:29] them* [10:29] *** jello has joined #archiveteam-bs [10:30] *** jello has quit IRC (Client Quit) [10:30] *** MrRadar2 has joined #archiveteam-bs [10:31] Ok, you could probably use wpull then. Run it twice, once logging in and writing a cookiejar, then a recursive grab of the subreddit (with a complex --reject-regex). [10:31] Perhaps there's a better method, not sure. [10:32] *** BlueMaxim has quit IRC (Leaving) [10:43] *** Ravenloft has joined #archiveteam-bs [10:44] *** yuitimoth has quit IRC (Read error: Operation timed out) [10:46] *** Darkstar has quit IRC (Ping timeout: 245 seconds) [10:51] *** yuitimoth has joined #archiveteam-bs [10:58] *** Darkstar has joined #archiveteam-bs [11:34] *** Darkstar has quit IRC (Ping timeout: 480 seconds) [11:53] *** Darkstar has joined #archiveteam-bs [12:07] *** dashcloud has quit IRC (Ping timeout: 633 seconds) [12:13] *** dashcloud has joined #archiveteam-bs [12:22] *** Darkstar has quit IRC (Ping timeout: 246 seconds) [12:34] *** Darkstar has joined #archiveteam-bs [12:34] I finally confirmed my suspicion why wpull 1.2.3 is so slow at adding URLs from hook scripts: it adds each URL to the database individually. That's only the case for hooks though; the other ways to add URLs (input URLs and HTML scraping etc.) are done in batches of 1000 URLs and therefore *much* faster. [12:51] *** schbirid has quit IRC (Ping timeout: 255 seconds) [13:03] *** schbirid has joined #archiveteam-bs [13:05] *** Darkstar has quit IRC (Ping timeout: 260 seconds) [13:10] *** odemg has quit IRC (Ping timeout: 260 seconds) [13:14] *** Darkstar has joined #archiveteam-bs [13:16] *** MrDignity has joined #archiveteam-bs [13:22] *** odemg has joined #archiveteam-bs [13:26] *** Mateon1 has quit IRC (Remote host closed the connection) [13:26] *** Mateon1 has joined #archiveteam-bs [13:33] *** Ceryn has quit IRC (Read error: Operation timed out) [13:34] *** Ceryn has joined #archiveteam-bs [13:41] *** ndiddy has joined #archiveteam-bs [14:10] *** Jonimus has joined #archiveteam-bs [14:10] *** swebb sets mode: +o Jonimus [14:11] *** rsznik has quit IRC (Ping timeout: 264 seconds) [14:11] *** Jj__ has joined #archiveteam-bs [14:13] *** Jj__ has left [14:14] *** Jj__ has joined #archiveteam-bs [14:14] *** Jj__ has left [15:19] *** icedice has joined #archiveteam-bs [15:24] *** dashcloud has quit IRC (Read error: Operation timed out) [15:34] *** dashcloud has joined #archiveteam-bs [15:36] *** superkuh has quit IRC (Read error: Operation timed out) [15:36] *** Fletcher has joined #archiveteam-bs [15:42] *** C4K3 has quit IRC (Read error: Operation timed out) [15:44] *** Stilett0 has quit IRC (Ping timeout: 250 seconds) [15:53] *** C4K3 has joined #archiveteam-bs [15:56] *** Stilett0 has joined #archiveteam-bs [16:27] *** Mateon1 has quit IRC (Ping timeout: 245 seconds) [16:27] *** Mateon1 has joined #archiveteam-bs [16:42] *** odemg has quit IRC (Ping timeout: 506 seconds) [16:54] *** odemg has joined #archiveteam-bs [16:55] so i have to recapture a tape [16:59] the stupid easycap got stuck at 1:17:30 [17:04] *** schbirid has quit IRC (Remote host closed the connection) [17:11] *** RichardG has quit IRC (Read error: Operation timed out) [17:14] *** RichardG has joined #archiveteam-bs [17:43] *** rsznik has joined #archiveteam-bs [18:21] *** icedice has quit IRC (Ping timeout: 252 seconds) [18:21] *** pizzaiolo has joined #archiveteam-bs [18:24] *** icedice has joined #archiveteam-bs [18:27] *** pizzaiolo has quit IRC (Read error: Operation timed out) [19:02] *** DFJustin has quit IRC (Ping timeout: 260 seconds) [19:02] *** rsznik has quit IRC (Ping timeout: 260 seconds) [19:28] *** schbirid has joined #archiveteam-bs [19:34] *** jacketcha has quit IRC (Read error: Operation timed out) [19:49] *** icedice has quit IRC (Quit: Leaving) [20:53] *** superkuh has joined #archiveteam-bs [21:02] *** REiN^ has joined #archiveteam-bs [21:31] *** dashcloud has quit IRC (Remote host closed the connection) [21:33] *** dashcloud has joined #archiveteam-bs [21:44] *** BlueMaxim has joined #archiveteam-bs [21:47] *** ReimuHaku has quit IRC (Ping timeout: 250 seconds) [21:49] *** ReimuHaku has joined #archiveteam-bs [22:18] *** ranavalon has quit IRC (Read error: Connection reset by peer) [22:21] *** ranavalon has joined #archiveteam-bs [23:10] *** jschwart has quit IRC (Read error: Connection reset by peer) [23:11] *** jschwart has joined #archiveteam-bs [23:14] *** ReimuHaku has quit IRC (Read error: Operation timed out) [23:15] *** jschwart has quit IRC (Client Quit) [23:23] *** ReimuHaku has joined #archiveteam-bs [23:57] *** ReimuHaku has quit IRC (Quit: http://quassel-irc.org - Chat comfortably. Anywhere.)