[00:13] *** dxrt_ is now known as dxrt [00:26] rocode: [00:26] hook54321, [00:26] Could kaspersky be the problem/. [00:26] *? [00:28] It could be. [00:28] Is there a way to fix it? [00:28] Researching now. [00:29] https://github.com/zaphoyd/websocketpp/issues/246 seems to mention your specific problem. [00:30] But Kaspersky documentation doesn't seem to list an option relating to it. [00:31] I can't change the kaspersky settings unfortunately [00:31] Unless there's a way around that [00:40] *** Ravenloft has quit IRC (Ping timeout: 633 seconds) [01:19] *** BlueMaxim has joined #archiveteam-bs [01:28] added one more [01:43] *** ravetcofx has quit IRC (Read error: Operation timed out) [01:46] *** ravetcofx has joined #archiveteam-bs [01:46] *** DiscantX has joined #archiveteam-bs [02:11] *** Honno has quit IRC (Read error: Connection reset by peer) [02:16] *** DiscantX has quit IRC (Read error: Operation timed out) [02:19] *** Start has quit IRC (Quit: Disconnected.) [02:27] *** Start has joined #archiveteam-bs [02:42] *** zhongfu has joined #archiveteam-bs [03:06] *** pizzaiolo has quit IRC (Remote host closed the connection) [03:26] *** jrwr has quit IRC (Remote host closed the connection) [03:47] *** ndiddy has quit IRC (Quit: Leaving) [03:47] *** brayden_ is now known as brayden [04:00] *** rocode has quit IRC (hub.efnet.us irc.Prison.NET) [04:00] *** tpw_rules has quit IRC (hub.efnet.us irc.Prison.NET) [04:00] *** dxdx has quit IRC (hub.efnet.us irc.Prison.NET) [04:00] *** achip has quit IRC (hub.efnet.us irc.Prison.NET) [04:30] *** VADemon has quit IRC (Quit: left4dead) [05:07] *** Sk1d has quit IRC (Ping timeout: 250 seconds) [05:14] *** Sk1d has joined #archiveteam-bs [05:33] *** rocode has joined #archiveteam-bs [05:38] *** rocode has quit IRC (Client Quit) [05:38] *** rocode has joined #archiveteam-bs [05:43] *** yeoldetoa has joined #archiveteam-bs [05:56] *** fie has quit IRC (Read error: Operation timed out) [05:58] *** Start has quit IRC (Quit: Disconnected.) [05:59] *** achip has joined #archiveteam-bs [05:59] *** tpw_rules has joined #archiveteam-bs [05:59] *** dxdx has joined #archiveteam-bs [06:02] *** Frogging has quit IRC (Read error: Operation timed out) [06:03] *** Frogging has joined #archiveteam-bs [06:03] *** rduser has quit IRC (Read error: Operation timed out) [06:03] *** Mayonaise has quit IRC (Read error: Operation timed out) [06:03] *** marvinw has quit IRC (Read error: Operation timed out) [06:03] *** balrog has quit IRC (Read error: Operation timed out) [06:03] *** dashcloud has quit IRC (Read error: Operation timed out) [06:03] *** jspiros_ has quit IRC (Read error: Operation timed out) [06:03] *** SadDM has quit IRC (Read error: Operation timed out) [06:03] *** yakfish has quit IRC (Read error: Operation timed out) [06:04] *** Mayonaise has joined #archiveteam-bs [06:04] *** swebb has quit IRC (Read error: Operation timed out) [06:04] *** acridAxid has quit IRC (Read error: Operation timed out) [06:04] *** remsen has quit IRC (Read error: Operation timed out) [06:04] *** kvieta has quit IRC (Ping timeout: 246 seconds) [06:05] *** Baljem has quit IRC (Ping timeout: 246 seconds) [06:05] *** chfoo has quit IRC (Read error: Operation timed out) [06:05] *** marvinw has joined #archiveteam-bs [06:05] *** balrog has joined #archiveteam-bs [06:07] *** rocode has quit IRC (Read error: Operation timed out) [06:11] *** fie has joined #archiveteam-bs [06:11] *** dashcloud has joined #archiveteam-bs [06:12] *** remsen has joined #archiveteam-bs [06:12] *** Baljem has joined #archiveteam-bs [06:12] *** acridAxid has joined #archiveteam-bs [06:12] *** swebb has joined #archiveteam-bs [06:12] *** rocode has joined #archiveteam-bs [06:13] *** chfoo has joined #archiveteam-bs [06:17] *** kvieta has joined #archiveteam-bs [06:36] *** rduser has joined #archiveteam-bs [07:03] *** SadDM has joined #archiveteam-bs [07:04] *** yakfish has joined #archiveteam-bs [07:07] *** jspiros has joined #archiveteam-bs [07:21] *** GE has joined #archiveteam-bs [07:23] *** ravetcofx has quit IRC (Read error: Operation timed out) [08:20] *** zhongfu has quit IRC (Ping timeout: 260 seconds) [08:22] *** zhongfu has joined #archiveteam-bs [08:51] *** Honno has joined #archiveteam-bs [09:05] *** Aranje has quit IRC (Ping timeout: 260 seconds) [09:27] *** zerkalo_ has quit IRC (Ping timeout: 250 seconds) [09:27] *** zerkalo has joined #archiveteam-bs [09:28] *** brayden has quit IRC (Read error: Operation timed out) [10:55] *** brayden has joined #archiveteam-bs [11:29] *** DiscantX has joined #archiveteam-bs [11:32] *** GE has quit IRC (Remote host closed the connection) [11:35] *** schbirid has joined #archiveteam-bs [11:36] so looks like have sitemaps for www.independent.co.uk [11:36] i grabbed them over 18 months ago i have not uploaded them yet [11:37] doing it now [11:37] i only have articles from sitemap from 1992 and 1993-01 [11:45] so good news and bad news [11:46] bad news is the old .xml.gz sitemaps doesn't exist on independent.co.uk anymore [11:47] good news is there is a sitemap.xml?page=${number} format one the site that can use [11:48] with lastmod date one same line as url [11:49] the only bad news about this is they are doing it in alphabet order [11:50] so i will have to do make a sitemap warc.gz of that then grab urls by date order [11:53] *** BlueMaxim has quit IRC (Quit: Leaving) [12:25] so i uploaded the last 4 of Revision3 show called DIY Tryin [12:34] *** Mayonaise has quit IRC (Read error: Operation timed out) [12:34] *** DiscantY has joined #archiveteam-bs [12:35] *** Mayonaise has joined #archiveteam-bs [12:39] *** hook54321 has quit IRC (Ping timeout: 244 seconds) [12:39] *** antonizoo has quit IRC (Ping timeout: 250 seconds) [12:39] *** DiscantX has quit IRC (Read error: Operation timed out) [12:41] *** antonizoo has joined #archiveteam-bs [12:51] *** pizzaiolo has joined #archiveteam-bs [12:55] *** GreenObse has joined #archiveteam-bs [12:56] do we have an instagram user scrapper? [13:13] *** GE has joined #archiveteam-bs [13:36] *** Start has joined #archiveteam-bs [13:42] *** Start has quit IRC (Quit: Disconnected.) [13:45] *** pizzaiolo has quit IRC (Read error: Operation timed out) [13:52] *** pizzaiolo has joined #archiveteam-bs [13:55] *** pizzaiolo has quit IRC (Remote host closed the connection) [13:56] *** pizzaiolo has joined #archiveteam-bs [14:06] *** VADemon has joined #archiveteam-bs [14:37] *** hook54321 has joined #archiveteam-bs [15:21] *** xmc has quit IRC (Read error: Operation timed out) [15:24] *** godane has quit IRC (Ping timeout: 244 seconds) [15:25] *** DiscantY has quit IRC (Read error: Operation timed out) [15:31] *** midas1 has quit IRC (Ping timeout: 633 seconds) [15:38] *** godane has joined #archiveteam-bs [15:47] *** SadDM has quit IRC (Read error: Operation timed out) [15:48] *** yakfish has quit IRC (Read error: Operation timed out) [15:48] *** jspiros has quit IRC (Read error: Operation timed out) [16:02] *** DiscantY has joined #archiveteam-bs [16:19] *** midas1 has joined #archiveteam-bs [16:21] *** godane has quit IRC (Quit: Leaving.) [16:21] *** godane has joined #archiveteam-bs [16:27] xmc: yipdw: looks like the tracker is down. [16:27] anything going on in the background? [16:29] als chfoo ^ [16:29] also* [16:34] *** midas1 has quit IRC (Ping timeout: 260 seconds) [16:47] *** DiscantY has quit IRC (Ping timeout: 633 seconds) [16:48] *** SadDM has joined #archiveteam-bs [16:48] *** yakfish has joined #archiveteam-bs [16:52] *** jspiros has joined #archiveteam-bs [16:59] *** midas1 has joined #archiveteam-bs [17:08] anyone know a good source for the silent Buster Keaton movies? [17:08] should be PD [17:38] *** midas1 has quit IRC (Ping timeout: 250 seconds) [17:41] *** ravetcofx has joined #archiveteam-bs [17:42] *** midas1 has joined #archiveteam-bs [17:54] GreenObse: Unfortunately, while we can scrape Instagram accounts, we do not yet have an "ignore pattern" for doing it in a focused way. [17:54] Here are the ignore patterns we have so far: https://github.com/ArchiveTeam/ArchiveBot/tree/master/db/ignore_patterns [18:00] *** ravetcofx has quit IRC (Read error: Operation timed out) [18:07] Asparagir, thank for the link, I found a paid/lic app for $50 that can get accounts and filter what it scrapes based on post,followers and following count but no free code online as far as i can see, short of writing my own.... using existing none specific auto follow bots i could get users but only 7500 per maintained account over the course of weeks and that's not fast enough to hold my interest in w [18:07] hat im doing.... urgh [18:07] arkiver: SSH on the tracker looks down too [18:09] GreenObse: why do you need to save people's photos at high speed [18:09] it sounds suspicious [18:11] *** ravetcofx has joined #archiveteam-bs [18:12] I was an advocate for gonewilder and saving reddit user content in an automated fashion, I've moved my interest toward ig accounts of all types but only getting snapshots right now and not automated the process, I want to do this to create an image database searchable by filename/image like googles reverse image search but for content taken from ig, mostly as a tool against the rising trend of people u [18:12] sing ig girls photos to catfish people. Google isn't generally caching ig accounts and sites that do aren't well maintained/ dont keep a copy of deleted/ now private images... or because i can [18:13] GreenObse: I wrote a script for instagram. It fetches images and metadata, but does not create WARCs. See https://6xq.net/paste/cikeszun.html [18:14] ok that's what I thought [18:14] PurpleSym, thank you, metadata was my next task but wasn't primary as I wasn't sure about the functionality of my end database and if i should link back to the original authors page/ metadata idk [18:34] Google is literally the new Yahoo. [18:36] I don't know why the hell I decided to rely on their APIs. I should of learned better by now. [18:38] *** ravetcofx has quit IRC (Read error: Operation timed out) [18:38] *** Stil3tt0 has joined #archiveteam-bs [18:38] *** Stil3tt0 is now known as Stiletto [18:38] *** ravetcofx has joined #archiveteam-bs [18:47] *** Ravenloft has joined #archiveteam-bs [18:56] *** Famicoman has quit IRC (Ping timeout: 260 seconds) [19:09] *** Ravenloft has quit IRC (Ping timeout: 633 seconds) [19:14] SketchCow: couldn't the revision3 collection be moved over to computersandtechvideos collection? [19:14] cause most of that stuff is computer and tech videos [19:16] and maybe by moving it there we can have make it visible [19:17] also hoping that doesn't make everything in computersandtechvideos un-viewible [19:29] i'm now grabbing 2015-11-21 to 2015-11-30 medium.com sitemap dumps [19:33] *** GreenObse has quit IRC (Ping timeout: 255 seconds) [19:39] *** GreenObse has joined #archiveteam-bs [19:57] *** ndiddy has joined #archiveteam-bs [20:08] *** rduser has quit IRC (Quit: ZNC - http://znc.in) [21:37] *** xmc has joined #archiveteam-bs [21:47] *** fie has quit IRC (Ping timeout: 245 seconds) [22:09] *** schbirid has quit IRC (Quit: Leaving) [22:29] *** fie has joined #archiveteam-bs [22:50] *** GE has quit IRC (Remote host closed the connection) [23:12] *** BlueMaxim has joined #archiveteam-bs [23:13] *** ndiddy has quit IRC (Read error: Connection reset by peer) [23:14] *** ndiddy has joined #archiveteam-bs [23:41] *** DiscantX has joined #archiveteam-bs [23:52] *** kristian_ has joined #archiveteam-bs