[00:01] *** closure has quit IRC (Read error: Connection reset by peer) [00:03] *** closure has joined #archiveteam-bs [00:10] *** caff has quit IRC (Read error: Connection reset by peer) [00:18] *** m007a83_ has joined #archiveteam-bs [00:21] *** m007a83 has quit IRC (Read error: Operation timed out) [00:21] *** schbirid has quit IRC (Read error: Operation timed out) [00:22] *** schbirid has joined #archiveteam-bs [00:33] *** m007a83_ is now known as m007a83 [00:33] *** m007a83 has quit IRC (Quit: Fuck you Comcast) [00:33] *** m007a83 has joined #archiveteam-bs [00:57] *** ndiddy has quit IRC (Ping timeout: 252 seconds) [00:58] *** closure has quit IRC (Read error: Operation timed out) [01:02] *** schbirid has quit IRC (Read error: Operation timed out) [01:03] *** schbirid has joined #archiveteam-bs [01:03] *** closure has joined #archiveteam-bs [01:23] *** ndiddy has joined #archiveteam-bs [01:23] *** ndiddy has quit IRC (Client Quit) [01:27] *** odemg_ has quit IRC (Ping timeout: 268 seconds) [01:38] *** odemg_ has joined #archiveteam-bs [01:41] *** schbirid has quit IRC (Read error: Operation timed out) [01:41] *** schbirid has joined #archiveteam-bs [02:00] *** closure has quit IRC (Read error: Connection reset by peer) [02:02] *** closure has joined #archiveteam-bs [02:31] *** PhrackD has quit IRC (bye) [02:31] SketchCow: any news about me getting return labels? [02:38] One of the ironies of that "plan" is, I can't help but imagine that even people opposed to the CoC would be *really* pissed by someone trying that. [02:40] *** PhrackD has joined #archiveteam-bs [02:59] *** closure has quit IRC (Read error: Connection reset by peer) [03:02] *** closure has joined #archiveteam-bs [03:10] zyphlar: there are a few of these supposedly-hyper-neutral sites that inevitably become alt-right rags pretending to be 'fair and balanced' [03:10] in NL, we have The Post Online doing the same [03:10] they may have once had the stated purpose, but if so, then the people running the place apparently didn't understand that no, not all ideologies should be on equal footing [03:11] with this inevitable result :P [03:16] *** davidar has joined #archiveteam-bs [03:20] Yaaaay [03:20] *** Ctrl-S has joined #archiveteam-bs [03:25] *** godane has quit IRC (Ping timeout: 252 seconds) [03:27] *** odemg_ has quit IRC (Ping timeout: 268 seconds) [03:39] *** odemg_ has joined #archiveteam-bs [03:44] *** schbirid has quit IRC (Read error: Operation timed out) [03:44] *** schbirid has joined #archiveteam-bs [03:46] *** archodg__ has joined #archiveteam-bs [03:48] *** archodg_ has quit IRC (Ping timeout: 252 seconds) [03:49] *** odemg_ has quit IRC (Ping timeout: 268 seconds) [04:00] *** closure has quit IRC (Read error: Connection reset by peer) [04:00] *** closure has joined #archiveteam-bs [04:01] *** odemg_ has joined #archiveteam-bs [04:11] *** TC04 has joined #archiveteam-bs [04:12] *** TC01 has quit IRC (Read error: Operation timed out) [04:27] *** godane has joined #archiveteam-bs [04:27] *** svchfoo3 sets mode: +o godane [04:29] *** schbirid has quit IRC (Read error: Operation timed out) [04:32] *** schbirid has joined #archiveteam-bs [04:59] *** closure has quit IRC (Read error: Connection reset by peer) [05:00] *** closure has joined #archiveteam-bs [05:54] *** HCross has joined #archiveteam-bs [06:00] *** closure has quit IRC (Read error: Connection reset by peer) [06:28] *** closure_ has joined #archiveteam-bs [06:38] *** closure has joined #archiveteam-bs [06:38] *** closure_ has quit IRC (Read error: Connection reset by peer) [06:41] *** bsmith093 has joined #archiveteam-bs [06:57] *** schbirid has quit IRC (Read error: Operation timed out) [06:58] *** schbirid has joined #archiveteam-bs [06:59] *** closure has quit IRC (Read error: Operation timed out) [08:32] *** schbirid has quit IRC (Read error: Operation timed out) [08:42] VoynichCr: Do you have plans for adding other URLs to your bot? https://www.wikidata.org/wiki/Property:P3265 for example? [08:48] There’s more social media accounts hidden behind P3040 P2002 P2397 P2003 and P2013. [10:01] *** chferfa has quit IRC () [10:38] *** chferfa has joined #archiveteam-bs [12:06] *** BlueMax has quit IRC (Read error: Connection reset by peer) [12:11] PurpleSym: i could, but the table can get too complicated, and searching for TW/FB/etc urls on ArchiveBot Viewer isn't as easy as website domains [12:12] suggestions are welcome [12:37] *** eientei95 has joined #archiveteam-bs [12:53] VoynichCr: You could rowspan the common lines (Name, Description, …) and have one line per URL for ArchiveBot and Archive details. Since nobody edits the table by hand this should be fine. [12:54] I think we lack the metadata to search for URL prefixes. You could search the IRC logs though. [12:57] Indeed, but it's not easy to search for social media jobs in any case because we usually use !ao < jobs for that, not !a on the actual social media site. [12:58] True, that’s an issue. Your metadata collection is not the right place for stuff like that, JAA? [13:01] PurpleSym: You mean my archivebot-archives tool? That would only find the URL of the URL list. Not the URLs inside that list. [13:02] Hmm, actually, the URL list is also on IA, so I guess it could be extended that way. [13:04] Yep, that one. I didn’t know there were .json files with metadata for each grab. [13:04] Neat. [13:04] Yeah, the JSON file contains some very basic metadata on the job, and the -urls.txt file contains the URL list for !ao < and !a < jobs. [13:05] The JSON file is where the viewer gets the job URL from (well, if that weren't broken). [13:29] IA api shows where a grab come from? spider, archivebot... [13:32] *** wp494 has quit IRC (Read error: Operation timed out) [13:32] *** wp494 has joined #archiveteam-bs [13:36] PurpleSym: Speaking of that tool, do you have any ideas how I could make it more useful? grepping a directory of ~16k YAML files works, I guess, but it's definitely not optimal. And a DB (e.g. sqlite3) wouldn't work well with version control. [13:37] I'd have to restructure it anyway if I wanted to integrate the information from JSON and URL lists in there. [13:51] I suppose I could create a fake DB using one YAML file per table. I wonder if that would actually be better though. [13:56] JAA: Sure, sqlite is not optimal for git storage, but if you want search functionality it is definitely the way to go. [13:58] PurpleSym: Yeah. Or I could keep the data in a text format in the repo for git purposes and have an import script which inserts it into an SQLite DB for searches etc. [14:00] Having the actual DB in git could get nasty quickly due to size and diffs. The repo is already several hundred MiB in the most basic format possible. [14:00] SQL dump? [14:31] *** Pixi has quit IRC (Quit: Pixi) [15:14] *** Jens has quit IRC (Remote host closed the connection) [15:14] *** Jens has joined #archiveteam-bs [15:15] *** closure_ has joined #archiveteam-bs [15:20] *** Pixi has joined #archiveteam-bs [15:41] *** zhongfu has joined #archiveteam-bs [15:58] *** closure_ has quit IRC (Read error: Connection reset by peer) [15:59] *** closure has joined #archiveteam-bs [16:58] *** closure has quit IRC (Read error: Connection reset by peer) [16:58] *** closure_ has joined #archiveteam-bs [17:28] *** arbin_ has joined #archiveteam-bs [18:00] *** closure_ has quit IRC (Read error: Connection reset by peer) [18:08] *** closure has joined #archiveteam-bs [18:23] *** jmtd is now known as Jon [18:40] *** closure has quit IRC (Read error: Connection reset by peer) [19:16] *** closure has joined #archiveteam-bs [20:01] *** closure has quit IRC (Read error: Connection reset by peer) [20:03] *** closure has joined #archiveteam-bs [20:08] *** Mateon1 has quit IRC (Ping timeout: 268 seconds) [20:08] *** Mateon1 has joined #archiveteam-bs [20:16] *** vectr0n_ has joined #archiveteam-bs [20:17] *** vectr0n has quit IRC (Read error: Operation timed out) [20:18] *** vectr0n_ is now known as vectr0n [20:18] *** closure has quit IRC (Read error: Connection reset by peer) [20:23] *** closure_ has joined #archiveteam-bs [20:27] is there a single canonical template for warrior pipelines or are they just copied over from whatever the most recent project was? [20:27] wanted to send a patch but I'm not sure which repo would be appropriate [20:29] They're just copied from project to project. But the idea of a common template isn't new, and I think it would be very useful indeed. [20:35] JAA: started tubeup on the ZAM youtube [20:36] HCross: Sweet. [20:36] We should create a wiki page for ZAM. [20:43] tubeup is running, albeit only at 100Mbit [20:52] *** S1mpbrain has joined #archiveteam-bs [20:53] *** SimpBrain has quit IRC (Read error: Operation timed out) [20:57] lol, someone "hijacked" (reregistered?) ZAM's Twitter page: https://twitter.com/zamofficial [20:58] *** closure_ has quit IRC (Read error: Operation timed out) [21:03] *** closure has joined #archiveteam-bs [21:13] *** S1mpbrain has quit IRC (Remote host closed the connection) [21:59] *** closure has quit IRC (Read error: Connection reset by peer) [22:03] *** closure has joined #archiveteam-bs [22:29] Created an initial page for tracking ZAM Network. This company is a mess. [22:30] Please feel free to expand this page with details on all the sites, the ArchiveBot jobs etc. [22:59] *** closure has quit IRC (Read error: Connection reset by peer) [23:08] *** closure has joined #archiveteam-bs [23:11] *** BlueMax has joined #archiveteam-bs [23:58] *** closure has quit IRC (Read error: Connection reset by peer)