[00:34] *** Rome has joined #archiveteam-bs [00:40] *** RomeSilva has quit IRC (Read error: Operation timed out) [00:42] *** enowaldo has joined #archiveteam-bs [00:48] *** enowaldo has quit IRC (Ping timeout: 268 seconds) [00:55] Kaz: I totally understand, I posted in several channels asking what people's opinions were on the move. I got the go-ahead from @arkiver but I don't mind that it's been reverted [00:57] where did you get the go ahead? I didnt see it in the logs [00:57] IIPC Slack [00:57] *** BlueMax has quit IRC (Quit: Leaving) [00:57] then PM with him directly [00:57] @jasonscott also mentioned it could be done, but didn't explicitly approve it, so don't blame him [00:58] I'll repost my original messag here so you can see it [00:58] does anyone know who manages the ArchiveTeam wiki? I was wondering if I could talk to them about potentially renaming this page to ArchiveToolbox https://www.archiveteam.org/index.php?title=ArchiveBox (due to name-conflict with archivebox.io) [00:58] I am hoping to cut down on conflicting google results and it looks like a mostly dormant page stored only for historical reasons, but I also totally understand if they'd prefer to keep it with the current name, the wiki page has been there much longer than my project, so of course you guys get priority.) [01:00] a few people helped direct me to the right place and gave me the edit password, apologies as I'm not familiar with the ArchiveTeam wiki's governance process, I just joined this stream yesterday [01:00] in general do you post things here and wait for some +1s? how are edits usually proposed? [01:01] You guys are my heroes so I definitely don't want to go stepping on toes ;P [01:08] Flashfire: would you be ok with me adding a disambiguation section to the top/bottom of that wiki that mentions it's unrelated? I'll also add a disambiguation section in the ArchiveBox.io docs [01:09] I have no holding except for in URLTeam at this stage my friend [01:10] Yeah, things like that would usually be discussed here. [01:11] ^ [01:12] A disambiguation notice sounds good. I'm not sure if you really need to add one on your docs as well. [01:12] I have a big community page where I list related projects, so i'd just make a small note there next to it, nbd https://github.com/pirate/ArchiveBox/wiki/Web-Archiving-Community [01:13] Ah yeah, that sounds good. [01:15] `Note: This page describes a collection of unix tools curated by ArchiveTeam and is unrelated to the ArchiveBox.io project.` [01:15] how about that? [01:19] *** Zerote has quit IRC (Ping timeout: 260 seconds) [01:20] *** BlueMax has joined #archiveteam-bs [01:22] *** tech234a has joined #archiveteam-bs [01:26] *** Despatche has quit IRC (Read error: Connection reset by peer) [02:36] *** kbtoo has joined #archiveteam-bs [02:37] *** kbtoo_ has quit IRC (Ping timeout: 255 seconds) [02:56] *** PhrackD has quit IRC (Read error: Connection reset by peer) [03:00] *** PhrackD has joined #archiveteam-bs [03:16] even though its already in the dead section is it worth updating the fact that a.gd - redirects to eg.gg which is a (somewhat entertaining) parking page as of 02:13, 26 December 2015 (EST) now redirects to a page saying it is for sale? Or should I leave it as the 2015 thing for posterity? [03:23] *** qw3rty113 has joined #archiveteam-bs [03:27] *** Rome has quit IRC (Read error: Connection reset by peer) [03:28] *** odemgi has joined #archiveteam-bs [03:29] *** qw3rty112 has quit IRC (Read error: Operation timed out) [03:30] *** odemgi_ has quit IRC (Read error: Operation timed out) [03:34] tsquashsh: FWIW, I came across your ArchiveBot project recently, and was really impressed. I definitely think we should add a disambiguation note on the wiki page about it. [03:37] *** odemg has quit IRC (Ping timeout: 615 seconds) [03:44] *** odemg has joined #archiveteam-bs [04:02] *** Tsuser has joined #archiveteam-bs [04:03] Kaz: the tracker stopped feeding metrics to influxdb, can you take a look at why it b0rked? [04:07] how was it feeding it initially? [04:07] I thought you were scraping [04:11] *** PhrackD has quit IRC (Read error: Operation timed out) [04:12] *** tech234a has quit IRC (Quit: Connection closed for inactivity) [04:12] *** PhrackD has joined #archiveteam-bs [04:30] telegraf [04:34] Oh [04:35] I restarted telegraf, lmk if that fixes [04:43] it did not [04:45] actually, it did [04:50] *** ndiddy has quit IRC (Ping timeout: 615 seconds) [04:50] *** wyatt8740 has joined #archiveteam-bs [05:11] *** tech234a has joined #archiveteam-bs [05:11] *** tomaspark has quit IRC (Ping timeout: 255 seconds) [06:15] *** RomeSilva has joined #archiveteam-bs [06:30] *** dashcloud has quit IRC (Ping timeout: 265 seconds) [06:31] *** dashcloud has joined #archiveteam-bs [06:44] *** fredgido has quit IRC (Read error: Connection reset by peer) [06:45] *** fredgido has joined #archiveteam-bs [07:19] *** schbirid has joined #archiveteam-bs [07:20] *** tech234a has quit IRC (Quit: Connection closed for inactivity) [08:04] *** enowaldo has joined #archiveteam-bs [08:06] *** Lord_Nigh has quit IRC (Ping timeout: 265 seconds) [08:12] *** enowaldo has quit IRC (Read error: Operation timed out) [08:15] *** BlueMax has quit IRC (Quit: Leaving) [08:24] *** Lord_Nigh has joined #archiveteam-bs [08:29] *** Zerote has joined #archiveteam-bs [08:48] *** kbtoo_ has joined #archiveteam-bs [08:55] *** kbtoo has quit IRC (Read error: Operation timed out) [09:59] SketchCow: that guy is back: https://archive.org/details/@jerseyjack&tab=reviews [11:08] *** frainz_ has quit IRC (Remote host closed the connection) [11:13] *** frainz has joined #archiveteam-bs [11:13] *** enowaldo has joined #archiveteam-bs [11:14] *** sec^nd has quit IRC (Remote host closed the connection) [11:23] *** enowaldo has quit IRC (Ping timeout: 265 seconds) [11:33] hey, I've uploaded a WARC - https://archive.org/details/homepages.inspire.net.nz_2019-04-18_inspire-user-homepages - apparently I need to ask people here to get it made ready for going into the wayback machine? [11:36] *** drcd has joined #archiveteam-bs [11:49] *** enowaldo has joined #archiveteam-bs [12:11] *** Despatche has joined #archiveteam-bs [12:12] *** enowaldo has quit IRC (Read error: Operation timed out) [12:16] *** cfarquhar has quit IRC (Read error: Operation timed out) [12:17] *** cfarquhar has joined #archiveteam-bs [12:19] *** Mateon1 has quit IRC (Remote host closed the connection) [12:19] *** Mateon1 has joined #archiveteam-bs [12:40] *** Shen has joined #archiveteam-bs [12:47] *** enowaldo has joined #archiveteam-bs [13:05] *** enowaldo has quit IRC (Read error: Operation timed out) [13:08] *** Verified_ has joined #archiveteam-bs [13:09] *** ndiddy has joined #archiveteam-bs [13:09] *** ndiddy has quit IRC (Client Quit) [13:26] *** enowaldo has joined #archiveteam-bs [13:36] *** enowaldo has quit IRC (Read error: Operation timed out) [13:41] *** enowaldo has joined #archiveteam-bs [13:53] *** jspiros__ has quit IRC (Quit: ZNC - https://znc.in) [13:54] *** jspiros__ has joined #archiveteam-bs [14:05] *** Hani has quit IRC (Quit: Going offline, see ya! (www.adiirc.com)) [14:16] *** jspiros__ has quit IRC (Quit: ZNC - https://znc.in) [15:02] *** enowaldo has quit IRC (Read error: Operation timed out) [15:43] *** enowaldo has joined #archiveteam-bs [15:56] Hi. [15:56] Do you have a "go-to" scrapy project to archive websites, a "skeleton" you use when you try to archive a service? [15:58] not really - other than starting at the root and crawling it [15:58] Using something like https://github.com/archiveteam/grab-site might help you out? [16:00] let's say you have dynamic ressource loading, is it taken care of? (like, javascript loading more javascript). I'd say it's not taken care of, but might not be a real world problem too. [16:02] No JS processing in wpull (or the tools that build on it like grab-site and ArchiveBot). [16:43] *** Reventlov has quit IRC (Quit: WeeChat 2.4) [16:43] *** Reventlov has joined #archiveteam-bs [17:08] is there a tool that would passively download/archive youtube videos you put in a certain playlist? would be useful for convenience [17:15] Sanqui: I think you could accomplish that by just pointing youtube-dl at the playlist and setting it to run on a certain schedule. [17:16] Not quite sure what you mean by "passively" though [17:20] i think Sanqui means put it in a cronjob or something similar and it runs on the regular ... kind of like a podcatcher? [17:22] Yeah, that's what I figured. In that case, youtube-dl in a cronjob should work okay. Main annoyance is that youtube-dl has to re-download and iterate through the playlist each time, which can slow things down if the playlist gets really long. [18:10] *** ave_ has joined #archiveteam-bs [18:35] *** Stilettoo is now known as Stiletto [18:47] Sanqui: check out https://archivebox.io [18:48] or any one of the other projects that accomplish similar things here: https://github.com/pirate/ArchiveBox/wiki/Web-Archiving-Community#Web-Archiving-Projects [19:13] *** enowaldo has quit IRC (Ping timeout: 265 seconds) [20:04] *** killsushi has joined #archiveteam-bs [20:18] *** enowaldo has joined #archiveteam-bs [20:26] *** chirlu has quit IRC (Ping timeout: 255 seconds) [20:30] *** ave_ has quit IRC (Quit: Connection closed for inactivity) [20:32] *** enowaldo has quit IRC (Read error: Operation timed out) [20:37] VoynichCr: I'm working on adding a bunch of libraries to https://www.archiveteam.org/index.php?title=ArchiveGLAM. Do you think it's alright to make https://www.archiveteam.org/index.php?title=ArchiveBot/National_Libraries a page for both national and state libraries? [20:53] *** ndiddy has joined #archiveteam-bs [21:09] *** enowaldo has joined #archiveteam-bs [21:20] *** drcd has quit IRC (Read error: Connection reset by peer) [21:45] *** enowaldo has quit IRC (Ping timeout: 265 seconds) [21:46] *** Soni has joined #archiveteam-bs [22:06] *** Selavi has quit IRC (Quit: verb. to stop or discontinue) [22:11] *** Selavi has joined #archiveteam-bs [22:41] *** enowaldo has joined #archiveteam-bs [22:52] *** ndiddy has quit IRC () [23:02] *** enowaldo has quit IRC (Read error: Operation timed out) [23:16] *** BlueMax has joined #archiveteam-bs [23:39] *** enowaldo has joined #archiveteam-bs