#archiveteam-bs 2019-04-18,Thu

↑back Search

Time Nickname Message
00:34 🔗 Rome has joined #archiveteam-bs
00:40 🔗 RomeSilva has quit IRC (Read error: Operation timed out)
00:42 🔗 enowaldo has joined #archiveteam-bs
00:48 🔗 enowaldo has quit IRC (Ping timeout: 268 seconds)
00:55 🔗 tsquashsh Kaz: I totally understand, I posted in several channels asking what people's opinions were on the move. I got the go-ahead from @arkiver but I don't mind that it's been reverted
00:57 🔗 Flashfire where did you get the go ahead? I didnt see it in the logs
00:57 🔗 tsquashsh IIPC Slack
00:57 🔗 BlueMax has quit IRC (Quit: Leaving)
00:57 🔗 tsquashsh then PM with him directly
00:57 🔗 tsquashsh @jasonscott also mentioned it could be done, but didn't explicitly approve it, so don't blame him
00:58 🔗 tsquashsh I'll repost my original messag here so you can see it
00:58 🔗 tsquashsh does anyone know who manages the ArchiveTeam wiki? I was wondering if I could talk to them about potentially renaming this page to ArchiveToolbox https://www.archiveteam.org/index.php?title=ArchiveBox (due to name-conflict with archivebox.io)
00:58 🔗 tsquashsh I am hoping to cut down on conflicting google results and it looks like a mostly dormant page stored only for historical reasons, but I also totally understand if they'd prefer to keep it with the current name, the wiki page has been there much longer than my project, so of course you guys get priority.)
01:00 🔗 tsquashsh a few people helped direct me to the right place and gave me the edit password, apologies as I'm not familiar with the ArchiveTeam wiki's governance process, I just joined this stream yesterday
01:00 🔗 tsquashsh in general do you post things here and wait for some +1s? how are edits usually proposed?
01:01 🔗 tsquashsh You guys are my heroes so I definitely don't want to go stepping on toes ;P
01:08 🔗 tsquashsh Flashfire: would you be ok with me adding a disambiguation section to the top/bottom of that wiki that mentions it's unrelated? I'll also add a disambiguation section in the ArchiveBox.io docs
01:09 🔗 Flashfire I have no holding except for in URLTeam at this stage my friend
01:10 🔗 JAA Yeah, things like that would usually be discussed here.
01:11 🔗 Flashfire ^
01:12 🔗 JAA A disambiguation notice sounds good. I'm not sure if you really need to add one on your docs as well.
01:12 🔗 tsquashsh I have a big community page where I list related projects, so i'd just make a small note there next to it, nbd https://github.com/pirate/ArchiveBox/wiki/Web-Archiving-Community
01:13 🔗 JAA Ah yeah, that sounds good.
01:15 🔗 tsquashsh `Note: This page describes a collection of unix tools curated by ArchiveTeam and is unrelated to the ArchiveBox.io project.`
01:15 🔗 tsquashsh how about that?
01:19 🔗 Zerote has quit IRC (Ping timeout: 260 seconds)
01:20 🔗 BlueMax has joined #archiveteam-bs
01:22 🔗 tech234a has joined #archiveteam-bs
01:26 🔗 Despatche has quit IRC (Read error: Connection reset by peer)
02:36 🔗 kbtoo has joined #archiveteam-bs
02:37 🔗 kbtoo_ has quit IRC (Ping timeout: 255 seconds)
02:56 🔗 PhrackD has quit IRC (Read error: Connection reset by peer)
03:00 🔗 PhrackD has joined #archiveteam-bs
03:16 🔗 Flashfire even though its already in the dead section is it worth updating the fact that a.gd - redirects to eg.gg which is a (somewhat entertaining) parking page as of 02:13, 26 December 2015 (EST) now redirects to a page saying it is for sale? Or should I leave it as the 2015 thing for posterity?
03:23 🔗 qw3rty113 has joined #archiveteam-bs
03:27 🔗 Rome has quit IRC (Read error: Connection reset by peer)
03:28 🔗 odemgi has joined #archiveteam-bs
03:29 🔗 qw3rty112 has quit IRC (Read error: Operation timed out)
03:30 🔗 odemgi_ has quit IRC (Read error: Operation timed out)
03:34 🔗 Somebody2 tsquashsh: FWIW, I came across your ArchiveBot project recently, and was really impressed. I definitely think we should add a disambiguation note on the wiki page about it.
03:37 🔗 odemg has quit IRC (Ping timeout: 615 seconds)
03:44 🔗 odemg has joined #archiveteam-bs
04:02 🔗 Tsuser has joined #archiveteam-bs
04:03 🔗 Fusl Kaz: the tracker stopped feeding metrics to influxdb, can you take a look at why it b0rked?
04:07 🔗 Kaz how was it feeding it initially?
04:07 🔗 Kaz I thought you were scraping
04:11 🔗 PhrackD has quit IRC (Read error: Operation timed out)
04:12 🔗 tech234a has quit IRC (Quit: Connection closed for inactivity)
04:12 🔗 PhrackD has joined #archiveteam-bs
04:30 🔗 Fusl telegraf
04:34 🔗 Kaz Oh
04:35 🔗 Kaz I restarted telegraf, lmk if that fixes
04:43 🔗 Fusl it did not
04:45 🔗 Fusl actually, it did
04:50 🔗 ndiddy has quit IRC (Ping timeout: 615 seconds)
04:50 🔗 wyatt8740 has joined #archiveteam-bs
05:11 🔗 tech234a has joined #archiveteam-bs
05:11 🔗 tomaspark has quit IRC (Ping timeout: 255 seconds)
06:15 🔗 RomeSilva has joined #archiveteam-bs
06:30 🔗 dashcloud has quit IRC (Ping timeout: 265 seconds)
06:31 🔗 dashcloud has joined #archiveteam-bs
06:44 🔗 fredgido has quit IRC (Read error: Connection reset by peer)
06:45 🔗 fredgido has joined #archiveteam-bs
07:19 🔗 schbirid has joined #archiveteam-bs
07:20 🔗 tech234a has quit IRC (Quit: Connection closed for inactivity)
08:04 🔗 enowaldo has joined #archiveteam-bs
08:06 🔗 Lord_Nigh has quit IRC (Ping timeout: 265 seconds)
08:12 🔗 enowaldo has quit IRC (Read error: Operation timed out)
08:15 🔗 BlueMax has quit IRC (Quit: Leaving)
08:24 🔗 Lord_Nigh has joined #archiveteam-bs
08:29 🔗 Zerote has joined #archiveteam-bs
08:48 🔗 kbtoo_ has joined #archiveteam-bs
08:55 🔗 kbtoo has quit IRC (Read error: Operation timed out)
09:59 🔗 godane SketchCow: that guy is back: https://archive.org/details/@jerseyjack&tab=reviews
11:08 🔗 frainz_ has quit IRC (Remote host closed the connection)
11:13 🔗 frainz has joined #archiveteam-bs
11:13 🔗 enowaldo has joined #archiveteam-bs
11:14 🔗 sec^nd has quit IRC (Remote host closed the connection)
11:23 🔗 enowaldo has quit IRC (Ping timeout: 265 seconds)
11:33 🔗 eythian hey, I've uploaded a WARC - https://archive.org/details/homepages.inspire.net.nz_2019-04-18_inspire-user-homepages - apparently I need to ask people here to get it made ready for going into the wayback machine?
11:36 🔗 drcd has joined #archiveteam-bs
11:49 🔗 enowaldo has joined #archiveteam-bs
12:11 🔗 Despatche has joined #archiveteam-bs
12:12 🔗 enowaldo has quit IRC (Read error: Operation timed out)
12:16 🔗 cfarquhar has quit IRC (Read error: Operation timed out)
12:17 🔗 cfarquhar has joined #archiveteam-bs
12:19 🔗 Mateon1 has quit IRC (Remote host closed the connection)
12:19 🔗 Mateon1 has joined #archiveteam-bs
12:40 🔗 Shen has joined #archiveteam-bs
12:47 🔗 enowaldo has joined #archiveteam-bs
13:05 🔗 enowaldo has quit IRC (Read error: Operation timed out)
13:08 🔗 Verified_ has joined #archiveteam-bs
13:09 🔗 ndiddy has joined #archiveteam-bs
13:09 🔗 ndiddy has quit IRC (Client Quit)
13:26 🔗 enowaldo has joined #archiveteam-bs
13:36 🔗 enowaldo has quit IRC (Read error: Operation timed out)
13:41 🔗 enowaldo has joined #archiveteam-bs
13:53 🔗 jspiros__ has quit IRC (Quit: ZNC - https://znc.in)
13:54 🔗 jspiros__ has joined #archiveteam-bs
14:05 🔗 Hani has quit IRC (Quit: Going offline, see ya! (www.adiirc.com))
14:16 🔗 jspiros__ has quit IRC (Quit: ZNC - https://znc.in)
15:02 🔗 enowaldo has quit IRC (Read error: Operation timed out)
15:43 🔗 enowaldo has joined #archiveteam-bs
15:56 🔗 Reventlov Hi.
15:56 🔗 Reventlov Do you have a "go-to" scrapy project to archive websites, a "skeleton" you use when you try to archive a service?
15:58 🔗 Kaz not really - other than starting at the root and crawling it
15:58 🔗 Kaz Using something like https://github.com/archiveteam/grab-site might help you out?
16:00 🔗 Reventlov let's say you have dynamic ressource loading, is it taken care of? (like, javascript loading more javascript). I'd say it's not taken care of, but might not be a real world problem too.
16:02 🔗 JAA No JS processing in wpull (or the tools that build on it like grab-site and ArchiveBot).
16:43 🔗 Reventlov has quit IRC (Quit: WeeChat 2.4)
16:43 🔗 Reventlov has joined #archiveteam-bs
17:08 🔗 Sanqui is there a tool that would passively download/archive youtube videos you put in a certain playlist? would be useful for convenience
17:15 🔗 jodizzle Sanqui: I think you could accomplish that by just pointing youtube-dl at the playlist and setting it to run on a certain schedule.
17:16 🔗 jodizzle Not quite sure what you mean by "passively" though
17:20 🔗 astrid i think Sanqui means put it in a cronjob or something similar and it runs on the regular ... kind of like a podcatcher?
17:22 🔗 jodizzle Yeah, that's what I figured. In that case, youtube-dl in a cronjob should work okay. Main annoyance is that youtube-dl has to re-download and iterate through the playlist each time, which can slow things down if the playlist gets really long.
18:10 🔗 ave_ has joined #archiveteam-bs
18:35 🔗 Stilettoo is now known as Stiletto
18:47 🔗 tsquashsh Sanqui: check out https://archivebox.io
18:48 🔗 tsquashsh or any one of the other projects that accomplish similar things here: https://github.com/pirate/ArchiveBox/wiki/Web-Archiving-Community#Web-Archiving-Projects
19:13 🔗 enowaldo has quit IRC (Ping timeout: 265 seconds)
20:04 🔗 killsushi has joined #archiveteam-bs
20:18 🔗 enowaldo has joined #archiveteam-bs
20:26 🔗 chirlu has quit IRC (Ping timeout: 255 seconds)
20:30 🔗 ave_ has quit IRC (Quit: Connection closed for inactivity)
20:32 🔗 enowaldo has quit IRC (Read error: Operation timed out)
20:37 🔗 jodizzle VoynichCr: I'm working on adding a bunch of libraries to https://www.archiveteam.org/index.php?title=ArchiveGLAM. Do you think it's alright to make https://www.archiveteam.org/index.php?title=ArchiveBot/National_Libraries a page for both national and state libraries?
20:53 🔗 ndiddy has joined #archiveteam-bs
21:09 🔗 enowaldo has joined #archiveteam-bs
21:20 🔗 drcd has quit IRC (Read error: Connection reset by peer)
21:45 🔗 enowaldo has quit IRC (Ping timeout: 265 seconds)
21:46 🔗 Soni has joined #archiveteam-bs
22:06 🔗 Selavi has quit IRC (Quit: verb. to stop or discontinue)
22:11 🔗 Selavi has joined #archiveteam-bs
22:41 🔗 enowaldo has joined #archiveteam-bs
22:52 🔗 ndiddy has quit IRC ()
23:02 🔗 enowaldo has quit IRC (Read error: Operation timed out)
23:16 🔗 BlueMax has joined #archiveteam-bs
23:39 🔗 enowaldo has joined #archiveteam-bs

irclogger-viewer