#archiveteam-bs 2020-08-28,Fri

↑back Search

Time Nickname Message
00:12 🔗 paul2520 has joined #archiveteam-bs
00:13 🔗 Jake has joined #archiveteam-bs
00:16 🔗 Jake has quit IRC (Remote host closed the connection)
00:19 🔗 gandalf has joined #archiveteam-bs
00:24 🔗 dxrt_ has joined #archiveteam-bs
00:32 🔗 Jake has joined #archiveteam-bs
01:22 🔗 SketchCo1 is now known as SketchCow
01:24 🔗 nepeat has joined #archiveteam-bs
01:24 🔗 apache2 has joined #archiveteam-bs
01:25 🔗 DFJustin has joined #archiveteam-bs
01:25 🔗 step has joined #archiveteam-bs
01:25 🔗 zhongfu has joined #archiveteam-bs
01:25 🔗 PotcFdk has joined #archiveteam-bs
01:32 🔗 atg has joined #archiveteam-bs
01:51 🔗 Arcorann has joined #archiveteam-bs
01:52 🔗 Arcorann There's a mailing list I'm part of that's shutting down today, and I'd like to back up their archives (reference link: http://calndr-l.10958.n7.nabble.com/Calndr-l-is-Closing-td21135.html)
01:54 🔗 Arcorann Their Nabble archives go back to 2006, while their full archives (https://listserv.ecu.edu/scripts/wa.exe?A0=calndr-l) are login-only but I have an account
02:12 🔗 benjinss Arcorann: if you join the archivebot channel, you can request someone do a crawl of the site
02:12 🔗 Arcorann What about the login-only archives?
02:13 🔗 benjinss I'm not too sure. It'd still be possible to grab the data, but it might need to be processed to strip out any login info
02:13 🔗 benjinss Other people have more experience w/ that sort of thing
02:14 🔗 nico_32 it need some experiment to see if logout kill the session
02:14 🔗 nico_32 so the cookie would be meaningless
02:20 🔗 jodizzle Yes, might be possible to get the data that requires a login, but it wouldn't be in AB
02:20 🔗 jodizzle I think the nabble should work out, though.
02:25 🔗 Arcorann Thanks. While I'm at it could you throw in https://moonphase.hatenablog.com as well? The blog's been defunct for a while but due to how it moved sites I'm not sure if archive.org got everything
02:29 🔗 jodizzle Okay, I threw it in.
02:30 🔗 jodizzle For reference, while it's usually good to archive anyway, you can check wayback coverage for a site like this: https://web.archive.org/web/*/https://moonphase.hatenablog.com/*
02:37 🔗 BlueMax has joined #archiveteam-bs
03:00 🔗 Arcorann I did have a look at that, and it looks like when combined with the old URL the coverage is decent, but there are over 10000 posts and ensuring that none are missing looked like it was more trouble than it was worth
03:13 🔗 jodizzle Yep, grabbing it is definitely a good idea
03:19 🔗 qw3rty has quit IRC (Ping timeout: 610 seconds)
05:04 🔗 Mayonaise has quit IRC (Read error: Operation timed out)
05:57 🔗 cascode1 has joined #archiveteam-bs
06:00 🔗 wp494 has quit IRC (Quit: LOUD UNNECESSARY QUIT MESSAGES)
06:22 🔗 wp494 has joined #archiveteam-bs
07:38 🔗 qw3rty has joined #archiveteam-bs
07:43 🔗 JAA Arcorann, benjinss: ArchiveBot can't do login things (except theoretically in some very special circumstances). And login things in the WBM need to be done very carefully. What nico_32 said is one part of it. Stripping things out is a no-go.
07:48 🔗 Arcorann As I suspected. Has this sort of thing (listserv archive backups) been looked into before?
07:49 🔗 JAA We archived some public ones in AB before. Otherwise, I'm not sure.
07:49 🔗 nico_32 we could do the same things as the python yahoo groups grab
07:49 🔗 JAA Does LISTSERV let you download the emails as mbox or similar?
07:49 🔗 nico_32 one mail == one json in a folder
08:12 🔗 bsmith093 has quit IRC (Ping timeout: 265 seconds)
08:17 🔗 Arcorann I have yet to find any option to download the emails from the listserv archive
08:19 🔗 BlueMax has quit IRC (Read error: Connection reset by peer)
08:27 🔗 bsmith093 has joined #archiveteam-bs
09:06 🔗 jshoard has joined #archiveteam-bs
09:06 🔗 K4k__ has quit IRC (Read error: Operation timed out)
09:06 🔗 step has quit IRC (Quit: ZNC 1.8.0 - https://znc.in)
09:08 🔗 step has joined #archiveteam-bs
09:14 🔗 K4k__ has joined #archiveteam-bs
09:37 🔗 Ryz has quit IRC (Quit: Ping timeout (120 seconds))
09:54 🔗 Doran has quit IRC (Remote host closed the connection)
09:54 🔗 Doran has joined #archiveteam-bs
10:18 🔗 britmob has joined #archiveteam-bs
10:18 🔗 britm0b has quit IRC (Remote host closed the connection)
10:20 🔗 Raccoon` has quit IRC (Read error: Connection reset by peer)
11:13 🔗 schbirid has joined #archiveteam-bs
11:30 🔗 VerifiedJ has joined #archiveteam-bs
13:12 🔗 Mayonaise has joined #archiveteam-bs
13:25 🔗 Jon- is now known as Jon
13:55 🔗 schbirid has quit IRC (Quit: Leaving)
14:37 🔗 Arcorann Notch deleted his Twitter account?
14:40 🔗 bsmith093 has quit IRC (Read error: Operation timed out)
14:56 🔗 bsmith093 has joined #archiveteam-bs
14:57 🔗 phuzion Probably deactivated. I can't squat the name
15:17 🔗 Raccoon has joined #archiveteam-bs
15:29 🔗 systwi_ has joined #archiveteam-bs
15:34 🔗 systwi has quit IRC (Read error: Operation timed out)
15:40 🔗 Ivy has joined #archiveteam-bs
15:40 🔗 Arcorann has quit IRC (Read error: Connection reset by peer)
15:55 🔗 lennier1 Re Notch: https://twitter.com/gamemakerstk/status/1299318360505749504
15:57 🔗 Ryz has joined #archiveteam-bs
16:20 🔗 HP_Archiv has joined #archiveteam-bs
16:26 🔗 HP_Archiv has quit IRC (Quit: Leaving)
17:02 🔗 nyany has quit IRC (Read error: Operation timed out)
17:03 🔗 nyany has joined #archiveteam-bs
17:04 🔗 underscor has quit IRC (Quit: No Ping reply in 180 seconds.)
17:04 🔗 underscor has joined #archiveteam-bs
17:30 🔗 Craigle has quit IRC (Quit: The Lounge - https://thelounge.chat)
18:17 🔗 trc has left Goodbye
18:45 🔗 superkuh has quit IRC (Remote host closed the connection)
18:51 🔗 superkuh has joined #archiveteam-bs
19:09 🔗 Craigle has joined #archiveteam-bs
19:43 🔗 lennier2 has joined #archiveteam-bs
19:53 🔗 lennier1 has quit IRC (Ping timeout: 745 seconds)
19:54 🔗 lennier2 is now known as lennier1
20:32 🔗 semisimpl has joined #archiveteam-bs
21:15 🔗 cascode1 has quit IRC (Remote host closed the connection)
21:16 🔗 cascode1 has joined #archiveteam-bs
21:37 🔗 semisimpl has quit IRC (Quit: semisimpl)
21:40 🔗 VerifiedJ has quit IRC (Quit: Leaving)
23:30 🔗 Arcorann has joined #archiveteam-bs
23:30 🔗 Arcorann has quit IRC (Read error: Connection reset by peer)
23:31 🔗 Arcorann has joined #archiveteam-bs
23:34 🔗 BlueMax has joined #archiveteam-bs
23:46 🔗 jshoard has quit IRC (Leaving)
23:49 🔗 RichardG_ is now known as RichardG

irclogger-viewer