#archiveteam-bs 2020-01-09,Thu

↑back Search

Time Nickname Message
00:01 🔗 icedice2 has quit IRC (Client Quit)
00:01 🔗 icedice2 has joined #archiveteam-bs
00:02 🔗 icedice has quit IRC (Read error: Operation timed out)
00:03 🔗 icedice2 has quit IRC (Client Quit)
00:03 🔗 icedice has joined #archiveteam-bs
00:54 🔗 nicolas17 has joined #archiveteam-bs
01:53 🔗 wyatt8740 has quit IRC (Remote host closed the connection)
02:12 🔗 Joseph__ has quit IRC (Quit: Leaving)
02:23 🔗 Selavi has quit IRC (Quit: verb. to stop or discontinue)
02:24 🔗 Selavi has joined #archiveteam-bs
02:47 🔗 MeeDee I just read that roots-web - one of the largest genelogy forums will be shut down.
02:48 🔗 MeeDee https://lists.rootsweb.com/hyperkitty/list/rootsweb-listowners-announcements.rootsweb.com/thread/35491416/
02:49 🔗 MeeDee lists https://mailinglists.rootsweb.com/listindexes/
02:50 🔗 MeeDee a lot of the content is public and datws back to the late 1990s https://lists.rootsweb.com/hyperkitty/list/posen.rootsweb.com/1998/9/
02:51 🔗 nicolas17 a lot of lists are missing archives
02:51 🔗 oxguy3 has joined #archiveteam-bs
02:51 🔗 nicolas17 "Please note that we are working hard to bring mailing list archives back online, but will not be finished for some time. Thus, searching archives does not currently return complete results."
02:51 🔗 nicolas17 I browsed to a random list and the archives were empty
02:52 🔗 nicolas17 geez that's a lot of lists
02:52 🔗 nicolas17 the community seems spread unnecessarily thin
02:52 🔗 OrIdow6 MeeDee: Is it this whole site that's going down, or just the lists?
02:53 🔗 nicolas17 "Beginning March 2nd, 2020 the Mailing Lists functionality on RootsWeb will be discontinued"
02:54 🔗 DiscantX has joined #archiveteam-bs
02:59 🔗 MeeDee more recent years are empty but from 1998-2005 or so there are many posts
02:59 🔗 MeeDee much of this is irreplaceable
03:00 🔗 MeeDee this is still active https://lists.rootsweb.com/hyperkitty/list/prussia-roots.rootsweb.com/2020/1/
03:01 🔗 MeeDee first post is 1980? https://lists.rootsweb.com/hyperkitty/list/prussia-roots.rootsweb.com/1980/8/
03:02 🔗 MeeDee 1997-onwards seems more likely
03:03 🔗 oxguy3 has quit IRC (Ping timeout: 745 seconds)
03:04 🔗 MeeDee rootsweb was bought by ancestry which allowed it to die of negelct. it was the nest first free genelogy forum and family tree website https://en.wikipedia.org/wiki/Ancestry.com#RootsWeb
03:05 🔗 OrIdow6 They're not doing well with dates at all
03:05 🔗 MeeDee no, but from 2997 onwards it is more reliable.
03:05 🔗 hook54321 when they bought it, part of the deal apparently was that they would keep it online.
03:06 🔗 MeeDee is this something the archiveteam can work on?
03:06 🔗 MeeDee sorry.."from 12997 onwards..."
03:06 🔗 nicolas17 ...heh
03:06 🔗 OrIdow6 This (https://lists.rootsweb.com/hyperkitty/list/roots.rootsweb.com/thread/16460426/) references a Usenet post
03:06 🔗 OrIdow6 Year will overflow?
03:06 🔗 OrIdow6 Haven't checked it
03:07 🔗 hook54321 is it being shut down on March 2 or being put into read-only mode?
03:07 🔗 nicolas17 I just re-read the announcement, they're keeping the archives available and searchable
03:08 🔗 nicolas17 so I'd say it should be archived just in case but it's no hurry
03:09 🔗 OrIdow6 Assuming that MeeDee isn't getting information from anywhere else, it looks like just archived - "After that [March 2nd], mailing list archives will remain available and searchable on RootsWeb. "
03:10 🔗 MeeDee Yes, I basically wanted to get is on the AT's radar because ancestry has a habit of dumping functionality to the point that the data becomes inaccessible/non-browse-able/non-searchable
03:11 🔗 OrIdow6 And it would might be a positively good idea to wait until after then, to get messages on the tail end, or those introduced into the archives because of their mysterious "processing" (migration between list archive programs?)
03:18 🔗 hook54321 In my opinion, we should just recrawl those parts after.
03:55 🔗 MeeDee I agree with hook54321 -- we don't know how stable the site will be after March
04:09 🔗 jamiew has joined #archiveteam-bs
04:10 🔗 qw3rty__ has joined #archiveteam-bs
04:14 🔗 qw3rty_ has quit IRC (Ping timeout: 276 seconds)
04:24 🔗 hook54321 MeeDee: do you know if there's any lists not linked to in their directory or where you have to join it to view the archives?
04:28 🔗 DogsRNice has quit IRC (Read error: Connection reset by peer)
04:50 🔗 odemgi_ has joined #archiveteam-bs
04:53 🔗 odemgi has quit IRC (Ping timeout: 276 seconds)
05:13 🔗 MeeDee @hook54321 I'll look around at the lists and will let you know
05:18 🔗 MeeDee going back to the 2019 capture of the mailing list index on RootsWeb it says: "A complete index to RootsWeb's 32,740 genealogy mailing lists!" - so at least you have a ballpark number
05:20 🔗 nicolas17 ok so it's even more absurdly split into separate lists than I thought
05:27 🔗 MeeDee very few of the mailing list categories have been captured by the IA..this may have to do with how ancestry set up rootsweb when they took over. Navigating from the Index---Categories -- List of Groups in the Categories --- details of a grouyp --- finally the archives stored under 'hyperkitty" ex: https://lists.rootsweb.com/hyperkitty/list/greek-surnames@rootsweb.com/ Once you are
05:27 🔗 MeeDee there archives are set up by date: https://lists.rootsweb.com/hyperkitty/list/greek-surnames.rootsweb.com/2004/10/ To read the posts you have to go one step further to thread: https://lists.rootsweb.com/hyperkitty/list/greek-surnames.rootsweb.com/thread/23264192/
05:30 🔗 MeeDee going back to before ancestry however, it seems that the top level urls have not changed much. Ex 2002 Index http://web.archive.org/web/20010201065600/http://lists.rootsweb.com/ says "A complete index to RootsWeb's 20,733 genealogy mailing lists!"
05:34 🔗 MeeDee So I am thinking the index is where you start
06:16 🔗 Mayonaise has quit IRC (Read error: Operation timed out)
06:17 🔗 gtwy has quit IRC (Read error: Operation timed out)
06:17 🔗 jamiew has quit IRC (Read error: Operation timed out)
06:17 🔗 dxrt_ has quit IRC (Read error: Operation timed out)
06:18 🔗 asdf0101 has quit IRC (Read error: Operation timed out)
06:18 🔗 Mayonaise has joined #archiveteam-bs
06:19 🔗 ndiddy has quit IRC (Read error: Operation timed out)
06:19 🔗 icedice2 has joined #archiveteam-bs
06:19 🔗 icedice has quit IRC (Read error: Connection reset by peer)
06:19 🔗 ndiddy has joined #archiveteam-bs
06:19 🔗 Maylay has quit IRC (Read error: Operation timed out)
06:19 🔗 luckcolor has quit IRC (Read error: Operation timed out)
06:20 🔗 Maylay has joined #archiveteam-bs
06:20 🔗 luckcolor has joined #archiveteam-bs
06:20 🔗 jamiew has joined #archiveteam-bs
06:21 🔗 wabu has quit IRC (Read error: Operation timed out)
06:22 🔗 wabu has joined #archiveteam-bs
06:24 🔗 ats_ has joined #archiveteam-bs
06:24 🔗 arktek has quit IRC (Read error: Operation timed out)
06:25 🔗 systwi has quit IRC (Read error: Operation timed out)
06:28 🔗 ats has quit IRC (Ping timeout: 622 seconds)
06:29 🔗 wyatt8740 has joined #archiveteam-bs
06:29 🔗 balrog_ has joined #archiveteam-bs
06:29 🔗 Gfy has quit IRC (Read error: Connection reset by peer)
06:32 🔗 Wingy has quit IRC (Read error: Operation timed out)
06:32 🔗 jamiew has quit IRC (Read error: Operation timed out)
06:33 🔗 arktek has joined #archiveteam-bs
06:33 🔗 jamiew has joined #archiveteam-bs
06:34 🔗 asdf0101 has joined #archiveteam-bs
06:36 🔗 Gfy has joined #archiveteam-bs
06:36 🔗 balrog has quit IRC (Read error: Operation timed out)
06:36 🔗 balrog_ is now known as balrog
06:37 🔗 morgandaw has joined #archiveteam-bs
06:38 🔗 MeeDee has quit IRC (Read error: Operation timed out)
06:39 🔗 systwi has joined #archiveteam-bs
06:42 🔗 nicolas17 has quit IRC (Read error: Operation timed out)
06:53 🔗 gtwy has joined #archiveteam-bs
08:10 🔗 schbirid has joined #archiveteam-bs
08:53 🔗 DiscantX has quit IRC (Remote host closed the connection)
09:19 🔗 VoynichCr jodizzle: great, socialbot has a "since:" option?
09:22 🔗 jodizzle VoynichCr: socialbot doesn't (to my knowledge), but snscrape does. E.g., `snscrape twitter-search "<account name> since:<date>"`
09:22 🔗 jodizzle So I would prepare a list locally.
09:23 🔗 VoynichCr ah ok
09:24 🔗 VoynichCr i think some suspended accounts will be back, so i hope we can archive them
10:13 🔗 VADemon_ is now known as VADemon
10:36 🔗 betamax has quit IRC (Read error: Operation timed out)
11:27 🔗 betamax has joined #archiveteam-bs
11:43 🔗 katocala has joined #archiveteam-bs
11:43 🔗 katocala has left
11:44 🔗 RichardG has quit IRC (Read error: Connection reset by peer)
11:44 🔗 RichardG has joined #archiveteam-bs
11:56 🔗 BlueMax has quit IRC (Quit: Leaving)
12:41 🔗 HP_Archiv has joined #archiveteam-bs
12:56 🔗 DiscantX has joined #archiveteam-bs
13:16 🔗 DiscantX has quit IRC (Remote host closed the connection)
15:29 🔗 sirvy_ has joined #archiveteam-bs
15:33 🔗 sirvy has quit IRC (Ping timeout: 615 seconds)
16:19 🔗 X-Scale` has joined #archiveteam-bs
16:28 🔗 X-Scale has quit IRC (Ping timeout: 610 seconds)
16:28 🔗 X-Scale` is now known as X-Scale
16:33 🔗 systwi_ has joined #archiveteam-bs
16:39 🔗 systwi has quit IRC (Ping timeout: 622 seconds)
16:50 🔗 fangfufu has joined #archiveteam-bs
16:50 🔗 X-Scale` has joined #archiveteam-bs
16:57 🔗 X-Scale has quit IRC (Read error: Operation timed out)
16:57 🔗 X-Scale` is now known as X-Scale
17:04 🔗 synm0nger has joined #archiveteam-bs
17:05 🔗 underscor has quit IRC (Read error: Operation timed out)
17:05 🔗 underscor has joined #archiveteam-bs
17:05 🔗 SynMonger has quit IRC (Read error: Connection reset by peer)
17:06 🔗 JAA jodizzle, VoynichCr: It's `snscrape twitter-search "from:username since:date"`. Searching for the username without `from:` will produce different results. The twitter-user scraper is in fact just a convenience alias to `twitter-search from:username`.
17:08 🔗 Mateon1 has quit IRC (Read error: Operation timed out)
17:10 🔗 jamiew_ has joined #archiveteam-bs
17:10 🔗 jamiew has quit IRC (Read error: Operation timed out)
17:10 🔗 Mateon1 has joined #archiveteam-bs
17:22 🔗 schbirid has quit IRC (Quit: Leaving)
17:59 🔗 VerifiedJ has joined #archiveteam-bs
18:24 🔗 arkiver has quit IRC (Ping timeout: 745 seconds)
18:24 🔗 arkiver has joined #archiveteam-bs
18:25 🔗 svchfoo3 sets mode: +o arkiver
18:25 🔗 svchfoo1 sets mode: +o arkiver
21:01 🔗 VoynichCr JAA: interesting, i didn't know about twitter-search
21:27 🔗 BlueMax has joined #archiveteam-bs
21:40 🔗 Wingy has joined #archiveteam-bs
22:04 🔗 nicolas17 has joined #archiveteam-bs
22:41 🔗 BlueMax has quit IRC (Read error: Connection reset by peer)
22:48 🔗 jodizzle Oops, yeah, I forgot the `from:`. Thankfully I included it in my recent re-grabs, though.
22:48 🔗 JAA :-)

irclogger-viewer