[00:01] *** icedice2 has quit IRC (Client Quit) [00:01] *** icedice2 has joined #archiveteam-bs [00:02] *** icedice has quit IRC (Read error: Operation timed out) [00:03] *** icedice2 has quit IRC (Client Quit) [00:03] *** icedice has joined #archiveteam-bs [00:54] *** nicolas17 has joined #archiveteam-bs [01:53] *** wyatt8740 has quit IRC (Remote host closed the connection) [02:12] *** Joseph__ has quit IRC (Quit: Leaving) [02:23] *** Selavi has quit IRC (Quit: verb. to stop or discontinue) [02:24] *** Selavi has joined #archiveteam-bs [02:47] I just read that roots-web - one of the largest genelogy forums will be shut down. [02:48] https://lists.rootsweb.com/hyperkitty/list/rootsweb-listowners-announcements.rootsweb.com/thread/35491416/ [02:49] lists https://mailinglists.rootsweb.com/listindexes/ [02:50] a lot of the content is public and datws back to the late 1990s https://lists.rootsweb.com/hyperkitty/list/posen.rootsweb.com/1998/9/ [02:51] a lot of lists are missing archives [02:51] *** oxguy3 has joined #archiveteam-bs [02:51] "Please note that we are working hard to bring mailing list archives back online, but will not be finished for some time. Thus, searching archives does not currently return complete results." [02:51] I browsed to a random list and the archives were empty [02:52] geez that's a lot of lists [02:52] the community seems spread unnecessarily thin [02:52] MeeDee: Is it this whole site that's going down, or just the lists? [02:53] "Beginning March 2nd, 2020 the Mailing Lists functionality on RootsWeb will be discontinued" [02:54] *** DiscantX has joined #archiveteam-bs [02:59] more recent years are empty but from 1998-2005 or so there are many posts [02:59] much of this is irreplaceable [03:00] this is still active https://lists.rootsweb.com/hyperkitty/list/prussia-roots.rootsweb.com/2020/1/ [03:01] first post is 1980? https://lists.rootsweb.com/hyperkitty/list/prussia-roots.rootsweb.com/1980/8/ [03:02] 1997-onwards seems more likely [03:03] *** oxguy3 has quit IRC (Ping timeout: 745 seconds) [03:04] rootsweb was bought by ancestry which allowed it to die of negelct. it was the nest first free genelogy forum and family tree website https://en.wikipedia.org/wiki/Ancestry.com#RootsWeb [03:05] They're not doing well with dates at all [03:05] no, but from 2997 onwards it is more reliable. [03:05] when they bought it, part of the deal apparently was that they would keep it online. [03:06] is this something the archiveteam can work on? [03:06] sorry.."from 12997 onwards..." [03:06] ...heh [03:06] This (https://lists.rootsweb.com/hyperkitty/list/roots.rootsweb.com/thread/16460426/) references a Usenet post [03:06] Year will overflow? [03:06] Haven't checked it [03:07] is it being shut down on March 2 or being put into read-only mode? [03:07] I just re-read the announcement, they're keeping the archives available and searchable [03:08] so I'd say it should be archived just in case but it's no hurry [03:09] Assuming that MeeDee isn't getting information from anywhere else, it looks like just archived - "After that [March 2nd], mailing list archives will remain available and searchable on RootsWeb. " [03:10] Yes, I basically wanted to get is on the AT's radar because ancestry has a habit of dumping functionality to the point that the data becomes inaccessible/non-browse-able/non-searchable [03:11] And it would might be a positively good idea to wait until after then, to get messages on the tail end, or those introduced into the archives because of their mysterious "processing" (migration between list archive programs?) [03:18] In my opinion, we should just recrawl those parts after. [03:55] I agree with hook54321 -- we don't know how stable the site will be after March [04:09] *** jamiew has joined #archiveteam-bs [04:10] *** qw3rty__ has joined #archiveteam-bs [04:14] *** qw3rty_ has quit IRC (Ping timeout: 276 seconds) [04:24] MeeDee: do you know if there's any lists not linked to in their directory or where you have to join it to view the archives? [04:28] *** DogsRNice has quit IRC (Read error: Connection reset by peer) [04:50] *** odemgi_ has joined #archiveteam-bs [04:53] *** odemgi has quit IRC (Ping timeout: 276 seconds) [05:13] @hook54321 I'll look around at the lists and will let you know [05:18] going back to the 2019 capture of the mailing list index on RootsWeb it says: "A complete index to RootsWeb's 32,740 genealogy mailing lists!" - so at least you have a ballpark number [05:20] ok so it's even more absurdly split into separate lists than I thought [05:27] very few of the mailing list categories have been captured by the IA..this may have to do with how ancestry set up rootsweb when they took over. Navigating from the Index---Categories -- List of Groups in the Categories --- details of a grouyp --- finally the archives stored under 'hyperkitty" ex: https://lists.rootsweb.com/hyperkitty/list/greek-surnames@rootsweb.com/ Once you are [05:27] there archives are set up by date: https://lists.rootsweb.com/hyperkitty/list/greek-surnames.rootsweb.com/2004/10/ To read the posts you have to go one step further to thread: https://lists.rootsweb.com/hyperkitty/list/greek-surnames.rootsweb.com/thread/23264192/ [05:30] going back to before ancestry however, it seems that the top level urls have not changed much. Ex 2002 Index http://web.archive.org/web/20010201065600/http://lists.rootsweb.com/ says "A complete index to RootsWeb's 20,733 genealogy mailing lists!" [05:34] So I am thinking the index is where you start [06:16] *** Mayonaise has quit IRC (Read error: Operation timed out) [06:17] *** gtwy has quit IRC (Read error: Operation timed out) [06:17] *** jamiew has quit IRC (Read error: Operation timed out) [06:17] *** dxrt_ has quit IRC (Read error: Operation timed out) [06:18] *** asdf0101 has quit IRC (Read error: Operation timed out) [06:18] *** Mayonaise has joined #archiveteam-bs [06:19] *** ndiddy has quit IRC (Read error: Operation timed out) [06:19] *** icedice2 has joined #archiveteam-bs [06:19] *** icedice has quit IRC (Read error: Connection reset by peer) [06:19] *** ndiddy has joined #archiveteam-bs [06:19] *** Maylay has quit IRC (Read error: Operation timed out) [06:19] *** luckcolor has quit IRC (Read error: Operation timed out) [06:20] *** Maylay has joined #archiveteam-bs [06:20] *** luckcolor has joined #archiveteam-bs [06:20] *** jamiew has joined #archiveteam-bs [06:21] *** wabu has quit IRC (Read error: Operation timed out) [06:22] *** wabu has joined #archiveteam-bs [06:24] *** ats_ has joined #archiveteam-bs [06:24] *** arktek has quit IRC (Read error: Operation timed out) [06:25] *** systwi has quit IRC (Read error: Operation timed out) [06:28] *** ats has quit IRC (Ping timeout: 622 seconds) [06:29] *** wyatt8740 has joined #archiveteam-bs [06:29] *** balrog_ has joined #archiveteam-bs [06:29] *** Gfy has quit IRC (Read error: Connection reset by peer) [06:32] *** Wingy has quit IRC (Read error: Operation timed out) [06:32] *** jamiew has quit IRC (Read error: Operation timed out) [06:33] *** arktek has joined #archiveteam-bs [06:33] *** jamiew has joined #archiveteam-bs [06:34] *** asdf0101 has joined #archiveteam-bs [06:36] *** Gfy has joined #archiveteam-bs [06:36] *** balrog has quit IRC (Read error: Operation timed out) [06:36] *** balrog_ is now known as balrog [06:37] *** morgandaw has joined #archiveteam-bs [06:38] *** MeeDee has quit IRC (Read error: Operation timed out) [06:39] *** systwi has joined #archiveteam-bs [06:42] *** nicolas17 has quit IRC (Read error: Operation timed out) [06:53] *** gtwy has joined #archiveteam-bs [08:10] *** schbirid has joined #archiveteam-bs [08:53] *** DiscantX has quit IRC (Remote host closed the connection) [09:19] jodizzle: great, socialbot has a "since:" option? [09:22] VoynichCr: socialbot doesn't (to my knowledge), but snscrape does. E.g., `snscrape twitter-search " since:"` [09:22] So I would prepare a list locally. [09:23] ah ok [09:24] i think some suspended accounts will be back, so i hope we can archive them [10:13] *** VADemon_ is now known as VADemon [10:36] *** betamax has quit IRC (Read error: Operation timed out) [11:27] *** betamax has joined #archiveteam-bs [11:43] *** katocala has joined #archiveteam-bs [11:43] *** katocala has left [11:44] *** RichardG has quit IRC (Read error: Connection reset by peer) [11:44] *** RichardG has joined #archiveteam-bs [11:56] *** BlueMax has quit IRC (Quit: Leaving) [12:41] *** HP_Archiv has joined #archiveteam-bs [12:56] *** DiscantX has joined #archiveteam-bs [13:16] *** DiscantX has quit IRC (Remote host closed the connection) [15:29] *** sirvy_ has joined #archiveteam-bs [15:33] *** sirvy has quit IRC (Ping timeout: 615 seconds) [16:19] *** X-Scale` has joined #archiveteam-bs [16:28] *** X-Scale has quit IRC (Ping timeout: 610 seconds) [16:28] *** X-Scale` is now known as X-Scale [16:33] *** systwi_ has joined #archiveteam-bs [16:39] *** systwi has quit IRC (Ping timeout: 622 seconds) [16:50] *** fangfufu has joined #archiveteam-bs [16:50] *** X-Scale` has joined #archiveteam-bs [16:57] *** X-Scale has quit IRC (Read error: Operation timed out) [16:57] *** X-Scale` is now known as X-Scale [17:04] *** synm0nger has joined #archiveteam-bs [17:05] *** underscor has quit IRC (Read error: Operation timed out) [17:05] *** underscor has joined #archiveteam-bs [17:05] *** SynMonger has quit IRC (Read error: Connection reset by peer) [17:06] jodizzle, VoynichCr: It's `snscrape twitter-search "from:username since:date"`. Searching for the username without `from:` will produce different results. The twitter-user scraper is in fact just a convenience alias to `twitter-search from:username`. [17:08] *** Mateon1 has quit IRC (Read error: Operation timed out) [17:10] *** jamiew_ has joined #archiveteam-bs [17:10] *** jamiew has quit IRC (Read error: Operation timed out) [17:10] *** Mateon1 has joined #archiveteam-bs [17:22] *** schbirid has quit IRC (Quit: Leaving) [17:59] *** VerifiedJ has joined #archiveteam-bs [18:24] *** arkiver has quit IRC (Ping timeout: 745 seconds) [18:24] *** arkiver has joined #archiveteam-bs [18:25] *** svchfoo3 sets mode: +o arkiver [18:25] *** svchfoo1 sets mode: +o arkiver [21:01] JAA: interesting, i didn't know about twitter-search [21:27] *** BlueMax has joined #archiveteam-bs [21:40] *** Wingy has joined #archiveteam-bs [22:04] *** nicolas17 has joined #archiveteam-bs [22:41] *** BlueMax has quit IRC (Read error: Connection reset by peer) [22:48] Oops, yeah, I forgot the `from:`. Thankfully I included it in my recent re-grabs, though. [22:48] :-)