[00:02] Raccoon, how can you tell if someone is under 12? [00:03] abstract: i believe this applies to logged on google accounts. beats me how a 12 year old is allowed to register a google account, or why would they bother since they can no longer write comments on youtube videos now. [00:04] does google have a setting for parents "my kid is using the browser / phone now" [00:19] *** godane has joined #archiveteam-bs [00:44] google doesn't allow accounts for under age. the FTC's complaint said parents give devices to kids on the parents accounts [00:45] *** SynMonger has quit IRC (Read error: Operation timed out) [00:45] google's official products for underage was youtube kids, but the complaint was not enough parents even know about it or see why it's worth the trouble [01:05] *** manjaro-u has quit IRC (Konversation terminated!) [01:10] i suppose that the FTC action is a step in the right direction to making those products available and useful for effective parenting decisions [01:11] i can't find anything in chrome for creating a guest user/account that specifies an age or $is_child checkbox [01:11] without first registering a gmail account for that user [01:11] (you can only input a date of birth in the gmail account registration) [01:21] *** Video has joined #archiveteam-bs [02:31] *** BlueMax has quit IRC (Ping timeout: 745 seconds) [02:38] *** BlueMax has joined #archiveteam-bs [02:51] so i bought 6 issues of Mobile Beat i found at savers today [02:51] turns out that luckly the full back catalog of issues is online [02:51] https://www.mobilebeat.com/digital-edition/ [02:52] only bad news is that there not all pdf with ocr [02:56] turns out the pdfs maybe incomplete anyways [02:59] or maybe there shorter on pages with more recent issues [03:00] i only say that cause page 25 of issue 184 say 'The Last Word' for the page [03:30] *** programme is now known as prq [03:46] JAA: So far, 184 responses. [03:47] "Thank you for doing this. My mom passed away in 2018. Her tweets weren’t groundbreaking (mostly tweeting at her favorite shows or at democrat politicians she didn’t like) and I don’t agree with a lot of what she tweeted, but it’s one of the few things I have from her that I can readily visit when I miss her." [03:47] "Thank you so much for doing this. I'm really really upset about this, as my dad's twitter account means a lot to me." [03:47] "Thank you for doing this. This is my deceased wife’s account. She was only 35. Next month will be five years." [04:13] related to that^, I don't think I have access to the main page so can someone pls update it for me? [04:16] *** odemgi_ has joined #archiveteam-bs [04:17] *** qw3rty2 has joined #archiveteam-bs [04:18] I just went through old twitter notifications trying to figure out the public twitter account of a friend who passed away [04:18] sigh [04:19] *** odemgi has quit IRC (Ping timeout: 252 seconds) [04:19] couldn't find the public one yet and it seems most of it was in protected accounts anyway [04:23] I guess I need to grab all DMs too [04:26] *** qw3rty has quit IRC (Ping timeout: 745 seconds) [04:27] *** odemg has quit IRC (Ping timeout: 745 seconds) [04:31] *** odemg has joined #archiveteam-bs [04:38] found one by digging through old website snapshots in the wayback machine. can somebody query me to get the account added to the list? [04:50] *** kiska18 has quit IRC (Remote host closed the connection) [04:50] *** Ryz has quit IRC (Remote host closed the connection) [04:50] *** Ryz has joined #archiveteam-bs [04:50] *** kiska18 has joined #archiveteam-bs [05:01] Aighgh -- those are (routine, sadly) horrifying responses to the Twittering Dead form. [05:07] *** mtntmnky has quit IRC (Remote host closed the connection) [05:08] *** SynMonger has joined #archiveteam-bs [05:12] *** mtntmnky has joined #archiveteam-bs [05:16] *** SynMonger has quit IRC (Wait, what?) [05:43] *** akierig has joined #archiveteam-bs [05:44] *** SmileyG has quit IRC (Read error: Operation timed out) [05:44] *** Smiley has joined #archiveteam-bs [06:01] *** kiska has quit IRC (Remote host closed the connection) [06:01] *** Flashfire has quit IRC (Remote host closed the connection) [06:02] *** kiska has joined #archiveteam-bs [06:02] *** Fusl sets mode: +o kiska [06:02] *** Fusl__ sets mode: +o kiska [06:02] *** Fusl_ sets mode: +o kiska [06:02] *** Flashfire has joined #archiveteam-bs [06:03] *** HP_Archiv has joined #archiveteam-bs [06:25] *** m007a83 has quit IRC (Quit: Fuck you Comcast) [06:33] *** akierig has quit IRC (Quit: later_gator) [08:45] *** MrRadar has quit IRC (Read error: Operation timed out) [09:57] *** VADemon has joined #archiveteam-bs [10:21] *** eientei95 has quit IRC (Remote host closed the connection) [10:27] *** eientei95 has joined #archiveteam-bs [10:27] *** eientei95 has quit IRC (Handshake flooding) [10:29] *** eientei95 has joined #archiveteam-bs [11:02] *** X-Scale` has joined #archiveteam-bs [11:04] *** X-Scale has quit IRC (Ping timeout: 252 seconds) [11:04] *** X-Scale` is now known as X-Scale [11:11] *** raeyulca has joined #archiveteam-bs [11:31] *** X-Scale has quit IRC (Quit: HydraIRC -> http://www.hydrairc.com <- Organize your IRC) [11:43] *** X-Scale has joined #archiveteam-bs [11:46] *** X-Scale has quit IRC (Client Quit) [11:59] *** X-Scale has joined #archiveteam-bs [12:10] *** BlueMax has quit IRC (Read error: Connection reset by peer) [12:14] SketchCow I'd love to see the results of the "Twitter is:" poll on that google form lol [12:58] *** SynMonger has joined #archiveteam-bs [13:14] *** anonymiga has quit IRC (Quit: Lost terminal) [13:20] *** bluefoo has quit IRC (Read error: Operation timed out) [13:25] *** kiska18 has quit IRC (Read error: Operation timed out) [13:26] *** kiska18 has joined #archiveteam-bs [13:26] *** Fusl sets mode: +o kiska18 [13:26] *** Fusl__ sets mode: +o kiska18 [13:26] *** Fusl_ sets mode: +o kiska18 [13:26] *** Ryz has quit IRC (Read error: Connection reset by peer) [13:27] *** Ryz7 has joined #archiveteam-bs [13:51] *** Sanky is now known as Sanqui [13:57] when you create a new repo on github, it gets a few git clones on the first day (according to Insights tab) [13:58] they must be some archiving, mirroring projects [13:59] *** mtntmnky has quit IRC (Remote host closed the connection) [13:59] *** mtntmnky has joined #archiveteam-bs [14:07] uploading to github a repo with a wiktionary dump (5GB) split on < 100MB files [14:09] kind of a rosetta stone, it has 6 million words in 4000 languages [14:10] (anyone) I'm looking for help with page scraping for metadata, and hoping there might be some ready-made tools to batch process locally saved html pages by creating custom templates with simple rules. [14:43] *** Craigle has quit IRC (Ping timeout: 496 seconds) [14:43] *** Craigle has joined #archiveteam-bs [15:24] *** superkuh_ is now known as superkuh [15:32] *** K4k has quit IRC (Read error: Connection reset by peer) [15:56] *** Jamesatja has joined #archiveteam-bs [16:24] Soo, VoynichCr gave me a list of 181k Twitter accounts from Wikidata. Turns out it has some duplicates, didn't check for that before. It's actually 178960 accounts, and 38288 of them have either never tweeted or last tweeted before 2019-01-01T00:00:00Z. [16:26] There are also 7796 accounts which no longer exist and 1898 are presumably suspended (status 302). [16:36] I guess these can be considered important enough that we should probably archive them. [16:37] There's no way ArchiveBot/socialbot is going to tear through 38k Twitter accounts in two weeks though. [16:37] (Unless we fully dedicate it to that task.) [16:37] interesting results, i didnt think there were so many closed accounts [16:41] FYI, Etsy has instituted a bunch of new rules screwing over their sellers -- probably good to raise the priority on grabbing it, eventually. I don't have a link with more details, sadly. [16:43] *** RichardG_ has joined #archiveteam-bs [16:43] Here are the Twitter scan results: https://transfer.notkiska.pw/2BIyq/wikidata-twitter-20191127-181k-results [16:44] It's lines of username, HTTP status, and timestamp of last activity. [16:44] The last column can be 0 if the account has never tweeted or -1 if it doesn't exist (deleted or suspended). [16:44] *** Ryz7 is now known as Ryz [16:45] is that a csv then? what format [16:46] JAA: Which accounts need a backup? [16:46] All of them? [16:46] could probably back up all of them just to identify the account registration time and profile data/photos [16:47] to tell apart new owners of old names? [16:47] I can extract the registration time if needed. [16:47] Grabbed all the profile pages into WARCs. [16:47] that would be the most useful feature, i think, in telling apart new owners from old owners [16:47] No images or anything though. [16:48] PurpleSym: Primarily those which haven't been active since June I guess, since those might get purged on the 11th. [16:48] @twitterhandle%20071231 vs @twitterhandle%20201231 [16:49] (Some of them will have been active in other ways than tweeting on their timeline, obviously.) [16:49] With lower priority, I think it would be nice to grab all of them. [16:49] *** RichardG has quit IRC (Ping timeout: 615 seconds) [16:50] chromebot will be way too slow, but I can offer to at least try and grab as many as possible (without using the IRC frontend). [16:51] I've been thinking about coupling snscrape with qwarc. [16:52] darn, @raccoon isn't in the last :'( [16:54] 51152 accounts have not tweeted since 2019-06-12, the presumable cutoff date for the purge. [16:54] awk '$3 >= 0 && $3 < 1560297600' wikidata-twitter-20191127-181k-results [16:54] JAA: Sure, that would certainly be faster. Well, my offer stands. [16:54] lots of good 3 letter names in there [16:55] Raccoon: Take that to -ot please. [16:56] JAA: #twitterdead [17:18] *** bluefoo has joined #archiveteam-bs [17:29] *** Jamesatja has quit IRC (Read error: Connection reset by peer) [17:31] *** bluefoo has quit IRC (Remote host closed the connection) [17:37] *** wyatt8740 has quit IRC (Read error: Operation timed out) [17:37] *** Ing3b0rg has quit IRC (Ping timeout: 252 seconds) [17:39] *** deevious has quit IRC (Ping timeout: 252 seconds) [17:40] *** Zerote_ has joined #archiveteam-bs [17:40] *** kiska has quit IRC (Ping timeout: 252 seconds) [17:40] *** Ing3b0rg has joined #archiveteam-bs [17:42] *** Zerote has quit IRC (Ping timeout: 252 seconds) [17:43] *** britmob_ has joined #archiveteam-bs [17:44] *** kiska has joined #archiveteam-bs [17:44] *** Fusl sets mode: +o kiska [17:44] *** Fusl__ sets mode: +o kiska [17:44] *** Fusl_ sets mode: +o kiska [17:46] *** bluefoo has joined #archiveteam-bs [17:46] *** britmob has quit IRC (Ping timeout: 252 seconds) [17:46] *** anarcat has quit IRC (Ping timeout: 252 seconds) [17:48] *** kiska has quit IRC (Ping timeout: 252 seconds) [17:51] *** katocala has quit IRC (Ping timeout: 252 seconds) [17:51] *** katocala has joined #archiveteam-bs [17:53] *** _niklas has joined #archiveteam-bs [17:53] *** cppchrisc has quit IRC (Ping timeout: 252 seconds) [17:54] *** cppchrisc has joined #archiveteam-bs [17:54] *** cppchrisc has quit IRC (Connection closed) [17:54] *** cppchrisc has joined #archiveteam-bs [17:54] *** cppchrisc has quit IRC (Connection closed) [17:55] *** Flashfire has quit IRC (Ping timeout: 252 seconds) [17:56] *** cppchrisc has joined #archiveteam-bs [17:57] *** anarcat has joined #archiveteam-bs [17:57] *** anarcat has quit IRC (Handshake flooding) [18:00] *** Flashfire has joined #archiveteam-bs [18:02] *** MrRadar has joined #archiveteam-bs [18:02] *** anarcat has joined #archiveteam-bs [18:03] SketchCow: i suggest you move that over to hackint [18:03] JAA arkiver: watch me [18:03] *** Fusl has quit IRC (Quit: Moving to hackint) [18:04] *** Fusl__ has quit IRC (Quit: Moving to hackint) [18:04] *** Fusl_ has quit IRC (Quit: Moving to hackint) [18:04] *** systwiAL_ has joined #archiveteam-bs [18:04] Ro what [18:09] *** systwiALT has quit IRC (Read error: Operation timed out) [18:12] *** systwiAL_ is now known as systwiALT [18:12] *** raeyulca has quit IRC (Ping timeout: 496 seconds) [18:18] *** deevious has joined #archiveteam-bs [18:20] JAA: i think you were looking for something like that at some point? https://github.com/Truelite/closesocks [18:24] *** kiska has joined #archiveteam-bs [18:25] *** svchfoo3 sets mode: +o kiska [18:25] *** svchfoo1 sets mode: +o kiska [18:27] anarcat: Yes, that sounds exactly like what we'd need with wpull. I don't see how it actually closes any connections with the options passed to ss, but with a modern enough ss version, it seems that there's a -K switch that can be added to do so. [18:27] The ss man page is maximally unhelpful though regarding the filter syntax. "Please take a look at the official documentation for details regarding filters." Uh, ok... [18:29] It's part of the iproute2 package on Debian, and the docs for iproute2 don't even mention the tool at all. Wat? [18:30] But thanks, definitely something to dig deeper into. :-) [18:35] *** Deewiant has quit IRC (Ping timeout: 258 seconds) [18:35] *** Deewiant has joined #archiveteam-bs [18:38] *** _niklas has quit IRC (Ping timeout: 258 seconds) [18:38] JAA: enrico (on OFTC) said the same about the "ss" documentation [18:38] *** _niklas has joined #archiveteam-bs [18:38] *** Deewiant has quit IRC (se.hub efnet.portlane.se) [18:38] *** katocala has quit IRC (se.hub efnet.portlane.se) [18:38] *** Smiley has quit IRC (se.hub efnet.portlane.se) [18:38] *** mls has quit IRC (se.hub efnet.portlane.se) [18:38] *** klg has quit IRC (se.hub efnet.portlane.se) [18:38] *** Gfy has quit IRC (se.hub efnet.portlane.se) [18:38] *** Jon has quit IRC (se.hub efnet.portlane.se) [18:38] *** Laverne_ has quit IRC (se.hub efnet.portlane.se) [18:38] *** VoynichCr has quit IRC (se.hub efnet.portlane.se) [18:39] I guess it might be the same filter syntax as on some of the other iproute2 tools? [18:39] *** bluefoo has quit IRC (Ping timeout: 745 seconds) [18:40] no idea at all, ask enrico :) [18:40] Will do once I have time to look into this in more detail. Thanks again! [18:51] *** SmileyG has joined #archiveteam-bs [18:53] *** katocala has joined #archiveteam-bs [18:54] *** yuitimoth has joined #archiveteam-bs [18:55] *** MrRadar has quit IRC (Read error: Operation timed out) [18:55] *** Gfy has joined #archiveteam-bs [18:55] *** Deewiant has joined #archiveteam-bs [18:56] *** systwiAL_ has joined #archiveteam-bs [18:58] *** MrRadar has joined #archiveteam-bs [18:58] *** bluefoo has joined #archiveteam-bs [18:59] *** klg has joined #archiveteam-bs [19:02] *** MrRadar has quit IRC (Read error: Operation timed out) [19:03] *** systwiALT has quit IRC (Read error: Operation timed out) [19:10] *** HP_Archiv has quit IRC (Quit: Leaving) [19:13] *** Jon has joined #archiveteam-bs [19:21] ^ Added even though it doesn't explicitly mention AT since it's obviously about AT. [19:37] *** mls has joined #archiveteam-bs [19:42] *** VoynichCr has joined #archiveteam-bs [19:58] *** Laverne has joined #archiveteam-bs [20:03] *** bluefoo has quit IRC (Read error: Connection reset by peer) [20:04] JAA: re enrico, you can also email him, enrico@debian.org [20:30] *** HP_Archiv has joined #archiveteam-bs [20:37] *** systwiAL_ is now known as systwiALT [20:45] *** Raccoon` has joined #archiveteam-bs [20:51] *** Cameron_D has quit IRC (Quit: :(){ :|:& };:) [20:51] *** bluefoo has joined #archiveteam-bs [20:53] *** Raccoon has quit IRC (Ping timeout: 622 seconds) [20:53] *** Raccoon` is now known as Raccoon [20:53] *** Cameron_D has joined #archiveteam-bs [20:54] *** Smiley has joined #archiveteam-bs [20:54] *** SmileyG has quit IRC (Remote host closed the connection) [20:55] *** Raccoon` has joined #archiveteam-bs [20:58] *** Raccoon has quit IRC (Ping timeout: 258 seconds) [20:58] *** Raccoon` is now known as Raccoon [21:02] *** MrRadar has joined #archiveteam-bs [21:02] *** Smiley has quit IRC (bye.) [21:03] *** Smiley has joined #archiveteam-bs [21:19] *** Raccoon has quit IRC (Remote host closed the connection) [21:29] *** manjaro-u has joined #archiveteam-bs [22:03] *** Smiley has quit IRC (http://www.milkme.co.uk - You'll never understand.) [22:04] *** Smiley has joined #archiveteam-bs [22:10] *** Smiley has quit IRC (Quit: http://www.milkme.co.uk - You'll never understand.) [22:11] *** Smiley has joined #archiveteam-bs [22:11] *** Smiley has quit IRC (Client Quit) [22:11] *** Smiley has joined #archiveteam-bs [22:56] *** Dark_Star has quit IRC (Quit: http://quassel-irc.org - Chat comfortably. Anywhere.) [23:05] *** ryry has joined #archiveteam-bs [23:15] *** VerifiedJ has quit IRC (Read error: Connection reset by peer) [23:17] *** Dark_Star has joined #archiveteam-bs [23:21] *** dewdropaw has quit IRC (Quit: I object! That was... objectionable!) [23:37] *** BlueMax has joined #archiveteam-bs [23:53] *** manjaro-u has quit IRC (Ping timeout: 252 seconds)