[00:17] *** wp494 has quit IRC (Ping timeout: 492 seconds) [00:18] *** VerfiedJ has quit IRC (Quit: Leaving) [00:20] *** wp494 has joined #archiveteam-bs [00:25] *** kbtoo_ has joined #archiveteam-bs [00:32] *** kbtoo has quit IRC (Read error: Operation timed out) [01:16] *** Dj-Wawa has quit IRC (Quit: Connection closed for inactivity) [01:57] *** twoTBHetz has quit IRC (Read error: Operation timed out) [03:41] #textdreamer? [04:17] *** qw3rty114 has joined #archiveteam-bs [04:22] #textreamer [04:22] *** qw3rty113 has quit IRC (Read error: Operation timed out) [04:56] *** odemg has quit IRC (Ping timeout: 265 seconds) [05:09] *** odemg has joined #archiveteam-bs [05:09] *** odemg has quit IRC (Connection closed) [06:25] *** zerkalo has quit IRC (Remote host closed the connection) [06:40] *** Dj-Wawa has joined #archiveteam-bs [07:15] *** edisded has quit IRC (Quit: Connection closed for inactivity) [08:40] *** twoTBHetz has joined #archiveteam-bs [09:18] *** wp494 has quit IRC (Read error: Operation timed out) [09:18] *** wp494 has joined #archiveteam-bs [09:25] *** BlueMax has quit IRC (Quit: Leaving) [09:28] *** ubahn has joined #archiveteam-bs [10:36] *** ubahn has quit IRC (Quit: ubahn) [11:55] *** agris has quit IRC (Ping timeout: 252 seconds) [11:59] *** agris has joined #archiveteam-bs [12:08] *** agris has quit IRC (Remote host closed the connection) [12:18] *** LFlare has quit IRC (Read error: Operation timed out) [12:26] *** LFlare has joined #archiveteam-bs [13:28] *** VerfiedJ has joined #archiveteam-bs [13:51] *** odemgi has joined #archiveteam-bs [13:57] *** odemgi_ has quit IRC (Read error: Operation timed out) [14:13] *** jeekl has quit IRC (Ping timeout: 260 seconds) [14:58] *** Mateon1 has quit IRC (Ping timeout: 255 seconds) [14:58] *** Mateon1 has joined #archiveteam-bs [15:04] *** Pixi has quit IRC (Ping timeout: 255 seconds) [15:09] *** Pixi has joined #archiveteam-bs [15:25] *** phirephl- has quit IRC (Read error: Operation timed out) [15:25] *** yipdw has quit IRC (Read error: Operation timed out) [15:26] *** me has joined #archiveteam-bs [15:28] *** phirephly has joined #archiveteam-bs [15:30] *** Dj-Wawa has quit IRC (Quit: Connection closed for inactivity) [15:42] *** ubahn has joined #archiveteam-bs [16:01] *** ubahn has quit IRC (Quit: ubahn) [16:14] *** Pixi has quit IRC (Read error: Connection reset by peer) [16:15] *** Pixi has joined #archiveteam-bs [16:20] *** Pixi` has joined #archiveteam-bs [16:21] let´s do textreamer then :) [16:21] #textreamer [16:24] *** ubahn has joined #archiveteam-bs [16:26] *** Pixi has quit IRC (Read error: Operation timed out) [16:27] *** TC01 has quit IRC (Quit: http://quassel-irc.org - Chat comfortably. Anywhere.) [16:29] *** ubahn has quit IRC (Client Quit) [16:32] *** edisded has joined #archiveteam-bs [16:49] *** TC01 has joined #archiveteam-bs [17:18] *** coldon2dr has quit IRC (Quit: https://quassel-irc.org - Chat comfortably. Anywhere.) [17:19] *** coldon2dr has joined #archiveteam-bs [17:30] *** jeekl has joined #archiveteam-bs [17:37] *** Stiletto has quit IRC (Ping timeout: 255 seconds) [17:38] *** Stiletto has joined #archiveteam-bs [17:49] *** Stilett0 has joined #archiveteam-bs [17:53] *** Stiletto has quit IRC (Read error: Operation timed out) [17:54] *** Stilett0 has quit IRC (Ping timeout: 264 seconds) [17:56] *** Stiletto has joined #archiveteam-bs [18:15] *** ubahn has joined #archiveteam-bs [18:16] *** wp494 has quit IRC (Ping timeout: 492 seconds) [18:17] *** wp494 has joined #archiveteam-bs [18:18] *** ubahn has quit IRC (Client Quit) [18:20] *** ubahn has joined #archiveteam-bs [18:25] *** ubahn has quit IRC (Quit: ubahn) [18:26] it occurs to me [18:26] we should maybe save how long a grab is taking [18:26] as we can publish average length / size of grabs [18:26] in the warrior I mean [18:27] *** ubahn has joined #archiveteam-bs [18:28] *** ubahn_ has joined #archiveteam-bs [18:29] *** ubahn_ has quit IRC (Client Quit) [18:31] *** TC01 has quit IRC (Quit: http://quassel-irc.org - Chat comfortably. Anywhere.) [18:31] *** TC01 has joined #archiveteam-bs [18:31] *** ubahn has quit IRC (Ping timeout: 268 seconds) [18:36] *** wmvhater has quit IRC (Read error: Operation timed out) [18:36] *** kiska1 has quit IRC (Read error: Operation timed out) [18:36] *** wmvhater has joined #archiveteam-bs [18:36] *** kiska1 has joined #archiveteam-bs [19:06] *** Smiley has joined #archiveteam-bs [19:07] *** SmileyG has quit IRC (Quit: Lost terminal) [19:09] *** twoTBHetz has quit IRC (Remote host closed the connection) [19:12] *** horkermon has quit IRC (Quit: Connection closed for inactivity) [19:21] *** edisded has quit IRC (Quit: Connection closed for inactivity) [19:29] *** ubahn has joined #archiveteam-bs [19:30] *** ubahn_ has joined #archiveteam-bs [19:31] *** ubahn__ has joined #archiveteam-bs [19:31] *** ubahn__ has quit IRC (Client Quit) [19:31] *** ubahn has quit IRC (Read error: Operation timed out) [19:34] *** ubahn_ has quit IRC (Read error: Operation timed out) [19:45] *** edisded has joined #archiveteam-bs [19:47] would also need tracker integration [19:47] and there's *so many* variables that could affect it, that aren't due to the job itself [19:49] yah [19:57] *** HashbangI has quit IRC (net_error) [19:57] *** HashbangI has joined #archiveteam-bs [20:56] *** ubahn has joined #archiveteam-bs [20:56] *** ubahn has quit IRC (Client Quit) [21:38] *** BlueMax has joined #archiveteam-bs [22:37] is this new or known ? > Attackers on fos.textfiles.com might attempt to trick you into installing programs that harm your browsing experience (for example, by changing your homepage or showing extra ads on sites you visit). Learn more [22:37] *** ubahn has joined #archiveteam-bs [22:38] *** ubahn has quit IRC (Client Quit) [22:38] Oh no, Google Safe Browsing is upset about textfiles? [22:39] I got a warning in FF and Chrome [22:39] I get a warning in Safari; all of them use Google Safe Browsing. [22:39] SketchCow: ^ [22:40] *** ubahn has joined #archiveteam-bs [22:40] *** ubahn has quit IRC (Client Quit) [22:44] *** edisded has quit IRC (Quit: Connection closed for inactivity) [22:54] loads ok for me [22:55] https://transparencyreport.google.com/safe-browsing/search?url=fos.textfiles.com&hl=en-US [22:55] *** sims has joined #archiveteam-bs [23:01] it also affects the main domain not only the subdomain, meh [23:02] the google search console will list the url's it found objectionable , that is if you register as administrator [23:02] *** MR9K has joined #archiveteam-bs [23:06] Does IA keep a copy of reddit?, Does anyone here keep a copy? pushshift.io currently maintains a copy of posts and comments including most removed comments and posts for the last 3 years but right now there is discussion of either deleting all the information removed by reddit or making it only available to a select elite. I don't know if anyone else maintains the same set. Current [23:06] comments will still be available and could be refetched anyway but anything removed could be lost forever. Discussion: https://www.reddit.com/r/pushshift/comments/a8xq4h/feedback_and_discussion_regarding_concerns_reddit/ files: https://files.pushshift.io/ I don't know the current total size but I know it's larger than I have available. [23:07] *** william has joined #archiveteam-bs [23:08] Hi. Has anyone archived the website codeisfreespeech . com? It could be under legal threat. [23:10] I have a copy archived [23:11] https://khuxkm.tilde.team/codeisfreespeech.com-20181226.zip [23:11] warc and cdx [23:13] Nice. Hope you aren't in New Jersey! [23:15] sims: Ew, thanks for pointing that out. I'll look into it. I was planning on going through that data anyway for other reasons. [23:16] Some of the older data is on IA, but not the newer one, at least last time I checked. [23:17] *** william has quit IRC (Quit: william) [23:21] So the submissions files are around 155 GB total and the comments about 477 GB. [23:22] he left, but tilde.team is in germany [23:23] so fairly certain we'll be fine [23:23] MR9K: Mind if I throw it into ArchiveBot? [23:24] *** horkermon has joined #archiveteam-bs [23:24] <_niklas> eh this material is probably legally dicey in most parts of the world [23:25] MR9K: archive.org/upload [23:27] JAA Oh that small? [23:31] so do I upload the whole zip file or do I upload the WARC and CDX separately? [23:32] separately as different files in the same item [23:32] or you can just upload the warc, the cdx will be recreated from it automatically [23:35] *** VerfiedJ has quit IRC (Quit: Leaving) [23:44] sims: Not that surprising really. It's very well compressible since it's JSON. [23:44] https://archive.org/details/codeisfreespeech.com-20181226 now what? [23:46] MR9K: then you're done. you can ask info@archive.org to bless it for inclusion into web.archive.org but that's unlikely