#archiveteam-bs 2018-12-26,Wed

↑back Search

Time Nickname Message
00:17 πŸ”— wp494 has quit IRC (Ping timeout: 492 seconds)
00:18 πŸ”— VerfiedJ has quit IRC (Quit: Leaving)
00:20 πŸ”— wp494 has joined #archiveteam-bs
00:25 πŸ”— kbtoo_ has joined #archiveteam-bs
00:32 πŸ”— kbtoo has quit IRC (Read error: Operation timed out)
01:16 πŸ”— Dj-Wawa has quit IRC (Quit: Connection closed for inactivity)
01:57 πŸ”— twoTBHetz has quit IRC (Read error: Operation timed out)
03:41 πŸ”— edisded #textdreamer?
04:17 πŸ”— qw3rty114 has joined #archiveteam-bs
04:22 πŸ”— marked #textreamer
04:22 πŸ”— qw3rty113 has quit IRC (Read error: Operation timed out)
04:56 πŸ”— odemg has quit IRC (Ping timeout: 265 seconds)
05:09 πŸ”— odemg has joined #archiveteam-bs
05:09 πŸ”— odemg has quit IRC (Connection closed)
06:25 πŸ”— zerkalo has quit IRC (Remote host closed the connection)
06:40 πŸ”— Dj-Wawa has joined #archiveteam-bs
07:15 πŸ”— edisded has quit IRC (Quit: Connection closed for inactivity)
08:40 πŸ”— twoTBHetz has joined #archiveteam-bs
09:18 πŸ”— wp494 has quit IRC (Read error: Operation timed out)
09:18 πŸ”— wp494 has joined #archiveteam-bs
09:25 πŸ”— BlueMax has quit IRC (Quit: Leaving)
09:28 πŸ”— ubahn has joined #archiveteam-bs
10:36 πŸ”— ubahn has quit IRC (Quit: ubahn)
11:55 πŸ”— agris has quit IRC (Ping timeout: 252 seconds)
11:59 πŸ”— agris has joined #archiveteam-bs
12:08 πŸ”— agris has quit IRC (Remote host closed the connection)
12:18 πŸ”— LFlare has quit IRC (Read error: Operation timed out)
12:26 πŸ”— LFlare has joined #archiveteam-bs
13:28 πŸ”— VerfiedJ has joined #archiveteam-bs
13:51 πŸ”— odemgi has joined #archiveteam-bs
13:57 πŸ”— odemgi_ has quit IRC (Read error: Operation timed out)
14:13 πŸ”— jeekl has quit IRC (Ping timeout: 260 seconds)
14:58 πŸ”— Mateon1 has quit IRC (Ping timeout: 255 seconds)
14:58 πŸ”— Mateon1 has joined #archiveteam-bs
15:04 πŸ”— Pixi has quit IRC (Ping timeout: 255 seconds)
15:09 πŸ”— Pixi has joined #archiveteam-bs
15:25 πŸ”— phirephl- has quit IRC (Read error: Operation timed out)
15:25 πŸ”— yipdw has quit IRC (Read error: Operation timed out)
15:26 πŸ”— me has joined #archiveteam-bs
15:28 πŸ”— phirephly has joined #archiveteam-bs
15:30 πŸ”— Dj-Wawa has quit IRC (Quit: Connection closed for inactivity)
15:42 πŸ”— ubahn has joined #archiveteam-bs
16:01 πŸ”— ubahn has quit IRC (Quit: ubahn)
16:14 πŸ”— Pixi has quit IRC (Read error: Connection reset by peer)
16:15 πŸ”— Pixi has joined #archiveteam-bs
16:20 πŸ”— Pixi` has joined #archiveteam-bs
16:21 πŸ”— arkiver letΒ΄s do textreamer then :)
16:21 πŸ”— arkiver #textreamer
16:24 πŸ”— ubahn has joined #archiveteam-bs
16:26 πŸ”— Pixi has quit IRC (Read error: Operation timed out)
16:27 πŸ”— TC01 has quit IRC (Quit: http://quassel-irc.org - Chat comfortably. Anywhere.)
16:29 πŸ”— ubahn has quit IRC (Client Quit)
16:32 πŸ”— edisded has joined #archiveteam-bs
16:49 πŸ”— TC01 has joined #archiveteam-bs
17:18 πŸ”— coldon2dr has quit IRC (Quit: https://quassel-irc.org - Chat comfortably. Anywhere.)
17:19 πŸ”— coldon2dr has joined #archiveteam-bs
17:30 πŸ”— jeekl has joined #archiveteam-bs
17:37 πŸ”— Stiletto has quit IRC (Ping timeout: 255 seconds)
17:38 πŸ”— Stiletto has joined #archiveteam-bs
17:49 πŸ”— Stilett0 has joined #archiveteam-bs
17:53 πŸ”— Stiletto has quit IRC (Read error: Operation timed out)
17:54 πŸ”— Stilett0 has quit IRC (Ping timeout: 264 seconds)
17:56 πŸ”— Stiletto has joined #archiveteam-bs
18:15 πŸ”— ubahn has joined #archiveteam-bs
18:16 πŸ”— wp494 has quit IRC (Ping timeout: 492 seconds)
18:17 πŸ”— wp494 has joined #archiveteam-bs
18:18 πŸ”— ubahn has quit IRC (Client Quit)
18:20 πŸ”— ubahn has joined #archiveteam-bs
18:25 πŸ”— ubahn has quit IRC (Quit: ubahn)
18:26 πŸ”— SmileyG it occurs to me
18:26 πŸ”— SmileyG we should maybe save how long a grab is taking
18:26 πŸ”— SmileyG as we can publish average length / size of grabs
18:26 πŸ”— SmileyG in the warrior I mean
18:27 πŸ”— ubahn has joined #archiveteam-bs
18:28 πŸ”— ubahn_ has joined #archiveteam-bs
18:29 πŸ”— ubahn_ has quit IRC (Client Quit)
18:31 πŸ”— TC01 has quit IRC (Quit: http://quassel-irc.org - Chat comfortably. Anywhere.)
18:31 πŸ”— TC01 has joined #archiveteam-bs
18:31 πŸ”— ubahn has quit IRC (Ping timeout: 268 seconds)
18:36 πŸ”— wmvhater has quit IRC (Read error: Operation timed out)
18:36 πŸ”— kiska1 has quit IRC (Read error: Operation timed out)
18:36 πŸ”— wmvhater has joined #archiveteam-bs
18:36 πŸ”— kiska1 has joined #archiveteam-bs
19:06 πŸ”— Smiley has joined #archiveteam-bs
19:07 πŸ”— SmileyG has quit IRC (Quit: Lost terminal)
19:09 πŸ”— twoTBHetz has quit IRC (Remote host closed the connection)
19:12 πŸ”— horkermon has quit IRC (Quit: Connection closed for inactivity)
19:21 πŸ”— edisded has quit IRC (Quit: Connection closed for inactivity)
19:29 πŸ”— ubahn has joined #archiveteam-bs
19:30 πŸ”— ubahn_ has joined #archiveteam-bs
19:31 πŸ”— ubahn__ has joined #archiveteam-bs
19:31 πŸ”— ubahn__ has quit IRC (Client Quit)
19:31 πŸ”— ubahn has quit IRC (Read error: Operation timed out)
19:34 πŸ”— ubahn_ has quit IRC (Read error: Operation timed out)
19:45 πŸ”— edisded has joined #archiveteam-bs
19:47 πŸ”— Kaz would also need tracker integration
19:47 πŸ”— Kaz and there's *so many* variables that could affect it, that aren't due to the job itself
19:49 πŸ”— Smiley yah
19:57 πŸ”— HashbangI has quit IRC (net_error)
19:57 πŸ”— HashbangI has joined #archiveteam-bs
20:56 πŸ”— ubahn has joined #archiveteam-bs
20:56 πŸ”— ubahn has quit IRC (Client Quit)
21:38 πŸ”— BlueMax has joined #archiveteam-bs
22:37 πŸ”— marked is this new or known ? > Attackers on fos.textfiles.com might attempt to trick you into installing programs that harm your browsing experience (for example, by changing your homepage or showing extra ads on sites you visit). Learn more
22:37 πŸ”— ubahn has joined #archiveteam-bs
22:38 πŸ”— ubahn has quit IRC (Client Quit)
22:38 πŸ”— pawbs Oh no, Google Safe Browsing is upset about textfiles?
22:39 πŸ”— marked I got a warning in FF and Chrome
22:39 πŸ”— pawbs I get a warning in Safari; all of them use Google Safe Browsing.
22:39 πŸ”— JAA SketchCow: ^
22:40 πŸ”— ubahn has joined #archiveteam-bs
22:40 πŸ”— ubahn has quit IRC (Client Quit)
22:44 πŸ”— edisded has quit IRC (Quit: Connection closed for inactivity)
22:54 πŸ”— Kaz loads ok for me
22:55 πŸ”— marked https://transparencyreport.google.com/safe-browsing/search?url=fos.textfiles.com&hl=en-US
22:55 πŸ”— sims has joined #archiveteam-bs
23:01 πŸ”— buckket it also affects the main domain not only the subdomain, meh
23:02 πŸ”— marked the google search console will list the url's it found objectionable , that is if you register as administrator
23:02 πŸ”— MR9K has joined #archiveteam-bs
23:06 πŸ”— sims Does IA keep a copy of reddit?, Does anyone here keep a copy? pushshift.io currently maintains a copy of posts and comments including most removed comments and posts for the last 3 years but right now there is discussion of either deleting all the information removed by reddit or making it only available to a select elite. I don't know if anyone else maintains the same set. Current
23:06 πŸ”— sims comments will still be available and could be refetched anyway but anything removed could be lost forever. Discussion: https://www.reddit.com/r/pushshift/comments/a8xq4h/feedback_and_discussion_regarding_concerns_reddit/ files: https://files.pushshift.io/ I don't know the current total size but I know it's larger than I have available.
23:07 πŸ”— william has joined #archiveteam-bs
23:08 πŸ”— william Hi. Has anyone archived the website codeisfreespeech . com? It could be under legal threat.
23:10 πŸ”— MR9K I have a copy archived
23:11 πŸ”— MR9K https://khuxkm.tilde.team/codeisfreespeech.com-20181226.zip
23:11 πŸ”— MR9K warc and cdx
23:13 πŸ”— william Nice. Hope you aren't in New Jersey!
23:15 πŸ”— JAA sims: Ew, thanks for pointing that out. I'll look into it. I was planning on going through that data anyway for other reasons.
23:16 πŸ”— JAA Some of the older data is on IA, but not the newer one, at least last time I checked.
23:17 πŸ”— william has quit IRC (Quit: william)
23:21 πŸ”— JAA So the submissions files are around 155 GB total and the comments about 477 GB.
23:22 πŸ”— MR9K he left, but tilde.team is in germany
23:23 πŸ”— MR9K so fairly certain we'll be fine
23:23 πŸ”— JAA MR9K: Mind if I throw it into ArchiveBot?
23:24 πŸ”— horkermon has joined #archiveteam-bs
23:24 πŸ”— _niklas eh this material is probably legally dicey in most parts of the world
23:25 πŸ”— astrid MR9K: archive.org/upload
23:27 πŸ”— sims JAA Oh that small?
23:31 πŸ”— MR9K so do I upload the whole zip file or do I upload the WARC and CDX separately?
23:32 πŸ”— astrid separately as different files in the same item
23:32 πŸ”— astrid or you can just upload the warc, the cdx will be recreated from it automatically
23:35 πŸ”— VerfiedJ has quit IRC (Quit: Leaving)
23:44 πŸ”— JAA sims: Not that surprising really. It's very well compressible since it's JSON.
23:44 πŸ”— MR9K https://archive.org/details/codeisfreespeech.com-20181226 now what?
23:46 πŸ”— astrid MR9K: then you're done. you can ask info@archive.org to bless it for inclusion into web.archive.org but that's unlikely

irclogger-viewer