#archiveteam-bs 2017-12-26,Tue

↑back Search

Time Nickname Message
00:06 πŸ”— icedice2 has quit IRC (Ping timeout: 260 seconds)
00:10 πŸ”— icedice has joined #archiveteam-bs
00:56 πŸ”— ola_norsk has quit IRC (Remote host closed the connection)
01:10 πŸ”— tomatokin has quit IRC (Ping timeout: 360 seconds)
01:12 πŸ”— drumstick has joined #archiveteam-bs
01:23 πŸ”— kristian_ has joined #archiveteam-bs
01:39 πŸ”— tar-xvf has joined #archiveteam-bs
01:43 πŸ”— odemg_ has quit IRC (Read error: Operation timed out)
02:09 πŸ”— schbirid has quit IRC (Ping timeout: 255 seconds)
02:20 πŸ”— icedice has quit IRC (Ping timeout: 245 seconds)
02:21 πŸ”— schbirid has joined #archiveteam-bs
03:02 πŸ”— drumstick has quit IRC (Ping timeout: 248 seconds)
03:06 πŸ”— drumstick has joined #archiveteam-bs
03:07 πŸ”— Asparagir has joined #archiveteam-bs
03:19 πŸ”— Asparagir has quit IRC (Asparagir)
04:48 πŸ”— Stilett0 has quit IRC (Ping timeout: 264 seconds)
04:50 πŸ”— SketchCow Jason Scott, c/o Internet Archive, San Francisco, CA 94118
04:50 πŸ”— SketchCow Did jrwr just explain to lord nightmare who I am
04:50 πŸ”— SketchCow aaawww
04:51 πŸ”— qw3rty112 has joined #archiveteam-bs
04:51 πŸ”— jrwr I told him in a pm to email you SketchCow
04:52 πŸ”— jrwr Mailbox at textfiles
04:52 πŸ”— SketchCow no, jason@textfiles.com or jscott@archive.org
04:53 πŸ”— jrwr It was from the whois, Google was turning up.empty for me, I'll add it to.my noted
04:53 πŸ”— jrwr Lord_Nigh: you around
04:54 πŸ”— Lord_Nigh yes, I think I have those emails already
04:54 πŸ”— jrwr Cool
04:55 πŸ”— jrwr You are still my hero Mr scott
04:56 πŸ”— qw3rty111 has quit IRC (Read error: Operation timed out)
04:57 πŸ”— Lord_Nigh 300 Funston Avenue address, i assume?
04:58 πŸ”— jrwr I think since it was books, right to the internet archive for them
04:59 πŸ”— jrwr 11:50 PM <BA1719@ SketchCow> Jason Scott, c/o Internet Archive, San Francisco, CA 94118
05:15 πŸ”— Stilett0 has joined #archiveteam-bs
05:38 πŸ”— Lord_Nigh yes, but there's no address within san francisco in that line
05:41 πŸ”— SketchCow Jason Scott, c/o Internet Archive, 300 Funston Avenue, San Francisco, CA 94118
07:03 πŸ”— Somebody2 Sorry if my poking at WARC uploading was what prompted the discovery of the bug.
07:04 πŸ”— Somebody2 (actually, I'm not sure if this is a "sorry", "you're welcome" kind of situation)
07:08 πŸ”— Pixi has quit IRC (Ping timeout: 255 seconds)
07:12 πŸ”— Pixi has joined #archiveteam-bs
07:14 πŸ”— kristian_ has quit IRC (Quit: Leaving)
07:30 πŸ”— Specular has joined #archiveteam-bs
07:34 πŸ”— Pixi has quit IRC (Quit: Pixi)
07:35 πŸ”— Pixi has joined #archiveteam-bs
08:59 πŸ”— odemg_ has joined #archiveteam-bs
09:01 πŸ”— schbirid has quit IRC (Quit: Leaving)
09:02 πŸ”— tar-xvf has quit IRC (Read error: Operation timed out)
09:05 πŸ”— drumstick has quit IRC (Read error: Operation timed out)
09:05 πŸ”— drumstick has joined #archiveteam-bs
09:27 πŸ”— drumstick Is there any whitelisted archiving service besides waybackmachine's 'save page now'?
09:30 πŸ”— PurpleSym ArchiveBot
09:34 πŸ”— ZexaronS has joined #archiveteam-bs
09:36 πŸ”— PurpleSym SketchCow: Is there no room for a compromise here? Hide the β€œunauthorized” WARCs until the user confirms he understands they might be fake? Show a big red banner including user and collection name?
10:21 πŸ”— kimmer12 has joined #archiveteam-bs
10:27 πŸ”— kimmer1 has quit IRC (Read error: Operation timed out)
10:52 πŸ”— kimmer1 has joined #archiveteam-bs
10:54 πŸ”— kimmer13 has joined #archiveteam-bs
11:00 πŸ”— kimmer12 has quit IRC (Ping timeout: 633 seconds)
11:01 πŸ”— kimmer12 has joined #archiveteam-bs
11:02 πŸ”— kimmer1 has quit IRC (Ping timeout: 633 seconds)
11:05 πŸ”— kimmer13 has quit IRC (Ping timeout: 633 seconds)
11:16 πŸ”— kimmer1 has joined #archiveteam-bs
11:19 πŸ”— kimmer12 has quit IRC (Ping timeout: 633 seconds)
11:20 πŸ”— dashcloud has quit IRC (Quit: No Ping reply in 180 seconds.)
11:21 πŸ”— dashcloud has joined #archiveteam-bs
11:39 πŸ”— drumstick has quit IRC (Ping timeout: 248 seconds)
11:42 πŸ”— tomatokin has joined #archiveteam-bs
11:51 πŸ”— kimmer12 has joined #archiveteam-bs
11:55 πŸ”— kimmer1 has quit IRC (Ping timeout: 633 seconds)
11:56 πŸ”— kimmer1 has joined #archiveteam-bs
12:00 πŸ”— kimmer13 has joined #archiveteam-bs
12:02 πŸ”— kimmer12 has quit IRC (Ping timeout: 633 seconds)
12:05 πŸ”— tomatokin I love to see that compromise. I have been archiving myself because waybackmachine fails quite often.
12:06 πŸ”— BnAboyZ has quit IRC (Quit: The Lounge - https://thelounge.github.io)
12:06 πŸ”— kimmer1 has quit IRC (Ping timeout: 633 seconds)
12:06 πŸ”— dashcloud has quit IRC (Read error: Connection reset by peer)
12:08 πŸ”— dashcloud has joined #archiveteam-bs
12:11 πŸ”— kimmer13 has quit IRC (Ping timeout: 633 seconds)
12:12 πŸ”— BnAboyZ has joined #archiveteam-bs
12:17 πŸ”— tomatokin I'm surprised not many people here talking about it, Isn't this really big deal for amateur archiver?
12:18 πŸ”— Sanqui we mostly just use archivebot
12:18 πŸ”— kimmer1 has joined #archiveteam-bs
12:28 πŸ”— BlueMaxim has quit IRC (Quit: Leaving)
12:37 πŸ”— Specular hadn't really considered manipulated WARCs before, do many try uploading them?
12:46 πŸ”— kimmer1 has quit IRC (Ping timeout: 633 seconds)
12:51 πŸ”— kimmer1 has joined #archiveteam-bs
12:59 πŸ”— kimmer12 has joined #archiveteam-bs
13:02 πŸ”— kimmer1 has quit IRC (Ping timeout: 632 seconds)
13:03 πŸ”— kimmer1 has joined #archiveteam-bs
13:06 πŸ”— kimmer12 has quit IRC (Read error: Operation timed out)
14:04 πŸ”— kimmer12 has joined #archiveteam-bs
14:04 πŸ”— SketchCow Everyone is adorable.
14:05 πŸ”— SketchCow Here's the problem.
14:05 πŸ”— SketchCow We budgeted for 1pb of disk space last year
14:05 πŸ”— SketchCow We used 2pb
14:05 πŸ”— SketchCow At some point, it'll be noticed that "just folks" are slamming thousands of WARCs into the opensource uploads and they were getting into the wayback.
14:05 πŸ”— SketchCow We don't delete data
14:06 πŸ”— SketchCow But we may only whitelist a set that comes through an authorized channel.
14:06 πŸ”— SketchCow To be honest, it wasn't supposed to be accepting them before.
14:06 πŸ”— SketchCow Also, WAY too many people, once they realize they can upload "anything" do an excellent job of deciding 200-1tb collections are great to have "just because" and suddenly we're youtube.bak
14:06 πŸ”— SketchCow That's all. We'll see how it plays out
14:10 πŸ”— kimmer1 has quit IRC (Ping timeout: 633 seconds)
14:15 πŸ”— Specular sounds like a significant problem. Whitelisting or getting approved doesn't seem like a bad step, assuming legit archiving efforts can still get through.
14:18 πŸ”— jrwr SketchCow: Leave it to us nerds to archive too much
14:19 πŸ”— kimmer1 has joined #archiveteam-bs
14:24 πŸ”— kimmer13 has joined #archiveteam-bs
14:26 πŸ”— kimmer12 has quit IRC (Ping timeout: 633 seconds)
14:30 πŸ”— kimmer1 has quit IRC (Ping timeout: 633 seconds)
14:37 πŸ”— sep332 has joined #archiveteam-bs
14:39 πŸ”— tomatokin has quit IRC (Ping timeout: 360 seconds)
14:52 πŸ”— kimmer13 has quit IRC (Ping timeout: 633 seconds)
14:54 πŸ”— kimmer1 has joined #archiveteam-bs
14:59 πŸ”— kimmer12 has joined #archiveteam-bs
15:05 πŸ”— kimmer1 has quit IRC (Ping timeout: 633 seconds)
15:07 πŸ”— dashcloud has quit IRC (Ping timeout: 250 seconds)
15:09 πŸ”— kimmer1 has joined #archiveteam-bs
15:13 πŸ”— dashcloud has joined #archiveteam-bs
15:15 πŸ”— kimmer12 has quit IRC (Read error: Operation timed out)
15:19 πŸ”— jrwr I mean SketchCow, Vid.me total was something like 600TB
15:20 πŸ”— jrwr there are more and sites closing that are like that
15:20 πŸ”— jrwr we only ended up getting 200TB~ due to limitations
15:25 πŸ”— Specular I think the total was 1.4PB after de-duping, according to the staff guy. Pretty sure some of that chunk would have been Youtube mirrors as well since they offered an import ability (and just shared content in general across some channels). Crazy.
15:27 πŸ”— jrwr Ya
15:27 πŸ”— jrwr since youtube is going down the shitter with videos being removed
15:34 πŸ”— icedice has joined #archiveteam-bs
15:36 πŸ”— HCross2 jrwr: how was the AWS bill in the end :p
15:37 πŸ”— jrwr no idea
15:37 πŸ”— jrwr its STILL online
15:37 πŸ”— jrwr but not
15:42 πŸ”— kimmer1 has quit IRC (Ping timeout: 633 seconds)
15:50 πŸ”— Specular has quit IRC (Leaving)
15:52 πŸ”— kimmer1 has joined #archiveteam-bs
16:30 πŸ”— kimmer12 has joined #archiveteam-bs
16:33 πŸ”— jschwart has joined #archiveteam-bs
16:36 πŸ”— kimmer1 has quit IRC (Ping timeout: 633 seconds)
16:57 πŸ”— kimmer1 has joined #archiveteam-bs
17:03 πŸ”— kimmer12 has quit IRC (Ping timeout: 633 seconds)
17:27 πŸ”— Kaz kimmer1: fix your connection
17:29 πŸ”— kimmer12 has joined #archiveteam-bs
17:34 πŸ”— kimmer13 has joined #archiveteam-bs
17:35 πŸ”— kimmer1 has quit IRC (Ping timeout: 633 seconds)
17:38 πŸ”— kimmer12 has quit IRC (Read error: Operation timed out)
17:44 πŸ”— icedice has quit IRC (Quit: Leaving)
17:53 πŸ”— kimmer1 has joined #archiveteam-bs
17:54 πŸ”— astrid has joined #archiveteam-bs
17:55 πŸ”— swebb sets mode: +o astrid
18:02 πŸ”— kimmer13 has quit IRC (Ping timeout: 633 seconds)
18:15 πŸ”— ZexaronS has quit IRC (Quit: Leaving)
19:17 πŸ”— ndiddy_ has quit IRC ()
19:31 πŸ”— SketchCow someone archive https://twitter.com/OrrinHatch/status/945375067927490560
20:20 πŸ”— RichardG has quit IRC (Read error: Connection reset by peer)
20:21 πŸ”— RichardG has joined #archiveteam-bs
20:31 πŸ”— godane SketchCow: we are up to 1997-01-31 with tagesschau evening news videos i got
20:52 πŸ”— JAA Does anyone know of a website which currently has CloudFlare's I'm Under Attack mode activated? (That's the "Checking your browser before accessing X" message.)
20:53 πŸ”— godane i got to love the fact that install dead rising 4 needs a 42gb update
20:58 πŸ”— schbirid has joined #archiveteam-bs
20:59 πŸ”— godane i'm very sure we are fucking screwed with backing up current games
21:07 πŸ”— jschwart has quit IRC (Quit: Konversation terminated!)
21:12 πŸ”— Stilett0 is now known as Stiletto
21:17 πŸ”— Mateon1 has quit IRC (Ping timeout: 260 seconds)
21:17 πŸ”— Mateon1 has joined #archiveteam-bs
21:25 πŸ”— dd0a13f37 has joined #archiveteam-bs
21:29 πŸ”— dashcloud has quit IRC (Read error: Operation timed out)
21:30 πŸ”— dashcloud has joined #archiveteam-bs
21:30 πŸ”— MrDignity has quit IRC (Read error: Connection reset by peer)
21:34 πŸ”— icedice has joined #archiveteam-bs
21:51 πŸ”— icedice has quit IRC (Quit: Leaving)
22:04 πŸ”— joepie91 JAA: archive.is zip downloads have it enabled permanently
22:04 πŸ”— joepie91 iirc
22:05 πŸ”— joepie91 or whatever their current TLD is
22:07 πŸ”— JAA Ah sweet, thanks.
22:08 πŸ”— JAA The website uses .fo, but the downloads are on .today.
22:09 πŸ”— JAA Oh, website's available on .is as well, but HTTP redirects to .fo.
22:09 πŸ”— JAA Whatever.
22:09 πŸ”— dd0a13f37 I guess he wants to spread it out
22:10 πŸ”— dd0a13f37 Isn't "I'm under attack" when they want you to complete a captcha?
22:11 πŸ”— JAA Ew, I get the captcha on those downloads from my server.
22:11 πŸ”— JAA No, attack mode is that message I believe.
22:12 πŸ”— JAA https://support.cloudflare.com/hc/en-us/articles/200170076-What-does-I-m-Under-Attack-Mode-do-
22:12 πŸ”— dd0a13f37 bisnode.se gave me a captcha just now
22:14 πŸ”— dd0a13f37 joepie91: You can't enable it for specific parts of the site, it has to do with caching
22:14 πŸ”— dd0a13f37 Their zip downloads aren't cached, but the main page probably is
22:17 πŸ”— JAA Yes, you can.
22:18 πŸ”— dd0a13f37 Huh? Since when?
22:18 πŸ”— JAA No idea.
22:19 πŸ”— MrDignity has joined #archiveteam-bs
22:26 πŸ”— dd0a13f37 godane: https://thepiratebay.org/torrent/17957059/Dead_Rising_4-BALDMAN_(Inclu_Update_1)
22:26 πŸ”— godane i'm doing this on xbox one s
22:27 πŸ”— godane my pc is not powerful enough to play it anyways
22:28 πŸ”— dd0a13f37 For archival purposes pc is better
22:40 πŸ”— ArgyroNet has joined #archiveteam-bs
22:44 πŸ”— drumstick has joined #archiveteam-bs
22:44 πŸ”— ArgyroNet has left
22:48 πŸ”— JAA Interesting. CF changed the code for their attack mode challenge slightly at some point in the past few months.
22:48 πŸ”— svchost03 has quit IRC (Ping timeout: 360 seconds)
22:49 πŸ”— JAA Nothing that changes how it works though.
22:51 πŸ”— JAA Actually looks like a small bugfix.
22:59 πŸ”— Kaz ..are you trying to break cloudflare?
22:59 πŸ”— JAA Yes.
23:00 πŸ”— JAA Well, succeeding, mostly. ;-)
23:00 πŸ”— Kaz ah lord
23:04 πŸ”— JAA I ported joepie91's parser to Python and am just implementing the final missing parts to get it working. I'll then look into how it can be integrated into wpull.
23:15 πŸ”— BlueMaxim has joined #archiveteam-bs
23:23 πŸ”— Mateon1 has quit IRC (Remote host closed the connection)
23:24 πŸ”— Mateon1 has joined #archiveteam-bs
23:34 πŸ”— JAA TIL that tr sucks at handling multi-byte characters.
23:57 πŸ”— JAA Yay, it seems to work correctly. :-)
23:58 πŸ”— JAA 100 test cases gave the same results as the Node interpreter.

irclogger-viewer