#archiveteam-bs 2017-12-26,Tue

↑back Search ←Prev date Next date→ Show only urls(Click on time to select a line by its url)

WhoWhatWhen
***icedice2 has quit IRC (Ping timeout: 260 seconds)
icedice has joined #archiveteam-bs
[00:06]
.......... (idle for 46mn)
ola_norsk has quit IRC (Remote host closed the connection) [00:56]
tomatokin has quit IRC (Ping timeout: 360 seconds)
drumstick has joined #archiveteam-bs
[01:10]
kristian_ has joined #archiveteam-bs [01:23]
.... (idle for 16mn)
tar-xvf has joined #archiveteam-bs
odemg_ has quit IRC (Read error: Operation timed out)
[01:39]
...... (idle for 26mn)
schbirid has quit IRC (Ping timeout: 255 seconds) [02:09]
icedice has quit IRC (Ping timeout: 245 seconds)
schbirid has joined #archiveteam-bs
[02:20]
......... (idle for 41mn)
drumstick has quit IRC (Ping timeout: 248 seconds)
drumstick has joined #archiveteam-bs
Asparagir has joined #archiveteam-bs
[03:02]
Asparagir has quit IRC (Asparagir) [03:19]
.................. (idle for 1h29mn)
Stilett0 has quit IRC (Ping timeout: 264 seconds) [04:48]
SketchCowJason Scott, c/o Internet Archive, San Francisco, CA 94118
Did jrwr just explain to lord nightmare who I am
aaawww
[04:50]
***qw3rty112 has joined #archiveteam-bs [04:51]
jrwrI told him in a pm to email you SketchCow
Mailbox at textfiles
[04:51]
SketchCowno, jason@textfiles.com or jscott@archive.org [04:52]
jrwrIt was from the whois, Google was turning up.empty for me, I'll add it to.my noted
Lord_Nigh: you around
[04:53]
Lord_Nighyes, I think I have those emails already [04:54]
jrwrCool
You are still my hero Mr scott
[04:54]
***qw3rty111 has quit IRC (Read error: Operation timed out) [04:56]
Lord_Nigh300 Funston Avenue address, i assume? [04:57]
jrwrI think since it was books, right to the internet archive for them
11:50 PM <BA1719@ SketchCow> Jason Scott, c/o Internet Archive, San Francisco, CA 94118
[04:58]
.... (idle for 16mn)
***Stilett0 has joined #archiveteam-bs [05:15]
..... (idle for 23mn)
Lord_Nighyes, but there's no address within san francisco in that line [05:38]
SketchCowJason Scott, c/o Internet Archive, 300 Funston Avenue, San Francisco, CA 94118 [05:41]
................. (idle for 1h22mn)
Somebody2Sorry if my poking at WARC uploading was what prompted the discovery of the bug.
(actually, I'm not sure if this is a "sorry", "you're welcome" kind of situation)
[07:03]
***Pixi has quit IRC (Ping timeout: 255 seconds)
Pixi has joined #archiveteam-bs
kristian_ has quit IRC (Quit: Leaving)
[07:08]
.... (idle for 16mn)
Specular has joined #archiveteam-bs
Pixi has quit IRC (Quit: Pixi)
Pixi has joined #archiveteam-bs
[07:30]
................. (idle for 1h24mn)
odemg_ has joined #archiveteam-bs
schbirid has quit IRC (Quit: Leaving)
tar-xvf has quit IRC (Read error: Operation timed out)
drumstick has quit IRC (Read error: Operation timed out)
drumstick has joined #archiveteam-bs
[08:59]
..... (idle for 22mn)
drumstickIs there any whitelisted archiving service besides waybackmachine's 'save page now'? [09:27]
PurpleSymArchiveBot [09:30]
***ZexaronS has joined #archiveteam-bs [09:34]
PurpleSymSketchCow: Is there no room for a compromise here? Hide the “unauthorized” WARCs until the user confirms he understands they might be fake? Show a big red banner including user and collection name? [09:36]
.......... (idle for 45mn)
***kimmer12 has joined #archiveteam-bs [10:21]
kimmer1 has quit IRC (Read error: Operation timed out) [10:27]
...... (idle for 25mn)
kimmer1 has joined #archiveteam-bs
kimmer13 has joined #archiveteam-bs
[10:52]
kimmer12 has quit IRC (Ping timeout: 633 seconds)
kimmer12 has joined #archiveteam-bs
kimmer1 has quit IRC (Ping timeout: 633 seconds)
kimmer13 has quit IRC (Ping timeout: 633 seconds)
[11:00]
kimmer1 has joined #archiveteam-bs
kimmer12 has quit IRC (Ping timeout: 633 seconds)
dashcloud has quit IRC (Quit: No Ping reply in 180 seconds.)
dashcloud has joined #archiveteam-bs
[11:16]
.... (idle for 18mn)
drumstick has quit IRC (Ping timeout: 248 seconds)
tomatokin has joined #archiveteam-bs
[11:39]
kimmer12 has joined #archiveteam-bs
kimmer1 has quit IRC (Ping timeout: 633 seconds)
kimmer1 has joined #archiveteam-bs
kimmer13 has joined #archiveteam-bs
kimmer12 has quit IRC (Ping timeout: 633 seconds)
[11:51]
tomatokinI love to see that compromise. I have been archiving myself because waybackmachine fails quite often. [12:05]
***BnAboyZ has quit IRC (Quit: The Lounge - https://thelounge.github.io)
kimmer1 has quit IRC (Ping timeout: 633 seconds)
dashcloud has quit IRC (Read error: Connection reset by peer)
dashcloud has joined #archiveteam-bs
kimmer13 has quit IRC (Ping timeout: 633 seconds)
BnAboyZ has joined #archiveteam-bs
[12:06]
tomatokinI'm surprised not many people here talking about it, Isn't this really big deal for amateur archiver? [12:17]
Sanquiwe mostly just use archivebot [12:18]
***kimmer1 has joined #archiveteam-bs [12:18]
BlueMaxim has quit IRC (Quit: Leaving) [12:28]
Specularhadn't really considered manipulated WARCs before, do many try uploading them? [12:37]
***kimmer1 has quit IRC (Ping timeout: 633 seconds) [12:46]
kimmer1 has joined #archiveteam-bs [12:51]
kimmer12 has joined #archiveteam-bs
kimmer1 has quit IRC (Ping timeout: 632 seconds)
kimmer1 has joined #archiveteam-bs
kimmer12 has quit IRC (Read error: Operation timed out)
[12:59]
............ (idle for 58mn)
kimmer12 has joined #archiveteam-bs [14:04]
SketchCowEveryone is adorable.
Here's the problem.
We budgeted for 1pb of disk space last year
We used 2pb
At some point, it'll be noticed that "just folks" are slamming thousands of WARCs into the opensource uploads and they were getting into the wayback.
We don't delete data
But we may only whitelist a set that comes through an authorized channel.
To be honest, it wasn't supposed to be accepting them before.
Also, WAY too many people, once they realize they can upload "anything" do an excellent job of deciding 200-1tb collections are great to have "just because" and suddenly we're youtube.bak
That's all. We'll see how it plays out
[14:04]
***kimmer1 has quit IRC (Ping timeout: 633 seconds) [14:10]
Specularsounds like a significant problem. Whitelisting or getting approved doesn't seem like a bad step, assuming legit archiving efforts can still get through. [14:15]
jrwrSketchCow: Leave it to us nerds to archive too much [14:18]
***kimmer1 has joined #archiveteam-bs [14:19]
kimmer13 has joined #archiveteam-bs
kimmer12 has quit IRC (Ping timeout: 633 seconds)
kimmer1 has quit IRC (Ping timeout: 633 seconds)
[14:24]
sep332 has joined #archiveteam-bs
tomatokin has quit IRC (Ping timeout: 360 seconds)
[14:37]
kimmer13 has quit IRC (Ping timeout: 633 seconds)
kimmer1 has joined #archiveteam-bs
[14:52]
kimmer12 has joined #archiveteam-bs [14:59]
kimmer1 has quit IRC (Ping timeout: 633 seconds)
dashcloud has quit IRC (Ping timeout: 250 seconds)
kimmer1 has joined #archiveteam-bs
dashcloud has joined #archiveteam-bs
kimmer12 has quit IRC (Read error: Operation timed out)
[15:05]
jrwrI mean SketchCow, Vid.me total was something like 600TB
there are more and sites closing that are like that
we only ended up getting 200TB~ due to limitations
[15:19]
SpecularI think the total was 1.4PB after de-duping, according to the staff guy. Pretty sure some of that chunk would have been Youtube mirrors as well since they offered an import ability (and just shared content in general across some channels). Crazy. [15:25]
jrwrYa
since youtube is going down the shitter with videos being removed
[15:27]
***icedice has joined #archiveteam-bs [15:34]
HCross2jrwr: how was the AWS bill in the end :p [15:36]
jrwrno idea
its STILL online
but not
[15:37]
***kimmer1 has quit IRC (Ping timeout: 633 seconds) [15:42]
Specular has quit IRC (Leaving)
kimmer1 has joined #archiveteam-bs
[15:50]
........ (idle for 38mn)
kimmer12 has joined #archiveteam-bs
jschwart has joined #archiveteam-bs
kimmer1 has quit IRC (Ping timeout: 633 seconds)
[16:30]
..... (idle for 21mn)
kimmer1 has joined #archiveteam-bs [16:57]
kimmer12 has quit IRC (Ping timeout: 633 seconds) [17:03]
..... (idle for 24mn)
Kazkimmer1: fix your connection [17:27]
***kimmer12 has joined #archiveteam-bs [17:29]
kimmer13 has joined #archiveteam-bs
kimmer1 has quit IRC (Ping timeout: 633 seconds)
kimmer12 has quit IRC (Read error: Operation timed out)
[17:34]
icedice has quit IRC (Quit: Leaving) [17:44]
kimmer1 has joined #archiveteam-bs
astrid has joined #archiveteam-bs
swebb sets mode: +o astrid
[17:53]
kimmer13 has quit IRC (Ping timeout: 633 seconds) [18:02]
ZexaronS has quit IRC (Quit: Leaving) [18:15]
............. (idle for 1h2mn)
ndiddy_ has quit IRC () [19:17]
SketchCowsomeone archive https://twitter.com/OrrinHatch/status/945375067927490560 [19:31]
.......... (idle for 49mn)
***RichardG has quit IRC (Read error: Connection reset by peer)
RichardG has joined #archiveteam-bs
[20:20]
godaneSketchCow: we are up to 1997-01-31 with tagesschau evening news videos i got [20:31]
..... (idle for 21mn)
JAADoes anyone know of a website which currently has CloudFlare's I'm Under Attack mode activated? (That's the "Checking your browser before accessing X" message.) [20:52]
godanei got to love the fact that install dead rising 4 needs a 42gb update [20:53]
***schbirid has joined #archiveteam-bs [20:58]
godanei'm very sure we are fucking screwed with backing up current games [20:59]
***jschwart has quit IRC (Quit: Konversation terminated!) [21:07]
Stilett0 is now known as Stiletto [21:12]
Mateon1 has quit IRC (Ping timeout: 260 seconds)
Mateon1 has joined #archiveteam-bs
[21:17]
dd0a13f37 has joined #archiveteam-bs
dashcloud has quit IRC (Read error: Operation timed out)
dashcloud has joined #archiveteam-bs
MrDignity has quit IRC (Read error: Connection reset by peer)
icedice has joined #archiveteam-bs
[21:25]
.... (idle for 17mn)
icedice has quit IRC (Quit: Leaving) [21:51]
joepie91JAA: archive.is zip downloads have it enabled permanently
iirc
or whatever their current TLD is
[22:04]
JAAAh sweet, thanks.
The website uses .fo, but the downloads are on .today.
Oh, website's available on .is as well, but HTTP redirects to .fo.
Whatever.
[22:07]
dd0a13f37I guess he wants to spread it out
Isn't "I'm under attack" when they want you to complete a captcha?
[22:09]
JAAEw, I get the captcha on those downloads from my server.
No, attack mode is that message I believe.
https://support.cloudflare.com/hc/en-us/articles/200170076-What-does-I-m-Under-Attack-Mode-do-
[22:11]
dd0a13f37bisnode.se gave me a captcha just now
joepie91: You can't enable it for specific parts of the site, it has to do with caching
Their zip downloads aren't cached, but the main page probably is
[22:12]
JAAYes, you can. [22:17]
dd0a13f37Huh? Since when? [22:18]
JAANo idea. [22:18]
***MrDignity has joined #archiveteam-bs [22:19]
dd0a13f37godane: https://thepiratebay.org/torrent/17957059/Dead_Rising_4-BALDMAN_(Inclu_Update_1) [22:26]
godanei'm doing this on xbox one s
my pc is not powerful enough to play it anyways
[22:26]
dd0a13f37For archival purposes pc is better [22:28]
***ArgyroNet has joined #archiveteam-bs
drumstick has joined #archiveteam-bs
ArgyroNet has left
[22:40]
JAAInteresting. CF changed the code for their attack mode challenge slightly at some point in the past few months. [22:48]
***svchost03 has quit IRC (Ping timeout: 360 seconds) [22:48]
JAANothing that changes how it works though.
Actually looks like a small bugfix.
[22:49]
Kaz..are you trying to break cloudflare? [22:59]
JAAYes.
Well, succeeding, mostly. ;-)
[22:59]
Kazah lord [23:00]
JAAI ported joepie91's parser to Python and am just implementing the final missing parts to get it working. I'll then look into how it can be integrated into wpull. [23:04]
***BlueMaxim has joined #archiveteam-bs [23:15]
Mateon1 has quit IRC (Remote host closed the connection)
Mateon1 has joined #archiveteam-bs
[23:23]
JAATIL that tr sucks at handling multi-byte characters. [23:34]
..... (idle for 23mn)
Yay, it seems to work correctly. :-)
100 test cases gave the same results as the Node interpreter.
[23:57]

↑back Search ←Prev date Next date→ Show only urls(Click on time to select a line by its url)