Time |
Nickname |
Message |
00:13
🔗
|
|
wp494 has quit IRC (Ping timeout: 633 seconds) |
00:13
🔗
|
|
wp494 has joined #archiveteam |
00:15
🔗
|
|
hdch has joined #archiveteam |
00:23
🔗
|
|
Nemo_bis has joined #archiveteam |
01:40
🔗
|
|
jut has quit IRC (Read error: Connection reset by peer) |
01:45
🔗
|
|
jut has joined #archiveteam |
02:41
🔗
|
|
DFJustin has quit IRC (Ping timeout: 260 seconds) |
02:48
🔗
|
|
tango111 has joined #archiveteam |
03:43
🔗
|
JAA |
Reminder that the UOL forums (Brazilian equivalent to AOL) will go down on the 7th. There are ArchiveBot jobs for it (after the first ones crashed), but they won't finish in time... |
04:08
🔗
|
|
exoire has quit IRC (Read error: Operation timed out) |
04:17
🔗
|
|
guest has joined #archiveteam |
04:17
🔗
|
guest |
say, does the internet archive still retroactively de-list sites that exclude it in robots.txt? |
04:17
🔗
|
guest |
i thought it was supposed to have stopped that a while ago |
04:19
🔗
|
guest |
i just found out that i'm unable to access the old truecrypt website because the *current* website excludes the IA user agent... |
04:19
🔗
|
guest |
(i know you guys aren't archive.org, but i'd think you would know the answer since you're associated with them) |
04:23
🔗
|
ivan_ |
if you're seeing "This URL has been excluded from the Wayback Machine." that's a manual exclusion |
04:24
🔗
|
guest |
ivan_: it's being caused by robots.txt *retroactively* |
04:24
🔗
|
guest |
i'm pretty sure it wasn't there when the site was first archived |
04:24
🔗
|
JAA |
Well, then you have your answer: yes, they still do that. |
04:26
🔗
|
guest |
so it's not being caused by manual exclusion via a dmca or something? it's definitely the fault of robots.txt? |
04:28
🔗
|
ivan_ |
why not give us a link to the page you're looking at |
04:28
🔗
|
JAA |
Pay attention to the error message. If it's what ivan_ wrote, then it's a manual exclusion due to a request by the website owner. If the error message mentions robots.txt, then it's robots.txt. |
04:28
🔗
|
JAA |
www.truecrypt.org was excluded manually. |
04:28
🔗
|
guest |
oh i thought manually meant via robots.txt explicitly specifying ia_archiver |
04:29
🔗
|
JAA |
No, manual exclusion means that someone at the Internet Archive blocked that site in the Wayback Machine. |
04:29
🔗
|
guest |
at the request of the site owner or by their own volution? |
04:30
🔗
|
guest |
volition* |
04:30
🔗
|
JAA |
The former. Or perhaps DMCA or similar. Certainly not just because they felt like doing it. |
04:31
🔗
|
Somebody2 |
as I say over and over again -- if you want a to preserve something, keep a LOCAL copy of it, don't rely on any 3rd party |
04:32
🔗
|
Somebody2 |
happily, that's still prettty straightforward to extract things from the Wayback Machine (one page at a time) |
04:33
🔗
|
guest |
Somebody2: well this was brought about because i'm fixing link rot. thankfully i did find a copy at archive.fo but who knows if that archive site will last for long like IA |
04:34
🔗
|
Somebody2 |
right, so *grab a local copy* (from archive.fo in this case) right now |
04:35
🔗
|
Somebody2 |
and at a minimum, keep it locally |
04:35
🔗
|
guest |
aight |
04:35
🔗
|
Somebody2 |
if you chose to also distribute it various other places ... you probably shouldn't mention that here |
04:35
🔗
|
Somebody2 |
but there are lots of places to post textual material |
04:36
🔗
|
guest |
why would it be bad to mention if it were distributed to someone else? or do you mean like, distributed amongst many small redundant systems if i get what you're saying |
04:36
🔗
|
|
qw3rty115 has joined #archiveteam |
04:37
🔗
|
guest |
it's just some html content and i'm pretty sure other people have copies too, so i'll just keep it on my local computer |
04:37
🔗
|
Somebody2 |
the reason not to discuss it here is ... this channel has public logs, and is pretty well known. And the material in question has at least some people who don't want it distributed... |
04:38
🔗
|
guest |
ah |
04:38
🔗
|
guest |
well it's just the truecrypt web pages. not some kind of illicit porn or anything. |
04:38
🔗
|
Somebody2 |
so it's best not to discuss where it might be made available in such well-known locations. |
04:39
🔗
|
JAA |
Besides, this channel isn't for discussion anyway. |
04:39
🔗
|
Somebody2 |
sure, but apparently the author of it (or someone associated with them) would prefer it not be distributed (at least not thru the wayback machine) |
04:39
🔗
|
guest |
oah |
04:39
🔗
|
guest |
*ah |
04:39
🔗
|
guest |
JAA: ok sorry |
04:39
🔗
|
Somebody2 |
whoops, right. Moving to #archiveteam-bs! |
04:42
🔗
|
|
qw3rty114 has quit IRC (Read error: Operation timed out) |
04:44
🔗
|
|
odemgi_ has joined #archiveteam |
04:46
🔗
|
|
odemgi has quit IRC (Read error: Operation timed out) |
04:46
🔗
|
|
odemg has quit IRC (Ping timeout: 265 seconds) |
04:58
🔗
|
|
odemg has joined #archiveteam |
05:19
🔗
|
|
ndiddy has quit IRC (Quit: nighty night) |
05:38
🔗
|
|
DFJustin has joined #archiveteam |
05:38
🔗
|
|
swebb sets mode: +o DFJustin |
06:10
🔗
|
|
nertzy3 has quit IRC (Quit: This computer has gone to sleep) |
06:24
🔗
|
|
riley has joined #archiveteam |
06:37
🔗
|
|
macker has quit IRC (Ping timeout: 240 seconds) |
06:45
🔗
|
|
jut has quit IRC (Ping timeout: 252 seconds) |
06:46
🔗
|
|
jut has joined #archiveteam |
06:49
🔗
|
|
cascode has joined #archiveteam |
06:52
🔗
|
|
macker has joined #archiveteam |
06:52
🔗
|
|
cascode has quit IRC (Client Quit) |
06:53
🔗
|
|
cascode has joined #archiveteam |
07:15
🔗
|
|
redlob has quit IRC (Read error: Operation timed out) |
07:25
🔗
|
|
redlob has joined #archiveteam |
07:54
🔗
|
|
hdch has quit IRC (Ping timeout: 265 seconds) |
08:16
🔗
|
|
hdch has joined #archiveteam |
08:16
🔗
|
|
tomaspark has quit IRC (Read error: Connection reset by peer) |
08:22
🔗
|
|
LFlare has joined #archiveteam |
08:49
🔗
|
|
Matt07211 has joined #archiveteam |
09:07
🔗
|
|
wp494 has quit IRC (Ping timeout: 265 seconds) |
09:07
🔗
|
|
wp494 has joined #archiveteam |
09:31
🔗
|
|
ubahn has joined #archiveteam |
09:40
🔗
|
|
ubahn_ has joined #archiveteam |
09:42
🔗
|
|
ubahn has quit IRC (Ping timeout: 260 seconds) |
09:48
🔗
|
|
exoire has joined #archiveteam |
09:50
🔗
|
|
AliceSCT has quit IRC (Remote host closed the connection) |
09:58
🔗
|
|
hook54321 has quit IRC (Quit: Connection closed for inactivity) |
10:11
🔗
|
|
exoire has quit IRC (Read error: Operation timed out) |
10:22
🔗
|
|
ubahn has joined #archiveteam |
10:23
🔗
|
|
ubahn_ has quit IRC (Read error: Operation timed out) |
10:26
🔗
|
|
ubahn_ has joined #archiveteam |
10:28
🔗
|
|
ubahn has quit IRC (Ping timeout: 360 seconds) |
10:53
🔗
|
|
pizzaiolo has joined #archiveteam |
11:09
🔗
|
|
hdch has quit IRC (Ping timeout: 265 seconds) |
11:23
🔗
|
|
pizzaiolo has quit IRC (Quit: pizzaiolo) |
11:25
🔗
|
|
pizzaiolo has joined #archiveteam |
11:56
🔗
|
|
odemgi_ has quit IRC (Remote host closed the connection) |
12:07
🔗
|
|
BlueMax has quit IRC (Quit: Leaving) |
12:10
🔗
|
|
hook54321 has joined #archiveteam |
12:10
🔗
|
|
svchfoo1 sets mode: +o hook54321 |
12:44
🔗
|
|
tango111 has quit IRC (Read error: Operation timed out) |
13:21
🔗
|
|
benjinsmi has joined #archiveteam |
13:25
🔗
|
|
benjins has quit IRC (Read error: Operation timed out) |
13:30
🔗
|
|
jut has quit IRC (Ping timeout: 252 seconds) |
13:31
🔗
|
|
jut has joined #archiveteam |
13:40
🔗
|
|
Hani has quit IRC (Read error: Connection reset by peer) |
13:56
🔗
|
|
redlob has quit IRC (Read error: Operation timed out) |
13:59
🔗
|
|
redlob has joined #archiveteam |
14:01
🔗
|
|
pizzaiolo has quit IRC (Quit: pizzaiolo) |
14:38
🔗
|
|
Despatche has joined #archiveteam |
14:39
🔗
|
|
guest has quit IRC (Quit: y ppl s) |
14:54
🔗
|
|
Mindbleac has joined #archiveteam |
15:09
🔗
|
SketchCow |
AND HOW THE FUCK IS EVERYTHING |
15:09
🔗
|
SketchCow |
Because I've been hit with dozens of privmessages from Brazillians |
15:10
🔗
|
VoynichCr |
why |
15:17
🔗
|
Kaz |
SketchCow: need access to the tumblr collection please, username kaz_ |
15:22
🔗
|
SketchCow |
msg me the e-mail of the account |
15:27
🔗
|
|
matthusby has joined #archiveteam |
16:10
🔗
|
|
Hani has joined #archiveteam |
16:12
🔗
|
Nemo_bis |
"14.6 TB of residual data (5.8 million files) from more than 10 years of operation were deleted" http://www.ipernity.com/blog/team/4715014 |
16:32
🔗
|
SketchCow |
=========================================== |
16:32
🔗
|
SketchCow |
INTERNET ARCHIVE IS GOING MAINTENANCE TODAY |
16:32
🔗
|
SketchCow |
Disruption is meant to be minimuzed but might not be |
16:32
🔗
|
SketchCow |
It will affect a range of things, just keep an eye |
16:33
🔗
|
SketchCow |
Starts shortly, will go on for a few hours |
16:33
🔗
|
SketchCow |
=========================================== |
16:37
🔗
|
|
nertzy has joined #archiveteam |
16:37
🔗
|
|
Mateon1 has quit IRC (Ping timeout: 255 seconds) |
17:00
🔗
|
|
benjinsmi has quit IRC (Leaving) |
17:00
🔗
|
|
benjins has joined #archiveteam |
17:34
🔗
|
|
ubahn_ has quit IRC (Quit: ubahn_) |
17:37
🔗
|
yano |
SketchCow: is it over? or has it not started yet? |
17:49
🔗
|
|
hook54321 has quit IRC (Quit: Connection closed for inactivity) |
17:50
🔗
|
yano |
i found it, https://twitter.com/internetarchive/status/1080871414389972992 |
17:58
🔗
|
BartoCH |
yep, my archive crawler started to have troubles just now |
18:00
🔗
|
|
jut has quit IRC (Ping timeout: 252 seconds) |
18:03
🔗
|
|
jut has joined #archiveteam |
18:06
🔗
|
yano |
BartoCH: mine is still working |
18:08
🔗
|
|
wp494 has quit IRC (Ping timeout: 260 seconds) |
18:08
🔗
|
|
wp494 has joined #archiveteam |
18:23
🔗
|
|
ubahn has joined #archiveteam |
18:24
🔗
|
|
ubahn has quit IRC (Client Quit) |
18:24
🔗
|
|
ubahn has joined #archiveteam |
18:25
🔗
|
|
ubahn has quit IRC (Client Quit) |
18:57
🔗
|
SketchCow |
It's for two hours roughly |
18:57
🔗
|
SketchCow |
It's still going on |
18:57
🔗
|
SketchCow |
I'm having S3 explode |
18:57
🔗
|
astrid |
oops sorry |
19:04
🔗
|
|
Mateon1 has joined #archiveteam |
19:09
🔗
|
|
Despatche has quit IRC (Quit: Error: Connection reset by peer) |
19:26
🔗
|
|
hook54321 has joined #archiveteam |
19:26
🔗
|
|
svchfoo3 sets mode: +o hook54321 |
19:27
🔗
|
|
chimyatta has quit IRC (Ping timeout: 252 seconds) |
19:28
🔗
|
|
chimyatta has joined #archiveteam |
19:35
🔗
|
|
Stilett0 has joined #archiveteam |
19:36
🔗
|
|
Stiletto has quit IRC (Read error: Operation timed out) |
20:05
🔗
|
|
daan101 has joined #archiveteam |
20:07
🔗
|
|
daan101 has quit IRC (Client Quit) |
20:41
🔗
|
JAA |
UOL Forums warrior project started. |
20:41
🔗
|
|
rektide has joined #archiveteam |
21:00
🔗
|
|
BasDub has joined #archiveteam |
21:02
🔗
|
|
Mateon1 has quit IRC (Read error: Operation timed out) |
21:03
🔗
|
|
DasBub has quit IRC (Read error: Operation timed out) |
21:08
🔗
|
|
Mateon1 has joined #archiveteam |
21:10
🔗
|
|
VerfiedJ has joined #archiveteam |
21:18
🔗
|
|
nico_32 has quit IRC (Read error: Operation timed out) |
21:21
🔗
|
|
nico_32 has joined #archiveteam |
21:22
🔗
|
|
antomatic has quit IRC (Read error: Operation timed out) |
21:33
🔗
|
|
DosBob has joined #archiveteam |
21:37
🔗
|
|
BasDub has quit IRC (Ping timeout: 252 seconds) |
21:37
🔗
|
|
DosBob is now known as DasBub |
21:39
🔗
|
SketchCow |
It should all be back now, by the way. |
21:39
🔗
|
|
Stiletto has joined #archiveteam |
21:45
🔗
|
|
Stilett0 has quit IRC (Read error: Operation timed out) |
21:49
🔗
|
|
tomaspark has joined #archiveteam |
21:52
🔗
|
|
Wizzito has joined #archiveteam |
21:52
🔗
|
Wizzito |
okay, so i guess we're focusing on the uol forums rn? i see that it's on "warrior based projects" now |
22:05
🔗
|
|
Dj-Wawa has joined #archiveteam |
22:07
🔗
|
|
Matt07211 has quit IRC (Ping timeout: 265 seconds) |
22:11
🔗
|
Wizzito |
So many 0/O.1 MBs on the UOL tracker |
22:11
🔗
|
Wizzito |
We've done 2 GB so far, eh |
22:11
🔗
|
JAA |
Wizzito: Come to #archiveteam-bs please. This channel is for announcements. |
22:11
🔗
|
Wizzito |
Ohh okay |
22:16
🔗
|
|
antomatic has joined #archiveteam |
22:16
🔗
|
|
swebb sets mode: +o antomatic |
22:18
🔗
|
JAA |
It looks like the UOL Forums are just the tip of the iceberg. Grupo Abril, the parent company of UOL, was recently sold for a symbolic 100k Brazilian real (~27k USD) after amassing 1.6 billion real (~430 million USD) of debt. So there will likely be more service shutdowns soon. https://naaju.com/brazil/grupo-abril-is-sold-and-banks-and-employees-of-the-billionaire-by-default/ |
22:18
🔗
|
JAA |
For starters, the blog platform at http://blog.uol.com.br/ is shutting down end of the month. |
22:21
🔗
|
Wizzito |
oh boy, are we gonna archive that? |
22:22
🔗
|
astrid |
-> #archiveteam-bs |
22:23
🔗
|
Wizzito |
????? i cant post a response to anything without getting sent there smh |
22:23
🔗
|
Wizzito |
ok |
22:26
🔗
|
|
BlueMax has joined #archiveteam |
22:27
🔗
|
|
Stilett0 has joined #archiveteam |
22:29
🔗
|
|
Stiletto has quit IRC (Read error: Operation timed out) |
22:34
🔗
|
|
DasBub has quit IRC (Quit: rebeught) |
22:34
🔗
|
|
thesame_ has joined #archiveteam |
22:36
🔗
|
|
DasBub has joined #archiveteam |
22:37
🔗
|
|
thesame has quit IRC (Ping timeout: 252 seconds) |
22:38
🔗
|
|
thesame__ has joined #archiveteam |
22:38
🔗
|
|
exoire has joined #archiveteam |
22:41
🔗
|
|
thesame_ has quit IRC (Ping timeout: 252 seconds) |
22:56
🔗
|
|
Stilett0 has quit IRC (Ping timeout: 246 seconds) |
22:59
🔗
|
|
trc has joined #archiveteam |
23:00
🔗
|
|
Stiletto has joined #archiveteam |
23:19
🔗
|
|
Stilett0 has joined #archiveteam |
23:21
🔗
|
|
Stiletto has quit IRC (Read error: Operation timed out) |
23:31
🔗
|
|
trc has quit IRC (Quit: AndroIRC - Android IRC Client ( http://www.androirc.com )) |
23:33
🔗
|
|
nertzy has quit IRC (Quit: This computer has gone to sleep) |
23:49
🔗
|
|
chimyatta has quit IRC (Quit: quitting) |