Time |
Nickname |
Message |
00:07
🔗
|
|
j08nY has quit IRC (Quit: Leaving) |
00:09
🔗
|
|
DoomTay has quit IRC (Ping timeout: 268 seconds) |
00:12
🔗
|
|
DoomTay has joined #archiveteam-bs |
00:19
🔗
|
|
JesseW has joined #archiveteam-bs |
00:23
🔗
|
|
VADemon has quit IRC (Quit: left4dead) |
00:26
🔗
|
|
tomwsmf-a has quit IRC (Read error: Operation timed out) |
00:27
🔗
|
|
ris has quit IRC () |
00:42
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
00:46
🔗
|
|
dashcloud has joined #archiveteam-bs |
00:57
🔗
|
SketchCow |
DoomTay: What is your deal? |
00:58
🔗
|
JesseW |
DoomTay: btw, your edit to http://archiveteam.org/index.php?title=Template:IRC stuffed all the pages using that template into Category:Templates. Fixing now. |
00:59
🔗
|
DoomTay |
Huh. I was wondering why that one wasn't categorized into Templates |
01:00
🔗
|
JesseW |
and apparently my fix borked the site. wheee |
01:01
🔗
|
JesseW |
but it's back now |
01:09
🔗
|
DoomTay |
..though it frequently resultsi n bouts of a 508 |
01:11
🔗
|
|
JesseW has quit IRC (Ping timeout: 370 seconds) |
01:20
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
01:21
🔗
|
|
RichardG_ has quit IRC (Ping timeout: 258 seconds) |
01:24
🔗
|
|
RichardG has joined #archiveteam-bs |
01:27
🔗
|
|
dashcloud has joined #archiveteam-bs |
01:28
🔗
|
|
vitzli has joined #archiveteam-bs |
01:33
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
01:36
🔗
|
|
dashcloud has joined #archiveteam-bs |
01:38
🔗
|
DoomTay |
Is there anything that resuscitates deleted IMDb comments? Because Wayback Machine isn't helping |
01:42
🔗
|
|
aschmitz has joined #archiveteam-bs |
01:46
🔗
|
aschmitz |
godane: Do you have a full copy of NTRS, or are you just going slowly at it? |
01:48
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
01:49
🔗
|
godane |
i'm grabbing them slowly |
01:49
🔗
|
godane |
year by year |
01:52
🔗
|
|
dashcloud has joined #archiveteam-bs |
02:16
🔗
|
|
BlueMaxim has joined #archiveteam-bs |
02:18
🔗
|
|
nickname_ has joined #archiveteam-bs |
02:31
🔗
|
|
JesseW has joined #archiveteam-bs |
02:52
🔗
|
|
tomwsmf-a has joined #archiveteam-bs |
03:21
🔗
|
|
RichardG has quit IRC (Read error: Operation timed out) |
03:21
🔗
|
|
RichardG has joined #archiveteam-bs |
03:49
🔗
|
|
RichardG has quit IRC (Read error: Operation timed out) |
03:49
🔗
|
|
RichardG has joined #archiveteam-bs |
03:51
🔗
|
|
nickname_ has quit IRC (Read error: Operation timed out) |
03:53
🔗
|
|
DoomTay has quit IRC (Ping timeout: 270 seconds) |
03:57
🔗
|
|
DoomTay has joined #archiveteam-bs |
04:05
🔗
|
|
Sk1d has quit IRC (Ping timeout: 250 seconds) |
04:05
🔗
|
|
Start has quit IRC (Read error: Connection reset by peer) |
04:14
🔗
|
|
Sk1d has joined #archiveteam-bs |
04:47
🔗
|
|
tomwsmf-a has quit IRC (Read error: Operation timed out) |
04:57
🔗
|
JesseW |
Lord_Nigh: I had no idea your actual nick was Nightmare. I thought it was a reference to Monty Python... |
04:57
🔗
|
JesseW |
We are the lords who say Nigh! |
05:02
🔗
|
|
Sk1d has quit IRC (Ping timeout: 250 seconds) |
05:09
🔗
|
Lord_Nigh |
nope |
05:10
🔗
|
Lord_Nigh |
its lord_nightmare but for historical reasons (oldest surviving irc network) efnet has a 9 char nickname limit |
05:10
🔗
|
Lord_Nigh |
on all other irc networks i'm Lord_Nightmare |
05:11
🔗
|
|
Sk1d has joined #archiveteam-bs |
05:13
🔗
|
|
tomwsmf-a has joined #archiveteam-bs |
05:16
🔗
|
|
DoomTay has left |
05:20
🔗
|
|
tomwsmf-a has quit IRC (Ping timeout: 258 seconds) |
05:22
🔗
|
|
DoomTay has joined #archiveteam-bs |
05:40
🔗
|
|
JesseW has quit IRC (Ping timeout: 370 seconds) |
06:02
🔗
|
|
hook54321 has joined #archiveteam-bs |
06:14
🔗
|
|
DoomTay has quit IRC (Quit: Page closed) |
06:57
🔗
|
|
dashcloud has quit IRC (Ping timeout: 244 seconds) |
06:58
🔗
|
|
dashcloud has joined #archiveteam-bs |
07:23
🔗
|
|
schbirid has joined #archiveteam-bs |
07:31
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
07:34
🔗
|
|
dashcloud has joined #archiveteam-bs |
07:38
🔗
|
|
remsen has quit IRC (ZNC 1.6.2 - http://znc.in) |
07:38
🔗
|
|
remsen has joined #archiveteam-bs |
08:01
🔗
|
|
godane has quit IRC (Quit: Leaving.) |
08:03
🔗
|
|
godane has joined #archiveteam-bs |
08:37
🔗
|
|
vitzli has quit IRC (Leaving) |
09:01
🔗
|
|
hook54321 has quit IRC (Quit: Connection closed for inactivity) |
09:35
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
09:38
🔗
|
|
dashcloud has joined #archiveteam-bs |
10:13
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
10:16
🔗
|
|
dashcloud has joined #archiveteam-bs |
10:19
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
10:23
🔗
|
|
dashcloud has joined #archiveteam-bs |
11:02
🔗
|
Medowar |
I think i have to kill 2 google code tasks running on my machine. They are on 3,5mio requests right now, with 8 more million to do. Currently using 6Gig Ram each. |
11:04
🔗
|
luckcolor |
Yeah sometime you get some infinite recurring ones |
11:04
🔗
|
luckcolor |
either it loops on the same url over and over again or it just gets confused on all the tags of the pages |
11:27
🔗
|
|
Fusl has quit IRC (Ping timeout: 260 seconds) |
11:58
🔗
|
luckcolor |
Does any body here knows json and can help me? |
11:59
🔗
|
luckcolor |
i have this very long and inline json data wich i would like to have formatted normally |
12:09
🔗
|
Fletcher |
luckcolor, do you just need it formatted once (online tool) or are you looking for a long term solution? (code) |
12:10
🔗
|
Fletcher |
for the former (first result on google) https://jsonformatter.curiousconcept.com/ |
12:11
🔗
|
luckcolor |
ah this work |
12:11
🔗
|
luckcolor |
*works |
12:11
🔗
|
luckcolor |
but i just discovered that this doesn't help me a lot |
12:11
🔗
|
luckcolor |
100 urls over 700 i readed about |
12:12
🔗
|
luckcolor |
Anyway thanks Fletcher :P |
12:13
🔗
|
Fletcher |
np |
12:22
🔗
|
|
jut has joined #archiveteam-bs |
12:26
🔗
|
|
Fusl has joined #archiveteam-bs |
12:38
🔗
|
SketchCow |
THE NEXT GREAT GODANE INBOX CATTLE DRIVE HAS BEGUN |
12:38
🔗
|
SketchCow |
29,000 items going into already existing or new collections |
12:41
🔗
|
SketchCow |
I've got four threads doing the moves, which I think is basically enough. |
12:42
🔗
|
Medowar |
rip IA |
12:45
🔗
|
SketchCow |
7,000 items moved already! |
12:45
🔗
|
SketchCow |
I'm using a method that was agreed upon that doesn't kill IA |
12:46
🔗
|
SketchCow |
Also removes a lot of error issues, where it will flat out reject "you done fucked up" instead of "well, let me try since you said URK" |
12:48
🔗
|
|
ItsYoda has quit IRC (Ping timeout: 260 seconds) |
12:54
🔗
|
luckcolor |
SketchCow you are mass uploading or mass moving to a disk drive? :D |
12:57
🔗
|
SketchCow |
Neither in this case. I am doing mass metadata changes so items go from a central godane upload pool into a few dozen potential collections on the archive. |
12:57
🔗
|
|
ItsYoda has joined #archiveteam-bs |
12:57
🔗
|
luckcolor |
ah ok |
12:58
🔗
|
SketchCow |
But 30,000 items done in the way I'm doing them (the script does one by one so if there's queue backup, it'll stop doing it), can still take quite a bit of time. |
12:59
🔗
|
SketchCow |
Examples of new collections created in the last hour for this: https://archive.org/details/the-laura-ingraham-show https://archive.org/details/the-sean-hannity-show |
12:59
🔗
|
luckcolor |
cool |
13:37
🔗
|
|
anjacks0n has joined #archiveteam-bs |
13:43
🔗
|
|
anjacks0n has quit IRC (anjacks0n) |
13:44
🔗
|
|
anjacks0n has joined #archiveteam-bs |
13:57
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
14:00
🔗
|
|
dashcloud has joined #archiveteam-bs |
14:34
🔗
|
|
BlueMaxim has quit IRC (Quit: Leaving) |
14:55
🔗
|
|
DoomTay has joined #archiveteam-bs |
14:55
🔗
|
|
nickname_ has joined #archiveteam-bs |
14:55
🔗
|
|
Aranje has joined #archiveteam-bs |
15:04
🔗
|
|
j08nY has joined #archiveteam-bs |
15:06
🔗
|
SketchCow |
godane: When you have a chance, please look at https://archive.org/details/austinchronicle - several PDFs seem to be 100% blank (I checked) |
15:08
🔗
|
SketchCow |
In other news: There's "Ensign Magazine" https://archive.org/details/Ensign_Magazine which is a Mormon publication, and there's a boat magazine called The Ensign |
15:08
🔗
|
SketchCow |
They must fucking HATE each other |
15:10
🔗
|
DoomTay |
Yeah well I once saw that a few of the stuff at https://archive.org/details/doom-cds seem tobe broken too |
15:10
🔗
|
DoomTay |
Lemme try downloading and pluggin in the smallest one there |
15:15
🔗
|
DoomTay |
So which issues in particular are 100% blank? |
15:15
🔗
|
SketchCow |
I asked Godane. |
15:20
🔗
|
DoomTay |
Yup, the ISO at https://archive.org/details/DoomFever1995MapleMedia is "corrupted" |
15:21
🔗
|
|
nickname_ has quit IRC (Read error: Operation timed out) |
15:44
🔗
|
|
nickname_ has joined #archiveteam-bs |
15:52
🔗
|
|
JesseW has joined #archiveteam-bs |
15:54
🔗
|
joepie91 |
https://en.wikipedia.org/wiki/MediaWiki_talk:Spam-blacklist#archive.is |
15:54
🔗
|
joepie91 |
:| |
16:09
🔗
|
|
mr-b has quit IRC (Ping timeout: 246 seconds) |
16:10
🔗
|
|
yakfish has quit IRC (Ping timeout: 246 seconds) |
16:10
🔗
|
|
yakfish has joined #archiveteam-bs |
16:11
🔗
|
|
mr-b has joined #archiveteam-bs |
16:12
🔗
|
|
nickname_ has quit IRC (Ping timeout: 492 seconds) |
16:12
🔗
|
|
nickname_ has joined #archiveteam-bs |
16:13
🔗
|
JesseW |
eh, from my point of view, banning archive.is from wikipedia makes it less well known, which means it will be longer before the pressure on it is sufficient to kill it -- which I'm happy about |
16:16
🔗
|
|
jut has quit IRC (Leaving) |
16:20
🔗
|
JesseW |
joepie91: btw, as of today, it appears that it may be un-blacklisted: https://en.wikipedia.org/wiki/Wikipedia:Requests_for_comment/Archive.is_RFC_4 |
16:23
🔗
|
joepie91 |
I'm not really sure why grab-site is consuming a full CPU core grabbing a site...? |
16:23
🔗
|
joepie91 |
that seems unnecessary |
16:24
🔗
|
joepie91 |
... okay, so it only does that when it's failing to fetch a URL... |
16:24
🔗
|
joepie91 |
it still uses a lot otherwise but not a full core |
16:28
🔗
|
|
anjacks0n has quit IRC (anjacks0n) |
16:36
🔗
|
|
JesseW has quit IRC (Ping timeout: 370 seconds) |
16:37
🔗
|
godane |
SketchCow: i think some pdfs from them are just blank |
16:38
🔗
|
godane |
from Austin Chronicle |
16:39
🔗
|
DoomTay |
Oh, I see it now |
16:43
🔗
|
joepie91 |
so |
16:43
🔗
|
joepie91 |
I might make a PR for wpull soon to try and implement cloudflare "ddos protection" bypass :| |
16:43
🔗
|
joepie91 |
fucking cloudflare |
16:46
🔗
|
DoomTay |
Something tells me that ddos protection is the real reason why the job on http://archive.fbi.ninja/lmao/ "finished" so quickly |
16:54
🔗
|
godane |
example pdf thats blank: http://www.austinchronicle.com/download/2007-10-12/chronicle.pdf |
16:58
🔗
|
joepie91 |
DoomTay: yeah, it is. |
16:58
🔗
|
joepie91 |
DoomTay: irritatingly it seems to ask for the 'ddos captcha' again after X requiests |
16:58
🔗
|
joepie91 |
so just exporting cookies and useragent from the browser will not get you past it for the entire job |
16:58
🔗
|
joepie91 |
meaning this needs to be supported in the actual downloading tool to work |
16:58
🔗
|
joepie91 |
because it can encounter the wall again at any point |
16:58
🔗
|
|
anjacks0n has joined #archiveteam-bs |
16:58
🔗
|
joepie91 |
and you get handed a fresh 'clearance cookie' every time you encounter the wall |
16:59
🔗
|
joepie91 |
it's also a hilariously bad captcha anyway: http://storage3.static.itmages.com/i/16/0622/h_1466613477_7341936_022a37d420.png |
16:59
🔗
|
joepie91 |
but still breaks archival |
16:59
🔗
|
joepie91 |
so, good job cloudflare, you fucked up legitimate bots and didn't hamper the ddos kids |
16:59
🔗
|
joepie91 |
thanks for breaking the web |
16:59
🔗
|
joepie91 |
</rant> |
17:00
🔗
|
DoomTay |
Maybe persude the guy behind the site to not use cloudflare, at least for a while? |
17:00
🔗
|
DoomTay |
Unless it wasn't his decision to make? |
17:01
🔗
|
|
ris has joined #archiveteam-bs |
17:01
🔗
|
joepie91 |
DoomTay: doesn't solve the bigger problem |
17:01
🔗
|
joepie91 |
a ton of sites use cloudflare |
17:01
🔗
|
joepie91 |
we need to be able to deal with that |
17:04
🔗
|
SketchCow |
Can I just say, joepie |
17:04
🔗
|
SketchCow |
My favorite part of the post-IA-DDOS fallout was watching anonymous groups throw each other under a bus |
17:07
🔗
|
xmc |
<3 |
17:17
🔗
|
|
tomwsmf-a has joined #archiveteam-bs |
17:29
🔗
|
|
VADemon has joined #archiveteam-bs |
17:40
🔗
|
|
JW_work1 has joined #archiveteam-bs |
17:41
🔗
|
|
JW_work has quit IRC (Read error: Operation timed out) |
17:44
🔗
|
|
anjacks0n has quit IRC (anjacks0n) |
17:50
🔗
|
|
JW_work1 has quit IRC (Quit: Leaving.) |
17:51
🔗
|
|
JW_work has joined #archiveteam-bs |
17:59
🔗
|
|
JW_work has quit IRC (Quit: Leaving.) |
18:00
🔗
|
|
tomwsmf-a has quit IRC (Ping timeout: 258 seconds) |
18:02
🔗
|
|
JW_work has joined #archiveteam-bs |
18:06
🔗
|
|
zino has quit IRC (Quit: Leaving) |
18:12
🔗
|
|
Start has joined #archiveteam-bs |
18:16
🔗
|
|
zino has joined #archiveteam-bs |
18:16
🔗
|
|
JW_work has quit IRC (Quit: Leaving.) |
18:16
🔗
|
|
JW_work has joined #archiveteam-bs |
18:31
🔗
|
|
nickname_ has quit IRC (Ping timeout: 492 seconds) |
19:13
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
19:16
🔗
|
|
dashcloud has joined #archiveteam-bs |
19:21
🔗
|
|
anjacks0n has joined #archiveteam-bs |
19:30
🔗
|
yipdw |
it really fucks with me that Ruby Range objects have #cover?, #include?, and #overlaps? methods |
19:30
🔗
|
yipdw |
the first two especially, the difference is difficult to explain |
19:31
🔗
|
yipdw |
#include? appears to require iterable objects whereas #cover? requires only a partial order |
19:31
🔗
|
yipdw |
computers.txt |
20:25
🔗
|
|
anjacks0n has quit IRC (anjacks0n) |
20:33
🔗
|
|
j08nY has quit IRC (Quit: Leaving) |
21:03
🔗
|
|
schbirid has quit IRC (Quit: Leaving) |
22:03
🔗
|
|
anjacks0n has joined #archiveteam-bs |
22:04
🔗
|
|
anjacks0n has quit IRC (Client Quit) |
22:12
🔗
|
|
RichardG has quit IRC (Quit: Keyboard not found, press F1 to continue) |
22:23
🔗
|
|
anjacks0n has joined #archiveteam-bs |
22:46
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
22:50
🔗
|
|
dashcloud has joined #archiveteam-bs |
22:52
🔗
|
|
anjacks0n has quit IRC (anjacks0n) |
22:59
🔗
|
|
RichardG has joined #archiveteam-bs |
23:06
🔗
|
|
RichardG_ has joined #archiveteam-bs |
23:11
🔗
|
|
RichardG_ has quit IRC (Ping timeout: 250 seconds) |
23:11
🔗
|
|
RichardG has quit IRC (Ping timeout: 370 seconds) |
23:12
🔗
|
|
RichardG has joined #archiveteam-bs |
23:24
🔗
|
|
hook54321 has joined #archiveteam-bs |
23:26
🔗
|
|
tomwsmf-a has joined #archiveteam-bs |
23:54
🔗
|
|
BlueMaxim has joined #archiveteam-bs |