Time |
Nickname |
Message |
00:08
🔗
|
|
Honno has quit IRC (Read error: Operation timed out) |
00:24
🔗
|
SketchCow |
Well, that was definitely #-bs |
00:24
🔗
|
SketchCow |
So congrats to all involved |
00:26
🔗
|
|
BlueMaxim has joined #archiveteam-bs |
00:31
🔗
|
arkiver |
PurpleSym: will have a look tomorrow |
00:44
🔗
|
|
kristian_ has quit IRC (Quit: Leaving) |
02:43
🔗
|
|
ndizzle has quit IRC (Quit: Leaving) |
03:15
🔗
|
|
Start has joined #archiveteam-bs |
03:27
🔗
|
|
Start has quit IRC (ircd.choopa.net irc.mzima.net) |
03:27
🔗
|
|
Stiletto has quit IRC (ircd.choopa.net irc.mzima.net) |
03:27
🔗
|
|
yipdw has quit IRC (ircd.choopa.net irc.mzima.net) |
03:27
🔗
|
|
joepie91 has quit IRC (ircd.choopa.net irc.mzima.net) |
03:27
🔗
|
|
robink has quit IRC (ircd.choopa.net irc.mzima.net) |
03:33
🔗
|
|
yipdw_ has joined #archiveteam-bs |
03:33
🔗
|
|
Frogging sets mode: +o yipdw_ |
03:36
🔗
|
|
Start has joined #archiveteam-bs |
03:36
🔗
|
|
Stiletto has joined #archiveteam-bs |
03:36
🔗
|
|
joepie91 has joined #archiveteam-bs |
03:36
🔗
|
|
robink has joined #archiveteam-bs |
03:36
🔗
|
|
walkeroh has joined #archiveteam-bs |
03:48
🔗
|
|
yipdw_ has quit IRC (Read error: Operation timed out) |
04:01
🔗
|
|
yipdw has joined #archiveteam-bs |
04:01
🔗
|
|
Frogging sets mode: +o yipdw |
04:20
🔗
|
|
tomwsmf_ has quit IRC (Read error: Operation timed out) |
04:39
🔗
|
|
Start has quit IRC (Quit: Disconnected.) |
04:49
🔗
|
|
tomwsmf_ has joined #archiveteam-bs |
04:53
🔗
|
|
bsmith093 has quit IRC (Read error: Operation timed out) |
04:56
🔗
|
|
Sk1d has quit IRC (Ping timeout: 194 seconds) |
05:02
🔗
|
|
Sk1d has joined #archiveteam-bs |
05:48
🔗
|
|
Start has joined #archiveteam-bs |
05:50
🔗
|
|
Start has quit IRC (Remote host closed the connection) |
05:55
🔗
|
|
tomwsmf_ has quit IRC (Read error: Operation timed out) |
05:56
🔗
|
|
Start has joined #archiveteam-bs |
06:19
🔗
|
|
t2t2 has joined #archiveteam-bs |
06:24
🔗
|
|
dashcloud has quit IRC (Quit: No Ping reply in 180 seconds.) |
06:27
🔗
|
|
dashcloud has joined #archiveteam-bs |
06:43
🔗
|
|
metalcamp has joined #archiveteam-bs |
07:28
🔗
|
|
BlueMaxim has quit IRC (Quit: Leaving) |
07:36
🔗
|
|
schbirid has joined #archiveteam-bs |
08:13
🔗
|
|
robink has quit IRC (Ping timeout: 506 seconds) |
08:52
🔗
|
|
robink has joined #archiveteam-bs |
08:59
🔗
|
|
RedType has joined #archiveteam-bs |
09:16
🔗
|
|
Genericen has joined #archiveteam-bs |
09:22
🔗
|
|
Honno has joined #archiveteam-bs |
09:56
🔗
|
|
BlueMaxim has joined #archiveteam-bs |
10:17
🔗
|
|
BartoCH has quit IRC (Ping timeout: 260 seconds) |
10:26
🔗
|
|
godane has quit IRC (Ping timeout: 244 seconds) |
10:34
🔗
|
|
BartoCH has joined #archiveteam-bs |
10:44
🔗
|
|
godane has joined #archiveteam-bs |
10:56
🔗
|
|
VADemon has joined #archiveteam-bs |
11:36
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
11:40
🔗
|
|
dashcloud has joined #archiveteam-bs |
12:34
🔗
|
|
BlueMaxim has quit IRC (Read error: Operation timed out) |
12:35
🔗
|
|
Genericen has quit IRC (Remote host closed the connection) |
12:52
🔗
|
|
REiN^ has joined #archiveteam-bs |
13:27
🔗
|
|
Jeroen__ has quit IRC (Ping timeout: 268 seconds) |
13:58
🔗
|
|
zenguy_pc has joined #archiveteam-bs |
14:16
🔗
|
|
Genericen has joined #archiveteam-bs |
14:38
🔗
|
|
Genericen has left |
14:43
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
14:46
🔗
|
|
ndiddy has joined #archiveteam-bs |
14:53
🔗
|
|
dashcloud has joined #archiveteam-bs |
15:24
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
15:27
🔗
|
|
dashcloud has joined #archiveteam-bs |
16:19
🔗
|
|
Start has quit IRC (Quit: Disconnected.) |
16:55
🔗
|
|
Start has joined #archiveteam-bs |
18:04
🔗
|
|
Mayonaise has quit IRC (Ping timeout: 246 seconds) |
18:04
🔗
|
|
Mayonaise has joined #archiveteam-bs |
18:32
🔗
|
|
ItsYoda has quit IRC (Read error: Connection reset by peer) |
19:28
🔗
|
|
coretx has quit IRC (Ping timeout: 246 seconds) |
19:28
🔗
|
|
coretx has joined #archiveteam-bs |
19:35
🔗
|
|
kristian_ has joined #archiveteam-bs |
20:07
🔗
|
|
ItsYoda has joined #archiveteam-bs |
20:11
🔗
|
|
RichardG_ has joined #archiveteam-bs |
20:13
🔗
|
|
schbirid has quit IRC (Quit: Leaving) |
20:14
🔗
|
|
RichardG has quit IRC (Ping timeout: 250 seconds) |
20:24
🔗
|
|
RichardG_ is now known as RichardG |
20:38
🔗
|
|
Sanqui has quit IRC (Remote host closed the connection) |
20:38
🔗
|
|
metal_cam has joined #archiveteam-bs |
20:39
🔗
|
|
metalcamp has quit IRC (Ping timeout: 244 seconds) |
20:39
🔗
|
|
tomwsmf_ has joined #archiveteam-bs |
20:42
🔗
|
|
jspiros has quit IRC (Ping timeout: 492 seconds) |
20:43
🔗
|
|
Sanqui has joined #archiveteam-bs |
21:04
🔗
|
|
metalcamp has joined #archiveteam-bs |
21:04
🔗
|
|
PQHF5KD has joined #archiveteam-bs |
21:05
🔗
|
|
metal_cam has quit IRC (Ping timeout: 244 seconds) |
21:06
🔗
|
PQHF5KD |
Hey, what's the opinion of content living under both the media databases on Archive.org and the on original site on the Wayback Machine |
21:14
🔗
|
|
metalcamp has quit IRC (Read error: Operation timed out) |
21:22
🔗
|
ranma |
can't hurt, especially if it's a binary/archive |
21:22
🔗
|
ranma |
dedup will probably come about some time |
21:30
🔗
|
PQHF5KD |
It's podcasts in this case, the media collections make it easy to discover things rather than having to know about the site |
21:42
🔗
|
DFJustin |
podcasts are such small potatoes in terms of disk space that you might as well |
21:49
🔗
|
|
r3c0d3x has quit IRC (Ping timeout: 260 seconds) |
21:50
🔗
|
|
Jordan_ has quit IRC (Ping timeout: 260 seconds) |
22:01
🔗
|
|
r3c0d3x has joined #archiveteam-bs |
22:07
🔗
|
|
Jordan_ has joined #archiveteam-bs |
22:15
🔗
|
|
r3c0d3x has quit IRC (Read error: Connection timed out) |
22:16
🔗
|
|
r3c0d3x has joined #archiveteam-bs |
22:18
🔗
|
|
wp494 has quit IRC (Quit: LOUD UNNECESSARY QUIT MESSAGES) |
22:22
🔗
|
|
PQHF5KD has quit IRC (Quit: Page closed) |
23:05
🔗
|
godane |
SketchCow: i'm grabbing a linuxscreenshots.org database and screenshots |
23:05
🔗
|
godane |
its about 47gb |
23:05
🔗
|
MrRadar |
godane: I already created IA items for it |
23:05
🔗
|
godane |
oh |
23:05
🔗
|
|
Honno has quit IRC (Read error: Operation timed out) |
23:05
🔗
|
MrRadar |
https://archive.org/details/LinuxScreenshots.orgReleaseArchive |
23:05
🔗
|
MrRadar |
https://archive.org/details/LinuxScreenshots.orgScreenshotArchive |
23:06
🔗
|
MrRadar |
The screenshot archive is downloading very slowly: https://catalogd.archive.org/log/563111226 |
23:06
🔗
|
godane |
ok |
23:08
🔗
|
Frogging |
bay12games.com's robots.txt blocks ia_archiver. I talked to the owner and he seems open to allowing it, but he has reservations about server load. What can I tell him? |
23:13
🔗
|
Frogging |
I could just say try it and see... his question is mostly theoretical; the restriction is a relic from long ago when the transfer cap was an issue (it's not anymore) |
23:14
🔗
|
MrRadar |
This Stack Overflow question says that the IA archiver bot sometimes obeys the crawl-delay directive: https://webmasters.stackexchange.com/questions/39850/is-there-a-way-to-make-alexas-ia-archiver-slow-down-its-crawling-of-my-website |
23:14
🔗
|
Frogging |
yah, I saw that one when searching earlier |
23:15
🔗
|
|
zenguy_pc has quit IRC (Ping timeout: 255 seconds) |
23:15
🔗
|
|
wp494 has joined #archiveteam-bs |
23:16
🔗
|
Frogging |
there's a bunch of other junk in the file as well, it'd be good if someone who is familiar with Wayback's robots.txt parsing quirks could take a look to make sure there's no "gotchas" in here http://www.bay12games.com/robots.txt |
23:22
🔗
|
|
VADemon has quit IRC (Quit: left4dead) |
23:41
🔗
|
|
wp494 has quit IRC (Read error: Operation timed out) |
23:41
🔗
|
|
wp494 has joined #archiveteam-bs |