Time |
Nickname |
Message |
00:00
🔗
|
SketchCow |
doomtay. shut up. |
00:01
🔗
|
|
Start has quit IRC (Quit: Disconnected.) |
00:03
🔗
|
|
Start has joined #archiveteam-bs |
00:44
🔗
|
ravetcofx |
here's a neat magnet link it's for the this American life podcast archive of episodes from 1995-2007 |
00:44
🔗
|
ravetcofx |
magnet:?xt=urn:btih:5e31b76cd01ff9426ca2bec078c712ff20e17af6&dn=This%20American%20Life%20-%20Complete%20Volume%201995-2007%20-%20Episodes%201-342&tr=udp%3A%2F%2Ftracker.publicbt.com%2Fannounce&tr=udp%3A%2F%2Fglotorrents.pw%3A6969%2Fannounce&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80%2Fannounce&tr=udp%3A%2F%2Ftracker.opentrackr.org%3A1337%2Fannounce |
00:51
🔗
|
|
Stiletto has quit IRC () |
01:15
🔗
|
|
schbirid2 has joined #archiveteam-bs |
01:18
🔗
|
|
schbirid has quit IRC (Ping timeout: 244 seconds) |
01:18
🔗
|
|
Stiletto has joined #archiveteam-bs |
02:00
🔗
|
|
Stiletto has quit IRC (Read error: Operation timed out) |
02:01
🔗
|
|
Stiletto has joined #archiveteam-bs |
02:01
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
02:02
🔗
|
|
dashcloud has joined #archiveteam-bs |
02:24
🔗
|
|
Stiletto has quit IRC (Read error: Operation timed out) |
02:25
🔗
|
|
Stiletto has joined #archiveteam-bs |
02:49
🔗
|
|
SketchCow has quit IRC (Read error: Connection reset by peer) |
02:55
🔗
|
|
SketchCow has joined #archiveteam-bs |
02:55
🔗
|
|
midas sets mode: +o SketchCow |
02:55
🔗
|
|
swebb sets mode: +o SketchCow |
03:43
🔗
|
|
DoomTay has quit IRC (Quit: Page closed) |
03:58
🔗
|
|
kristian_ has quit IRC (Leaving) |
04:06
🔗
|
|
godane has quit IRC (Leaving.) |
04:29
🔗
|
|
Sk1d has quit IRC (Ping timeout: 194 seconds) |
04:35
🔗
|
|
Sk1d has joined #archiveteam-bs |
04:37
🔗
|
|
Frogging sets mode: +o arkiver |
04:52
🔗
|
|
Meroje has quit IRC (Quit: bye!) |
04:53
🔗
|
|
Meroje has joined #archiveteam-bs |
05:35
🔗
|
|
Fusl has quit IRC (Contact: http://hallowe.lt/) |
05:39
🔗
|
yipdw |
nice, whoever was in #internetarchive as "obama" sent me like 40 queries |
05:40
🔗
|
yipdw |
some boys don't like the +b |
05:41
🔗
|
yipdw |
under 40 different handles and CTCP VERSIONed me twice |
05:53
🔗
|
|
godane has joined #archiveteam-bs |
05:56
🔗
|
HCross2 |
Nice yipdw - was holding my tongue back on him myself |
06:10
🔗
|
godane |
i'm uploading 25gb of The Doug Urbanski Show |
06:10
🔗
|
godane |
i'm hoping IA can handle me uploading since one item is still waiting to be derive |
06:28
🔗
|
hook54321 |
DoomTay, Frogging : It's a genealogy library. |
06:33
🔗
|
|
fusl has joined #archiveteam-bs |
06:42
🔗
|
|
Honno has joined #archiveteam-bs |
07:13
🔗
|
|
Start_ has joined #archiveteam-bs |
07:13
🔗
|
|
Start has quit IRC (Read error: Connection reset by peer) |
07:15
🔗
|
|
sep332 has quit IRC (Quit: konversation out) |
07:24
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
07:28
🔗
|
|
dashcloud has joined #archiveteam-bs |
07:34
🔗
|
|
atrocity has quit IRC (Read error: Connection reset by peer) |
07:35
🔗
|
|
atrocity has joined #archiveteam-bs |
07:54
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
07:57
🔗
|
|
dashcloud has joined #archiveteam-bs |
07:58
🔗
|
|
BartoCH has quit IRC (Ping timeout: 260 seconds) |
08:00
🔗
|
|
BartoCH has joined #archiveteam-bs |
08:10
🔗
|
|
BlueMaxim has quit IRC (Read error: Operation timed out) |
08:14
🔗
|
|
Start_ is now known as Start |
08:23
🔗
|
|
wp494 has quit IRC (Read error: Connection reset by peer) |
08:23
🔗
|
|
wp494 has joined #archiveteam-bs |
09:03
🔗
|
|
tomwsmf has quit IRC (Read error: Operation timed out) |
09:57
🔗
|
|
VADemon has joined #archiveteam-bs |
10:07
🔗
|
|
BlueMaxim has joined #archiveteam-bs |
10:28
🔗
|
|
yakfish has quit IRC (Ping timeout: 246 seconds) |
10:43
🔗
|
|
yakfish has joined #archiveteam-bs |
11:30
🔗
|
|
VADemon has quit IRC (Read error: Operation timed out) |
11:53
🔗
|
luckcolor |
hook54321: after you start the grab-site crawl in the crawl folder there are different files, (those are for the settings) if chnage any of these wpull will pickup the settings |
11:54
🔗
|
luckcolor |
there should be one for ignoreregexs |
11:54
🔗
|
luckcolor |
one per line |
13:45
🔗
|
|
BlueMaxim has quit IRC (Quit: Leaving) |
14:03
🔗
|
|
sep332 has joined #archiveteam-bs |
15:11
🔗
|
|
DoomTay has joined #archiveteam-bs |
15:19
🔗
|
|
goekesmi has left |
15:25
🔗
|
HCross2 |
yipdw: could we have a test endpoint on FOS to try and sort these speed issues please? |
15:26
🔗
|
yipdw |
HCross2: an iperf endpoint, or something else |
15:27
🔗
|
HCross2 |
rsync |
15:27
🔗
|
yipdw |
note that fos is currently doing a bunch of disk work so you're going to get that interference |
15:28
🔗
|
SketchCow |
:) |
15:28
🔗
|
yipdw |
as a result your measurements are going to be noisy |
15:28
🔗
|
SketchCow |
Every time I try to clean up FOS I end up with 20% more used disk space |
15:28
🔗
|
HCross2 |
Yeah, I see. It's just that from the EU OVH it's painful and they want to test. |
15:29
🔗
|
SmileyG |
theey actually interested :O |
15:35
🔗
|
godane |
so the doug urbanski show is mostly uploaded |
15:37
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
15:42
🔗
|
|
dashcloud has joined #archiveteam-bs |
15:45
🔗
|
yeoldeto1 |
rip Krautchan |
16:22
🔗
|
|
kristian_ has joined #archiveteam-bs |
16:55
🔗
|
|
DoomTay has quit IRC (Quit: Page closed) |
17:21
🔗
|
|
atrocity has quit IRC (Read error: Connection reset by peer) |
17:39
🔗
|
Frogging |
is it possible that pastebin.com expires/removes old pastes even if they didn't have an expiry set? there are a lot of broken pastebin links in my IRC logs and I could swear they never had an expiry |
17:46
🔗
|
|
robink has quit IRC (Ping timeout: 501 seconds) |
17:47
🔗
|
yipdw |
there's plenty of ways to remove stuff from pastebin.com |
17:47
🔗
|
yipdw |
logged-in users can delete their pastes, abuse reports, DMCA reports |
17:49
🔗
|
|
DoomTay has joined #archiveteam-bs |
17:50
🔗
|
|
brayden__ has joined #archiveteam-bs |
17:50
🔗
|
|
swebb sets mode: +o brayden__ |
17:54
🔗
|
|
brayden_ has quit IRC (Read error: Operation timed out) |
17:57
🔗
|
Atluxity |
yes, pastebin.com may remove stuff without notice |
17:57
🔗
|
Atluxity |
but I would not highest on the likelyhood |
17:57
🔗
|
Frogging |
i guess people removing them happens more frequently than I'd expect |
17:57
🔗
|
Frogging |
it's just random stuff like code snippets my friend sent me last year, I wouldn't expect him to have deleted it manually or anything |
18:01
🔗
|
HCross |
yipdw, getting a chroot failed error :/ |
18:05
🔗
|
godane |
kpfa archives are now up to 2016-08-09 |
18:07
🔗
|
|
robink has joined #archiveteam-bs |
18:08
🔗
|
|
RichardG has quit IRC (Read error: Operation timed out) |
18:16
🔗
|
|
DoomTay has quit IRC (Quit: Page closed) |
18:25
🔗
|
|
DoomTay has joined #archiveteam-bs |
18:27
🔗
|
DoomTay |
So http://4publicpurity.org/ is now a blank page |
18:28
🔗
|
Kaz |
correct |
18:31
🔗
|
DoomTay |
At least it was saved, even if there was really little to save |
18:39
🔗
|
|
VADemon has joined #archiveteam-bs |
18:41
🔗
|
|
DoomTay has quit IRC (Quit: Page closed) |
18:53
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
18:56
🔗
|
|
dashcloud has joined #archiveteam-bs |
19:32
🔗
|
|
DoomTay has joined #archiveteam-bs |
19:36
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
19:44
🔗
|
|
tomwsmf has joined #archiveteam-bs |
19:45
🔗
|
|
dashcloud has joined #archiveteam-bs |
19:51
🔗
|
|
Lord_Nigh has quit IRC (ZNC - http://znc.in) |
20:07
🔗
|
godane |
SketchCow: more Nintendo Power issues: https://www.reddit.com/r/DataHoarder/comments/4wzzsv/a_few_more_issues_of_nintendo_power/ |
20:07
🔗
|
Frogging |
yeah I just saw that, grabbed it |
20:08
🔗
|
godane |
they are smaller then the other releases of Nintendo Power |
20:08
🔗
|
godane |
i also have issue 171 and 180 |
20:11
🔗
|
|
Lord_Nigh has joined #archiveteam-bs |
20:49
🔗
|
SketchCow |
All set for Nintendo power at the moment thank youuuuuuuuuuuuuuuuuuuuu |
20:53
🔗
|
godane |
ok |
20:56
🔗
|
DoomTay |
Let's hope it doesn't go dark again |
22:04
🔗
|
|
dashcloud has quit IRC (Read error: Connection reset by peer) |
22:07
🔗
|
|
dashcloud has joined #archiveteam-bs |
22:20
🔗
|
hook54321 |
Frogging: We could set something up that automatically archives pastebin (and possibly some other sites?) links that are posted in IRC. |
22:24
🔗
|
|
Honno has quit IRC (Read error: Operation timed out) |
22:26
🔗
|
|
BartoCH has quit IRC (Ping timeout: 260 seconds) |
22:27
🔗
|
|
BartoCH has joined #archiveteam-bs |
22:51
🔗
|
|
BartoCH has quit IRC (Ping timeout: 260 seconds) |
22:51
🔗
|
|
DoomTay has quit IRC (Ping timeout: 268 seconds) |
23:08
🔗
|
|
BartoCH has joined #archiveteam-bs |
23:11
🔗
|
|
DoomTay has joined #archiveteam-bs |
23:20
🔗
|
hook54321 |
If I have a reprint of a book that was originally published in 1906, is it safe to scan it and put it on archive.org? |
23:21
🔗
|
xmc |
yes |
23:21
🔗
|
DoomTay |
I'd OCR it so we can have it in pure text |
23:22
🔗
|
hook54321 |
DoomTay: Like, in addition to the the scans, or just OCR? |
23:22
🔗
|
DoomTay |
Either might work |
23:22
🔗
|
DoomTay |
Not sure which archive.org would accept |
23:23
🔗
|
DoomTay |
Yeah, both |
23:24
🔗
|
hook54321 |
What's the best OCR software? Are there any good free ones? |
23:25
🔗
|
DoomTay |
That I can't help you with |
23:30
🔗
|
dashcloud |
if you've got a good, clean scan, Internet Archive will OCR it for you as part of the process |
23:31
🔗
|
xmc |
they'll ocr anyway, but if it's a garbage scan you won't get much out of it |
23:35
🔗
|
hook54321 |
How good is their OCR? |
23:38
🔗
|
Frogging |
they're kind of in the business of digitizing books so I think it'd be good |
23:38
🔗
|
xmc |
quite |
23:58
🔗
|
|
BartoCH has quit IRC (Ping timeout: 260 seconds) |
23:58
🔗
|
|
BartoCH has joined #archiveteam-bs |