Time |
Nickname |
Message |
00:18
🔗
|
|
adinbied has joined #archiveteam-ot |
01:20
🔗
|
|
svchfoo1 has quit IRC (Read error: Operation timed out) |
01:21
🔗
|
|
svchfoo1 has joined #archiveteam-ot |
01:22
🔗
|
|
svchfoo3 sets mode: +o svchfoo1 |
01:25
🔗
|
|
chirlu has quit IRC (Read error: Operation timed out) |
01:28
🔗
|
adinbied |
Figured I'd ask people here and see what they think. I'm currently archiving a lot of my schools stuff (old newspapers scanned and OCR'ed, VHS tapes digitized and restored, photos, etc.) and I've been struggling for a while on how best to implement a browsing system or for how to actually make things work |
01:28
🔗
|
SketchCow |
You can upload them to archive.org. |
01:29
🔗
|
SketchCow |
I can give you an FTP drop or you can upload them yourself. |
01:29
🔗
|
SketchCow |
If it's enough items I can give you a collection. |
01:31
🔗
|
adinbied |
I was thinking about that, but wasn't sure if they would qualify as being notable enough - that does sound like the best way to do things, though. |
01:34
🔗
|
SketchCow |
I don't care about notability. |
01:35
🔗
|
SketchCow |
I only care about it being a notable amount of files. |
01:35
🔗
|
SketchCow |
Like over 75 |
01:37
🔗
|
|
chirlu has joined #archiveteam-ot |
01:37
🔗
|
adinbied |
Well, its still in progress, but there will probably be ~150 3-8 MB PDFs for the school newspaper archives, and then another 150 x264 videos or so |
01:41
🔗
|
adinbied |
Just threw an example newspaper that I scanned up to IA: https://archive.org/details/SavantWinter2001OCR |
01:41
🔗
|
Flashfire |
adinbied how are you scanning them? |
01:44
🔗
|
adinbied |
I've got a Canon flatbed scanner hooked up to my computer and because each newspaper page is larger than the scanner, I scan it in quadrants at 600 DPI. I then take the four files for each page and use a panorama stitcher (PTGUI) to combine it into one image. Then I'll go through and crop/rotate/color correct each page, and then downscale the resulting image to 25% of its 55 MP resolution. I'll then use a png to PDF |
01:44
🔗
|
adinbied |
converter to get everything into one PDF, and then I'll run it through ABBYY FineReader to OCR it before everything is done. |
01:47
🔗
|
SketchCow |
So yeah, just scan |
01:47
🔗
|
SketchCow |
We will automatically OCR them too |
01:49
🔗
|
|
alex____ has quit IRC (Quit: ZZzzz) |
01:50
🔗
|
adinbied |
SketchCow, here's a tree of what all I've got so far - currently trying to figure out how to organize everything and allow future people to be able to search through this easily: https://pastebin.com/raw/1w6TD0sH |
03:09
🔗
|
|
Stilett0 has joined #archiveteam-ot |
03:12
🔗
|
|
Stiletto has quit IRC (Read error: Operation timed out) |
03:13
🔗
|
|
Stiletto has joined #archiveteam-ot |
03:13
🔗
|
|
Stilett0 has quit IRC (Ping timeout: 252 seconds) |
03:18
🔗
|
|
Stilett0 has joined #archiveteam-ot |
03:20
🔗
|
|
Stiletto has quit IRC (Ping timeout: 265 seconds) |
04:52
🔗
|
|
hiroi has joined #archiveteam-ot |
04:57
🔗
|
|
Stiletto has joined #archiveteam-ot |
04:59
🔗
|
|
Stilett0 has quit IRC (Read error: Operation timed out) |
05:30
🔗
|
|
Stilett0 has joined #archiveteam-ot |
05:33
🔗
|
|
Stiletto has quit IRC (Ping timeout: 492 seconds) |
05:36
🔗
|
|
Stiletto has joined #archiveteam-ot |
05:40
🔗
|
|
Stilett0 has quit IRC (Read error: Operation timed out) |
05:56
🔗
|
|
Stilett0 has joined #archiveteam-ot |
06:01
🔗
|
|
Stiletto has quit IRC (Ping timeout: 492 seconds) |
06:02
🔗
|
|
Stiletto has joined #archiveteam-ot |
06:02
🔗
|
|
Stilett0 has quit IRC (Ping timeout: 246 seconds) |
06:05
🔗
|
|
Stilett0 has joined #archiveteam-ot |
06:09
🔗
|
|
Stiletto has quit IRC (Read error: Operation timed out) |
06:11
🔗
|
|
Stiletto has joined #archiveteam-ot |
06:11
🔗
|
|
Stilett0 has quit IRC (Read error: Operation timed out) |
06:15
🔗
|
|
Stilett0 has joined #archiveteam-ot |
06:16
🔗
|
|
Stiletto has quit IRC (Ping timeout: 260 seconds) |
06:21
🔗
|
|
Stilett0 has quit IRC (Read error: Operation timed out) |
06:23
🔗
|
|
Stiletto has joined #archiveteam-ot |
06:37
🔗
|
|
Stilett0 has joined #archiveteam-ot |
06:40
🔗
|
|
Stiletto has quit IRC (Read error: Operation timed out) |
06:44
🔗
|
|
Stilett0 has quit IRC (Read error: Operation timed out) |
06:47
🔗
|
|
Stiletto has joined #archiveteam-ot |
06:58
🔗
|
|
Martle_ has joined #archiveteam-ot |
07:00
🔗
|
|
Martle has quit IRC (Read error: Operation timed out) |
07:10
🔗
|
|
Stilett0 has joined #archiveteam-ot |
07:12
🔗
|
|
Stiletto has quit IRC (Ping timeout: 252 seconds) |
07:12
🔗
|
|
Stiletto has joined #archiveteam-ot |
07:15
🔗
|
|
Stilett0 has quit IRC (Read error: Operation timed out) |
07:23
🔗
|
|
Stilett0 has joined #archiveteam-ot |
07:25
🔗
|
|
Stiletto has quit IRC (Ping timeout: 268 seconds) |
07:31
🔗
|
|
Stiletto has joined #archiveteam-ot |
07:33
🔗
|
|
Stilett0 has quit IRC (Read error: Operation timed out) |
07:36
🔗
|
|
Stiletto has quit IRC (Ping timeout: 255 seconds) |
07:39
🔗
|
|
Stiletto has joined #archiveteam-ot |
07:40
🔗
|
|
Martle_ has quit IRC (Remote host closed the connection) |
07:49
🔗
|
|
Stilett0 has joined #archiveteam-ot |
07:55
🔗
|
|
Stiletto has quit IRC (Ping timeout: 633 seconds) |
07:57
🔗
|
|
Stiletto has joined #archiveteam-ot |
07:59
🔗
|
|
Stilett0 has quit IRC (Read error: Operation timed out) |
08:00
🔗
|
|
Stilett0 has joined #archiveteam-ot |
08:01
🔗
|
|
Stiletto has quit IRC (Ping timeout: 252 seconds) |
08:35
🔗
|
|
Stiletto has joined #archiveteam-ot |
08:36
🔗
|
|
Stilett0 has quit IRC (Read error: Operation timed out) |
08:47
🔗
|
|
m007a83_ has joined #archiveteam-ot |
08:50
🔗
|
|
m007a83 has quit IRC (Ping timeout: 252 seconds) |
08:53
🔗
|
|
m007a83__ has joined #archiveteam-ot |
08:56
🔗
|
|
m007a83_ has quit IRC (Read error: Operation timed out) |
09:01
🔗
|
|
Stiletto has quit IRC (Ping timeout: 268 seconds) |
09:01
🔗
|
|
Stiletto has joined #archiveteam-ot |
09:11
🔗
|
|
alex__ has joined #archiveteam-ot |
09:30
🔗
|
|
Stiletto has quit IRC (Read error: Operation timed out) |
09:31
🔗
|
|
Stiletto has joined #archiveteam-ot |
09:45
🔗
|
|
Stilett0 has joined #archiveteam-ot |
09:48
🔗
|
|
Stiletto has quit IRC (Ping timeout: 360 seconds) |
09:48
🔗
|
|
Stiletto has joined #archiveteam-ot |
09:50
🔗
|
|
Stilett0 has quit IRC (Ping timeout: 268 seconds) |
10:57
🔗
|
|
Mateon1 has quit IRC (Ping timeout: 600 seconds) |
11:00
🔗
|
|
Mateon1 has joined #archiveteam-ot |
11:58
🔗
|
|
BlueMax has quit IRC (Quit: Leaving) |
12:09
🔗
|
|
hiroi has left Be back later... |
12:30
🔗
|
|
m007a83 has joined #archiveteam-ot |
12:34
🔗
|
|
m007a83_ has joined #archiveteam-ot |
12:35
🔗
|
|
m007a83__ has quit IRC (Read error: Operation timed out) |
12:39
🔗
|
|
m007a83 has quit IRC (Read error: Operation timed out) |
13:52
🔗
|
|
m007a83_ is now known as m007a83 |
14:18
🔗
|
|
Stilett0 has joined #archiveteam-ot |
14:20
🔗
|
|
Stiletto has quit IRC (Read error: Operation timed out) |
14:34
🔗
|
|
Stiletto has joined #archiveteam-ot |
14:35
🔗
|
|
Stilett0 has quit IRC (Ping timeout: 264 seconds) |
14:57
🔗
|
|
Stilett0 has joined #archiveteam-ot |
15:01
🔗
|
|
Stiletto has quit IRC (Read error: Operation timed out) |
15:05
🔗
|
|
Stiletto has joined #archiveteam-ot |
15:07
🔗
|
|
Stilett0 has quit IRC (Ping timeout: 268 seconds) |
17:00
🔗
|
|
alex____ has joined #archiveteam-ot |
17:01
🔗
|
|
alex__ has quit IRC (Ping timeout: 252 seconds) |
19:18
🔗
|
|
sep332 has quit IRC (Read error: Connection reset by peer) |
19:35
🔗
|
|
Martle has joined #archiveteam-ot |
20:13
🔗
|
|
wp494 has quit IRC (Read error: Operation timed out) |
20:16
🔗
|
|
wp494 has joined #archiveteam-ot |
20:22
🔗
|
|
S1mpbrain has quit IRC (Remote host closed the connection) |
20:25
🔗
|
|
SimpBrain has joined #archiveteam-ot |
20:26
🔗
|
|
BlueMax has joined #archiveteam-ot |
21:13
🔗
|
|
BlueMax has quit IRC (Remote host closed the connection) |
21:13
🔗
|
|
BlueMax has joined #archiveteam-ot |