[01:30] so, shipping boxes with USPS- what should I know about the options? should I use flat rate boxes instead of normal shipping cartons? [01:55] frb is usually nice and simple [01:56] frb? [01:56] flatratebox [02:03] i have "The Virtual Revolution" series from bbc now [02:09] nooo [02:09] bye bye mail [03:47] SketchCow: I have a Dutch translation of I Chose Freedom (some really old book) that I am currently scanning - the translation was made in 1948 so it's technically not public domain yet - what are the possibilities for placing it on the internet archive, if any? [03:47] it seems there are no PDFs or anything for it *anywhere* [03:47] (we have like a 70 year after death period for public domain) [03:52] joepie91: afaict usually stuff like that gets blacked out [03:52] was looking around today and could only find newer stuff that was in DAISY format with encryption (requires a key only accessible to the blind) [03:52] well, the PDFs and other files were listed if you looked, just not available to download [03:53] okay, but I can upload so that it's archived still, or...? [03:54] I mean, it doesn't have to be immediately accessible [03:54] as long as it's stored somewhere :P [04:00] context: http://en.wikipedia.org/wiki/Victor_Kravchenko_(defector) [04:02] Page 30 finished, press enter to continue with next page... [04:02] >.> [04:09] balrog-: ^? [04:09] :P [04:11] my understanding is that you upload and then get it "blacked out" [04:17] balrog-: once i'm done with these scans can you help me figure out how to appropriately chop em up etc? [04:18] the page borders for each page have to be removed [04:18] and the pages split in two down the middle [04:18] probably first split then debordered [04:18] and finally level adjust and mash to 16-level greyscale [04:18] though the last might not be needed after level adjust [04:19] Lord_Nigh: sure [04:56] joepie91: basically just upload it and if someone bitches it will get blacked out, but they'll keep the copy [04:56] but for stuff like what you described probably no one will ever care [04:57] okay :P [04:57] well [04:57] lol [04:57] yeah [04:58] * joepie91 is using a harddrive as weight to keep the book against the glass plate [05:06] also, this is the kind of scan I'm making now: http://aarnist.cryto.net:81/vrijheid2_0013.png [05:07] and yes, the book really is printed that way :P [05:10] you'll want to split each scan into two pages before uploading [05:11] of course :) [05:11] this is just faster to scan [05:11] yep been down that road :) [05:49] DFJustin: say one had access to a collection of arbitrary electriconic components, as well as a pile of old scanners [05:49] how would one produce a fully automated book scanner? [05:50] er [05:50] electronic * [05:50] as in, are there any guides on this topic? :P [05:50] (I do in fact have access to a pile of scanners and electronic components) [05:51] generally the diy designs use cameras rather than scanners [05:51] http://www.diybookscanner.org/ [05:52] alright, problem is I don't have high resolution cameras [05:53] and money is a constraint for me :/ [05:55] may still be worth a poke around the forums [05:59] DFJustin: you don't happen to know how zero edge scanners work? [05:59] by any chance? [05:59] because if I can figure that out, I have an idea [06:03] nope [06:04] my idea is two glass plates placed in a 90 degrees angle from each other [06:04] and some kind of clamp mechanism to automatically place a book on there [06:04] the book would be facing down, with zero edge scanners under both glass plates [06:05] to flip the page, mechanism could tilt back the book and somehow flip the page, then tilt back onto the glass plates [06:05] that should be reasonably fast while maintaining the high quality of a good flatbed scanner, not damage the spine [06:05] and it should be fully automated [06:05] need to figure out the specifics though [06:06] may also use a metal rod to let the spine rest on, but that would require a more complicated mechanism to tilt it back, since the page can't be turned if there's a metal spine inbetween the pages [06:06] er [06:06] metal rod* [06:09] I mean, if it's fully automated, it doesn't matter that the flatbed scanners are slow [06:09] hell, I could built multiple and run them in parallel :P [08:39] oh shit [08:39] just found scantailor [08:39] I think I'm in love [08:39] ll [08:39] lol * [08:47] mmmmm [08:47] also, unpaper [08:47] http://unpaper.berlios.de/ [08:48] scantailor seems much spiffier though [08:52] how *do* you turn the pages automatically? [08:53] I think it's a semimanual process [08:53] o [08:53] ffs [08:53] why do people think if they ring someone and they don't answer, that ringing back again and again and again will get htem anywhere? [08:53] 5th time [08:55] well, I do that a lot [08:55] sometimes people can't get to their phones in time, or maybe they won't hear it [08:55] my father for example is a prime example of the latter [08:55] I usually ring a second time for urgent calls if no answer, three if it's important as well as urgent [09:16] SmileyG: in the context of the idea I spammed above? [09:22] oh shit, the output from scantailor seems perfect [09:22] wow. [09:25] this man is a pro: http://imgur.com/DB3IO [09:27] hah [10:06] input: http://aarnist.cryto.net:81/vrijheid2_0007.png output: http://imgur.com/a/CZ4VZ [10:06] I am impressed. [10:28] chronomex, DFJustin, SmileyG, underscor, perfect book scanner design: http://www.youtube.com/watch?v=hlOQuuLYavY [10:28] fully automated [10:29] also seems reasonably possible to self-produce one of those at low cost [10:36] * SmileyG wibbles [10:42] http://what-if.xkcd.com/17/ [11:03] well there we go [11:03] processing the entire book now [11:03] :) [11:47] hey, um, does anyone here have actual experience with camera-based book scanners? [11:47] SmileyG, underscor, SketchCow, DFJustin, ? [11:48] if any of you does, do you think a camera with this kind of image quality would be capable of taking usable pictures of books under proper lighting? http://www.dealextreme.com/customerphotos/quarantined/201105/42179-a0ce62af-6a11-41f0-ac4e-72ac9b184ed5.jpg [11:52] I have no clue, but I'd be doubtful of that camera in perticular. [11:52] why is that? :P [11:54] small sensor, cheap camera [11:54] I have no clue how good any "webcam" type cameras are focusing on text... [11:59] found one that I think will suffice [11:59] http://dx.com/p/compact-1-3mp-pc-usb-webcam-with-built-in-microphone-black-51874 [11:59] but no sample pictures, annoyingly [11:59] one review even says it has autofocus but I sort of doubt that [11:59] although not impossible [11:59] but judging from the video this isn't the same crappy sensor that's in 99% of the cheap webcams [12:01] what I really want is this one: http://dx.com/p/stickman-2mp-pc-usb-2-0-webcam-14990?item=2 [12:01] but it's not manufactured anymore :( [12:01] I had one but I broke it - if you focused it correctly, the image was ridiculously sharp [12:34] SmileyG: finished postprocessing! [12:34] http://aarnist.cryto.net:81/vrijheid2.pdf :D [12:34] my first scanned book lol [12:36] not the final version, still missing 2 pages [13:45] Why hello. [13:46] I have a book scanner right here in my room. [14:17] SketchCow: nice :) [14:24] One of these: http://diybookscanner.org/ [14:28] yeah I've seen them [14:28] would like to have one, though auto page-turning would be nice [14:29] Auto page turning scanners require a person to be there to supervise [14:31] does google have people supervising all their scanners? [14:35] Not like they can't afford them? [14:36] Yes [14:36] Also, Google Scanners are by hand, not auto-page-turners. [14:36] you don't want the machine wreck your valuable books [14:41] http://www.buzzfeed.com/reyhan/the-hidden-hands-scanning-the-worlds-knowledge-fo [14:43] hm, ok... [14:45] http://archive.org/details/americanartposters1890s [14:45] I'm looking at porting an online collection into the archive. [14:45] Without losing their metadata. [16:01] FUCK SOLAR FLARES [16:03] WE MUST BLOW UP THE SUN TO SAVE THE INTERNET [16:21] On it [16:21] for each in 00 01 02 03 04 05 06 07 08 09 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 [16:21] All my best scripts start like that [16:52] SketchCow: for each in `seq -w 0 42` [16:53] the w does consistent width [16:53] (so you get 00 instead of 0) [16:55] yawn [16:55] There's a joy factor here. [16:56] Also, seq ain't everywhere, kid. [16:56] 6.0-RELEASE FreeBSD [16:56] seq [16:56] seq: Command not found. [17:18] SketchCow: Oh, sorry, the bsd userland version is jot [17:32] * Aranje grins [17:35] Hmmm. [17:35] So you're saying if I do it my way, it just works [17:36] But your attempt to save 80 characters adds a new set of if statements to verify the existence of seq or jot [17:36] hmmmm [18:25] fuck yeah, ubuntu 12.10 seems to have only broken one thing [18:25] excellent [19:40] SketchCow: If you have bash you can do {00..42}} [19:40] Agreed. [19:40] THAT I always forget to do. [19:43] I'm an old tcsh guy, so my foreach looks look like 'foreach file in (blah[1234][0123456789])' [19:49] for each {00..42 [19:49] } [19:50] oh wait soultcer already said it, hahaha [19:50] bash understands sequences, even with variable gaps [19:50] for each in {00..69} ; do wget http://archive.org/download/archiveteam-picplz-$each/000000$each.tar; done [19:51] nice [19:51] See, I used your silly thing to download picplz [19:51] :D [19:51] wget .... & [19:51] DOWNLOAD ALL THE THINGS [19:51] ha ha, if you're an idiot [19:51] i think that'd work, :D [19:52] No, TECHNICALLY a torrent would work, but I'm inside the network, I'm maxing this machine as it is. [19:52] ah :) [19:52] then my way would be slightly bad :D [19:53] Anyway, as it is, this machine is fucking WHALING OUT [19:54] SketchCow: HMMMM [19:54] IOWAIT thru the roof? [19:55] * SmileyG is unsure if bsd has teh same terrible CFQ schedular as is default in linux. [19:55] It hates the idea of you writing a large file in one go [19:55] and goes "fuck it, I'll make everything wait for you!" [19:56] Dude, you do realize I'm on a machine I control and not some lame-ass sloppy-hundreds university machine I'm fighting 30 instances of Dwarf Fortress for domination of, right? [19:56] SketchCow: :D [19:56] dwarffortress is awesome tho :P [19:56] I just mean I dunno what the default schedular is on freeBSD. [19:57] The fact is, it's a virtual instance and it has the pros and cons of that. Pro is that I get 20tb of space, but con is right now, like this week, I'm rushing to get stuff into Wayback and that means basically downloading something like 12 terabytes of data, running processing on it, and shoving it back in. [19:57] that's just murdering this machine, and any attempts to slightly improve things is just tweaker hob-nob bullshit I have to time for. [19:58] wget *; megawarc convert *;shove all that shit back into archive.org * [19:58] yeah, ouch [19:58] unless you've got some crazy ass raid array for storage, thats gonna take awhile for any machine [19:58] then theres the "you want to work on HOW MUCH DATA AT ONCE?!" issue.... [19:59] Not an issue to me. [19:59] Just Solve The Problem and the DEFCON documentary are my two big things right now. [20:00] http://www.facebook.com/photo.php?fbid=10151130542264527&set=a.37023524526.46968.706829526&type=1&theater¬if_t=like [21:26] i hate to ask again since i did once before but i still dont really understand what just solve the problem is [21:26] last time i learned it basically needed programmers [21:26] so im sort of out of being any help [21:26] but i read the wiki and just dont understand what its about [21:26] not sure if im retarded [21:28] i guess the better question is, what needs to be done. and how can people who arent master programmers help [21:28] Go to #justsolve [22:35] so, I saw this, and thought you guys might like it: http://davidhunt.ie/wp/?p=232 (What to do when gigabit ethernet just isn't cutting it anymore- a cheap solution actually) [22:37] Infiniband at home!