[00:03] *** xk_id_ has quit IRC (Remote host closed the connection) [00:04] *** xk_id has joined #archiveteam [00:05] *** n00b742 has left [00:10] *** MMovie has joined #archiveteam [00:13] *** MMovie1 has quit IRC (Read error: Operation timed out) [00:21] *** wyatt8740 has quit IRC (Remote host closed the connection) [00:22] *** wyatt8740 has joined #archiveteam [00:36] *** vitzli has joined #archiveteam [00:37] *** xk_id has quit IRC (Read error: Operation timed out) [00:41] *** superkuh has joined #archiveteam [00:45] *** yakfish has quit IRC (Read error: Operation timed out) [00:45] *** yakfish has joined #archiveteam [01:11] *** yakfish has quit IRC (Read error: Operation timed out) [01:15] *** primus104 has quit IRC (Leaving.) [01:21] *** toad2 has joined #archiveteam [01:23] *** toad1 has quit IRC (Read error: Operation timed out) [01:47] *** JesseW has joined #archiveteam [02:13] *** yakfish has joined #archiveteam [02:35] *** dan- has quit IRC (Ping timeout: 483 seconds) [02:36] *** JesseW has quit IRC (Read error: Operation timed out) [02:39] *** dan- has joined #archiveteam [02:39] *** JesseW has joined #archiveteam [03:30] *** robink has quit IRC (Ping timeout: 492 seconds) [03:56] *** robink has joined #archiveteam [04:06] *** robink has quit IRC (Ping timeout: 492 seconds) [04:28] *** robink has joined #archiveteam [04:33] *** aschmitz has quit IRC (Ping timeout: 258 seconds) [04:40] *** robink has quit IRC (Ping timeout: 492 seconds) [04:46] *** aaaaaaaaa has quit IRC (Leaving) [04:47] *** robink has joined #archiveteam [04:50] *** Asparagir has joined #archiveteam [04:52] Guess who just became the first genealogist to use Freedom of Information laws to try to force the release of public vital records back into the public domain. [04:53] http://www.reclaimtherecords.org/ [04:53] Case filed in NY court yesterday against the NYC Department of Records and Information Systems. [04:55] This is vaguely relevant to ArchiveTeam because I win, those files are all going up on the Internet Archive. [04:55] *If I win... [04:56] Asparagir: neat. good luck [04:57] *** robink has quit IRC (Ping timeout: 492 seconds) [04:58] signed up for your mailing list [04:59] Thanks. [05:03] *** robink has joined #archiveteam [05:07] *** vitzli has quit IRC (Quit: Leaving) [05:10] Asparagir: That's fascinating, I guess it never occurred to me that those records *wouldn't* be available, but I guess someone has to get them up somewhere. Very cool. All the genealogy nuts in my family are in Canada but I'll put this project on their radar at least for information's sake. [05:11] Thanks! Yeah, NYC and to a lesser extent NY State (minus NYC) are both surprisingly bad at putting data out there for the public to use. [05:11] Other states are way, way better at making data available. [05:13] NY State has made the 1957-1965 death index available as open data on the state Socrata portal. But other years? You can only see them in microfilm in like eight state libraries. Nowhere else. Crazy. [05:13] Never mind births, marriages... [05:14] What sort of access do you have there? I've never worked with microfilm; could you somehow copy it while you're there? Would that be "allowed" if it were technically feasible? [05:14] So, Freedom of Information requests to the rescue (maybe). [05:14] NY state doesn't allow it! [05:14] *** robink has quit IRC (Read error: Connection reset by peer) [05:15] Oh interesting. Lawyer up, I guess! [05:15] Oh I have. Google "Rankin and Taylor". [05:15] Asparagir: is a copy of your legal complaint available? [05:15] indeed :) [05:15] Best known for forcing even the NYPD to answer FOIL. [05:16] Right now my Petition is a very basic one pager. We don't get serious until the city responds. [05:17] I got the NY State Committee on Open Government to write an "Advisory Opinion" on my case, agreeing with me! It's not legally binding but should help a lot. See the PDF on the bottom of the site. [05:18] More data will get posted to site as the case goes on. [05:20] * myself reads about FOSDIC, which is how microfilm data was scanned for the census in the the 1960s... [05:20] Nerd. [05:20] :-) [05:20] where?! [05:20] Hee. [05:21] Have other government agencies complied with similar requests for microfilm records? If so, what format do they deliver the response in? [05:22] For other agencies, yes. If the original records are in format X and the FOIL requester is willing to pay for copies and here are no glaring privacy issues, requester can also get Format X. [05:22] X includes microfilm. [05:23] You cannot force the government to make you a new format, though. You get what they use. [05:23] Which is fine. Microfilm is neat. [05:23] And the IA at Funston Street has, I hear, a microfilm scanner. :-) [05:24] So they also might not have this data in any other format? They don't use it for anything? [05:24] *** robink has joined #archiveteam [05:24] Yup. In fact, in the case I am specifically filing in NYC, they definitely don't have anything else. Just microfilm for this old stuff. [05:24] They do their daily work with microfilm. [05:25] Thug Lyfe. [05:26] You'd think they'd be scrambling to convert before all the people who know how to use it retire/die. The telephone industry has been hair-on-fire replacing old switching equipment because there's nobody left who can maintain it. [05:26] Yes, and no one makes new microfilm film anymore. Like Polaroids, kinda. [05:28] they should be begging you to do their scanning for them! [05:28] Microfilm-only records terrify the heck out of me. [05:28] Of course, it's better than CDR-only records... ;-/ [05:29] or the ultimate horror, nitracellulose film-only records -- "hi, would you like a f**king BOMB to store your data on? No? Too bad!" [05:33] For further terror, read: https://en.wikipedia.org/wiki/Nitrocellulose#Nitrate_film [05:37] That website just answered a question I had two weeks ago about the safety of taking 16 mm film (old [05:37] home movies) on an airplane. [05:38] Seems like film made for home use was the safe kind after all. [05:38] So, thanks! [05:39] "Eight, 9.5, and 16 mm film stocks, intended for amateur and other nontheatrical use, were never manufactured with a nitrate base in the west, but rumors exist of 16 mm nitrate film having been produced in the former Soviet Union and/or China." [05:40] glad to help [05:48] *** Asparagir has quit IRC (Ping timeout: 252 seconds) [06:35] *** PurpleSym has joined #archiveteam [06:39] *** JesseW has quit IRC (Read error: Operation timed out) [06:48] *** scyther has joined #archiveteam [06:55] *** Start has joined #archiveteam [07:07] *** JesseW has joined #archiveteam [07:17] botpie91: tell Asparagir -- your physical address is listed when anyone confirms signing up for the newsletter. You may want to switch that to a PO box or something, if possible. [07:17] JesseW: I'll pass that on when Asparagir is around. [07:17] botpie91: tell Asparagir -- the Reclaim the Records newsletter, I mean. [07:17] JesseW: I'll pass that on when Asparagir is around. [07:35] *** Stiletto has quit IRC (Read error: Operation timed out) [07:51] *** wyatt8740 has quit IRC (Remote host closed the connection) [07:53] *** JesseW has quit IRC (Read error: Operation timed out) [07:54] *** vitzli has joined #archiveteam [07:54] *** wyatt8740 has joined #archiveteam [08:25] *** vitzli has quit IRC (Quit: Leaving) [08:54] *** primus104 has joined #archiveteam [09:11] *** BlueMaxim has quit IRC (Quit: Leaving) [09:33] *** primus104 has quit IRC (Leaving.) [09:35] *** Start has quit IRC (Read error: Connection reset by peer) [09:37] *** Start has joined #archiveteam [10:08] *** primus104 has joined #archiveteam [10:20] *** vitzli has joined #archiveteam [10:22] *** VADemon has joined #archiveteam [10:57] *** scyther has quit IRC (Leaving) [11:35] *** schbirid has joined #archiveteam [12:21] *** primus104 has quit IRC (Leaving.) [12:22] *** Froggypwn has quit IRC (Read error: Operation timed out) [12:22] *** Froggypwn has joined #archiveteam [12:30] *** Emcy has quit IRC (Ping timeout: 306 seconds) [13:32] *** nertzy has joined #archiveteam [13:37] *** Start has quit IRC (Read error: Connection reset by peer) [13:38] *** Start has joined #archiveteam [13:38] *** boozehoun has quit IRC (Read error: Connection reset by peer) [13:38] *** boozehoun has joined #archiveteam [13:43] *** Emcy has joined #archiveteam [14:21] *** nertzy has quit IRC (Quit: This computer has gone to sleep) [14:59] *** JesseW has joined #archiveteam [15:17] *** habi has joined #archiveteam [15:18] *** habi has left [15:25] *** habi1 has joined #archiveteam [15:29] *** habi1 has left [15:47] *** primus104 has joined #archiveteam [15:54] *** xk_id has joined #archiveteam [15:56] *** primus104 has quit IRC (Leaving.) [16:01] *** irl1 has joined #archiveteam [16:02] so i found a large collection of manuals [16:02] is anyone interested in manuals? [16:02] looks to be solaris manuals and manuals for crazy old networking software and hardware [16:03] i have literally no idea where to start with these, they're all in boxes and i've asked the uni library and we have no resources for digitising them easily available [16:04] ideally i would like these gone from my store room soon, but i can be flexible if there's a chance we can archive these digitally [16:04] Yes we get tons of manuals [16:05] the solaris ones i'm guessing aren't too interesting as they'll be pretty common maybe [16:05] but for some of the older x.25 hardware/software [16:05] Ah paper ones. Hmm dunno. [16:05] those things won't be so common [16:06] there are also some 8" floppies, although i guess the data on those is long gone [16:08] http://bitsavers.trailing-edge.com/ has a very nice collection [16:09] *** atomotic has joined #archiveteam [16:10] do you reckon it's safe to put ~25 year old paper through an ADF? [16:10] some of these manuals are just A4 in a binder [16:10] i could probably digitise those easily enough [16:15] depends on paper weight, but I don't know how binding holes would behave in ADF [16:16] vitzli: binding holes are typically not affected, most ADFs have rollers far enough towards the middle that they don't go over the holes [16:16] check your scanner model though [16:16] irl1: how solid is the paper? [16:16] like, solid enough that you could hold all 4 corners in place, and put something like a can of soda on it, without it tearing? [16:16] it's quite thin [16:16] oh yes [16:16] you could do that [16:17] then you're *probably* fine for ADF, as long as a) your rollers don't go over the holes, and b) the paper weight is less than the max of your ADF [16:17] :P [16:17] irl1: try it with a duplicate first if you can [16:17] there are a few blank sheets in here [16:17] so i'll try on one of those [16:18] yeah, that works [16:18] * irl1 -> scanning machine [16:18] * joepie91 watches the ADF turn into a fireball [16:19] Paper weight is definitely an issue. Few producers give reliable info on the matter though, last time I checked [16:30] irl1: I assume you've heard about SketchCow's ArchiveCorps mailing list? [16:30] http://archivecorps.org/ [16:32] irl1: If you have time to type up basic cataloging info on them first, post that, then other people can check if they're already digitalized and available somewhere. [16:36] *** Stiletto has joined #archiveteam [16:38] *** vitzli has quit IRC (Quit: Leaving) [17:02] *** JesseW has quit IRC (Read error: Operation timed out) [17:13] *** atomotic has quit IRC (Quit: My Mac has gone to sleep. ZZZzzz…) [17:31] *** SimpBrain has quit IRC (Ping timeout: 258 seconds) [17:33] *** db48x has quit IRC (Ping timeout: 258 seconds) [17:36] *** SimpBrain has joined #archiveteam [17:37] *** yuvadm has quit IRC (Ping timeout: 258 seconds) [17:37] *** yuvadm has joined #archiveteam [17:38] ah cool, i'll take a look at doing that [17:38] *** Start has quit IRC (Read error: Connection reset by peer) [17:38] i've digitised one just to see how well it works [17:38] the ADF worked fine except for the first two pages which were curled a bit [17:39] once the scanner emails whatever it got to me, i'll take a look and see if it's worth using that process again for more or if i need to find another way [17:39] *** Start has joined #archiveteam [17:45] i've also scanned a bunch of ring-binder documents [17:46] sometimes i get much better feed results with the spine edge going in the machine last, so it grabs pages from their facing edge [17:49] *** aaaaaaaaa has joined #archiveteam [17:49] *** swebb sets mode: +o aaaaaaaaa [17:54] gaah [17:54] it's got the page size wrong and scanned only the bottom half of the pages [17:54] for all 500 pages [17:54] D: [17:55] i have discovered that i can get the scanner to upload to an ftp server though which is nice [17:55] and means next time i can do the whole lot in one go instead of splitting into email acceptable sizes [17:55] if i can get it to do the paper size right [17:56] *** xk_id has quit IRC (Read error: Operation timed out) [17:57] *** khaoohs has joined #archiveteam [17:59] *** khaoohs_ has quit IRC (Read error: Operation timed out) [17:59] *** xk_id has joined #archiveteam [18:02] ok, i will put these manuals aside and hopefully try again next weekend [18:02] * irl1 needs to eat a food [18:02] if anyone is around the aberdeen, scotland area next weekend and would like to help out that would be great [18:15] *** primus104 has joined #archiveteam [18:16] *** atomotic has joined #archiveteam [18:19] *** xk_id has quit IRC (Remote host closed the connection) [18:20] *** xk_id has joined #archiveteam [18:22] *** xk_id_ has joined #archiveteam [18:22] *** xk_id has quit IRC (Read error: Connection reset by peer) [18:23] i wish [18:23] *** xk_id_ has quit IRC (Read error: Connection reset by peer) [18:23] *** xk_id has joined #archiveteam [18:24] *** xk_id has quit IRC (Remote host closed the connection) [18:26] *** xk_id has joined #archiveteam [18:28] *** xk_id_ has joined #archiveteam [18:28] *** xk_id has quit IRC (Read error: Connection reset by peer) [18:28] *** xk_id_ has quit IRC (Remote host closed the connection) [18:43] *** xk_id has joined #archiveteam [18:54] *** arkiver_ has joined #archiveteam [18:55] *** kyan has joined #archiveteam [18:57] Hi :) Got an email that Docstoc (http://www.docstoc.com/) is closing down. It's a website that hosts a lot of user-uploaded documents (a la Scribd). I'd like to see it archived... "visit the site no later than November 30th, 2015 to retrieve copies of any documents you previously uploaded to your account" [18:57] kyan: 01 Sep 20:53Z tell kyan that his unicode job probably got banned again [18:58] aaaaaaaaa, oh darn. Thanks :) [18:58] i thought i had it slow enough, wow [19:07] *** arkiver_ has quit IRC (Ping timeout: 483 seconds) [19:13] *** atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [19:37] *** SimpBrai1 has joined #archiveteam [19:41] *** SimpBrain has quit IRC (Ping timeout: 306 seconds) [19:43] *** SimpBrai1 has quit IRC (Ping timeout: 306 seconds) [20:02] docs appear to be sequential: http://www.docstoc.com/docs/173891525/ [20:02] http://www.docstoc.com/docs/173891524/ [20:03] we'll likely have to do a discovery project for this [20:03] *** Start_ has joined #archiveteam [20:03] *** Start has quit IRC (Read error: Connection reset by peer) [20:05] first page has the id number [20:05] then you get urls like this: http://d34s6f1vnlnqxq.cloudfront.net/31003386/3f321554-93cb-4674-b9c8-8f3202f1890c/norm/138554408_2.png [20:12] *** ripvanwin has quit IRC (Remote host closed the connection) [20:17] *** boozehoun has quit IRC (Read error: Connection reset by peer) [20:23] *** zenguy_pc has joined #archiveteam [20:29] ArchiveCorps is doing a-ok. [20:29] Now I can get back to The Rest Of All My Shit [20:35] *** zenguy_pc has quit IRC (Read error: Connection reset by peer) [20:35] *** zenguy_pc has joined #archiveteam [20:56] *** SimpBrain has joined #archiveteam [21:01] *** PurpleSym has quit IRC (Remote host closed the connection) [21:12] *** aaaaaaaa_ has joined #archiveteam [21:12] *** aaaaaaaaa has quit IRC (Read error: Connection reset by peer) [21:12] *** swebb sets mode: +o aaaaaaaa_ [21:13] *** aaaaaaaa_ is now known as aaaaaaaaa [21:39] *** Start_ has quit IRC (Read error: Connection reset by peer) [21:40] *** Start has joined #archiveteam [21:47] *** Start has quit IRC (Read error: Connection reset by peer) [21:47] *** Start has joined #archiveteam [21:49] kyan: holy fuck. docstoc is shutting down? [21:49] that's really bad news [21:49] like [21:49] REALLY REALLY REALLY bad news [21:49] because they are nazis about full document access [21:49] and have a ton of unique shit [21:50] that's what the email i got said [21:50] assuming its' not a troll or something [21:50] but i would assume it's real, since i do have a docstoc account [21:50] http://the-digital-reader.com/2015/09/03/docstoc-is-shutting-down/ [21:51] yeah [21:51] frntpgae confirms [21:51] frontpage* [22:02] *** Start_ has joined #archiveteam [22:07] *** Start has quit IRC (Ping timeout: 362 seconds) [22:11] *** arkiver2 has joined #archiveteam [22:17] *** Start_ is now known as Start [22:28] *** arkiver2 has quit IRC (Quit: Nettalk6 - www.ntalk.de) [22:28] *** arkiver2 has joined #archiveteam [22:32] kyan: What's your Unicode job about? Just wondering [22:32] bentpins: I'd like to have a complete record of how the Unicode standard was developed safely archived. [22:35] *** arkiver2 has quit IRC (Ping timeout: 252 seconds) [22:35] kyan: Oh awesome, I'm interested. Is there some curation involved or is it a copy of the Unicode Consortium site? [22:36] *** SimpBrain has quit IRC (Read error: Connection reset by peer) [22:36] just a WARC (Web ARChive) file archiving the useful parts of the site. Unfortunately it's hard to crawl due to the large amounts of un-needed URLs on the site making them ban the crawler :P [22:37] *** arkiver2 has joined #archiveteam [22:38] Also Long Access is shutting down: http://the-digital-reader.com/2015/07/16/digital-archival-site-longacess-shuts-down-youll-need-to-find-a-new-way-to-gift-your-ebook-collection-to-your-heirs/ [22:38] Ah I can imagine. Definately worth preserving, thank you [22:43] *** arkiver2 has quit IRC (Ping timeout: 252 seconds) [22:45] *** db48x` has joined #archiveteam [22:58] saw jason's tweet about docstoc - is there a mass download effort being organized? [22:59] I have a couple dozen terabytes I can put on a gigabit connection for a few weeks [23:05] Well, we JUST learned about it a few hours ago. [23:10] So we have two months for DocStoc [23:10] Will do my best to have the scripts ready soon. [23:57] *** brayden_ has joined #archiveteam [23:57] *** swebb sets mode: +o brayden_ [23:57] *** brayden has quit IRC (Read error: Connection reset by peer) [23:57] *** kyan has quit IRC (Quit: This computer has gone to sleep) [23:58] *** kyan has joined #archiveteam