[00:13] Anyone in here have experience supporting CJK? I am still missing some fonts in my screenshots [00:14] I am using sazanami-gothic-fonts sazanami-mincho-fonts but it is not providing enough character coverage [00:14] hmmm [00:14] you should install 'unifont' as a final backstop [00:15] no package for that on fedora [00:16] strange [00:16] well, keep looking, it's an attempt to have all the characters in at least a bitmap form [00:16] each time I pull new fonts in the rest got better [00:17] cool [00:23] cjkuni-ukai-fonts cjkuni-uming-fonts got me even further [01:08] omf_: I do [01:08] don't use unifont it's neat for what it is but it looks like ass compared to a real font and isn't updated [01:09] is wqy-unibit-fonts in fedora? [01:09] https://admin.fedoraproject.org/pkgdb/acls/name/wqy-unibit-fonts [01:10] yes balrog [01:10] do you have it? [01:10] that includes unifont [01:10] plus updates [01:11] This is what I am using now and they good sazanami-gothic-fonts sazanami-mincho-fonts cjkuni-ukai-fonts cjkuni-uming-fonts wqy-zenhei-fonts [01:12] as you probably noticed a font designed for japanese like sazanami will be missing a lot of chinese-only characters [01:13] yeah mutltiple fonts to cover the bases [01:14] if you get a japanese font, a simplified chinese, and a traditional chinese that should cover 99% of real world text [01:15] I don't recognize the chinese face names offhand to know what's simplified and what's traditional [01:15] if you want the wacky extended stuff no one uses there's http://vietunicode.sourceforge.net/fonts/fonts_hannom.html [01:16] extensions C and D exist now but even less than no one use them and I don't know of fonts offhand [01:16] I did it via trial and error [01:17] had pages that looked bad, searched around, installed a font, reloaded. If it looked better I moved on [01:17] Russia and Farsi were no trouble [01:17] farsi is harder than you might think [01:18] arab fonts do certain things wrong for farsi but it's an uphill battle and farsi readers are probably used to it by now [01:18] Considering the number of projects to have a full unicode font, it is frustrating [01:19] I mean projects to build a full unicode font [01:20] yeah there's unifont which is nice but bitmap and not updated, code2000 which looks like shite, freesans/freeserif/etc. which last I checked had various bugs and were completely unhinted [01:21] I think nowadays the thinking is to have good script-specific fonts rather than trying to cram everything into one huge font that does everything poorly and takes all your ram [01:21] It makes sense considering how big unicode is [01:21] ttf can't have more than 65,536 characters so you can't even fit it all in one font anyway [01:21] There should be a list somewhere of a "Cover all your bases font list" [01:23] there's http://unifont.org/fontguide/ but it's out of date and not really succinct enough for that purpose [01:24] I'll know by the end of this screenshot project [01:25] pages covered in blank squares are evil [01:25] SIL has the best quality stuff for minority languages http://scripts.sil.org/FontDownloads [01:30] I was thinking about installing every font on there but i know that would be problematic [01:37] yeah then you get stuff like fallback fonts and systems are not always good at assigning priority appropriately [01:38] right, when you install all the fonts everything that doesn't exactly specify what fonts it wants will look like crap [01:56] so, this might be of interest to some folks: http://blogs.adobe.com/typblography/2013/04/adobe-blank-redux.html a font that has All Unicode code points are covered & All code points are rendered using a non-spacing and non-marking glyph. [02:00] I've had it with all these "preservationists" who keep their dumping tools private [02:01] any particular incident that set you off? [02:01] see forums.bannister.org shoutbox [02:01] hopefully it didn't scroll off by now [02:03] probably did [02:03] this software called TruRip for ripping CDs [02:03] it's supposed to be the best but the group that made it absolutely refuses to release it [02:04] that's fascinating the level groups are going to now to ensure things are legit [02:04] If they cannot produce results they are full of shit [02:04] they produce rips [02:04] but won't provide their tools [02:05] I mean produce the program so others can verify it [02:05] this is sorta like SPS but SPS was forced to produce some stuff [02:05] isn't SPS the group that does Kyroflux? [02:05] think so [02:05] yeah... they at least have released something [02:06] though they still want people to send disk images to them for processing with their unreleased analysis software [02:07] well you can get it if you're an institution or business and have something like $3500/yr to spend [02:08] which also is a condition of using the KF hardware for commercial or institutional uses ... but this has already gone far enough OT [02:17] Thanks for the orgy of font information. I feel more knowledgeable already [02:19] if you're dumping floppies (in my case 3.5'' dos/windows-formatted ones), it's nice to be able to see what's on the images and get some basic info about them quicky- 7z can tell you about some images, and mtools is invaluable for working with DOS/Windows formatted floppies/images [02:48] so, I got a note from TiVo telling me they are pulling the downloads for TiVo desktop by June 5, 2013- I grabbed a copy of the software (it's free) and you may want to as well: https://www3.tivo.com/store/accessories-software.do [03:12] throw it on ia [03:17] WHAT FORSOOTH, PRITHEE TELL ME THE SECRET WORD [03:17] wait "we rescue your shit" hahahahhahahahaha [03:18] This is why I never use IRC [03:18] hellllooooooooooooooooo [03:18] sterdgsrt: yahoosucks [03:18] chill out, not everyone is watching all the time [03:18] cool there's someone else alive [03:18] k im chill [03:18] lots of people at the moment, I would venture [03:19] k [03:20] el graciar chronomex [03:22] life tip: irc does not work like yelling into a room, you may have to leave the window open in the background for a few hours to get an answer depending on the channel [03:27] correct [04:19] Fire, fire everywhere [04:19] That font discussion could have stood a transition to -bs [04:23] maybe, maybe not [04:23] it was at least marginally on topic [04:29] The posterous screenshots no longer have bad glyphs in them [04:32] now this may explain spark having problems [04:33] the one of the interviews was last modifed around 2013-04-18 [04:33] this interview is from 2009 [07:35] alard: what's the simplest example of one of your crawlers to list URLs/IDs to grab? [07:36] (searching for some strings on Google or other search engine) [10:05] hello [11:09] hi jonas__ [14:15] https://archive.org/details/hotu_dos_2006 [15:05] DocCool.com, a forum for people cheating in relationships, has closed. Forums are still readable for the time being. [15:46] GRAB IT [15:46] JUST DON'T TELL THE WIFE [17:05] what about kids.yahoo.com? [17:21] http://archiveteam.org/index.php?title=Rescuing_Optical_Media in case anyone is interested in helping out [17:50] I have several CD's in flac + cue that I could send somewhere ... [17:50] andy0: upload erm [17:50] em* [17:52] how? & where? [18:00] archive.org [18:00] hit upload button [18:38] omf_: I've heard that FurAffinity has had similar internal political issues for what, the past 2 years? Not that I have an account there, as I don't [18:38] We should warriorize a backup of that [18:39] * Smiley wishes to learn how to do this process. [18:39] theres also that pouet site. which I'm grabbing a warc of... :/ [18:41] underscor: expect significant resistance [18:41] some people wrote a FA -> Weasyl migrator, they broke it by mucking with html tags [18:42] Oh, yes [18:42] I am *very* intimately familiar with the drama there, along with all the shit with yak [18:42] and [18:42] ugh [18:42] it all sucks [19:12] o_O [19:13] * Smiley continues to not understand that place [21:12] balrog cool, you mean help building that list, or actually ripping? [21:12] building for now [21:12] I'm working on ripping methods too [21:12] are you interested? [21:18] I'm interested in the issue of digital archiving (obviously). i don't know much about optical media in particular. [21:18] balrog: Hm, I could have sworn I had an AT wiki account. Anyway I'll try to find it when I get home and add something or two to the list after [21:18] ok [21:18] i do know that all of this rogue-archivist stuff is skirting the law in lots of areas. [21:18] yeah well [21:19] (bit-for-bit copies are scary to the rightsholders) [21:19] eh, it's been happening for years and years [21:19] libraries should be pushing for this stuff. most are clueless that this is the new library system. [21:19] In my experience analogue archivists also tend to break the law on a regular basis but aren't always aware they're doing it. [21:19] "rightsholders" usually have no leg to stand on [21:20] especially custodians of PD works [21:20] well anyway, a thing i like about archiveteam is that they aren't asking. [21:20] it's the right thing to do, so do it. [21:20] they/we/you/us, not sure the right collective noun to use :P [21:29] mistym: It might have been deleted after the spam clearout, by mistake [21:29] Good point, maybe I should just reregister. [21:29] Any wiki admins around and can check if mistym's account's still there? [21:29] hmmm [21:29] I had my rights removed after. [21:30] I get lost on the wiki every time [21:30] Smiley: Then don't bother answering that one... obviously [21:30] looking up archived content, reading endangered content, ... [21:30] yeah I realised after I'd said hmm :P [21:30] yan_: Anything specific that's confusing? Or do you mean some other kind of lost? :P [21:32] ersi: yes, the other kind :) [21:33] Ah, lost in time then ;-) [23:26] heh, nine inch nails uploaded the infamous "broken" video to vimeo, and almost immediately it is deleted [23:28] Ymgve: "broken"? [23:29] oh [23:29] http://en.wikipedia.org/wiki/Broken_(1993_film)