I'm no linguist, but I suspect the question is easier to answer if you first define the scope of the messages you might want to display. ----- Original Message ----- From: "Harold Hallikainen" To: "Pic List" Sent: Monday, July 02, 2007 2:41 PM Subject: [PIC] Unicode > I'm working on a project where I have to display Unicode using a PIC > driving a graphic display. I'm using Unifont > (http://czyborra.com/unifont/) for the character bitmaps. However, there > are THOUSANDS of bitmaps, more than will fit in flash in my 24hj256gp610. > I'm particularly interested in covering the Americas, Europe, Japan, and > China. So, I clearly have to include Unicode 0x0000 through 0x007f > (ASCII), plus some of the supplemental and extended characters through > 0x024f. The trick is what to cover for Chinese. > > Unifont defines the following Unicode code ranges: > > 128 U+0000..U+007F:Basic Latin > 128 U+0080..U+00FF:Latin-1 Supplement > 128 U+0100..U+017F:Latin Extended-A > 156 U+0180..U+024F:Latin Extended-B > 89 U+0250..U+02AF:IPA Extensions > 57 U+02B0..U+02FF:Spacing Modifier Letters > 72 U+0300..U+036F:Combining Diacritical Marks > 105 U+0370..U+03FF:Greek > 230 U+0400..U+04FF:Cyrillic > 85 U+0530..U+058F:Armenian > 82 U+0590..U+05FF:Hebrew > 62 U+0600..U+06FF:Arabic > 87 U+0E00..U+0E7F:Thai > 65 U+0E80..U+0EFF:Lao > 40 U+10A0..U+10FF:Georgian > 348 U+1200..U+137F:Ethiopic > 246 U+1E00..U+1EFF:Latin Extended Additional > 233 U+1F00..U+1FFF:Greek Extended > 77 U+2000..U+206F:General Punctuation > 28 U+2070..U+209F:Superscripts and Subscripts > 14 U+20A0..U+20CF:Currency Symbols > 20 U+20D0..U+20FF:Combining Marks for Symbols > 57 U+2100..U+214F:Letterlike Symbols > 48 U+2150..U+218F:Number Forms > 91 U+2190..U+21FF:Arrows > 242 U+2200..U+22FF:Mathematical Operators > 123 U+2300..U+23FF:Miscellaneous Technical > 37 U+2400..U+243F:Control Pictures > 11 U+2440..U+245F:Optical Character Recognition > 139 U+2460..U+24FF:Enclosed Alphanumerics > 128 U+2500..U+257F:Box Drawing > 22 U+2580..U+259F:Block Elements > 80 U+25A0..U+25FF:Geometric Shapes > 106 U+2600..U+26FF:Miscellaneous Symbols > 73 U+2700..U+27BF:Dingbats > 256 U+2800..U+28FF:Braille Pattern Symbols > 35 U+3000..U+303F:CJK Symbols and Punctuation > 87 U+3040..U+309F:Hiragana > 90 U+30A0..U+30FF:Katakana > 37 U+3100..U+312F:Bopomofo > 94 U+3130..U+318F:Hangul Compatibility Jamo > 69 U+3200..U+32FF:Enclosed CJK Letters and Months > 84 U+3300..U+33FF:CJK Compatibility > 18174 U+4E00..U+9FFF:CJK Unified Ideographs > 11172 U+AC00..U+D7A3:Hangul Syllables > 270 U+F900..U+FAFF:CJK Compatibility Ideographs > 57 U+FB00..U+FB4F:Alphabetic Presentation Forms > 12 U+FB50..U+FDFF:Arabic Presentation Forms-A > 4 U+FE20..U+FE2F:Combining Half Marks > 28 U+FE30..U+FE4F:CJK Compatibility Forms > 26 U+FE50..U+FE6F:Small Form Variants > 140 U+FE70..U+FEFF:Arabic Presentation Forms-B > 171 U+FF00..U+FFEF:Halfwidth and Fullwidth Forms > 2 U+FFF0..U+FFFF:Specials > > > Just covering CJK Compatibility Ideographs uses more flash than I have. > So, ideas on what character ranges I should include, trying to fit this in > flash? > > THANKS! > > Harold > > > > -- > FCC Rules Updated Daily at http://www.hallikainen.com - Advertising > opportunities available! > -- > http://www.piclist.com PIC/SX FAQ & list archive > View/change your membership options at > http://mailman.mit.edu/mailman/listinfo/piclist -- http://www.piclist.com PIC/SX FAQ & list archive View/change your membership options at http://mailman.mit.edu/mailman/listinfo/piclist