Subj : Re: Character codes To : Maurice Kinal From : Holger Granholm Date : Sun Mar 10 2019 16:15:00 In a message on 03-05-19 Maurice Kinal said to Holger Granholm: Hello Maurice, Excuse the delay. I was in Stockholm, Sweden for the Boat Show. HG> OK, the code 218 128 162 that i interpreted as hyphen actually HG> is the longer 'dash'. MK> I am not sure what you mean but using 218 (DA) as the leading byte MK> means you are restricted to a 2 byte or 16 bit character and not a MK> 24 bit character that is required for euro sign in utf8. The way MK> the leading byte works is like this; I understand, but this is how the UTF codes are represented in PC8, = 8bit ASCII, and I have come to the conclusion that I will, at least try, to use only the two following bytes in the translation table. That may be all that is needed but if not, I can always include the leading byte. Kind of cut and try . MK> The first zero shows that there are two leading ones which means MK> there is only one trailing byte following. MK> So that means either 218 128 and 162 is ignored. MK> For the utf8 euro character the prefix is; MK> dec 226 = bin 11100010 MK> ^ According to my interpretation of how the chracter is presented in PC8 it's as 218 130 172. All normal umlaut characters are presented with only two bytes, like 195 165 for the small angstrom character that is included in your "Moose" tagline. MK> and as you can see the first zero yields three leading ones which is MK> three bytes or 24 bits. I don't need more than 16-bit characters for that editor. UTF characters ARE presented with two bytes in it. MK> For the record 218 128 is U+0680 which we already know to be a 16 MK> bit Arabic character. That third byte (first 218 or 226) comes only as a prefix for other characters. MK> Thank you. Buenas noches mi amigo. :-) Gracias mi amigo. Have a good night, Holger ... Computers always win because they have inside information ;o) -- MR/2 2.30 --- PCBoard (R) v15.22 (OS/2) 2 * Origin: Coming to you from the Sunny Aland Islands. (2:20/228) .