2 Jul
2004
2 Jul
'04
7:12 a.m.
David Abrahams wrote:
As an example, there are two environment with a single string type: Qt and Java, and in both there's no issue of Unicode any more, AFAICT.
Har!
Java "unicode" is utf-16, I think. Unicode now has at least 32 bits per character, IIUC, so I don't think any simplistic interface choices can make a non-issue of Unicode.
Huh, the utf-16 is 16-bit *encoding* for 32-unicode, it's not 16-bit unicode. There are so called surrogate pairs which allows to represent 32-bit values. According to http://java.sun.com/j2se/1.5.0/docs/guide/intl/enhancements.html and http://java.sun.com/developer/technicalArticles/Intl/Supplementary/ this is not what Java did in 1.4, but with 1.5 release it really supports 32-bit encoding. - Volodya