
9 Oct
2015
9 Oct
'15
8:40 p.m.
Artyom Beilis wrote:
Codepoints above 10FFFF is like lets assume Pi=3.15..
No, sorry. This is not at all the same. The reason we're in this mess is precisely because codepoints above 0xFFFF were like pi=3.15. And then it turned out they weren't.
- probably more because Unicode is hard
Unicode isn't hard - it is just treated with ignorance by even big organization not talking about average programmers.
What I meant by that is for instance - is 0xCC 0x81 a valid UTF-8 string? - is 0x65 0xCC 0x81 0xCC 0x81 a valid UTF-8 string?