Hi,
This means that a lot of existing software doesn't need to be updated to actually handle UTF-8 (as long as they are 8 bit clean). For example I use UTF-8 for my source code, even though my compiler isn't UTF-8 aware
Just a thought; what happens when/if a unicode character is coded as <something><the code for "> in a string? I suppose C will fail in that case?
It's not possible. When you get a byte part of an UTF-8 sequence, you can know if its position in the sequence by checking the value of a few bits. An ASCII character (its value is less than 127) can only be alone.
Cheers, Vincent ISAMBART