Unicode encodings (was Re: [TxMt] Tidy and XML)

Chris Thomas chris at cjack.com
Fri Dec 2 14:49:59 UTC 2005


One more and I'm done. In reference to strings written in East Asian  
languages (which require three bytes for each character when encoded  
in UTF-8), there's a good point made in passing here:

  http://jroller.com/page/bloritsch?entry=obsessed_with_speed

> Not to mention UTF16 is vastly simpler to work with when you have  
> to manipulate the strings than UTF8.

Chris




More information about the textmate mailing list