One more and I'm done. In reference to strings written in East Asian languages (which require three bytes for each character when encoded in UTF-8), there's a good point made in passing here:
http://jroller.com/page/bloritsch?entry=obsessed_with_speed
Not to mention UTF16 is vastly simpler to work with when you have to manipulate the strings than UTF8.
Chris