UTF-16 is about the most efficient way possible of representing Asian character strings [...]
I would have put my money on bzip2 for “the most efficient way possible” if the parameter was size :p
google for "criscione 100 bytes". I place my bets on this method ^_^