[TxMt] Re: Soft-wraping text containing multi-byte characters

30 Jun 2015


      On 27 Jun 2015, at 6:33, Yoichiro Hasebe wrote:
...
I suspect TM2 treats a sequence of multi-byte characters as if it was
a single word. If that is the case, with text in a language like
Japanese, where word boundaries are not indicated by spaces, a whole
sentence or even a paragraph will be processed as just one huge word.
Correct, TextMate will need to learn about word boundaries for languages 
that do not use space characters, to do proper wrapping.
I see CFString has hyphenation API since 10.7, so this might be usable, 
but I will need to investigate this a bit further, also, wrapping is not 
the only place where word boundaries come up, so the “fix” would 
need to go beyond just wrapping (e.g. word movement and selection should 
also use linguistic word boundary definitions).

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

[TxMt] Re: Soft-wraping text containing multi-byte characters