On Mar 25, 2008, at 7:42 AM, Robin Houston wrote:
On 25/03/2008, Adam R. Maxwell amaxwell@mac.com wrote:
[...] BibDesk uses Greg Ward's libbtparse [...] Nelson Beebe's TUGBoat article on bibclean is also quite helpful,
That's amazingly useful information. Thanks!
I'll look into both of these. Perhaps it will turn out that the best solution is to write a Ruby binding for btparse.
I'd definitely recommend that route if you can; it's a pretty good parser. Beware that >8 bit encodings aren't supported, though (and I think there's a Japanese encoding that fails as well). We work around that by converting problematic data to UTF-8 internally.