Re: [Boost-users] find japanese character with boost regex++

12 Dec 2003

      ...
Actually I have a text with a lot of strange characters and japanese
one ( Hiragana, Katakana, Kanji everything..!) and I want to find these
japanese sentence in order to translate them and replace in the text.
I need hence a way in order to identify a japanese sentence . A kind
of function const bool isJap( const wchar ) const would be fine.
Do you need to use regexes? I've not tried boost.regex yet so cannot help there.

Is your text just ascii and Japanese? Or do you need to distinguish from
other languages as well?

If just ascii and Japanese, you could define a Japanese char as anything
that is not ascii (beware shift-jis encoding though, as 2nd byte of a double
byte character is in the ascii range). If your data is unicode it should
also be easy to treat European characters as non-Japanese as well.

Darren

Re: [Boost-users] find japanese character with boost regex++

Darren Cook