Hello everyone, Since I regularly work with Unicode text, I have started to design a Unicode library for handling UTF 8,16 and 32. The current support in C++ is still limited. Mainly I'm trying to implement a STL like support for file reading/writing, I/O operations and memory storage. So a user could read/write and process Unicode text like an ASCII string without knowing how the text is encoded. I thought people might have use for it. Even though libraries like ICU exist, they aren't beginner friendly and I couldn't find one with decent support for I/O. Is there demand for this? And if so do you guys have any suggestions? Thank you. - Onur
Hi Can,
On Wed, Feb 27, 2019 at 11:22 AM Can Topal via Boost
Mainly I'm trying to implement a STL like support for file reading/writing, I/O operations and memory storage. So a user could read/write and process Unicode text like an ASCII string without knowing how the text is encoded.
To my best knowledge std::string in conjunction with UTF-8 does let you do exactly that -- process Unicode text like an ASCII string without caring for the details. To my best knowledge this covers 95% of use cases of strings. For the rest a fully fledged Unicode library is usually required, whether it's for Unicode normalization, collation, or text rendering. True -- it would be helpful to have good implementation of those on UTF-8 strings. However, wherever you take your design, take care not to introduce yet another string class. You probably shouldn't care where the bytes are stored. You can also look at Boost.Locale and Boost.NoWide. Cheers & Wish you GL,, -- Yakov Galka http://stannum.co.il/
On Wed, Feb 27, 2019 at 10:22 AM Can Topal via Boost
Hello everyone,
Since I regularly work with Unicode text, I have started to design a Unicode library for handling UTF 8,16 and 32. The current support in C++ is still limited.
Mainly I'm trying to implement a STL like support for file reading/writing, I/O operations and memory storage. So a user could read/write and process Unicode text like an ASCII string without knowing how the text is encoded.
I thought people might have use for it. Even though libraries like ICU exist, they aren't beginner friendly and I couldn't find one with decent support for I/O.
Is there demand for this? And if so do you guys have any suggestions?
I've been working on a proposed Boost library called Boost.Text for some time now. It's not yet ready for review, but I posted about initial interest a while back. You can find it here: https://github.com/tzlaine/text Zach
On Wed, 27 Feb 2019 at 19:22, Zach Laine via Boost
I've been working on a proposed Boost library called Boost.Text for some time now. It's not yet ready for review, but I posted about initial interest a while back. You can find it here:
Any ETA?, coz that looks impressive (and massive). degski -- *"Big boys don't cry" - **Eric Stewart, Graham Gouldman*
participants (4)
-
Can Topal
-
degski
-
Yakov Galka
-
Zach Laine