UNIC — Unicode Text Segmentation Algorithms
This UNIC component implements algorithms from Unicode® Standard Annex #29 - Unicode Text Segmentation, used for detecting boundaries of text element boundaries, such as user-perceived characters (a.k.a. Grapheme Clusters), Words, and Sentences.
Notes
Initial code for this component is based on
unicode-segmentation
.