Enumeration
unibreak
Detectable text elements.
Since v1.0
enum unibreak {
...
}
Constants 🔗
UNI_GRAPHEME | Extended grapheme cluster breaks. |
UNI_WORD | Word breaks. |
UNI_SENTENCE | Sentence break. |
Discussion 🔗
The elements of this enumeration describe the boundaries that can be detected. Conceptually, a break represents the space in between two code points.
The following illustration demonstrates a string with 7 possible break positions.
If the string is segmented into grapheme clusters, then it will break at offsets 0, 2, 3, 5, 6, and 7.