Enumeration
unibp
Binary properties.
enum unibp {
...
}
Constants 🔗
UNI_NONCHARACTER_CODE_POINT | The |
UNI_ALPHABETIC | The |
UNI_LOWERCASE | The |
UNI_UPPERCASE | The |
UNI_HEX_DIGIT | The |
UNI_WHITE_SPACE | The |
UNI_MATH | The |
UNI_DASH | The |
UNI_DIACRITIC | The |
UNI_EXTENDER | The |
UNI_IDEOGRAPHIC | The |
UNI_QUOTATION_MARK | The |
UNI_UNIFIED_IDEOGRAPH | The |
UNI_TERMINAL_PUNCTUATION | The |
Discussion 🔗
Unicorn supports a small subset of the binary character properties defined by the Unicode Standard. The binary properties supported are those that are useful when parsing plain text.
Most binary characters properties defined by the standard are only applicable in specific applications, i.e. text shaping or rendering. Other properties are informational, for example, a character’s name or the version it was introduced into the Unicode Standard. The remaining properties are relevant only when implementing various Unicode algorithms and are not “general” enough to expose.