Enumeration
unibp
Binary properties.
enum unibp {
...
}
Constants 🔗
UNI_NONCHARACTER_CODE_POINT | The |
UNI_ALPHABETIC | The |
UNI_LOWERCASE | The |
UNI_UPPERCASE | The |
UNI_HEX_DIGIT | The |
UNI_WHITE_SPACE | The White_Space character property. |
UNI_MATH | The |
UNI_DASH | The |
UNI_DIACRITIC | The |
UNI_EXTENDER | The |
UNI_IDEOGRAPHIC | The |
UNI_QUOTATION_MARK | The |
UNI_UNIFIED_IDEOGRAPH | The |
UNI_TERMINAL_PUNCTUATION | The |
Discussion 🔗
Unicorn supports a small subset of the binary character properties defined by the Unicode Standard. The binary properties supported are those that are useful when parsing plain text.
Most binary characters properties defined by the standard are only applicable in specific applications, i.e. text shaping or rendering. Other properties are informational, for example a character’s name, the version it was introduced into the Unicode Standard. The remaining are only relevant when implementing various Unicode algorithms and are not “general” enough to expose.