Enumeration

unistrength

Collation comparison levels.

enum unistrength {
    ...
}

Constants 🔗

UNI_PRIMARY

Represents differences in the base letter or symbol.

UNI_SECONDARY

Represents differences in accents.

UNI_TERTIARY

Represents differences in case or variants of symbols.

UNI_QUATERNARY

Represents differences in punctuation.

Discussion 🔗

The Unicode collation algorithm is a multilevel comparison algorithm. The number of levels that are considered in comparison is known as the collation strength. This enumeration defines constants for each level.

In comparing two strings, the most important feature is the identity of the base letters. For example, the difference between an A and a B. If the base letters differ, accent differences are typically ignored. If the base letters or their accents differ, case differences (uppercase versus lowercase) are typically ignored.