Expand description
Registry of rules that define the contexts in which particular
PROTOCOL-VALID characters, characters associated with a requirement
for Contextual Information, are permitted. These rules are expressed
as tests on the label in which the characters appear (all, or any part of,
the label may be tested).
Each rule is constructed as a Boolean expression that evaluates to
either true or false. A simple true or false rule sets the
default result value for the rule set. Subsequent conditional rules
that evaluate to true or false may re-set the result value.
A special value Undefined is used to deal with any error
conditions, such as an attempt to test a character before the start
of a label or after the end of a label. If any term of a rule
evaluates to Undefined, further evaluation of the rule immediately
terminates, as the result value of the rule will itself be Undefined.
Enums§
- Context
Rule Error - Error associated to the application of any context rule.
Functions§
- get_
context_ rule - Gets the context rule associated to an Unicode code point. Arguments
- rule_
arabic_ indic_ digits - Appendix A.8.
ARABIC-INDIC DIGITS
Can not be mixed with Extended Arabic-Indic Digits. - rule_
extended_ arabic_ indic_ digits - Appendix A.9.
EXTENDED ARABIC-INDIC DIGITS
Can not be mixed with Arabic-Indic Digits. - rule_
greek_ lower_ numeral_ sign_ keraia - Appendix A.4.
GREEK LOWER NUMERAL SIGN (
KERAIA
)
The script of the following character MUST be Greek. - rule_
hebrew_ punctuation - Appendix A.5.
Appendix A.6.
HEBREW PUNCTUATION
GERESH
and HEBREW PUNCTUATIONGERSHAYIM
The script of the preceding character MUST be Hebrew. - rule_
katakana_ middle_ dot - Appendix A.7.
KATAKANA MIDDLE DOT
Note that the Script ofKatakana Middle Dot
is not any ofHiragana
,Katakana
, orHan
. The effect of this rule is to require at least one character in the label to be in one of those scripts. - rule_
middle_ dot - Appendix A.3.
MIDDLE DOT
Between ‘l’U+006C
characters only, used to permit the Catalan characterela
geminada
to be expressed. - rule_
zero_ width_ joiner - Appendix A.2.
ZERO WIDTH JOINER
This may occur inIndic
scripts in a consonant-conjunct context (immediately following avirama
), to control required display of such conjuncts. - rule_
zero_ width_ nonjoiner - Appendix A.1.
ZERO WIDTH NON-JOINER
U+200C
This may occur in a formally cursive script (such as Arabic) in a context where it breaks a cursive connection as required for orthographic rules, as in the Persian language, for example. It also may occur inIndic
scripts in a consonant-conjunct context (immediately following avirama
), to control required display of such conjuncts.
Type Aliases§
- Context
Rule - Describes a context rule function