Network Working Group P. Faltstrom, Ed. Internet-Draft Cisco Intended status: Standards Track April 27, 2008 Expires: October 29, 2008 The Unicode Codepoints and IDNA draft-ietf-idnabis-tables-00.txt Status of this Memo By submitting this Internet-Draft, each author represents that any applicable patent or other IPR claims of which he or she is aware have been or will be disclosed, and any of which he or she becomes aware will be disclosed, in accordance with Section 6 of BCP 79. Internet-Drafts are working documents of the Internet Engineering Task Force (IETF), its areas, and its working groups. Note that other groups may also distribute working documents as Internet- Drafts. Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress." The list of current Internet-Drafts can be accessed at http://www.ietf.org/ietf/1id-abstracts.txt. The list of Internet-Draft Shadow Directories can be accessed at http://www.ietf.org/shadow.html. This Internet-Draft will expire on October 29, 2008. Abstract This document specifies rules for deciding whether a codepoint, considered in isolation, is a candidate for inclusion in an Internationalized Domain Name. It is part of the specification of IDNA2008. Faltstrom Expires October 29, 2008 [Page 1]
Internet-Draft IDNA Codepoints April 2008 Table of Contents 1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . 3 2. Category definitions Used to Calculate Derived Property Value . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 2.1. Categories Based on Unicode Properties . . . . . . . . . . 4 2.1.1. LetterDigits (A) . . . . . . . . . . . . . . . . . . . 5 2.1.2. Unstable (B) . . . . . . . . . . . . . . . . . . . . . 5 2.1.3. IgnorableProperties (C) . . . . . . . . . . . . . . . 6 2.1.4. IgnorableBlocks (D) . . . . . . . . . . . . . . . . . 6 2.2. Other Categories . . . . . . . . . . . . . . . . . . . . . 6 2.2.1. LDH (E) . . . . . . . . . . . . . . . . . . . . . . . 6 2.2.2. Exceptions (F) . . . . . . . . . . . . . . . . . . . . 6 2.2.3. BackwardCompatible (G) . . . . . . . . . . . . . . . . 7 2.2.4. JoinControl (H) . . . . . . . . . . . . . . . . . . . 7 2.2.5. Unassigned (J) . . . . . . . . . . . . . . . . . . . . 7 3. Calculation of the Derived Property . . . . . . . . . . . . . 8 4. Codepoints . . . . . . . . . . . . . . . . . . . . . . . . . . 8 5. IANA Considerations . . . . . . . . . . . . . . . . . . . . . 9 6. Security Considerations . . . . . . . . . . . . . . . . . . . 9 7. Contributors . . . . . . . . . . . . . . . . . . . . . . . . . 9 8. Acknowledgements . . . . . . . . . . . . . . . . . . . . . . . 9 Appendix A. Codepoints 0x0000 - 0x10FFFD . . . . . . . . . . . . 9 Appendix A.1. Codepoints in Unicode Character Database (UCD) format . . . . . . . . . . . . . . . . . . . . . . . 9 9. References . . . . . . . . . . . . . . . . . . . . . . . . . . 55 9.1. Normative References . . . . . . . . . . . . . . . . . . . 55 9.2. Informative References . . . . . . . . . . . . . . . . . . 56 Author's Address . . . . . . . . . . . . . . . . . . . . . . . . . 57 Intellectual Property and Copyright Statements . . . . . . . . . . 58 Faltstrom Expires October 29, 2008 [Page 2]
Internet-Draft IDNA Codepoints April 2008 1. Introduction RFC 4690 [RFC4690] suggests an inclusion based approach for selecting the codepoints from The Unicode Standard [Unicode51] that should be included in the list of codepoints that may be used in Internationalized Domain Names. Specifically, RFC 4690 [RFC4690] says the following: The IAB has concluded that there is a consensus within the broader community that lists of code points should be specified by the use of an inclusion-based mechanism (i.e., identifying the characters that are permitted), rather than by excluding a small number of characters from the total Unicode set as Stringprep [RFC3454] and Nameprep [RFC3491] do today. That conclusion should be reviewed by the IETF community and action taken as appropriate. This document reviews and classifies the collections of codepoints in the Unicode character set by examining various properties of the codepoints. It then defines an algorithm for determining a derived property value. It specifies a procedure and not a table of codepoints so that the algorithm can be used to determine code point sets independent of the version of Unicode that is in use. This document is not intended to specify precisely how these property values are to be applied in IDN labels. That information appears in [IDNA2008-protocol], but it is important to understand that the assignment of a value of this property to a particular character is not sufficient to determine whether it can be used in a given label. In particular, some combinations of allowed codepoints are not advisable for use in IDNs due to rules specific to a script or class of characters. The requirement for such rules is linked to the operations in [IDNA2008-protocol] and especially to the characters designated as requiring contextual rules. The value of the property is to be interpreted approximately as follows. o PROTOCOL VALID: Those that are allowed to be used in IDNs. Codepoints with this property value are permitted for general use in IDNs. That a label consists only of codepoints that have this property value does though not imply the label can be used in DNS. See [IDNA2008-protocol] for algorithm for how to make decisions on domain names. The abbreviated term PVALID is used to refer to this value in the rest of this document. o CONTEXTUAL RULE REQUIRED: Some characteristics of the character, such as it being invisible in certain contexts or problematic in others, requires that it not be used in labels unless specific Faltstrom Expires October 29, 2008 [Page 3]
Internet-Draft IDNA Codepoints April 2008 other characters or properties are present. The abbreviated term CONTEXT is used to refer to this value in the rest of this document. There are two subdivisions of CONTEXTUAL RULE REQUIRED, one for Join_controls (called CONTEXTJ) and and for other characters (called CONTEXTO). These are discussed in more detail below and in [IDNA2008-protocol]. o DISALLOWED: Those that should clearly not be included in IDNs. Codepoints with this property value will never be permitted in IDNs. o UNASSIGNED: Those codepoints that are unassigned in the Unicode Standard. The (non-normative) table in Appendix A is derived from data in Unicode 5.1, rather than the earlier Unicode 3.2; this in order to take advantage of the expanded character repertoire and better definitions in the newer version. The mechanisms described here allow determination of the value of the property for future versions of Unicode (including characters added after Unicode 5.1). It should be suitable for newer revisions of Unicode, as long as the Unicode properties on which it is based remain stable. Some codepoints need to be allowed in exceptional circumstances, but should be excluded in all other cases; these rules are also described in other documents. The most notable of these are the the Join Control characters, ZERO WIDTH JOINER (U+200D) and ZERO WIDTH NON- JOINER (U+200C). Both of them have the derived property value CONTEXTJ. A character with the derived property value CONTEXTJ or CONTEXTO (CONTEXTUAL RULE REQUIRED) is not be used unless an appropriate rule has been established and the context of the character is consistent with that rule. It is invalid to either register a string containing these characters or even to look one up unless such contextual rule is found and satisfied. Please see Appendix A, The Contextual Rules Registry, in [IDNA2008-rationale] for more information. This document is part of a series that, together, constitute a preliminary proposal for updating the IDNA standards to resolve issues uncovered in recent years, cover a broader range of scripts, and provide for migration to newer versions of Unicode. See [IDNA2008-rationale] for a broader discussion. 2. Category definitions Used to Calculate Derived Property Value 2.1. Categories Based on Unicode Properties The derived property obtains its value based on a two step procedure. First, characters are placed in one or more character categories Faltstrom Expires October 29, 2008 [Page 4]
Internet-Draft IDNA Codepoints April 2008 based on core properties defined by the Unicode Standard. These categories are not mutually exclusive. In the second step, set operations are used with these categories to determine the values for an IDN-specific property. Those operations are specified in Section 3. In many cases aliases are used in the data in the Unicode Standard. This document uses both the alias and the spelled out terms (for example alias Ll for the General Category Lowercase_Letter). 2.1.1. LetterDigits (A) A: generalCategory(cp) is in {Ll, Lu, Lo, Nd, Lm, Mn, Mc} These rules identify characters commonly used in mnemonics and often informally described as "language characters". In general, only codepoints assigned to this category are suitable for use in IDN. The generalCategory() operation returns the General Category for a particular Unicode code point. For more information, see section 4.5 of The Unicode Standard [Unicode5]. The categories used in this rule are: o Ll - Lowercase_Letter o Lu - Uppercase_Letter o Lo - Other_Letter o Nd - Decimal_Number o Lm - Modifier_Letter o Mn - Nonspacing_Mark o Mc - Spacing_Mark 2.1.2. Unstable (B) B: toNFKC(toCaseFolded(toNFKC(cp))) != cp The category is used to group the characters that are not stable under NFKC normalization and casefolding. In general, these codepoints are not suitable for use for IDN. toCaseFolded is defined in Section 3.13 of The Unicode Standard [Unicode5]. The toNFKC() operation returns the code point in normalization form KC. For more information, see Section 5 of Unicode Standard Annex #15 [TR15]. Faltstrom Expires October 29, 2008 [Page 5]
Internet-Draft IDNA Codepoints April 2008 2.1.3. IgnorableProperties (C) C: property(cp) is in {Default_Ignorable_Code_Point, White_Space, Noncharacter_Code_Point} This category is used to group codepoints that are not recommended for use in identifiers. In general, these codepoints are not suitable for use for IDN. The definition for Default_Ignorable_Code_Point can be found in DerivedCoreProperties.txt [1] (and erratum of 2007-January-25 [2]) and is Other_Default_Ignorable_Code_Point + Cf + Cc + Cs + Noncharacter_Code_Point + Variation_Selector - White_Space - FFF9..FFFB (Annotation Characters) 2.1.4. IgnorableBlocks (D) D: block(cp) in {Combining Diacritical Marks for Symbols, Musical Symbols, Ancient Greek Musical Notation, Private Use Area} This category is used to identifying codepoints that are not useful in mnemonics or that are otherwise impractical for IDN use. In general, these codepoints are not suitable for use for IDN. 2.2. Other Categories 2.2.1. LDH (E) E: cp is in {002D, 0030..0039, 0061..007A} This category is used in the second step to preserve the traditional "hostname" (LDH) characters ('-', 0-9 and a-z). In general, these codepoints are suitable for use for IDN. Note that the codepoint 0x002D (HYPHEN-MINUS) is also in Exceptions (Section 2.2.2). It is kept in this category of LDH characters because of clarity although it will never match. This because one can see in Section 3, Exceptions (Section 2.2.2) overrides what this category (LDH) (Section 2.2.1) specifies. 2.2.2. Exceptions (F) F: cp in {002D, 00B7, 02B9, 0375, 0483, 05F3, 05F4, 06FD, 06FE, 0F0B, 3005, 3007, 303B, 30FB} This category explicitly lists codepoints for which the category Faltstrom Expires October 29, 2008 [Page 6]
Internet-Draft IDNA Codepoints April 2008 cannot be assigned using only the core property values that exist in the Unicode standard. The values are according to the table below: 002D; CONTEXTO # HYPHEN-MINUS 00B7; CONTEXTO # MIDDLE DOT 02B9; CONTEXTO # MODIFIER LETTER PRIME 0375; CONTEXTO # GREEK LOWER NUMERAL SIGN (KERAIA) 0483; CONTEXTO # COMBINING CYRILLIC TITLO 05F3; CONTEXTO # HEBREW PUNCTUATION GERESH 05F4; CONTEXTO # HEBREW PUNCTUATION GERSHAYIM 06FD; PVALID # ARABIC SIGN SINDHI AMPERSAND 06FE; PVALID # ARABIC SIGN SINDHI POSTPOSITION MEN 0F0B; PVALID # TIBETAN MARK INTERSYLLABIC TSHEG 3005; CONTEXTO # IDEOGRAPHIC ITERATION MARK 3007; PVALID # IDEOGRAPHIC NUMBER ZERO 303B; CONTEXTO # VERTICAL IDEOGRAPHIC ITERATION MARK 30FB; CONTEXTO # KATAKANA MIDDLE DOT The characters 02B9, 0375 and 0483 are used in different scripts to indicate that an adjacent letter is being used with a numeric value. 2.2.3. BackwardCompatible (G) G: cp in {} This category includes the codepoints that property values in versions of Unicode after 5.0 have changed in such a way that the derived property value would no longer be PVALID or DISALLOWED. As the requirement is that codepoints having either of these derived values MUST NOT have those changed, this category is added, but initially the set of codepoints is empty. 2.2.4. JoinControl (H) H: property(cp) is in {Join_Control} This category consists of Join Control characters (i.e., they are not in LetterDigits (Section 2.1.1)) but are still required in IDN labels under some circumstances. They require extended special treatment in Lookup and Resolution. 2.2.5. Unassigned (J) J: cp is unassigned This category consists of codepoints in the Unicode character set that are not (yet) assigned. Faltstrom Expires October 29, 2008 [Page 7]
Internet-Draft IDNA Codepoints April 2008 3. Calculation of the Derived Property As described above (Section 1) and in more detail in the "IDNA Protocol" document [IDNA2008-protocol], possible values of the IDN property are: o PVALID o CONTEXTJ o CONTEXTO o DISALLOWED o UNASSIGNED The algorithm to calculate the value of the derived property is as follows. o If the codepoint is in Exceptions (Section 2.2.2), the value is according to the table in Section 2.2.2. o If the codepoint is in BackwardCompatible (Section 2.2.3), the value is according to the table in Section 2.2.3. o If the codepoint is in LDH (Section 2.2.1), the value is PVALID. o If the codepoint is in JoinControl (Section 2.2.4), the value is CONTEXTJ. o If the codepoint is in Unstable (Section 2.1.2), the value is DISALLOWED. o If the codepoint is in IgnorableProperties (Section 2.1.3), the value is DISALLOWED. o If the codepoint is in IgnorableBlocks (Section 2.1.4), the value is DISALLOWED. o If the codepoint is in Unassigned (Section 2.2.5), the value is UNASSIGNED. o If the codepoint is in LetterDigits (Section 2.1.1), the value is PVALID. o If the codepoint is not in LetterDigits (Section 2.1.1), the value is DISALLOWED. 4. Codepoints The Categories and Rules defined in Section 2 and Section 3 apply to all assigned Unicode characters. The table in Appendix A shows, for illustrative purposes, the consequences of the categories and classification rules, and the resulting property values. The list of codepoints that can be found in Appendix A is non- normative. Section 2 and Section 3 are normative. Faltstrom Expires October 29, 2008 [Page 8]
Internet-Draft IDNA Codepoints April 2008 5. IANA Considerations IANA is to keep a list of the derived property for the versions of Unicode that is released after (and including) version 5.0. The derived property value is to be calculated according to the specifications in sections Section 2 and Section 3 and not by copying the non-normative table found in Appendix A. If needed, IANA should (with the help of an appointed expert) suggest updates of this RFC where BackwardCompatible (Section 2.2.3) is updated, a set that is at release of this document is empty. 6. Security Considerations The security issues associated with this work are discussed in [IDNA2008-rationale] and [IDNA2008-protocol]. 7. Contributors While the listed editor held the pen, this document represents the joint work and conclusions of an ad hoc design team. In addition to the editor this consisted of, Harald Alvestrand, Tina Dam, Cary Karp, and John Klensin. 8. Acknowledgements This document would not have been possible to produce without input from many people. The main contribuotrs are (in alphabetical order) Harald Alvestrand, Vint Cerf, Tina Dam, Mark Davis, Mouhammet Diop, Michael Everson, Asmus Freytag, Debbie Garside, Paul Hoffman, Cary Karp, John Klensin, Olaf Kolkman, Lisa Moore, Yngve Pettersen, Erik van der Poel, Hualin Qian, Rick Reed, Michel Suignard and Kenneth Whistler. Appendix A. Codepoints 0x0000 - 0x10FFFD If one apply the rules (Section 3) to the codepoints 0x0000 to 0x10FFFD, the result is as follows. This list is non-normative, as explained in Section 4, and only included for illustrative purposes. Appendix A.1. Codepoints in Unicode Character Database (UCD) format 0000..002C ; DISALLOWED # <control>..COMMA Faltstrom Expires October 29, 2008 [Page 9]
Internet-Draft IDNA Codepoints April 2008 002D ; CONTEXTO # HYPHEN-MINUS 002E..002F ; DISALLOWED # FULL STOP..SOLIDUS 0030..0039 ; PVALID # DIGIT ZERO..DIGIT NINE 003A..0060 ; DISALLOWED # COLON..GRAVE ACCENT 0061..007A ; PVALID # LATIN SMALL LETTER A..LATIN SMALL LETTER Z 007B..00B6 ; DISALLOWED # LEFT CURLY BRACKET..PILCROW SIGN 00B7 ; CONTEXTO # MIDDLE DOT 00B8..00DF ; DISALLOWED # CEDILLA..LATIN SMALL LETTER SHARP S 00E0..00F6 ; PVALID # LATIN SMALL LETTER A WITH GRAVE..LATIN SMALL 00F7 ; DISALLOWED # DIVISION SIGN 00F8..00FF ; PVALID # LATIN SMALL LETTER O WITH STROKE..LATIN SMAL 0100 ; DISALLOWED # LATIN CAPITAL LETTER A WITH MACRON 0101 ; PVALID # LATIN SMALL LETTER A WITH MACRON 0102 ; DISALLOWED # LATIN CAPITAL LETTER A WITH BREVE 0103 ; PVALID # LATIN SMALL LETTER A WITH BREVE 0104 ; DISALLOWED # LATIN CAPITAL LETTER A WITH OGONEK 0105 ; PVALID # LATIN SMALL LETTER A WITH OGONEK 0106 ; DISALLOWED # LATIN CAPITAL LETTER C WITH ACUTE 0107 ; PVALID # LATIN SMALL LETTER C WITH ACUTE 0108 ; DISALLOWED # LATIN CAPITAL LETTER C WITH CIRCUMFLEX 0109 ; PVALID # LATIN SMALL LETTER C WITH CIRCUMFLEX 010A ; DISALLOWED # LATIN CAPITAL LETTER C WITH DOT ABOVE 010B ; PVALID # LATIN SMALL LETTER C WITH DOT ABOVE 010C ; DISALLOWED # LATIN CAPITAL LETTER C WITH CARON 010D ; PVALID # LATIN SMALL LETTER C WITH CARON 010E ; DISALLOWED # LATIN CAPITAL LETTER D WITH CARON 010F ; PVALID # LATIN SMALL LETTER D WITH CARON 0110 ; DISALLOWED # LATIN CAPITAL LETTER D WITH STROKE 0111 ; PVALID # LATIN SMALL LETTER D WITH STROKE 0112 ; DISALLOWED # LATIN CAPITAL LETTER E WITH MACRON 0113 ; PVALID # LATIN SMALL LETTER E WITH MACRON 0114 ; DISALLOWED # LATIN CAPITAL LETTER E WITH BREVE 0115 ; PVALID # LATIN SMALL LETTER E WITH BREVE 0116 ; DISALLOWED # LATIN CAPITAL LETTER E WITH DOT ABOVE 0117 ; PVALID # LATIN SMALL LETTER E WITH DOT ABOVE 0118 ; DISALLOWED # LATIN CAPITAL LETTER E WITH OGONEK 0119 ; PVALID # LATIN SMALL LETTER E WITH OGONEK 011A ; DISALLOWED # LATIN CAPITAL LETTER E WITH CARON 011B ; PVALID # LATIN SMALL LETTER E WITH CARON 011C ; DISALLOWED # LATIN CAPITAL LETTER G WITH CIRCUMFLEX 011D ; PVALID # LATIN SMALL LETTER G WITH CIRCUMFLEX 011E ; DISALLOWED # LATIN CAPITAL LETTER G WITH BREVE 011F ; PVALID # LATIN SMALL LETTER G WITH BREVE 0120 ; DISALLOWED # LATIN CAPITAL LETTER G WITH DOT ABOVE 0121 ; PVALID # LATIN SMALL LETTER G WITH DOT ABOVE 0122 ; DISALLOWED # LATIN CAPITAL LETTER G WITH CEDILLA 0123 ; PVALID # LATIN SMALL LETTER G WITH CEDILLA 0124 ; DISALLOWED # LATIN CAPITAL LETTER H WITH CIRCUMFLEX Faltstrom Expires October 29, 2008 [Page 10]
Internet-Draft IDNA Codepoints April 2008 0125 ; PVALID # LATIN SMALL LETTER H WITH CIRCUMFLEX 0126 ; DISALLOWED # LATIN CAPITAL LETTER H WITH STROKE 0127 ; PVALID # LATIN SMALL LETTER H WITH STROKE 0128 ; DISALLOWED # LATIN CAPITAL LETTER I WITH TILDE 0129 ; PVALID # LATIN SMALL LETTER I WITH TILDE 012A ; DISALLOWED # LATIN CAPITAL LETTER I WITH MACRON 012B ; PVALID # LATIN SMALL LETTER I WITH MACRON 012C ; DISALLOWED # LATIN CAPITAL LETTER I WITH BREVE 012D ; PVALID # LATIN SMALL LETTER I WITH BREVE 012E ; DISALLOWED # LATIN CAPITAL LETTER I WITH OGONEK 012F ; PVALID # LATIN SMALL LETTER I WITH OGONEK 0130 ; DISALLOWED # LATIN CAPITAL LETTER I WITH DOT ABOVE 0131 ; PVALID # LATIN SMALL LETTER DOTLESS I 0132..0134 ; DISALLOWED # LATIN CAPITAL LIGATURE IJ..LATIN CAPITAL LET 0135 ; PVALID # LATIN SMALL LETTER J WITH CIRCUMFLEX 0136 ; DISALLOWED # LATIN CAPITAL LETTER K WITH CEDILLA 0137..0138 ; PVALID # LATIN SMALL LETTER K WITH CEDILLA..LATIN SMA 0139 ; DISALLOWED # LATIN CAPITAL LETTER L WITH ACUTE 013A ; PVALID # LATIN SMALL LETTER L WITH ACUTE 013B ; DISALLOWED # LATIN CAPITAL LETTER L WITH CEDILLA 013C ; PVALID # LATIN SMALL LETTER L WITH CEDILLA 013D ; DISALLOWED # LATIN CAPITAL LETTER L WITH CARON 013E ; PVALID # LATIN SMALL LETTER L WITH CARON 013F..0141 ; DISALLOWED # LATIN CAPITAL LETTER L WITH MIDDLE DOT..LATI 0142 ; PVALID # LATIN SMALL LETTER L WITH STROKE 0143 ; DISALLOWED # LATIN CAPITAL LETTER N WITH ACUTE 0144 ; PVALID # LATIN SMALL LETTER N WITH ACUTE 0145 ; DISALLOWED # LATIN CAPITAL LETTER N WITH CEDILLA 0146 ; PVALID # LATIN SMALL LETTER N WITH CEDILLA 0147 ; DISALLOWED # LATIN CAPITAL LETTER N WITH CARON 0148 ; PVALID # LATIN SMALL LETTER N WITH CARON 0149..014A ; DISALLOWED # LATIN SMALL LETTER N PRECEDED BY APOSTROPHE. 014B ; PVALID # LATIN SMALL LETTER ENG 014C ; DISALLOWED # LATIN CAPITAL LETTER O WITH MACRON 014D ; PVALID # LATIN SMALL LETTER O WITH MACRON 014E ; DISALLOWED # LATIN CAPITAL LETTER O WITH BREVE 014F ; PVALID # LATIN SMALL LETTER O WITH BREVE 0150 ; DISALLOWED # LATIN CAPITAL LETTER O WITH DOUBLE ACUTE 0151 ; PVALID # LATIN SMALL LETTER O WITH DOUBLE ACUTE 0152 ; DISALLOWED # LATIN CAPITAL LIGATURE OE 0153 ; PVALID # LATIN SMALL LIGATURE OE 0154 ; DISALLOWED # LATIN CAPITAL LETTER R WITH ACUTE 0155 ; PVALID # LATIN SMALL LETTER R WITH ACUTE 0156 ; DISALLOWED # LATIN CAPITAL LETTER R WITH CEDILLA 0157 ; PVALID # LATIN SMALL LETTER R WITH CEDILLA 0158 ; DISALLOWED # LATIN CAPITAL LETTER R WITH CARON 0159 ; PVALID # LATIN SMALL LETTER R WITH CARON 015A ; DISALLOWED # LATIN CAPITAL LETTER S WITH ACUTE Faltstrom Expires October 29, 2008 [Page 11]
Internet-Draft IDNA Codepoints April 2008 015B ; PVALID # LATIN SMALL LETTER S WITH ACUTE 015C ; DISALLOWED # LATIN CAPITAL LETTER S WITH CIRCUMFLEX 015D ; PVALID # LATIN SMALL LETTER S WITH CIRCUMFLEX 015E ; DISALLOWED # LATIN CAPITAL LETTER S WITH CEDILLA 015F ; PVALID # LATIN SMALL LETTER S WITH CEDILLA 0160 ; DISALLOWED # LATIN CAPITAL LETTER S WITH CARON 0161 ; PVALID # LATIN SMALL LETTER S WITH CARON 0162 ; DISALLOWED # LATIN CAPITAL LETTER T WITH CEDILLA 0163 ; PVALID # LATIN SMALL LETTER T WITH CEDILLA 0164 ; DISALLOWED # LATIN CAPITAL LETTER T WITH CARON 0165 ; PVALID # LATIN SMALL LETTER T WITH CARON 0166 ; DISALLOWED # LATIN CAPITAL LETTER T WITH STROKE 0167 ; PVALID # LATIN SMALL LETTER T WITH STROKE 0168 ; DISALLOWED # LATIN CAPITAL LETTER U WITH TILDE 0169 ; PVALID # LATIN SMALL LETTER U WITH TILDE 016A ; DISALLOWED # LATIN CAPITAL LETTER U WITH MACRON 016B ; PVALID # LATIN SMALL LETTER U WITH MACRON 016C ; DISALLOWED # LATIN CAPITAL LETTER U WITH BREVE 016D ; PVALID # LATIN SMALL LETTER U WITH BREVE 016E ; DISALLOWED # LATIN CAPITAL LETTER U WITH RING ABOVE 016F ; PVALID # LATIN SMALL LETTER U WITH RING ABOVE 0170 ; DISALLOWED # LATIN CAPITAL LETTER U WITH DOUBLE ACUTE 0171 ; PVALID # LATIN SMALL LETTER U WITH DOUBLE ACUTE 0172 ; DISALLOWED # LATIN CAPITAL LETTER U WITH OGONEK 0173 ; PVALID # LATIN SMALL LETTER U WITH OGONEK 0174 ; DISALLOWED # LATIN CAPITAL LETTER W WITH CIRCUMFLEX 0175 ; PVALID # LATIN SMALL LETTER W WITH CIRCUMFLEX 0176 ; DISALLOWED # LATIN CAPITAL LETTER Y WITH CIRCUMFLEX 0177 ; PVALID # LATIN SMALL LETTER Y WITH CIRCUMFLEX 0178..0179 ; DISALLOWED # LATIN CAPITAL LETTER Y WITH DIAERESIS..LATIN 017A ; PVALID # LATIN SMALL LETTER Z WITH ACUTE 017B ; DISALLOWED # LATIN CAPITAL LETTER Z WITH DOT ABOVE 017C ; PVALID # LATIN SMALL LETTER Z WITH DOT ABOVE 017D ; DISALLOWED # LATIN CAPITAL LETTER Z WITH CARON 017E ; PVALID # LATIN SMALL LETTER Z WITH CARON 017F ; DISALLOWED # LATIN SMALL LETTER LONG S 0180 ; PVALID # LATIN SMALL LETTER B WITH STROKE 0181..0182 ; DISALLOWED # LATIN CAPITAL LETTER B WITH HOOK..LATIN CAPI 0183 ; PVALID # LATIN SMALL LETTER B WITH TOPBAR 0184 ; DISALLOWED # LATIN CAPITAL LETTER TONE SIX 0185 ; PVALID # LATIN SMALL LETTER TONE SIX 0186..0187 ; DISALLOWED # LATIN CAPITAL LETTER OPEN O..LATIN CAPITAL L 0188 ; PVALID # LATIN SMALL LETTER C WITH HOOK 0189..018B ; DISALLOWED # LATIN CAPITAL LETTER AFRICAN D..LATIN CAPITA 018C..018D ; PVALID # LATIN SMALL LETTER D WITH TOPBAR..LATIN SMAL 018E..0191 ; DISALLOWED # LATIN CAPITAL LETTER REVERSED E..LATIN CAPIT 0192 ; PVALID # LATIN SMALL LETTER F WITH HOOK 0193..0194 ; DISALLOWED # LATIN CAPITAL LETTER G WITH HOOK..LATIN CAPI Faltstrom Expires October 29, 2008 [Page 12]
Internet-Draft IDNA Codepoints April 2008 0195 ; PVALID # LATIN SMALL LETTER HV 0196..0198 ; DISALLOWED # LATIN CAPITAL LETTER IOTA..LATIN CAPITAL LET 0199..019B ; PVALID # LATIN SMALL LETTER K WITH HOOK..LATIN SMALL 019C..019D ; DISALLOWED # LATIN CAPITAL LETTER TURNED M..LATIN CAPITAL 019E ; PVALID # LATIN SMALL LETTER N WITH LONG RIGHT LEG 019F..01A0 ; DISALLOWED # LATIN CAPITAL LETTER O WITH MIDDLE TILDE..LA 01A1 ; PVALID # LATIN SMALL LETTER O WITH HORN 01A2 ; DISALLOWED # LATIN CAPITAL LETTER OI 01A3 ; PVALID # LATIN SMALL LETTER OI 01A4 ; DISALLOWED # LATIN CAPITAL LETTER P WITH HOOK 01A5 ; PVALID # LATIN SMALL LETTER P WITH HOOK 01A6..01A7 ; DISALLOWED # LATIN LETTER YR..LATIN CAPITAL LETTER TONE T 01A8 ; PVALID # LATIN SMALL LETTER TONE TWO 01A9 ; DISALLOWED # LATIN CAPITAL LETTER ESH 01AA..01AB ; PVALID # LATIN LETTER REVERSED ESH LOOP..LATIN SMALL 01AC ; DISALLOWED # LATIN CAPITAL LETTER T WITH HOOK 01AD ; PVALID # LATIN SMALL LETTER T WITH HOOK 01AE..01AF ; DISALLOWED # LATIN CAPITAL LETTER T WITH RETROFLEX HOOK.. 01B0 ; PVALID # LATIN SMALL LETTER U WITH HORN 01B1..01B3 ; DISALLOWED # LATIN CAPITAL LETTER UPSILON..LATIN CAPITAL 01B4 ; PVALID # LATIN SMALL LETTER Y WITH HOOK 01B5 ; DISALLOWED # LATIN CAPITAL LETTER Z WITH STROKE 01B6 ; PVALID # LATIN SMALL LETTER Z WITH STROKE 01B7..01B8 ; DISALLOWED # LATIN CAPITAL LETTER EZH..LATIN CAPITAL LETT 01B9..01BB ; PVALID # LATIN SMALL LETTER EZH REVERSED..LATIN LETTE 01BC ; DISALLOWED # LATIN CAPITAL LETTER TONE FIVE 01BD..01C3 ; PVALID # LATIN SMALL LETTER TONE FIVE..LATIN LETTER R 01C4..01CD ; DISALLOWED # LATIN CAPITAL LETTER DZ WITH CARON..LATIN CA 01CE ; PVALID # LATIN SMALL LETTER A WITH CARON 01CF ; DISALLOWED # LATIN CAPITAL LETTER I WITH CARON 01D0 ; PVALID # LATIN SMALL LETTER I WITH CARON 01D1 ; DISALLOWED # LATIN CAPITAL LETTER O WITH CARON 01D2 ; PVALID # LATIN SMALL LETTER O WITH CARON 01D3 ; DISALLOWED # LATIN CAPITAL LETTER U WITH CARON 01D4 ; PVALID # LATIN SMALL LETTER U WITH CARON 01D5 ; DISALLOWED # LATIN CAPITAL LETTER U WITH DIAERESIS AND MA 01D6 ; PVALID # LATIN SMALL LETTER U WITH DIAERESIS AND MACR 01D7 ; DISALLOWED # LATIN CAPITAL LETTER U WITH DIAERESIS AND AC 01D8 ; PVALID # LATIN SMALL LETTER U WITH DIAERESIS AND ACUT 01D9 ; DISALLOWED # LATIN CAPITAL LETTER U WITH DIAERESIS AND CA 01DA ; PVALID # LATIN SMALL LETTER U WITH DIAERESIS AND CARO 01DB ; DISALLOWED # LATIN CAPITAL LETTER U WITH DIAERESIS AND GR 01DC..01DD ; PVALID # LATIN SMALL LETTER U WITH DIAERESIS AND GRAV 01DE ; DISALLOWED # LATIN CAPITAL LETTER A WITH DIAERESIS AND MA 01DF ; PVALID # LATIN SMALL LETTER A WITH DIAERESIS AND MACR 01E0 ; DISALLOWED # LATIN CAPITAL LETTER A WITH DOT ABOVE AND MA 01E1 ; PVALID # LATIN SMALL LETTER A WITH DOT ABOVE AND MACR 01E2 ; DISALLOWED # LATIN CAPITAL LETTER AE WITH MACRON Faltstrom Expires October 29, 2008 [Page 13]
Internet-Draft IDNA Codepoints April 2008 01E3 ; PVALID # LATIN SMALL LETTER AE WITH MACRON 01E4 ; DISALLOWED # LATIN CAPITAL LETTER G WITH STROKE 01E5 ; PVALID # LATIN SMALL LETTER G WITH STROKE 01E6 ; DISALLOWED # LATIN CAPITAL LETTER G WITH CARON 01E7 ; PVALID # LATIN SMALL LETTER G WITH CARON 01E8 ; DISALLOWED # LATIN CAPITAL LETTER K WITH CARON 01E9 ; PVALID # LATIN SMALL LETTER K WITH CARON 01EA ; DISALLOWED # LATIN CAPITAL LETTER O WITH OGONEK 01EB ; PVALID # LATIN SMALL LETTER O WITH OGONEK 01EC ; DISALLOWED # LATIN CAPITAL LETTER O WITH OGONEK AND MACRO 01ED ; PVALID # LATIN SMALL LETTER O WITH OGONEK AND MACRON 01EE ; DISALLOWED # LATIN CAPITAL LETTER EZH WITH CARON 01EF..01F0 ; PVALID # LATIN SMALL LETTER EZH WITH CARON..LATIN SMA 01F1..01F4 ; DISALLOWED # LATIN CAPITAL LETTER DZ..LATIN CAPITAL LETTE 01F5 ; PVALID # LATIN SMALL LETTER G WITH ACUTE 01F6..01F8 ; DISALLOWED # LATIN CAPITAL LETTER HWAIR..LATIN CAPITAL LE 01F9 ; PVALID # LATIN SMALL LETTER N WITH GRAVE 01FA ; DISALLOWED # LATIN CAPITAL LETTER A WITH RING ABOVE AND A 01FB ; PVALID # LATIN SMALL LETTER A WITH RING ABOVE AND ACU 01FC ; DISALLOWED # LATIN CAPITAL LETTER AE WITH ACUTE 01FD ; PVALID # LATIN SMALL LETTER AE WITH ACUTE 01FE ; DISALLOWED # LATIN CAPITAL LETTER O WITH STROKE AND ACUTE 01FF ; PVALID # LATIN SMALL LETTER O WITH STROKE AND ACUTE 0200 ; DISALLOWED # LATIN CAPITAL LETTER A WITH DOUBLE GRAVE 0201 ; PVALID # LATIN SMALL LETTER A WITH DOUBLE GRAVE 0202 ; DISALLOWED # LATIN CAPITAL LETTER A WITH INVERTED BREVE 0203 ; PVALID # LATIN SMALL LETTER A WITH INVERTED BREVE 0204 ; DISALLOWED # LATIN CAPITAL LETTER E WITH DOUBLE GRAVE 0205 ; PVALID # LATIN SMALL LETTER E WITH DOUBLE GRAVE 0206 ; DISALLOWED # LATIN CAPITAL LETTER E WITH INVERTED BREVE 0207 ; PVALID # LATIN SMALL LETTER E WITH INVERTED BREVE 0208 ; DISALLOWED # LATIN CAPITAL LETTER I WITH DOUBLE GRAVE 0209 ; PVALID # LATIN SMALL LETTER I WITH DOUBLE GRAVE 020A ; DISALLOWED # LATIN CAPITAL LETTER I WITH INVERTED BREVE 020B ; PVALID # LATIN SMALL LETTER I WITH INVERTED BREVE 020C ; DISALLOWED # LATIN CAPITAL LETTER O WITH DOUBLE GRAVE 020D ; PVALID # LATIN SMALL LETTER O WITH DOUBLE GRAVE 020E ; DISALLOWED # LATIN CAPITAL LETTER O WITH INVERTED BREVE 020F ; PVALID # LATIN SMALL LETTER O WITH INVERTED BREVE 0210 ; DISALLOWED # LATIN CAPITAL LETTER R WITH DOUBLE GRAVE 0211 ; PVALID # LATIN SMALL LETTER R WITH DOUBLE GRAVE 0212 ; DISALLOWED # LATIN CAPITAL LETTER R WITH INVERTED BREVE 0213 ; PVALID # LATIN SMALL LETTER R WITH INVERTED BREVE 0214 ; DISALLOWED # LATIN CAPITAL LETTER U WITH DOUBLE GRAVE 0215 ; PVALID # LATIN SMALL LETTER U WITH DOUBLE GRAVE 0216 ; DISALLOWED # LATIN CAPITAL LETTER U WITH INVERTED BREVE 0217 ; PVALID # LATIN SMALL LETTER U WITH INVERTED BREVE 0218 ; DISALLOWED # LATIN CAPITAL LETTER S WITH COMMA BELOW Faltstrom Expires October 29, 2008 [Page 14]
Internet-Draft IDNA Codepoints April 2008 0219 ; PVALID # LATIN SMALL LETTER S WITH COMMA BELOW 021A ; DISALLOWED # LATIN CAPITAL LETTER T WITH COMMA BELOW 021B ; PVALID # LATIN SMALL LETTER T WITH COMMA BELOW 021C ; DISALLOWED # LATIN CAPITAL LETTER YOGH 021D ; PVALID # LATIN SMALL LETTER YOGH 021E ; DISALLOWED # LATIN CAPITAL LETTER H WITH CARON 021F ; PVALID # LATIN SMALL LETTER H WITH CARON 0220 ; DISALLOWED # LATIN CAPITAL LETTER N WITH LONG RIGHT LEG 0221 ; PVALID # LATIN SMALL LETTER D WITH CURL 0222 ; DISALLOWED # LATIN CAPITAL LETTER OU 0223 ; PVALID # LATIN SMALL LETTER OU 0224 ; DISALLOWED # LATIN CAPITAL LETTER Z WITH HOOK 0225 ; PVALID # LATIN SMALL LETTER Z WITH HOOK 0226 ; DISALLOWED # LATIN CAPITAL LETTER A WITH DOT ABOVE 0227 ; PVALID # LATIN SMALL LETTER A WITH DOT ABOVE 0228 ; DISALLOWED # LATIN CAPITAL LETTER E WITH CEDILLA 0229 ; PVALID # LATIN SMALL LETTER E WITH CEDILLA 022A ; DISALLOWED # LATIN CAPITAL LETTER O WITH DIAERESIS AND MA 022B ; PVALID # LATIN SMALL LETTER O WITH DIAERESIS AND MACR 022C ; DISALLOWED # LATIN CAPITAL LETTER O WITH TILDE AND MACRON 022D ; PVALID # LATIN SMALL LETTER O WITH TILDE AND MACRON 022E ; DISALLOWED # LATIN CAPITAL LETTER O WITH DOT ABOVE 022F ; PVALID # LATIN SMALL LETTER O WITH DOT ABOVE 0230 ; DISALLOWED # LATIN CAPITAL LETTER O WITH DOT ABOVE AND MA 0231 ; PVALID # LATIN SMALL LETTER O WITH DOT ABOVE AND MACR 0232 ; DISALLOWED # LATIN CAPITAL LETTER Y WITH MACRON 0233..0239 ; PVALID # LATIN SMALL LETTER Y WITH MACRON..LATIN SMAL 023A..023B ; DISALLOWED # LATIN CAPITAL LETTER A WITH STROKE..LATIN CA 023C ; PVALID # LATIN SMALL LETTER C WITH STROKE 023D..023E ; DISALLOWED # LATIN CAPITAL LETTER L WITH BAR..LATIN CAPIT 023F..0240 ; PVALID # LATIN SMALL LETTER S WITH SWASH TAIL..LATIN 0241 ; DISALLOWED # LATIN CAPITAL LETTER GLOTTAL STOP 0242 ; PVALID # LATIN SMALL LETTER GLOTTAL STOP 0243..0246 ; DISALLOWED # LATIN CAPITAL LETTER B WITH STROKE..LATIN CA 0247 ; PVALID # LATIN SMALL LETTER E WITH STROKE 0248 ; DISALLOWED # LATIN CAPITAL LETTER J WITH STROKE 0249 ; PVALID # LATIN SMALL LETTER J WITH STROKE 024A ; DISALLOWED # LATIN CAPITAL LETTER SMALL Q WITH HOOK TAIL 024B ; PVALID # LATIN SMALL LETTER Q WITH HOOK TAIL 024C ; DISALLOWED # LATIN CAPITAL LETTER R WITH STROKE 024D ; PVALID # LATIN SMALL LETTER R WITH STROKE 024E ; DISALLOWED # LATIN CAPITAL LETTER Y WITH STROKE 024F..02AF ; PVALID # LATIN SMALL LETTER Y WITH STROKE..LATIN SMAL 02B0..02B8 ; DISALLOWED # MODIFIER LETTER SMALL H..MODIFIER LETTER SMA 02B9 ; CONTEXTO # MODIFIER LETTER PRIME 02BA..02C1 ; PVALID # MODIFIER LETTER DOUBLE PRIME..MODIFIER LETTE 02C2..02C5 ; DISALLOWED # MODIFIER LETTER LEFT ARROWHEAD..MODIFIER LET 02C6..02D1 ; PVALID # MODIFIER LETTER CIRCUMFLEX ACCENT..MODIFIER Faltstrom Expires October 29, 2008 [Page 15]
Internet-Draft IDNA Codepoints April 2008 02D2..02EB ; DISALLOWED # MODIFIER LETTER CENTRED RIGHT HALF RING..MOD 02EC ; PVALID # MODIFIER LETTER VOICING 02ED ; DISALLOWED # MODIFIER LETTER UNASPIRATED 02EE ; PVALID # MODIFIER LETTER DOUBLE APOSTROPHE 02EF..02FF ; DISALLOWED # MODIFIER LETTER LOW DOWN ARROWHEAD..MODIFIER 0300..033F ; PVALID # COMBINING GRAVE ACCENT..COMBINING DOUBLE OVE 0340..0341 ; DISALLOWED # COMBINING GRAVE TONE MARK..COMBINING ACUTE T 0342 ; PVALID # COMBINING GREEK PERISPOMENI 0343..0345 ; DISALLOWED # COMBINING GREEK KORONIS..COMBINING GREEK YPO 0346..034E ; PVALID # COMBINING BRIDGE ABOVE..COMBINING UPWARDS AR 034F ; DISALLOWED # COMBINING GRAPHEME JOINER 0350..036F ; PVALID # COMBINING RIGHT ARROWHEAD ABOVE..COMBINING L 0370 ; DISALLOWED # GREEK CAPITAL LETTER HETA 0371 ; PVALID # GREEK SMALL LETTER HETA 0372 ; DISALLOWED # GREEK CAPITAL LETTER ARCHAIC SAMPI 0373 ; PVALID # GREEK SMALL LETTER ARCHAIC SAMPI 0374 ; DISALLOWED # GREEK NUMERAL SIGN 0375 ; CONTEXTO # GREEK LOWER NUMERAL SIGN 0376 ; DISALLOWED # GREEK CAPITAL LETTER PAMPHYLIAN DIGAMMA 0377 ; PVALID # GREEK SMALL LETTER PAMPHYLIAN DIGAMMA 0378..0379 ; UNASSIGNED # <reserved>..<reserved> 037A ; DISALLOWED # GREEK YPOGEGRAMMENI 037B..037D ; PVALID # GREEK SMALL REVERSED LUNATE SIGMA SYMBOL..GR 037E ; DISALLOWED # GREEK QUESTION MARK 037F..0383 ; UNASSIGNED # <reserved>..<reserved> 0384..038A ; DISALLOWED # GREEK TONOS..GREEK CAPITAL LETTER IOTA WITH 038B ; UNASSIGNED # <reserved> 038C ; DISALLOWED # GREEK CAPITAL LETTER OMICRON WITH TONOS 038D ; UNASSIGNED # <reserved> 038E..038F ; DISALLOWED # GREEK CAPITAL LETTER UPSILON WITH TONOS..GRE 0390 ; PVALID # GREEK SMALL LETTER IOTA WITH DIALYTIKA AND T 0391..03A1 ; DISALLOWED # GREEK CAPITAL LETTER ALPHA..GREEK CAPITAL LE 03A2 ; UNASSIGNED # <reserved> 03A3..03AB ; DISALLOWED # GREEK CAPITAL LETTER SIGMA..GREEK CAPITAL LE 03AC..03C1 ; PVALID # GREEK SMALL LETTER ALPHA WITH TONOS..GREEK S 03C2 ; DISALLOWED # GREEK SMALL LETTER FINAL SIGMA 03C3..03CE ; PVALID # GREEK SMALL LETTER SIGMA..GREEK SMALL LETTER 03CF..03D6 ; DISALLOWED # GREEK CAPITAL KAI SYMBOL..GREEK PI SYMBOL 03D7 ; PVALID # GREEK KAI SYMBOL 03D8 ; DISALLOWED # GREEK LETTER ARCHAIC KOPPA 03D9 ; PVALID # GREEK SMALL LETTER ARCHAIC KOPPA 03DA ; DISALLOWED # GREEK LETTER STIGMA 03DB ; PVALID # GREEK SMALL LETTER STIGMA 03DC ; DISALLOWED # GREEK LETTER DIGAMMA 03DD ; PVALID # GREEK SMALL LETTER DIGAMMA 03DE ; DISALLOWED # GREEK LETTER KOPPA 03DF ; PVALID # GREEK SMALL LETTER KOPPA 03E0 ; DISALLOWED # GREEK LETTER SAMPI Faltstrom Expires October 29, 2008 [Page 16]
Internet-Draft IDNA Codepoints April 2008 03E1 ; PVALID # GREEK SMALL LETTER SAMPI 03E2 ; DISALLOWED # COPTIC CAPITAL LETTER SHEI 03E3 ; PVALID # COPTIC SMALL LETTER SHEI 03E4 ; DISALLOWED # COPTIC CAPITAL LETTER FEI 03E5 ; PVALID # COPTIC SMALL LETTER FEI 03E6 ; DISALLOWED # COPTIC CAPITAL LETTER KHEI 03E7 ; PVALID # COPTIC SMALL LETTER KHEI 03E8 ; DISALLOWED # COPTIC CAPITAL LETTER HORI 03E9 ; PVALID # COPTIC SMALL LETTER HORI 03EA ; DISALLOWED # COPTIC CAPITAL LETTER GANGIA 03EB ; PVALID # COPTIC SMALL LETTER GANGIA 03EC ; DISALLOWED # COPTIC CAPITAL LETTER SHIMA 03ED ; PVALID # COPTIC SMALL LETTER SHIMA 03EE ; DISALLOWED # COPTIC CAPITAL LETTER DEI 03EF ; PVALID # COPTIC SMALL LETTER DEI 03F0..03F2 ; DISALLOWED # GREEK KAPPA SYMBOL..GREEK LUNATE SIGMA SYMBO 03F3 ; PVALID # GREEK LETTER YOT 03F4..03F7 ; DISALLOWED # GREEK CAPITAL THETA SYMBOL..GREEK CAPITAL LE 03F8 ; PVALID # GREEK SMALL LETTER SHO 03F9..03FA ; DISALLOWED # GREEK CAPITAL LUNATE SIGMA SYMBOL..GREEK CAP 03FB..03FC ; PVALID # GREEK SMALL LETTER SAN..GREEK RHO WITH STROK 03FD..042F ; DISALLOWED # GREEK CAPITAL REVERSED LUNATE SIGMA SYMBOL.. 0430..045F ; PVALID # CYRILLIC SMALL LETTER A..CYRILLIC SMALL LETT 0460 ; DISALLOWED # CYRILLIC CAPITAL LETTER OMEGA 0461 ; PVALID # CYRILLIC SMALL LETTER OMEGA 0462 ; DISALLOWED # CYRILLIC CAPITAL LETTER YAT 0463 ; PVALID # CYRILLIC SMALL LETTER YAT 0464 ; DISALLOWED # CYRILLIC CAPITAL LETTER IOTIFIED E 0465 ; PVALID # CYRILLIC SMALL LETTER IOTIFIED E 0466 ; DISALLOWED # CYRILLIC CAPITAL LETTER LITTLE YUS 0467 ; PVALID # CYRILLIC SMALL LETTER LITTLE YUS 0468 ; DISALLOWED # CYRILLIC CAPITAL LETTER IOTIFIED LITTLE YUS 0469 ; PVALID # CYRILLIC SMALL LETTER IOTIFIED LITTLE YUS 046A ; DISALLOWED # CYRILLIC CAPITAL LETTER BIG YUS 046B ; PVALID # CYRILLIC SMALL LETTER BIG YUS 046C ; DISALLOWED # CYRILLIC CAPITAL LETTER IOTIFIED BIG YUS 046D ; PVALID # CYRILLIC SMALL LETTER IOTIFIED BIG YUS 046E ; DISALLOWED # CYRILLIC CAPITAL LETTER KSI 046F ; PVALID # CYRILLIC SMALL LETTER KSI 0470 ; DISALLOWED # CYRILLIC CAPITAL LETTER PSI 0471 ; PVALID # CYRILLIC SMALL LETTER PSI 0472 ; DISALLOWED # CYRILLIC CAPITAL LETTER FITA 0473 ; PVALID # CYRILLIC SMALL LETTER FITA 0474 ; DISALLOWED # CYRILLIC CAPITAL LETTER IZHITSA 0475 ; PVALID # CYRILLIC SMALL LETTER IZHITSA 0476 ; DISALLOWED # CYRILLIC CAPITAL LETTER IZHITSA WITH DOUBLE 0477 ; PVALID # CYRILLIC SMALL LETTER IZHITSA WITH DOUBLE GR 0478 ; DISALLOWED # CYRILLIC CAPITAL LETTER UK Faltstrom Expires October 29, 2008 [Page 17]
Internet-Draft IDNA Codepoints April 2008 0479 ; PVALID # CYRILLIC SMALL LETTER UK 047A ; DISALLOWED # CYRILLIC CAPITAL LETTER ROUND OMEGA 047B ; PVALID # CYRILLIC SMALL LETTER ROUND OMEGA 047C ; DISALLOWED # CYRILLIC CAPITAL LETTER OMEGA WITH TITLO 047D ; PVALID # CYRILLIC SMALL LETTER OMEGA WITH TITLO 047E ; DISALLOWED # CYRILLIC CAPITAL LETTER OT 047F ; PVALID # CYRILLIC SMALL LETTER OT 0480 ; DISALLOWED # CYRILLIC CAPITAL LETTER KOPPA 0481 ; PVALID # CYRILLIC SMALL LETTER KOPPA 0482 ; DISALLOWED # CYRILLIC THOUSANDS SIGN 0483 ; CONTEXTO # COMBINING CYRILLIC TITLO 0484..0487 ; PVALID # COMBINING CYRILLIC PALATALIZATION..COMBINING 0488..048A ; DISALLOWED # COMBINING CYRILLIC HUNDRED THOUSANDS SIGN..C 048B ; PVALID # CYRILLIC SMALL LETTER SHORT I WITH TAIL 048C ; DISALLOWED # CYRILLIC CAPITAL LETTER SEMISOFT SIGN 048D ; PVALID # CYRILLIC SMALL LETTER SEMISOFT SIGN 048E ; DISALLOWED # CYRILLIC CAPITAL LETTER ER WITH TICK 048F ; PVALID # CYRILLIC SMALL LETTER ER WITH TICK 0490 ; DISALLOWED # CYRILLIC CAPITAL LETTER GHE WITH UPTURN 0491 ; PVALID # CYRILLIC SMALL LETTER GHE WITH UPTURN 0492 ; DISALLOWED # CYRILLIC CAPITAL LETTER GHE WITH STROKE 0493 ; PVALID # CYRILLIC SMALL LETTER GHE WITH STROKE 0494 ; DISALLOWED # CYRILLIC CAPITAL LETTER GHE WITH MIDDLE HOOK 0495 ; PVALID # CYRILLIC SMALL LETTER GHE WITH MIDDLE HOOK 0496 ; DISALLOWED # CYRILLIC CAPITAL LETTER ZHE WITH DESCENDER 0497 ; PVALID # CYRILLIC SMALL LETTER ZHE WITH DESCENDER 0498 ; DISALLOWED # CYRILLIC CAPITAL LETTER ZE WITH DESCENDER 0499 ; PVALID # CYRILLIC SMALL LETTER ZE WITH DESCENDER 049A ; DISALLOWED # CYRILLIC CAPITAL LETTER KA WITH DESCENDER 049B ; PVALID # CYRILLIC SMALL LETTER KA WITH DESCENDER 049C ; DISALLOWED # CYRILLIC CAPITAL LETTER KA WITH VERTICAL STR 049D ; PVALID # CYRILLIC SMALL LETTER KA WITH VERTICAL STROK 049E ; DISALLOWED # CYRILLIC CAPITAL LETTER KA WITH STROKE 049F ; PVALID # CYRILLIC SMALL LETTER KA WITH STROKE 04A0 ; DISALLOWED # CYRILLIC CAPITAL LETTER BASHKIR KA 04A1 ; PVALID # CYRILLIC SMALL LETTER BASHKIR KA 04A2 ; DISALLOWED # CYRILLIC CAPITAL LETTER EN WITH DESCENDER 04A3 ; PVALID # CYRILLIC SMALL LETTER EN WITH DESCENDER 04A4 ; DISALLOWED # CYRILLIC CAPITAL LIGATURE EN GHE 04A5 ; PVALID # CYRILLIC SMALL LIGATURE EN GHE 04A6 ; DISALLOWED # CYRILLIC CAPITAL LETTER PE WITH MIDDLE HOOK 04A7 ; PVALID # CYRILLIC SMALL LETTER PE WITH MIDDLE HOOK 04A8 ; DISALLOWED # CYRILLIC CAPITAL LETTER ABKHASIAN HA 04A9 ; PVALID # CYRILLIC SMALL LETTER ABKHASIAN HA 04AA ; DISALLOWED # CYRILLIC CAPITAL LETTER ES WITH DESCENDER 04AB ; PVALID # CYRILLIC SMALL LETTER ES WITH DESCENDER 04AC ; DISALLOWED # CYRILLIC CAPITAL LETTER TE WITH DESCENDER 04AD ; PVALID # CYRILLIC SMALL LETTER TE WITH DESCENDER Faltstrom Expires October 29, 2008 [Page 18]
Internet-Draft IDNA Codepoints April 2008 04AE ; DISALLOWED # CYRILLIC CAPITAL LETTER STRAIGHT U 04AF ; PVALID # CYRILLIC SMALL LETTER STRAIGHT U 04B0 ; DISALLOWED # CYRILLIC CAPITAL LETTER STRAIGHT U WITH STRO 04B1 ; PVALID # CYRILLIC SMALL LETTER STRAIGHT U WITH STROKE 04B2 ; DISALLOWED # CYRILLIC CAPITAL LETTER HA WITH DESCENDER 04B3 ; PVALID # CYRILLIC SMALL LETTER HA WITH DESCENDER 04B4 ; DISALLOWED # CYRILLIC CAPITAL LIGATURE TE TSE 04B5 ; PVALID # CYRILLIC SMALL LIGATURE TE TSE 04B6 ; DISALLOWED # CYRILLIC CAPITAL LETTER CHE WITH DESCENDER 04B7 ; PVALID # CYRILLIC SMALL LETTER CHE WITH DESCENDER 04B8 ; DISALLOWED # CYRILLIC CAPITAL LETTER CHE WITH VERTICAL ST 04B9 ; PVALID # CYRILLIC SMALL LETTER CHE WITH VERTICAL STRO 04BA ; DISALLOWED # CYRILLIC CAPITAL LETTER SHHA 04BB ; PVALID # CYRILLIC SMALL LETTER SHHA 04BC ; DISALLOWED # CYRILLIC CAPITAL LETTER ABKHASIAN CHE 04BD ; PVALID # CYRILLIC SMALL LETTER ABKHASIAN CHE 04BE ; DISALLOWED # CYRILLIC CAPITAL LETTER ABKHASIAN CHE WITH D 04BF ; PVALID # CYRILLIC SMALL LETTER ABKHASIAN CHE WITH DES 04C0..04C1 ; DISALLOWED # CYRILLIC LETTER PALOCHKA..CYRILLIC CAPITAL L 04C2 ; PVALID # CYRILLIC SMALL LETTER ZHE WITH BREVE 04C3 ; DISALLOWED # CYRILLIC CAPITAL LETTER KA WITH HOOK 04C4 ; PVALID # CYRILLIC SMALL LETTER KA WITH HOOK 04C5 ; DISALLOWED # CYRILLIC CAPITAL LETTER EL WITH TAIL 04C6 ; PVALID # CYRILLIC SMALL LETTER EL WITH TAIL 04C7 ; DISALLOWED # CYRILLIC CAPITAL LETTER EN WITH HOOK 04C8 ; PVALID # CYRILLIC SMALL LETTER EN WITH HOOK 04C9 ; DISALLOWED # CYRILLIC CAPITAL LETTER EN WITH TAIL 04CA ; PVALID # CYRILLIC SMALL LETTER EN WITH TAIL 04CB ; DISALLOWED # CYRILLIC CAPITAL LETTER KHAKASSIAN CHE 04CC ; PVALID # CYRILLIC SMALL LETTER KHAKASSIAN CHE 04CD ; DISALLOWED # CYRILLIC CAPITAL LETTER EM WITH TAIL 04CE..04CF ; PVALID # CYRILLIC SMALL LETTER EM WITH TAIL..CYRILLIC 04D0 ; DISALLOWED # CYRILLIC CAPITAL LETTER A WITH BREVE 04D1 ; PVALID # CYRILLIC SMALL LETTER A WITH BREVE 04D2 ; DISALLOWED # CYRILLIC CAPITAL LETTER A WITH DIAERESIS 04D3 ; PVALID # CYRILLIC SMALL LETTER A WITH DIAERESIS 04D4 ; DISALLOWED # CYRILLIC CAPITAL LIGATURE A IE 04D5 ; PVALID # CYRILLIC SMALL LIGATURE A IE 04D6 ; DISALLOWED # CYRILLIC CAPITAL LETTER IE WITH BREVE 04D7 ; PVALID # CYRILLIC SMALL LETTER IE WITH BREVE 04D8 ; DISALLOWED # CYRILLIC CAPITAL LETTER SCHWA 04D9 ; PVALID # CYRILLIC SMALL LETTER SCHWA 04DA ; DISALLOWED # CYRILLIC CAPITAL LETTER SCHWA WITH DIAERESIS 04DB ; PVALID # CYRILLIC SMALL LETTER SCHWA WITH DIAERESIS 04DC ; DISALLOWED # CYRILLIC CAPITAL LETTER ZHE WITH DIAERESIS 04DD ; PVALID # CYRILLIC SMALL LETTER ZHE WITH DIAERESIS 04DE ; DISALLOWED # CYRILLIC CAPITAL LETTER ZE WITH DIAERESIS 04DF ; PVALID # CYRILLIC SMALL LETTER ZE WITH DIAERESIS Faltstrom Expires October 29, 2008 [Page 19]
Internet-Draft IDNA Codepoints April 2008 04E0 ; DISALLOWED # CYRILLIC CAPITAL LETTER ABKHASIAN DZE 04E1 ; PVALID # CYRILLIC SMALL LETTER ABKHASIAN DZE 04E2 ; DISALLOWED # CYRILLIC CAPITAL LETTER I WITH MACRON 04E3 ; PVALID # CYRILLIC SMALL LETTER I WITH MACRON 04E4 ; DISALLOWED # CYRILLIC CAPITAL LETTER I WITH DIAERESIS 04E5 ; PVALID # CYRILLIC SMALL LETTER I WITH DIAERESIS 04E6 ; DISALLOWED # CYRILLIC CAPITAL LETTER O WITH DIAERESIS 04E7 ; PVALID # CYRILLIC SMALL LETTER O WITH DIAERESIS 04E8 ; DISALLOWED # CYRILLIC CAPITAL LETTER BARRED O 04E9 ; PVALID # CYRILLIC SMALL LETTER BARRED O 04EA ; DISALLOWED # CYRILLIC CAPITAL LETTER BARRED O WITH DIAERE 04EB ; PVALID # CYRILLIC SMALL LETTER BARRED O WITH DIAERESI 04EC ; DISALLOWED # CYRILLIC CAPITAL LETTER E WITH DIAERESIS 04ED ; PVALID # CYRILLIC SMALL LETTER E WITH DIAERESIS 04EE ; DISALLOWED # CYRILLIC CAPITAL LETTER U WITH MACRON 04EF ; PVALID # CYRILLIC SMALL LETTER U WITH MACRON 04F0 ; DISALLOWED # CYRILLIC CAPITAL LETTER U WITH DIAERESIS 04F1 ; PVALID # CYRILLIC SMALL LETTER U WITH DIAERESIS 04F2 ; DISALLOWED # CYRILLIC CAPITAL LETTER U WITH DOUBLE ACUTE 04F3 ; PVALID # CYRILLIC SMALL LETTER U WITH DOUBLE ACUTE 04F4 ; DISALLOWED # CYRILLIC CAPITAL LETTER CHE WITH DIAERESIS 04F5 ; PVALID # CYRILLIC SMALL LETTER CHE WITH DIAERESIS 04F6 ; DISALLOWED # CYRILLIC CAPITAL LETTER GHE WITH DESCENDER 04F7 ; PVALID # CYRILLIC SMALL LETTER GHE WITH DESCENDER 04F8 ; DISALLOWED # CYRILLIC CAPITAL LETTER YERU WITH DIAERESIS 04F9 ; PVALID # CYRILLIC SMALL LETTER YERU WITH DIAERESIS 04FA ; DISALLOWED # CYRILLIC CAPITAL LETTER GHE WITH STROKE AND 04FB ; PVALID # CYRILLIC SMALL LETTER GHE WITH STROKE AND HO 04FC ; DISALLOWED # CYRILLIC CAPITAL LETTER HA WITH HOOK 04FD ; PVALID # CYRILLIC SMALL LETTER HA WITH HOOK 04FE ; DISALLOWED # CYRILLIC CAPITAL LETTER HA WITH STROKE 04FF ; PVALID # CYRILLIC SMALL LETTER HA WITH STROKE 0500 ; DISALLOWED # CYRILLIC CAPITAL LETTER KOMI DE 0501 ; PVALID # CYRILLIC SMALL LETTER KOMI DE 0502 ; DISALLOWED # CYRILLIC CAPITAL LETTER KOMI DJE 0503 ; PVALID # CYRILLIC SMALL LETTER KOMI DJE 0504 ; DISALLOWED # CYRILLIC CAPITAL LETTER KOMI ZJE 0505 ; PVALID # CYRILLIC SMALL LETTER KOMI ZJE 0506 ; DISALLOWED # CYRILLIC CAPITAL LETTER KOMI DZJE 0507 ; PVALID # CYRILLIC SMALL LETTER KOMI DZJE 0508 ; DISALLOWED # CYRILLIC CAPITAL LETTER KOMI LJE 0509 ; PVALID # CYRILLIC SMALL LETTER KOMI LJE 050A ; DISALLOWED # CYRILLIC CAPITAL LETTER KOMI NJE 050B ; PVALID # CYRILLIC SMALL LETTER KOMI NJE 050C ; DISALLOWED # CYRILLIC CAPITAL LETTER KOMI SJE 050D ; PVALID # CYRILLIC SMALL LETTER KOMI SJE 050E ; DISALLOWED # CYRILLIC CAPITAL LETTER KOMI TJE 050F ; PVALID # CYRILLIC SMALL LETTER KOMI TJE Faltstrom Expires October 29, 2008 [Page 20]
Internet-Draft IDNA Codepoints April 2008 0510 ; DISALLOWED # CYRILLIC CAPITAL LETTER REVERSED ZE 0511 ; PVALID # CYRILLIC SMALL LETTER REVERSED ZE 0512 ; DISALLOWED # CYRILLIC CAPITAL LETTER EL WITH HOOK 0513 ; PVALID # CYRILLIC SMALL LETTER EL WITH HOOK 0514 ; DISALLOWED # CYRILLIC CAPITAL LETTER LHA 0515 ; PVALID # CYRILLIC SMALL LETTER LHA 0516 ; DISALLOWED # CYRILLIC CAPITAL LETTER RHA 0517 ; PVALID # CYRILLIC SMALL LETTER RHA 0518 ; DISALLOWED # CYRILLIC CAPITAL LETTER YAE 0519 ; PVALID # CYRILLIC SMALL LETTER YAE 051A ; DISALLOWED # CYRILLIC CAPITAL LETTER QA 051B ; PVALID # CYRILLIC SMALL LETTER QA 051C ; DISALLOWED # CYRILLIC CAPITAL LETTER WE 051D ; PVALID # CYRILLIC SMALL LETTER WE 051E ; DISALLOWED # CYRILLIC CAPITAL LETTER ALEUT KA 051F ; PVALID # CYRILLIC SMALL LETTER ALEUT KA 0520 ; DISALLOWED # CYRILLIC CAPITAL LETTER EL WITH MIDDLE HOOK 0521 ; PVALID # CYRILLIC SMALL LETTER EL WITH MIDDLE HOOK 0522 ; DISALLOWED # CYRILLIC CAPITAL LETTER EN WITH MIDDLE HOOK 0523 ; PVALID # CYRILLIC SMALL LETTER EN WITH MIDDLE HOOK 0524..0530 ; UNASSIGNED # <reserved>..<reserved> 0531..0556 ; DISALLOWED # ARMENIAN CAPITAL LETTER AYB..ARMENIAN CAPITA 0557..0558 ; UNASSIGNED # <reserved>..<reserved> 0559 ; PVALID # ARMENIAN MODIFIER LETTER LEFT HALF RING 055A..055F ; DISALLOWED # ARMENIAN APOSTROPHE..ARMENIAN ABBREVIATION M 0560 ; UNASSIGNED # <reserved> 0561..0586 ; PVALID # ARMENIAN SMALL LETTER AYB..ARMENIAN SMALL LE 0587 ; DISALLOWED # ARMENIAN SMALL LIGATURE ECH YIWN 0588 ; UNASSIGNED # <reserved> 0589..058A ; DISALLOWED # ARMENIAN FULL STOP..ARMENIAN HYPHEN 058B..0590 ; UNASSIGNED # <reserved>..<reserved> 0591..05BD ; PVALID # HEBREW ACCENT ETNAHTA..HEBREW POINT METEG 05BE ; DISALLOWED # HEBREW PUNCTUATION MAQAF 05BF ; PVALID # HEBREW POINT RAFE 05C0 ; DISALLOWED # HEBREW PUNCTUATION PASEQ 05C1..05C2 ; PVALID # HEBREW POINT SHIN DOT..HEBREW POINT SIN DOT 05C3 ; DISALLOWED # HEBREW PUNCTUATION SOF PASUQ 05C4..05C5 ; PVALID # HEBREW MARK UPPER DOT..HEBREW MARK LOWER DOT 05C6 ; DISALLOWED # HEBREW PUNCTUATION NUN HAFUKHA 05C7 ; PVALID # HEBREW POINT QAMATS QATAN 05C8..05CF ; UNASSIGNED # <reserved>..<reserved> 05D0..05EA ; PVALID # HEBREW LETTER ALEF..HEBREW LETTER TAV 05EB..05EF ; UNASSIGNED # <reserved>..<reserved> 05F0..05F2 ; PVALID # HEBREW LIGATURE YIDDISH DOUBLE VAV..HEBREW L 05F3..05F4 ; CONTEXTO # HEBREW PUNCTUATION GERESH..HEBREW PUNCTUATIO 05F5..05FF ; UNASSIGNED # <reserved>..<reserved> 0600..0603 ; DISALLOWED # ARABIC NUMBER SIGN..ARABIC SIGN SAFHA 0604..0605 ; UNASSIGNED # <reserved>..<reserved> Faltstrom Expires October 29, 2008 [Page 21]
Internet-Draft IDNA Codepoints April 2008 0606..060F ; DISALLOWED # ARABIC-INDIC CUBE ROOT..ARABIC SIGN MISRA 0610..061A ; PVALID # ARABIC SIGN SALLALLAHOU ALAYHE WASSALLAM..AR 061B ; DISALLOWED # ARABIC SEMICOLON 061C..061D ; UNASSIGNED # <reserved>..<reserved> 061E..061F ; DISALLOWED # ARABIC TRIPLE DOT PUNCTUATION MARK..ARABIC Q 0620 ; UNASSIGNED # <reserved> 0621..065E ; PVALID # ARABIC LETTER HAMZA..ARABIC FATHA WITH TWO D 065F ; UNASSIGNED # <reserved> 0660..0669 ; PVALID # ARABIC-INDIC DIGIT ZERO..ARABIC-INDIC DIGIT 066A..066D ; DISALLOWED # ARABIC PERCENT SIGN..ARABIC FIVE POINTED STA 066E..0674 ; PVALID # ARABIC LETTER DOTLESS BEH..ARABIC LETTER HIG 0675..0678 ; DISALLOWED # ARABIC LETTER HIGH HAMZA ALEF..ARABIC LETTER 0679..06D3 ; PVALID # ARABIC LETTER TTEH..ARABIC LETTER YEH BARREE 06D4 ; DISALLOWED # ARABIC FULL STOP 06D5..06DC ; PVALID # ARABIC LETTER AE..ARABIC SMALL HIGH SEEN 06DD..06DE ; DISALLOWED # ARABIC END OF AYAH..ARABIC START OF RUB EL H 06DF..06E8 ; PVALID # ARABIC SMALL HIGH ROUNDED ZERO..ARABIC SMALL 06E9 ; DISALLOWED # ARABIC PLACE OF SAJDAH 06EA..06FF ; PVALID # ARABIC EMPTY CENTRE LOW STOP..ARABIC LETTER 0700..070D ; DISALLOWED # SYRIAC END OF PARAGRAPH..SYRIAC HARKLEAN AST 070E ; UNASSIGNED # <reserved> 070F ; DISALLOWED # SYRIAC ABBREVIATION MARK 0710..074A ; PVALID # SYRIAC LETTER ALAPH..SYRIAC BARREKH 074B..074C ; UNASSIGNED # <reserved>..<reserved> 074D..07B1 ; PVALID # SYRIAC LETTER SOGDIAN ZHAIN..THAANA LETTER N 07B2..07BF ; UNASSIGNED # <reserved>..<reserved> 07C0..07F5 ; PVALID # NKO DIGIT ZERO..NKO LOW TONE APOSTROPHE 07F6..07F9 ; DISALLOWED # NKO SYMBOL OO DENNEN..NKO EXCLAMATION MARK 07FA ; PVALID # NKO LAJANYALAN 07FB..0900 ; UNASSIGNED # <reserved>..<reserved> 0901..0939 ; PVALID # DEVANAGARI SIGN CANDRABINDU..DEVANAGARI LETT 093A..093B ; UNASSIGNED # <reserved>..<reserved> 093C..094D ; PVALID # DEVANAGARI SIGN NUKTA..DEVANAGARI SIGN VIRAM 094E..094F ; UNASSIGNED # <reserved>..<reserved> 0950..0954 ; PVALID # DEVANAGARI OM..DEVANAGARI ACUTE ACCENT 0955..0957 ; UNASSIGNED # <reserved>..<reserved> 0958..095F ; DISALLOWED # DEVANAGARI LETTER QA..DEVANAGARI LETTER YYA 0960..0963 ; PVALID # DEVANAGARI LETTER VOCALIC RR..DEVANAGARI VOW 0964..0965 ; DISALLOWED # DEVANAGARI DANDA..DEVANAGARI DOUBLE DANDA 0966..096F ; PVALID # DEVANAGARI DIGIT ZERO..DEVANAGARI DIGIT NINE 0970 ; DISALLOWED # DEVANAGARI ABBREVIATION SIGN 0971..0972 ; PVALID # DEVANAGARI SIGN HIGH SPACING DOT..DEVANAGARI 0973..097A ; UNASSIGNED # <reserved>..<reserved> 097B..097F ; PVALID # DEVANAGARI LETTER GGA..DEVANAGARI LETTER BBA 0980 ; UNASSIGNED # <reserved> 0981..0983 ; PVALID # BENGALI SIGN CANDRABINDU..BENGALI SIGN VISAR 0984 ; UNASSIGNED # <reserved> 0985..098C ; PVALID # BENGALI LETTER A..BENGALI LETTER VOCALIC L Faltstrom Expires October 29, 2008 [Page 22]
Internet-Draft IDNA Codepoints April 2008 098D..098E ; UNASSIGNED # <reserved>..<reserved> 098F..0990 ; PVALID # BENGALI LETTER E..BENGALI LETTER AI 0991..0992 ; UNASSIGNED # <reserved>..<reserved> 0993..09A8 ; PVALID # BENGALI LETTER O..BENGALI LETTER NA 09A9 ; UNASSIGNED # <reserved> 09AA..09B0 ; PVALID # BENGALI LETTER PA..BENGALI LETTER RA 09B1 ; UNASSIGNED # <reserved> 09B2 ; PVALID # BENGALI LETTER LA 09B3..09B5 ; UNASSIGNED # <reserved>..<reserved> 09B6..09B9 ; PVALID # BENGALI LETTER SHA..BENGALI LETTER HA 09BA..09BB ; UNASSIGNED # <reserved>..<reserved> 09BC..09C4 ; PVALID # BENGALI SIGN NUKTA..BENGALI VOWEL SIGN VOCAL 09C5..09C6 ; UNASSIGNED # <reserved>..<reserved> 09C7..09C8 ; PVALID # BENGALI VOWEL SIGN E..BENGALI VOWEL SIGN AI 09C9..09CA ; UNASSIGNED # <reserved>..<reserved> 09CB..09CE ; PVALID # BENGALI VOWEL SIGN O..BENGALI LETTER KHANDA 09CF..09D6 ; UNASSIGNED # <reserved>..<reserved> 09D7 ; PVALID # BENGALI AU LENGTH MARK 09D8..09DB ; UNASSIGNED # <reserved>..<reserved> 09DC..09DD ; DISALLOWED # BENGALI LETTER RRA..BENGALI LETTER RHA 09DE ; UNASSIGNED # <reserved> 09DF ; DISALLOWED # BENGALI LETTER YYA 09E0..09E3 ; PVALID # BENGALI LETTER VOCALIC RR..BENGALI VOWEL SIG 09E4..09E5 ; UNASSIGNED # <reserved>..<reserved> 09E6..09F1 ; PVALID # BENGALI DIGIT ZERO..BENGALI LETTER RA WITH L 09F2..09FA ; DISALLOWED # BENGALI RUPEE MARK..BENGALI ISSHAR 09FB..0A00 ; UNASSIGNED # <reserved>..<reserved> 0A01..0A03 ; PVALID # GURMUKHI SIGN ADAK BINDI..GURMUKHI SIGN VISA 0A04 ; UNASSIGNED # <reserved> 0A05..0A0A ; PVALID # GURMUKHI LETTER A..GURMUKHI LETTER UU 0A0B..0A0E ; UNASSIGNED # <reserved>..<reserved> 0A0F..0A10 ; PVALID # GURMUKHI LETTER EE..GURMUKHI LETTER AI 0A11..0A12 ; UNASSIGNED # <reserved>..<reserved> 0A13..0A28 ; PVALID # GURMUKHI LETTER OO..GURMUKHI LETTER NA 0A29 ; UNASSIGNED # <reserved> 0A2A..0A30 ; PVALID # GURMUKHI LETTER PA..GURMUKHI LETTER RA 0A31 ; UNASSIGNED # <reserved> 0A32 ; PVALID # GURMUKHI LETTER LA 0A33 ; DISALLOWED # GURMUKHI LETTER LLA 0A34 ; UNASSIGNED # <reserved> 0A35 ; PVALID # GURMUKHI LETTER VA 0A36 ; DISALLOWED # GURMUKHI LETTER SHA 0A37 ; UNASSIGNED # <reserved> 0A38..0A39 ; PVALID # GURMUKHI LETTER SA..GURMUKHI LETTER HA 0A3A..0A3B ; UNASSIGNED # <reserved>..<reserved> 0A3C ; PVALID # GURMUKHI SIGN NUKTA 0A3D ; UNASSIGNED # <reserved> 0A3E..0A42 ; PVALID # GURMUKHI VOWEL SIGN AA..GURMUKHI VOWEL SIGN Faltstrom Expires October 29, 2008 [Page 23]
Internet-Draft IDNA Codepoints April 2008 0A43..0A46 ; UNASSIGNED # <reserved>..<reserved> 0A47..0A48 ; PVALID # GURMUKHI VOWEL SIGN EE..GURMUKHI VOWEL SIGN 0A49..0A4A ; UNASSIGNED # <reserved>..<reserved> 0A4B..0A4D ; PVALID # GURMUKHI VOWEL SIGN OO..GURMUKHI SIGN VIRAMA 0A4E..0A50 ; UNASSIGNED # <reserved>..<reserved> 0A51 ; PVALID # GURMUKHI SIGN UDAAT 0A52..0A58 ; UNASSIGNED # <reserved>..<reserved> 0A59..0A5B ; DISALLOWED # GURMUKHI LETTER KHHA..GURMUKHI LETTER ZA 0A5C ; PVALID # GURMUKHI LETTER RRA 0A5D ; UNASSIGNED # <reserved> 0A5E ; DISALLOWED # GURMUKHI LETTER FA 0A5F..0A65 ; UNASSIGNED # <reserved>..<reserved> 0A66..0A75 ; PVALID # GURMUKHI DIGIT ZERO..GURMUKHI SIGN YAKASH 0A76..0A80 ; UNASSIGNED # <reserved>..<reserved> 0A81..0A83 ; PVALID # GUJARATI SIGN CANDRABINDU..GUJARATI SIGN VIS 0A84 ; UNASSIGNED # <reserved> 0A85..0A8D ; PVALID # GUJARATI LETTER A..GUJARATI VOWEL CANDRA E 0A8E ; UNASSIGNED # <reserved> 0A8F..0A91 ; PVALID # GUJARATI LETTER E..GUJARATI VOWEL CANDRA O 0A92 ; UNASSIGNED # <reserved> 0A93..0AA8 ; PVALID # GUJARATI LETTER O..GUJARATI LETTER NA 0AA9 ; UNASSIGNED # <reserved> 0AAA..0AB0 ; PVALID # GUJARATI LETTER PA..GUJARATI LETTER RA 0AB1 ; UNASSIGNED # <reserved> 0AB2..0AB3 ; PVALID # GUJARATI LETTER LA..GUJARATI LETTER LLA 0AB4 ; UNASSIGNED # <reserved> 0AB5..0AB9 ; PVALID # GUJARATI LETTER VA..GUJARATI LETTER HA 0ABA..0ABB ; UNASSIGNED # <reserved>..<reserved> 0ABC..0AC5 ; PVALID # GUJARATI SIGN NUKTA..GUJARATI VOWEL SIGN CAN 0AC6 ; UNASSIGNED # <reserved> 0AC7..0AC9 ; PVALID # GUJARATI VOWEL SIGN E..GUJARATI VOWEL SIGN C 0ACA ; UNASSIGNED # <reserved> 0ACB..0ACD ; PVALID # GUJARATI VOWEL SIGN O..GUJARATI SIGN VIRAMA 0ACE..0ACF ; UNASSIGNED # <reserved>..<reserved> 0AD0 ; PVALID # GUJARATI OM 0AD1..0ADF ; UNASSIGNED # <reserved>..<reserved> 0AE0..0AE3 ; PVALID # GUJARATI LETTER VOCALIC RR..GUJARATI VOWEL S 0AE4..0AE5 ; UNASSIGNED # <reserved>..<reserved> 0AE6..0AEF ; PVALID # GUJARATI DIGIT ZERO..GUJARATI DIGIT NINE 0AF0 ; UNASSIGNED # <reserved> 0AF1 ; DISALLOWED # GUJARATI RUPEE SIGN 0AF2..0B00 ; UNASSIGNED # <reserved>..<reserved> 0B01..0B03 ; PVALID # ORIYA SIGN CANDRABINDU..ORIYA SIGN VISARGA 0B04 ; UNASSIGNED # <reserved> 0B05..0B0C ; PVALID # ORIYA LETTER A..ORIYA LETTER VOCALIC L 0B0D..0B0E ; UNASSIGNED # <reserved>..<reserved> 0B0F..0B10 ; PVALID # ORIYA LETTER E..ORIYA LETTER AI 0B11..0B12 ; UNASSIGNED # <reserved>..<reserved> Faltstrom Expires October 29, 2008 [Page 24]
Internet-Draft IDNA Codepoints April 2008 0B13..0B28 ; PVALID # ORIYA LETTER O..ORIYA LETTER NA 0B29 ; UNASSIGNED # <reserved> 0B2A..0B30 ; PVALID # ORIYA LETTER PA..ORIYA LETTER RA 0B31 ; UNASSIGNED # <reserved> 0B32..0B33 ; PVALID # ORIYA LETTER LA..ORIYA LETTER LLA 0B34 ; UNASSIGNED # <reserved> 0B35..0B39 ; PVALID # ORIYA LETTER VA..ORIYA LETTER HA 0B3A..0B3B ; UNASSIGNED # <reserved>..<reserved> 0B3C..0B44 ; PVALID # ORIYA SIGN NUKTA..ORIYA VOWEL SIGN VOCALIC R 0B45..0B46 ; UNASSIGNED # <reserved>..<reserved> 0B47..0B48 ; PVALID # ORIYA VOWEL SIGN E..ORIYA VOWEL SIGN AI 0B49..0B4A ; UNASSIGNED # <reserved>..<reserved> 0B4B..0B4D ; PVALID # ORIYA VOWEL SIGN O..ORIYA SIGN VIRAMA 0B4E..0B55 ; UNASSIGNED # <reserved>..<reserved> 0B56..0B57 ; PVALID # ORIYA AI LENGTH MARK..ORIYA AU LENGTH MARK 0B58..0B5B ; UNASSIGNED # <reserved>..<reserved> 0B5C..0B5D ; DISALLOWED # ORIYA LETTER RRA..ORIYA LETTER RHA 0B5E ; UNASSIGNED # <reserved> 0B5F..0B63 ; PVALID # ORIYA LETTER YYA..ORIYA VOWEL SIGN VOCALIC L 0B64..0B65 ; UNASSIGNED # <reserved>..<reserved> 0B66..0B6F ; PVALID # ORIYA DIGIT ZERO..ORIYA DIGIT NINE 0B70 ; DISALLOWED # ORIYA ISSHAR 0B71 ; PVALID # ORIYA LETTER WA 0B72..0B81 ; UNASSIGNED # <reserved>..<reserved> 0B82..0B83 ; PVALID # TAMIL SIGN ANUSVARA..TAMIL SIGN VISARGA 0B84 ; UNASSIGNED # <reserved> 0B85..0B8A ; PVALID # TAMIL LETTER A..TAMIL LETTER UU 0B8B..0B8D ; UNASSIGNED # <reserved>..<reserved> 0B8E..0B90 ; PVALID # TAMIL LETTER E..TAMIL LETTER AI 0B91 ; UNASSIGNED # <reserved> 0B92..0B95 ; PVALID # TAMIL LETTER O..TAMIL LETTER KA 0B96..0B98 ; UNASSIGNED # <reserved>..<reserved> 0B99..0B9A ; PVALID # TAMIL LETTER NGA..TAMIL LETTER CA 0B9B ; UNASSIGNED # <reserved> 0B9C ; PVALID # TAMIL LETTER JA 0B9D ; UNASSIGNED # <reserved> 0B9E..0B9F ; PVALID # TAMIL LETTER NYA..TAMIL LETTER TTA 0BA0..0BA2 ; UNASSIGNED # <reserved>..<reserved> 0BA3..0BA4 ; PVALID # TAMIL LETTER NNA..TAMIL LETTER TA 0BA5..0BA7 ; UNASSIGNED # <reserved>..<reserved> 0BA8..0BAA ; PVALID # TAMIL LETTER NA..TAMIL LETTER PA 0BAB..0BAD ; UNASSIGNED # <reserved>..<reserved> 0BAE..0BB9 ; PVALID # TAMIL LETTER MA..TAMIL LETTER HA 0BBA..0BBD ; UNASSIGNED # <reserved>..<reserved> 0BBE..0BC2 ; PVALID # TAMIL VOWEL SIGN AA..TAMIL VOWEL SIGN UU 0BC3..0BC5 ; UNASSIGNED # <reserved>..<reserved> 0BC6..0BC8 ; PVALID # TAMIL VOWEL SIGN E..TAMIL VOWEL SIGN AI 0BC9 ; UNASSIGNED # <reserved> Faltstrom Expires October 29, 2008 [Page 25]
Internet-Draft IDNA Codepoints April 2008 0BCA..0BCD ; PVALID # TAMIL VOWEL SIGN O..TAMIL SIGN VIRAMA 0BCE..0BCF ; UNASSIGNED # <reserved>..<reserved> 0BD0 ; PVALID # TAMIL OM 0BD1..0BD6 ; UNASSIGNED # <reserved>..<reserved> 0BD7 ; PVALID # TAMIL AU LENGTH MARK 0BD8..0BE5 ; UNASSIGNED # <reserved>..<reserved> 0BE6..0BEF ; PVALID # TAMIL DIGIT ZERO..TAMIL DIGIT NINE 0BF0..0BFA ; DISALLOWED # TAMIL NUMBER TEN..TAMIL NUMBER SIGN 0BFB..0C00 ; UNASSIGNED # <reserved>..<reserved> 0C01..0C03 ; PVALID # TELUGU SIGN CANDRABINDU..TELUGU SIGN VISARGA 0C04 ; UNASSIGNED # <reserved> 0C05..0C0C ; PVALID # TELUGU LETTER A..TELUGU LETTER VOCALIC L 0C0D ; UNASSIGNED # <reserved> 0C0E..0C10 ; PVALID # TELUGU LETTER E..TELUGU LETTER AI 0C11 ; UNASSIGNED # <reserved> 0C12..0C28 ; PVALID # TELUGU LETTER O..TELUGU LETTER NA 0C29 ; UNASSIGNED # <reserved> 0C2A..0C33 ; PVALID # TELUGU LETTER PA..TELUGU LETTER LLA 0C34 ; UNASSIGNED # <reserved> 0C35..0C39 ; PVALID # TELUGU LETTER VA..TELUGU LETTER HA 0C3A..0C3C ; UNASSIGNED # <reserved>..<reserved> 0C3D..0C44 ; PVALID # TELUGU SIGN AVAGRAHA..TELUGU VOWEL SIGN VOCA 0C45 ; UNASSIGNED # <reserved> 0C46..0C48 ; PVALID # TELUGU VOWEL SIGN E..TELUGU VOWEL SIGN AI 0C49 ; UNASSIGNED # <reserved> 0C4A..0C4D ; PVALID # TELUGU VOWEL SIGN O..TELUGU SIGN VIRAMA 0C4E..0C54 ; UNASSIGNED # <reserved>..<reserved> 0C55..0C56 ; PVALID # TELUGU LENGTH MARK..TELUGU AI LENGTH MARK 0C57 ; UNASSIGNED # <reserved> 0C58..0C59 ; PVALID # TELUGU LETTER TSA..TELUGU LETTER DZA 0C5A..0C5F ; UNASSIGNED # <reserved>..<reserved> 0C60..0C63 ; PVALID # TELUGU LETTER VOCALIC RR..TELUGU VOWEL SIGN 0C64..0C65 ; UNASSIGNED # <reserved>..<reserved> 0C66..0C6F ; PVALID # TELUGU DIGIT ZERO..TELUGU DIGIT NINE 0C70..0C77 ; UNASSIGNED # <reserved>..<reserved> 0C78..0C7F ; DISALLOWED # TELUGU FRACTION DIGIT ZERO FOR ODD POWERS OF 0C80..0C81 ; UNASSIGNED # <reserved>..<reserved> 0C82..0C83 ; PVALID # KANNADA SIGN ANUSVARA..KANNADA SIGN VISARGA 0C84 ; UNASSIGNED # <reserved> 0C85..0C8C ; PVALID # KANNADA LETTER A..KANNADA LETTER VOCALIC L 0C8D ; UNASSIGNED # <reserved> 0C8E..0C90 ; PVALID # KANNADA LETTER E..KANNADA LETTER AI 0C91 ; UNASSIGNED # <reserved> 0C92..0CA8 ; PVALID # KANNADA LETTER O..KANNADA LETTER NA 0CA9 ; UNASSIGNED # <reserved> 0CAA..0CB3 ; PVALID # KANNADA LETTER PA..KANNADA LETTER LLA 0CB4 ; UNASSIGNED # <reserved> 0CB5..0CB9 ; PVALID # KANNADA LETTER VA..KANNADA LETTER HA Faltstrom Expires October 29, 2008 [Page 26]
Internet-Draft IDNA Codepoints April 2008 0CBA..0CBB ; UNASSIGNED # <reserved>..<reserved> 0CBC..0CC4 ; PVALID # KANNADA SIGN NUKTA..KANNADA VOWEL SIGN VOCAL 0CC5 ; UNASSIGNED # <reserved> 0CC6..0CC8 ; PVALID # KANNADA VOWEL SIGN E..KANNADA VOWEL SIGN AI 0CC9 ; UNASSIGNED # <reserved> 0CCA..0CCD ; PVALID # KANNADA VOWEL SIGN O..KANNADA SIGN VIRAMA 0CCE..0CD4 ; UNASSIGNED # <reserved>..<reserved> 0CD5..0CD6 ; PVALID # KANNADA LENGTH MARK..KANNADA AI LENGTH MARK 0CD7..0CDD ; UNASSIGNED # <reserved>..<reserved> 0CDE ; PVALID # KANNADA LETTER FA 0CDF ; UNASSIGNED # <reserved> 0CE0..0CE3 ; PVALID # KANNADA LETTER VOCALIC RR..KANNADA VOWEL SIG 0CE4..0CE5 ; UNASSIGNED # <reserved>..<reserved> 0CE6..0CEF ; PVALID # KANNADA DIGIT ZERO..KANNADA DIGIT NINE 0CF0 ; UNASSIGNED # <reserved> 0CF1..0CF2 ; DISALLOWED # KANNADA SIGN JIHVAMULIYA..KANNADA SIGN UPADH 0CF3..0D01 ; UNASSIGNED # <reserved>..<reserved> 0D02..0D03 ; PVALID # MALAYALAM SIGN ANUSVARA..MALAYALAM SIGN VISA 0D04 ; UNASSIGNED # <reserved> 0D05..0D0C ; PVALID # MALAYALAM LETTER A..MALAYALAM LETTER VOCALIC 0D0D ; UNASSIGNED # <reserved> 0D0E..0D10 ; PVALID # MALAYALAM LETTER E..MALAYALAM LETTER AI 0D11 ; UNASSIGNED # <reserved> 0D12..0D28 ; PVALID # MALAYALAM LETTER O..MALAYALAM LETTER NA 0D29 ; UNASSIGNED # <reserved> 0D2A..0D39 ; PVALID # MALAYALAM LETTER PA..MALAYALAM LETTER HA 0D3A..0D3C ; UNASSIGNED # <reserved>..<reserved> 0D3D..0D44 ; PVALID # MALAYALAM SIGN AVAGRAHA..MALAYALAM VOWEL SIG 0D45 ; UNASSIGNED # <reserved> 0D46..0D48 ; PVALID # MALAYALAM VOWEL SIGN E..MALAYALAM VOWEL SIGN 0D49 ; UNASSIGNED # <reserved> 0D4A..0D4D ; PVALID # MALAYALAM VOWEL SIGN O..MALAYALAM SIGN VIRAM 0D4E..0D56 ; UNASSIGNED # <reserved>..<reserved> 0D57 ; PVALID # MALAYALAM AU LENGTH MARK 0D58..0D5F ; UNASSIGNED # <reserved>..<reserved> 0D60..0D63 ; PVALID # MALAYALAM LETTER VOCALIC RR..MALAYALAM VOWEL 0D64..0D65 ; UNASSIGNED # <reserved>..<reserved> 0D66..0D6F ; PVALID # MALAYALAM DIGIT ZERO..MALAYALAM DIGIT NINE 0D70..0D75 ; DISALLOWED # MALAYALAM NUMBER TEN..MALAYALAM FRACTION THR 0D76..0D78 ; UNASSIGNED # <reserved>..<reserved> 0D79 ; DISALLOWED # MALAYALAM DATE MARK 0D7A..0D7F ; PVALID # MALAYALAM LETTER CHILLU NN..MALAYALAM LETTER 0D80..0D81 ; UNASSIGNED # <reserved>..<reserved> 0D82..0D83 ; PVALID # SINHALA SIGN ANUSVARAYA..SINHALA SIGN VISARG 0D84 ; UNASSIGNED # <reserved> 0D85..0D96 ; PVALID # SINHALA LETTER AYANNA..SINHALA LETTER AUYANN 0D97..0D99 ; UNASSIGNED # <reserved>..<reserved> 0D9A..0DB1 ; PVALID # SINHALA LETTER ALPAPRAANA KAYANNA..SINHALA L Faltstrom Expires October 29, 2008 [Page 27]
Internet-Draft IDNA Codepoints April 2008 0DB2 ; UNASSIGNED # <reserved> 0DB3..0DBB ; PVALID # SINHALA LETTER SANYAKA DAYANNA..SINHALA LETT 0DBC ; UNASSIGNED # <reserved> 0DBD ; PVALID # SINHALA LETTER DANTAJA LAYANNA 0DBE..0DBF ; UNASSIGNED # <reserved>..<reserved> 0DC0..0DC6 ; PVALID # SINHALA LETTER VAYANNA..SINHALA LETTER FAYAN 0DC7..0DC9 ; UNASSIGNED # <reserved>..<reserved> 0DCA ; PVALID # SINHALA SIGN AL-LAKUNA 0DCB..0DCE ; UNASSIGNED # <reserved>..<reserved> 0DCF..0DD4 ; PVALID # SINHALA VOWEL SIGN AELA-PILLA..SINHALA VOWEL 0DD5 ; UNASSIGNED # <reserved> 0DD6 ; PVALID # SINHALA VOWEL SIGN DIGA PAA-PILLA 0DD7 ; UNASSIGNED # <reserved> 0DD8..0DDF ; PVALID # SINHALA VOWEL SIGN GAETTA-PILLA..SINHALA VOW 0DE0..0DF1 ; UNASSIGNED # <reserved>..<reserved> 0DF2..0DF3 ; PVALID # SINHALA VOWEL SIGN DIGA GAETTA-PILLA..SINHAL 0DF4 ; DISALLOWED # SINHALA PUNCTUATION KUNDDALIYA 0DF5..0E00 ; UNASSIGNED # <reserved>..<reserved> 0E01..0E32 ; PVALID # THAI CHARACTER KO KAI..THAI CHARACTER SARA A 0E33 ; DISALLOWED # THAI CHARACTER SARA AM 0E34..0E3A ; PVALID # THAI CHARACTER SARA I..THAI CHARACTER PHINTH 0E3B..0E3E ; UNASSIGNED # <reserved>..<reserved> 0E3F ; DISALLOWED # THAI CURRENCY SYMBOL BAHT 0E40..0E4E ; PVALID # THAI CHARACTER SARA E..THAI CHARACTER YAMAKK 0E4F ; DISALLOWED # THAI CHARACTER FONGMAN 0E50..0E59 ; PVALID # THAI DIGIT ZERO..THAI DIGIT NINE 0E5A..0E5B ; DISALLOWED # THAI CHARACTER ANGKHANKHU..THAI CHARACTER KH 0E5C..0E80 ; UNASSIGNED # <reserved>..<reserved> 0E81..0E82 ; PVALID # LAO LETTER KO..LAO LETTER KHO SUNG 0E83 ; UNASSIGNED # <reserved> 0E84 ; PVALID # LAO LETTER KHO TAM 0E85..0E86 ; UNASSIGNED # <reserved>..<reserved> 0E87..0E88 ; PVALID # LAO LETTER NGO..LAO LETTER CO 0E89 ; UNASSIGNED # <reserved> 0E8A ; PVALID # LAO LETTER SO TAM 0E8B..0E8C ; UNASSIGNED # <reserved>..<reserved> 0E8D ; PVALID # LAO LETTER NYO 0E8E..0E93 ; UNASSIGNED # <reserved>..<reserved> 0E94..0E97 ; PVALID # LAO LETTER DO..LAO LETTER THO TAM 0E98 ; UNASSIGNED # <reserved> 0E99..0E9F ; PVALID # LAO LETTER NO..LAO LETTER FO SUNG 0EA0 ; UNASSIGNED # <reserved> 0EA1..0EA3 ; PVALID # LAO LETTER MO..LAO LETTER LO LING 0EA4 ; UNASSIGNED # <reserved> 0EA5 ; PVALID # LAO LETTER LO LOOT 0EA6 ; UNASSIGNED # <reserved> 0EA7 ; PVALID # LAO LETTER WO 0EA8..0EA9 ; UNASSIGNED # <reserved>..<reserved> Faltstrom Expires October 29, 2008 [Page 28]
Internet-Draft IDNA Codepoints April 2008 0EAA..0EAB ; PVALID # LAO LETTER SO SUNG..LAO LETTER HO SUNG 0EAC ; UNASSIGNED # <reserved> 0EAD..0EB2 ; PVALID # LAO LETTER O..LAO VOWEL SIGN AA 0EB3 ; DISALLOWED # LAO VOWEL SIGN AM 0EB4..0EB9 ; PVALID # LAO VOWEL SIGN I..LAO VOWEL SIGN UU 0EBA ; UNASSIGNED # <reserved> 0EBB..0EBD ; PVALID # LAO VOWEL SIGN MAI KON..LAO SEMIVOWEL SIGN N 0EBE..0EBF ; UNASSIGNED # <reserved>..<reserved> 0EC0..0EC4 ; PVALID # LAO VOWEL SIGN E..LAO VOWEL SIGN AI 0EC5 ; UNASSIGNED # <reserved> 0EC6 ; PVALID # LAO KO LA 0EC7 ; UNASSIGNED # <reserved> 0EC8..0ECD ; PVALID # LAO TONE MAI EK..LAO NIGGAHITA 0ECE..0ECF ; UNASSIGNED # <reserved>..<reserved> 0ED0..0ED9 ; PVALID # LAO DIGIT ZERO..LAO DIGIT NINE 0EDA..0EDB ; UNASSIGNED # <reserved>..<reserved> 0EDC..0EDD ; DISALLOWED # LAO HO NO..LAO HO MO 0EDE..0EFF ; UNASSIGNED # <reserved>..<reserved> 0F00 ; PVALID # TIBETAN SYLLABLE OM 0F01..0F0A ; DISALLOWED # TIBETAN MARK GTER YIG MGO TRUNCATED A..TIBET 0F0B ; PVALID # TIBETAN MARK INTERSYLLABIC TSHEG 0F0C..0F17 ; DISALLOWED # TIBETAN MARK DELIMITER TSHEG BSTAR..TIBETAN 0F18..0F19 ; PVALID # TIBETAN ASTROLOGICAL SIGN -KHYUD PA..TIBETAN 0F1A..0F1F ; DISALLOWED # TIBETAN SIGN RDEL DKAR GCIG..TIBETAN SIGN RD 0F20..0F29 ; PVALID # TIBETAN DIGIT ZERO..TIBETAN DIGIT NINE 0F2A..0F34 ; DISALLOWED # TIBETAN DIGIT HALF ONE..TIBETAN MARK BSDUS R 0F35 ; PVALID # TIBETAN MARK NGAS BZUNG NYI ZLA 0F36 ; DISALLOWED # TIBETAN MARK CARET -DZUD RTAGS BZHI MIG CAN 0F37 ; PVALID # TIBETAN MARK NGAS BZUNG SGOR RTAGS 0F38 ; DISALLOWED # TIBETAN MARK CHE MGO 0F39 ; PVALID # TIBETAN MARK TSA -PHRU 0F3A..0F3D ; DISALLOWED # TIBETAN MARK GUG RTAGS GYON..TIBETAN MARK AN 0F3E..0F42 ; PVALID # TIBETAN SIGN YAR TSHES..TIBETAN LETTER GA 0F43 ; DISALLOWED # TIBETAN LETTER GHA 0F44..0F47 ; PVALID # TIBETAN LETTER NGA..TIBETAN LETTER JA 0F48 ; UNASSIGNED # <reserved> 0F49..0F4C ; PVALID # TIBETAN LETTER NYA..TIBETAN LETTER DDA 0F4D ; DISALLOWED # TIBETAN LETTER DDHA 0F4E..0F51 ; PVALID # TIBETAN LETTER NNA..TIBETAN LETTER DA 0F52 ; DISALLOWED # TIBETAN LETTER DHA 0F53..0F56 ; PVALID # TIBETAN LETTER NA..TIBETAN LETTER BA 0F57 ; DISALLOWED # TIBETAN LETTER BHA 0F58..0F5B ; PVALID # TIBETAN LETTER MA..TIBETAN LETTER DZA 0F5C ; DISALLOWED # TIBETAN LETTER DZHA 0F5D..0F68 ; PVALID # TIBETAN LETTER WA..TIBETAN LETTER A 0F69 ; DISALLOWED # TIBETAN LETTER KSSA 0F6A..0F6C ; PVALID # TIBETAN LETTER FIXED-FORM RA..TIBETAN LETTER 0F6D..0F70 ; UNASSIGNED # <reserved>..<reserved> Faltstrom Expires October 29, 2008 [Page 29]
Internet-Draft IDNA Codepoints April 2008 0F71..0F72 ; PVALID # TIBETAN VOWEL SIGN AA..TIBETAN VOWEL SIGN I 0F73 ; DISALLOWED # TIBETAN VOWEL SIGN II 0F74 ; PVALID # TIBETAN VOWEL SIGN U 0F75..0F79 ; DISALLOWED # TIBETAN VOWEL SIGN UU..TIBETAN VOWEL SIGN VO 0F7A..0F80 ; PVALID # TIBETAN VOWEL SIGN E..TIBETAN VOWEL SIGN REV 0F81 ; DISALLOWED # TIBETAN VOWEL SIGN REVERSED II 0F82..0F84 ; PVALID # TIBETAN SIGN NYI ZLA NAA DA..TIBETAN MARK HA 0F85 ; DISALLOWED # TIBETAN MARK PALUTA 0F86..0F8B ; PVALID # TIBETAN SIGN LCI RTAGS..TIBETAN SIGN GRU MED 0F8C..0F8F ; UNASSIGNED # <reserved>..<reserved> 0F90..0F92 ; PVALID # TIBETAN SUBJOINED LETTER KA..TIBETAN SUBJOIN 0F93 ; DISALLOWED # TIBETAN SUBJOINED LETTER GHA 0F94..0F97 ; PVALID # TIBETAN SUBJOINED LETTER NGA..TIBETAN SUBJOI 0F98 ; UNASSIGNED # <reserved> 0F99..0F9C ; PVALID # TIBETAN SUBJOINED LETTER NYA..TIBETAN SUBJOI 0F9D ; DISALLOWED # TIBETAN SUBJOINED LETTER DDHA 0F9E..0FA1 ; PVALID # TIBETAN SUBJOINED LETTER NNA..TIBETAN SUBJOI 0FA2 ; DISALLOWED # TIBETAN SUBJOINED LETTER DHA 0FA3..0FA6 ; PVALID # TIBETAN SUBJOINED LETTER NA..TIBETAN SUBJOIN 0FA7 ; DISALLOWED # TIBETAN SUBJOINED LETTER BHA 0FA8..0FAB ; PVALID # TIBETAN SUBJOINED LETTER MA..TIBETAN SUBJOIN 0FAC ; DISALLOWED # TIBETAN SUBJOINED LETTER DZHA 0FAD..0FB8 ; PVALID # TIBETAN SUBJOINED LETTER WA..TIBETAN SUBJOIN 0FB9 ; DISALLOWED # TIBETAN SUBJOINED LETTER KSSA 0FBA..0FBC ; PVALID # TIBETAN SUBJOINED LETTER FIXED-FORM WA..TIBE 0FBD ; UNASSIGNED # <reserved> 0FBE..0FC5 ; DISALLOWED # TIBETAN KU RU KHA..TIBETAN SYMBOL RDO RJE 0FC6 ; PVALID # TIBETAN SYMBOL PADMA GDAN 0FC7..0FCC ; DISALLOWED # TIBETAN SYMBOL RDO RJE RGYA GRAM..TIBETAN SY 0FCD ; UNASSIGNED # <reserved> 0FCE..0FD4 ; DISALLOWED # TIBETAN SIGN RDEL NAG RDEL DKAR..TIBETAN MAR 0FD5..0FFF ; UNASSIGNED # <reserved>..<reserved> 1000..1049 ; PVALID # MYANMAR LETTER KA..MYANMAR DIGIT NINE 104A..104F ; DISALLOWED # MYANMAR SIGN LITTLE SECTION..MYANMAR SYMBOL 1050..1099 ; PVALID # MYANMAR LETTER SHA..MYANMAR SHAN DIGIT NINE 109A..109D ; UNASSIGNED # <reserved>..<reserved> 109E..10C5 ; DISALLOWED # MYANMAR SYMBOL SHAN ONE..GEORGIAN CAPITAL LE 10C6..10CF ; UNASSIGNED # <reserved>..<reserved> 10D0..10FA ; PVALID # GEORGIAN LETTER AN..GEORGIAN LETTER AIN 10FB..10FC ; DISALLOWED # GEORGIAN PARAGRAPH SEPARATOR..MODIFIER LETTE 10FD..10FF ; UNASSIGNED # <reserved>..<reserved> 1100..1159 ; PVALID # HANGUL CHOSEONG KIYEOK..HANGUL CHOSEONG YEOR 115A..115E ; UNASSIGNED # <reserved>..<reserved> 115F..1160 ; DISALLOWED # HANGUL CHOSEONG FILLER..HANGUL JUNGSEONG FIL 1161..11A2 ; PVALID # HANGUL JUNGSEONG A..HANGUL JUNGSEONG SSANGAR 11A3..11A7 ; UNASSIGNED # <reserved>..<reserved> 11A8..11F9 ; PVALID # HANGUL JONGSEONG KIYEOK..HANGUL JONGSEONG YE 11FA..11FF ; UNASSIGNED # <reserved>..<reserved> Faltstrom Expires October 29, 2008 [Page 30]
Internet-Draft IDNA Codepoints April 2008 1200..1248 ; PVALID # ETHIOPIC SYLLABLE HA..ETHIOPIC SYLLABLE QWA 1249 ; UNASSIGNED # <reserved> 124A..124D ; PVALID # ETHIOPIC SYLLABLE QWI..ETHIOPIC SYLLABLE QWE 124E..124F ; UNASSIGNED # <reserved>..<reserved> 1250..1256 ; PVALID # ETHIOPIC SYLLABLE QHA..ETHIOPIC SYLLABLE QHO 1257 ; UNASSIGNED # <reserved> 1258 ; PVALID # ETHIOPIC SYLLABLE QHWA 1259 ; UNASSIGNED # <reserved> 125A..125D ; PVALID # ETHIOPIC SYLLABLE QHWI..ETHIOPIC SYLLABLE QH 125E..125F ; UNASSIGNED # <reserved>..<reserved> 1260..1288 ; PVALID # ETHIOPIC SYLLABLE BA..ETHIOPIC SYLLABLE XWA 1289 ; UNASSIGNED # <reserved> 128A..128D ; PVALID # ETHIOPIC SYLLABLE XWI..ETHIOPIC SYLLABLE XWE 128E..128F ; UNASSIGNED # <reserved>..<reserved> 1290..12B0 ; PVALID # ETHIOPIC SYLLABLE NA..ETHIOPIC SYLLABLE KWA 12B1 ; UNASSIGNED # <reserved> 12B2..12B5 ; PVALID # ETHIOPIC SYLLABLE KWI..ETHIOPIC SYLLABLE KWE 12B6..12B7 ; UNASSIGNED # <reserved>..<reserved> 12B8..12BE ; PVALID # ETHIOPIC SYLLABLE KXA..ETHIOPIC SYLLABLE KXO 12BF ; UNASSIGNED # <reserved> 12C0 ; PVALID # ETHIOPIC SYLLABLE KXWA 12C1 ; UNASSIGNED # <reserved> 12C2..12C5 ; PVALID # ETHIOPIC SYLLABLE KXWI..ETHIOPIC SYLLABLE KX 12C6..12C7 ; UNASSIGNED # <reserved>..<reserved> 12C8..12D6 ; PVALID # ETHIOPIC SYLLABLE WA..ETHIOPIC SYLLABLE PHAR 12D7 ; UNASSIGNED # <reserved> 12D8..1310 ; PVALID # ETHIOPIC SYLLABLE ZA..ETHIOPIC SYLLABLE GWA 1311 ; UNASSIGNED # <reserved> 1312..1315 ; PVALID # ETHIOPIC SYLLABLE GWI..ETHIOPIC SYLLABLE GWE 1316..1317 ; UNASSIGNED # <reserved>..<reserved> 1318..135A ; PVALID # ETHIOPIC SYLLABLE GGA..ETHIOPIC SYLLABLE FYA 135B..135E ; UNASSIGNED # <reserved>..<reserved> 135F ; PVALID # ETHIOPIC COMBINING GEMINATION MARK 1360..137C ; DISALLOWED # ETHIOPIC SECTION MARK..ETHIOPIC NUMBER TEN T 137D..137F ; UNASSIGNED # <reserved>..<reserved> 1380..138F ; PVALID # ETHIOPIC SYLLABLE SEBATBEIT MWA..ETHIOPIC SY 1390..1399 ; DISALLOWED # ETHIOPIC TONAL MARK YIZET..ETHIOPIC TONAL MA 139A..139F ; UNASSIGNED # <reserved>..<reserved> 13A0..13F4 ; PVALID # CHEROKEE LETTER A..CHEROKEE LETTER YV 13F5..1400 ; UNASSIGNED # <reserved>..<reserved> 1401..166C ; PVALID # CANADIAN SYLLABICS E..CANADIAN SYLLABICS CAR 166D..166E ; DISALLOWED # CANADIAN SYLLABICS CHI SIGN..CANADIAN SYLLAB 166F..1676 ; PVALID # CANADIAN SYLLABICS QAI..CANADIAN SYLLABICS N 1677..167F ; UNASSIGNED # <reserved>..<reserved> 1680 ; DISALLOWED # OGHAM SPACE MARK 1681..169A ; PVALID # OGHAM LETTER BEITH..OGHAM LETTER PEITH 169B..169C ; DISALLOWED # OGHAM FEATHER MARK..OGHAM REVERSED FEATHER M 169D..169F ; UNASSIGNED # <reserved>..<reserved> Faltstrom Expires October 29, 2008 [Page 31]
Internet-Draft IDNA Codepoints April 2008 16A0..16EA ; PVALID # RUNIC LETTER FEHU FEOH FE F..RUNIC LETTER X 16EB..16F0 ; DISALLOWED # RUNIC SINGLE PUNCTUATION..RUNIC BELGTHOR SYM 16F1..16FF ; UNASSIGNED # <reserved>..<reserved> 1700..170C ; PVALID # TAGALOG LETTER A..TAGALOG LETTER YA 170D ; UNASSIGNED # <reserved> 170E..1714 ; PVALID # TAGALOG LETTER LA..TAGALOG SIGN VIRAMA 1715..171F ; UNASSIGNED # <reserved>..<reserved> 1720..1734 ; PVALID # HANUNOO LETTER A..HANUNOO SIGN PAMUDPOD 1735..1736 ; DISALLOWED # PHILIPPINE SINGLE PUNCTUATION..PHILIPPINE DO 1737..173F ; UNASSIGNED # <reserved>..<reserved> 1740..1753 ; PVALID # BUHID LETTER A..BUHID VOWEL SIGN U 1754..175F ; UNASSIGNED # <reserved>..<reserved> 1760..176C ; PVALID # TAGBANWA LETTER A..TAGBANWA LETTER YA 176D ; UNASSIGNED # <reserved> 176E..1770 ; PVALID # TAGBANWA LETTER LA..TAGBANWA LETTER SA 1771 ; UNASSIGNED # <reserved> 1772..1773 ; PVALID # TAGBANWA VOWEL SIGN I..TAGBANWA VOWEL SIGN U 1774..177F ; UNASSIGNED # <reserved>..<reserved> 1780..17B3 ; PVALID # KHMER LETTER KA..KHMER INDEPENDENT VOWEL QAU 17B4..17B5 ; DISALLOWED # KHMER VOWEL INHERENT AQ..KHMER VOWEL INHEREN 17B6..17D3 ; PVALID # KHMER VOWEL SIGN AA..KHMER SIGN BATHAMASAT 17D4..17D6 ; DISALLOWED # KHMER SIGN KHAN..KHMER SIGN CAMNUC PII KUUH 17D7 ; PVALID # KHMER SIGN LEK TOO 17D8..17DB ; DISALLOWED # KHMER SIGN BEYYAL..KHMER CURRENCY SYMBOL RIE 17DC..17DD ; PVALID # KHMER SIGN AVAKRAHASANYA..KHMER SIGN ATTHACA 17DE..17DF ; UNASSIGNED # <reserved>..<reserved> 17E0..17E9 ; PVALID # KHMER DIGIT ZERO..KHMER DIGIT NINE 17EA..17EF ; UNASSIGNED # <reserved>..<reserved> 17F0..17F9 ; DISALLOWED # KHMER SYMBOL LEK ATTAK SON..KHMER SYMBOL LEK 17FA..17FF ; UNASSIGNED # <reserved>..<reserved> 1800..180E ; DISALLOWED # MONGOLIAN BIRGA..MONGOLIAN VOWEL SEPARATOR 180F ; UNASSIGNED # <reserved> 1810..1819 ; PVALID # MONGOLIAN DIGIT ZERO..MONGOLIAN DIGIT NINE 181A..181F ; UNASSIGNED # <reserved>..<reserved> 1820..1877 ; PVALID # MONGOLIAN LETTER A..MONGOLIAN LETTER MANCHU 1878..187F ; UNASSIGNED # <reserved>..<reserved> 1880..18AA ; PVALID # MONGOLIAN LETTER ALI GALI ANUSVARA ONE..MONG 18AB..18FF ; UNASSIGNED # <reserved>..<reserved> 1900..191C ; PVALID # LIMBU VOWEL-CARRIER LETTER..LIMBU LETTER HA 191D..191F ; UNASSIGNED # <reserved>..<reserved> 1920..192B ; PVALID # LIMBU VOWEL SIGN A..LIMBU SUBJOINED LETTER W 192C..192F ; UNASSIGNED # <reserved>..<reserved> 1930..193B ; PVALID # LIMBU SMALL LETTER KA..LIMBU SIGN SA-I 193C..193F ; UNASSIGNED # <reserved>..<reserved> 1940 ; DISALLOWED # LIMBU SIGN LOO 1941..1943 ; UNASSIGNED # <reserved>..<reserved> 1944..1945 ; DISALLOWED # LIMBU EXCLAMATION MARK..LIMBU QUESTION MARK 1946..196D ; PVALID # LIMBU DIGIT ZERO..TAI LE LETTER AI Faltstrom Expires October 29, 2008 [Page 32]
Internet-Draft IDNA Codepoints April 2008 196E..196F ; UNASSIGNED # <reserved>..<reserved> 1970..1974 ; PVALID # TAI LE LETTER TONE-2..TAI LE LETTER TONE-6 1975..197F ; UNASSIGNED # <reserved>..<reserved> 1980..19A9 ; PVALID # NEW TAI LUE LETTER HIGH QA..NEW TAI LUE LETT 19AA..19AF ; UNASSIGNED # <reserved>..<reserved> 19B0..19C9 ; PVALID # NEW TAI LUE VOWEL SIGN VOWEL SHORTENER..NEW 19CA..19CF ; UNASSIGNED # <reserved>..<reserved> 19D0..19D9 ; PVALID # NEW TAI LUE DIGIT ZERO..NEW TAI LUE DIGIT NI 19DA..19DD ; UNASSIGNED # <reserved>..<reserved> 19DE..19FF ; DISALLOWED # NEW TAI LUE SIGN LAE..KHMER SYMBOL DAP-PRAM 1A00..1A1B ; PVALID # BUGINESE LETTER KA..BUGINESE VOWEL SIGN AE 1A1C..1A1D ; UNASSIGNED # <reserved>..<reserved> 1A1E..1A1F ; DISALLOWED # BUGINESE PALLAWA..BUGINESE END OF SECTION 1A20..1AFF ; UNASSIGNED # <reserved>..<reserved> 1B00..1B4B ; PVALID # BALINESE SIGN ULU RICEM..BALINESE LETTER ASY 1B4C..1B4F ; UNASSIGNED # <reserved>..<reserved> 1B50..1B59 ; PVALID # BALINESE DIGIT ZERO..BALINESE DIGIT NINE 1B5A..1B6A ; DISALLOWED # BALINESE PANTI..BALINESE MUSICAL SYMBOL DANG 1B6B..1B73 ; PVALID # BALINESE MUSICAL SYMBOL COMBINING TEGEH..BAL 1B74..1B7C ; DISALLOWED # BALINESE MUSICAL SYMBOL RIGHT-HAND OPEN DUG. 1B7D..1B7F ; UNASSIGNED # <reserved>..<reserved> 1B80..1BAA ; PVALID # SUNDANESE SIGN PANYECEK..SUNDANESE SIGN PAMA 1BAB..1BAD ; UNASSIGNED # <reserved>..<reserved> 1BAE..1BB9 ; PVALID # SUNDANESE LETTER KHA..SUNDANESE DIGIT NINE 1BBA..1BFF ; UNASSIGNED # <reserved>..<reserved> 1C00..1C37 ; PVALID # LEPCHA LETTER KA..LEPCHA SIGN NUKTA 1C38..1C3A ; UNASSIGNED # <reserved>..<reserved> 1C3B..1C3F ; DISALLOWED # LEPCHA PUNCTUATION TA-ROL..LEPCHA PUNCTUATIO 1C40..1C49 ; PVALID # LEPCHA DIGIT ZERO..LEPCHA DIGIT NINE 1C4A..1C4C ; UNASSIGNED # <reserved>..<reserved> 1C4D..1C7D ; PVALID # LEPCHA LETTER TTA..OL CHIKI AHAD 1C7E..1C7F ; DISALLOWED # OL CHIKI PUNCTUATION MUCAAD..OL CHIKI PUNCTU 1C80..1CFF ; UNASSIGNED # <reserved>..<reserved> 1D00..1D2B ; PVALID # LATIN LETTER SMALL CAPITAL A..CYRILLIC LETTE 1D2C..1D2E ; DISALLOWED # MODIFIER LETTER CAPITAL A..MODIFIER LETTER C 1D2F ; PVALID # MODIFIER LETTER CAPITAL BARRED B 1D30..1D3A ; DISALLOWED # MODIFIER LETTER CAPITAL D..MODIFIER LETTER C 1D3B ; PVALID # MODIFIER LETTER CAPITAL REVERSED N 1D3C..1D4D ; DISALLOWED # MODIFIER LETTER CAPITAL O..MODIFIER LETTER S 1D4E ; PVALID # MODIFIER LETTER SMALL TURNED I 1D4F..1D6A ; DISALLOWED # MODIFIER LETTER SMALL K..GREEK SUBSCRIPT SMA 1D6B..1D77 ; PVALID # LATIN SMALL LETTER UE..LATIN SMALL LETTER TU 1D78 ; DISALLOWED # MODIFIER LETTER CYRILLIC EN 1D79..1D9A ; PVALID # LATIN SMALL LETTER INSULAR G..LATIN SMALL LE 1D9B..1DBF ; DISALLOWED # MODIFIER LETTER SMALL TURNED ALPHA..MODIFIER 1DC0..1DE6 ; PVALID # COMBINING DOTTED GRAVE ACCENT..COMBINING LAT 1DE7..1DFD ; UNASSIGNED # <reserved>..<reserved> 1DFE..1DFF ; PVALID # COMBINING LEFT ARROWHEAD ABOVE..COMBINING RI Faltstrom Expires October 29, 2008 [Page 33]
Internet-Draft IDNA Codepoints April 2008 1E00 ; DISALLOWED # LATIN CAPITAL LETTER A WITH RING BELOW 1E01 ; PVALID # LATIN SMALL LETTER A WITH RING BELOW 1E02 ; DISALLOWED # LATIN CAPITAL LETTER B WITH DOT ABOVE 1E03 ; PVALID # LATIN SMALL LETTER B WITH DOT ABOVE 1E04 ; DISALLOWED # LATIN CAPITAL LETTER B WITH DOT BELOW 1E05 ; PVALID # LATIN SMALL LETTER B WITH DOT BELOW 1E06 ; DISALLOWED # LATIN CAPITAL LETTER B WITH LINE BELOW 1E07 ; PVALID # LATIN SMALL LETTER B WITH LINE BELOW 1E08 ; DISALLOWED # LATIN CAPITAL LETTER C WITH CEDILLA AND ACUT 1E09 ; PVALID # LATIN SMALL LETTER C WITH CEDILLA AND ACUTE 1E0A ; DISALLOWED # LATIN CAPITAL LETTER D WITH DOT ABOVE 1E0B ; PVALID # LATIN SMALL LETTER D WITH DOT ABOVE 1E0C ; DISALLOWED # LATIN CAPITAL LETTER D WITH DOT BELOW 1E0D ; PVALID # LATIN SMALL LETTER D WITH DOT BELOW 1E0E ; DISALLOWED # LATIN CAPITAL LETTER D WITH LINE BELOW 1E0F ; PVALID # LATIN SMALL LETTER D WITH LINE BELOW 1E10 ; DISALLOWED # LATIN CAPITAL LETTER D WITH CEDILLA 1E11 ; PVALID # LATIN SMALL LETTER D WITH CEDILLA 1E12 ; DISALLOWED # LATIN CAPITAL LETTER D WITH CIRCUMFLEX BELOW 1E13 ; PVALID # LATIN SMALL LETTER D WITH CIRCUMFLEX BELOW 1E14 ; DISALLOWED # LATIN CAPITAL LETTER E WITH MACRON AND GRAVE 1E15 ; PVALID # LATIN SMALL LETTER E WITH MACRON AND GRAVE 1E16 ; DISALLOWED # LATIN CAPITAL LETTER E WITH MACRON AND ACUTE 1E17 ; PVALID # LATIN SMALL LETTER E WITH MACRON AND ACUTE 1E18 ; DISALLOWED # LATIN CAPITAL LETTER E WITH CIRCUMFLEX BELOW 1E19 ; PVALID # LATIN SMALL LETTER E WITH CIRCUMFLEX BELOW 1E1A ; DISALLOWED # LATIN CAPITAL LETTER E WITH TILDE BELOW 1E1B ; PVALID # LATIN SMALL LETTER E WITH TILDE BELOW 1E1C ; DISALLOWED # LATIN CAPITAL LETTER E WITH CEDILLA AND BREV 1E1D ; PVALID # LATIN SMALL LETTER E WITH CEDILLA AND BREVE 1E1E ; DISALLOWED # LATIN CAPITAL LETTER F WITH DOT ABOVE 1E1F ; PVALID # LATIN SMALL LETTER F WITH DOT ABOVE 1E20 ; DISALLOWED # LATIN CAPITAL LETTER G WITH MACRON 1E21 ; PVALID # LATIN SMALL LETTER G WITH MACRON 1E22 ; DISALLOWED # LATIN CAPITAL LETTER H WITH DOT ABOVE 1E23 ; PVALID # LATIN SMALL LETTER H WITH DOT ABOVE 1E24 ; DISALLOWED # LATIN CAPITAL LETTER H WITH DOT BELOW 1E25 ; PVALID # LATIN SMALL LETTER H WITH DOT BELOW 1E26 ; DISALLOWED # LATIN CAPITAL LETTER H WITH DIAERESIS 1E27 ; PVALID # LATIN SMALL LETTER H WITH DIAERESIS 1E28 ; DISALLOWED # LATIN CAPITAL LETTER H WITH CEDILLA 1E29 ; PVALID # LATIN SMALL LETTER H WITH CEDILLA 1E2A ; DISALLOWED # LATIN CAPITAL LETTER H WITH BREVE BELOW 1E2B ; PVALID # LATIN SMALL LETTER H WITH BREVE BELOW 1E2C ; DISALLOWED # LATIN CAPITAL LETTER I WITH TILDE BELOW 1E2D ; PVALID # LATIN SMALL LETTER I WITH TILDE BELOW 1E2E ; DISALLOWED # LATIN CAPITAL LETTER I WITH DIAERESIS AND AC 1E2F ; PVALID # LATIN SMALL LETTER I WITH DIAERESIS AND ACUT Faltstrom Expires October 29, 2008 [Page 34]
Internet-Draft IDNA Codepoints April 2008 1E30 ; DISALLOWED # LATIN CAPITAL LETTER K WITH ACUTE 1E31 ; PVALID # LATIN SMALL LETTER K WITH ACUTE 1E32 ; DISALLOWED # LATIN CAPITAL LETTER K WITH DOT BELOW 1E33 ; PVALID # LATIN SMALL LETTER K WITH DOT BELOW 1E34 ; DISALLOWED # LATIN CAPITAL LETTER K WITH LINE BELOW 1E35 ; PVALID # LATIN SMALL LETTER K WITH LINE BELOW 1E36 ; DISALLOWED # LATIN CAPITAL LETTER L WITH DOT BELOW 1E37 ; PVALID # LATIN SMALL LETTER L WITH DOT BELOW 1E38 ; DISALLOWED # LATIN CAPITAL LETTER L WITH DOT BELOW AND MA 1E39 ; PVALID # LATIN SMALL LETTER L WITH DOT BELOW AND MACR 1E3A ; DISALLOWED # LATIN CAPITAL LETTER L WITH LINE BELOW 1E3B ; PVALID # LATIN SMALL LETTER L WITH LINE BELOW 1E3C ; DISALLOWED # LATIN CAPITAL LETTER L WITH CIRCUMFLEX BELOW 1E3D ; PVALID # LATIN SMALL LETTER L WITH CIRCUMFLEX BELOW 1E3E ; DISALLOWED # LATIN CAPITAL LETTER M WITH ACUTE 1E3F ; PVALID # LATIN SMALL LETTER M WITH ACUTE 1E40 ; DISALLOWED # LATIN CAPITAL LETTER M WITH DOT ABOVE 1E41 ; PVALID # LATIN SMALL LETTER M WITH DOT ABOVE 1E42 ; DISALLOWED # LATIN CAPITAL LETTER M WITH DOT BELOW 1E43 ; PVALID # LATIN SMALL LETTER M WITH DOT BELOW 1E44 ; DISALLOWED # LATIN CAPITAL LETTER N WITH DOT ABOVE 1E45 ; PVALID # LATIN SMALL LETTER N WITH DOT ABOVE 1E46 ; DISALLOWED # LATIN CAPITAL LETTER N WITH DOT BELOW 1E47 ; PVALID # LATIN SMALL LETTER N WITH DOT BELOW 1E48 ; DISALLOWED # LATIN CAPITAL LETTER N WITH LINE BELOW 1E49 ; PVALID # LATIN SMALL LETTER N WITH LINE BELOW 1E4A ; DISALLOWED # LATIN CAPITAL LETTER N WITH CIRCUMFLEX BELOW 1E4B ; PVALID # LATIN SMALL LETTER N WITH CIRCUMFLEX BELOW 1E4C ; DISALLOWED # LATIN CAPITAL LETTER O WITH TILDE AND ACUTE 1E4D ; PVALID # LATIN SMALL LETTER O WITH TILDE AND ACUTE 1E4E ; DISALLOWED # LATIN CAPITAL LETTER O WITH TILDE AND DIAERE 1E4F ; PVALID # LATIN SMALL LETTER O WITH TILDE AND DIAERESI 1E50 ; DISALLOWED # LATIN CAPITAL LETTER O WITH MACRON AND GRAVE 1E51 ; PVALID # LATIN SMALL LETTER O WITH MACRON AND GRAVE 1E52 ; DISALLOWED # LATIN CAPITAL LETTER O WITH MACRON AND ACUTE 1E53 ; PVALID # LATIN SMALL LETTER O WITH MACRON AND ACUTE 1E54 ; DISALLOWED # LATIN CAPITAL LETTER P WITH ACUTE 1E55 ; PVALID # LATIN SMALL LETTER P WITH ACUTE 1E56 ; DISALLOWED # LATIN CAPITAL LETTER P WITH DOT ABOVE 1E57 ; PVALID # LATIN SMALL LETTER P WITH DOT ABOVE 1E58 ; DISALLOWED # LATIN CAPITAL LETTER R WITH DOT ABOVE 1E59 ; PVALID # LATIN SMALL LETTER R WITH DOT ABOVE 1E5A ; DISALLOWED # LATIN CAPITAL LETTER R WITH DOT BELOW 1E5B ; PVALID # LATIN SMALL LETTER R WITH DOT BELOW 1E5C ; DISALLOWED # LATIN CAPITAL LETTER R WITH DOT BELOW AND MA 1E5D ; PVALID # LATIN SMALL LETTER R WITH DOT BELOW AND MACR 1E5E ; DISALLOWED # LATIN CAPITAL LETTER R WITH LINE BELOW 1E5F ; PVALID # LATIN SMALL LETTER R WITH LINE BELOW Faltstrom Expires October 29, 2008 [Page 35]
Internet-Draft IDNA Codepoints April 2008 1E60 ; DISALLOWED # LATIN CAPITAL LETTER S WITH DOT ABOVE 1E61 ; PVALID # LATIN SMALL LETTER S WITH DOT ABOVE 1E62 ; DISALLOWED # LATIN CAPITAL LETTER S WITH DOT BELOW 1E63 ; PVALID # LATIN SMALL LETTER S WITH DOT BELOW 1E64 ; DISALLOWED # LATIN CAPITAL LETTER S WITH ACUTE AND DOT AB 1E65 ; PVALID # LATIN SMALL LETTER S WITH ACUTE AND DOT ABOV 1E66 ; DISALLOWED # LATIN CAPITAL LETTER S WITH CARON AND DOT AB 1E67 ; PVALID # LATIN SMALL LETTER S WITH CARON AND DOT ABOV 1E68 ; DISALLOWED # LATIN CAPITAL LETTER S WITH DOT BELOW AND DO 1E69 ; PVALID # LATIN SMALL LETTER S WITH DOT BELOW AND DOT 1E6A ; DISALLOWED # LATIN CAPITAL LETTER T WITH DOT ABOVE 1E6B ; PVALID # LATIN SMALL LETTER T WITH DOT ABOVE 1E6C ; DISALLOWED # LATIN CAPITAL LETTER T WITH DOT BELOW 1E6D ; PVALID # LATIN SMALL LETTER T WITH DOT BELOW 1E6E ; DISALLOWED # LATIN CAPITAL LETTER T WITH LINE BELOW 1E6F ; PVALID # LATIN SMALL LETTER T WITH LINE BELOW 1E70 ; DISALLOWED # LATIN CAPITAL LETTER T WITH CIRCUMFLEX BELOW 1E71 ; PVALID # LATIN SMALL LETTER T WITH CIRCUMFLEX BELOW 1E72 ; DISALLOWED # LATIN CAPITAL LETTER U WITH DIAERESIS BELOW 1E73 ; PVALID # LATIN SMALL LETTER U WITH DIAERESIS BELOW 1E74 ; DISALLOWED # LATIN CAPITAL LETTER U WITH TILDE BELOW 1E75 ; PVALID # LATIN SMALL LETTER U WITH TILDE BELOW 1E76 ; DISALLOWED # LATIN CAPITAL LETTER U WITH CIRCUMFLEX BELOW 1E77 ; PVALID # LATIN SMALL LETTER U WITH CIRCUMFLEX BELOW 1E78 ; DISALLOWED # LATIN CAPITAL LETTER U WITH TILDE AND ACUTE 1E79 ; PVALID # LATIN SMALL LETTER U WITH TILDE AND ACUTE 1E7A ; DISALLOWED # LATIN CAPITAL LETTER U WITH MACRON AND DIAER 1E7B ; PVALID # LATIN SMALL LETTER U WITH MACRON AND DIAERES 1E7C ; DISALLOWED # LATIN CAPITAL LETTER V WITH TILDE 1E7D ; PVALID # LATIN SMALL LETTER V WITH TILDE 1E7E ; DISALLOWED # LATIN CAPITAL LETTER V WITH DOT BELOW 1E7F ; PVALID # LATIN SMALL LETTER V WITH DOT BELOW 1E80 ; DISALLOWED # LATIN CAPITAL LETTER W WITH GRAVE 1E81 ; PVALID # LATIN SMALL LETTER W WITH GRAVE 1E82 ; DISALLOWED # LATIN CAPITAL LETTER W WITH ACUTE 1E83 ; PVALID # LATIN SMALL LETTER W WITH ACUTE 1E84 ; DISALLOWED # LATIN CAPITAL LETTER W WITH DIAERESIS 1E85 ; PVALID # LATIN SMALL LETTER W WITH DIAERESIS 1E86 ; DISALLOWED # LATIN CAPITAL LETTER W WITH DOT ABOVE 1E87 ; PVALID # LATIN SMALL LETTER W WITH DOT ABOVE 1E88 ; DISALLOWED # LATIN CAPITAL LETTER W WITH DOT BELOW 1E89 ; PVALID # LATIN SMALL LETTER W WITH DOT BELOW 1E8A ; DISALLOWED # LATIN CAPITAL LETTER X WITH DOT ABOVE 1E8B ; PVALID # LATIN SMALL LETTER X WITH DOT ABOVE 1E8C ; DISALLOWED # LATIN CAPITAL LETTER X WITH DIAERESIS 1E8D ; PVALID # LATIN SMALL LETTER X WITH DIAERESIS 1E8E ; DISALLOWED # LATIN CAPITAL LETTER Y WITH DOT ABOVE 1E8F ; PVALID # LATIN SMALL LETTER Y WITH DOT ABOVE Faltstrom Expires October 29, 2008 [Page 36]
Internet-Draft IDNA Codepoints April 2008 1E90 ; DISALLOWED # LATIN CAPITAL LETTER Z WITH CIRCUMFLEX 1E91 ; PVALID # LATIN SMALL LETTER Z WITH CIRCUMFLEX 1E92 ; DISALLOWED # LATIN CAPITAL LETTER Z WITH DOT BELOW 1E93 ; PVALID # LATIN SMALL LETTER Z WITH DOT BELOW 1E94 ; DISALLOWED # LATIN CAPITAL LETTER Z WITH LINE BELOW 1E95..1E99 ; PVALID # LATIN SMALL LETTER Z WITH LINE BELOW..LATIN 1E9A..1E9B ; DISALLOWED # LATIN SMALL LETTER A WITH RIGHT HALF RING..L 1E9C..1E9D ; PVALID # LATIN SMALL LETTER LONG S WITH DIAGONAL STRO 1E9E ; DISALLOWED # LATIN CAPITAL LETTER SHARP S 1E9F ; PVALID # LATIN SMALL LETTER DELTA 1EA0 ; DISALLOWED # LATIN CAPITAL LETTER A WITH DOT BELOW 1EA1 ; PVALID # LATIN SMALL LETTER A WITH DOT BELOW 1EA2 ; DISALLOWED # LATIN CAPITAL LETTER A WITH HOOK ABOVE 1EA3 ; PVALID # LATIN SMALL LETTER A WITH HOOK ABOVE 1EA4 ; DISALLOWED # LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND A 1EA5 ; PVALID # LATIN SMALL LETTER A WITH CIRCUMFLEX AND ACU 1EA6 ; DISALLOWED # LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND G 1EA7 ; PVALID # LATIN SMALL LETTER A WITH CIRCUMFLEX AND GRA 1EA8 ; DISALLOWED # LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND H 1EA9 ; PVALID # LATIN SMALL LETTER A WITH CIRCUMFLEX AND HOO 1EAA ; DISALLOWED # LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND T 1EAB ; PVALID # LATIN SMALL LETTER A WITH CIRCUMFLEX AND TIL 1EAC ; DISALLOWED # LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND D 1EAD ; PVALID # LATIN SMALL LETTER A WITH CIRCUMFLEX AND DOT 1EAE ; DISALLOWED # LATIN CAPITAL LETTER A WITH BREVE AND ACUTE 1EAF ; PVALID # LATIN SMALL LETTER A WITH BREVE AND ACUTE 1EB0 ; DISALLOWED # LATIN CAPITAL LETTER A WITH BREVE AND GRAVE 1EB1 ; PVALID # LATIN SMALL LETTER A WITH BREVE AND GRAVE 1EB2 ; DISALLOWED # LATIN CAPITAL LETTER A WITH BREVE AND HOOK A 1EB3 ; PVALID # LATIN SMALL LETTER A WITH BREVE AND HOOK ABO 1EB4 ; DISALLOWED # LATIN CAPITAL LETTER A WITH BREVE AND TILDE 1EB5 ; PVALID # LATIN SMALL LETTER A WITH BREVE AND TILDE 1EB6 ; DISALLOWED # LATIN CAPITAL LETTER A WITH BREVE AND DOT BE 1EB7 ; PVALID # LATIN SMALL LETTER A WITH BREVE AND DOT BELO 1EB8 ; DISALLOWED # LATIN CAPITAL LETTER E WITH DOT BELOW 1EB9 ; PVALID # LATIN SMALL LETTER E WITH DOT BELOW 1EBA ; DISALLOWED # LATIN CAPITAL LETTER E WITH HOOK ABOVE 1EBB ; PVALID # LATIN SMALL LETTER E WITH HOOK ABOVE 1EBC ; DISALLOWED # LATIN CAPITAL LETTER E WITH TILDE 1EBD ; PVALID # LATIN SMALL LETTER E WITH TILDE 1EBE ; DISALLOWED # LATIN CAPITAL LETTER E WITH CIRCUMFLEX AND A 1EBF ; PVALID # LATIN SMALL LETTER E WITH CIRCUMFLEX AND ACU 1EC0 ; DISALLOWED # LATIN CAPITAL LETTER E WITH CIRCUMFLEX AND G 1EC1 ; PVALID # LATIN SMALL LETTER E WITH CIRCUMFLEX AND GRA 1EC2 ; DISALLOWED # LATIN CAPITAL LETTER E WITH CIRCUMFLEX AND H 1EC3 ; PVALID # LATIN SMALL LETTER E WITH CIRCUMFLEX AND HOO 1EC4 ; DISALLOWED # LATIN CAPITAL LETTER E WITH CIRCUMFLEX AND T 1EC5 ; PVALID # LATIN SMALL LETTER E WITH CIRCUMFLEX AND TIL Faltstrom Expires October 29, 2008 [Page 37]
Internet-Draft IDNA Codepoints April 2008 1EC6 ; DISALLOWED # LATIN CAPITAL LETTER E WITH CIRCUMFLEX AND D 1EC7 ; PVALID # LATIN SMALL LETTER E WITH CIRCUMFLEX AND DOT 1EC8 ; DISALLOWED # LATIN CAPITAL LETTER I WITH HOOK ABOVE 1EC9 ; PVALID # LATIN SMALL LETTER I WITH HOOK ABOVE 1ECA ; DISALLOWED # LATIN CAPITAL LETTER I WITH DOT BELOW 1ECB ; PVALID # LATIN SMALL LETTER I WITH DOT BELOW 1ECC ; DISALLOWED # LATIN CAPITAL LETTER O WITH DOT BELOW 1ECD ; PVALID # LATIN SMALL LETTER O WITH DOT BELOW 1ECE ; DISALLOWED # LATIN CAPITAL LETTER O WITH HOOK ABOVE 1ECF ; PVALID # LATIN SMALL LETTER O WITH HOOK ABOVE 1ED0 ; DISALLOWED # LATIN CAPITAL LETTER O WITH CIRCUMFLEX AND A 1ED1 ; PVALID # LATIN SMALL LETTER O WITH CIRCUMFLEX AND ACU 1ED2 ; DISALLOWED # LATIN CAPITAL LETTER O WITH CIRCUMFLEX AND G 1ED3 ; PVALID # LATIN SMALL LETTER O WITH CIRCUMFLEX AND GRA 1ED4 ; DISALLOWED # LATIN CAPITAL LETTER O WITH CIRCUMFLEX AND H 1ED5 ; PVALID # LATIN SMALL LETTER O WITH CIRCUMFLEX AND HOO 1ED6 ; DISALLOWED # LATIN CAPITAL LETTER O WITH CIRCUMFLEX AND T 1ED7 ; PVALID # LATIN SMALL LETTER O WITH CIRCUMFLEX AND TIL 1ED8 ; DISALLOWED # LATIN CAPITAL LETTER O WITH CIRCUMFLEX AND D 1ED9 ; PVALID # LATIN SMALL LETTER O WITH CIRCUMFLEX AND DOT 1EDA ; DISALLOWED # LATIN CAPITAL LETTER O WITH HORN AND ACUTE 1EDB ; PVALID # LATIN SMALL LETTER O WITH HORN AND ACUTE 1EDC ; DISALLOWED # LATIN CAPITAL LETTER O WITH HORN AND GRAVE 1EDD ; PVALID # LATIN SMALL LETTER O WITH HORN AND GRAVE 1EDE ; DISALLOWED # LATIN CAPITAL LETTER O WITH HORN AND HOOK AB 1EDF ; PVALID # LATIN SMALL LETTER O WITH HORN AND HOOK ABOV 1EE0 ; DISALLOWED # LATIN CAPITAL LETTER O WITH HORN AND TILDE 1EE1 ; PVALID # LATIN SMALL LETTER O WITH HORN AND TILDE 1EE2 ; DISALLOWED # LATIN CAPITAL LETTER O WITH HORN AND DOT BEL 1EE3 ; PVALID # LATIN SMALL LETTER O WITH HORN AND DOT BELOW 1EE4 ; DISALLOWED # LATIN CAPITAL LETTER U WITH DOT BELOW 1EE5 ; PVALID # LATIN SMALL LETTER U WITH DOT BELOW 1EE6 ; DISALLOWED # LATIN CAPITAL LETTER U WITH HOOK ABOVE 1EE7 ; PVALID # LATIN SMALL LETTER U WITH HOOK ABOVE 1EE8 ; DISALLOWED # LATIN CAPITAL LETTER U WITH HORN AND ACUTE 1EE9 ; PVALID # LATIN SMALL LETTER U WITH HORN AND ACUTE 1EEA ; DISALLOWED # LATIN CAPITAL LETTER U WITH HORN AND GRAVE 1EEB ; PVALID # LATIN SMALL LETTER U WITH HORN AND GRAVE 1EEC ; DISALLOWED # LATIN CAPITAL LETTER U WITH HORN AND HOOK AB 1EED ; PVALID # LATIN SMALL LETTER U WITH HORN AND HOOK ABOV 1EEE ; DISALLOWED # LATIN CAPITAL LETTER U WITH HORN AND TILDE 1EEF ; PVALID # LATIN SMALL LETTER U WITH HORN AND TILDE 1EF0 ; DISALLOWED # LATIN CAPITAL LETTER U WITH HORN AND DOT BEL 1EF1 ; PVALID # LATIN SMALL LETTER U WITH HORN AND DOT BELOW 1EF2 ; DISALLOWED # LATIN CAPITAL LETTER Y WITH GRAVE 1EF3 ; PVALID # LATIN SMALL LETTER Y WITH GRAVE 1EF4 ; DISALLOWED # LATIN CAPITAL LETTER Y WITH DOT BELOW 1EF5 ; PVALID # LATIN SMALL LETTER Y WITH DOT BELOW Faltstrom Expires October 29, 2008 [Page 38]
Internet-Draft IDNA Codepoints April 2008 1EF6 ; DISALLOWED # LATIN CAPITAL LETTER Y WITH HOOK ABOVE 1EF7 ; PVALID # LATIN SMALL LETTER Y WITH HOOK ABOVE 1EF8 ; DISALLOWED # LATIN CAPITAL LETTER Y WITH TILDE 1EF9 ; PVALID # LATIN SMALL LETTER Y WITH TILDE 1EFA ; DISALLOWED # LATIN CAPITAL LETTER MIDDLE-WELSH LL 1EFB ; PVALID # LATIN SMALL LETTER MIDDLE-WELSH LL 1EFC ; DISALLOWED # LATIN CAPITAL LETTER MIDDLE-WELSH V 1EFD ; PVALID # LATIN SMALL LETTER MIDDLE-WELSH V 1EFE ; DISALLOWED # LATIN CAPITAL LETTER Y WITH LOOP 1EFF..1F07 ; PVALID # LATIN SMALL LETTER Y WITH LOOP..GREEK SMALL 1F08..1F0F ; DISALLOWED # GREEK CAPITAL LETTER ALPHA WITH PSILI..GREEK 1F10..1F15 ; PVALID # GREEK SMALL LETTER EPSILON WITH PSILI..GREEK 1F16..1F17 ; UNASSIGNED # <reserved>..<reserved> 1F18..1F1D ; DISALLOWED # GREEK CAPITAL LETTER EPSILON WITH PSILI..GRE 1F1E..1F1F ; UNASSIGNED # <reserved>..<reserved> 1F20..1F27 ; PVALID # GREEK SMALL LETTER ETA WITH PSILI..GREEK SMA 1F28..1F2F ; DISALLOWED # GREEK CAPITAL LETTER ETA WITH PSILI..GREEK C 1F30..1F37 ; PVALID # GREEK SMALL LETTER IOTA WITH PSILI..GREEK SM 1F38..1F3F ; DISALLOWED # GREEK CAPITAL LETTER IOTA WITH PSILI..GREEK 1F40..1F45 ; PVALID # GREEK SMALL LETTER OMICRON WITH PSILI..GREEK 1F46..1F47 ; UNASSIGNED # <reserved>..<reserved> 1F48..1F4D ; DISALLOWED # GREEK CAPITAL LETTER OMICRON WITH PSILI..GRE 1F4E..1F4F ; UNASSIGNED # <reserved>..<reserved> 1F50..1F57 ; PVALID # GREEK SMALL LETTER UPSILON WITH PSILI..GREEK 1F58 ; UNASSIGNED # <reserved> 1F59 ; DISALLOWED # GREEK CAPITAL LETTER UPSILON WITH DASIA 1F5A ; UNASSIGNED # <reserved> 1F5B ; DISALLOWED # GREEK CAPITAL LETTER UPSILON WITH DASIA AND 1F5C ; UNASSIGNED # <reserved> 1F5D ; DISALLOWED # GREEK CAPITAL LETTER UPSILON WITH DASIA AND 1F5E ; UNASSIGNED # <reserved> 1F5F ; DISALLOWED # GREEK CAPITAL LETTER UPSILON WITH DASIA AND 1F60..1F67 ; PVALID # GREEK SMALL LETTER OMEGA WITH PSILI..GREEK S 1F68..1F6F ; DISALLOWED # GREEK CAPITAL LETTER OMEGA WITH PSILI..GREEK 1F70 ; PVALID # GREEK SMALL LETTER ALPHA WITH VARIA 1F71 ; DISALLOWED # GREEK SMALL LETTER ALPHA WITH OXIA 1F72 ; PVALID # GREEK SMALL LETTER EPSILON WITH VARIA 1F73 ; DISALLOWED # GREEK SMALL LETTER EPSILON WITH OXIA 1F74 ; PVALID # GREEK SMALL LETTER ETA WITH VARIA 1F75 ; DISALLOWED # GREEK SMALL LETTER ETA WITH OXIA 1F76 ; PVALID # GREEK SMALL LETTER IOTA WITH VARIA 1F77 ; DISALLOWED # GREEK SMALL LETTER IOTA WITH OXIA 1F78 ; PVALID # GREEK SMALL LETTER OMICRON WITH VARIA 1F79 ; DISALLOWED # GREEK SMALL LETTER OMICRON WITH OXIA 1F7A ; PVALID # GREEK SMALL LETTER UPSILON WITH VARIA 1F7B ; DISALLOWED # GREEK SMALL LETTER UPSILON WITH OXIA 1F7C ; PVALID # GREEK SMALL LETTER OMEGA WITH VARIA 1F7D ; DISALLOWED # GREEK SMALL LETTER OMEGA WITH OXIA Faltstrom Expires October 29, 2008 [Page 39]
Internet-Draft IDNA Codepoints April 2008 1F7E..1F7F ; UNASSIGNED # <reserved>..<reserved> 1F80..1FAF ; DISALLOWED # GREEK SMALL LETTER ALPHA WITH PSILI AND YPOG 1FB0..1FB1 ; PVALID # GREEK SMALL LETTER ALPHA WITH VRACHY..GREEK 1FB2..1FB4 ; DISALLOWED # GREEK SMALL LETTER ALPHA WITH VARIA AND YPOG 1FB5 ; UNASSIGNED # <reserved> 1FB6 ; PVALID # GREEK SMALL LETTER ALPHA WITH PERISPOMENI 1FB7..1FC4 ; DISALLOWED # GREEK SMALL LETTER ALPHA WITH PERISPOMENI AN 1FC5 ; UNASSIGNED # <reserved> 1FC6 ; PVALID # GREEK SMALL LETTER ETA WITH PERISPOMENI 1FC7..1FCF ; DISALLOWED # GREEK SMALL LETTER ETA WITH PERISPOMENI AND 1FD0..1FD2 ; PVALID # GREEK SMALL LETTER IOTA WITH VRACHY..GREEK S 1FD3 ; DISALLOWED # GREEK SMALL LETTER IOTA WITH DIALYTIKA AND O 1FD4..1FD5 ; UNASSIGNED # <reserved>..<reserved> 1FD6..1FD7 ; PVALID # GREEK SMALL LETTER IOTA WITH PERISPOMENI..GR 1FD8..1FDB ; DISALLOWED # GREEK CAPITAL LETTER IOTA WITH VRACHY..GREEK 1FDC ; UNASSIGNED # <reserved> 1FDD..1FDF ; DISALLOWED # GREEK DASIA AND VARIA..GREEK DASIA AND PERIS 1FE0..1FE2 ; PVALID # GREEK SMALL LETTER UPSILON WITH VRACHY..GREE 1FE3 ; DISALLOWED # GREEK SMALL LETTER UPSILON WITH DIALYTIKA AN 1FE4..1FE7 ; PVALID # GREEK SMALL LETTER RHO WITH PSILI..GREEK SMA 1FE8..1FEF ; DISALLOWED # GREEK CAPITAL LETTER UPSILON WITH VRACHY..GR 1FF0..1FF1 ; UNASSIGNED # <reserved>..<reserved> 1FF2..1FF4 ; DISALLOWED # GREEK SMALL LETTER OMEGA WITH VARIA AND YPOG 1FF5 ; UNASSIGNED # <reserved> 1FF6 ; PVALID # GREEK SMALL LETTER OMEGA WITH PERISPOMENI 1FF7..1FFE ; DISALLOWED # GREEK SMALL LETTER OMEGA WITH PERISPOMENI AN 1FFF ; UNASSIGNED # <reserved> 2000..200B ; DISALLOWED # EN QUAD..ZERO WIDTH SPACE 200C..200D ; CONTEXTJ # ZERO WIDTH NON-JOINER..ZERO WIDTH JOINER 200E..2071 ; DISALLOWED # LEFT-TO-RIGHT MARK..SUPERSCRIPT LATIN SMALL 2072..2073 ; UNASSIGNED # <reserved>..<reserved> 2074..208E ; DISALLOWED # SUPERSCRIPT FOUR..SUBSCRIPT RIGHT PARENTHESI 208F ; UNASSIGNED # <reserved> 2090..2094 ; DISALLOWED # LATIN SUBSCRIPT SMALL LETTER A..LATIN SUBSCR 2095..209F ; UNASSIGNED # <reserved>..<reserved> 20A0..20B5 ; DISALLOWED # EURO-CURRENCY SIGN..CEDI SIGN 20B6..20CF ; UNASSIGNED # <reserved>..<reserved> 20D0..214D ; DISALLOWED # COMBINING LEFT HARPOON ABOVE..AKTIESELSKAB 214E ; PVALID # TURNED SMALL F 214F ; DISALLOWED # SYMBOL FOR SAMARITAN SOURCE 2150..2152 ; UNASSIGNED # <reserved>..<reserved> 2153..2183 ; DISALLOWED # VULGAR FRACTION ONE THIRD..ROMAN NUMERAL REV 2184 ; PVALID # LATIN SMALL LETTER REVERSED C 2185..2188 ; DISALLOWED # ROMAN NUMERAL SIX LATE FORM..ROMAN NUMERAL O 2189..218F ; UNASSIGNED # <reserved>..<reserved> 2190..23E7 ; DISALLOWED # LEFTWARDS ARROW..ELECTRICAL INTERSECTION 23E8..23FF ; UNASSIGNED # <reserved>..<reserved> 2400..2426 ; DISALLOWED # SYMBOL FOR NULL..SYMBOL FOR SUBSTITUTE FORM Faltstrom Expires October 29, 2008 [Page 40]
Internet-Draft IDNA Codepoints April 2008 2427..243F ; UNASSIGNED # <reserved>..<reserved> 2440..244A ; DISALLOWED # OCR HOOK..OCR DOUBLE BACKSLASH 244B..245F ; UNASSIGNED # <reserved>..<reserved> 2460..269D ; DISALLOWED # CIRCLED DIGIT ONE..OUTLINED WHITE STAR 269E..269F ; UNASSIGNED # <reserved>..<reserved> 26A0..26BC ; DISALLOWED # WARNING SIGN..SESQUIQUADRATE 26BD..26BF ; UNASSIGNED # <reserved>..<reserved> 26C0..26C3 ; DISALLOWED # WHITE DRAUGHTS MAN..BLACK DRAUGHTS KING 26C4..2700 ; UNASSIGNED # <reserved>..<reserved> 2701..2704 ; DISALLOWED # UPPER BLADE SCISSORS..WHITE SCISSORS 2705 ; UNASSIGNED # <reserved> 2706..2709 ; DISALLOWED # TELEPHONE LOCATION SIGN..ENVELOPE 270A..270B ; UNASSIGNED # <reserved>..<reserved> 270C..2727 ; DISALLOWED # VICTORY HAND..WHITE FOUR POINTED STAR 2728 ; UNASSIGNED # <reserved> 2729..274B ; DISALLOWED # STRESS OUTLINED WHITE STAR..HEAVY EIGHT TEAR 274C ; UNASSIGNED # <reserved> 274D ; DISALLOWED # SHADOWED WHITE CIRCLE 274E ; UNASSIGNED # <reserved> 274F..2752 ; DISALLOWED # LOWER RIGHT DROP-SHADOWED WHITE SQUARE..UPPE 2753..2755 ; UNASSIGNED # <reserved>..<reserved> 2756 ; DISALLOWED # BLACK DIAMOND MINUS WHITE X 2757 ; UNASSIGNED # <reserved> 2758..275E ; DISALLOWED # LIGHT VERTICAL BAR..HEAVY DOUBLE COMMA QUOTA 275F..2760 ; UNASSIGNED # <reserved>..<reserved> 2761..2794 ; DISALLOWED # CURVED STEM PARAGRAPH SIGN ORNAMENT..HEAVY W 2795..2797 ; UNASSIGNED # <reserved>..<reserved> 2798..27AF ; DISALLOWED # HEAVY SOUTH EAST ARROW..NOTCHED LOWER RIGHT- 27B0 ; UNASSIGNED # <reserved> 27B1..27BE ; DISALLOWED # NOTCHED UPPER RIGHT-SHADOWED WHITE RIGHTWARD 27BF ; UNASSIGNED # <reserved> 27C0..27CA ; DISALLOWED # THREE DIMENSIONAL ANGLE..VERTICAL BAR WITH H 27CB ; UNASSIGNED # <reserved> 27CC ; DISALLOWED # LONG DIVISION 27CD..27CF ; UNASSIGNED # <reserved>..<reserved> 27D0..2B4C ; DISALLOWED # WHITE DIAMOND WITH CENTRED DOT..RIGHTWARDS A 2B4D..2B4F ; UNASSIGNED # <reserved>..<reserved> 2B50..2B54 ; DISALLOWED # WHITE MEDIUM STAR..WHITE RIGHT-POINTING PENT 2B55..2BFF ; UNASSIGNED # <reserved>..<reserved> 2C00..2C2E ; DISALLOWED # GLAGOLITIC CAPITAL LETTER AZU..GLAGOLITIC CA 2C2F ; UNASSIGNED # <reserved> 2C30..2C5E ; PVALID # GLAGOLITIC SMALL LETTER AZU..GLAGOLITIC SMAL 2C5F ; UNASSIGNED # <reserved> 2C60 ; DISALLOWED # LATIN CAPITAL LETTER L WITH DOUBLE BAR 2C61 ; PVALID # LATIN SMALL LETTER L WITH DOUBLE BAR 2C62..2C64 ; DISALLOWED # LATIN CAPITAL LETTER L WITH MIDDLE TILDE..LA 2C65..2C66 ; PVALID # LATIN SMALL LETTER A WITH STROKE..LATIN SMAL 2C67 ; DISALLOWED # LATIN CAPITAL LETTER H WITH DESCENDER Faltstrom Expires October 29, 2008 [Page 41]
Internet-Draft IDNA Codepoints April 2008 2C68 ; PVALID # LATIN SMALL LETTER H WITH DESCENDER 2C69 ; DISALLOWED # LATIN CAPITAL LETTER K WITH DESCENDER 2C6A ; PVALID # LATIN SMALL LETTER K WITH DESCENDER 2C6B ; DISALLOWED # LATIN CAPITAL LETTER Z WITH DESCENDER 2C6C ; PVALID # LATIN SMALL LETTER Z WITH DESCENDER 2C6D..2C6F ; DISALLOWED # LATIN CAPITAL LETTER ALPHA..LATIN CAPITAL LE 2C70 ; UNASSIGNED # <reserved> 2C71 ; PVALID # LATIN SMALL LETTER V WITH RIGHT HOOK 2C72 ; DISALLOWED # LATIN CAPITAL LETTER W WITH HOOK 2C73..2C74 ; PVALID # LATIN SMALL LETTER W WITH HOOK..LATIN SMALL 2C75 ; DISALLOWED # LATIN CAPITAL LETTER HALF H 2C76..2C7B ; PVALID # LATIN SMALL LETTER HALF H..LATIN LETTER SMAL 2C7C..2C7D ; DISALLOWED # LATIN SUBSCRIPT SMALL LETTER J..MODIFIER LET 2C7E..2C7F ; UNASSIGNED # <reserved>..<reserved> 2C80 ; DISALLOWED # COPTIC CAPITAL LETTER ALFA 2C81 ; PVALID # COPTIC SMALL LETTER ALFA 2C82 ; DISALLOWED # COPTIC CAPITAL LETTER VIDA 2C83 ; PVALID # COPTIC SMALL LETTER VIDA 2C84 ; DISALLOWED # COPTIC CAPITAL LETTER GAMMA 2C85 ; PVALID # COPTIC SMALL LETTER GAMMA 2C86 ; DISALLOWED # COPTIC CAPITAL LETTER DALDA 2C87 ; PVALID # COPTIC SMALL LETTER DALDA 2C88 ; DISALLOWED # COPTIC CAPITAL LETTER EIE 2C89 ; PVALID # COPTIC SMALL LETTER EIE 2C8A ; DISALLOWED # COPTIC CAPITAL LETTER SOU 2C8B ; PVALID # COPTIC SMALL LETTER SOU 2C8C ; DISALLOWED # COPTIC CAPITAL LETTER ZATA 2C8D ; PVALID # COPTIC SMALL LETTER ZATA 2C8E ; DISALLOWED # COPTIC CAPITAL LETTER HATE 2C8F ; PVALID # COPTIC SMALL LETTER HATE 2C90 ; DISALLOWED # COPTIC CAPITAL LETTER THETHE 2C91 ; PVALID # COPTIC SMALL LETTER THETHE 2C92 ; DISALLOWED # COPTIC CAPITAL LETTER IAUDA 2C93 ; PVALID # COPTIC SMALL LETTER IAUDA 2C94 ; DISALLOWED # COPTIC CAPITAL LETTER KAPA 2C95 ; PVALID # COPTIC SMALL LETTER KAPA 2C96 ; DISALLOWED # COPTIC CAPITAL LETTER LAULA 2C97 ; PVALID # COPTIC SMALL LETTER LAULA 2C98 ; DISALLOWED # COPTIC CAPITAL LETTER MI 2C99 ; PVALID # COPTIC SMALL LETTER MI 2C9A ; DISALLOWED # COPTIC CAPITAL LETTER NI 2C9B ; PVALID # COPTIC SMALL LETTER NI 2C9C ; DISALLOWED # COPTIC CAPITAL LETTER KSI 2C9D ; PVALID # COPTIC SMALL LETTER KSI 2C9E ; DISALLOWED # COPTIC CAPITAL LETTER O 2C9F ; PVALID # COPTIC SMALL LETTER O 2CA0 ; DISALLOWED # COPTIC CAPITAL LETTER PI 2CA1 ; PVALID # COPTIC SMALL LETTER PI Faltstrom Expires October 29, 2008 [Page 42]
Internet-Draft IDNA Codepoints April 2008 2CA2 ; DISALLOWED # COPTIC CAPITAL LETTER RO 2CA3 ; PVALID # COPTIC SMALL LETTER RO 2CA4 ; DISALLOWED # COPTIC CAPITAL LETTER SIMA 2CA5 ; PVALID # COPTIC SMALL LETTER SIMA 2CA6 ; DISALLOWED # COPTIC CAPITAL LETTER TAU 2CA7 ; PVALID # COPTIC SMALL LETTER TAU 2CA8 ; DISALLOWED # COPTIC CAPITAL LETTER UA 2CA9 ; PVALID # COPTIC SMALL LETTER UA 2CAA ; DISALLOWED # COPTIC CAPITAL LETTER FI 2CAB ; PVALID # COPTIC SMALL LETTER FI 2CAC ; DISALLOWED # COPTIC CAPITAL LETTER KHI 2CAD ; PVALID # COPTIC SMALL LETTER KHI 2CAE ; DISALLOWED # COPTIC CAPITAL LETTER PSI 2CAF ; PVALID # COPTIC SMALL LETTER PSI 2CB0 ; DISALLOWED # COPTIC CAPITAL LETTER OOU 2CB1 ; PVALID # COPTIC SMALL LETTER OOU 2CB2 ; DISALLOWED # COPTIC CAPITAL LETTER DIALECT-P ALEF 2CB3 ; PVALID # COPTIC SMALL LETTER DIALECT-P ALEF 2CB4 ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC AIN 2CB5 ; PVALID # COPTIC SMALL LETTER OLD COPTIC AIN 2CB6 ; DISALLOWED # COPTIC CAPITAL LETTER CRYPTOGRAMMIC EIE 2CB7 ; PVALID # COPTIC SMALL LETTER CRYPTOGRAMMIC EIE 2CB8 ; DISALLOWED # COPTIC CAPITAL LETTER DIALECT-P KAPA 2CB9 ; PVALID # COPTIC SMALL LETTER DIALECT-P KAPA 2CBA ; DISALLOWED # COPTIC CAPITAL LETTER DIALECT-P NI 2CBB ; PVALID # COPTIC SMALL LETTER DIALECT-P NI 2CBC ; DISALLOWED # COPTIC CAPITAL LETTER CRYPTOGRAMMIC NI 2CBD ; PVALID # COPTIC SMALL LETTER CRYPTOGRAMMIC NI 2CBE ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC OOU 2CBF ; PVALID # COPTIC SMALL LETTER OLD COPTIC OOU 2CC0 ; DISALLOWED # COPTIC CAPITAL LETTER SAMPI 2CC1 ; PVALID # COPTIC SMALL LETTER SAMPI 2CC2 ; DISALLOWED # COPTIC CAPITAL LETTER CROSSED SHEI 2CC3 ; PVALID # COPTIC SMALL LETTER CROSSED SHEI 2CC4 ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC SHEI 2CC5 ; PVALID # COPTIC SMALL LETTER OLD COPTIC SHEI 2CC6 ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC ESH 2CC7 ; PVALID # COPTIC SMALL LETTER OLD COPTIC ESH 2CC8 ; DISALLOWED # COPTIC CAPITAL LETTER AKHMIMIC KHEI 2CC9 ; PVALID # COPTIC SMALL LETTER AKHMIMIC KHEI 2CCA ; DISALLOWED # COPTIC CAPITAL LETTER DIALECT-P HORI 2CCB ; PVALID # COPTIC SMALL LETTER DIALECT-P HORI 2CCC ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC HORI 2CCD ; PVALID # COPTIC SMALL LETTER OLD COPTIC HORI 2CCE ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC HA 2CCF ; PVALID # COPTIC SMALL LETTER OLD COPTIC HA 2CD0 ; DISALLOWED # COPTIC CAPITAL LETTER L-SHAPED HA 2CD1 ; PVALID # COPTIC SMALL LETTER L-SHAPED HA Faltstrom Expires October 29, 2008 [Page 43]
Internet-Draft IDNA Codepoints April 2008 2CD2 ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC HEI 2CD3 ; PVALID # COPTIC SMALL LETTER OLD COPTIC HEI 2CD4 ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC HAT 2CD5 ; PVALID # COPTIC SMALL LETTER OLD COPTIC HAT 2CD6 ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC GANGIA 2CD7 ; PVALID # COPTIC SMALL LETTER OLD COPTIC GANGIA 2CD8 ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC DJA 2CD9 ; PVALID # COPTIC SMALL LETTER OLD COPTIC DJA 2CDA ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC SHIMA 2CDB ; PVALID # COPTIC SMALL LETTER OLD COPTIC SHIMA 2CDC ; DISALLOWED # COPTIC CAPITAL LETTER OLD NUBIAN SHIMA 2CDD ; PVALID # COPTIC SMALL LETTER OLD NUBIAN SHIMA 2CDE ; DISALLOWED # COPTIC CAPITAL LETTER OLD NUBIAN NGI 2CDF ; PVALID # COPTIC SMALL LETTER OLD NUBIAN NGI 2CE0 ; DISALLOWED # COPTIC CAPITAL LETTER OLD NUBIAN NYI 2CE1 ; PVALID # COPTIC SMALL LETTER OLD NUBIAN NYI 2CE2 ; DISALLOWED # COPTIC CAPITAL LETTER OLD NUBIAN WAU 2CE3..2CE4 ; PVALID # COPTIC SMALL LETTER OLD NUBIAN WAU..COPTIC S 2CE5..2CEA ; DISALLOWED # COPTIC SYMBOL MI RO..COPTIC SYMBOL SHIMA SIM 2CEB..2CF8 ; UNASSIGNED # <reserved>..<reserved> 2CF9..2CFF ; DISALLOWED # COPTIC OLD NUBIAN FULL STOP..COPTIC MORPHOLO 2D00..2D25 ; PVALID # GEORGIAN SMALL LETTER AN..GEORGIAN SMALL LET 2D26..2D2F ; UNASSIGNED # <reserved>..<reserved> 2D30..2D65 ; PVALID # TIFINAGH LETTER YA..TIFINAGH LETTER YAZZ 2D66..2D6E ; UNASSIGNED # <reserved>..<reserved> 2D6F ; DISALLOWED # TIFINAGH MODIFIER LETTER LABIALIZATION MARK 2D70..2D7F ; UNASSIGNED # <reserved>..<reserved> 2D80..2D96 ; PVALID # ETHIOPIC SYLLABLE LOA..ETHIOPIC SYLLABLE GGW 2D97..2D9F ; UNASSIGNED # <reserved>..<reserved> 2DA0..2DA6 ; PVALID # ETHIOPIC SYLLABLE SSA..ETHIOPIC SYLLABLE SSO 2DA7 ; UNASSIGNED # <reserved> 2DA8..2DAE ; PVALID # ETHIOPIC SYLLABLE CCA..ETHIOPIC SYLLABLE CCO 2DAF ; UNASSIGNED # <reserved> 2DB0..2DB6 ; PVALID # ETHIOPIC SYLLABLE ZZA..ETHIOPIC SYLLABLE ZZO 2DB7 ; UNASSIGNED # <reserved> 2DB8..2DBE ; PVALID # ETHIOPIC SYLLABLE CCHA..ETHIOPIC SYLLABLE CC 2DBF ; UNASSIGNED # <reserved> 2DC0..2DC6 ; PVALID # ETHIOPIC SYLLABLE QYA..ETHIOPIC SYLLABLE QYO 2DC7 ; UNASSIGNED # <reserved> 2DC8..2DCE ; PVALID # ETHIOPIC SYLLABLE KYA..ETHIOPIC SYLLABLE KYO 2DCF ; UNASSIGNED # <reserved> 2DD0..2DD6 ; PVALID # ETHIOPIC SYLLABLE XYA..ETHIOPIC SYLLABLE XYO 2DD7 ; UNASSIGNED # <reserved> 2DD8..2DDE ; PVALID # ETHIOPIC SYLLABLE GYA..ETHIOPIC SYLLABLE GYO 2DDF ; UNASSIGNED # <reserved> 2DE0..2DFF ; PVALID # COMBINING CYRILLIC LETTER BE..COMBINING CYRI 2E00..2E2E ; DISALLOWED # RIGHT ANGLE SUBSTITUTION MARKER..REVERSED QU 2E2F ; PVALID # VERTICAL TILDE Faltstrom Expires October 29, 2008 [Page 44]
Internet-Draft IDNA Codepoints April 2008 2E30 ; DISALLOWED # RING POINT 2E31..2E7F ; UNASSIGNED # <reserved>..<reserved> 2E80..2E99 ; DISALLOWED # CJK RADICAL REPEAT..CJK RADICAL RAP 2E9A ; UNASSIGNED # <reserved> 2E9B..2EF3 ; DISALLOWED # CJK RADICAL CHOKE..CJK RADICAL C-SIMPLIFIED 2EF4..2EFF ; UNASSIGNED # <reserved>..<reserved> 2F00..2FD5 ; DISALLOWED # KANGXI RADICAL ONE..KANGXI RADICAL FLUTE 2FD6..2FEF ; UNASSIGNED # <reserved>..<reserved> 2FF0..2FFB ; DISALLOWED # IDEOGRAPHIC DESCRIPTION CHARACTER LEFT TO RI 2FFC..2FFF ; UNASSIGNED # <reserved>..<reserved> 3000..3004 ; DISALLOWED # IDEOGRAPHIC SPACE..JAPANESE INDUSTRIAL STAND 3005 ; CONTEXTO # IDEOGRAPHIC ITERATION MARK 3006..3007 ; PVALID # IDEOGRAPHIC CLOSING MARK..IDEOGRAPHIC NUMBER 3008..3029 ; DISALLOWED # LEFT ANGLE BRACKET..HANGZHOU NUMERAL NINE 302A..302F ; PVALID # IDEOGRAPHIC LEVEL TONE MARK..HANGUL DOUBLE D 3030 ; DISALLOWED # WAVY DASH 3031..3035 ; PVALID # VERTICAL KANA REPEAT MARK..VERTICAL KANA REP 3036..303A ; DISALLOWED # CIRCLED POSTAL MARK..HANGZHOU NUMERAL THIRTY 303B ; CONTEXTO # VERTICAL IDEOGRAPHIC ITERATION MARK 303C ; PVALID # MASU MARK 303D..303F ; DISALLOWED # PART ALTERNATION MARK..IDEOGRAPHIC HALF FILL 3040 ; UNASSIGNED # <reserved> 3041..3096 ; PVALID # HIRAGANA LETTER SMALL A..HIRAGANA LETTER SMA 3097..3098 ; UNASSIGNED # <reserved>..<reserved> 3099..309A ; PVALID # COMBINING KATAKANA-HIRAGANA VOICED SOUND MAR 309B..309C ; DISALLOWED # KATAKANA-HIRAGANA VOICED SOUND MARK..KATAKAN 309D..309E ; PVALID # HIRAGANA ITERATION MARK..HIRAGANA VOICED ITE 309F..30A0 ; DISALLOWED # HIRAGANA DIGRAPH YORI..KATAKANA-HIRAGANA DOU 30A1..30FA ; PVALID # KATAKANA LETTER SMALL A..KATAKANA LETTER VO 30FB ; CONTEXTO # KATAKANA MIDDLE DOT 30FC..30FE ; PVALID # KATAKANA-HIRAGANA PROLONGED SOUND MARK..KATA 30FF ; DISALLOWED # KATAKANA DIGRAPH KOTO 3100..3104 ; UNASSIGNED # <reserved>..<reserved> 3105..312D ; PVALID # BOPOMOFO LETTER B..BOPOMOFO LETTER IH 312E..3130 ; UNASSIGNED # <reserved>..<reserved> 3131..318E ; DISALLOWED # HANGUL LETTER KIYEOK..HANGUL LETTER ARAEAE 318F ; UNASSIGNED # <reserved> 3190..319F ; DISALLOWED # IDEOGRAPHIC ANNOTATION LINKING MARK..IDEOGRA 31A0..31B7 ; PVALID # BOPOMOFO LETTER BU..BOPOMOFO FINAL LETTER H 31B8..31BF ; UNASSIGNED # <reserved>..<reserved> 31C0..31E3 ; DISALLOWED # CJK STROKE T..CJK STROKE Q 31E4..31EF ; UNASSIGNED # <reserved>..<reserved> 31F0..31FF ; PVALID # KATAKANA LETTER SMALL KU..KATAKANA LETTER SM 3200..321E ; DISALLOWED # PARENTHESIZED HANGUL KIYEOK..PARENTHESIZED K 321F ; UNASSIGNED # <reserved> 3220..3243 ; DISALLOWED # PARENTHESIZED IDEOGRAPH ONE..PARENTHESIZED I 3244..324F ; UNASSIGNED # <reserved>..<reserved> 3250..32FE ; DISALLOWED # PARTNERSHIP SIGN..CIRCLED KATAKANA WO Faltstrom Expires October 29, 2008 [Page 45]
Internet-Draft IDNA Codepoints April 2008 32FF ; UNASSIGNED # <reserved> 3300..33FF ; DISALLOWED # SQUARE APAATO..SQUARE GAL 3400..4DB5 ; PVALID # <CJK Ideograph Extension A>..<CJK Ideograph 4DB6..4DBF ; UNASSIGNED # <reserved>..<reserved> 4DC0..4DFF ; DISALLOWED # HEXAGRAM FOR THE CREATIVE HEAVEN..HEXAGRAM F 4E00..9FC3 ; PVALID # <CJK Ideograph>..<CJK Ideograph> 9FC4..9FFF ; UNASSIGNED # <reserved>..<reserved> A000..A48C ; PVALID # YI SYLLABLE IT..YI SYLLABLE YYR A48D..A48F ; UNASSIGNED # <reserved>..<reserved> A490..A4C6 ; DISALLOWED # YI RADICAL QOT..YI RADICAL KE A4C7..A4FF ; UNASSIGNED # <reserved>..<reserved> A500..A60C ; PVALID # VAI SYLLABLE EE..VAI SYLLABLE LENGTHENER A60D..A60F ; DISALLOWED # VAI COMMA..VAI QUESTION MARK A610..A62B ; PVALID # VAI SYLLABLE NDOLE FA..VAI SYLLABLE NDOLE DO A62C..A63F ; UNASSIGNED # <reserved>..<reserved> A640 ; DISALLOWED # CYRILLIC CAPITAL LETTER ZEMLYA A641 ; PVALID # CYRILLIC SMALL LETTER ZEMLYA A642 ; DISALLOWED # CYRILLIC CAPITAL LETTER DZELO A643 ; PVALID # CYRILLIC SMALL LETTER DZELO A644 ; DISALLOWED # CYRILLIC CAPITAL LETTER REVERSED DZE A645 ; PVALID # CYRILLIC SMALL LETTER REVERSED DZE A646 ; DISALLOWED # CYRILLIC CAPITAL LETTER IOTA A647 ; PVALID # CYRILLIC SMALL LETTER IOTA A648 ; DISALLOWED # CYRILLIC CAPITAL LETTER DJERV A649 ; PVALID # CYRILLIC SMALL LETTER DJERV A64A ; DISALLOWED # CYRILLIC CAPITAL LETTER MONOGRAPH UK A64B ; PVALID # CYRILLIC SMALL LETTER MONOGRAPH UK A64C ; DISALLOWED # CYRILLIC CAPITAL LETTER BROAD OMEGA A64D ; PVALID # CYRILLIC SMALL LETTER BROAD OMEGA A64E ; DISALLOWED # CYRILLIC CAPITAL LETTER NEUTRAL YER A64F ; PVALID # CYRILLIC SMALL LETTER NEUTRAL YER A650 ; DISALLOWED # CYRILLIC CAPITAL LETTER YERU WITH BACK YER A651 ; PVALID # CYRILLIC SMALL LETTER YERU WITH BACK YER A652 ; DISALLOWED # CYRILLIC CAPITAL LETTER IOTIFIED YAT A653 ; PVALID # CYRILLIC SMALL LETTER IOTIFIED YAT A654 ; DISALLOWED # CYRILLIC CAPITAL LETTER REVERSED YU A655 ; PVALID # CYRILLIC SMALL LETTER REVERSED YU A656 ; DISALLOWED # CYRILLIC CAPITAL LETTER IOTIFIED A A657 ; PVALID # CYRILLIC SMALL LETTER IOTIFIED A A658 ; DISALLOWED # CYRILLIC CAPITAL LETTER CLOSED LITTLE YUS A659 ; PVALID # CYRILLIC SMALL LETTER CLOSED LITTLE YUS A65A ; DISALLOWED # CYRILLIC CAPITAL LETTER BLENDED YUS A65B ; PVALID # CYRILLIC SMALL LETTER BLENDED YUS A65C ; DISALLOWED # CYRILLIC CAPITAL LETTER IOTIFIED CLOSED LITT A65D ; PVALID # CYRILLIC SMALL LETTER IOTIFIED CLOSED LITTLE A65E ; DISALLOWED # CYRILLIC CAPITAL LETTER YN A65F ; PVALID # CYRILLIC SMALL LETTER YN A660..A661 ; UNASSIGNED # <reserved>..<reserved> Faltstrom Expires October 29, 2008 [Page 46]
Internet-Draft IDNA Codepoints April 2008 A662 ; DISALLOWED # CYRILLIC CAPITAL LETTER SOFT DE A663 ; PVALID # CYRILLIC SMALL LETTER SOFT DE A664 ; DISALLOWED # CYRILLIC CAPITAL LETTER SOFT EL A665 ; PVALID # CYRILLIC SMALL LETTER SOFT EL A666 ; DISALLOWED # CYRILLIC CAPITAL LETTER SOFT EM A667 ; PVALID # CYRILLIC SMALL LETTER SOFT EM A668 ; DISALLOWED # CYRILLIC CAPITAL LETTER MONOCULAR O A669 ; PVALID # CYRILLIC SMALL LETTER MONOCULAR O A66A ; DISALLOWED # CYRILLIC CAPITAL LETTER BINOCULAR O A66B ; PVALID # CYRILLIC SMALL LETTER BINOCULAR O A66C ; DISALLOWED # CYRILLIC CAPITAL LETTER DOUBLE MONOCULAR O A66D..A66F ; PVALID # CYRILLIC SMALL LETTER DOUBLE MONOCULAR O..CO A670..A673 ; DISALLOWED # COMBINING CYRILLIC TEN MILLIONS SIGN..SLAVON A674..A67B ; UNASSIGNED # <reserved>..<reserved> A67C..A67D ; PVALID # COMBINING CYRILLIC KAVYKA..COMBINING CYRILLI A67E ; DISALLOWED # CYRILLIC KAVYKA A67F ; PVALID # CYRILLIC PAYEROK A680 ; DISALLOWED # CYRILLIC CAPITAL LETTER DWE A681 ; PVALID # CYRILLIC SMALL LETTER DWE A682 ; DISALLOWED # CYRILLIC CAPITAL LETTER DZWE A683 ; PVALID # CYRILLIC SMALL LETTER DZWE A684 ; DISALLOWED # CYRILLIC CAPITAL LETTER ZHWE A685 ; PVALID # CYRILLIC SMALL LETTER ZHWE A686 ; DISALLOWED # CYRILLIC CAPITAL LETTER CCHE A687 ; PVALID # CYRILLIC SMALL LETTER CCHE A688 ; DISALLOWED # CYRILLIC CAPITAL LETTER DZZE A689 ; PVALID # CYRILLIC SMALL LETTER DZZE A68A ; DISALLOWED # CYRILLIC CAPITAL LETTER TE WITH MIDDLE HOOK A68B ; PVALID # CYRILLIC SMALL LETTER TE WITH MIDDLE HOOK A68C ; DISALLOWED # CYRILLIC CAPITAL LETTER TWE A68D ; PVALID # CYRILLIC SMALL LETTER TWE A68E ; DISALLOWED # CYRILLIC CAPITAL LETTER TSWE A68F ; PVALID # CYRILLIC SMALL LETTER TSWE A690 ; DISALLOWED # CYRILLIC CAPITAL LETTER TSSE A691 ; PVALID # CYRILLIC SMALL LETTER TSSE A692 ; DISALLOWED # CYRILLIC CAPITAL LETTER TCHE A693 ; PVALID # CYRILLIC SMALL LETTER TCHE A694 ; DISALLOWED # CYRILLIC CAPITAL LETTER HWE A695 ; PVALID # CYRILLIC SMALL LETTER HWE A696 ; DISALLOWED # CYRILLIC CAPITAL LETTER SHWE A697 ; PVALID # CYRILLIC SMALL LETTER SHWE A698..A6FF ; UNASSIGNED # <reserved>..<reserved> A700..A716 ; DISALLOWED # MODIFIER LETTER CHINESE TONE YIN PING..MODIF A717..A71F ; PVALID # MODIFIER LETTER DOT VERTICAL BAR..MODIFIER L A720..A722 ; DISALLOWED # MODIFIER LETTER STRESS AND HIGH TONE..LATIN A723 ; PVALID # LATIN SMALL LETTER EGYPTOLOGICAL ALEF A724 ; DISALLOWED # LATIN CAPITAL LETTER EGYPTOLOGICAL AIN A725 ; PVALID # LATIN SMALL LETTER EGYPTOLOGICAL AIN Faltstrom Expires October 29, 2008 [Page 47]
Internet-Draft IDNA Codepoints April 2008 A726 ; DISALLOWED # LATIN CAPITAL LETTER HENG A727 ; PVALID # LATIN SMALL LETTER HENG A728 ; DISALLOWED # LATIN CAPITAL LETTER TZ A729 ; PVALID # LATIN SMALL LETTER TZ A72A ; DISALLOWED # LATIN CAPITAL LETTER TRESILLO A72B ; PVALID # LATIN SMALL LETTER TRESILLO A72C ; DISALLOWED # LATIN CAPITAL LETTER CUATRILLO A72D ; PVALID # LATIN SMALL LETTER CUATRILLO A72E ; DISALLOWED # LATIN CAPITAL LETTER CUATRILLO WITH COMMA A72F..A731 ; PVALID # LATIN SMALL LETTER CUATRILLO WITH COMMA..LAT A732 ; DISALLOWED # LATIN CAPITAL LETTER AA A733 ; PVALID # LATIN SMALL LETTER AA A734 ; DISALLOWED # LATIN CAPITAL LETTER AO A735 ; PVALID # LATIN SMALL LETTER AO A736 ; DISALLOWED # LATIN CAPITAL LETTER AU A737 ; PVALID # LATIN SMALL LETTER AU A738 ; DISALLOWED # LATIN CAPITAL LETTER AV A739 ; PVALID # LATIN SMALL LETTER AV A73A ; DISALLOWED # LATIN CAPITAL LETTER AV WITH HORIZONTAL BAR A73B ; PVALID # LATIN SMALL LETTER AV WITH HORIZONTAL BAR A73C ; DISALLOWED # LATIN CAPITAL LETTER AY A73D ; PVALID # LATIN SMALL LETTER AY A73E ; DISALLOWED # LATIN CAPITAL LETTER REVERSED C WITH DOT A73F ; PVALID # LATIN SMALL LETTER REVERSED C WITH DOT A740 ; DISALLOWED # LATIN CAPITAL LETTER K WITH STROKE A741 ; PVALID # LATIN SMALL LETTER K WITH STROKE A742 ; DISALLOWED # LATIN CAPITAL LETTER K WITH DIAGONAL STROKE A743 ; PVALID # LATIN SMALL LETTER K WITH DIAGONAL STROKE A744 ; DISALLOWED # LATIN CAPITAL LETTER K WITH STROKE AND DIAGO A745 ; PVALID # LATIN SMALL LETTER K WITH STROKE AND DIAGONA A746 ; DISALLOWED # LATIN CAPITAL LETTER BROKEN L A747 ; PVALID # LATIN SMALL LETTER BROKEN L A748 ; DISALLOWED # LATIN CAPITAL LETTER L WITH HIGH STROKE A749 ; PVALID # LATIN SMALL LETTER L WITH HIGH STROKE A74A ; DISALLOWED # LATIN CAPITAL LETTER O WITH LONG STROKE OVER A74B ; PVALID # LATIN SMALL LETTER O WITH LONG STROKE OVERLA A74C ; DISALLOWED # LATIN CAPITAL LETTER O WITH LOOP A74D ; PVALID # LATIN SMALL LETTER O WITH LOOP A74E ; DISALLOWED # LATIN CAPITAL LETTER OO A74F ; PVALID # LATIN SMALL LETTER OO A750 ; DISALLOWED # LATIN CAPITAL LETTER P WITH STROKE THROUGH D A751 ; PVALID # LATIN SMALL LETTER P WITH STROKE THROUGH DES A752 ; DISALLOWED # LATIN CAPITAL LETTER P WITH FLOURISH A753 ; PVALID # LATIN SMALL LETTER P WITH FLOURISH A754 ; DISALLOWED # LATIN CAPITAL LETTER P WITH SQUIRREL TAIL A755 ; PVALID # LATIN SMALL LETTER P WITH SQUIRREL TAIL A756 ; DISALLOWED # LATIN CAPITAL LETTER Q WITH STROKE THROUGH D A757 ; PVALID # LATIN SMALL LETTER Q WITH STROKE THROUGH DES Faltstrom Expires October 29, 2008 [Page 48]
Internet-Draft IDNA Codepoints April 2008 A758 ; DISALLOWED # LATIN CAPITAL LETTER Q WITH DIAGONAL STROKE A759 ; PVALID # LATIN SMALL LETTER Q WITH DIAGONAL STROKE A75A ; DISALLOWED # LATIN CAPITAL LETTER R ROTUNDA A75B ; PVALID # LATIN SMALL LETTER R ROTUNDA A75C ; DISALLOWED # LATIN CAPITAL LETTER RUM ROTUNDA A75D ; PVALID # LATIN SMALL LETTER RUM ROTUNDA A75E ; DISALLOWED # LATIN CAPITAL LETTER V WITH DIAGONAL STROKE A75F ; PVALID # LATIN SMALL LETTER V WITH DIAGONAL STROKE A760 ; DISALLOWED # LATIN CAPITAL LETTER VY A761 ; PVALID # LATIN SMALL LETTER VY A762 ; DISALLOWED # LATIN CAPITAL LETTER VISIGOTHIC Z A763 ; PVALID # LATIN SMALL LETTER VISIGOTHIC Z A764 ; DISALLOWED # LATIN CAPITAL LETTER THORN WITH STROKE A765 ; PVALID # LATIN SMALL LETTER THORN WITH STROKE A766 ; DISALLOWED # LATIN CAPITAL LETTER THORN WITH STROKE THROU A767 ; PVALID # LATIN SMALL LETTER THORN WITH STROKE THROUGH A768 ; DISALLOWED # LATIN CAPITAL LETTER VEND A769 ; PVALID # LATIN SMALL LETTER VEND A76A ; DISALLOWED # LATIN CAPITAL LETTER ET A76B ; PVALID # LATIN SMALL LETTER ET A76C ; DISALLOWED # LATIN CAPITAL LETTER IS A76D ; PVALID # LATIN SMALL LETTER IS A76E ; DISALLOWED # LATIN CAPITAL LETTER CON A76F ; PVALID # LATIN SMALL LETTER CON A770 ; DISALLOWED # MODIFIER LETTER US A771..A778 ; PVALID # LATIN SMALL LETTER DUM..LATIN SMALL LETTER U A779 ; DISALLOWED # LATIN CAPITAL LETTER INSULAR D A77A ; PVALID # LATIN SMALL LETTER INSULAR D A77B ; DISALLOWED # LATIN CAPITAL LETTER INSULAR F A77C ; PVALID # LATIN SMALL LETTER INSULAR F A77D..A77E ; DISALLOWED # LATIN CAPITAL LETTER INSULAR G..LATIN CAPITA A77F ; PVALID # LATIN SMALL LETTER TURNED INSULAR G A780 ; DISALLOWED # LATIN CAPITAL LETTER TURNED L A781 ; PVALID # LATIN SMALL LETTER TURNED L A782 ; DISALLOWED # LATIN CAPITAL LETTER INSULAR R A783 ; PVALID # LATIN SMALL LETTER INSULAR R A784 ; DISALLOWED # LATIN CAPITAL LETTER INSULAR S A785 ; PVALID # LATIN SMALL LETTER INSULAR S A786 ; DISALLOWED # LATIN CAPITAL LETTER INSULAR T A787..A788 ; PVALID # LATIN SMALL LETTER INSULAR T..MODIFIER LETTE A789..A78B ; DISALLOWED # MODIFIER LETTER COLON..LATIN CAPITAL LETTER A78C ; PVALID # LATIN SMALL LETTER SALTILLO A78D..A7FA ; UNASSIGNED # <reserved>..<reserved> A7FB..A827 ; PVALID # LATIN EPIGRAPHIC LETTER REVERSED F..SYLOTI N A828..A82B ; DISALLOWED # SYLOTI NAGRI POETRY MARK-1..SYLOTI NAGRI POE A82C..A83F ; UNASSIGNED # <reserved>..<reserved> A840..A873 ; PVALID # PHAGS-PA LETTER KA..PHAGS-PA LETTER CANDRABI A874..A877 ; DISALLOWED # PHAGS-PA SINGLE HEAD MARK..PHAGS-PA MARK DOU Faltstrom Expires October 29, 2008 [Page 49]
Internet-Draft IDNA Codepoints April 2008 A878..A87F ; UNASSIGNED # <reserved>..<reserved> A880..A8C4 ; PVALID # SAURASHTRA SIGN ANUSVARA..SAURASHTRA SIGN VI A8C5..A8CD ; UNASSIGNED # <reserved>..<reserved> A8CE..A8CF ; DISALLOWED # SAURASHTRA DANDA..SAURASHTRA DOUBLE DANDA A8D0..A8D9 ; PVALID # SAURASHTRA DIGIT ZERO..SAURASHTRA DIGIT NINE A8DA..A8FF ; UNASSIGNED # <reserved>..<reserved> A900..A92D ; PVALID # KAYAH LI DIGIT ZERO..KAYAH LI TONE CALYA PLO A92E..A92F ; DISALLOWED # KAYAH LI SIGN CWI..KAYAH LI SIGN SHYA A930..A953 ; PVALID # REJANG LETTER KA..REJANG VIRAMA A954..A95E ; UNASSIGNED # <reserved>..<reserved> A95F ; DISALLOWED # REJANG SECTION MARK A960..A9FF ; UNASSIGNED # <reserved>..<reserved> AA00..AA36 ; PVALID # CHAM LETTER A..CHAM CONSONANT SIGN WA AA37..AA3F ; UNASSIGNED # <reserved>..<reserved> AA40..AA4D ; PVALID # CHAM LETTER FINAL K..CHAM CONSONANT SIGN FIN AA4E..AA4F ; UNASSIGNED # <reserved>..<reserved> AA50..AA59 ; PVALID # CHAM DIGIT ZERO..CHAM DIGIT NINE AA5A..AA5B ; UNASSIGNED # <reserved>..<reserved> AA5C..AA5F ; DISALLOWED # CHAM PUNCTUATION SPIRAL..CHAM PUNCTUATION TR AA60..ABFF ; UNASSIGNED # <reserved>..<reserved> AC00..D7A3 ; PVALID # <Hangul Syllable>..<Hangul Syllable> D7A4..D7FF ; UNASSIGNED # <reserved>..<reserved> D800..FA0D ; DISALLOWED # <Non Private Use High Surrogate>..CJK COMPAT FA0E..FA0F ; PVALID # CJK COMPATIBILITY IDEOGRAPH-FA0E..CJK COMPAT FA10 ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA10 FA11 ; PVALID # CJK COMPATIBILITY IDEOGRAPH-FA11 FA12 ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA12 FA13..FA14 ; PVALID # CJK COMPATIBILITY IDEOGRAPH-FA13..CJK COMPAT FA15..FA1E ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA15..CJK COMPAT FA1F ; PVALID # CJK COMPATIBILITY IDEOGRAPH-FA1F FA20 ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA20 FA21 ; PVALID # CJK COMPATIBILITY IDEOGRAPH-FA21 FA22 ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA22 FA23..FA24 ; PVALID # CJK COMPATIBILITY IDEOGRAPH-FA23..CJK COMPAT FA25..FA26 ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA25..CJK COMPAT FA27..FA29 ; PVALID # CJK COMPATIBILITY IDEOGRAPH-FA27..CJK COMPAT FA2A..FA2D ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA2A..CJK COMPAT FA2E..FA2F ; UNASSIGNED # <reserved>..<reserved> FA30..FA6A ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA30..CJK COMPAT FA6B..FA6F ; UNASSIGNED # <reserved>..<reserved> FA70..FAD9 ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA70..CJK COMPAT FADA..FAFF ; UNASSIGNED # <reserved>..<reserved> FB00..FB06 ; DISALLOWED # LATIN SMALL LIGATURE FF..LATIN SMALL LIGATUR FB07..FB12 ; UNASSIGNED # <reserved>..<reserved> FB13..FB17 ; DISALLOWED # ARMENIAN SMALL LIGATURE MEN NOW..ARMENIAN SM FB18..FB1C ; UNASSIGNED # <reserved>..<reserved> FB1D ; DISALLOWED # HEBREW LETTER YOD WITH HIRIQ FB1E ; PVALID # HEBREW POINT JUDEO-SPANISH VARIKA Faltstrom Expires October 29, 2008 [Page 50]
Internet-Draft IDNA Codepoints April 2008 FB1F..FB36 ; DISALLOWED # HEBREW LIGATURE YIDDISH YOD YOD PATAH..HEBRE FB37 ; UNASSIGNED # <reserved> FB38..FB3C ; DISALLOWED # HEBREW LETTER TET WITH DAGESH..HEBREW LETTER FB3D ; UNASSIGNED # <reserved> FB3E ; DISALLOWED # HEBREW LETTER MEM WITH DAGESH FB3F ; UNASSIGNED # <reserved> FB40..FB41 ; DISALLOWED # HEBREW LETTER NUN WITH DAGESH..HEBREW LETTER FB42 ; UNASSIGNED # <reserved> FB43..FB44 ; DISALLOWED # HEBREW LETTER FINAL PE WITH DAGESH..HEBREW L FB45 ; UNASSIGNED # <reserved> FB46..FBB1 ; DISALLOWED # HEBREW LETTER TSADI WITH DAGESH..ARABIC LETT FBB2..FBD2 ; UNASSIGNED # <reserved>..<reserved> FBD3..FD3F ; DISALLOWED # ARABIC LETTER NG ISOLATED FORM..ORNATE RIGHT FD40..FD4F ; UNASSIGNED # <reserved>..<reserved> FD50..FD8F ; DISALLOWED # ARABIC LIGATURE TEH WITH JEEM WITH MEEM INIT FD90..FD91 ; UNASSIGNED # <reserved>..<reserved> FD92..FDC7 ; DISALLOWED # ARABIC LIGATURE MEEM WITH JEEM WITH KHAH INI FDC8..FDCF ; UNASSIGNED # <reserved>..<reserved> FDD0..FDFD ; DISALLOWED # <reserved>..ARABIC LIGATURE BISMILLAH AR-RAH FDFE..FDFF ; UNASSIGNED # <reserved>..<reserved> FE00..FE19 ; DISALLOWED # VARIATION SELECTOR-1..PRESENTATION FORM FOR FE1A..FE1F ; UNASSIGNED # <reserved>..<reserved> FE20..FE26 ; PVALID # COMBINING LIGATURE LEFT HALF..COMBINING CONJ FE27..FE2F ; UNASSIGNED # <reserved>..<reserved> FE30..FE52 ; DISALLOWED # PRESENTATION FORM FOR VERTICAL TWO DOT LEADE FE53 ; UNASSIGNED # <reserved> FE54..FE66 ; DISALLOWED # SMALL SEMICOLON..SMALL EQUALS SIGN FE67 ; UNASSIGNED # <reserved> FE68..FE6B ; DISALLOWED # SMALL REVERSE SOLIDUS..SMALL COMMERCIAL AT FE6C..FE6F ; UNASSIGNED # <reserved>..<reserved> FE70..FE72 ; DISALLOWED # ARABIC FATHATAN ISOLATED FORM..ARABIC DAMMAT FE73 ; PVALID # ARABIC TAIL FRAGMENT FE74 ; DISALLOWED # ARABIC KASRATAN ISOLATED FORM FE75 ; UNASSIGNED # <reserved> FE76..FEFC ; DISALLOWED # ARABIC FATHA ISOLATED FORM..ARABIC LIGATURE FEFD..FEFE ; UNASSIGNED # <reserved>..<reserved> FEFF ; DISALLOWED # ZERO WIDTH NO-BREAK SPACE FF00 ; UNASSIGNED # <reserved> FF01..FFBE ; DISALLOWED # FULLWIDTH EXCLAMATION MARK..HALFWIDTH HANGUL FFBF..FFC1 ; UNASSIGNED # <reserved>..<reserved> FFC2..FFC7 ; DISALLOWED # HALFWIDTH HANGUL LETTER A..HALFWIDTH HANGUL FFC8..FFC9 ; UNASSIGNED # <reserved>..<reserved> FFCA..FFCF ; DISALLOWED # HALFWIDTH HANGUL LETTER YEO..HALFWIDTH HANGU FFD0..FFD1 ; UNASSIGNED # <reserved>..<reserved> FFD2..FFD7 ; DISALLOWED # HALFWIDTH HANGUL LETTER YO..HALFWIDTH HANGUL FFD8..FFD9 ; UNASSIGNED # <reserved>..<reserved> FFDA..FFDC ; DISALLOWED # HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL FFDD..FFDF ; UNASSIGNED # <reserved>..<reserved> Faltstrom Expires October 29, 2008 [Page 51]
Internet-Draft IDNA Codepoints April 2008 FFE0..FFE6 ; DISALLOWED # FULLWIDTH CENT SIGN..FULLWIDTH WON SIGN FFE7 ; UNASSIGNED # <reserved> FFE8..FFEE ; DISALLOWED # HALFWIDTH FORMS LIGHT VERTICAL..HALFWIDTH WH FFEF ; UNASSIGNED # <reserved> FFF0..FFFF ; DISALLOWED # <reserved>..<reserved> 10000..1000B; PVALID # LINEAR B SYLLABLE B008 A..LINEAR B SYLLABLE 1000C ; UNASSIGNED # <reserved> 1000D..10026; PVALID # LINEAR B SYLLABLE B036 JO..LINEAR B SYLLABLE 10027 ; UNASSIGNED # <reserved> 10028..1003A; PVALID # LINEAR B SYLLABLE B060 RA..LINEAR B SYLLABLE 1003B ; UNASSIGNED # <reserved> 1003C..1003D; PVALID # LINEAR B SYLLABLE B017 ZA..LINEAR B SYLLABLE 1003E ; UNASSIGNED # <reserved> 1003F..1004D; PVALID # LINEAR B SYLLABLE B020 ZO..LINEAR B SYLLABLE 1004E..1004F; UNASSIGNED # <reserved>..<reserved> 10050..1005D; PVALID # LINEAR B SYMBOL B018..LINEAR B SYMBOL B089 1005E..1007F; UNASSIGNED # <reserved>..<reserved> 10080..100FA; PVALID # LINEAR B IDEOGRAM B100 MAN..LINEAR B IDEOGRA 100FB..100FF; UNASSIGNED # <reserved>..<reserved> 10100..10102; DISALLOWED # AEGEAN WORD SEPARATOR LINE..AEGEAN CHECK MAR 10103..10106; UNASSIGNED # <reserved>..<reserved> 10107..10133; DISALLOWED # AEGEAN NUMBER ONE..AEGEAN NUMBER NINETY THOU 10134..10136; UNASSIGNED # <reserved>..<reserved> 10137..1018A; DISALLOWED # AEGEAN WEIGHT BASE UNIT..GREEK ZERO SIGN 1018B..1018F; UNASSIGNED # <reserved>..<reserved> 10190..1019B; DISALLOWED # ROMAN SEXTANS SIGN..ROMAN CENTURIAL SIGN 1019C..101CF; UNASSIGNED # <reserved>..<reserved> 101D0..101FC; DISALLOWED # PHAISTOS DISC SIGN PEDESTRIAN..PHAISTOS DISC 101FD ; PVALID # PHAISTOS DISC SIGN COMBINING OBLIQUE STROKE 101FE..1027F; UNASSIGNED # <reserved>..<reserved> 10280..1029C; PVALID # LYCIAN LETTER A..LYCIAN LETTER X 1029D..1029F; UNASSIGNED # <reserved>..<reserved> 102A0..102D0; PVALID # CARIAN LETTER A..CARIAN LETTER UUU3 102D1..102FF; UNASSIGNED # <reserved>..<reserved> 10300..1031E; PVALID # OLD ITALIC LETTER A..OLD ITALIC LETTER UU 1031F ; UNASSIGNED # <reserved> 10320..10323; DISALLOWED # OLD ITALIC NUMERAL ONE..OLD ITALIC NUMERAL F 10324..1032F; UNASSIGNED # <reserved>..<reserved> 10330..10340; PVALID # GOTHIC LETTER AHSA..GOTHIC LETTER PAIRTHRA 10341 ; DISALLOWED # GOTHIC LETTER NINETY 10342..10349; PVALID # GOTHIC LETTER RAIDA..GOTHIC LETTER OTHAL 1034A ; DISALLOWED # GOTHIC LETTER NINE HUNDRED 1034B..1037F; UNASSIGNED # <reserved>..<reserved> 10380..1039D; PVALID # UGARITIC LETTER ALPA..UGARITIC LETTER SSU 1039E ; UNASSIGNED # <reserved> 1039F ; DISALLOWED # UGARITIC WORD DIVIDER 103A0..103C3; PVALID # OLD PERSIAN SIGN A..OLD PERSIAN SIGN HA 103C4..103C7; UNASSIGNED # <reserved>..<reserved> Faltstrom Expires October 29, 2008 [Page 52]
Internet-Draft IDNA Codepoints April 2008 103C8..103CF; PVALID # OLD PERSIAN SIGN AURAMAZDAA..OLD PERSIAN SIG 103D0..103D5; DISALLOWED # OLD PERSIAN WORD DIVIDER..OLD PERSIAN NUMBER 103D6..103FF; UNASSIGNED # <reserved>..<reserved> 10400..10427; DISALLOWED # DESERET CAPITAL LETTER LONG I..DESERET CAPIT 10428..1049D; PVALID # DESERET SMALL LETTER LONG I..OSMANYA LETTER 1049E..1049F; UNASSIGNED # <reserved>..<reserved> 104A0..104A9; PVALID # OSMANYA DIGIT ZERO..OSMANYA DIGIT NINE 104AA..107FF; UNASSIGNED # <reserved>..<reserved> 10800..10805; PVALID # CYPRIOT SYLLABLE A..CYPRIOT SYLLABLE JA 10806..10807; UNASSIGNED # <reserved>..<reserved> 10808 ; PVALID # CYPRIOT SYLLABLE JO 10809 ; UNASSIGNED # <reserved> 1080A..10835; PVALID # CYPRIOT SYLLABLE KA..CYPRIOT SYLLABLE WO 10836 ; UNASSIGNED # <reserved> 10837..10838; PVALID # CYPRIOT SYLLABLE XA..CYPRIOT SYLLABLE XE 10839..1083B; UNASSIGNED # <reserved>..<reserved> 1083C ; PVALID # CYPRIOT SYLLABLE ZA 1083D..1083E; UNASSIGNED # <reserved>..<reserved> 1083F ; PVALID # CYPRIOT SYLLABLE ZO 10840..108FF; UNASSIGNED # <reserved>..<reserved> 10900..10915; PVALID # PHOENICIAN LETTER ALF..PHOENICIAN LETTER TAU 10916..10919; DISALLOWED # PHOENICIAN NUMBER ONE..PHOENICIAN NUMBER ONE 1091A..1091E; UNASSIGNED # <reserved>..<reserved> 1091F ; DISALLOWED # PHOENICIAN WORD SEPARATOR 10920..10939; PVALID # LYDIAN LETTER A..LYDIAN LETTER C 1093A..1093E; UNASSIGNED # <reserved>..<reserved> 1093F ; DISALLOWED # LYDIAN TRIANGULAR MARK 10940..109FF; UNASSIGNED # <reserved>..<reserved> 10A00..10A03; PVALID # KHAROSHTHI LETTER A..KHAROSHTHI VOWEL SIGN V 10A04 ; UNASSIGNED # <reserved> 10A05..10A06; PVALID # KHAROSHTHI VOWEL SIGN E..KHAROSHTHI VOWEL SI 10A07..10A0B; UNASSIGNED # <reserved>..<reserved> 10A0C..10A13; PVALID # KHAROSHTHI VOWEL LENGTH MARK..KHAROSHTHI LET 10A14 ; UNASSIGNED # <reserved> 10A15..10A17; PVALID # KHAROSHTHI LETTER CA..KHAROSHTHI LETTER JA 10A18 ; UNASSIGNED # <reserved> 10A19..10A33; PVALID # KHAROSHTHI LETTER NYA..KHAROSHTHI LETTER TTT 10A34..10A37; UNASSIGNED # <reserved>..<reserved> 10A38..10A3A; PVALID # KHAROSHTHI SIGN BAR ABOVE..KHAROSHTHI SIGN D 10A3B..10A3E; UNASSIGNED # <reserved>..<reserved> 10A3F ; PVALID # KHAROSHTHI VIRAMA 10A40..10A47; DISALLOWED # KHAROSHTHI DIGIT ONE..KHAROSHTHI NUMBER ONE 10A48..10A4F; UNASSIGNED # <reserved>..<reserved> 10A50..10A58; DISALLOWED # KHAROSHTHI PUNCTUATION DOT..KHAROSHTHI PUNCT 10A59..11FFF; UNASSIGNED # <reserved>..<reserved> 12000..1236E; PVALID # CUNEIFORM SIGN A..CUNEIFORM SIGN ZUM 1236F..123FF; UNASSIGNED # <reserved>..<reserved> 12400..12462; DISALLOWED # CUNEIFORM NUMERIC SIGN TWO ASH..CUNEIFORM NU Faltstrom Expires October 29, 2008 [Page 53]
Internet-Draft IDNA Codepoints April 2008 12463..1246F; UNASSIGNED # <reserved>..<reserved> 12470..12473; DISALLOWED # CUNEIFORM PUNCTUATION SIGN OLD ASSYRIAN WORD 12474..1CFFF; UNASSIGNED # <reserved>..<reserved> 1D000..1D0F5; DISALLOWED # BYZANTINE MUSICAL SYMBOL PSILI..BYZANTINE MU 1D0F6..1D0FF; UNASSIGNED # <reserved>..<reserved> 1D100..1D24F; DISALLOWED # MUSICAL SYMBOL SINGLE BARLINE..<reserved> 1D250..1D2FF; UNASSIGNED # <reserved>..<reserved> 1D300..1D356; DISALLOWED # MONOGRAM FOR EARTH..TETRAGRAM FOR FOSTERING 1D357..1D35F; UNASSIGNED # <reserved>..<reserved> 1D360..1D371; DISALLOWED # COUNTING ROD UNIT DIGIT ONE..COUNTING ROD TE 1D372..1D3FF; UNASSIGNED # <reserved>..<reserved> 1D400..1D454; DISALLOWED # MATHEMATICAL BOLD CAPITAL A..MATHEMATICAL IT 1D455 ; UNASSIGNED # <reserved> 1D456..1D49C; DISALLOWED # MATHEMATICAL ITALIC SMALL I..MATHEMATICAL SC 1D49D ; UNASSIGNED # <reserved> 1D49E..1D49F; DISALLOWED # MATHEMATICAL SCRIPT CAPITAL C..MATHEMATICAL 1D4A0..1D4A1; UNASSIGNED # <reserved>..<reserved> 1D4A2 ; DISALLOWED # MATHEMATICAL SCRIPT CAPITAL G 1D4A3..1D4A4; UNASSIGNED # <reserved>..<reserved> 1D4A5..1D4A6; DISALLOWED # MATHEMATICAL SCRIPT CAPITAL J..MATHEMATICAL 1D4A7..1D4A8; UNASSIGNED # <reserved>..<reserved> 1D4A9..1D4AC; DISALLOWED # MATHEMATICAL SCRIPT CAPITAL N..MATHEMATICAL 1D4AD ; UNASSIGNED # <reserved> 1D4AE..1D4B9; DISALLOWED # MATHEMATICAL SCRIPT CAPITAL S..MATHEMATICAL 1D4BA ; UNASSIGNED # <reserved> 1D4BB ; DISALLOWED # MATHEMATICAL SCRIPT SMALL F 1D4BC ; UNASSIGNED # <reserved> 1D4BD..1D4C3; DISALLOWED # MATHEMATICAL SCRIPT SMALL H..MATHEMATICAL SC 1D4C4 ; UNASSIGNED # <reserved> 1D4C5..1D505; DISALLOWED # MATHEMATICAL SCRIPT SMALL P..MATHEMATICAL FR 1D506 ; UNASSIGNED # <reserved> 1D507..1D50A; DISALLOWED # MATHEMATICAL FRAKTUR CAPITAL D..MATHEMATICAL 1D50B..1D50C; UNASSIGNED # <reserved>..<reserved> 1D50D..1D514; DISALLOWED # MATHEMATICAL FRAKTUR CAPITAL J..MATHEMATICAL 1D515 ; UNASSIGNED # <reserved> 1D516..1D51C; DISALLOWED # MATHEMATICAL FRAKTUR CAPITAL S..MATHEMATICAL 1D51D ; UNASSIGNED # <reserved> 1D51E..1D539; DISALLOWED # MATHEMATICAL FRAKTUR SMALL A..MATHEMATICAL D 1D53A ; UNASSIGNED # <reserved> 1D53B..1D53E; DISALLOWED # MATHEMATICAL DOUBLE-STRUCK CAPITAL D..MATHEM 1D53F ; UNASSIGNED # <reserved> 1D540..1D544; DISALLOWED # MATHEMATICAL DOUBLE-STRUCK CAPITAL I..MATHEM 1D545 ; UNASSIGNED # <reserved> 1D546 ; DISALLOWED # MATHEMATICAL DOUBLE-STRUCK CAPITAL O 1D547..1D549; UNASSIGNED # <reserved>..<reserved> 1D54A..1D550; DISALLOWED # MATHEMATICAL DOUBLE-STRUCK CAPITAL S..MATHEM 1D551 ; UNASSIGNED # <reserved> 1D552..1D6A5; DISALLOWED # MATHEMATICAL DOUBLE-STRUCK SMALL A..MATHEMAT Faltstrom Expires October 29, 2008 [Page 54]
Internet-Draft IDNA Codepoints April 2008 1D6A6..1D6A7; UNASSIGNED # <reserved>..<reserved> 1D6A8..1D7CB; DISALLOWED # MATHEMATICAL BOLD CAPITAL ALPHA..MATHEMATICA 1D7CC..1D7CD; UNASSIGNED # <reserved>..<reserved> 1D7CE..1D7FF; DISALLOWED # MATHEMATICAL BOLD DIGIT ZERO..MATHEMATICAL M 1D800..1EFFF; UNASSIGNED # <reserved>..<reserved> 1F000..1F02B; DISALLOWED # MAHJONG TILE EAST WIND..MAHJONG TILE BACK 1F02C..1F02F; UNASSIGNED # <reserved>..<reserved> 1F030..1F093; DISALLOWED # DOMINO TILE HORIZONTAL BACK..DOMINO TILE VER 1F094..1FFFD; UNASSIGNED # <reserved>..<reserved> 1FFFE..1FFFF; DISALLOWED # <reserved>..<reserved> 20000..2A6D6; PVALID # <CJK Ideograph Extension B>..<CJK Ideograph 2A6D7..2F7FF; UNASSIGNED # <reserved>..<reserved> 2F800..2FA1D; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-2F800..CJK COMPA 2FA1E..2FFFD; UNASSIGNED # <reserved>..<reserved> 2FFFE..2FFFF; DISALLOWED # <reserved>..<reserved> 30000..3FFFD; UNASSIGNED # <reserved>..<reserved> 3FFFE..3FFFF; DISALLOWED # <reserved>..<reserved> 40000..4FFFD; UNASSIGNED # <reserved>..<reserved> 4FFFE..4FFFF; DISALLOWED # <reserved>..<reserved> 50000..5FFFD; UNASSIGNED # <reserved>..<reserved> 5FFFE..5FFFF; DISALLOWED # <reserved>..<reserved> 60000..6FFFD; UNASSIGNED # <reserved>..<reserved> 6FFFE..6FFFF; DISALLOWED # <reserved>..<reserved> 70000..7FFFD; UNASSIGNED # <reserved>..<reserved> 7FFFE..7FFFF; DISALLOWED # <reserved>..<reserved> 80000..8FFFD; UNASSIGNED # <reserved>..<reserved> 8FFFE..8FFFF; DISALLOWED # <reserved>..<reserved> 90000..9FFFD; UNASSIGNED # <reserved>..<reserved> 9FFFE..9FFFF; DISALLOWED # <reserved>..<reserved> A0000..AFFFD; UNASSIGNED # <reserved>..<reserved> AFFFE..AFFFF; DISALLOWED # <reserved>..<reserved> B0000..BFFFD; UNASSIGNED # <reserved>..<reserved> BFFFE..BFFFF; DISALLOWED # <reserved>..<reserved> C0000..CFFFD; UNASSIGNED # <reserved>..<reserved> CFFFE..CFFFF; DISALLOWED # <reserved>..<reserved> D0000..DFFFD; UNASSIGNED # <reserved>..<reserved> DFFFE..E0FFF; DISALLOWED # <reserved>..<reserved> E1000..EFFFD; UNASSIGNED # <reserved>..<reserved> EFFFE..10FFFE;DISALLOWED # <reserved>..<reserved> 9. References 9.1. Normative References [RFC4690] Klensin, J., Faltstrom, P., Karp, C., and IAB, "Review and Recommendations for Internationalized Domain Names (IDNs)", RFC 4690, September 2006. Faltstrom Expires October 29, 2008 [Page 55]
Internet-Draft IDNA Codepoints April 2008 [TR15] Davis, M. and M. Durst, "Unicode Standard Annex #15, Unicode Normalization Forms, an integral part of the Unicode Standard", <http://unicode.org/unicode/reports/tr15/>. [Unicode5] The Unicode Consortium, "The Unicode Standard, Version 5.0.0", Boston, MA, Addison-Wesley ISBN 0-321-48091-0, 2007. [Unicode51] The Unicode Consortium, "The Unicode Standard, Version 5.1.0", Unicode 5.0.0, Boston, MA, Addison-Wesley ISBN 0-321-48091-0, as amended by Unicode 5.1.0 http://www.unicode.org/versions/Unicode5.1.0/, 2008, <http://www.unicode.org/versions/Unicode5.1.0/>. 9.2. Informative References [IDNA2008-Bidi] Alvestrand, H. and C. Karp, "An IDNA problem in right-to- left scripts", April 2008, <http://www.ietf.org/ internet-drafts/draft-ietf-idnabis-bidi-00.txt>. [IDNA2008-protocol] Klensin, J., "Internationalizing Domain Names in Applications (IDNA): Protocol", April 2007, <http:// www.ietf.org/internet-drafts/ draft-ietf-idnabis-protocol-00.txt>. [IDNA2008-rationale] Klensin, J., Ed., "Internationalizing Domain Names for Applications (IDNA): Issues, Explanation, and Rationale", April 2008, <http://www.ietf.org/internet-drafts/ draft-ietf-idnabis-issues-00.txt>. [RFC1035] Mockapetris, P., "Domain names - implementation and specification", STD 13, RFC 1035, November 1987. [RFC3454] Hoffman, P. and M. Blanchet, "Preparation of Internationalized Strings ("stringprep")", RFC 3454, December 2002. [RFC3491] Hoffman, P. and M. Blanchet, "Nameprep: A Stringprep Profile for Internationalized Domain Names (IDN)", RFC 3491, March 2003. [RFC4713] Lee, X., Mao, W., Chen, E., Hsu, N., and J. Klensin, Faltstrom Expires October 29, 2008 [Page 56]
Internet-Draft IDNA Codepoints April 2008 "Registration and Administration Recommendations for Chinese Domain Names", RFC 4713, October 2006. URIs [1] <http://unicode.org/Public/UNIDATA/DerivedCoreProperties.txt> [2] <http://unicode.org/errata/> Author's Address Patrik Faltstrom (editor) Cisco Systems Email: paf@cisco.com Faltstrom Expires October 29, 2008 [Page 57]
Internet-Draft IDNA Codepoints April 2008 Full Copyright Statement Copyright (C) The IETF Trust (2008). This document is subject to the rights, licenses and restrictions contained in BCP 78, and except as set forth therein, the authors retain all their rights. This document and the information contained herein are provided on an "AS IS" basis and THE CONTRIBUTOR, THE ORGANIZATION HE/SHE REPRESENTS OR IS SPONSORED BY (IF ANY), THE INTERNET SOCIETY, THE IETF TRUST AND THE INTERNET ENGINEERING TASK FORCE DISCLAIM ALL WARRANTIES, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO ANY WARRANTY THAT THE USE OF THE INFORMATION HEREIN WILL NOT INFRINGE ANY RIGHTS OR ANY IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Intellectual Property The IETF takes no position regarding the validity or scope of any Intellectual Property Rights or other rights that might be claimed to pertain to the implementation or use of the technology described in this document or the extent to which any license under such rights might or might not be available; nor does it represent that it has made any independent effort to identify any such rights. Information on the procedures with respect to rights in RFC documents can be found in BCP 78 and BCP 79. Copies of IPR disclosures made to the IETF Secretariat and any assurances of licenses to be made available, or the result of an attempt made to obtain a general license or permission for the use of such proprietary rights by implementers or users of this specification can be obtained from the IETF on-line IPR repository at http://www.ietf.org/ipr. The IETF invites any interested party to bring to its attention any copyrights, patents or patent applications, or other proprietary rights that may cover technology that may be required to implement this standard. Please address the information to the IETF at ietf-ipr@ietf.org. Faltstrom Expires October 29, 2008 [Page 58]