Commit | Line | Data |
---|---|---|
ac71d2a0 UC |
1 | # IndicSyllabicCategory-8.0.0.txt |
2 | # Date: 2015-05-12, 10:00:00 GMT [RP, KW, LI] | |
bd84d130 KW |
3 | # |
4 | # Unicode Character Database | |
ac71d2a0 | 5 | # Copyright (c) 1991-2015 Unicode, Inc. |
bd84d130 | 6 | # For terms of use, see http://www.unicode.org/terms_of_use.html |
09edd811 KW |
7 | # For documentation, see UAX #44: Unicode Character Database, |
8 | # at http://www.unicode.org/reports/tr44/ | |
bd84d130 | 9 | # |
ac71d2a0 | 10 | # This file defines the following property: |
bd84d130 KW |
11 | # |
12 | # Indic_Syllabic_Category enumerated property | |
13 | # | |
ac71d2a0 | 14 | # Scope: This property is aimed at two general problem |
bd84d130 KW |
15 | # areas involving the analysis and processing of Indic scripts: |
16 | # | |
17 | # 1. Specification of syllabic structure. | |
18 | # 2. Specification of segmentation rules. | |
19 | # | |
20 | # Both of these problem areas may benefit from having defined subtypes | |
21 | # of Indic script characters which are relevant to how Indic | |
22 | # syllables (or aksaras) are constructed. Note that rules for | |
23 | # syllabic structure in Indic scripts may differ significantly | |
24 | # from how phonological syllables are defined. | |
25 | # | |
26 | # Format: | |
09edd811 KW |
27 | # Field 0 Unicode code point value or range of code point values |
28 | # Field 1 Indic_Syllabic_Category property value | |
bd84d130 | 29 | # |
09edd811 KW |
30 | # Field 1 is followed by a comment field, starting with the number sign '#', |
31 | # which shows the General_Category property value, the Unicode character name | |
32 | # or names, and, in lines with ranges of code points, the code point count in | |
33 | # square brackets. | |
bd84d130 | 34 | # |
09edd811 KW |
35 | # The scripts assessed as Indic in the structural sense used for the |
36 | # Indic_Syllabic_Category are the following: | |
bd84d130 | 37 | # |
ac71d2a0 UC |
38 | # Ahom, Balinese, Batak, Bengali, Brahmi, Buginese, Buhid, Chakma, |
39 | # Cham, Devanagari, Grantha, Gujarati, Gurmukhi, Hanunoo, Javanese, | |
40 | # Kaithi, Kannada, Kayah Li, Kharoshthi, Khmer, Khojki, Khudawadi, | |
41 | # Lao, Lepcha, Limbu, Mahajani, Malayalam, Meetei Mayek, Modi, | |
42 | # Multani, Myanmar, New Tai Lue, Oriya, Phags-pa, Rejang, Saurashtra, | |
43 | # Sharada, Siddham, Sinhala, Sundanese, Syloti Nagri, Tagalog, | |
44 | # Tagbanwa, Tai Le, Tai Tham, Tai Viet, Takri, Tamil, Telugu, Thai, | |
45 | # Tibetan, and Tirhuta. | |
bd84d130 KW |
46 | # |
47 | # All characters for all other scripts not in that list | |
48 | # take the default value for this property, unless they | |
49 | # are individually listed in this data file. | |
50 | # | |
51 | ||
52 | # ================================================ | |
53 | ||
54 | # Property: Indic_Syllabic_Category | |
55 | # | |
56 | # All code points not explicitly listed for Indic_Syllabic_Category | |
57 | # have the value Other. | |
58 | # | |
59 | # @missing: 0000..10FFFF; Other | |
60 | ||
61 | # ================================================ | |
62 | ||
63 | # Indic_Syllabic_Category=Bindu | |
64 | ||
65 | # Bindu/Anusvara (nasalization or -n) | |
66 | # Excludes various Vedic nasalization signs. | |
67 | ||
68 | # [Not derivable] | |
69 | ||
70 | 0900..0902 ; Bindu # Mn [3] DEVANAGARI SIGN INVERTED CANDRABINDU..DEVANAGARI SIGN ANUSVARA | |
71 | 0981 ; Bindu # Mn BENGALI SIGN CANDRABINDU | |
72 | 0982 ; Bindu # Mc BENGALI SIGN ANUSVARA | |
73 | 0A01..0A02 ; Bindu # Mn [2] GURMUKHI SIGN ADAK BINDI..GURMUKHI SIGN BINDI | |
74 | 0A70 ; Bindu # Mn GURMUKHI TIPPI | |
75 | 0A81..0A82 ; Bindu # Mn [2] GUJARATI SIGN CANDRABINDU..GUJARATI SIGN ANUSVARA | |
76 | 0B01 ; Bindu # Mn ORIYA SIGN CANDRABINDU | |
77 | 0B02 ; Bindu # Mc ORIYA SIGN ANUSVARA | |
78 | 0B82 ; Bindu # Mn TAMIL SIGN ANUSVARA | |
09edd811 | 79 | 0C00 ; Bindu # Mn TELUGU SIGN COMBINING CANDRABINDU ABOVE |
bd84d130 | 80 | 0C01..0C02 ; Bindu # Mc [2] TELUGU SIGN CANDRABINDU..TELUGU SIGN ANUSVARA |
09edd811 | 81 | 0C81 ; Bindu # Mn KANNADA SIGN CANDRABINDU |
bd84d130 | 82 | 0C82 ; Bindu # Mc KANNADA SIGN ANUSVARA |
09edd811 | 83 | 0D01 ; Bindu # Mn MALAYALAM SIGN CANDRABINDU |
bd84d130 KW |
84 | 0D02 ; Bindu # Mc MALAYALAM SIGN ANUSVARA |
85 | 0D82 ; Bindu # Mc SINHALA SIGN ANUSVARAYA | |
86 | 0E4D ; Bindu # Mn THAI CHARACTER NIKHAHIT | |
87 | 0ECD ; Bindu # Mn LAO NIGGAHITA | |
88 | 0F7E ; Bindu # Mn TIBETAN SIGN RJES SU NGA RO | |
89 | 0F82..0F83 ; Bindu # Mn [2] TIBETAN SIGN NYI ZLA NAA DA..TIBETAN SIGN SNA LDAN | |
90 | 1036 ; Bindu # Mn MYANMAR SIGN ANUSVARA | |
91 | 17C6 ; Bindu # Mn KHMER SIGN NIKAHIT | |
92 | 1932 ; Bindu # Mn LIMBU SMALL LETTER ANUSVARA | |
93 | 1B00..1B02 ; Bindu # Mn [3] BALINESE SIGN ULU RICEM..BALINESE SIGN CECEK | |
bd84d130 KW |
94 | 1B80 ; Bindu # Mn SUNDANESE SIGN PANYECEK |
95 | 1C34..1C35 ; Bindu # Mc [2] LEPCHA CONSONANT SIGN NYIN-DO..LEPCHA CONSONANT SIGN KANG | |
a9c9e371 KW |
96 | A80B ; Bindu # Mn SYLOTI NAGRI SIGN ANUSVARA |
97 | A873 ; Bindu # Lo PHAGS-PA LETTER CANDRABINDU | |
bd84d130 | 98 | A880 ; Bindu # Mc SAURASHTRA SIGN ANUSVARA |
a9c9e371 | 99 | A980..A981 ; Bindu # Mn [2] JAVANESE SIGN PANYANGGA..JAVANESE SIGN CECAK |
bd84d130 KW |
100 | 10A0E ; Bindu # Mn KHAROSHTHI SIGN ANUSVARA |
101 | 11000 ; Bindu # Mc BRAHMI SIGN CANDRABINDU | |
102 | 11001 ; Bindu # Mn BRAHMI SIGN ANUSVARA | |
09edd811 | 103 | 11080..11081 ; Bindu # Mn [2] KAITHI SIGN CANDRABINDU..KAITHI SIGN ANUSVARA |
a9c9e371 KW |
104 | 11100..11101 ; Bindu # Mn [2] CHAKMA SIGN CANDRABINDU..CHAKMA SIGN ANUSVARA |
105 | 11180..11181 ; Bindu # Mn [2] SHARADA SIGN CANDRABINDU..SHARADA SIGN ANUSVARA | |
09edd811 KW |
106 | 11234 ; Bindu # Mn KHOJKI SIGN ANUSVARA |
107 | 112DF ; Bindu # Mn KHUDAWADI SIGN ANUSVARA | |
ac71d2a0 | 108 | 11300..11301 ; Bindu # Mn [2] GRANTHA SIGN COMBINING ANUSVARA ABOVE..GRANTHA SIGN CANDRABINDU |
09edd811 KW |
109 | 11302 ; Bindu # Mc GRANTHA SIGN ANUSVARA |
110 | 114BF..114C0 ; Bindu # Mn [2] TIRHUTA SIGN CANDRABINDU..TIRHUTA SIGN ANUSVARA | |
111 | 115BC..115BD ; Bindu # Mn [2] SIDDHAM SIGN CANDRABINDU..SIDDHAM SIGN ANUSVARA | |
112 | 1163D ; Bindu # Mn MODI SIGN ANUSVARA | |
7620cb10 | 113 | 116AB ; Bindu # Mn TAKRI SIGN ANUSVARA |
bd84d130 KW |
114 | |
115 | # ================================================ | |
116 | ||
117 | # Indic_Syllabic_Category=Visarga | |
118 | ||
119 | # Visarga (-h) | |
7620cb10 KW |
120 | # Includes specialized case for Sanskrit: ardhavisarga |
121 | # Excludes letters for jihvamuliya and upadhmaniya, which are | |
122 | # related, but structured somewhat differently. | |
bd84d130 KW |
123 | |
124 | # [Not derivable] | |
125 | ||
126 | 0903 ; Visarga # Mc DEVANAGARI SIGN VISARGA | |
127 | 0983 ; Visarga # Mc BENGALI SIGN VISARGA | |
128 | 0A03 ; Visarga # Mc GURMUKHI SIGN VISARGA | |
129 | 0A83 ; Visarga # Mc GUJARATI SIGN VISARGA | |
130 | 0B03 ; Visarga # Mc ORIYA SIGN VISARGA | |
131 | 0C03 ; Visarga # Mc TELUGU SIGN VISARGA | |
132 | 0C83 ; Visarga # Mc KANNADA SIGN VISARGA | |
133 | 0D03 ; Visarga # Mc MALAYALAM SIGN VISARGA | |
134 | 0D83 ; Visarga # Mc SINHALA SIGN VISARGAYA | |
135 | 0F7F ; Visarga # Mc TIBETAN SIGN RNAM BCAD | |
136 | 1038 ; Visarga # Mc MYANMAR SIGN VISARGA | |
137 | 17C7 ; Visarga # Mc KHMER SIGN REAHMUK | |
138 | 1B04 ; Visarga # Mc BALINESE SIGN BISAH | |
139 | 1B82 ; Visarga # Mc SUNDANESE SIGN PANGWISAD | |
09edd811 | 140 | 1CF2..1CF3 ; Visarga # Mc [2] VEDIC SIGN ARDHAVISARGA..VEDIC SIGN ROTATED ARDHAVISARGA |
bd84d130 KW |
141 | A881 ; Visarga # Mc SAURASHTRA SIGN VISARGA |
142 | A983 ; Visarga # Mc JAVANESE SIGN WIGNYAN | |
7620cb10 | 143 | AAF5 ; Visarga # Mc MEETEI MAYEK VOWEL SIGN VISARGA |
bd84d130 KW |
144 | 10A0F ; Visarga # Mn KHAROSHTHI SIGN VISARGA |
145 | 11002 ; Visarga # Mc BRAHMI SIGN VISARGA | |
146 | 11082 ; Visarga # Mc KAITHI SIGN VISARGA | |
7620cb10 | 147 | 11102 ; Visarga # Mn CHAKMA SIGN VISARGA |
09edd811 KW |
148 | 11182 ; Visarga # Mc SHARADA SIGN VISARGA |
149 | 11303 ; Visarga # Mc GRANTHA SIGN VISARGA | |
150 | 114C1 ; Visarga # Mc TIRHUTA SIGN VISARGA | |
151 | 115BE ; Visarga # Mc SIDDHAM SIGN VISARGA | |
152 | 1163E ; Visarga # Mc MODI SIGN VISARGA | |
7620cb10 | 153 | 116AC ; Visarga # Mc TAKRI SIGN VISARGA |
bd84d130 KW |
154 | |
155 | # ================================================ | |
156 | ||
157 | # Indic_Syllabic_Category=Avagraha | |
158 | ||
159 | # Avagraha (elision of initial a- in sandhi) | |
160 | ||
161 | # [Not derivable] | |
162 | ||
163 | 093D ; Avagraha # Lo DEVANAGARI SIGN AVAGRAHA | |
164 | 09BD ; Avagraha # Lo BENGALI SIGN AVAGRAHA | |
165 | 0ABD ; Avagraha # Lo GUJARATI SIGN AVAGRAHA | |
166 | 0B3D ; Avagraha # Lo ORIYA SIGN AVAGRAHA | |
167 | 0C3D ; Avagraha # Lo TELUGU SIGN AVAGRAHA | |
168 | 0CBD ; Avagraha # Lo KANNADA SIGN AVAGRAHA | |
169 | 0D3D ; Avagraha # Lo MALAYALAM SIGN AVAGRAHA | |
170 | 0F85 ; Avagraha # Po TIBETAN MARK PALUTA | |
171 | 17DC ; Avagraha # Lo KHMER SIGN AVAKRAHASANYA | |
7620cb10 KW |
172 | 1BBA ; Avagraha # Lo SUNDANESE AVAGRAHA |
173 | 111C1 ; Avagraha # Lo SHARADA SIGN AVAGRAHA | |
09edd811 KW |
174 | 1133D ; Avagraha # Lo GRANTHA SIGN AVAGRAHA |
175 | 114C4 ; Avagraha # Lo TIRHUTA SIGN AVAGRAHA | |
bd84d130 KW |
176 | |
177 | # ================================================ | |
178 | ||
179 | # Indic_Syllabic_Category=Nukta | |
180 | ||
ac71d2a0 UC |
181 | # Nukta (diacritic for borrowed consonants or other consonant |
182 | # modifications) | |
bd84d130 | 183 | |
ac71d2a0 | 184 | # [Derivation: (ccc=7) + 0F39 + 10A38..10A3A - 1037] |
bd84d130 KW |
185 | |
186 | 093C ; Nukta # Mn DEVANAGARI SIGN NUKTA | |
187 | 09BC ; Nukta # Mn BENGALI SIGN NUKTA | |
188 | 0A3C ; Nukta # Mn GURMUKHI SIGN NUKTA | |
189 | 0ABC ; Nukta # Mn GUJARATI SIGN NUKTA | |
190 | 0B3C ; Nukta # Mn ORIYA SIGN NUKTA | |
191 | 0CBC ; Nukta # Mn KANNADA SIGN NUKTA | |
ac71d2a0 | 192 | 0F39 ; Nukta # Mn TIBETAN MARK TSA -PHRU |
bd84d130 KW |
193 | 1B34 ; Nukta # Mn BALINESE SIGN REREKAN |
194 | 1BE6 ; Nukta # Mn BATAK SIGN TOMPI | |
195 | 1C37 ; Nukta # Mn LEPCHA SIGN NUKTA | |
196 | A9B3 ; Nukta # Mn JAVANESE SIGN CECAK TELU | |
ac71d2a0 | 197 | 10A38..10A3A ; Nukta # Mn [3] KHAROSHTHI SIGN BAR ABOVE..KHAROSHTHI SIGN DOT BELOW |
bd84d130 | 198 | 110BA ; Nukta # Mn KAITHI SIGN NUKTA |
09edd811 | 199 | 11173 ; Nukta # Mn MAHAJANI SIGN NUKTA |
ac71d2a0 | 200 | 111CA ; Nukta # Mn SHARADA SIGN NUKTA |
09edd811 KW |
201 | 11236 ; Nukta # Mn KHOJKI SIGN NUKTA |
202 | 112E9 ; Nukta # Mn KHUDAWADI SIGN NUKTA | |
203 | 1133C ; Nukta # Mn GRANTHA SIGN NUKTA | |
204 | 114C3 ; Nukta # Mn TIRHUTA SIGN NUKTA | |
205 | 115C0 ; Nukta # Mn SIDDHAM SIGN NUKTA | |
7620cb10 | 206 | 116B7 ; Nukta # Mn TAKRI SIGN NUKTA |
bd84d130 KW |
207 | |
208 | # ================================================ | |
209 | ||
210 | # Indic_Syllabic_Category=Virama | |
211 | ||
09edd811 KW |
212 | # Virama (killing of inherent vowel in consonant sequence |
213 | # or consonant stacker) | |
214 | # Only includes characters that can act both as visible killer viramas | |
215 | # and consonant stackers. Separate property values exist for characters | |
216 | # that can only act as pure killers or only as consonant stackers. | |
bd84d130 | 217 | |
ac71d2a0 UC |
218 | # [Derivation: (ccc=9) - (InSC=Pure_Killer) - (InSC=Invisible_Stacker) |
219 | # - (InSC=Number_Joiner) - 2D7F] | |
bd84d130 KW |
220 | |
221 | 094D ; Virama # Mn DEVANAGARI SIGN VIRAMA | |
222 | 09CD ; Virama # Mn BENGALI SIGN VIRAMA | |
223 | 0A4D ; Virama # Mn GURMUKHI SIGN VIRAMA | |
224 | 0ACD ; Virama # Mn GUJARATI SIGN VIRAMA | |
225 | 0B4D ; Virama # Mn ORIYA SIGN VIRAMA | |
226 | 0BCD ; Virama # Mn TAMIL SIGN VIRAMA | |
227 | 0C4D ; Virama # Mn TELUGU SIGN VIRAMA | |
228 | 0CCD ; Virama # Mn KANNADA SIGN VIRAMA | |
229 | 0D4D ; Virama # Mn MALAYALAM SIGN VIRAMA | |
230 | 0DCA ; Virama # Mn SINHALA SIGN AL-LAKUNA | |
bd84d130 | 231 | 1B44 ; Virama # Mc BALINESE ADEG ADEG |
bd84d130 | 232 | A8C4 ; Virama # Mn SAURASHTRA SIGN VIRAMA |
bd84d130 | 233 | A9C0 ; Virama # Mc JAVANESE PANGKON |
bd84d130 KW |
234 | 11046 ; Virama # Mn BRAHMI VIRAMA |
235 | 110B9 ; Virama # Mn KAITHI SIGN VIRAMA | |
7620cb10 | 236 | 111C0 ; Virama # Mc SHARADA SIGN VIRAMA |
09edd811 KW |
237 | 11235 ; Virama # Mc KHOJKI SIGN VIRAMA |
238 | 1134D ; Virama # Mc GRANTHA SIGN VIRAMA | |
239 | 114C2 ; Virama # Mn TIRHUTA SIGN VIRAMA | |
240 | 115BF ; Virama # Mn SIDDHAM SIGN VIRAMA | |
241 | 1163F ; Virama # Mn MODI SIGN VIRAMA | |
242 | 116B6 ; Virama # Mc TAKRI SIGN VIRAMA | |
243 | ||
244 | # ================================================ | |
245 | ||
246 | # Indic_Syllabic_Category=Pure_Killer | |
247 | ||
248 | # Pure killer (killing of inherent vowel in consonant sequence, | |
249 | # with no consonant stacking behavior) | |
250 | ||
251 | # [Not derivable] | |
252 | ||
253 | 0E3A ; Pure_Killer # Mn THAI CHARACTER PHINTHU | |
254 | 0E4E ; Pure_Killer # Mn THAI CHARACTER YAMAKKAN | |
255 | 0F84 ; Pure_Killer # Mn TIBETAN MARK HALANTA | |
256 | 103A ; Pure_Killer # Mn MYANMAR SIGN ASAT | |
257 | 1714 ; Pure_Killer # Mn TAGALOG SIGN VIRAMA | |
258 | 1734 ; Pure_Killer # Mn HANUNOO SIGN PAMUDPOD | |
259 | 17D1 ; Pure_Killer # Mn KHMER SIGN VIRIAM | |
260 | 1BAA ; Pure_Killer # Mc SUNDANESE SIGN PAMAAEH | |
261 | 1BF2..1BF3 ; Pure_Killer # Mc [2] BATAK PANGOLAT..BATAK PANONGONAN | |
262 | A806 ; Pure_Killer # Mn SYLOTI NAGRI SIGN HASANTA | |
263 | A953 ; Pure_Killer # Mc REJANG VIRAMA | |
264 | ABED ; Pure_Killer # Mn MEETEI MAYEK APUN IYEK | |
265 | 11134 ; Pure_Killer # Mn CHAKMA MAAYYAA | |
266 | 112EA ; Pure_Killer # Mn KHUDAWADI SIGN VIRAMA | |
ac71d2a0 | 267 | 1172B ; Pure_Killer # Mn AHOM SIGN KILLER |
09edd811 KW |
268 | |
269 | # ================================================ | |
270 | ||
271 | # Indic_Syllabic_Category=Invisible_Stacker | |
272 | ||
273 | # Invisible stacker (invisible consonant stacker virama) | |
274 | ||
275 | # [Not derivable] | |
276 | ||
277 | 1039 ; Invisible_Stacker # Mn MYANMAR SIGN VIRAMA | |
278 | 17D2 ; Invisible_Stacker # Mn KHMER SIGN COENG | |
279 | 1A60 ; Invisible_Stacker # Mn TAI THAM SIGN SAKOT | |
280 | 1BAB ; Invisible_Stacker # Mn SUNDANESE SIGN VIRAMA | |
281 | AAF6 ; Invisible_Stacker # Mn MEETEI MAYEK VIRAMA | |
282 | 10A3F ; Invisible_Stacker # Mn KHAROSHTHI VIRAMA | |
283 | 11133 ; Invisible_Stacker # Mn CHAKMA VIRAMA | |
bd84d130 KW |
284 | |
285 | # ================================================ | |
286 | ||
287 | # Indic_Syllabic_Category=Vowel_Independent | |
288 | ||
289 | # Independent Vowels (contrasted with matras) | |
290 | ||
291 | # [Not derivable] | |
292 | ||
293 | 0904..0914 ; Vowel_Independent # Lo [17] DEVANAGARI LETTER SHORT A..DEVANAGARI LETTER AU | |
294 | 0960..0961 ; Vowel_Independent # Lo [2] DEVANAGARI LETTER VOCALIC RR..DEVANAGARI LETTER VOCALIC LL | |
295 | 0972..0977 ; Vowel_Independent # Lo [6] DEVANAGARI LETTER CANDRA A..DEVANAGARI LETTER UUE | |
296 | 0985..098C ; Vowel_Independent # Lo [8] BENGALI LETTER A..BENGALI LETTER VOCALIC L | |
297 | 098F..0990 ; Vowel_Independent # Lo [2] BENGALI LETTER E..BENGALI LETTER AI | |
298 | 0993..0994 ; Vowel_Independent # Lo [2] BENGALI LETTER O..BENGALI LETTER AU | |
299 | 09E0..09E1 ; Vowel_Independent # Lo [2] BENGALI LETTER VOCALIC RR..BENGALI LETTER VOCALIC LL | |
300 | 0A05..0A0A ; Vowel_Independent # Lo [6] GURMUKHI LETTER A..GURMUKHI LETTER UU | |
301 | 0A0F..0A10 ; Vowel_Independent # Lo [2] GURMUKHI LETTER EE..GURMUKHI LETTER AI | |
302 | 0A13..0A14 ; Vowel_Independent # Lo [2] GURMUKHI LETTER OO..GURMUKHI LETTER AU | |
303 | 0A85..0A8D ; Vowel_Independent # Lo [9] GUJARATI LETTER A..GUJARATI VOWEL CANDRA E | |
304 | 0A8F..0A91 ; Vowel_Independent # Lo [3] GUJARATI LETTER E..GUJARATI VOWEL CANDRA O | |
305 | 0A93..0A94 ; Vowel_Independent # Lo [2] GUJARATI LETTER O..GUJARATI LETTER AU | |
306 | 0AE0..0AE1 ; Vowel_Independent # Lo [2] GUJARATI LETTER VOCALIC RR..GUJARATI LETTER VOCALIC LL | |
307 | 0B05..0B0C ; Vowel_Independent # Lo [8] ORIYA LETTER A..ORIYA LETTER VOCALIC L | |
308 | 0B0F..0B10 ; Vowel_Independent # Lo [2] ORIYA LETTER E..ORIYA LETTER AI | |
309 | 0B13..0B14 ; Vowel_Independent # Lo [2] ORIYA LETTER O..ORIYA LETTER AU | |
310 | 0B60..0B61 ; Vowel_Independent # Lo [2] ORIYA LETTER VOCALIC RR..ORIYA LETTER VOCALIC LL | |
311 | 0B85..0B8A ; Vowel_Independent # Lo [6] TAMIL LETTER A..TAMIL LETTER UU | |
312 | 0B8E..0B90 ; Vowel_Independent # Lo [3] TAMIL LETTER E..TAMIL LETTER AI | |
313 | 0B92..0B94 ; Vowel_Independent # Lo [3] TAMIL LETTER O..TAMIL LETTER AU | |
314 | 0C05..0C0C ; Vowel_Independent # Lo [8] TELUGU LETTER A..TELUGU LETTER VOCALIC L | |
315 | 0C0E..0C10 ; Vowel_Independent # Lo [3] TELUGU LETTER E..TELUGU LETTER AI | |
316 | 0C12..0C14 ; Vowel_Independent # Lo [3] TELUGU LETTER O..TELUGU LETTER AU | |
317 | 0C60..0C61 ; Vowel_Independent # Lo [2] TELUGU LETTER VOCALIC RR..TELUGU LETTER VOCALIC LL | |
318 | 0C85..0C8C ; Vowel_Independent # Lo [8] KANNADA LETTER A..KANNADA LETTER VOCALIC L | |
319 | 0C8E..0C90 ; Vowel_Independent # Lo [3] KANNADA LETTER E..KANNADA LETTER AI | |
320 | 0C92..0C94 ; Vowel_Independent # Lo [3] KANNADA LETTER O..KANNADA LETTER AU | |
321 | 0CE0..0CE1 ; Vowel_Independent # Lo [2] KANNADA LETTER VOCALIC RR..KANNADA LETTER VOCALIC LL | |
322 | 0D05..0D0C ; Vowel_Independent # Lo [8] MALAYALAM LETTER A..MALAYALAM LETTER VOCALIC L | |
323 | 0D0E..0D10 ; Vowel_Independent # Lo [3] MALAYALAM LETTER E..MALAYALAM LETTER AI | |
324 | 0D12..0D14 ; Vowel_Independent # Lo [3] MALAYALAM LETTER O..MALAYALAM LETTER AU | |
ac71d2a0 | 325 | 0D5F..0D61 ; Vowel_Independent # Lo [3] MALAYALAM LETTER ARCHAIC II..MALAYALAM LETTER VOCALIC LL |
bd84d130 KW |
326 | 0D85..0D96 ; Vowel_Independent # Lo [18] SINHALA LETTER AYANNA..SINHALA LETTER AUYANNA |
327 | 1021..102A ; Vowel_Independent # Lo [10] MYANMAR LETTER A..MYANMAR LETTER AU | |
328 | 1052..1055 ; Vowel_Independent # Lo [4] MYANMAR LETTER VOCALIC R..MYANMAR LETTER VOCALIC LL | |
329 | 1700..1702 ; Vowel_Independent # Lo [3] TAGALOG LETTER A..TAGALOG LETTER U | |
330 | 1720..1722 ; Vowel_Independent # Lo [3] HANUNOO LETTER A..HANUNOO LETTER U | |
331 | 1740..1742 ; Vowel_Independent # Lo [3] BUHID LETTER A..BUHID LETTER U | |
332 | 1760..1762 ; Vowel_Independent # Lo [3] TAGBANWA LETTER A..TAGBANWA LETTER U | |
333 | 17A3..17B3 ; Vowel_Independent # Lo [17] KHMER INDEPENDENT VOWEL QAQ..KHMER INDEPENDENT VOWEL QAU | |
334 | 1A4D..1A52 ; Vowel_Independent # Lo [6] TAI THAM LETTER I..TAI THAM LETTER OO | |
335 | 1B05..1B12 ; Vowel_Independent # Lo [14] BALINESE LETTER AKARA..BALINESE LETTER OKARA TEDUNG | |
336 | 1B83..1B89 ; Vowel_Independent # Lo [7] SUNDANESE LETTER A..SUNDANESE LETTER EU | |
337 | 1BE4..1BE5 ; Vowel_Independent # Lo [2] BATAK LETTER I..BATAK LETTER U | |
338 | A800..A801 ; Vowel_Independent # Lo [2] SYLOTI NAGRI LETTER A..SYLOTI NAGRI LETTER I | |
339 | A803..A805 ; Vowel_Independent # Lo [3] SYLOTI NAGRI LETTER U..SYLOTI NAGRI LETTER O | |
340 | A882..A891 ; Vowel_Independent # Lo [16] SAURASHTRA LETTER A..SAURASHTRA LETTER AU | |
341 | A984..A988 ; Vowel_Independent # Lo [5] JAVANESE LETTER A..JAVANESE LETTER U | |
342 | A98C..A98E ; Vowel_Independent # Lo [3] JAVANESE LETTER E..JAVANESE LETTER O | |
343 | AA00..AA05 ; Vowel_Independent # Lo [6] CHAM LETTER A..CHAM LETTER O | |
7620cb10 KW |
344 | AAE0..AAE1 ; Vowel_Independent # Lo [2] MEETEI MAYEK LETTER E..MEETEI MAYEK LETTER O |
345 | ABCE..ABCF ; Vowel_Independent # Lo [2] MEETEI MAYEK LETTER UN..MEETEI MAYEK LETTER I | |
346 | ABD1 ; Vowel_Independent # Lo MEETEI MAYEK LETTER ATIYA | |
bd84d130 KW |
347 | 11005..11012 ; Vowel_Independent # Lo [14] BRAHMI LETTER A..BRAHMI LETTER AU |
348 | 11083..1108C ; Vowel_Independent # Lo [10] KAITHI LETTER A..KAITHI LETTER AU | |
7620cb10 KW |
349 | 11103..11106 ; Vowel_Independent # Lo [4] CHAKMA LETTER AA..CHAKMA LETTER E |
350 | 11183..11190 ; Vowel_Independent # Lo [14] SHARADA LETTER A..SHARADA LETTER AU | |
09edd811 | 351 | 11200..11207 ; Vowel_Independent # Lo [8] KHOJKI LETTER A..KHOJKI LETTER AU |
ac71d2a0 | 352 | 11280..11283 ; Vowel_Independent # Lo [4] MULTANI LETTER A..MULTANI LETTER E |
09edd811 KW |
353 | 112B0..112B9 ; Vowel_Independent # Lo [10] KHUDAWADI LETTER A..KHUDAWADI LETTER AU |
354 | 11305..1130C ; Vowel_Independent # Lo [8] GRANTHA LETTER A..GRANTHA LETTER VOCALIC L | |
355 | 1130F..11310 ; Vowel_Independent # Lo [2] GRANTHA LETTER EE..GRANTHA LETTER AI | |
356 | 11313..11314 ; Vowel_Independent # Lo [2] GRANTHA LETTER OO..GRANTHA LETTER AU | |
357 | 11360..11361 ; Vowel_Independent # Lo [2] GRANTHA LETTER VOCALIC RR..GRANTHA LETTER VOCALIC LL | |
358 | 11481..1148E ; Vowel_Independent # Lo [14] TIRHUTA LETTER A..TIRHUTA LETTER AU | |
359 | 11580..1158D ; Vowel_Independent # Lo [14] SIDDHAM LETTER A..SIDDHAM LETTER AU | |
ac71d2a0 | 360 | 115D8..115DB ; Vowel_Independent # Lo [4] SIDDHAM LETTER THREE-CIRCLE ALTERNATE I..SIDDHAM LETTER ALTERNATE U |
09edd811 | 361 | 11600..1160D ; Vowel_Independent # Lo [14] MODI LETTER A..MODI LETTER AU |
7620cb10 | 362 | 11680..11689 ; Vowel_Independent # Lo [10] TAKRI LETTER A..TAKRI LETTER AU |
bd84d130 KW |
363 | |
364 | # ================================================ | |
365 | ||
366 | # Indic_Syllabic_Category=Vowel_Dependent | |
367 | ||
368 | # Dependent Vowels (contrasted with independent vowels and/or with complex placement) | |
369 | # Matras (in Indic scripts) | |
370 | ||
371 | # [Not derivable] | |
372 | ||
373 | 093A ; Vowel_Dependent # Mn DEVANAGARI VOWEL SIGN OE | |
374 | 093B ; Vowel_Dependent # Mc DEVANAGARI VOWEL SIGN OOE | |
375 | 093E..0940 ; Vowel_Dependent # Mc [3] DEVANAGARI VOWEL SIGN AA..DEVANAGARI VOWEL SIGN II | |
376 | 0941..0948 ; Vowel_Dependent # Mn [8] DEVANAGARI VOWEL SIGN U..DEVANAGARI VOWEL SIGN AI | |
377 | 0949..094C ; Vowel_Dependent # Mc [4] DEVANAGARI VOWEL SIGN CANDRA O..DEVANAGARI VOWEL SIGN AU | |
378 | 094E..094F ; Vowel_Dependent # Mc [2] DEVANAGARI VOWEL SIGN PRISHTHAMATRA E..DEVANAGARI VOWEL SIGN AW | |
a9c9e371 | 379 | 0955..0957 ; Vowel_Dependent # Mn [3] DEVANAGARI VOWEL SIGN CANDRA LONG E..DEVANAGARI VOWEL SIGN UUE |
bd84d130 KW |
380 | 0962..0963 ; Vowel_Dependent # Mn [2] DEVANAGARI VOWEL SIGN VOCALIC L..DEVANAGARI VOWEL SIGN VOCALIC LL |
381 | 09BE..09C0 ; Vowel_Dependent # Mc [3] BENGALI VOWEL SIGN AA..BENGALI VOWEL SIGN II | |
382 | 09C1..09C4 ; Vowel_Dependent # Mn [4] BENGALI VOWEL SIGN U..BENGALI VOWEL SIGN VOCALIC RR | |
383 | 09C7..09C8 ; Vowel_Dependent # Mc [2] BENGALI VOWEL SIGN E..BENGALI VOWEL SIGN AI | |
384 | 09CB..09CC ; Vowel_Dependent # Mc [2] BENGALI VOWEL SIGN O..BENGALI VOWEL SIGN AU | |
385 | 09D7 ; Vowel_Dependent # Mc BENGALI AU LENGTH MARK | |
386 | 09E2..09E3 ; Vowel_Dependent # Mn [2] BENGALI VOWEL SIGN VOCALIC L..BENGALI VOWEL SIGN VOCALIC LL | |
387 | 0A3E..0A40 ; Vowel_Dependent # Mc [3] GURMUKHI VOWEL SIGN AA..GURMUKHI VOWEL SIGN II | |
388 | 0A41..0A42 ; Vowel_Dependent # Mn [2] GURMUKHI VOWEL SIGN U..GURMUKHI VOWEL SIGN UU | |
389 | 0A47..0A48 ; Vowel_Dependent # Mn [2] GURMUKHI VOWEL SIGN EE..GURMUKHI VOWEL SIGN AI | |
390 | 0A4B..0A4C ; Vowel_Dependent # Mn [2] GURMUKHI VOWEL SIGN OO..GURMUKHI VOWEL SIGN AU | |
391 | 0ABE..0AC0 ; Vowel_Dependent # Mc [3] GUJARATI VOWEL SIGN AA..GUJARATI VOWEL SIGN II | |
392 | 0AC1..0AC5 ; Vowel_Dependent # Mn [5] GUJARATI VOWEL SIGN U..GUJARATI VOWEL SIGN CANDRA E | |
393 | 0AC7..0AC8 ; Vowel_Dependent # Mn [2] GUJARATI VOWEL SIGN E..GUJARATI VOWEL SIGN AI | |
394 | 0AC9 ; Vowel_Dependent # Mc GUJARATI VOWEL SIGN CANDRA O | |
395 | 0ACB..0ACC ; Vowel_Dependent # Mc [2] GUJARATI VOWEL SIGN O..GUJARATI VOWEL SIGN AU | |
396 | 0AE2..0AE3 ; Vowel_Dependent # Mn [2] GUJARATI VOWEL SIGN VOCALIC L..GUJARATI VOWEL SIGN VOCALIC LL | |
397 | 0B3E ; Vowel_Dependent # Mc ORIYA VOWEL SIGN AA | |
398 | 0B3F ; Vowel_Dependent # Mn ORIYA VOWEL SIGN I | |
399 | 0B40 ; Vowel_Dependent # Mc ORIYA VOWEL SIGN II | |
400 | 0B41..0B44 ; Vowel_Dependent # Mn [4] ORIYA VOWEL SIGN U..ORIYA VOWEL SIGN VOCALIC RR | |
401 | 0B47..0B48 ; Vowel_Dependent # Mc [2] ORIYA VOWEL SIGN E..ORIYA VOWEL SIGN AI | |
402 | 0B4B..0B4C ; Vowel_Dependent # Mc [2] ORIYA VOWEL SIGN O..ORIYA VOWEL SIGN AU | |
403 | 0B56 ; Vowel_Dependent # Mn ORIYA AI LENGTH MARK | |
404 | 0B57 ; Vowel_Dependent # Mc ORIYA AU LENGTH MARK | |
405 | 0B62..0B63 ; Vowel_Dependent # Mn [2] ORIYA VOWEL SIGN VOCALIC L..ORIYA VOWEL SIGN VOCALIC LL | |
406 | 0BBE..0BBF ; Vowel_Dependent # Mc [2] TAMIL VOWEL SIGN AA..TAMIL VOWEL SIGN I | |
407 | 0BC0 ; Vowel_Dependent # Mn TAMIL VOWEL SIGN II | |
408 | 0BC1..0BC2 ; Vowel_Dependent # Mc [2] TAMIL VOWEL SIGN U..TAMIL VOWEL SIGN UU | |
409 | 0BC6..0BC8 ; Vowel_Dependent # Mc [3] TAMIL VOWEL SIGN E..TAMIL VOWEL SIGN AI | |
410 | 0BCA..0BCC ; Vowel_Dependent # Mc [3] TAMIL VOWEL SIGN O..TAMIL VOWEL SIGN AU | |
411 | 0BD7 ; Vowel_Dependent # Mc TAMIL AU LENGTH MARK | |
412 | 0C3E..0C40 ; Vowel_Dependent # Mn [3] TELUGU VOWEL SIGN AA..TELUGU VOWEL SIGN II | |
413 | 0C41..0C44 ; Vowel_Dependent # Mc [4] TELUGU VOWEL SIGN U..TELUGU VOWEL SIGN VOCALIC RR | |
414 | 0C46..0C48 ; Vowel_Dependent # Mn [3] TELUGU VOWEL SIGN E..TELUGU VOWEL SIGN AI | |
415 | 0C4A..0C4C ; Vowel_Dependent # Mn [3] TELUGU VOWEL SIGN O..TELUGU VOWEL SIGN AU | |
416 | 0C55..0C56 ; Vowel_Dependent # Mn [2] TELUGU LENGTH MARK..TELUGU AI LENGTH MARK | |
417 | 0C62..0C63 ; Vowel_Dependent # Mn [2] TELUGU VOWEL SIGN VOCALIC L..TELUGU VOWEL SIGN VOCALIC LL | |
418 | 0CBE ; Vowel_Dependent # Mc KANNADA VOWEL SIGN AA | |
419 | 0CBF ; Vowel_Dependent # Mn KANNADA VOWEL SIGN I | |
420 | 0CC0..0CC4 ; Vowel_Dependent # Mc [5] KANNADA VOWEL SIGN II..KANNADA VOWEL SIGN VOCALIC RR | |
421 | 0CC6 ; Vowel_Dependent # Mn KANNADA VOWEL SIGN E | |
422 | 0CC7..0CC8 ; Vowel_Dependent # Mc [2] KANNADA VOWEL SIGN EE..KANNADA VOWEL SIGN AI | |
423 | 0CCA..0CCB ; Vowel_Dependent # Mc [2] KANNADA VOWEL SIGN O..KANNADA VOWEL SIGN OO | |
424 | 0CCC ; Vowel_Dependent # Mn KANNADA VOWEL SIGN AU | |
425 | 0CD5..0CD6 ; Vowel_Dependent # Mc [2] KANNADA LENGTH MARK..KANNADA AI LENGTH MARK | |
426 | 0CE2..0CE3 ; Vowel_Dependent # Mn [2] KANNADA VOWEL SIGN VOCALIC L..KANNADA VOWEL SIGN VOCALIC LL | |
427 | 0D3E..0D40 ; Vowel_Dependent # Mc [3] MALAYALAM VOWEL SIGN AA..MALAYALAM VOWEL SIGN II | |
428 | 0D41..0D44 ; Vowel_Dependent # Mn [4] MALAYALAM VOWEL SIGN U..MALAYALAM VOWEL SIGN VOCALIC RR | |
429 | 0D46..0D48 ; Vowel_Dependent # Mc [3] MALAYALAM VOWEL SIGN E..MALAYALAM VOWEL SIGN AI | |
430 | 0D4A..0D4C ; Vowel_Dependent # Mc [3] MALAYALAM VOWEL SIGN O..MALAYALAM VOWEL SIGN AU | |
431 | 0D57 ; Vowel_Dependent # Mc MALAYALAM AU LENGTH MARK | |
432 | 0D62..0D63 ; Vowel_Dependent # Mn [2] MALAYALAM VOWEL SIGN VOCALIC L..MALAYALAM VOWEL SIGN VOCALIC LL | |
433 | 0DCF..0DD1 ; Vowel_Dependent # Mc [3] SINHALA VOWEL SIGN AELA-PILLA..SINHALA VOWEL SIGN DIGA AEDA-PILLA | |
434 | 0DD2..0DD4 ; Vowel_Dependent # Mn [3] SINHALA VOWEL SIGN KETTI IS-PILLA..SINHALA VOWEL SIGN KETTI PAA-PILLA | |
435 | 0DD6 ; Vowel_Dependent # Mn SINHALA VOWEL SIGN DIGA PAA-PILLA | |
436 | 0DD8..0DDF ; Vowel_Dependent # Mc [8] SINHALA VOWEL SIGN GAETTA-PILLA..SINHALA VOWEL SIGN GAYANUKITTA | |
437 | 0DF2..0DF3 ; Vowel_Dependent # Mc [2] SINHALA VOWEL SIGN DIGA GAETTA-PILLA..SINHALA VOWEL SIGN DIGA GAYANUKITTA | |
438 | 0E30 ; Vowel_Dependent # Lo THAI CHARACTER SARA A | |
439 | 0E31 ; Vowel_Dependent # Mn THAI CHARACTER MAI HAN-AKAT | |
440 | 0E32..0E33 ; Vowel_Dependent # Lo [2] THAI CHARACTER SARA AA..THAI CHARACTER SARA AM | |
441 | 0E34..0E39 ; Vowel_Dependent # Mn [6] THAI CHARACTER SARA I..THAI CHARACTER SARA UU | |
442 | 0E40..0E45 ; Vowel_Dependent # Lo [6] THAI CHARACTER SARA E..THAI CHARACTER LAKKHANGYAO | |
443 | 0E47 ; Vowel_Dependent # Mn THAI CHARACTER MAITAIKHU | |
444 | 0EB0 ; Vowel_Dependent # Lo LAO VOWEL SIGN A | |
445 | 0EB1 ; Vowel_Dependent # Mn LAO VOWEL SIGN MAI KAN | |
446 | 0EB2..0EB3 ; Vowel_Dependent # Lo [2] LAO VOWEL SIGN AA..LAO VOWEL SIGN AM | |
447 | 0EB4..0EB9 ; Vowel_Dependent # Mn [6] LAO VOWEL SIGN I..LAO VOWEL SIGN UU | |
448 | 0EBB ; Vowel_Dependent # Mn LAO VOWEL SIGN MAI KON | |
449 | 0EC0..0EC4 ; Vowel_Dependent # Lo [5] LAO VOWEL SIGN E..LAO VOWEL SIGN AI | |
450 | 0F71..0F7D ; Vowel_Dependent # Mn [13] TIBETAN VOWEL SIGN AA..TIBETAN VOWEL SIGN OO | |
451 | 0F80..0F81 ; Vowel_Dependent # Mn [2] TIBETAN VOWEL SIGN REVERSED I..TIBETAN VOWEL SIGN REVERSED II | |
452 | 102B..102C ; Vowel_Dependent # Mc [2] MYANMAR VOWEL SIGN TALL AA..MYANMAR VOWEL SIGN AA | |
453 | 102D..1030 ; Vowel_Dependent # Mn [4] MYANMAR VOWEL SIGN I..MYANMAR VOWEL SIGN UU | |
454 | 1031 ; Vowel_Dependent # Mc MYANMAR VOWEL SIGN E | |
455 | 1032..1035 ; Vowel_Dependent # Mn [4] MYANMAR VOWEL SIGN AI..MYANMAR VOWEL SIGN E ABOVE | |
456 | 1056..1057 ; Vowel_Dependent # Mc [2] MYANMAR VOWEL SIGN VOCALIC R..MYANMAR VOWEL SIGN VOCALIC RR | |
457 | 1058..1059 ; Vowel_Dependent # Mn [2] MYANMAR VOWEL SIGN VOCALIC L..MYANMAR VOWEL SIGN VOCALIC LL | |
458 | 1062 ; Vowel_Dependent # Mc MYANMAR VOWEL SIGN SGAW KAREN EU | |
459 | 1067..1068 ; Vowel_Dependent # Mc [2] MYANMAR VOWEL SIGN WESTERN PWO KAREN EU..MYANMAR VOWEL SIGN WESTERN PWO KAREN UE | |
460 | 1071..1074 ; Vowel_Dependent # Mn [4] MYANMAR VOWEL SIGN GEBA KAREN I..MYANMAR VOWEL SIGN KAYAH EE | |
461 | 1083..1084 ; Vowel_Dependent # Mc [2] MYANMAR VOWEL SIGN SHAN AA..MYANMAR VOWEL SIGN SHAN E | |
462 | 1085..1086 ; Vowel_Dependent # Mn [2] MYANMAR VOWEL SIGN SHAN E ABOVE..MYANMAR VOWEL SIGN SHAN FINAL Y | |
463 | 109C ; Vowel_Dependent # Mc MYANMAR VOWEL SIGN AITON A | |
464 | 109D ; Vowel_Dependent # Mn MYANMAR VOWEL SIGN AITON AI | |
465 | 1712..1713 ; Vowel_Dependent # Mn [2] TAGALOG VOWEL SIGN I..TAGALOG VOWEL SIGN U | |
466 | 1732..1733 ; Vowel_Dependent # Mn [2] HANUNOO VOWEL SIGN I..HANUNOO VOWEL SIGN U | |
467 | 1752..1753 ; Vowel_Dependent # Mn [2] BUHID VOWEL SIGN I..BUHID VOWEL SIGN U | |
468 | 1772..1773 ; Vowel_Dependent # Mn [2] TAGBANWA VOWEL SIGN I..TAGBANWA VOWEL SIGN U | |
469 | 17B6 ; Vowel_Dependent # Mc KHMER VOWEL SIGN AA | |
470 | 17B7..17BD ; Vowel_Dependent # Mn [7] KHMER VOWEL SIGN I..KHMER VOWEL SIGN UA | |
471 | 17BE..17C5 ; Vowel_Dependent # Mc [8] KHMER VOWEL SIGN OE..KHMER VOWEL SIGN AU | |
472 | 17C8 ; Vowel_Dependent # Mc KHMER SIGN YUUKALEAPINTU | |
473 | 1920..1922 ; Vowel_Dependent # Mn [3] LIMBU VOWEL SIGN A..LIMBU VOWEL SIGN U | |
474 | 1923..1926 ; Vowel_Dependent # Mc [4] LIMBU VOWEL SIGN EE..LIMBU VOWEL SIGN AU | |
475 | 1927..1928 ; Vowel_Dependent # Mn [2] LIMBU VOWEL SIGN E..LIMBU VOWEL SIGN O | |
ac71d2a0 UC |
476 | 193A ; Vowel_Dependent # Mn LIMBU SIGN KEMPHRENG |
477 | 19B0..19C0 ; Vowel_Dependent # Lo [17] NEW TAI LUE VOWEL SIGN VOWEL SHORTENER..NEW TAI LUE VOWEL SIGN IY | |
bd84d130 | 478 | 1A17..1A18 ; Vowel_Dependent # Mn [2] BUGINESE VOWEL SIGN I..BUGINESE VOWEL SIGN U |
09edd811 KW |
479 | 1A19..1A1A ; Vowel_Dependent # Mc [2] BUGINESE VOWEL SIGN E..BUGINESE VOWEL SIGN O |
480 | 1A1B ; Vowel_Dependent # Mn BUGINESE VOWEL SIGN AE | |
bd84d130 KW |
481 | 1A61 ; Vowel_Dependent # Mc TAI THAM VOWEL SIGN A |
482 | 1A62 ; Vowel_Dependent # Mn TAI THAM VOWEL SIGN MAI SAT | |
483 | 1A63..1A64 ; Vowel_Dependent # Mc [2] TAI THAM VOWEL SIGN AA..TAI THAM VOWEL SIGN TALL AA | |
484 | 1A65..1A6C ; Vowel_Dependent # Mn [8] TAI THAM VOWEL SIGN I..TAI THAM VOWEL SIGN OA BELOW | |
485 | 1A6D..1A72 ; Vowel_Dependent # Mc [6] TAI THAM VOWEL SIGN OY..TAI THAM VOWEL SIGN THAM AI | |
486 | 1A73..1A74 ; Vowel_Dependent # Mn [2] TAI THAM VOWEL SIGN OA ABOVE..TAI THAM SIGN MAI KANG | |
487 | 1B35 ; Vowel_Dependent # Mc BALINESE VOWEL SIGN TEDUNG | |
488 | 1B36..1B3A ; Vowel_Dependent # Mn [5] BALINESE VOWEL SIGN ULU..BALINESE VOWEL SIGN RA REPA | |
489 | 1B3B ; Vowel_Dependent # Mc BALINESE VOWEL SIGN RA REPA TEDUNG | |
490 | 1B3C ; Vowel_Dependent # Mn BALINESE VOWEL SIGN LA LENGA | |
491 | 1B3D..1B41 ; Vowel_Dependent # Mc [5] BALINESE VOWEL SIGN LA LENGA TEDUNG..BALINESE VOWEL SIGN TALING REPA TEDUNG | |
492 | 1B42 ; Vowel_Dependent # Mn BALINESE VOWEL SIGN PEPET | |
493 | 1B43 ; Vowel_Dependent # Mc BALINESE VOWEL SIGN PEPET TEDUNG | |
494 | 1BA4..1BA5 ; Vowel_Dependent # Mn [2] SUNDANESE VOWEL SIGN PANGHULU..SUNDANESE VOWEL SIGN PANYUKU | |
495 | 1BA6..1BA7 ; Vowel_Dependent # Mc [2] SUNDANESE VOWEL SIGN PANAELAENG..SUNDANESE VOWEL SIGN PANOLONG | |
496 | 1BA8..1BA9 ; Vowel_Dependent # Mn [2] SUNDANESE VOWEL SIGN PAMEPET..SUNDANESE VOWEL SIGN PANEULEUNG | |
497 | 1BE7 ; Vowel_Dependent # Mc BATAK VOWEL SIGN E | |
498 | 1BE8..1BE9 ; Vowel_Dependent # Mn [2] BATAK VOWEL SIGN PAKPAK E..BATAK VOWEL SIGN EE | |
499 | 1BEA..1BEC ; Vowel_Dependent # Mc [3] BATAK VOWEL SIGN I..BATAK VOWEL SIGN O | |
500 | 1BED ; Vowel_Dependent # Mn BATAK VOWEL SIGN KARO O | |
501 | 1BEE ; Vowel_Dependent # Mc BATAK VOWEL SIGN U | |
502 | 1BEF ; Vowel_Dependent # Mn BATAK VOWEL SIGN U FOR SIMALUNGUN SA | |
503 | 1C26..1C2B ; Vowel_Dependent # Mc [6] LEPCHA VOWEL SIGN AA..LEPCHA VOWEL SIGN UU | |
504 | 1C2C ; Vowel_Dependent # Mn LEPCHA VOWEL SIGN E | |
505 | A823..A824 ; Vowel_Dependent # Mc [2] SYLOTI NAGRI VOWEL SIGN A..SYLOTI NAGRI VOWEL SIGN I | |
506 | A825..A826 ; Vowel_Dependent # Mn [2] SYLOTI NAGRI VOWEL SIGN U..SYLOTI NAGRI VOWEL SIGN E | |
507 | A827 ; Vowel_Dependent # Mc SYLOTI NAGRI VOWEL SIGN OO | |
508 | A8B5..A8C3 ; Vowel_Dependent # Mc [15] SAURASHTRA VOWEL SIGN AA..SAURASHTRA VOWEL SIGN AU | |
509 | A947..A94E ; Vowel_Dependent # Mn [8] REJANG VOWEL SIGN I..REJANG VOWEL SIGN EA | |
510 | A9B4..A9B5 ; Vowel_Dependent # Mc [2] JAVANESE VOWEL SIGN TARUNG..JAVANESE VOWEL SIGN TOLONG | |
511 | A9B6..A9B9 ; Vowel_Dependent # Mn [4] JAVANESE VOWEL SIGN WULU..JAVANESE VOWEL SIGN SUKU MENDUT | |
512 | A9BA..A9BB ; Vowel_Dependent # Mc [2] JAVANESE VOWEL SIGN TALING..JAVANESE VOWEL SIGN DIRGA MURE | |
513 | A9BC ; Vowel_Dependent # Mn JAVANESE VOWEL SIGN PEPET | |
ac71d2a0 | 514 | A9E5 ; Vowel_Dependent # Mn MYANMAR SIGN SHAN SAW |
bd84d130 KW |
515 | AA29..AA2E ; Vowel_Dependent # Mn [6] CHAM VOWEL SIGN AA..CHAM VOWEL SIGN OE |
516 | AA2F..AA30 ; Vowel_Dependent # Mc [2] CHAM VOWEL SIGN O..CHAM VOWEL SIGN AI | |
517 | AA31..AA32 ; Vowel_Dependent # Mn [2] CHAM VOWEL SIGN AU..CHAM VOWEL SIGN UE | |
518 | AAB0 ; Vowel_Dependent # Mn TAI VIET MAI KANG | |
519 | AAB1 ; Vowel_Dependent # Lo TAI VIET VOWEL AA | |
520 | AAB2..AAB4 ; Vowel_Dependent # Mn [3] TAI VIET VOWEL I..TAI VIET VOWEL U | |
521 | AAB5..AAB6 ; Vowel_Dependent # Lo [2] TAI VIET VOWEL E..TAI VIET VOWEL O | |
522 | AAB7..AAB8 ; Vowel_Dependent # Mn [2] TAI VIET MAI KHIT..TAI VIET VOWEL IA | |
523 | AAB9..AABD ; Vowel_Dependent # Lo [5] TAI VIET VOWEL UEA..TAI VIET VOWEL AN | |
524 | AABE ; Vowel_Dependent # Mn TAI VIET VOWEL AM | |
09edd811 KW |
525 | AAEB ; Vowel_Dependent # Mc MEETEI MAYEK VOWEL SIGN II |
526 | AAEC..AAED ; Vowel_Dependent # Mn [2] MEETEI MAYEK VOWEL SIGN UU..MEETEI MAYEK VOWEL SIGN AAI | |
527 | AAEE..AAEF ; Vowel_Dependent # Mc [2] MEETEI MAYEK VOWEL SIGN AU..MEETEI MAYEK VOWEL SIGN AAU | |
bd84d130 KW |
528 | ABE3..ABE4 ; Vowel_Dependent # Mc [2] MEETEI MAYEK VOWEL SIGN ONAP..MEETEI MAYEK VOWEL SIGN INAP |
529 | ABE5 ; Vowel_Dependent # Mn MEETEI MAYEK VOWEL SIGN ANAP | |
530 | ABE6..ABE7 ; Vowel_Dependent # Mc [2] MEETEI MAYEK VOWEL SIGN YENAP..MEETEI MAYEK VOWEL SIGN SOUNAP | |
531 | ABE8 ; Vowel_Dependent # Mn MEETEI MAYEK VOWEL SIGN UNAP | |
532 | ABE9..ABEA ; Vowel_Dependent # Mc [2] MEETEI MAYEK VOWEL SIGN CHEINAP..MEETEI MAYEK VOWEL SIGN NUNG | |
533 | 10A01..10A03 ; Vowel_Dependent # Mn [3] KHAROSHTHI VOWEL SIGN I..KHAROSHTHI VOWEL SIGN VOCALIC R | |
534 | 10A05..10A06 ; Vowel_Dependent # Mn [2] KHAROSHTHI VOWEL SIGN E..KHAROSHTHI VOWEL SIGN O | |
ac71d2a0 | 535 | 10A0C..10A0D ; Vowel_Dependent # Mn [2] KHAROSHTHI VOWEL LENGTH MARK..KHAROSHTHI SIGN DOUBLE RING BELOW |
bd84d130 KW |
536 | 11038..11045 ; Vowel_Dependent # Mn [14] BRAHMI VOWEL SIGN AA..BRAHMI VOWEL SIGN AU |
537 | 110B0..110B2 ; Vowel_Dependent # Mc [3] KAITHI VOWEL SIGN AA..KAITHI VOWEL SIGN II | |
538 | 110B3..110B6 ; Vowel_Dependent # Mn [4] KAITHI VOWEL SIGN U..KAITHI VOWEL SIGN AI | |
539 | 110B7..110B8 ; Vowel_Dependent # Mc [2] KAITHI VOWEL SIGN O..KAITHI VOWEL SIGN AU | |
09edd811 KW |
540 | 11127..1112B ; Vowel_Dependent # Mn [5] CHAKMA VOWEL SIGN A..CHAKMA VOWEL SIGN UU |
541 | 1112C ; Vowel_Dependent # Mc CHAKMA VOWEL SIGN E | |
542 | 1112D..11132 ; Vowel_Dependent # Mn [6] CHAKMA VOWEL SIGN AI..CHAKMA AU MARK | |
543 | 111B3..111B5 ; Vowel_Dependent # Mc [3] SHARADA VOWEL SIGN AA..SHARADA VOWEL SIGN II | |
544 | 111B6..111BE ; Vowel_Dependent # Mn [9] SHARADA VOWEL SIGN U..SHARADA VOWEL SIGN O | |
545 | 111BF ; Vowel_Dependent # Mc SHARADA VOWEL SIGN AU | |
ac71d2a0 | 546 | 111CB..111CC ; Vowel_Dependent # Mn [2] SHARADA VOWEL MODIFIER MARK..SHARADA EXTRA SHORT VOWEL MARK |
09edd811 KW |
547 | 1122C..1122E ; Vowel_Dependent # Mc [3] KHOJKI VOWEL SIGN AA..KHOJKI VOWEL SIGN II |
548 | 1122F..11231 ; Vowel_Dependent # Mn [3] KHOJKI VOWEL SIGN U..KHOJKI VOWEL SIGN AI | |
549 | 11232..11233 ; Vowel_Dependent # Mc [2] KHOJKI VOWEL SIGN O..KHOJKI VOWEL SIGN AU | |
550 | 112E0..112E2 ; Vowel_Dependent # Mc [3] KHUDAWADI VOWEL SIGN AA..KHUDAWADI VOWEL SIGN II | |
551 | 112E3..112E8 ; Vowel_Dependent # Mn [6] KHUDAWADI VOWEL SIGN U..KHUDAWADI VOWEL SIGN AU | |
552 | 1133E..1133F ; Vowel_Dependent # Mc [2] GRANTHA VOWEL SIGN AA..GRANTHA VOWEL SIGN I | |
553 | 11340 ; Vowel_Dependent # Mn GRANTHA VOWEL SIGN II | |
554 | 11341..11344 ; Vowel_Dependent # Mc [4] GRANTHA VOWEL SIGN U..GRANTHA VOWEL SIGN VOCALIC RR | |
555 | 11347..11348 ; Vowel_Dependent # Mc [2] GRANTHA VOWEL SIGN EE..GRANTHA VOWEL SIGN AI | |
556 | 1134B..1134C ; Vowel_Dependent # Mc [2] GRANTHA VOWEL SIGN OO..GRANTHA VOWEL SIGN AU | |
557 | 11357 ; Vowel_Dependent # Mc GRANTHA AU LENGTH MARK | |
558 | 11362..11363 ; Vowel_Dependent # Mc [2] GRANTHA VOWEL SIGN VOCALIC L..GRANTHA VOWEL SIGN VOCALIC LL | |
559 | 114B0..114B2 ; Vowel_Dependent # Mc [3] TIRHUTA VOWEL SIGN AA..TIRHUTA VOWEL SIGN II | |
560 | 114B3..114B8 ; Vowel_Dependent # Mn [6] TIRHUTA VOWEL SIGN U..TIRHUTA VOWEL SIGN VOCALIC LL | |
561 | 114B9 ; Vowel_Dependent # Mc TIRHUTA VOWEL SIGN E | |
562 | 114BA ; Vowel_Dependent # Mn TIRHUTA VOWEL SIGN SHORT E | |
563 | 114BB..114BE ; Vowel_Dependent # Mc [4] TIRHUTA VOWEL SIGN AI..TIRHUTA VOWEL SIGN AU | |
564 | 115AF..115B1 ; Vowel_Dependent # Mc [3] SIDDHAM VOWEL SIGN AA..SIDDHAM VOWEL SIGN II | |
565 | 115B2..115B5 ; Vowel_Dependent # Mn [4] SIDDHAM VOWEL SIGN U..SIDDHAM VOWEL SIGN VOCALIC RR | |
566 | 115B8..115BB ; Vowel_Dependent # Mc [4] SIDDHAM VOWEL SIGN E..SIDDHAM VOWEL SIGN AU | |
ac71d2a0 | 567 | 115DC..115DD ; Vowel_Dependent # Mn [2] SIDDHAM VOWEL SIGN ALTERNATE U..SIDDHAM VOWEL SIGN ALTERNATE UU |
09edd811 KW |
568 | 11630..11632 ; Vowel_Dependent # Mc [3] MODI VOWEL SIGN AA..MODI VOWEL SIGN II |
569 | 11633..1163A ; Vowel_Dependent # Mn [8] MODI VOWEL SIGN U..MODI VOWEL SIGN AI | |
570 | 1163B..1163C ; Vowel_Dependent # Mc [2] MODI VOWEL SIGN O..MODI VOWEL SIGN AU | |
ac71d2a0 | 571 | 11640 ; Vowel_Dependent # Mn MODI SIGN ARDHACANDRA |
09edd811 KW |
572 | 116AD ; Vowel_Dependent # Mn TAKRI VOWEL SIGN AA |
573 | 116AE..116AF ; Vowel_Dependent # Mc [2] TAKRI VOWEL SIGN I..TAKRI VOWEL SIGN II | |
574 | 116B0..116B5 ; Vowel_Dependent # Mn [6] TAKRI VOWEL SIGN U..TAKRI VOWEL SIGN AU | |
ac71d2a0 UC |
575 | 11720..11721 ; Vowel_Dependent # Mc [2] AHOM VOWEL SIGN A..AHOM VOWEL SIGN AA |
576 | 11722..11725 ; Vowel_Dependent # Mn [4] AHOM VOWEL SIGN I..AHOM VOWEL SIGN UU | |
577 | 11726 ; Vowel_Dependent # Mc AHOM VOWEL SIGN E | |
578 | 11727..1172A ; Vowel_Dependent # Mn [4] AHOM VOWEL SIGN AW..AHOM VOWEL SIGN AM | |
bd84d130 KW |
579 | |
580 | # ================================================ | |
581 | ||
582 | # Indic_Syllabic_Category=Vowel | |
583 | ||
584 | # (Other) Vowels (reanalyzed as ordinary alphabetic letters or marks) | |
585 | ||
586 | # [Not derivable] | |
587 | ||
588 | 1963..196D ; Vowel # Lo [11] TAI LE LETTER A..TAI LE LETTER AI | |
589 | A85E..A861 ; Vowel # Lo [4] PHAGS-PA LETTER I..PHAGS-PA LETTER O | |
09edd811 | 590 | A866 ; Vowel # Lo PHAGS-PA LETTER EE |
bd84d130 KW |
591 | A922..A925 ; Vowel # Lo [4] KAYAH LI LETTER A..KAYAH LI LETTER OO |
592 | A926..A92A ; Vowel # Mn [5] KAYAH LI VOWEL UE..KAYAH LI VOWEL O | |
09edd811 | 593 | 11150..11154 ; Vowel # Lo [5] MAHAJANI LETTER A..MAHAJANI LETTER O |
bd84d130 KW |
594 | |
595 | # ================================================ | |
596 | ||
597 | # Indic_Syllabic_Category=Consonant_Placeholder | |
598 | ||
599 | # Consonant Placeholder | |
600 | # This includes generic placeholders used for | |
601 | # Indic script layout (NBSP and dotted circle), as well as a few script- | |
602 | # specific vowel-holder characters which are not technically | |
603 | # consonants, but serve instead as bases for placement of vowel marks. | |
604 | ||
605 | # [Not derivable] | |
606 | ||
09edd811 | 607 | 002D ; Consonant_Placeholder # Pd HYPHEN-MINUS |
bd84d130 | 608 | 00A0 ; Consonant_Placeholder # Zs NO-BREAK SPACE |
09edd811 | 609 | 00D7 ; Consonant_Placeholder # Sm MULTIPLICATION SIGN |
bd84d130 | 610 | 0A72..0A73 ; Consonant_Placeholder # Lo [2] GURMUKHI IRI..GURMUKHI URA |
09edd811 | 611 | 104E ; Consonant_Placeholder # Po MYANMAR SYMBOL AFOREMENTIONED |
bd84d130 | 612 | 1900 ; Consonant_Placeholder # Lo LIMBU VOWEL-CARRIER LETTER |
ac71d2a0 | 613 | 2010..2014 ; Consonant_Placeholder # Pd [5] HYPHEN..EM DASH |
bd84d130 KW |
614 | 25CC ; Consonant_Placeholder # So DOTTED CIRCLE |
615 | ||
616 | # ================================================ | |
617 | ||
618 | # Indic_Syllabic_Category=Consonant | |
619 | ||
620 | # Consonant (ordinary abugida consonants, with inherent vowels) | |
621 | ||
622 | # [Not derivable] | |
623 | ||
a9c9e371 | 624 | 0915..0939 ; Consonant # Lo [37] DEVANAGARI LETTER KA..DEVANAGARI LETTER HA |
bd84d130 | 625 | 0958..095F ; Consonant # Lo [8] DEVANAGARI LETTER QA..DEVANAGARI LETTER YYA |
09edd811 | 626 | 0978..097F ; Consonant # Lo [8] DEVANAGARI LETTER MARWARI DDA..DEVANAGARI LETTER BBA |
bd84d130 KW |
627 | 0995..09A8 ; Consonant # Lo [20] BENGALI LETTER KA..BENGALI LETTER NA |
628 | 09AA..09B0 ; Consonant # Lo [7] BENGALI LETTER PA..BENGALI LETTER RA | |
629 | 09B2 ; Consonant # Lo BENGALI LETTER LA | |
630 | 09B6..09B9 ; Consonant # Lo [4] BENGALI LETTER SHA..BENGALI LETTER HA | |
631 | 09DC..09DD ; Consonant # Lo [2] BENGALI LETTER RRA..BENGALI LETTER RHA | |
632 | 09DF ; Consonant # Lo BENGALI LETTER YYA | |
633 | 09F0..09F1 ; Consonant # Lo [2] BENGALI LETTER RA WITH MIDDLE DIAGONAL..BENGALI LETTER RA WITH LOWER DIAGONAL | |
634 | 0A15..0A28 ; Consonant # Lo [20] GURMUKHI LETTER KA..GURMUKHI LETTER NA | |
635 | 0A2A..0A30 ; Consonant # Lo [7] GURMUKHI LETTER PA..GURMUKHI LETTER RA | |
636 | 0A32..0A33 ; Consonant # Lo [2] GURMUKHI LETTER LA..GURMUKHI LETTER LLA | |
637 | 0A35..0A36 ; Consonant # Lo [2] GURMUKHI LETTER VA..GURMUKHI LETTER SHA | |
638 | 0A38..0A39 ; Consonant # Lo [2] GURMUKHI LETTER SA..GURMUKHI LETTER HA | |
639 | 0A59..0A5C ; Consonant # Lo [4] GURMUKHI LETTER KHHA..GURMUKHI LETTER RRA | |
640 | 0A5E ; Consonant # Lo GURMUKHI LETTER FA | |
641 | 0A95..0AA8 ; Consonant # Lo [20] GUJARATI LETTER KA..GUJARATI LETTER NA | |
642 | 0AAA..0AB0 ; Consonant # Lo [7] GUJARATI LETTER PA..GUJARATI LETTER RA | |
643 | 0AB2..0AB3 ; Consonant # Lo [2] GUJARATI LETTER LA..GUJARATI LETTER LLA | |
644 | 0AB5..0AB9 ; Consonant # Lo [5] GUJARATI LETTER VA..GUJARATI LETTER HA | |
ac71d2a0 | 645 | 0AF9 ; Consonant # Lo GUJARATI LETTER ZHA |
bd84d130 KW |
646 | 0B15..0B28 ; Consonant # Lo [20] ORIYA LETTER KA..ORIYA LETTER NA |
647 | 0B2A..0B30 ; Consonant # Lo [7] ORIYA LETTER PA..ORIYA LETTER RA | |
648 | 0B32..0B33 ; Consonant # Lo [2] ORIYA LETTER LA..ORIYA LETTER LLA | |
649 | 0B35..0B39 ; Consonant # Lo [5] ORIYA LETTER VA..ORIYA LETTER HA | |
650 | 0B5C..0B5D ; Consonant # Lo [2] ORIYA LETTER RRA..ORIYA LETTER RHA | |
651 | 0B5F ; Consonant # Lo ORIYA LETTER YYA | |
652 | 0B71 ; Consonant # Lo ORIYA LETTER WA | |
653 | 0B95 ; Consonant # Lo TAMIL LETTER KA | |
654 | 0B99..0B9A ; Consonant # Lo [2] TAMIL LETTER NGA..TAMIL LETTER CA | |
655 | 0B9C ; Consonant # Lo TAMIL LETTER JA | |
656 | 0B9E..0B9F ; Consonant # Lo [2] TAMIL LETTER NYA..TAMIL LETTER TTA | |
657 | 0BA3..0BA4 ; Consonant # Lo [2] TAMIL LETTER NNA..TAMIL LETTER TA | |
658 | 0BA8..0BAA ; Consonant # Lo [3] TAMIL LETTER NA..TAMIL LETTER PA | |
659 | 0BAE..0BB9 ; Consonant # Lo [12] TAMIL LETTER MA..TAMIL LETTER HA | |
660 | 0C15..0C28 ; Consonant # Lo [20] TELUGU LETTER KA..TELUGU LETTER NA | |
09edd811 | 661 | 0C2A..0C39 ; Consonant # Lo [16] TELUGU LETTER PA..TELUGU LETTER HA |
ac71d2a0 | 662 | 0C58..0C5A ; Consonant # Lo [3] TELUGU LETTER TSA..TELUGU LETTER RRRA |
bd84d130 KW |
663 | 0C95..0CA8 ; Consonant # Lo [20] KANNADA LETTER KA..KANNADA LETTER NA |
664 | 0CAA..0CB3 ; Consonant # Lo [10] KANNADA LETTER PA..KANNADA LETTER LLA | |
665 | 0CB5..0CB9 ; Consonant # Lo [5] KANNADA LETTER VA..KANNADA LETTER HA | |
666 | 0CDE ; Consonant # Lo KANNADA LETTER FA | |
667 | 0D15..0D3A ; Consonant # Lo [38] MALAYALAM LETTER KA..MALAYALAM LETTER TTTA | |
668 | 0D9A..0DB1 ; Consonant # Lo [24] SINHALA LETTER ALPAPRAANA KAYANNA..SINHALA LETTER DANTAJA NAYANNA | |
669 | 0DB3..0DBB ; Consonant # Lo [9] SINHALA LETTER SANYAKA DAYANNA..SINHALA LETTER RAYANNA | |
670 | 0DBD ; Consonant # Lo SINHALA LETTER DANTAJA LAYANNA | |
671 | 0DC0..0DC6 ; Consonant # Lo [7] SINHALA LETTER VAYANNA..SINHALA LETTER FAYANNA | |
09edd811 | 672 | 0E01..0E2E ; Consonant # Lo [46] THAI CHARACTER KO KAI..THAI CHARACTER HO NOKHUK |
bd84d130 KW |
673 | 0E81..0E82 ; Consonant # Lo [2] LAO LETTER KO..LAO LETTER KHO SUNG |
674 | 0E84 ; Consonant # Lo LAO LETTER KHO TAM | |
675 | 0E87..0E88 ; Consonant # Lo [2] LAO LETTER NGO..LAO LETTER CO | |
676 | 0E8A ; Consonant # Lo LAO LETTER SO TAM | |
677 | 0E8D ; Consonant # Lo LAO LETTER NYO | |
678 | 0E94..0E97 ; Consonant # Lo [4] LAO LETTER DO..LAO LETTER THO TAM | |
679 | 0E99..0E9F ; Consonant # Lo [7] LAO LETTER NO..LAO LETTER FO SUNG | |
680 | 0EA1..0EA3 ; Consonant # Lo [3] LAO LETTER MO..LAO LETTER LO LING | |
681 | 0EA5 ; Consonant # Lo LAO LETTER LO LOOT | |
682 | 0EA7 ; Consonant # Lo LAO LETTER WO | |
683 | 0EAA..0EAB ; Consonant # Lo [2] LAO LETTER SO SUNG..LAO LETTER HO SUNG | |
684 | 0EAD..0EAE ; Consonant # Lo [2] LAO LETTER O..LAO LETTER HO TAM | |
09edd811 | 685 | 0EDC..0EDF ; Consonant # Lo [4] LAO HO NO..LAO LETTER KHMU NYO |
bd84d130 KW |
686 | 0F40..0F47 ; Consonant # Lo [8] TIBETAN LETTER KA..TIBETAN LETTER JA |
687 | 0F49..0F6C ; Consonant # Lo [36] TIBETAN LETTER NYA..TIBETAN LETTER RRA | |
688 | 1000..1020 ; Consonant # Lo [33] MYANMAR LETTER KA..MYANMAR LETTER LLA | |
689 | 103F ; Consonant # Lo MYANMAR LETTER GREAT SA | |
690 | 1050..1051 ; Consonant # Lo [2] MYANMAR LETTER SHA..MYANMAR LETTER SSA | |
691 | 105A..105D ; Consonant # Lo [4] MYANMAR LETTER MON NGA..MYANMAR LETTER MON BBE | |
692 | 1061 ; Consonant # Lo MYANMAR LETTER SGAW KAREN SHA | |
693 | 1065..1066 ; Consonant # Lo [2] MYANMAR LETTER WESTERN PWO KAREN THA..MYANMAR LETTER WESTERN PWO KAREN PWA | |
694 | 106E..1070 ; Consonant # Lo [3] MYANMAR LETTER EASTERN PWO KAREN NNA..MYANMAR LETTER EASTERN PWO KAREN GHWA | |
695 | 1075..1081 ; Consonant # Lo [13] MYANMAR LETTER SHAN KA..MYANMAR LETTER SHAN HA | |
696 | 108E ; Consonant # Lo MYANMAR LETTER RUMAI PALAUNG FA | |
697 | 1703..170C ; Consonant # Lo [10] TAGALOG LETTER KA..TAGALOG LETTER YA | |
698 | 170E..1711 ; Consonant # Lo [4] TAGALOG LETTER LA..TAGALOG LETTER HA | |
699 | 1723..1731 ; Consonant # Lo [15] HANUNOO LETTER KA..HANUNOO LETTER HA | |
700 | 1743..1751 ; Consonant # Lo [15] BUHID LETTER KA..BUHID LETTER HA | |
701 | 1763..176C ; Consonant # Lo [10] TAGBANWA LETTER KA..TAGBANWA LETTER YA | |
702 | 176E..1770 ; Consonant # Lo [3] TAGBANWA LETTER LA..TAGBANWA LETTER SA | |
703 | 1780..17A2 ; Consonant # Lo [35] KHMER LETTER KA..KHMER LETTER QA | |
09edd811 | 704 | 1901..191E ; Consonant # Lo [30] LIMBU LETTER KA..LIMBU LETTER TRA |
bd84d130 KW |
705 | 1950..1962 ; Consonant # Lo [19] TAI LE LETTER KA..TAI LE LETTER NA |
706 | 1980..19AB ; Consonant # Lo [44] NEW TAI LUE LETTER HIGH QA..NEW TAI LUE LETTER LOW SUA | |
707 | 1A00..1A16 ; Consonant # Lo [23] BUGINESE LETTER KA..BUGINESE LETTER HA | |
708 | 1A20..1A4C ; Consonant # Lo [45] TAI THAM LETTER HIGH KA..TAI THAM LETTER LOW HA | |
709 | 1A53..1A54 ; Consonant # Lo [2] TAI THAM LETTER LAE..TAI THAM LETTER GREAT SA | |
710 | 1B13..1B33 ; Consonant # Lo [33] BALINESE LETTER KA..BALINESE LETTER HA | |
711 | 1B45..1B4B ; Consonant # Lo [7] BALINESE LETTER KAF SASAK..BALINESE LETTER ASYURA SASAK | |
712 | 1B8A..1BA0 ; Consonant # Lo [23] SUNDANESE LETTER KA..SUNDANESE LETTER HA | |
713 | 1BAE..1BAF ; Consonant # Lo [2] SUNDANESE LETTER KHA..SUNDANESE LETTER SYA | |
7620cb10 | 714 | 1BBB..1BBD ; Consonant # Lo [3] SUNDANESE LETTER REU..SUNDANESE LETTER BHA |
bd84d130 KW |
715 | 1BC0..1BE3 ; Consonant # Lo [36] BATAK LETTER A..BATAK LETTER MBA |
716 | 1C00..1C23 ; Consonant # Lo [36] LEPCHA LETTER KA..LEPCHA LETTER A | |
717 | 1C4D..1C4F ; Consonant # Lo [3] LEPCHA LETTER TTA..LEPCHA LETTER DDA | |
718 | A807..A80A ; Consonant # Lo [4] SYLOTI NAGRI LETTER KO..SYLOTI NAGRI LETTER GHO | |
719 | A80C..A822 ; Consonant # Lo [23] SYLOTI NAGRI LETTER CO..SYLOTI NAGRI LETTER HO | |
720 | A840..A85D ; Consonant # Lo [30] PHAGS-PA LETTER KA..PHAGS-PA LETTER A | |
721 | A862..A865 ; Consonant # Lo [4] PHAGS-PA LETTER QA..PHAGS-PA LETTER GGA | |
722 | A869..A870 ; Consonant # Lo [8] PHAGS-PA LETTER TTA..PHAGS-PA LETTER ASPIRATED FA | |
723 | A872 ; Consonant # Lo PHAGS-PA SUPERFIXED LETTER RA | |
724 | A892..A8B3 ; Consonant # Lo [34] SAURASHTRA LETTER KA..SAURASHTRA LETTER LLA | |
725 | A90A..A921 ; Consonant # Lo [24] KAYAH LI LETTER KA..KAYAH LI LETTER CA | |
726 | A930..A946 ; Consonant # Lo [23] REJANG LETTER KA..REJANG LETTER A | |
727 | A989..A98B ; Consonant # Lo [3] JAVANESE LETTER PA CEREK..JAVANESE LETTER NGA LELET RASWADI | |
a9c9e371 | 728 | A98F..A9B2 ; Consonant # Lo [36] JAVANESE LETTER KA..JAVANESE LETTER HA |
09edd811 KW |
729 | A9E0..A9E4 ; Consonant # Lo [5] MYANMAR LETTER SHAN GHA..MYANMAR LETTER SHAN BHA |
730 | A9E7..A9EF ; Consonant # Lo [9] MYANMAR LETTER TAI LAING NYA..MYANMAR LETTER TAI LAING NNA | |
731 | A9FA..A9FE ; Consonant # Lo [5] MYANMAR LETTER TAI LAING LLA..MYANMAR LETTER TAI LAING BHA | |
bd84d130 KW |
732 | AA06..AA28 ; Consonant # Lo [35] CHAM LETTER KA..CHAM LETTER HA |
733 | AA60..AA6F ; Consonant # Lo [16] MYANMAR LETTER KHAMTI GA..MYANMAR LETTER KHAMTI FA | |
09edd811 | 734 | AA71..AA73 ; Consonant # Lo [3] MYANMAR LETTER KHAMTI XA..MYANMAR LETTER KHAMTI RA |
bd84d130 | 735 | AA7A ; Consonant # Lo MYANMAR LETTER AITON RA |
09edd811 | 736 | AA7E..AA7F ; Consonant # Lo [2] MYANMAR LETTER SHWE PALAUNG CHA..MYANMAR LETTER SHWE PALAUNG SHA |
bd84d130 | 737 | AA80..AAAF ; Consonant # Lo [48] TAI VIET LETTER LOW KO..TAI VIET LETTER HIGH O |
7620cb10 KW |
738 | AAE2..AAEA ; Consonant # Lo [9] MEETEI MAYEK LETTER CHA..MEETEI MAYEK LETTER SSA |
739 | ABC0..ABCD ; Consonant # Lo [14] MEETEI MAYEK LETTER KOK..MEETEI MAYEK LETTER HUK | |
740 | ABD0 ; Consonant # Lo MEETEI MAYEK LETTER PHAM | |
741 | ABD2..ABDA ; Consonant # Lo [9] MEETEI MAYEK LETTER GOK..MEETEI MAYEK LETTER BHAM | |
bd84d130 KW |
742 | 10A00 ; Consonant # Lo KHAROSHTHI LETTER A |
743 | 10A10..10A13 ; Consonant # Lo [4] KHAROSHTHI LETTER KA..KHAROSHTHI LETTER GHA | |
744 | 10A15..10A17 ; Consonant # Lo [3] KHAROSHTHI LETTER CA..KHAROSHTHI LETTER JA | |
745 | 10A19..10A33 ; Consonant # Lo [27] KHAROSHTHI LETTER NYA..KHAROSHTHI LETTER TTTHA | |
746 | 11013..11037 ; Consonant # Lo [37] BRAHMI LETTER KA..BRAHMI LETTER OLD TAMIL NNNA | |
747 | 1108D..110AF ; Consonant # Lo [35] KAITHI LETTER KA..KAITHI LETTER HA | |
7620cb10 | 748 | 11107..11126 ; Consonant # Lo [32] CHAKMA LETTER KAA..CHAKMA LETTER HAA |
09edd811 | 749 | 11155..11172 ; Consonant # Lo [30] MAHAJANI LETTER KA..MAHAJANI LETTER RRA |
7620cb10 | 750 | 11191..111B2 ; Consonant # Lo [34] SHARADA LETTER KA..SHARADA LETTER HA |
09edd811 KW |
751 | 11208..11211 ; Consonant # Lo [10] KHOJKI LETTER KA..KHOJKI LETTER JJA |
752 | 11213..1122B ; Consonant # Lo [25] KHOJKI LETTER NYA..KHOJKI LETTER LLA | |
ac71d2a0 UC |
753 | 11284..11286 ; Consonant # Lo [3] MULTANI LETTER KA..MULTANI LETTER GA |
754 | 11288 ; Consonant # Lo MULTANI LETTER GHA | |
755 | 1128A..1128D ; Consonant # Lo [4] MULTANI LETTER CA..MULTANI LETTER JJA | |
756 | 1128F..1129D ; Consonant # Lo [15] MULTANI LETTER NYA..MULTANI LETTER BA | |
757 | 1129F..112A8 ; Consonant # Lo [10] MULTANI LETTER BHA..MULTANI LETTER RHA | |
09edd811 KW |
758 | 112BA..112DE ; Consonant # Lo [37] KHUDAWADI LETTER KA..KHUDAWADI LETTER HA |
759 | 11315..11328 ; Consonant # Lo [20] GRANTHA LETTER KA..GRANTHA LETTER NA | |
760 | 1132A..11330 ; Consonant # Lo [7] GRANTHA LETTER PA..GRANTHA LETTER RA | |
761 | 11332..11333 ; Consonant # Lo [2] GRANTHA LETTER LA..GRANTHA LETTER LLA | |
762 | 11335..11339 ; Consonant # Lo [5] GRANTHA LETTER VA..GRANTHA LETTER HA | |
763 | 1148F..114AF ; Consonant # Lo [33] TIRHUTA LETTER KA..TIRHUTA LETTER HA | |
764 | 1158E..115AE ; Consonant # Lo [33] SIDDHAM LETTER KA..SIDDHAM LETTER HA | |
765 | 1160E..1162F ; Consonant # Lo [34] MODI LETTER KA..MODI LETTER LLA | |
766 | 1168A..116AA ; Consonant # Lo [33] TAKRI LETTER KA..TAKRI LETTER RRA | |
ac71d2a0 | 767 | 11700..11719 ; Consonant # Lo [26] AHOM LETTER KA..AHOM LETTER JHA |
bd84d130 KW |
768 | |
769 | # ================================================ | |
770 | ||
771 | # Indic_Syllabic_Category=Consonant_Dead | |
772 | ||
773 | # Dead Consonant (special consonant with killed vowel) | |
774 | ||
775 | # [Not derivable] | |
776 | ||
777 | 09CE ; Consonant_Dead # Lo BENGALI LETTER KHANDA TA | |
778 | 0D7A..0D7F ; Consonant_Dead # Lo [6] MALAYALAM LETTER CHILLU NN..MALAYALAM LETTER CHILLU K | |
779 | ||
780 | # ================================================ | |
781 | ||
ac71d2a0 UC |
782 | # Indic_Syllabic_Category=Consonant_With_Stacker |
783 | ||
784 | # Consonants that may make stacked ligatures with the next consonant | |
785 | # without the use of a virama | |
786 | ||
787 | # [Not derivable] | |
788 | ||
789 | 0CF1..0CF2 ; Consonant_With_Stacker # Lo [2] KANNADA SIGN JIHVAMULIYA..KANNADA SIGN UPADHMANIYA | |
790 | 11003..11004 ; Consonant_With_Stacker # Lo [2] BRAHMI SIGN JIHVAMULIYA..BRAHMI SIGN UPADHMANIYA | |
791 | ||
792 | # ================================================ | |
793 | ||
794 | # Indic_Syllabic_Category=Consonant_Prefixed | |
795 | ||
796 | # Cluster-intial consonants | |
797 | ||
798 | # [Not derivable] | |
799 | ||
800 | 111C2..111C3 ; Consonant_Prefixed # Lo [2] SHARADA SIGN JIHVAMULIYA..SHARADA SIGN UPADHMANIYA | |
801 | ||
802 | # ================================================ | |
803 | ||
09edd811 | 804 | # Indic_Syllabic_Category=Consonant_Preceding_Repha |
bd84d130 | 805 | |
09edd811 | 806 | # Repha Form of RA (reanalyzed in some scripts), when preceding the main consonant |
bd84d130 KW |
807 | |
808 | # [Not derivable] | |
809 | ||
09edd811 KW |
810 | 0D4E ; Consonant_Preceding_Repha # Lo MALAYALAM LETTER DOT REPH |
811 | ||
812 | # ================================================ | |
813 | ||
814 | # Indic_Syllabic_Category=Consonant_Succeeding_Repha | |
815 | ||
816 | # Repha Form of RA (reanalyzed in some scripts), when succeeding the main consonant | |
817 | ||
818 | # [Not derivable] | |
819 | ||
820 | 17CC ; Consonant_Succeeding_Repha # Mn KHMER SIGN ROBAT | |
821 | 1B03 ; Consonant_Succeeding_Repha # Mn BALINESE SIGN SURANG | |
822 | 1B81 ; Consonant_Succeeding_Repha # Mn SUNDANESE SIGN PANGLAYAR | |
823 | A982 ; Consonant_Succeeding_Repha # Mn JAVANESE SIGN LAYAR | |
bd84d130 KW |
824 | |
825 | # ================================================ | |
826 | ||
827 | # Indic_Syllabic_Category=Consonant_Subjoined | |
828 | ||
829 | # Subjoined Consonant (C2 form subtending a base consonant in Tibetan, etc.) | |
830 | ||
831 | # [Not derivable] | |
832 | ||
833 | 0F8D..0F97 ; Consonant_Subjoined # Mn [11] TIBETAN SUBJOINED SIGN LCE TSA CAN..TIBETAN SUBJOINED LETTER JA | |
834 | 0F99..0FBC ; Consonant_Subjoined # Mn [36] TIBETAN SUBJOINED LETTER NYA..TIBETAN SUBJOINED LETTER FIXED-FORM RA | |
835 | 1929..192B ; Consonant_Subjoined # Mc [3] LIMBU SUBJOINED LETTER YA..LIMBU SUBJOINED LETTER WA | |
836 | 1BA1 ; Consonant_Subjoined # Mc SUNDANESE CONSONANT SIGN PAMINGKAL | |
837 | 1BA2..1BA3 ; Consonant_Subjoined # Mn [2] SUNDANESE CONSONANT SIGN PANYAKRA..SUNDANESE CONSONANT SIGN PANYIKU | |
09edd811 | 838 | 1BAC..1BAD ; Consonant_Subjoined # Mn [2] SUNDANESE CONSONANT SIGN PASANGAN MA..SUNDANESE CONSONANT SIGN PASANGAN WA |
bd84d130 KW |
839 | 1C24..1C25 ; Consonant_Subjoined # Mc [2] LEPCHA SUBJOINED LETTER YA..LEPCHA SUBJOINED LETTER RA |
840 | A867..A868 ; Consonant_Subjoined # Lo [2] PHAGS-PA SUBJOINED LETTER WA..PHAGS-PA SUBJOINED LETTER YA | |
841 | A871 ; Consonant_Subjoined # Lo PHAGS-PA SUBJOINED LETTER RA | |
842 | A9BD ; Consonant_Subjoined # Mc JAVANESE CONSONANT SIGN KERET | |
843 | ||
844 | # ================================================ | |
845 | ||
846 | # Indic_Syllabic_Category=Consonant_Medial | |
847 | ||
848 | # Medial Consonant (medial liquid, occurring in clusters) | |
849 | ||
850 | # [Not derivable] | |
851 | ||
852 | 0A75 ; Consonant_Medial # Mn GURMUKHI SIGN YAKASH | |
853 | 0EBC ; Consonant_Medial # Mn LAO SEMIVOWEL SIGN LO | |
854 | 0EBD ; Consonant_Medial # Lo LAO SEMIVOWEL SIGN NYO | |
855 | 103B..103C ; Consonant_Medial # Mc [2] MYANMAR CONSONANT SIGN MEDIAL YA..MYANMAR CONSONANT SIGN MEDIAL RA | |
856 | 103D..103E ; Consonant_Medial # Mn [2] MYANMAR CONSONANT SIGN MEDIAL WA..MYANMAR CONSONANT SIGN MEDIAL HA | |
857 | 105E..1060 ; Consonant_Medial # Mn [3] MYANMAR CONSONANT SIGN MON MEDIAL NA..MYANMAR CONSONANT SIGN MON MEDIAL LA | |
858 | 1082 ; Consonant_Medial # Mn MYANMAR CONSONANT SIGN SHAN MEDIAL WA | |
859 | 1A55 ; Consonant_Medial # Mc TAI THAM CONSONANT SIGN MEDIAL RA | |
860 | 1A56 ; Consonant_Medial # Mn TAI THAM CONSONANT SIGN MEDIAL LA | |
a9c9e371 | 861 | A9BE..A9BF ; Consonant_Medial # Mc [2] JAVANESE CONSONANT SIGN PENGKAL..JAVANESE CONSONANT SIGN CAKRA |
bd84d130 KW |
862 | AA33..AA34 ; Consonant_Medial # Mc [2] CHAM CONSONANT SIGN YA..CHAM CONSONANT SIGN RA |
863 | AA35..AA36 ; Consonant_Medial # Mn [2] CHAM CONSONANT SIGN LA..CHAM CONSONANT SIGN WA | |
ac71d2a0 | 864 | 1171D..1171F ; Consonant_Medial # Mn [3] AHOM CONSONANT SIGN MEDIAL LA..AHOM CONSONANT SIGN MEDIAL LIGATING RA |
bd84d130 KW |
865 | |
866 | # ================================================ | |
867 | ||
868 | # Indic_Syllabic_Category=Consonant_Final | |
869 | ||
870 | # Final Consonant (special final forms which do not take vowels) | |
871 | ||
872 | # [Not derivable] | |
873 | ||
874 | 1930..1931 ; Consonant_Final # Mc [2] LIMBU SMALL LETTER KA..LIMBU SMALL LETTER NGA | |
875 | 1933..1938 ; Consonant_Final # Mc [6] LIMBU SMALL LETTER TA..LIMBU SMALL LETTER LA | |
ac71d2a0 | 876 | 1939 ; Consonant_Final # Mn LIMBU SIGN MUKPHRENG |
bd84d130 KW |
877 | 19C1..19C7 ; Consonant_Final # Lo [7] NEW TAI LUE LETTER FINAL V..NEW TAI LUE LETTER FINAL B |
878 | 1A57 ; Consonant_Final # Mc TAI THAM CONSONANT SIGN LA TANG LAI | |
879 | 1A58..1A5E ; Consonant_Final # Mn [7] TAI THAM SIGN MAI KANG LAI..TAI THAM CONSONANT SIGN SA | |
7620cb10 | 880 | 1BBE..1BBF ; Consonant_Final # Lo [2] SUNDANESE LETTER FINAL K..SUNDANESE LETTER FINAL M |
bd84d130 KW |
881 | 1BF0..1BF1 ; Consonant_Final # Mn [2] BATAK CONSONANT SIGN NG..BATAK CONSONANT SIGN H |
882 | 1C2D..1C33 ; Consonant_Final # Mn [7] LEPCHA CONSONANT SIGN K..LEPCHA CONSONANT SIGN T | |
883 | A8B4 ; Consonant_Final # Mc SAURASHTRA CONSONANT SIGN HAARU | |
884 | A94F..A951 ; Consonant_Final # Mn [3] REJANG CONSONANT SIGN NG..REJANG CONSONANT SIGN R | |
885 | A952 ; Consonant_Final # Mc REJANG CONSONANT SIGN H | |
886 | AA40..AA42 ; Consonant_Final # Lo [3] CHAM LETTER FINAL K..CHAM LETTER FINAL NG | |
887 | AA43 ; Consonant_Final # Mn CHAM CONSONANT SIGN FINAL NG | |
888 | AA44..AA4B ; Consonant_Final # Lo [8] CHAM LETTER FINAL CH..CHAM LETTER FINAL SS | |
889 | AA4C ; Consonant_Final # Mn CHAM CONSONANT SIGN FINAL M | |
890 | AA4D ; Consonant_Final # Mc CHAM CONSONANT SIGN FINAL H | |
891 | ABDB..ABE2 ; Consonant_Final # Lo [8] MEETEI MAYEK LETTER KOK LONSUM..MEETEI MAYEK LETTER I LONSUM | |
892 | ||
893 | # ================================================ | |
894 | ||
895 | # Indic_Syllabic_Category=Consonant_Head_Letter | |
896 | ||
897 | # Head Letter (Tibetan) | |
898 | ||
899 | # [Not derivable] | |
900 | ||
901 | 0F88..0F8C ; Consonant_Head_Letter # Lo [5] TIBETAN SIGN LCE TSA CAN..TIBETAN SIGN INVERTED MCHU CAN | |
902 | ||
903 | # ================================================ | |
904 | ||
905 | # Indic_Syllabic_Category=Modifying_Letter | |
906 | ||
907 | # Reanalyzed letters not participating in the abugida structure, but | |
908 | # serving to modify the sound of an adjacent vowel or consonant. | |
909 | # Note that this is not the same as General_Category=Modifier_Letter. | |
910 | ||
911 | # [Not derivable] | |
912 | ||
09edd811 | 913 | 0B83 ; Modifying_Letter # Lo TAMIL SIGN VISARGA |
bd84d130 KW |
914 | |
915 | # ================================================ | |
916 | ||
917 | # Indic_Syllabic_Category=Tone_Letter | |
918 | ||
919 | # Tone Letter (spacing lexical tone mark with status as a letter) | |
920 | ||
921 | # [Not derivable] | |
922 | ||
923 | 1970..1974 ; Tone_Letter # Lo [5] TAI LE LETTER TONE-2..TAI LE LETTER TONE-6 | |
924 | AAC0 ; Tone_Letter # Lo TAI VIET TONE MAI NUENG | |
925 | AAC2 ; Tone_Letter # Lo TAI VIET TONE MAI SONG | |
926 | ||
927 | # ================================================ | |
928 | ||
929 | # Indic_Syllabic_Category=Tone_Mark | |
930 | ||
931 | # Tone Mark (nonspacing or spacing lexical tone mark) | |
bd84d130 KW |
932 | |
933 | # [Not derivable] | |
934 | ||
935 | 0E48..0E4B ; Tone_Mark # Mn [4] THAI CHARACTER MAI EK..THAI CHARACTER MAI CHATTAWA | |
936 | 0EC8..0ECB ; Tone_Mark # Mn [4] LAO TONE MAI EK..LAO TONE MAI CATAWA | |
937 | 1037 ; Tone_Mark # Mn MYANMAR SIGN DOT BELOW | |
938 | 1063..1064 ; Tone_Mark # Mc [2] MYANMAR TONE MARK SGAW KAREN HATHI..MYANMAR TONE MARK SGAW KAREN KE PHO | |
939 | 1069..106D ; Tone_Mark # Mc [5] MYANMAR SIGN WESTERN PWO KAREN TONE-1..MYANMAR SIGN WESTERN PWO KAREN TONE-5 | |
940 | 1087..108C ; Tone_Mark # Mc [6] MYANMAR SIGN SHAN TONE-2..MYANMAR SIGN SHAN COUNCIL TONE-3 | |
941 | 108D ; Tone_Mark # Mn MYANMAR SIGN SHAN COUNCIL EMPHATIC TONE | |
942 | 108F ; Tone_Mark # Mc MYANMAR SIGN RUMAI PALAUNG TONE-5 | |
943 | 109A..109B ; Tone_Mark # Mc [2] MYANMAR SIGN KHAMTI TONE-1..MYANMAR SIGN KHAMTI TONE-3 | |
ac71d2a0 | 944 | 19C8..19C9 ; Tone_Mark # Lo [2] NEW TAI LUE TONE MARK-1..NEW TAI LUE TONE MARK-2 |
bd84d130 KW |
945 | 1A75..1A79 ; Tone_Mark # Mn [5] TAI THAM SIGN TONE-1..TAI THAM SIGN KHUEN TONE-5 |
946 | A92B..A92D ; Tone_Mark # Mn [3] KAYAH LI TONE PLOPHU..KAYAH LI TONE CALYA PLOPHU | |
947 | AA7B ; Tone_Mark # Mc MYANMAR SIGN PAO KAREN TONE | |
09edd811 KW |
948 | AA7C ; Tone_Mark # Mn MYANMAR SIGN TAI LAING TONE-2 |
949 | AA7D ; Tone_Mark # Mc MYANMAR SIGN TAI LAING TONE-5 | |
bd84d130 KW |
950 | AABF ; Tone_Mark # Mn TAI VIET TONE MAI EK |
951 | AAC1 ; Tone_Mark # Mn TAI VIET TONE MAI THO | |
952 | ABEC ; Tone_Mark # Mc MEETEI MAYEK LUM IYEK | |
953 | ||
954 | # ================================================ | |
955 | ||
09edd811 KW |
956 | # Indic_Syllabic_Category=Gemination_Mark |
957 | ||
958 | # Gemination Mark (doubling of the preceding or following consonant) | |
959 | ||
960 | # [Not derivable] | |
961 | ||
962 | 0A71 ; Gemination_Mark # Mn GURMUKHI ADDAK | |
963 | 11237 ; Gemination_Mark # Mn KHOJKI SIGN SHADDA | |
964 | ||
965 | # ================================================ | |
966 | ||
967 | # Indic_Syllabic_Category=Cantillation_Mark | |
968 | ||
969 | # Cantillation Mark (recitation marks, such as svara markers for the Samaveda) | |
970 | ||
971 | # [Not derivable] | |
972 | ||
ac71d2a0 UC |
973 | 0951..0952 ; Cantillation_Mark # Mn [2] DEVANAGARI STRESS SIGN UDATTA..DEVANAGARI STRESS SIGN ANUDATTA |
974 | 1CD0..1CD2 ; Cantillation_Mark # Mn [3] VEDIC TONE KARSHANA..VEDIC TONE PRENKHA | |
975 | 1CD4..1CE0 ; Cantillation_Mark # Mn [13] VEDIC SIGN YAJURVEDIC MIDLINE SVARITA..VEDIC TONE RIGVEDIC KASHMIRI INDEPENDENT SVARITA | |
976 | 1CE1 ; Cantillation_Mark # Mc VEDIC TONE ATHARVAVEDIC INDEPENDENT SVARITA | |
977 | 1CF4 ; Cantillation_Mark # Mn VEDIC TONE CANDRA ABOVE | |
978 | 1CF8..1CF9 ; Cantillation_Mark # Mn [2] VEDIC TONE RING ABOVE..VEDIC TONE DOUBLE RING ABOVE | |
09edd811 KW |
979 | A8E0..A8F1 ; Cantillation_Mark # Mn [18] COMBINING DEVANAGARI DIGIT ZERO..COMBINING DEVANAGARI SIGN AVAGRAHA |
980 | 11366..1136C ; Cantillation_Mark # Mn [7] COMBINING GRANTHA DIGIT ZERO..COMBINING GRANTHA DIGIT SIX | |
981 | 11370..11374 ; Cantillation_Mark # Mn [5] COMBINING GRANTHA LETTER A..COMBINING GRANTHA LETTER PA | |
982 | ||
983 | # ================================================ | |
984 | ||
bd84d130 KW |
985 | # Indic_Syllabic_Category=Register_Shifter |
986 | ||
987 | # Register Shifter (shifts register for consonants, akin to a tone mark) | |
988 | ||
989 | # [Not derivable] | |
990 | ||
ac71d2a0 UC |
991 | 17C9..17CA ; Register_Shifter # Mn [2] KHMER SIGN MUUSIKATOAN..KHMER SIGN TRIISAP |
992 | ||
993 | # ================================================ | |
994 | ||
995 | # Indic_Syllabic_Category=Syllable_Modifier | |
996 | ||
997 | # Syllable Modifier (miscellaneous combining characters that modify | |
998 | # something in the orthographic syllable they succeed) | |
999 | ||
1000 | # [Not derivable] | |
1001 | ||
1002 | 00B2..00B3 ; Syllable_Modifier # No [2] SUPERSCRIPT TWO..SUPERSCRIPT THREE | |
1003 | 0F35 ; Syllable_Modifier # Mn TIBETAN MARK NGAS BZUNG NYI ZLA | |
1004 | 0F37 ; Syllable_Modifier # Mn TIBETAN MARK NGAS BZUNG SGOR RTAGS | |
1005 | 0FC6 ; Syllable_Modifier # Mn TIBETAN SYMBOL PADMA GDAN | |
1006 | 17CB ; Syllable_Modifier # Mn KHMER SIGN BANTOC | |
1007 | 17CE..17D0 ; Syllable_Modifier # Mn [3] KHMER SIGN KAKABAT..KHMER SIGN SAMYOK SANNYA | |
1008 | 17D3 ; Syllable_Modifier # Mn KHMER SIGN BATHAMASAT | |
1009 | 193B ; Syllable_Modifier # Mn LIMBU SIGN SA-I | |
1010 | 1A7A..1A7C ; Syllable_Modifier # Mn [3] TAI THAM SIGN RA HAAM..TAI THAM SIGN KHUEN-LUE KARAN | |
1011 | 1A7F ; Syllable_Modifier # Mn TAI THAM COMBINING CRYPTOGRAMMIC DOT | |
1012 | 1C36 ; Syllable_Modifier # Mn LEPCHA SIGN RAN | |
1013 | 2074 ; Syllable_Modifier # No SUPERSCRIPT FOUR | |
1014 | 2082..2084 ; Syllable_Modifier # No [3] SUBSCRIPT TWO..SUBSCRIPT FOUR | |
1015 | ||
1016 | # ================================================ | |
1017 | ||
1018 | # Indic_Syllabic_Category=Consonant_Killer | |
1019 | ||
1020 | # Consonant Killer (signifies that the previous consonant or consonants are | |
1021 | # not pronounced) | |
1022 | ||
1023 | # [Not derivable] | |
1024 | ||
1025 | 0E4C ; Consonant_Killer # Mn THAI CHARACTER THANTHAKHAT | |
1026 | 17CD ; Consonant_Killer # Mn KHMER SIGN TOANDAKHIAT | |
09edd811 KW |
1027 | |
1028 | # ================================================ | |
1029 | ||
1030 | # Indic_Syllabic_Category=Non_Joiner | |
1031 | ||
1032 | # Non_Joiner (Zero Width Non-Joiner) | |
1033 | ||
1034 | # [Not derivable] | |
1035 | ||
1036 | 200C ; Non_Joiner # Cf ZERO WIDTH NON-JOINER | |
1037 | ||
1038 | # ================================================ | |
1039 | ||
1040 | # Indic_Syllabic_Category=Joiner | |
1041 | ||
1042 | # Joiner (Zero Width Joiner) | |
1043 | ||
1044 | # [Not derivable] | |
1045 | ||
1046 | 200D ; Joiner # Cf ZERO WIDTH JOINER | |
1047 | ||
1048 | # ================================================ | |
1049 | ||
1050 | # Indic_Syllabic_Category=Number_Joiner | |
1051 | ||
1052 | # Number_Joiner (forms ligatures between numbers for multiplication) | |
1053 | ||
1054 | # [Not derivable] | |
1055 | ||
1056 | 1107F ; Number_Joiner # Mn BRAHMI NUMBER JOINER | |
1057 | ||
1058 | # ================================================ | |
1059 | ||
1060 | # Indic_Syllabic_Category=Number | |
1061 | ||
ac71d2a0 | 1062 | # Number (can be used as vowel-holders like consonant placeholders) |
09edd811 KW |
1063 | |
1064 | # [Not derivable] | |
1065 | ||
1066 | 0030..0039 ; Number # Nd [10] DIGIT ZERO..DIGIT NINE | |
1067 | 0966..096F ; Number # Nd [10] DEVANAGARI DIGIT ZERO..DEVANAGARI DIGIT NINE | |
1068 | 09E6..09EF ; Number # Nd [10] BENGALI DIGIT ZERO..BENGALI DIGIT NINE | |
1069 | 0A66..0A6F ; Number # Nd [10] GURMUKHI DIGIT ZERO..GURMUKHI DIGIT NINE | |
1070 | 0AE6..0AEF ; Number # Nd [10] GUJARATI DIGIT ZERO..GUJARATI DIGIT NINE | |
1071 | 0B66..0B6F ; Number # Nd [10] ORIYA DIGIT ZERO..ORIYA DIGIT NINE | |
1072 | 0BE6..0BEF ; Number # Nd [10] TAMIL DIGIT ZERO..TAMIL DIGIT NINE | |
1073 | 0C66..0C6F ; Number # Nd [10] TELUGU DIGIT ZERO..TELUGU DIGIT NINE | |
1074 | 0CE6..0CEF ; Number # Nd [10] KANNADA DIGIT ZERO..KANNADA DIGIT NINE | |
1075 | 0D66..0D6F ; Number # Nd [10] MALAYALAM DIGIT ZERO..MALAYALAM DIGIT NINE | |
1076 | 0DE6..0DEF ; Number # Nd [10] SINHALA LITH DIGIT ZERO..SINHALA LITH DIGIT NINE | |
1077 | 0E50..0E59 ; Number # Nd [10] THAI DIGIT ZERO..THAI DIGIT NINE | |
1078 | 0ED0..0ED9 ; Number # Nd [10] LAO DIGIT ZERO..LAO DIGIT NINE | |
1079 | 0F20..0F29 ; Number # Nd [10] TIBETAN DIGIT ZERO..TIBETAN DIGIT NINE | |
1080 | 0F2A..0F33 ; Number # No [10] TIBETAN DIGIT HALF ONE..TIBETAN DIGIT HALF ZERO | |
1081 | 1040..1049 ; Number # Nd [10] MYANMAR DIGIT ZERO..MYANMAR DIGIT NINE | |
1082 | 1090..1099 ; Number # Nd [10] MYANMAR SHAN DIGIT ZERO..MYANMAR SHAN DIGIT NINE | |
1083 | 17E0..17E9 ; Number # Nd [10] KHMER DIGIT ZERO..KHMER DIGIT NINE | |
1084 | 1946..194F ; Number # Nd [10] LIMBU DIGIT ZERO..LIMBU DIGIT NINE | |
1085 | 19D0..19D9 ; Number # Nd [10] NEW TAI LUE DIGIT ZERO..NEW TAI LUE DIGIT NINE | |
1086 | 1A80..1A89 ; Number # Nd [10] TAI THAM HORA DIGIT ZERO..TAI THAM HORA DIGIT NINE | |
1087 | 1A90..1A99 ; Number # Nd [10] TAI THAM THAM DIGIT ZERO..TAI THAM THAM DIGIT NINE | |
1088 | 1B50..1B59 ; Number # Nd [10] BALINESE DIGIT ZERO..BALINESE DIGIT NINE | |
1089 | 1BB0..1BB9 ; Number # Nd [10] SUNDANESE DIGIT ZERO..SUNDANESE DIGIT NINE | |
1090 | 1C40..1C49 ; Number # Nd [10] LEPCHA DIGIT ZERO..LEPCHA DIGIT NINE | |
1091 | A8D0..A8D9 ; Number # Nd [10] SAURASHTRA DIGIT ZERO..SAURASHTRA DIGIT NINE | |
1092 | A900..A909 ; Number # Nd [10] KAYAH LI DIGIT ZERO..KAYAH LI DIGIT NINE | |
1093 | A9D0..A9D9 ; Number # Nd [10] JAVANESE DIGIT ZERO..JAVANESE DIGIT NINE | |
1094 | A9F0..A9F9 ; Number # Nd [10] MYANMAR TAI LAING DIGIT ZERO..MYANMAR TAI LAING DIGIT NINE | |
1095 | AA50..AA59 ; Number # Nd [10] CHAM DIGIT ZERO..CHAM DIGIT NINE | |
1096 | ABF0..ABF9 ; Number # Nd [10] MEETEI MAYEK DIGIT ZERO..MEETEI MAYEK DIGIT NINE | |
1097 | 10A40..10A47 ; Number # No [8] KHAROSHTHI DIGIT ONE..KHAROSHTHI NUMBER ONE THOUSAND | |
1098 | 11066..1106F ; Number # Nd [10] BRAHMI DIGIT ZERO..BRAHMI DIGIT NINE | |
1099 | 11136..1113F ; Number # Nd [10] CHAKMA DIGIT ZERO..CHAKMA DIGIT NINE | |
1100 | 111D0..111D9 ; Number # Nd [10] SHARADA DIGIT ZERO..SHARADA DIGIT NINE | |
1101 | 111E1..111F4 ; Number # No [20] SINHALA ARCHAIC DIGIT ONE..SINHALA ARCHAIC NUMBER ONE THOUSAND | |
1102 | 112F0..112F9 ; Number # Nd [10] KHUDAWADI DIGIT ZERO..KHUDAWADI DIGIT NINE | |
1103 | 114D0..114D9 ; Number # Nd [10] TIRHUTA DIGIT ZERO..TIRHUTA DIGIT NINE | |
1104 | 11650..11659 ; Number # Nd [10] MODI DIGIT ZERO..MODI DIGIT NINE | |
1105 | 116C0..116C9 ; Number # Nd [10] TAKRI DIGIT ZERO..TAKRI DIGIT NINE | |
ac71d2a0 UC |
1106 | 11730..11739 ; Number # Nd [10] AHOM DIGIT ZERO..AHOM DIGIT NINE |
1107 | 1173A..1173B ; Number # No [2] AHOM NUMBER TEN..AHOM NUMBER TWENTY | |
09edd811 KW |
1108 | |
1109 | # ================================================ | |
1110 | ||
1111 | # Indic_Syllabic_Category=Brahmi_Joining_Number | |
1112 | ||
ac71d2a0 UC |
1113 | # Brahmi Joining Number (similar to Number in that in can be used as |
1114 | # vowel-holders like Consonant_Placeholder, but may also be joined by | |
1115 | # a Number_Joiner of the same script, e.g. in Brahmi) | |
09edd811 KW |
1116 | |
1117 | # [Not derivable] | |
1118 | ||
1119 | 11052..11065 ; Brahmi_Joining_Number # No [20] BRAHMI NUMBER ONE..BRAHMI NUMBER ONE THOUSAND | |
bd84d130 KW |
1120 | |
1121 | # EOF |