This is a live mirror of the Perl 5 development currently hosted at https://github.com/perl/perl5
[perl #41587] [PATCH] 5.8.8 make sure we get the proper ldflags on libperl.so
[perl5.git] / lib / unicore / ArabicShaping.txt
CommitLineData
98fbe989
JH
1# ArabicShaping-5.0.0.txt
2# Date: 2006-07-14, 11:23:00 PST [KW]
8836d2a5
JH
3#
4# This file is a normative contributory data file in the
5# Unicode Character Database.
6#
98fbe989 7# Copyright (c) 1991-2006 Unicode, Inc.
a2bd7410
JH
8# For terms of use, see http://www.unicode.org/terms_of_use.html
9#
8836d2a5
JH
10# This file defines the shaping classes for Arabic and Syriac
11# positional shaping, repeating in machine readable form the
1911be83
JH
12# information printed in Tables 8-3, 8-7, 8-8, 8-11, 8-12, and
13# 8-13 of The Unicode Standard, Version 4.0.
8836d2a5 14#
1911be83 15# See sections 8.2 and 8.3 of The Unicode Standard, Version 4.0
8836d2a5
JH
16# for more information.
17#
18# Each line contains four fields, separated by a semicolon.
19#
1911be83 20# Field 0: the code point, in 4-digit hexadecimal
8836d2a5 21# form, of an Arabic or Syriac character.
1911be83 22# Field 1: gives a short schematic name for that character,
8836d2a5 23# abbreviated from the normative Unicode character name.
a2bd7410
JH
24# Field 2: defines the joining type (property name: Joining_Type)
25# R Right_Joining
26# L Left_Joining
27# D Dual_Joining
28# C Join_Causing
29# U Non_Joining
30# T Transparent
1911be83 31# See the Arabic block description for more information on these types.
a2bd7410
JH
32# Field 3: defines the joining group (property name: Joining_Group)
33#
34# The values of the joining group are based schematically on character
35# names. Where a schematic character name consists of two or more parts separated
36# by spaces, the formal Joining_Group property value, as specified in
37# PropertyValueAliases.txt, consists of the same name parts joined by
38# underscores. Hence, the entry:
39#
40# 0629; TEH MARBUTA; R; TEH MARBUTA
41#
42# corresponds to [Joining_Group = Teh_Marbuta].
8836d2a5 43#
a2bd7410
JH
44# Note: For historical reasons, the property value [Joining_Group = Hamza_On_Heh_Goal]
45# is anachronistically named. It used to apply to both of the following characters
46# in earlier versions of the standard:
47#
48# U+06C2 ARABIC LETTER HEH GOAL WITH HAMZA ABOVE
49# U+06C3 ARABIC LETTER TEH MARBUTA GOAL
50#
51# However, it currently applies only to U+06C3, and *not* to U+06C2.
52# To avoid destabilizing existing Joining_Group property aliases, the
53# value Hamza_On_Heh_Goal has not been changed, despite the fact that it
54# no longer applies to Hamza On Heh Goal, but only to Teh Marbuta Goal.
afc46004 55#
1911be83 56# Note: Code points that are not explicitly listed in this file are
a2bd7410 57# either of joining type T or U:
afc46004 58#
a2bd7410 59# - Those that not explicitly listed that are of General Category Mn, Me, or Cf
1911be83
JH
60# have joining type T.
61# - All others not explicitly listed have type U.
afc46004
JH
62#
63# For an explicit listing of characters of joining type T, see
64# the derived property file DerivedJoiningType.txt.
65#
66# There are currently no characters of type L defined in Unicode.
67#
8836d2a5
JH
68# #############################################################
69
70# Unicode; Schematic Name; Joining Type; Joining Group
71
d357d9fe 72# Arabic characters
8836d2a5 73
a2bd7410
JH
740600; ARABIC NUMBER SIGN; U; No_Joining_Group
750601; ARABIC SIGN SANAH; U; No_Joining_Group
760602; ARABIC FOOTNOTE MARKER; U; No_Joining_Group
770603; ARABIC SIGN SAFHA; U; No_Joining_Group
78060B; AFGHANI SIGN; U; No_Joining_Group
790621; HAMZA; U; No_Joining_Group
d357d9fe
JH
800622; MADDA ON ALEF; R; ALEF
810623; HAMZA ON ALEF; R; ALEF
820624; HAMZA ON WAW; R; WAW
830625; HAMZA UNDER ALEF; R; ALEF
840626; HAMZA ON YEH; D; YEH
850627; ALEF; R; ALEF
860628; BEH; D; BEH
870629; TEH MARBUTA; R; TEH MARBUTA
88062A; TEH; D; BEH
89062B; THEH; D; BEH
90062C; JEEM; D; HAH
91062D; HAH; D; HAH
92062E; KHAH; D; HAH
93062F; DAL; R; DAL
940630; THAL; R; DAL
950631; REH; R; REH
960632; ZAIN; R; REH
970633; SEEN; D; SEEN
980634; SHEEN; D; SEEN
990635; SAD; D; SAD
1000636; DAD; D; SAD
1010637; TAH; D; TAH
1020638; ZAH; D; TAH
1030639; AIN; D; AIN
104063A; GHAIN; D; AIN
a2bd7410 1050640; TATWEEL; C; No_Joining_Group
d357d9fe
JH
1060641; FEH; D; FEH
1070642; QAF; D; QAF
1080643; KAF; D; KAF
1090644; LAM; D; LAM
1100645; MEEM; D; MEEM
1110646; NOON; D; NOON
1120647; HEH; D; HEH
1130648; WAW; R; WAW
8836d2a5 1140649; ALEF MAKSURA; D; YEH
d357d9fe 115064A; YEH; D; YEH
822ebcc8
JH
116066E; DOTLESS BEH; D; BEH
117066F; DOTLESS QAF; D; QAF
8836d2a5 1180671; HAMZAT WASL ON ALEF; R; ALEF
d357d9fe
JH
1190672; WAVY HAMZA ON ALEF; R; ALEF
1200673; WAVY HAMZA UNDER ALEF; R; ALEF
a2bd7410 1210674; HIGH HAMZA; U; No_Joining_Group
d357d9fe
JH
1220675; HIGH HAMZA ALEF; R; ALEF
1230676; HIGH HAMZA WAW; R; WAW
1240677; HIGH HAMZA WAW WITH DAMMA; R; WAW
1250678; HIGH HAMZA YEH; D; YEH
1260679; TEH WITH SMALL TAH; D; BEH
127067A; TEH WITH 2 DOTS VERTICAL ABOVE; D; BEH
128067B; BEH WITH 2 DOTS VERTICAL BELOW; D; BEH
129067C; TEH WITH RING; D; BEH
130067D; TEH WITH 3 DOTS ABOVE DOWNWARD; D; BEH
131067E; TEH WITH 3 DOTS BELOW; D; BEH
132067F; TEH WITH 4 DOTS ABOVE; D; BEH
1330680; BEH WITH 4 DOTS BELOW; D; BEH
1340681; HAMZA ON HAH; D; HAH
1350682; HAH WITH 2 DOTS VERTICAL ABOVE; D; HAH
1360683; HAH WITH MIDDLE 2 DOTS; D; HAH
1370684; HAH WITH MIDDLE 2 DOTS VERTICAL; D; HAH
1380685; HAH WITH 3 DOTS ABOVE; D; HAH
1390686; HAH WITH MIDDLE 3 DOTS DOWNWARD; D; HAH
1400687; HAH WITH MIDDLE 4 DOTS; D; HAH
1410688; DAL WITH SMALL TAH; R; DAL
1420689; DAL WITH RING; R; DAL
143068A; DAL WITH DOT BELOW; R; DAL
144068B; DAL WITH DOT BELOW AND SMALL TAH; R; DAL
145068C; DAL WITH 2 DOTS ABOVE; R; DAL
146068D; DAL WITH 2 DOTS BELOW; R; DAL
147068E; DAL WITH 3 DOTS ABOVE; R; DAL
148068F; DAL WITH 3 DOTS ABOVE DOWNWARD; R; DAL
1490690; DAL WITH 4 DOTS ABOVE; R; DAL
1500691; REH WITH SMALL TAH; R; REH
1510692; REH WITH SMALL V; R; REH
1520693; REH WITH RING; R; REH
1530694; REH WITH DOT BELOW; R; REH
1540695; REH WITH SMALL V BELOW; R; REH
1550696; REH WITH DOT BELOW AND DOT ABOVE; R; REH
1560697; REH WITH 2 DOTS ABOVE; R; REH
1570698; REH WITH 3 DOTS ABOVE; R; REH
1580699; REH WITH 4 DOTS ABOVE; R; REH
159069A; SEEN WITH DOT BELOW AND DOT ABOVE; D; SEEN
160069B; SEEN WITH 3 DOTS BELOW; D; SEEN
161069C; SEEN WITH 3 DOTS BELOW AND 3 DOTS ABOVE; D; SEEN
162069D; SAD WITH 2 DOTS BELOW; D; SAD
163069E; SAD WITH 3 DOTS ABOVE; D; SAD
164069F; TAH WITH 3 DOTS ABOVE; D; TAH
16506A0; AIN WITH 3 DOTS ABOVE; D; AIN
16606A1; DOTLESS FEH; D; FEH
16706A2; FEH WITH DOT MOVED BELOW; D; FEH
16806A3; FEH WITH DOT BELOW; D; FEH
16906A4; FEH WITH 3 DOTS ABOVE; D; FEH
17006A5; FEH WITH 3 DOTS BELOW; D; FEH
17106A6; FEH WITH 4 DOTS ABOVE; D; FEH
17206A7; QAF WITH DOT ABOVE; D; QAF
17306A8; QAF WITH 3 DOTS ABOVE; D; QAF
a2bd7410 17406A9; KEHEH; D; GAF
d357d9fe
JH
17506AA; SWASH KAF; D; SWASH KAF
17606AB; KAF WITH RING; D; GAF
17706AC; KAF WITH DOT ABOVE; D; KAF
17806AD; KAF WITH 3 DOTS ABOVE; D; KAF
17906AE; KAF WITH 3 DOTS BELOW; D; KAF
18006AF; GAF; D; GAF
18106B0; GAF WITH RING; D; GAF
18206B1; GAF WITH 2 DOTS ABOVE; D; GAF
18306B2; GAF WITH 2 DOTS BELOW; D; GAF
18406B3; GAF WITH 2 DOTS VERTICAL BELOW; D; GAF
18506B4; GAF WITH 3 DOTS ABOVE; D; GAF
18606B5; LAM WITH SMALL V; D; LAM
18706B6; LAM WITH DOT ABOVE; D; LAM
18806B7; LAM WITH 3 DOTS ABOVE; D; LAM
18906B8; LAM WITH 3 DOTS BELOW; D; LAM
19006B9; NOON WITH DOT BELOW; D; NOON
19106BA; DOTLESS NOON; D; NOON
19206BB; DOTLESS NOON WITH SMALL TAH; D; NOON
19306BC; NOON WITH RING; D; NOON
19406BD; NOON WITH 3 DOTS ABOVE; D; NOON
19506BE; KNOTTED HEH; D; KNOTTED HEH
19606BF; HAH WITH MIDDLE 3 DOTS DOWNWARD AND DOT ABOVE; D; HAH
19706C0; HAMZA ON HEH; R; TEH MARBUTA
19806C1; HEH GOAL; D; HEH GOAL
a2bd7410 19906C2; HAMZA ON HEH GOAL; D; HEH GOAL
d357d9fe
JH
20006C3; TEH MARBUTA GOAL; R; HAMZA ON HEH GOAL
20106C4; WAW WITH RING; R; WAW
20206C5; WAW WITH BAR; R; WAW
20306C6; WAW WITH SMALL V; R; WAW
20406C7; WAW WITH DAMMA; R; WAW
20506C8; WAW WITH ALEF ABOVE; R; WAW
20606C9; WAW WITH INVERTED SMALL V; R; WAW
20706CA; WAW WITH 2 DOTS ABOVE; R; WAW
20806CB; WAW WITH 3 DOTS ABOVE; R; WAW
20906CC; DOTLESS YEH; D; YEH
21006CD; YEH WITH TAIL; R; YEH WITH TAIL
21106CE; YEH WITH SMALL V; D; YEH
21206CF; WAW WITH DOT ABOVE; R; WAW
21306D0; YEH WITH 2 DOTS VERTICAL BELOW; D; YEH
21406D1; YEH WITH 3 DOTS BELOW; D; YEH
21506D2; YEH BARREE; R; YEH BARREE
21606D3; HAMZA ON YEH BARREE; R; YEH BARREE
afc46004 21706D5; AE; R; TEH MARBUTA
a2bd7410 21806DD; ARABIC END OF AYAH; U; No_Joining_Group
1911be83
JH
21906EE; DAL WITH INVERTED V; R; DAL
22006EF; REH WITH INVERTED V; R; REH
d357d9fe
JH
22106FA; SEEN WITH DOT BELOW AND 3 DOTS ABOVE; D; SEEN
22206FB; DAD WITH DOT BELOW; D; SAD
22306FC; GHAIN WITH DOT BELOW; D; AIN
7be0dac3 22406FF; HEH WITH INVERTED V; D; KNOTTED HEH
8836d2a5 225
d357d9fe 226# Syriac characters
8836d2a5 227
d357d9fe
JH
2280710; ALAPH; R; ALAPH
2290712; BETH; D; BETH
2300713; GAMAL; D; GAMAL
2310714; GAMAL GARSHUNI; D; GAMAL
2320715; DALATH; R; DALATH RISH
2330716; DOTLESS DALATH RISH; R; DALATH RISH
2340717; HE; R; HE
822ebcc8 2350718; WAW; R; SYRIAC WAW
d357d9fe
JH
2360719; ZAIN; R; ZAIN
237071A; HETH; D; HETH
238071B; TETH; D; TETH
239071C; TETH GARSHUNI; D; TETH
240071D; YUDH; D; YUDH
241071E; YUDH HE; R; YUDH HE
242071F; KAPH; D; KAPH
2430720; LAMADH; D; LAMADH
2440721; MIM; D; MIM
2450722; NUN; D; NUN
2460723; SEMKATH; D; SEMKATH
2470724; FINAL SEMKATH; D; FINAL SEMKATH
2480725; E; D; E
2490726; PE; D; PE
2500727; REVERSED PE; D; REVERSED PE
2510728; SADHE; R; SADHE
2520729; QAPH; D; QAPH
253072A; RISH; R; DALATH RISH
254072B; SHIN; D; SHIN
255072C; TAW; R; TAW
1911be83
JH
256072D; PERSIAN BHETH; D; BETH
257072E; PERSIAN GHAMAL; D; GAMAL
258072F; PERSIAN DHALATH; R; DALATH RISH
259074D; SOGDIAN ZHAIN; R; ZHAIN
260074E; SOGDIAN KHAPH; D; KHAPH
261074F; SOGDIAN FE; D; FE
afc46004 262
a2bd7410
JH
263# Arabic supplement characters
264
2650750; BEH WITH 3 DOTS HORIZONTALLY BELOW; D; BEH
2660751; BEH WITH DOT BELOW AND 3 DOTS ABOVE; D; BEH
2670752; BEH WITH 3 DOTS POINTING UPWARDS BELOW; D; BEH
2680753; BEH WITH 3 DOTS POINTING UPWARDS BELOW AND 2 DOTS ABOVE; D; BEH
2690754; BEH WITH 2 DOTS BELOW AND DOT ABOVE; D; BEH
2700755; BEH WITH INVERTED SMALL V BELOW; D; BEH
2710756; BEH WITH SMALL V; D; BEH
2720757; HAH WITH 2 DOTS ABOVE; D; HAH
2730758; HAH WITH 3 DOTS POINTING UPWARDS BELOW; D; HAH
2740759; DAL WITH 2 DOTS VERTICALLY BELOW AND SMALL TAH; R; DAL
275075A; DAL WITH INVERTED SMALL V BELOW; R; DAL
276075B; REH WITH STROKE; R; REH
277075C; SEEN WITH 4 DOTS ABOVE; D; SEEN
278075D; AIN WITH 2 DOTS ABOVE; D; AIN
279075E; AIN WITH 3 DOTS POINTING DOWNWARDS ABOVE; D; AIN
280075F; AIN WITH 2 DOTS VERTICALLY ABOVE; D; AIN
2810760; FEH WITH 2 DOTS BELOW; D; FEH
2820761; FEH WITH 3 DOTS POINTING UPWARDS BELOW; D; FEH
2830762; KEHEH WITH DOT ABOVE; D; GAF
2840763; KEHEH WITH 3 DOTS ABOVE; D; GAF
2850764; KEHEH WITH 3 DOTS POINTING UPWARDS BELOW; D; GAF
2860765; MEEM WITH DOT ABOVE; D; MEEM
2870766; MEEM WITH DOT BELOW; D; MEEM
2880767; NOON WITH 2 DOTS BELOW; D; NOON
2890768; NOON WITH SMALL TAH; D; NOON
2900769; NOON WITH SMALL V; D; NOON
291076A; LAM WITH BAR; D; LAM
292076B; REH WITH 2 DOTS VERTICALLY ABOVE; R; REH
293076C; REH WITH HAMZA ABOVE; R; REH
294076D; SEEN WITH 2 DOTS VERTICALLY ABOVE; D; SEEN
295
98fbe989
JH
296# N'Ko Characters
297
29807CA; NKO A; D; No_Joining_Group
29907CB; NKO EE; D; No_Joining_Group
30007CC; NKO I; D; No_Joining_Group
30107CD; NKO E; D; No_Joining_Group
30207CE; NKO U; D; No_Joining_Group
30307CF; NKO OO; D; No_Joining_Group
30407D0; NKO O; D; No_Joining_Group
30507D1; NKO DAGBASINNA; D; No_Joining_Group
30607D2; NKO N; D; No_Joining_Group
30707D3; NKO BA; D; No_Joining_Group
30807D4; NKO PA; D; No_Joining_Group
30907D5; NKO TA; D; No_Joining_Group
31007D6; NKO JA; D; No_Joining_Group
31107D7; NKO CHA; D; No_Joining_Group
31207D8; NKO DA; D; No_Joining_Group
31307D9; NKO RA; D; No_Joining_Group
31407DA; NKO RRA; D; No_Joining_Group
31507DB; NKO SA; D; No_Joining_Group
31607DC; NKO GBA; D; No_Joining_Group
31707DD; NKO FA; D; No_Joining_Group
31807DE; NKO KA; D; No_Joining_Group
31907DF; NKO LA; D; No_Joining_Group
32007E0; NKO NA WOLOSO; D; No_Joining_Group
32107E1; NKO MA; D; No_Joining_Group
32207E2; NKO NYA; D; No_Joining_Group
32307E3; NKO NA; D; No_Joining_Group
32407E4; NKO HA; D; No_Joining_Group
32507E5; NKO WA; D; No_Joining_Group
32607E6; NKO YA; D; No_Joining_Group
32707E7; NKO NYA WOLOSO; D; No_Joining_Group
32807E8; NKO JONA JA; D; No_Joining_Group
32907E9; NKO JONA CHA; D; No_Joining_Group
33007EA; NKO JONA RA; D; No_Joining_Group
33107FA; NKO LAJANYALAN; C; No_Joining_Group
332
afc46004
JH
333# Other
334
a2bd7410
JH
335200D; ZERO WIDTH JOINER; C; No_Joining_Group
336200C; ZERO WIDTH NON-JOINER; U; No_Joining_Group
98fbe989
JH
337
338# EOF