This is a live mirror of the Perl 5 development currently hosted at https://github.com/perl/perl5
Synchronise Math::BigInt{,::FastCalc} with CPAN
[perl5.git] / cpan / Unicode-Collate / Changes
CommitLineData
ae6aa562
JH
1Revision history for Perl module Unicode::Collate.
2
028d3bfa
CBW
30.65 Wed Nov 3 13:10:20 2010
4 - U::C::Locale newly supports locale: zh and its some variants.
5 (zh__big5han, zh__gb2312han, zh__pinyin, zh__stroke)
6 - added Unicode::Collate::CJK::Big5 for zh__big5han.
7 - added Unicode::Collate::CJK::GB2312 for zh__gb2312han.
8 - added Unicode::Collate::CJK::Pinyin for zh__pinyin.
9 - added Unicode::Collate::CJK::Stroke for zh__stroke.
10
539ce3d8
CBW
110.64 Sun Oct 31 14:17:29 2010
12 - U::C::Locale newly supports locale: ja.
13 - added Unicode::Collate::CJK::JISX0208 for ja.
14 - a subroutine specified in 'overrideCJK' or 'overrideHangul' is allowed
15 to return an integer or undef value.
16 - fix: Ideographs Ext.A are assigned since UCA Version 9 (Unicode 3.1).
17 This fix should affect only the case of (UCA_Version => 8).
18
aa7758f7
CBW
190.63 Sun Oct 10 22:13:21 2010
20 - supported suppress contractions (see 'suppress' in POD).
028d3bfa 21 - internal for 'hangul_terminator' in getSortKey().
aa7758f7
CBW
22 - U::C::Locale newly supports locales: be, bg, kk, mk, ru, sr.
23 - added tailoring with U+0340 or U+0341 instead of U+0300 or U+0301.
24 (affected locales: hr, is, pl, se, to, wo)
25
6709de88
CBW
260.62 Wed Oct 6 21:35:54 2010
27 - U::C::Locale newly supports locales: ar, hu, hy, se, to, uk.
28 - Vietnamese (vi): added tailoring for U+0340 and U+0341.
29
c02ee425
CBW
300.61 Sat Oct 2 11:41:29 2010
31 - U::C::Locale newly supports locales: hr, ig, sq.
32 - precomposites of e-dot-below, o-dot-below, o-tilde are tailored as well.
33 (affected locales: et, yo)
34 - Vietnamese (vi): added contractions for non-blocked decompositions
aa7758f7 35 * base + dot-below + mark such as a\x{323}\x{306}, \x{1EA1}\x{306} etc.
6709de88 36 * base + tone + horn such as o\x{309}\x{31B}, \x{1ECF}\x{31B} etc.
c02ee425 37
1393fe00
CBW
380.60 Thu Sep 23 21:37:36 2010
39 - bug fix: index() [and its friends including gmatch()] didn't remove
40 ignorable characters in the substring correctly.
41 Thanks for the bug report:
aa7758f7 42 http://www.xray.mpe.mpg.de/mailing-lists/perl-unicode/2010-09/msg00014.html
1393fe00
CBW
43
44 - U::C::Locale newly supports locales: de__phonebook, nso, om, tn, vi.
45 - precomposites of a-breve, a-circ, e-circ, o-circ are tailored as well.
46 (affected locales: ro, sk, sv)
47
f1a7422f
CBW
480.59 Sun Sep 5 17:03:52 2010
49 - U::C::Locale newly supports locales: az, fil, ha, lt, mt, tr, wo, yo.
50 - precomposites of a-uml, o-uml, and u-uml are tailored as well.
51 (affected locales: da, et, fi, fo, is, kl, nb, nn, sk, sv)
52
6484f676
CBW
530.58 Sun Aug 29 19:56:50 2010
54 - U::C::Locale newly supports locales: af, cy, da, fo, haw, is, kl, sw.
55
64dc7822 560.57 Sun Aug 22 22:39:58 2010
6484f676 57 - U::C::Locale newly supports locales: ca, et, fi, lv, sk, sl.
64dc7822 58
456a1446
CBW
590.56 Sun Aug 8 20:24:03 2010
60 - Unicode::Collate::Locale newly supports locales: eo, nb, ro, sv.
61 ! renamed t/locale_{xy}.t to t/loc_{xy}.t (for safer 8.3 names)
62
00e00351 630.55 Sun Aug 1 21:21:23 2010
aa7758f7
CBW
64 - incorporated Unicode::Collate::Locale with some changes. see:
65 http://www.xray.mpe.mpg.de/mailing-lists/perl-unicode/2004-03/msg00030.html
456a1446 66 - supported locales: cs, es, es__traditional, fr, nn, pl.
00e00351
CBW
67 ! added t/locale*.t that uses DUCET.
68
690.54 Sun Jul 25 21:37:04 2010
70 - Now UCA Revision 20 (based on Unicode 5.2.0).
71 - DUCET is also updated (for Unicode 5.2.0) as Collate/allkeys.txt,
72 which *is required* to test this module.
73 ! Please notice that allkeys.txt will be overwritten if you have had
74 other allkeys.txt already.
75 - U+9FC4..U+9FCB and U+2A700..U+2B734 are new CJK Unified Ideographs.
76 - Many hangul jamo are assigned (affecting hangul_terminator).
77
78 ! DUCET will be compiled when XS is used. Explicit saying
79 <table => 'allkeys.txt'> (or using another table) will prevent
1393fe00 80 this module from using the compiled DUCET.
00e00351
CBW
81
82 ! added t/default.t that uses DUCET.
83
74b94a79
CBW
840.53 Sun Feb 14 20:46:27 2010
85 - Now UCA Revision 18 (based on Unicode 5.1.0).
00e00351 86 - DUCET is also updated (for Unicode 5.1.0) as Collate/allkeys.txt,
74b94a79
CBW
87 which is not required to test this module.
88 ! Please notice that allkeys.txt will be overwritten if you have had
89 other allkeys.txt already.
90 - U+9FBC..U+9FC3 are new CJK Unified Ideographs.
91
6d24ed10
SP
920.52 Thu Oct 13 21:51:09 2005
93 - The Unicode::Collate->new method does not destroy user's $_ any longer.
94 (thanks to Jon Warbrick for bug report)
95
0d50d293
RGS
960.51 Sun May 29 20:21:19 2005
97 - Added the latest DUCET (for Unicode 4.1.0) as Collate/allkeys.txt,
98 which is not required to test this module.
74b94a79 99 ! Please notice that allkeys.txt will be overwritten if you have had
0d50d293
RGS
100 other allkeys.txt already.
101 - Added INSTALL section in POD.
102
3756e7ca
RGS
1030.50 Sun May 8 20:26:39 2005
104 - Now UCA Revision 14 (based on Unicode 4.1.0).
105 - Some tests are modified.
106 - Added cjkrange.t, ignor.t, override.t.
107 - Added META.yml.
108
1090.40 Sat Apr 24 06:54:40 2004
110 - Now a table file is searched in @INC.
111
abd1ec54
NC
1120.33 Sat Dec 13 14:07:27 2003
113 - documentation improvement: in "entry", "overrideHangul", etc.
114
1150.32 Wed Dec 3 23:38:18 2003
116 - A matching part from index(), match() etc. will include illegal
117 code points (as well as ignorable characters) following a grapheme.
118 - Contraction with illegal code point will be invalid.
3756e7ca
RGS
119 - Added view.t.
120 - Added some tests in illegal.t.
abd1ec54
NC
121 - Some tests are separated from test.t into altern.t and rearrang.t.
122 - modified XSUB internals.
123
10d7ec48
NC
1240.31 Sun Nov 16 15:40:15 2003
125 - Illegal code points (surrogate and noncharacter; they are definitely
126 ignorable) will be distinguished from NULL ("\0");
127 but porting is not successful in the case of ((Pure Perl) and
128 (Perl 5.7.3 or before)). If perl 5.6.X is used, XSUB may help it
129 in place of broken CORE::unpack('U*') in older perl.
130 - added illegal.t and illegalp.t.
abd1ec54
NC
131 - added XSUB (EXPERIMENTAL!) where some functions are implemented
132 in XSUB. Pure Perl is also supported.
10d7ec48 133
91ae00cb
NC
1340.30 Mon Oct 13 21:26:37 2003
135 - fix: Completely ignorable in table should be able to be overrided
136 by non-ignorable in entry.
137 - fix: Maximum length for contraction must not be shortened
10d7ec48 138 by a shorter contraction following in table and/or entry.
91ae00cb
NC
139 - added normal.t.
140 - some doc fixes
141
1420.29 Mon Oct 13 12:18:23 2003
abd1ec54 143 - now UCA Version 11 (but no functionality is different from Version 9).
91ae00cb
NC
144 - supported hangul_terminator.
145 - fix: Base_Unicode_Version falsely returns Perl's Unicode version.
146 C4 in UTS #10 requires UTS's Unicode version.
147 - For variable weighting, 'variable' is recommended
148 and 'alternate' is deprecated.
149 - added version() method.
150 - added hangtype.t, trailwt.t, variable.t, and version.t.
151
06c8fc8f
RGS
1520.28 Sat Sep 06 20:16:01 2003
153 - Fixed another inconsistency under (normalization => undef):
154 Non-contiguous contraction is always neglected.
155 - Fixed: according to S2.1 in UTS #10, a blocked combining character
156 should not be contracted. One test in test.t was wrong, then removed.
157 - Added contract.t.
158 - (normalization => "prenormalized") is able to be used.
159
1d2654e1
JH
1600.27 Sun Aug 31 22:23:17 2003
161 some improvements:
06c8fc8f 162 - The maximum length of contracted CE was not checked (v0.22 to v0.26).
1d2654e1
JH
163 Collation of a large string including a first letter of a contraction
164 that is not a part of that contraction (say, 'c' of 'ca'
165 where 'ch' is defined) was too slow, inefficient.
91ae00cb
NC
166 - A form name for 'normalization', no longer restricted to
167 /^(?:NF)?K?[CD]\z/, will be allowed as long as
168 Unicode::Normalize::normalize() accepts it, since Unicode::Normalize
169 or UAX #15 may be changed/enhanced in future.
1d2654e1
JH
170 - When Hangul syllables are decomposed under <normalization => undef>,
171 contraction among jamo (LV, VT, LVT) derived from the same
172 Hangul syllable is allowed. Added hangul.t.
173
4c843366
JH
1740.26 Sun Aug 03 22:23:17 2003
175 - fix: an expansion in which a CE is level 3 ignorable and others are not
1d2654e1 176 was wrongly made level 3 ignorable as a whole entry.
4c843366
JH
177 (In DUCET, some precomposites in Musical Symbols are so)
178
ae6aa562
JH
1790.25 Mon Jun 06 23:20:17 2003
180 - fix Makefile.PL.
181 - internal tweak (again): pack_U() and unpack_U().
45394607 182
9f1f04a1
RGS
1830.24 Thu Apr 02 23:12:54 2003
184 - internal tweak for (?un)pack 'U'.
185
4d36a948
ST
1860.23 Wed Sep 04 19:25:20 2002
187 - fix: scalar match() no longer returns an lvalue substr ref.
188 - fix: "Ignorable after variable" should be made level 3 ignorable
189 even if alternate => 'blanked'.
190 - Now a grapheme may contain trailing level 2, level 3,
191 and completely ignorable characters.
192
1930.22 Mon Sep 02 23:15:14 2002
194 - New File: index.t.
195 (The new test.t excludes tests for index.)
196 - tweak on index(). POSITION is supported.
197 - add match, gmatch, subst, gsubst methods.
198 - fix: ignorable after variable in 'shift'-variable weight.
199
caffd4cf
ST
2000.21 Sat Aug 03 10:24:00 2002
201 - upgrade keys.txt and t/test.t for UCA Version 9.
202
0116f5dc
JH
2030.20 Fri Jul 26 02:15:25 2002
204 - now UCA Version 9.
205 - U+FDD0..U+FDEF are new non-characters.
206 - fix: whitespace characters before @backwards etc. in a table file.
207 - now values for 'alternate', 'backwards', etc.,
208 which are explicitly specified via new(),
209 are preferred to those specified in a table file.
210
327745dc
ST
2110.12 Sun May 05 09:43:10 2002
212 - add new methods, ->UCA_Version and ->Base_Unicode_Version.
213 - test fix: removed the needless requirement of Unicode::Normalize.
214 [reported by David Hand]
215
809c7673
ST
2160.11 Fri May 03 02:28:10 2002
217 - fix: now derived collation elements can be used for Hangul Jamo
218 when their weights are not defined.
327745dc 219 [reported by Andreas J. Koenig]
809c7673
ST
220 - fix: rearrangements had not worked.
221 - mentioned pleblem on index() in BUGS.
222 - more documents, more tests.
223 - tag names for 'alternate' are case-insensitive (i.e. 'SHIFTed' etc.).
224 - The <undef> value for the keys "overrideCJK", "overrideHangul",
225 "rearrange" has a special behavior (different from default).
226
905aa9f0
ST
2270.10 Tue Dec 11 23:26:42 2001
228 - now you are allowed to use no table file.
229 - fix: fetching CE with two or more combining characters.
230
5398038e 2310.09 Sun Nov 11 17:02:40:18 2001
232 - add the following methods: eq, ne, lt, le, gt, le.
233 - relies on &Unicode::Normalize::getCombinClass()
234 in place of %Unicode::Normalize::Combin
235 (the hash is not defined in the XS version of Unicode::Normalize).
236 then you should install Unicode::Normalize 0.10 or later.
237 - now independent of Lingua::KO::Hangul::Util
238 (this module does decomposition of Hangul syllables for itself)
239
d16e9e3d
JH
2400.08 Mon Aug 20 22:40:18 2001
241 - add the index method.
242
45394607
JH
2430.07 Thu Aug 16 23:42:02 2001
244 - rename the module name to Unicode::Collate.
245
2460.06 Thu Aug 16 23:18:36 2001
247 - add description of the getSortKey method.
248
2490.05 Mon Aug 13 22:23:11 2001
250 - bug fix: on the things of 4.2.1, UTR #10
251 - getSortKey returns a string, but not an arrayref.
252
2530.04 Mon Aug 13 22:23:11 2001
254 - some bugs are fixed.
255 - some tailoring parameters are added.
256
2570.03 Mon Aug 06 06:26:35 2001
258 - modify README
259
2600.02 Sun Aug 05 20:20:01 2001
261 - some fix
262
2630.01 Sun Jul 29 16:16:15 2001
264 - original version; created by h2xs 1.21
265 with options -A -X -n Sort::UCA