This is a live mirror of the Perl 5 development currently hosted at https://github.com/perl/perl5
Update ExtUtils::CBuilder in Maintainers.pl
[perl5.git] / cpan / Unicode-Collate / Changes
CommitLineData
ae6aa562
JH
1Revision history for Perl module Unicode::Collate.
2
68adb2b0
CBW
30.68 Tue Nov 23 20:17:22 2010
4 - doc: clarified about (backwards => [ ]) and (backwards => undef).
5 - separated t/backwds.t from t/test.t.
6 - added cjk_b5.t, cjk_gb.t, cjk_ja.t, cjk_ko.t, cjk_py.t, cjk_st.t in t
7 for CJK/*.pm without Locale.pm.
8
b5d9a953
CBW
90.67 Sun Nov 14 11:38:59 2010
10 - supported UCA_Version 22 for Unicode 6.0.0.
11 * 2B740..2B81D are new CJK unified ideographs.
12 * noncharacters (e.g. U+FFFF) should be overridable, not be ignored.
13 ! DUCET is NOT updated, as no maint perl supports Unicode 6.0.0.
14 Thus the default UCA_Version is still 20.
15 - added t/nonchar.t.
16 - improved discontiguous contractions of 3 or more characters.
17 (e.g. 0FB2 0F71 0F80 and 0FB3 0F71 0F80)
18 - auxiliary: now 'mklocale' also copes with Korean.pm according to DUCET.
19
584e761d
CBW
200.66 Sun Nov 7 10:47:30 2010
21 - U::C::Locale newly supports locale: ko.
22 - added Unicode::Collate::CJK::Korean for ko.
23 - added t/loc_ko.t.
24 - 12 compat. ideographs (e.g. U+FA0E) are treated as unified ideographs.
25 (though DUCET also does it, now Unicode::Collate does it without DUCET.)
26 - added t/compatui.t.
5a449a8e 27 ! Ideographs Ext.B (U+20000..U+2A6D6) can be overrided with UCA_Version 8.
584e761d
CBW
28 This is a long-standing behavior from Unicode::Collate 0.11 to 0.63.
29 A wrong fix at 0.64 should be abandoned.
30
028d3bfa
CBW
310.65 Wed Nov 3 13:10:20 2010
32 - U::C::Locale newly supports locale: zh and its some variants.
584e761d 33 (zh__big5han, zh__gb2312han, zh__pinyin, zh__stroke)
028d3bfa
CBW
34 - added Unicode::Collate::CJK::Big5 for zh__big5han.
35 - added Unicode::Collate::CJK::GB2312 for zh__gb2312han.
36 - added Unicode::Collate::CJK::Pinyin for zh__pinyin.
37 - added Unicode::Collate::CJK::Stroke for zh__stroke.
584e761d 38 - added loc_zh.t, loc_zhb5.t, loc_zhgb.t, loc_zhpy.t, loc_zhst.t in t.
028d3bfa 39
539ce3d8
CBW
400.64 Sun Oct 31 14:17:29 2010
41 - U::C::Locale newly supports locale: ja.
42 - added Unicode::Collate::CJK::JISX0208 for ja.
584e761d 43 - added loc_ja.t, loc_jait.t, loc_japr.t in t.
539ce3d8
CBW
44 - a subroutine specified in 'overrideCJK' or 'overrideHangul' is allowed
45 to return an integer or undef value.
584e761d
CBW
46 - fix: Ideographs Ext.B (U+20000..U+2A6D6) are assigned in Unicode 3.1,
47 then 'overrideCJK' should not override them with UCA_Version 8.
48 !! sorry, this fix is based on a wrong idea. reverted at 0.66. !!
49 - separated t/overcjk0.t and t/overcjk1.t from t/override.t.
539ce3d8 50
aa7758f7
CBW
510.63 Sun Oct 10 22:13:21 2010
52 - supported suppress contractions (see 'suppress' in POD).
028d3bfa 53 - internal for 'hangul_terminator' in getSortKey().
aa7758f7 54 - U::C::Locale newly supports locales: be, bg, kk, mk, ru, sr.
584e761d
CBW
55 - added loc_be.t, loc_bg.t, loc_cyrl.t, loc_kk.t, loc_mk.t, loc_ru.t,
56 loc_sr.t in t.
aa7758f7
CBW
57 - added tailoring with U+0340 or U+0341 instead of U+0300 or U+0301.
58 (affected locales: hr, is, pl, se, to, wo)
59
6709de88
CBW
600.62 Wed Oct 6 21:35:54 2010
61 - U::C::Locale newly supports locales: ar, hu, hy, se, to, uk.
584e761d 62 - added loc_ar.t, loc_hu.t, loc_hy.t, loc_se.t, loc_to.t, loc_uk.t in t.
6709de88
CBW
63 - Vietnamese (vi): added tailoring for U+0340 and U+0341.
64
c02ee425
CBW
650.61 Sat Oct 2 11:41:29 2010
66 - U::C::Locale newly supports locales: hr, ig, sq.
584e761d 67 - added loc_hr.t, loc_ig.t, loc_sq.t in t.
c02ee425
CBW
68 - precomposites of e-dot-below, o-dot-below, o-tilde are tailored as well.
69 (affected locales: et, yo)
70 - Vietnamese (vi): added contractions for non-blocked decompositions
aa7758f7 71 * base + dot-below + mark such as a\x{323}\x{306}, \x{1EA1}\x{306} etc.
6709de88 72 * base + tone + horn such as o\x{309}\x{31B}, \x{1ECF}\x{31B} etc.
c02ee425 73
1393fe00
CBW
740.60 Thu Sep 23 21:37:36 2010
75 - bug fix: index() [and its friends including gmatch()] didn't remove
76 ignorable characters in the substring correctly.
77 Thanks for the bug report:
aa7758f7 78 http://www.xray.mpe.mpg.de/mailing-lists/perl-unicode/2010-09/msg00014.html
1393fe00
CBW
79
80 - U::C::Locale newly supports locales: de__phonebook, nso, om, tn, vi.
584e761d 81 - added loc_de.t, loc_deph.t, loc_nso.t, loc_om.t, loc_tn.t, loc_vi.t in t.
1393fe00
CBW
82 - precomposites of a-breve, a-circ, e-circ, o-circ are tailored as well.
83 (affected locales: ro, sk, sv)
84
f1a7422f
CBW
850.59 Sun Sep 5 17:03:52 2010
86 - U::C::Locale newly supports locales: az, fil, ha, lt, mt, tr, wo, yo.
584e761d
CBW
87 - added loc_az.t, loc_fil.t, loc_ha.t, loc_lt.t, loc_mt.t, loc_tr.t,
88 loc_wo.t, loc_yo.t in t.
f1a7422f
CBW
89 - precomposites of a-uml, o-uml, and u-uml are tailored as well.
90 (affected locales: da, et, fi, fo, is, kl, nb, nn, sk, sv)
91
6484f676
CBW
920.58 Sun Aug 29 19:56:50 2010
93 - U::C::Locale newly supports locales: af, cy, da, fo, haw, is, kl, sw.
584e761d
CBW
94 - added loc_af.t, loc_cy.t, loc_da.t, loc_fo.t, loc_haw.t, loc_is.t,
95 loc_kl.t, loc_sw.t in t.
6484f676 96
64dc7822 970.57 Sun Aug 22 22:39:58 2010
6484f676 98 - U::C::Locale newly supports locales: ca, et, fi, lv, sk, sl.
584e761d 99 - added loc_ca.t, loc_et.t, loc_fi.t, loc_lv.t, loc_sk.t, loc_sl.t in t.
64dc7822 100
456a1446
CBW
1010.56 Sun Aug 8 20:24:03 2010
102 - Unicode::Collate::Locale newly supports locales: eo, nb, ro, sv.
584e761d 103 - added loc_eo.t, loc_es.t, loc_estr.t, loc_nb.t, loc_ro.t, loc_sv.t in t.
456a1446 104 ! renamed t/locale_{xy}.t to t/loc_{xy}.t (for safer 8.3 names)
584e761d 105 (loc_cs.t, loc_fr.t, loc_nn.t, loc_pl.t, loc_test.t)
456a1446 106
00e00351 1070.55 Sun Aug 1 21:21:23 2010
aa7758f7
CBW
108 - incorporated Unicode::Collate::Locale with some changes. see:
109 http://www.xray.mpe.mpg.de/mailing-lists/perl-unicode/2004-03/msg00030.html
456a1446 110 - supported locales: cs, es, es__traditional, fr, nn, pl.
00e00351 111 ! added t/locale*.t that uses DUCET.
584e761d 112 (locale_cs.t, locale_fr.t, locale_nn.t, locale_pl.t, locale_test.t)
b5d9a953 113 - data/*.txt and mklocale for preparation of Locale/*.pl from DUCET.
00e00351
CBW
114
1150.54 Sun Jul 25 21:37:04 2010
116 - Now UCA Revision 20 (based on Unicode 5.2.0).
117 - DUCET is also updated (for Unicode 5.2.0) as Collate/allkeys.txt,
118 which *is required* to test this module.
119 ! Please notice that allkeys.txt will be overwritten if you have had
120 other allkeys.txt already.
b5d9a953 121 - U+9FC4..U+9FCB and U+2A700..U+2B734 are new CJK unified ideographs.
00e00351
CBW
122 - Many hangul jamo are assigned (affecting hangul_terminator).
123
124 ! DUCET will be compiled when XS is used. Explicit saying
125 <table => 'allkeys.txt'> (or using another table) will prevent
1393fe00 126 this module from using the compiled DUCET.
00e00351
CBW
127
128 ! added t/default.t that uses DUCET.
129
74b94a79
CBW
1300.53 Sun Feb 14 20:46:27 2010
131 - Now UCA Revision 18 (based on Unicode 5.1.0).
00e00351 132 - DUCET is also updated (for Unicode 5.1.0) as Collate/allkeys.txt,
74b94a79
CBW
133 which is not required to test this module.
134 ! Please notice that allkeys.txt will be overwritten if you have had
135 other allkeys.txt already.
b5d9a953 136 - U+9FBC..U+9FC3 are new CJK unified ideographs.
74b94a79 137
6d24ed10
SP
1380.52 Thu Oct 13 21:51:09 2005
139 - The Unicode::Collate->new method does not destroy user's $_ any longer.
140 (thanks to Jon Warbrick for bug report)
141
0d50d293
RGS
1420.51 Sun May 29 20:21:19 2005
143 - Added the latest DUCET (for Unicode 4.1.0) as Collate/allkeys.txt,
144 which is not required to test this module.
74b94a79 145 ! Please notice that allkeys.txt will be overwritten if you have had
0d50d293
RGS
146 other allkeys.txt already.
147 - Added INSTALL section in POD.
148
3756e7ca
RGS
1490.50 Sun May 8 20:26:39 2005
150 - Now UCA Revision 14 (based on Unicode 4.1.0).
151 - Some tests are modified.
584e761d 152 - Added cjkrange.t, ignor.t, override.t in t.
3756e7ca
RGS
153 - Added META.yml.
154
1550.40 Sat Apr 24 06:54:40 2004
156 - Now a table file is searched in @INC.
157
abd1ec54
NC
1580.33 Sat Dec 13 14:07:27 2003
159 - documentation improvement: in "entry", "overrideHangul", etc.
160
1610.32 Wed Dec 3 23:38:18 2003
162 - A matching part from index(), match() etc. will include illegal
163 code points (as well as ignorable characters) following a grapheme.
164 - Contraction with illegal code point will be invalid.
584e761d
CBW
165 - Added t/view.t.
166 - Added some tests in t/illegal.t.
167 - Separated t/altern.t and t/rearrang.t from t/test.t.
abd1ec54
NC
168 - modified XSUB internals.
169
10d7ec48
NC
1700.31 Sun Nov 16 15:40:15 2003
171 - Illegal code points (surrogate and noncharacter; they are definitely
172 ignorable) will be distinguished from NULL ("\0");
173 but porting is not successful in the case of ((Pure Perl) and
174 (Perl 5.7.3 or before)). If perl 5.6.X is used, XSUB may help it
175 in place of broken CORE::unpack('U*') in older perl.
584e761d 176 - added illegal.t and illegalp.t in t.
5a449a8e
CBW
177 - added XSUB (EXPERIMENTAL!) where some functions are implemented
178 in XSUB. Pure Perl is also supported.
10d7ec48 179
91ae00cb 1800.30 Mon Oct 13 21:26:37 2003
5a449a8e 181 - fix: Completely ignorable in table should be able to be overrided
91ae00cb
NC
182 by non-ignorable in entry.
183 - fix: Maximum length for contraction must not be shortened
10d7ec48 184 by a shorter contraction following in table and/or entry.
584e761d 185 - added t/normal.t.
91ae00cb
NC
186 - some doc fixes
187
1880.29 Mon Oct 13 12:18:23 2003
abd1ec54 189 - now UCA Version 11 (but no functionality is different from Version 9).
91ae00cb
NC
190 - supported hangul_terminator.
191 - fix: Base_Unicode_Version falsely returns Perl's Unicode version.
192 C4 in UTS #10 requires UTS's Unicode version.
193 - For variable weighting, 'variable' is recommended
194 and 'alternate' is deprecated.
195 - added version() method.
584e761d 196 - added hangtype.t, trailwt.t, variable.t, and version.t in t.
91ae00cb 197
06c8fc8f
RGS
1980.28 Sat Sep 06 20:16:01 2003
199 - Fixed another inconsistency under (normalization => undef):
200 Non-contiguous contraction is always neglected.
201 - Fixed: according to S2.1 in UTS #10, a blocked combining character
584e761d
CBW
202 should not be contracted. One test in t/test.t was wrong, then removed.
203 - Added t/contract.t.
06c8fc8f
RGS
204 - (normalization => "prenormalized") is able to be used.
205
1d2654e1
JH
2060.27 Sun Aug 31 22:23:17 2003
207 some improvements:
06c8fc8f 208 - The maximum length of contracted CE was not checked (v0.22 to v0.26).
1d2654e1
JH
209 Collation of a large string including a first letter of a contraction
210 that is not a part of that contraction (say, 'c' of 'ca'
211 where 'ch' is defined) was too slow, inefficient.
91ae00cb
NC
212 - A form name for 'normalization', no longer restricted to
213 /^(?:NF)?K?[CD]\z/, will be allowed as long as
214 Unicode::Normalize::normalize() accepts it, since Unicode::Normalize
215 or UAX #15 may be changed/enhanced in future.
1d2654e1
JH
216 - When Hangul syllables are decomposed under <normalization => undef>,
217 contraction among jamo (LV, VT, LVT) derived from the same
584e761d
CBW
218 Hangul syllable is allowed.
219 - Added t/hangul.t.
1d2654e1 220
4c843366
JH
2210.26 Sun Aug 03 22:23:17 2003
222 - fix: an expansion in which a CE is level 3 ignorable and others are not
1d2654e1 223 was wrongly made level 3 ignorable as a whole entry.
4c843366
JH
224 (In DUCET, some precomposites in Musical Symbols are so)
225
ae6aa562
JH
2260.25 Mon Jun 06 23:20:17 2003
227 - fix Makefile.PL.
228 - internal tweak (again): pack_U() and unpack_U().
45394607 229
9f1f04a1
RGS
2300.24 Thu Apr 02 23:12:54 2003
231 - internal tweak for (?un)pack 'U'.
232
4d36a948
ST
2330.23 Wed Sep 04 19:25:20 2002
234 - fix: scalar match() no longer returns an lvalue substr ref.
235 - fix: "Ignorable after variable" should be made level 3 ignorable
236 even if alternate => 'blanked'.
237 - Now a grapheme may contain trailing level 2, level 3,
238 and completely ignorable characters.
239
2400.22 Mon Sep 02 23:15:14 2002
584e761d
CBW
241 - New File: t/index.t.
242 (The new t/test.t excludes tests for index.)
4d36a948
ST
243 - tweak on index(). POSITION is supported.
244 - add match, gmatch, subst, gsubst methods.
245 - fix: ignorable after variable in 'shift'-variable weight.
246
caffd4cf
ST
2470.21 Sat Aug 03 10:24:00 2002
248 - upgrade keys.txt and t/test.t for UCA Version 9.
249
0116f5dc
JH
2500.20 Fri Jul 26 02:15:25 2002
251 - now UCA Version 9.
252 - U+FDD0..U+FDEF are new non-characters.
253 - fix: whitespace characters before @backwards etc. in a table file.
254 - now values for 'alternate', 'backwards', etc.,
255 which are explicitly specified via new(),
256 are preferred to those specified in a table file.
257
327745dc
ST
2580.12 Sun May 05 09:43:10 2002
259 - add new methods, ->UCA_Version and ->Base_Unicode_Version.
260 - test fix: removed the needless requirement of Unicode::Normalize.
261 [reported by David Hand]
262
809c7673
ST
2630.11 Fri May 03 02:28:10 2002
264 - fix: now derived collation elements can be used for Hangul Jamo
265 when their weights are not defined.
327745dc 266 [reported by Andreas J. Koenig]
809c7673
ST
267 - fix: rearrangements had not worked.
268 - mentioned pleblem on index() in BUGS.
269 - more documents, more tests.
270 - tag names for 'alternate' are case-insensitive (i.e. 'SHIFTed' etc.).
271 - The <undef> value for the keys "overrideCJK", "overrideHangul",
272 "rearrange" has a special behavior (different from default).
273
905aa9f0
ST
2740.10 Tue Dec 11 23:26:42 2001
275 - now you are allowed to use no table file.
276 - fix: fetching CE with two or more combining characters.
277
5398038e 2780.09 Sun Nov 11 17:02:40:18 2001
279 - add the following methods: eq, ne, lt, le, gt, le.
280 - relies on &Unicode::Normalize::getCombinClass()
281 in place of %Unicode::Normalize::Combin
282 (the hash is not defined in the XS version of Unicode::Normalize).
283 then you should install Unicode::Normalize 0.10 or later.
284 - now independent of Lingua::KO::Hangul::Util
285 (this module does decomposition of Hangul syllables for itself)
286
d16e9e3d
JH
2870.08 Mon Aug 20 22:40:18 2001
288 - add the index method.
289
45394607
JH
2900.07 Thu Aug 16 23:42:02 2001
291 - rename the module name to Unicode::Collate.
292
2930.06 Thu Aug 16 23:18:36 2001
294 - add description of the getSortKey method.
295
2960.05 Mon Aug 13 22:23:11 2001
297 - bug fix: on the things of 4.2.1, UTR #10
298 - getSortKey returns a string, but not an arrayref.
299
3000.04 Mon Aug 13 22:23:11 2001
301 - some bugs are fixed.
302 - some tailoring parameters are added.
303
3040.03 Mon Aug 06 06:26:35 2001
305 - modify README
306
3070.02 Sun Aug 05 20:20:01 2001
308 - some fix
309
3100.01 Sun Jul 29 16:16:15 2001
311 - original version; created by h2xs 1.21
312 with options -A -X -n Sort::UCA