This is a live mirror of the Perl 5 development currently hosted at https://github.com/perl/perl5
Document the refcount of version functions’ retval
[perl5.git] / cpan / Unicode-Collate / Changes
CommitLineData
ae6aa562
JH
1Revision history for Perl module Unicode::Collate.
2
b5d9a953
CBW
30.67 Sun Nov 14 11:38:59 2010
4 - supported UCA_Version 22 for Unicode 6.0.0.
5 * 2B740..2B81D are new CJK unified ideographs.
6 * noncharacters (e.g. U+FFFF) should be overridable, not be ignored.
7 ! DUCET is NOT updated, as no maint perl supports Unicode 6.0.0.
8 Thus the default UCA_Version is still 20.
9 - added t/nonchar.t.
10 - improved discontiguous contractions of 3 or more characters.
11 (e.g. 0FB2 0F71 0F80 and 0FB3 0F71 0F80)
12 - auxiliary: now 'mklocale' also copes with Korean.pm according to DUCET.
13
584e761d
CBW
140.66 Sun Nov 7 10:47:30 2010
15 - U::C::Locale newly supports locale: ko.
16 - added Unicode::Collate::CJK::Korean for ko.
17 - added t/loc_ko.t.
18 - 12 compat. ideographs (e.g. U+FA0E) are treated as unified ideographs.
19 (though DUCET also does it, now Unicode::Collate does it without DUCET.)
20 - added t/compatui.t.
21 ! Ideographs Ext.B (U+20000..U+2A6D6) can be overrided with UCA_Version 8.
22 This is a long-standing behavior from Unicode::Collate 0.11 to 0.63.
23 A wrong fix at 0.64 should be abandoned.
24
028d3bfa
CBW
250.65 Wed Nov 3 13:10:20 2010
26 - U::C::Locale newly supports locale: zh and its some variants.
584e761d 27 (zh__big5han, zh__gb2312han, zh__pinyin, zh__stroke)
028d3bfa
CBW
28 - added Unicode::Collate::CJK::Big5 for zh__big5han.
29 - added Unicode::Collate::CJK::GB2312 for zh__gb2312han.
30 - added Unicode::Collate::CJK::Pinyin for zh__pinyin.
31 - added Unicode::Collate::CJK::Stroke for zh__stroke.
584e761d 32 - added loc_zh.t, loc_zhb5.t, loc_zhgb.t, loc_zhpy.t, loc_zhst.t in t.
028d3bfa 33
539ce3d8
CBW
340.64 Sun Oct 31 14:17:29 2010
35 - U::C::Locale newly supports locale: ja.
36 - added Unicode::Collate::CJK::JISX0208 for ja.
584e761d 37 - added loc_ja.t, loc_jait.t, loc_japr.t in t.
539ce3d8
CBW
38 - a subroutine specified in 'overrideCJK' or 'overrideHangul' is allowed
39 to return an integer or undef value.
584e761d
CBW
40 - fix: Ideographs Ext.B (U+20000..U+2A6D6) are assigned in Unicode 3.1,
41 then 'overrideCJK' should not override them with UCA_Version 8.
42 !! sorry, this fix is based on a wrong idea. reverted at 0.66. !!
43 - separated t/overcjk0.t and t/overcjk1.t from t/override.t.
539ce3d8 44
aa7758f7
CBW
450.63 Sun Oct 10 22:13:21 2010
46 - supported suppress contractions (see 'suppress' in POD).
028d3bfa 47 - internal for 'hangul_terminator' in getSortKey().
aa7758f7 48 - U::C::Locale newly supports locales: be, bg, kk, mk, ru, sr.
584e761d
CBW
49 - added loc_be.t, loc_bg.t, loc_cyrl.t, loc_kk.t, loc_mk.t, loc_ru.t,
50 loc_sr.t in t.
aa7758f7
CBW
51 - added tailoring with U+0340 or U+0341 instead of U+0300 or U+0301.
52 (affected locales: hr, is, pl, se, to, wo)
53
6709de88
CBW
540.62 Wed Oct 6 21:35:54 2010
55 - U::C::Locale newly supports locales: ar, hu, hy, se, to, uk.
584e761d 56 - added loc_ar.t, loc_hu.t, loc_hy.t, loc_se.t, loc_to.t, loc_uk.t in t.
6709de88
CBW
57 - Vietnamese (vi): added tailoring for U+0340 and U+0341.
58
c02ee425
CBW
590.61 Sat Oct 2 11:41:29 2010
60 - U::C::Locale newly supports locales: hr, ig, sq.
584e761d 61 - added loc_hr.t, loc_ig.t, loc_sq.t in t.
c02ee425
CBW
62 - precomposites of e-dot-below, o-dot-below, o-tilde are tailored as well.
63 (affected locales: et, yo)
64 - Vietnamese (vi): added contractions for non-blocked decompositions
aa7758f7 65 * base + dot-below + mark such as a\x{323}\x{306}, \x{1EA1}\x{306} etc.
6709de88 66 * base + tone + horn such as o\x{309}\x{31B}, \x{1ECF}\x{31B} etc.
c02ee425 67
1393fe00
CBW
680.60 Thu Sep 23 21:37:36 2010
69 - bug fix: index() [and its friends including gmatch()] didn't remove
70 ignorable characters in the substring correctly.
71 Thanks for the bug report:
aa7758f7 72 http://www.xray.mpe.mpg.de/mailing-lists/perl-unicode/2010-09/msg00014.html
1393fe00
CBW
73
74 - U::C::Locale newly supports locales: de__phonebook, nso, om, tn, vi.
584e761d 75 - added loc_de.t, loc_deph.t, loc_nso.t, loc_om.t, loc_tn.t, loc_vi.t in t.
1393fe00
CBW
76 - precomposites of a-breve, a-circ, e-circ, o-circ are tailored as well.
77 (affected locales: ro, sk, sv)
78
f1a7422f
CBW
790.59 Sun Sep 5 17:03:52 2010
80 - U::C::Locale newly supports locales: az, fil, ha, lt, mt, tr, wo, yo.
584e761d
CBW
81 - added loc_az.t, loc_fil.t, loc_ha.t, loc_lt.t, loc_mt.t, loc_tr.t,
82 loc_wo.t, loc_yo.t in t.
f1a7422f
CBW
83 - precomposites of a-uml, o-uml, and u-uml are tailored as well.
84 (affected locales: da, et, fi, fo, is, kl, nb, nn, sk, sv)
85
6484f676
CBW
860.58 Sun Aug 29 19:56:50 2010
87 - U::C::Locale newly supports locales: af, cy, da, fo, haw, is, kl, sw.
584e761d
CBW
88 - added loc_af.t, loc_cy.t, loc_da.t, loc_fo.t, loc_haw.t, loc_is.t,
89 loc_kl.t, loc_sw.t in t.
6484f676 90
64dc7822 910.57 Sun Aug 22 22:39:58 2010
6484f676 92 - U::C::Locale newly supports locales: ca, et, fi, lv, sk, sl.
584e761d 93 - added loc_ca.t, loc_et.t, loc_fi.t, loc_lv.t, loc_sk.t, loc_sl.t in t.
64dc7822 94
456a1446
CBW
950.56 Sun Aug 8 20:24:03 2010
96 - Unicode::Collate::Locale newly supports locales: eo, nb, ro, sv.
584e761d 97 - added loc_eo.t, loc_es.t, loc_estr.t, loc_nb.t, loc_ro.t, loc_sv.t in t.
456a1446 98 ! renamed t/locale_{xy}.t to t/loc_{xy}.t (for safer 8.3 names)
584e761d 99 (loc_cs.t, loc_fr.t, loc_nn.t, loc_pl.t, loc_test.t)
456a1446 100
00e00351 1010.55 Sun Aug 1 21:21:23 2010
aa7758f7
CBW
102 - incorporated Unicode::Collate::Locale with some changes. see:
103 http://www.xray.mpe.mpg.de/mailing-lists/perl-unicode/2004-03/msg00030.html
456a1446 104 - supported locales: cs, es, es__traditional, fr, nn, pl.
00e00351 105 ! added t/locale*.t that uses DUCET.
584e761d 106 (locale_cs.t, locale_fr.t, locale_nn.t, locale_pl.t, locale_test.t)
b5d9a953 107 - data/*.txt and mklocale for preparation of Locale/*.pl from DUCET.
00e00351
CBW
108
1090.54 Sun Jul 25 21:37:04 2010
110 - Now UCA Revision 20 (based on Unicode 5.2.0).
111 - DUCET is also updated (for Unicode 5.2.0) as Collate/allkeys.txt,
112 which *is required* to test this module.
113 ! Please notice that allkeys.txt will be overwritten if you have had
114 other allkeys.txt already.
b5d9a953 115 - U+9FC4..U+9FCB and U+2A700..U+2B734 are new CJK unified ideographs.
00e00351
CBW
116 - Many hangul jamo are assigned (affecting hangul_terminator).
117
118 ! DUCET will be compiled when XS is used. Explicit saying
119 <table => 'allkeys.txt'> (or using another table) will prevent
1393fe00 120 this module from using the compiled DUCET.
00e00351
CBW
121
122 ! added t/default.t that uses DUCET.
123
74b94a79
CBW
1240.53 Sun Feb 14 20:46:27 2010
125 - Now UCA Revision 18 (based on Unicode 5.1.0).
00e00351 126 - DUCET is also updated (for Unicode 5.1.0) as Collate/allkeys.txt,
74b94a79
CBW
127 which is not required to test this module.
128 ! Please notice that allkeys.txt will be overwritten if you have had
129 other allkeys.txt already.
b5d9a953 130 - U+9FBC..U+9FC3 are new CJK unified ideographs.
74b94a79 131
6d24ed10
SP
1320.52 Thu Oct 13 21:51:09 2005
133 - The Unicode::Collate->new method does not destroy user's $_ any longer.
134 (thanks to Jon Warbrick for bug report)
135
0d50d293
RGS
1360.51 Sun May 29 20:21:19 2005
137 - Added the latest DUCET (for Unicode 4.1.0) as Collate/allkeys.txt,
138 which is not required to test this module.
74b94a79 139 ! Please notice that allkeys.txt will be overwritten if you have had
0d50d293
RGS
140 other allkeys.txt already.
141 - Added INSTALL section in POD.
142
3756e7ca
RGS
1430.50 Sun May 8 20:26:39 2005
144 - Now UCA Revision 14 (based on Unicode 4.1.0).
145 - Some tests are modified.
584e761d 146 - Added cjkrange.t, ignor.t, override.t in t.
3756e7ca
RGS
147 - Added META.yml.
148
1490.40 Sat Apr 24 06:54:40 2004
150 - Now a table file is searched in @INC.
151
abd1ec54
NC
1520.33 Sat Dec 13 14:07:27 2003
153 - documentation improvement: in "entry", "overrideHangul", etc.
154
1550.32 Wed Dec 3 23:38:18 2003
156 - A matching part from index(), match() etc. will include illegal
157 code points (as well as ignorable characters) following a grapheme.
158 - Contraction with illegal code point will be invalid.
584e761d
CBW
159 - Added t/view.t.
160 - Added some tests in t/illegal.t.
161 - Separated t/altern.t and t/rearrang.t from t/test.t.
abd1ec54
NC
162 - modified XSUB internals.
163
10d7ec48
NC
1640.31 Sun Nov 16 15:40:15 2003
165 - Illegal code points (surrogate and noncharacter; they are definitely
166 ignorable) will be distinguished from NULL ("\0");
167 but porting is not successful in the case of ((Pure Perl) and
168 (Perl 5.7.3 or before)). If perl 5.6.X is used, XSUB may help it
169 in place of broken CORE::unpack('U*') in older perl.
584e761d 170 - added illegal.t and illegalp.t in t.
abd1ec54
NC
171 - added XSUB (EXPERIMENTAL!) where some functions are implemented
172 in XSUB. Pure Perl is also supported.
10d7ec48 173
91ae00cb
NC
1740.30 Mon Oct 13 21:26:37 2003
175 - fix: Completely ignorable in table should be able to be overrided
176 by non-ignorable in entry.
177 - fix: Maximum length for contraction must not be shortened
10d7ec48 178 by a shorter contraction following in table and/or entry.
584e761d 179 - added t/normal.t.
91ae00cb
NC
180 - some doc fixes
181
1820.29 Mon Oct 13 12:18:23 2003
abd1ec54 183 - now UCA Version 11 (but no functionality is different from Version 9).
91ae00cb
NC
184 - supported hangul_terminator.
185 - fix: Base_Unicode_Version falsely returns Perl's Unicode version.
186 C4 in UTS #10 requires UTS's Unicode version.
187 - For variable weighting, 'variable' is recommended
188 and 'alternate' is deprecated.
189 - added version() method.
584e761d 190 - added hangtype.t, trailwt.t, variable.t, and version.t in t.
91ae00cb 191
06c8fc8f
RGS
1920.28 Sat Sep 06 20:16:01 2003
193 - Fixed another inconsistency under (normalization => undef):
194 Non-contiguous contraction is always neglected.
195 - Fixed: according to S2.1 in UTS #10, a blocked combining character
584e761d
CBW
196 should not be contracted. One test in t/test.t was wrong, then removed.
197 - Added t/contract.t.
06c8fc8f
RGS
198 - (normalization => "prenormalized") is able to be used.
199
1d2654e1
JH
2000.27 Sun Aug 31 22:23:17 2003
201 some improvements:
06c8fc8f 202 - The maximum length of contracted CE was not checked (v0.22 to v0.26).
1d2654e1
JH
203 Collation of a large string including a first letter of a contraction
204 that is not a part of that contraction (say, 'c' of 'ca'
205 where 'ch' is defined) was too slow, inefficient.
91ae00cb
NC
206 - A form name for 'normalization', no longer restricted to
207 /^(?:NF)?K?[CD]\z/, will be allowed as long as
208 Unicode::Normalize::normalize() accepts it, since Unicode::Normalize
209 or UAX #15 may be changed/enhanced in future.
1d2654e1
JH
210 - When Hangul syllables are decomposed under <normalization => undef>,
211 contraction among jamo (LV, VT, LVT) derived from the same
584e761d
CBW
212 Hangul syllable is allowed.
213 - Added t/hangul.t.
1d2654e1 214
4c843366
JH
2150.26 Sun Aug 03 22:23:17 2003
216 - fix: an expansion in which a CE is level 3 ignorable and others are not
1d2654e1 217 was wrongly made level 3 ignorable as a whole entry.
4c843366
JH
218 (In DUCET, some precomposites in Musical Symbols are so)
219
ae6aa562
JH
2200.25 Mon Jun 06 23:20:17 2003
221 - fix Makefile.PL.
222 - internal tweak (again): pack_U() and unpack_U().
45394607 223
9f1f04a1
RGS
2240.24 Thu Apr 02 23:12:54 2003
225 - internal tweak for (?un)pack 'U'.
226
4d36a948
ST
2270.23 Wed Sep 04 19:25:20 2002
228 - fix: scalar match() no longer returns an lvalue substr ref.
229 - fix: "Ignorable after variable" should be made level 3 ignorable
230 even if alternate => 'blanked'.
231 - Now a grapheme may contain trailing level 2, level 3,
232 and completely ignorable characters.
233
2340.22 Mon Sep 02 23:15:14 2002
584e761d
CBW
235 - New File: t/index.t.
236 (The new t/test.t excludes tests for index.)
4d36a948
ST
237 - tweak on index(). POSITION is supported.
238 - add match, gmatch, subst, gsubst methods.
239 - fix: ignorable after variable in 'shift'-variable weight.
240
caffd4cf
ST
2410.21 Sat Aug 03 10:24:00 2002
242 - upgrade keys.txt and t/test.t for UCA Version 9.
243
0116f5dc
JH
2440.20 Fri Jul 26 02:15:25 2002
245 - now UCA Version 9.
246 - U+FDD0..U+FDEF are new non-characters.
247 - fix: whitespace characters before @backwards etc. in a table file.
248 - now values for 'alternate', 'backwards', etc.,
249 which are explicitly specified via new(),
250 are preferred to those specified in a table file.
251
327745dc
ST
2520.12 Sun May 05 09:43:10 2002
253 - add new methods, ->UCA_Version and ->Base_Unicode_Version.
254 - test fix: removed the needless requirement of Unicode::Normalize.
255 [reported by David Hand]
256
809c7673
ST
2570.11 Fri May 03 02:28:10 2002
258 - fix: now derived collation elements can be used for Hangul Jamo
259 when their weights are not defined.
327745dc 260 [reported by Andreas J. Koenig]
809c7673
ST
261 - fix: rearrangements had not worked.
262 - mentioned pleblem on index() in BUGS.
263 - more documents, more tests.
264 - tag names for 'alternate' are case-insensitive (i.e. 'SHIFTed' etc.).
265 - The <undef> value for the keys "overrideCJK", "overrideHangul",
266 "rearrange" has a special behavior (different from default).
267
905aa9f0
ST
2680.10 Tue Dec 11 23:26:42 2001
269 - now you are allowed to use no table file.
270 - fix: fetching CE with two or more combining characters.
271
5398038e 2720.09 Sun Nov 11 17:02:40:18 2001
273 - add the following methods: eq, ne, lt, le, gt, le.
274 - relies on &Unicode::Normalize::getCombinClass()
275 in place of %Unicode::Normalize::Combin
276 (the hash is not defined in the XS version of Unicode::Normalize).
277 then you should install Unicode::Normalize 0.10 or later.
278 - now independent of Lingua::KO::Hangul::Util
279 (this module does decomposition of Hangul syllables for itself)
280
d16e9e3d
JH
2810.08 Mon Aug 20 22:40:18 2001
282 - add the index method.
283
45394607
JH
2840.07 Thu Aug 16 23:42:02 2001
285 - rename the module name to Unicode::Collate.
286
2870.06 Thu Aug 16 23:18:36 2001
288 - add description of the getSortKey method.
289
2900.05 Mon Aug 13 22:23:11 2001
291 - bug fix: on the things of 4.2.1, UTR #10
292 - getSortKey returns a string, but not an arrayref.
293
2940.04 Mon Aug 13 22:23:11 2001
295 - some bugs are fixed.
296 - some tailoring parameters are added.
297
2980.03 Mon Aug 06 06:26:35 2001
299 - modify README
300
3010.02 Sun Aug 05 20:20:01 2001
302 - some fix
303
3040.01 Sun Jul 29 16:16:15 2001
305 - original version; created by h2xs 1.21
306 with options -A -X -n Sort::UCA