1 # Revision history for Perl extension Encode.
3 # $Id: Changes,v 2.0 2004/05/16 20:55:15 dankogai Exp $
6 $Revision: 2.0 $ $Date: 2004/05/16 20:55:15 $
7 * version updated to 2.00
8 -- sorry, no big feature change. I just hate version 1.100 :)
11 addressed UTF-(8|32LE) + BOM misguessing
12 https://rt.cpan.org/Ticket/Display.html?id=6279
14 s/is_utif8/is_utf8/ in POD
15 ! Encode/lib/Encode/CN/HZ.pm
16 Fixes "make test" failure after the patch to pp_hot.c
18 Message-Id: <20040222182357.6B39.BQW10602@nifty.com>
20 From: autrijus@autrijus.org
21 Subject: [PATCH] "piconv -C 512" badly broken
22 Message-Id: <1072870210.769.5.camel@localhost>
24 1.99 2003/12/29 02:47:16
26 find_encoding("UTF-16BE")->encode("abc") now null terminates
27 http://www.mail-archive.com/perl5-porters@perl.org/msg69766.html
29 prototype bug in decode_utf8() fixed
30 Message-Id: <600A4CDA-F004-11D7-B570-000393AE4244@dan.co.jp>
31 ! Encode.pm /MANIFEST encoding.pm lib/Encode/Supported.pod
32 t/at-cn.t t/at-tw.t t/gsm0338.t ucm/gsm0338.ucm
34 Merged from maintperl@21987
36 1.98 2003/08/20 11:15:31
37 ! lib/Encode/MIME/Header.pm AUTHORS t/mime-header.t
38 Dave Evans has found and corrected a bug in Encode::MIME::Header.
39 Test suite added by Dan Kogai.
40 Message-Id: <3F43440B.7060606@rudolf.org.uk>
42 Typo fixes rolled back in from bleedperl
44 v-strings, now depreciated in perl 5.8.1, is replaced by sadahiro
45 Message-Id: <20030805002313.9880.BQW10602@nifty.com>
47 argv case nit for VMS by Craig
48 Message-ID: <3F2B02DE.10207@mac.com>
49 ! t/enc_eucjp.t t/enc_utf8.t AUTHORS
50 Encode test fixes for VMS by Peter Prymmer
51 Message-ID: <OFBD4A7559.D7CF9517-ON85256D6B.00534853-85256D6B.00538131@factset.com>
52 ! lib/Encode/Alias.pm t/Aliases.t
53 koi-8 aliases bug detected and patched by sadahiro.
54 Further fix and test suite by dankogai
55 Message-Id: <20030713102228.C76A.BQW10602@nifty.com>
57 1.97 2003/07/08 21:52:14
58 ! encoding.pm lib/Encode/Guess.pm lib/Encode/Alias.pm
59 lib/Encode/JP/JIS7.pm lib/Encode/Encoder.pm Encode.pm
60 $DEBUG replaced with DEBUG() so perl optimizes better,
61 by Rafael with further fixes by dankogai
62 Message-Id: <20030705222023.1f24e041.rgarciasuarez@free.fr>
63 ! lib/Encode/Aliases.pm
64 Was: define_alias( qr/\bGB[-_ ]?2312(?:\D.*$|$)/i => '"euc-cn"' );
65 Now: define_alias( qr/\bGB[-_ ]?2312(?!-?raw)/i => '"euc-cn"' );
66 So new hash seeding introduced in bleedperl works.
67 Message-Id: <20030629100937.GD20285@vipunen.hut.fi>
69 $Encode::Guess::NoUTFAutoGuess is added so you can turn off
70 automatic utf(8|16|32) guessing -- originally by Autrijus
71 Message-Id: <20030626162731.GA2077@not.autrijus.org>
73 Addressed the following;
74 Subject: [perl #22835] FB_QUIET doesn't work with Encode::encode
75 Message-Id: <rt-22835-59975.6.8650775354304@rt.perl.org>
77 1.96 2003/06/18 09:29:02
78 ! lib/Encode/JP/JP.pm t/guess.t
79 m/(...)/ in void context then $1 is considered a Bad Thing
80 Message-Id: <B5AB34D0-A019-11D7-AF03-000393AE4244@dan.co.jp>
82 Mentions in POD that as of perl 5.8.1 utf8::is_utf8() is
85 More typecast from maintperl@19739
86 Message-Id: <200306110645.h5B6j5D2009640@smtp3.ActiveState.com>
88 Tests 37 & 38 failed on Win32 -- yet another CRLF issue
89 Message-Id: <200306090733.h597XQPA031646@smtp3.ActiveState.com>
91 Now skips for EBCDIC platform.
92 Message-Id: <OF44B38062.A3998148-ON80256D27.004CF379@portsmouth.uk.ibm.com>
94 Craig's patch applied that addresses "Many systems (DOS, VMS) cannot
95 have more than one C<.> in their filenames." -- perlport.
96 Message-Id: <3ED79E01.8050401@mac.com>
98 Found and fixed the back that -p,--perlqq does not work.
99 Induced by the change from Getopt::Std to Getopt::Long.
101 Addressed [cpan #2629] Wrong assumption in numeric comparison
102 Message-Id: <rt-2629-7326.19.5700583232515@cpan.org>
103 ! Encode.pm Encode.xs Unicode/Unicode.pm Unicode/Unicode.xs
104 lib/Encode/Encoding.pm t/perlio.t
105 ! API Change: ->new_sequence() => ->renew()
106 + Encode::Unicode makes use of it so it can handle BOM on PerlIO
107 + Encode::XS and Encode::utf8 now supports ->renew()
108 + Encode::Encoding now documents this with examples
109 - Non-XS (en|de)code stripped out of Encode::Unicode
110 Message-Id: <146957DB-8C39-11D7-9C91-000393AE4244@dan.co.jp>
112 1.95 2003/05/21 08:41:11
114 Since bogus entries were found in iso-8859-6, all entries are
115 re-generated once again out of
116 http://www.unicode.org/Public/MAPPINGS/ISO8859/8859-*.TXT
117 Thank David Graff <graff@unagi.cis.upenn.edu> for the discovery
118 Message-Id: <200305201819.h4KIJRRU013746@unagi.cis.upenn.edu>
119 + lib/Encode/Unicode/UTF7.pm
120 ! lib/Encode/Config.pm lib/Encode/Alias.pm Unicode/Unicode.pm t/Unicode.t
121 lib/Encode/Supported.pod
122 UTF-7 support is now added. With this Encode now has all transcoding
123 methods in Unicode::String.
125 1.94 2003/05/10 18:13:59
126 ! lib/Encode/MIME/Header.pm
127 A more sophisticated solution for double-encoding by dankogai
128 ! lib/Encode/MIME/Header.pm AUTHORS
129 Two bugs fixed by Bjoern Jacke
130 * "Double Encoding" was not possible
131 i.e. encode("MIME-B" => "=?UTF-8?B?w4RwZmVs?=")
132 * encode("MIME-Q") had UTF-8 flag on
133 Message-Id: <rt-22166-57077.2.12980078979811@bugs6.perl.org>
134 ! lib/Encode/MIME/Header.pm AUTHORS
135 Two occurances of "croak ()" fixed as "croak qq()".
136 Simon Cozens is added to AUTHORS as a result.
137 Message-Id: <20030509103708.GA30664@deep-dark-truthful-mirror.pad>
139 POD fixes that reflect enhancements by jhi
141 Two enhancements by jhi.
142 + Now uses Getopt::Long so it accepts long name options
143 (--from for -f, for example)
144 + New option: -r,--resolve
145 Message-Id: <20030505114149.GA227075@kosh.hut.fi>
147 META.yml added upon request of Schwern
148 Message-Id: <F3B0BD2C-7BCB-11D7-A488-000393AE4244@dan.co.jp>
150 Enache Adrian removed upon request -- to live longer than Encode
151 and/or FreeBSD (toy-)?thread :)
152 Message-Id: <20030425015701.GA2069@ratsnest.hole>
154 "close STDOUT unless $^O eq 'freebsd';" once again relocated
155 to keep VMS happy in which case "$^O eq 'freebsd'" is required
156 to keep FreeBSD+thread happy. Sigh.
157 Message-Id: <3EA88ADC.3000300@mac.com>
159 1.93 2003/04/24 17:43:16
161 added "no warnings 'pack'" in for loop to keep bleedperl from
162 complaining "Character in 'C' format wrapped in pack".
164 More elegant perl core detection inspired by Ilya Zakharevich
165 (but further elaborated for general cases).
166 ! lib/Encode/Encoding.pm lib/Encode/PerlIO.pod
169 like cp9??, \x80-\x9F (control + 0x80) are zapped so they
170 are less likely to be confused w/ ISO-8859-*
172 RT tests added (vendor encodings are exemplified)
173 -- that successfully found a flaw on iso-2022-kr before the patch.
174 ! lib/Encode/CJKConstants.pm lib/Encode/KR/2022_KR.pm
175 decode("ISO-2022-KR") has been buggy but no one ever sited
176 that since no one seems to be using it. Bugs discovered by
178 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
179 ! lib/Encode/CN/HZ.pm t/perlio.t
180 HZ is now perlio_ok, thanks to SADAHIRO-san. perlio.t modified
181 so it adds test for HZ.
182 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
183 ! lib/Encode/Guess.pm
184 Now guesses UTF-(16|32)(BE|LE) when the string contains \x00.
185 So long as the string contains \x{00}-\x{ff} it does not fail.
186 See perldoc for details.
187 Message-Id: <D2F9BB3C-6DC8-11D7-8F19-000393AE4244@dan.co.jp>
189 1.92 2003/03/31 03:27:27
190 ! ucm/big5-eten.ucm ucm/big5-hkscs.ucm
191 Extraneous single-byte chars in range \x80-\xA0 and \xFA-\xFF
192 removed. FYI, IBM's ICU has none of these for java-Big5-1.3_P.ucm
193 but glibc-BIG5-2.1.2.ucm does.
194 Message-Id: <20030325215213.4CA1.BQW10602@nifty.com>
195 ! ucm/cp932.ucm ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
196 Maps regenerated again but this time based upon
197 http://oss.software.ibm.com/cvs/icu/charset/data/ucm/
198 (But where is THE DOCUMENT by MICROSOFT?)
199 ! t/enc_module.t AUTHORS
200 failure with threaded Perl on FreeBSD addressed.
201 Enache Adrian <enache@rdslink.ro> is added to AUTHORS for this.
202 Message-Id: <20030322230131.GA813@ratsnest.hole>
203 ! lib/Encode/Guess.pm
206 Change 18989: Make the :bytes conditional on PerlIO.
207 further Modified by Dan Kogai
208 <200303161730.h2GHU5B16265@smtp3.ActiveState.com>
210 Chnage 18966: another fix for failing test on windows ("use encoding"
211 puts STDIN in :raw mode, so chomp() wasn't stripping the CR), by gsar
212 Message-Id: <200303140545.h2E5j5B08856@smtp3.ActiveState.com>
214 Change 18970: Hopefully this works also in Win32, by jhi
215 Message-Id: <200303140745.h2E7j6B22729@smtp3.ActiveState.com>
216 Change 18965: fix CJKT.t failures on windows due to incorrect
218 Message-Id: <200303140530.h2E5U5B07046@smtp3.ActiveState.com>
220 1.91 2003/03/09 20:07:37
222 even more proofread by jhi.
223 Message-Id: <20030309194323.GT20843@kosh.hut.fi>
226 +use lib qw(t ext/Encode/t ../ext/Encode/t);
227 Message-Id: <20030309182057.GR20843@kosh.hut.fi>
229 s/Hirohito/Hiroto/ig; Sorry, Hiroto-san.
230 Message-Id: <20030309181748.GP20843@kosh.hut.fi>
233 Message-Id: <20030309181907.GQ20843@kosh.hut.fi>
235 1.90 2003/03/09 17:32:43
238 Inaba-san has added a patch for perl 5.8.1 or later that makes
239 encoding.pm work for <DATA> filehandle. t/enc_data.t is to test
240 that. POD is further revised.
241 Message-Id: <200303091515.h29FF6B03903@smtp3.ActiveState.com>
242 ! encoding.pm t/enc_module.t
243 encoding vs. ${^UNICODE} resolved. POD revised accordingly.
244 Message-Id: <20030306112940.GN20652@kosh.hut.fi>
248 signed vs. unsigned issue discovered by Craig on OpenVM
249 Message-Id: <a05200f12ba81fe9d6298@[172.16.52.1]>
250 ! encoding.pm AUTHORS
251 + t/Mod_EUCJP.pm t/enc_module.enc t/enc_module.t
252 Because binmode() stacks layers instead of overwrite, you have to
253 ":raw :encoding()" in encoding.pm or your are in trouble when you
254 call encoding.pm multiple times. There are several workarounds
255 but Inaba-san's idea is in. SUGAWARA Hajime <sugawara@hdt.co.jp>,
256 who was the first to address this problem was added to AUTHORS.
257 The test suites was added for this, which is a modified version
258 of SUGAWARA-san's scripts
259 Message-Id: <3E5CF695.6AE07852@st.rim.or.jp>
261 1.88 2003/02/20 14:42:34
263 one signedness nit for Encode by jhi
264 <200302161933.h1GJX876018710@kosh.hut.fi>
266 VISCII map was incorrect; fixed by Sadahiro-san
267 Message-Id: <20030216120828.47D3.BQW10602@nifty.com>
268 ! t/enc_eucjp.t t/enc_utf8.t AUTHORS
269 You can't unlink files that are opened in cygwin but the last
270 file handle opened in t/enc_*.t left open. Patch submitted
271 by Yitzchak and he was added to AUTHORS.
272 Message-Id: <iN0Q+gzkgmZN092yn@efn.org>
274 now works with 'LC_ALL=en_US.UTF-8 PERL_UTF8_LOCALE=1'
275 Message-Id: <20030206104513.GA11081@kosh.hut.fi>
277 For 1.88: Unicode.xs =~ s/regog/recog/ -- jhi
278 Message-Id: <20030206045153.GA6826@kosh.hut.fi>
280 1.87 2003/02/06 01:52:11
282 * Inaba "Sensei" Hirohito added (I thought I have done so a long
283 ago but apparently I did not).
284 * SUZUKI Norio added for verious and useful bug reports.
285 ! Byte/Byte.pm KR/KR.pm Unicode/Unicode.pm
286 lib/Encode/Encoder.pm lib/Encode/CJKConstants.pm
287 podchecked so all warnings are gone except for L<http://>.
288 ! encoding.pm t/enc_eucjp.t
289 * t/uni/tr_utf8.t now t ok on maintperl (sorry, jhi)
290 * Filter option overhaul
292 ! Encode.pm Encode.xs encengine.c Encode/encode.h
293 lib/Encode/Encoding.pm lib/Encode/JP/JIS7.pm
294 Merged inaba-san's patch that fixes "use encoding 'shiftjis'"
295 without filter. podchecked by Dan Kogai.
296 Message-Id: <3E3BC46B.6C687CFD@st.rim.or.jp>
297 ! lib/Encode/Alias.pm
298 decode('alias', $1) went wild because of local $_ in find_alias()
299 the evil local $_ is eradicated but that changes find_alias()
300 format for coderef aliasing. See Encode::Alias for details
301 Message-Id: <200302051704.AA00042@kipp0.nifty.com>
303 1.86 2003/01/22 03:29:07
305 * Don't forget to canonize when you attempt an exact match!
306 Message-Id: <73E7F801-2DAA-11D7-BF9A-000393AE4244@dan.co.jp>
307 * ${^ENCODING} exception is off for $] > 5.008
308 Message-Id: <20030122110617T.inaba.hiroto@toshiba-it.co.jp>
310 $] check commented out so it runs on 5.8.0
312 1.85 2003/01/21 22:19:14
314 ${^ENCODING} exception is now explicit rather than handled by regex.
315 + t/enc_eucjp.t t/enc_utf8.t
316 Test suite for the better "encoding" pragma support for bleedperl.
317 On 5.8.0, they will just be skipped.
319 1.84 2003/01/10 12:00:16
321 ${^ENCODING} is no longer set for utf so encoding is no longer fun :)
322 (That is to prevent duplicate encoding first by IO then ${^ENCODING})
323 Message-Id: <20030108213737.GK331043@lyta.hut.fi>
325 %_ fixes saves the resulting .so .05% smaller, by NC
326 Message-Id: <20021226225709.GF284@Bagpuss.unfortu.net>
328 Silence Encode on undef, by Andreas
329 Message-Id: <m3smwrohd1.fsf@k242.linux.bogus>
330 Message-Id: <m3of7fo7np.fsf@k242.linux.bogus>
332 s/regognised/recognised/ . British spelling left intact to pay
333 respect to two British Nicks :)
334 Message-Id: <20021203020454.GK2274@kosh.hut.fi>
336 1.83 2002/11/18 17:28:49
337 ! Encode.xs lib/Encode/JIS7.pm
338 Even more patches from Inaba-san has been applied. With this
339 patch t/uni/tr_7jis.t and t/uni/t_utf8.t of bleedperl will work.
340 Message-Id: <20021115105514D.inaba.hiroto@toshiba-it.co.jp>
342 1.82 2002/11/14 23:06:12
344 Encode::utf8 (XS Version) assertion botch first found in Cygwin,
345 later found in perls w/ -Dusemymalloc was fixed by NC.
346 Message-Id: <20021114210349.GA288@Bagpuss.unfortu.net>
348 1.81 2002/11/08 18:29:27
349 ! Encode.pm Encode.xs
350 Non-XS version of Encode::utf8 is back (with XS being default).
351 Encode::predefine_encodings(0) to turn off XS.
352 This is primarily to cope w/ Cygwin smoke but Sadahiro-san has
353 found that it was Test::More causing the problem, not Encode.
354 But I have already made it configurable so it may be useful in
356 Message-Id: <20021107210110.2EE4.BQW10602@nifty.com>, et al.
358 The ingenious patch by Nicholas Clark that reduces shlib sizes by
359 50% with no penalty and backward compatibility preserved, is in.
360 Message-Id: <20021103231324.GE288@Bagpuss.unfortu.net>
362 1.80 2002/10/21 20:39:09
363 ! Encode.xs t/mime-header.t
364 Even more patches from NI-XS regarding Encode::utf8->decode().
365 And one more test to t/mime-header.t to prove it
366 Message-Id: <E183i0Y-0003mo-00@mserv1c.vianw.co.uk>
368 1.79 2002/10/21 06:05:37
370 Further patches from NI-XS. Encode::utf8->decode() now checks the
371 value of utf8 flag of the argument. As a result, the fix to
372 lib/Encode/MIME/Header.pm is no longer neccessary but since it did
373 no harm (even speedwise) I'll leave it unreverted.
374 ! ucm/cp949.ucm ucm/cp950.ucm
376 U+00AE REGISTERED SIGN
377 were missing as a result of 1.78. Discovered by Moriyama-san.
378 Moriyama-san has also developed a test script that compares
379 (en|de)coded results to the corresponding Win32 API result and
380 all cp9?? maps are now verified.
381 Message-Id: <20021021025220.3AED.MSYK@mtg.biglobe.ne.jp>
383 1.78 2002/10/20 15:44:00
384 ! lib/Encode/MIME/Header.pm
385 fixed so that it works with new Encode::utf8
386 ! Encode.pm Encode.xs
387 Encode::utf8 is now in Encode.xs by Nick In-XS. This allows
388 :encoding(UTF-8) to handle partial chars at end of buffers
390 Message-Id: <20021020134935.2079.3@bactrian.ni-s.u-net.com>
391 ! lib/Encode/Supported.pod
392 More nitpickings applied.
395 Moriyama-san has discovered a serious bug in t/CJKT.t; its roundtrip
396 tests were completely useless. To redeem that and get the peace of
397 mind again, I wrote t/rt.pl to test ALL '|0' ENTRIES in all
398 ucm/*.ucm Since this script takes too long to finish (30 seconds on
399 PIII-800MHz, FreeBSD), it is deliberately excluded from 'make test'
400 but you can easily run that by either renaming it or:
402 Message-Id: <20021019065420.0C48.MSYK@mtg.biglobe.ne.jp>
403 ! ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
404 Other CJKT cp9?? also updated according to the URI below;
405 http://www.microsoft.com/typography/unicode/cscp.htm
406 + bin/ucmsort MANIFEST
407 ucmsort is a crude utility that sorts CHARMAP entries in UCM files
408 to proper order. intended for hardcore develpers only.
409 ! ucm/cp932.ucm JP/JP.pm AUTHORS
410 CP932 mapping which was based upon the mapping file at unicode.org
411 was found obsolete by MORIYAMA Masayuki msyk@mtg.biglobe.ne.jp>. He
412 has also supplied the patch so he was added to AUTHORS.
413 ! lib/Encode/Supported.pod
414 ISO-8859-11 != TIS 620
415 == TIS 620 + \xA0 ( )
417 <DC504E9C3384054C8506D3E6BB012460810D23@bsebe001.americas.nokia.com>
419 1.77 2002/10/06 03:27:02
421 * Modified to accomodate up and comming patch by Inaba-san that
422 will fix tr/// needing eval qq{}
423 Message-Id: <9F78A19C-D6C3-11D6-BAC6-0003939A104C@dan.co.jp>
425 * pod fixes/enhancements to reflect the changes above
426 ! lib/Encode/Alias.pm
427 "Encode::TW is correct, Encode::Alias not." - /Autrijus/
428 Message-Id: <20021001015648.GB18710@not.autrijus.org>
430 1.76 2002/08/25 15:09:51
432 To reflect ucm change by Autrijus. t/big5-eten.enc was regenerated
433 but naturally identical to previous version -- dankogai
435 Codepoint fixes -- autrijus
436 Message-Id: <20020805040236.GC5220@not.autrijus.org>
438 copied everything under perl-5.8.0/ext/Encode to make sure Encode
439 is in sync w/ perl core
441 Change 17175 by jhi@alpha on 2002/06/10 23:24:42
442 Now that binmode(FH) does implicit ":bytes" revisit
443 the failing tests. The worrisome one is the Digest::MD5
444 test-- how will it fare in CRLF lands now?
446 From: Radu Greab <radu@netsoft.ro>
447 Date: Mon, 10 Jun 2002 00:40:34 +0300
448 Message-Id: <200206092140.g59LeYn15745@ix.netsoft.ro>
449 Fixes for en_US.UTF-8 failures, all but ext/PerlIO/t/fallback.t
450 ones which I cannot figure out.
451 ! lib/Encode/Alias.pm
452 Subject: [Encode PATCH] spurious warning
453 From: Nicholas Clark <nick@unfortu.net>
454 Date: Sun, 2 Jun 2002 20:26:22 +0100
455 Message-ID: <20020602192619.GA320@Bagpuss.unfortu.net>
457 1.75 2002/06/01 18:07:49
458 ! lib/Encode/Alias.pm t/Alias.t lib/Encode/Supported.pod TW/TW.pm
459 glibc compliance cited by Autrijus.
460 http://www.li18nux.org/docs/html/CodesetAliasTable-V10.html
461 ! bin/enc2xs bin/piconv
462 Subject: Re: forewarning: usedevel and versiononly
463 Message-Id: <20020529081515.D570.H.M.BRAND@hccnet.nl>
465 1.74 2002/05/28 18:33:15
466 + ucm/null.ucm ucm/ctrl.ucm
467 ! Makefile.PL bin/enc2xs lib/Encode/Supported.pod
468 "null" and "ascii-ctrl" encodings added upon the request of Autrijus
469 Subject: Re: unicode -> &# notation
470 Message-ID: <20020518193704.GB40272@not.autrijus.org>
472 1.73 2002/05/28 17:26:18
473 ! */Makefile.PL Makefile.PL bin/enc2xs Encode/Makefile_PL.e2x AUTHORS
474 Chris Nandor has fixed Encode so that it works w/ MacPerl --
475 at least w/ PPC (68k need static linking which does not work due to
476 64k limit). pudge is added to AUTHORS (I'm surprised he was not
477 there in the list). Encode/Makefile_PL.e2x was additionally fixed
478 by dankogai to reflect changes in other Makefile.PL
479 Message-Id: <p0510030ab9195ed230ff@[10.0.1.107]>
481 Subject: Change 16746: -Mutf8 cleanup.
482 Message-Id: <200205222345.g4MNj7e10597@smtp3.ActiveState.com>
484 1.72 2002/05/20 15:49:56
486 Subject: [PATCH] Encode should be in perl-core library path
487 Message-Id: <86r8k7h738.wl@mail.edge.co.jp>
488 Message-Id: <20020520161201.A11019@alpha.hut.fi>
489 ! lib/Encode/MIME/Header.pm
490 Subject: [PATCH] Encode::MIME::Header
491 Message-Id: <86sn4nh7a8.wl@mail.edge.co.jp>
492 ! Encode/Makefile_PL.e2x
493 Subject: [PATCH] Make Makefile_PL.e2x happy on MSWin32
494 Message-Id: <20020519201031.GA1603@not.autrijus.org>
495 ! CN/Makefile.PL Byte/Makefile.PL JP/Makefile.PL TW/Makefile.PL
496 Symbol/Makefile.PL KR/Makefile.PL EBCDIC/Makefile.PL Makefile.PL
498 @16628 and @16652 from Vadim. Vadim was added to AUTHORS.
499 Subject: [PATCH] good day for WinCE port of perl.
500 Message-ID: <001301c1fc68$e808e560$a95cc3d9@vad>
503 Even more linting by Robin via @16532
505 Even more typecast by Sarathy in @16460
507 1.71 2002/05/07 16:22:42
509 even more typecasts by Robin
510 Message-Id: <200205071513.QAA05846@tempest.npl.co.uk>
512 A very strange bug that was causing a bugus ucm -> C table
513 generation that was revealed by a UCM file that Andreas was
514 working. This is the king of wierdest bug I've encountered
515 in the course of Encode maintenance.
516 Message-Id: <6C04F0FA-61D4-11D6-B164-00039301D480@dan.co.jp>
518 1.70 2002/05/06 10:26:48
520 Made more 'module-safe' with conjunction w/ 'no encoding'.
521 Message-Id: <EAB48C16-60DA-11D6-9982-00039301D480@dan.co.jp>
522 ! lib/Encode/Encoding.pm
523 'require Encode' because ->Define uses Encode::define_encoding();
524 problem and solution addressed by Miyagawa-kun
525 Message-Id: <86znzdfvuh.wl@mail.edge.co.jp>
527 Cuts the frill to make djgpp happier, as suggested by Laszlo
528 Message-Id: <20020506105819.H17012@libra.eth.ericsson.se>
530 enc2xs no longer overwrites files w/ -M option, as suggested by Andreas
531 Message-Id: <m3bsbug48n.fsf@anima.de>
533 1.69 2002/05/04 16:41:18
534 ! lib/Encode/MIME/Header
535 Floating-point coerced for UNICOS (in integer arithmetics it folds
536 line one character too early). Verification by Mark is pending.
537 Message-Id: <C670F60D-5F4F-11D6-A5CA-00039301D480@dan.co.jp>
539 more doc patch from Elizabeth
540 Message-Id: <4.2.0.58.20020503210946.02f4ed30@mickey.dijkmat.nl>
541 ! Encode/Makefile_PL.e2x
542 More platform-independent patch from Benjamin
543 Message-Id: <3CD31BE0.69F79B06@earthlink.net>
544 ! lib/Encode/Guess AUTHORS
545 split regex fix by Graham Barr. Adds him to AUTHORS.
546 Message-Id: <20020504085419.E95940@valueclick.com>
547 ! Encode/Makefile_PL.e2x
548 enc2xs script discovery made smarter and more sensible, first cited
549 by Miyagawa-kun and further suggestions by Rafael and Andreas
550 ! Encode.pm lib/Encode/Guess.pm t/fallback.t t/guess.t t/mime-header.t
551 "The EBCDIC remapping of the low 256 bites again" #16372 by jhi
553 1.68 2002/05/03 12:20:13
554 ! lib/Encode/Alias.pm lib/Encode/Supported.pod t/Alias.t AUTHORS
555 UCS-4 added to aliases of UTF-32 by Elizabeth Mattijsen. Alias.t
556 and Supported.pod modified to reflect the change. Elizabeth added
557 to Authors. And H.M. is also added for forwarding her patch among
558 other contributions (I was rather surprised to find his name was not
560 Message-Id: <20020503114901.D639.H.M.BRAND@hccnet.nl>
562 1.67 2002/05/02 07:33:09
564 Error message now consistent w/ perlqq (\N{U+} -> \x{})
565 done in perl@16308 but Philip linted me further. Now the error
566 messages are macronized as ERR_ENCODE_NOMAP and ERR_DECODE_NOMAP
567 ! lib/Encode/Guess.pm
568 Sanity check for happier -w by Autrijus
570 1.66 2002/05/01 05:41:06
571 ! Encode.xs t/fallback.t
572 WARN_ON_ERR no longer assumes RETURN_ON_ERR so you can issue a warning
573 while fallback is in effect. This even came with a welcome side-effect
574 of cleaner code with less nests! Thank you, NI-XS. t/fallback.t is
575 also modified to test this.
576 And of course, the corresponding varialbles to UV[Xx]f are appropriately
577 cast. This should've concluded NI-XS homework.
579 encode(undef) does warn again! Repented upon suggestion by NI-XS.
580 Document for unless vs. '' added
581 Message-Id: <20020430171547.3322.13@bactrian.elixent.com>
583 1.65 2002/04/30 16:13:37
585 encode(undef) no longer warns for C<Use of uninitialized value in
586 subroutine entry>. Suggested by Paul.
587 Message-Id: <AIEAJICLCBDNAAOLLOKLMEEEEJAA.Paul.Marquess@ntlworld.com>
588 ! lib/Encode/Supported.pod
589 Encode::MIME::Header and Encode::Guess mentioned
590 Updated for Encode::HanExtra 0.05 and Encode::JIS2K
591 ! lib/Encode/Guess.pm
592 POD fix by Miyagawa-kun
593 Message-Id: <86k7qqx8p7.wl@mail.edge.co.jp>
595 1.64 2002/04/29 06:54:06
597 Now decodes euc-jisx0213 also. CAVEAT: encode("euc-jp"...) and
598 encocde("euc-jisx0213") are still DIFFERENT.
599 Message-Id: <A5DFA5CA-5B3C-11D6-A54F-00039301D480@dan.co.jp>
601 A few white spaces corrected by NI-XS via PerlIO integration to
603 Subject: Change 16247: Integrate perlio;
605 Document fixes by Andreas
606 Message-Id: <m3k7qsf1we.fsf@anima.de>
608 1.63 2002/04/27 18:59:50
609 ! lib/Encode/Encoding.pm
610 ! Encoding.pm Unicode/Unicode.pm lib/Encode/Guess.pm lib/Encode/CN/HZ.pm
611 ! lib/Encode/JP/JIS7.pm lib/Encode/MIME/Header.pm lib/Encode/KR/2022_KR.pm
612 Make use of the Encode::Encoding base class!
613 And other cleanups in Encode.xs upon NI-XS suggestions
614 Message-Id: <20020427160718.1290.15@bactrian.ni-s.u-net.com>
616 1.62 2002/04/27 11:17:39
618 encodings() now just check %ExtModule instead of eval{require}
619 all of them for ":all" to conserve more memory.
621 more "%x" -> "%" UVxf stuff.
623 s/=over2/=over 2/g # oops.
625 1.61 2002/04/26 03:02:04
627 Now does decent tests besides use_ok()
628 ! lib/Encode/Guess.pm t/guess.t
629 UI streamlined, document added
631 various signed/unsigned mismatch nits (#16173)
632 http://public.activestate.com/cgi-bin/perlbrowse?patch=16173
634 POD: utf8-flag-related caveats added. A few sections completely
638 Thou shalt not assume %d works, either!
639 Robin Baker added to AUTHORS for this
640 Message-Id: <200204251132.MAA28237@tempest.npl.co.uk>
642 "Change 16144 by gsar@onru on 2002/04/24 18:59:05"
644 1.60 2002/04/24 20:06:52
646 "Thou shalt not assume %x works." -- jhi
647 Message-Id: <20020424210618.E24347@alpha.hut.fi>
648 ! CN/Makefile.PL JP/Makefile.PL KR/Makefile.PL TW/Makefile.PL To make
649 low-memory build machines happy, now *.c is created for each *.ucm
650 (no table aggregation). You can still override this by setting
651 $ENV{AGGREGATE_TABLES}.
652 Message-Id: <00B1B3E4-579F-11D6-A441-00039301D480@dan.co.jp>
653 + lib/Encode/Guess.pm
654 + lib/Encode/JP/JIS7.pm
655 Encoding-autodetect (mainly for Japanese encoding) added. In a
656 course of development, JIS7.pm was improved.
657 + lib/Encode/HTML/Header.pm
658 + lib/Encode/Config.pm
659 MIME B/Q Header Encoding Added!
660 ! Encode.pm Encode.xs t/fallback.t
661 new fallbacks; XMLCREF and HTMLCREF upon Bart's request.
662 Message-Id: <20020424130709.GA14211@tanglefoot>
664 1.59 $ 2002/04/22 23:54:22
665 ! Encode.pm Encode.xs
666 needs_lines() and perlio_ok() are added to Internal encodings such
667 as utf8 so XML::SAX is happy. FB_* stub xsubs are now prototyped.
669 1.58 2002/04/22 23:54:22
671 s/MacChineseSimp/MacChineseTrad/ # ... oops.
676 Now all CJKT encodings go thru round-trip test via t/CJKT.t.
677 t/(CN|TW).t by Autrijus are renamed at-(cn|tw).t
678 t/(JP|KR).t are aggregated to t/CJKT.t
679 test data are all remade via bin/ucm2text.
680 And .... They are no longer skipped for -Uuseperlio !
682 1.57 2002/04/22 20:27:30
683 ! t/JP.t t/KR.t t/perlio.t
684 unless (find PerlIO::Layer 'perlio') ... line is back again.
685 t/JP.t and t/KR.t were supposed to work but maybe '>:utf8' lines
686 need PerlIO. Sigh....
687 ! Encode.xs Unicode/Unicode.pm lib/Encode/JP/JIS7.pm t/perlio.t
688 ->perlio_ok now does eval{ require PerlIO::encoding } there so
689 it correctly returns 1 when PerlIO::encoding is yet loaded.
691 perl-current patch #16072 reflected
693 1.56 2002/04/22 09:48:07
694 ! Encode.pm encoding.pm t/perlio.t t/jperl.t
695 New PerlIO::encoding 0.04 compliance met
697 1.55 2002/04/22 03:43:05
698 ! Encode.pm Encode.xs Unicode/Unicode.pm
699 needs_lines() defined so Encode::Encoding is no longer needed
702 1.54 2002/04/22 02:50:01
703 ! Encode.pm! Encode.xs! Unicode/Unicode.pm t/perlio.t
704 ! lib/Encode/Encoding.pm lib/Encode/CN/HZ.pm
705 now perlio_ok is true by default if PerlIO::encoding->VERSION is
706 0.03 or larger. POD in Encode::Encoding revised to reflect this.
707 Encode::XS and Encode::Unicode now has perlio_ok() method.
708 ! lib/Encode/Supported.pod
709 s/UP-UX/HP-UX/ by jhi
710 ! AUTHORS Byte/Byte.pm CN/CN.pm Encode.pm JP/JP.pm KR/KR.pm README
711 ! Symbol/Symbol.pm TW/TW.pm Unicode/Unicode.pm bin/enc2xs bin/piconv
712 ! bin/ucmlint encoding.pm lib/Encode/Alias.pm lib/Encode/CN/HZ.pm
713 ! lib/Encode/Config.pm lib/Encode/Encoder.pm lib/Encode/Encoding.pm
714 ! lib/Encode/KR/2022_KR.pm lib/Encode/PerlIO.pod
715 ! lib/Encode/Supported.pod
716 Huge document fixes by Philip.
719 s/compare\(/compare_text\(/o by Sarathy. Adds him to AUTHORS
720 http://public.activestate.com/cgi-bin/perlbrowse?patch=16049
722 binmode() after "<:encoding" to make Win32 happy, by Mattia.
723 Mattia added to AUTHORS file
724 Message-Id: <3CC3150F.5798.22A05AE@localhost>
726 1.52 2002/04/20 23:43:47
728 TODO: is now SKIP:, as NI-XS requested. Also adds more
729 eraborate failure analysis added.
731 A note on how to make sure of round-trip safety added to POD
732 section (so Autrijus is happier)
733 ! ucm/big5-hkscs.ucm ucm/big5-eten.ucm t/TW.pm
734 big5-(eten|hkscs) is round-trip safe again!
735 Message-Id: <A2C949CC-54AC-11D6-A5FB-00039301D480@dan.co.jp>
737 Typo fixes by Andreas
738 ! Encode.pm Encode.xs Unicode/Unicode.xs Encode/Encoding.pm
739 ! lib/Encode/JP/JIS7.pm lib/Encode/KR/2022_KR.pm t/perlio.t
740 PerIO coodination patches from NI-XS.
741 Message-Id: <2769E572-54A1-11D6-B7E2-00039301D480@dan.co.jp>
743 1.51 2002/04/20 09:58:23
745 Updated test suite by Autrijis so "make test" is happy again
746 Message-Id: <20020420082104.GA25037@not.autrijus.org>
748 ! ucm/big5-hkscs.ucm lib/Encode/Alias.pm
750 TW/TW.pm TW/Makefile.PL
751 Updates by Autrijus. 'big5' is no longer a canonical but an
752 alias to 'big5-eten'. big5-hkscs is now in 2001 edition.
753 Message-Id: <20020419195346.GA19597@not.autrijus.org>
755 Fix by NI-XS that fallback may cause SEGV w/ Perl/TK
756 Message-Id: <20020419184509.1924.1@bactrian.ni-s.u-net.com>
758 PerlIO detection a little bit smarter; no longer uses eval qq{}
761 1.50 2002/04/19 06:13:02
762 ! ! Encode.pm Encode.xs Encode/encoding.h
764 New Fallback API imlemented and documented. See "perldoc Encode"
766 ! lib/Encode/JP/JIS7.pm Encode.pm
767 + lib/Encode/PerlIO.pod t/perlio.t
768 API compliance met. However, it still does not work unless perlio
769 implements line buffer. See BUGS section in perldoc Encode::PerlIO
770 As a sensible workaround, perlio_ok() added to Encode.
772 ! lib/Encode/Supported.pod
774 Message-Id: <20020418174647.J8466@alpha.hut.fi>
776 Doc fixes from Autrijus
777 Message-Id: <20020418144131.GA10987@not.autrijus.org>
779 perlqq mode documented
781 + t/jisx0201.euc t/jisx0201.ref
782 ! t/jisx0208.euc t/jisx0208.ref
783 t/JP.t tests more rigorously and with other encodings
784 t/jisx0201.* added to test JIS7 encodings. jisx0208 is now PURELY
785 in jis0208 (used to contain jisx0201 part).
786 ! Encode/Makefile_PL.e2x
787 The resulting Makefile.PL that "enc2xs -M" creates now auto-discovers
788 enc2xs and encode.h rather than hard-coded. This allows the resulting
789 module fully CPANizable.
790 ! encoding.pm t/JP.t t/KR.t
791 PerlIO detection simplified (checks %INC instead of eval{})
792 ! Encode.xs Encode/encode.h
793 + Unicode/Makefile.PL Unicode/Unicode.pm Unicode/Unicode.xs
794 - lib/Encode/Unicode.pm
795 (en|de)code_xs relocated to where it belongs. Source reindented
798 Additional (U8 *) cast added as suggested by jhi
799 Message-Id: <20020417165916.A28599@alpha.hut.fi>
801 1.42 Date: 2002/04/17
803 no-op module; Thought of adding a pod there but enc2xs has
808 correct mechanism to detect Perlio::encoding layar installed.
810 PerlIO Layer detached.
812 1.41 2002/04/16 23:35:00
814 binmode(STDIN|STDOUT ...) done iff PerlIO is available
816 Cleaned up PerlIO skip conditions to prepare for the upcoming
817 Encode - PerlIO forking.
819 exported functions are now prototyped.
820 ! lib/Encode/CN/HZ.pm
823 fallback implemented # was /* FIXME */
824 affected programs revised to fit (only HZ was using the try-catch
825 approach which needed to be fixed for API-compliance).
827 ! Encode/KR/2022_KR.pm
829 can find =head1 NAME now, jhi
830 Message-Id: <20020416083059.V30639@alpha.hut.fi>
834 now complies with less warnings with the pickest compilers.
835 Suggested by Craig, fixed by Dan.
836 ! Encode/Makefile_PL.e2x
838 A bug that fails to find *.e2x in certain conditions fixed
840 1.40 2002/04/14 22:27:14
841 + Encode/ConfigLocal_PM.e2x
842 ! lib/Encode/Config.pm
844 "enc2xs -C" now generates/updates Encode::ConfigLocal.
845 ConfigLocal_PM.e2x is a skelton thereof.
846 ! lib/Encode/Config.pm
848 "use Encode::CN::HZ;" was missing.
851 More rigorous tests added to test XS, especially on memory allocation.
853 ! lib/Encode/Unicode.pm
854 NI-S implemented an XS version -- merged
855 Message-Id: <20020414154857.2066.4@bactrian.ni-s.u-net.com>
858 Source filter option added. With this option on, you can write
859 perl 5.8-savvy scripts (such as UTF-8 identifiers) in legacy
860 encodings. t/jperl.t enhanced to test this feature.
862 ok() gotcha addressed by Benjamin fixed. Though I didn't exactly
863 apply his suggestion, this degree of nitting is enough to add him
865 Message-Id: <3CB93223.291E5E2E@earthlink.net>
867 + lib/Encode/JP/JIS7.pm
868 - lib/Encode/JP/JIS.pm
869 - lib/Encode/JP/2022_JP.pm
870 - lib/Encode/JP/2022_JP1.pm
871 7bit-jis, iso-2022-jp and iso-2022-jp1 are all aggregated to
872 JIS7.pm for better maintainability and performance
874 Added caveat for non-ascii identifiers.
876 fixes by jhi, the original author of this pragramtic module.
877 Message-Id: <20020413231527.V1826@alpha.hut.fi>
879 1.34 2002/04/12 20:23:05 (Unreleased)
882 EBCDIC fixes addressed by jhi.
883 Message-Id: <20020412161844.D9383@alpha.hut.fi>
884 ! lib/Encode/Encoder.pm
885 POD fix by Miyagawa-kun
886 Message-Id: <86bscqq4hu.wl@mail.edge.co.jp>
888 1.33 2002/04/10 22:28:40
890 Philip's mail address corrected.
893 ! lib/Encode/Encoder.pm
894 s/ = shift;/ = @_;/ # trivial but a common idiomatic typo :)
895 This adds Miyagawa-kun to AUTHORS.
896 * encoding() no longer exported by default but on demand
897 * t/Encoder.t updated to test all these
898 Message-Id: <86hemjpdn4.wl@mail.edge.co.jp>
899 ! lib/Encode/Unicode.pm
900 ! lib/Encode/Supported.pm
901 Further doc fixes by Anton
903 1.32 2002/04/09 20:06:15
906 - ucm/macDevanaga.ucm Unicode Character Map
907 - ucm/macGujarati.ucm Unicode Character Map
908 - ucm/macGurmukhi.ucm Unicode Character Map
909 A utility to check integrity of .ucm files. t/bogus.ucm is a
910 ucm that is deliberately bogus. unused Indic mappings are removed
913 resolve_alias() added as suggested by jhi. Same as
914 find_encoding("alias")->name. For convenience. This one is
915 defined in Encode.pm instead of Alias.pm.
916 Message-Id: <20020409215846.H17022@alpha.hut.fi>
918 Memory Allocate but detected during the devel of ucmlint -- fixed.
919 Message-Id: <C0DDCE16-4BE7-11D6-9204-00039301D480@dan.co.jp>
920 ! lib/Encode/Unicode.pm
921 valid_ucs2(0) is false but must be true.
922 3 patches from NI-S as follows. This also has fixed the incident
924 ! lib/Encode/Alias.pm
925 find_alias() recursion prevention
927 Checks for the patch above
928 ! t/Encode/Unicode.pm
929 An extra "F" that causes valid_ucs2() return a bogus value fixed
930 Message-Id: <20020409133927.17803.1@bactrian.elixent.com>
931 Message-Id: <Pine.SOL.4.10.10204091338220.10390-100000@maxwell.phys.lafayette.edu>
932 2 Small Patches from jhi as follows:
934 Encode->encodings() lists in case-insensitve order (as it was)
936 -l option prints avaiable encodings to STDOUT instead of STDERR
937 ! lib/Encode/Aliases.pm
938 s/defintion/definition/
939 Message-Id: <200204082306.CAA21033@alpha.hut.fi>
941 ! lib/Encode/Supported.pod
942 ! lib/Encode/Unicode.pm
943 POD revise by Philip Newton. This adds Philip to AUTHORS list.
944 Thank you for the exact quote of Douglas Adams :)
945 Message-Id: <22s3bu4gpvhhsses64nj3afuu0lo927rv3@4ax.com>
947 1.31 2002/04/08 18:08:07
948 ! lib/Encode/Encoder.pm
950 Encode::Encoder, once just a placeholder of an idea, is now much more
951 practical. See t/Encode.t to find how practical it can be.
952 + lib/Encode/Config.pm
954 my false laziness at Encode.pm is fixed. Now %ExtModules are set
955 in Encode::Config and they are all literally, not programatically
956 set. My false laziness was resulting many encodings missing from
958 ! lib/Encode/Unicode.pm
960 BOM for 32LE was bogus as noted by Anton. t/Unicode.t is fixed
961 so that it does not rely Encode::Unicode for BOM values
962 Message-Id: <FFEC33E9-4AFB-11D6-B415-00039301D480@dan.co.jp>
964 1.30 2002/04/08 02:34:51
965 + lib/Encode/Encoder.pm
966 Object Oriented Encoder. I reckon something like this is in need.
969 ! lib/Encode/Supported.pod
970 * autoloading bug that prevented upper-case canonicals such as UTF-16
971 is fixed. Now even UTF/UCS are autoloaded!
972 * encodings() is now more intuitive.
973 * t/Unicode.t fixed to explicitly use Unicode.pm -- BOM values are
975 * Obligatory fixes to the POD.
976 ! lib/Encode/Supported.pod
977 Patch from Anton applied.
978 Message-Id: <66641479.20020408033300@motor.ru>
980 ! lib/Encode/Unicode.pm
981 Cosmetic changes: "bless $obj, $class" => "bless $obj => class"
983 1.28 2002/04/07 18:58:42
987 Just a MANIFEST for those missing files.
989 1.26 Date: 2002/04/07 15:22:04
992 Schwarn's patches against Makefile.PL has zapped jis*.ucm. Restored.
993 And t/Aliases.t fixed to make sure they all exist.
995 1.25 2002/04/07 15:01:25 (Unreleased)
997 ! lib/Encode/Unicode.pm
1000 - lib/Encode/UTF_EBCDIC.pm
1001 - lib/Encode/Internal.pm
1002 - lib/Encode/utf8.pm
1003 Integrated into Encode.pm as closures. That way "one package, one file"
1004 rule is preserved yet less files to require.
1006 commented out binmode(STDERR ...
1010 ! EBCDIC/Makefile.PL
1013 ! Symbol/Makefile.PL
1015 ! Encode/Makefile_PL.e2x
1016 Schwarn's MM-compliance patch merged
1017 Message-Id: <20020406082609.GA28758@blackrider>
1019 ! lib/Encode/Unicode.pm
1020 + lib/Encode/UTF_EBCDIC.pm
1022 - lib/Encode/10646_1.pm
1023 - lib/Encode/ucs2_le.pm
1024 (UCS-2|UTF-(16|32))(LE|BE)? implementation and cleanups. Instead of
1025 per-module based (en|de)code, I saved a number of .pm by
1026 reorganizing it as per-object base (Well, this is what Encode::XS
1027 does under the hood). See Encode::Unicode for details.
1028 The original Unicode.pm is now correctly renamed to UTF_EBCDIC.pm.
1029 This module is used only on EBCDIC environments.
1031 1.21 2002/04/05 14:46:34 (Not Released)
1037 Are back to make Perl/Tk happy Smile, NI-S.
1039 ! lib/Encode/Alias.pm
1040 ! lib/Encode/Supported.pm
1041 ! lib/Encode/10646_1.pm
1042 ! lib/Encode/ucs2_le.pm
1043 UCS-16BE is now canonical for UCS-2/ISO-10646-1.
1044 Leftover implicit aliases in ucs2_le.pm removed. Tests and documents
1045 updated to reflect changes.
1046 essage-Id: <20020405114024.1290.17@bactrian.ni-s.u-net.com>
1047 ! lib/Encode/Alias.pm
1048 ! lib/Encode/Supported.pm
1049 Anton's revision commited. Added Dan's own fixes as well.
1050 Message-Id: <159103166906.20020405161134@motor.ru>
1051 ! lib/Encode/Alias.pm
1053 < qr/^UCS2-le$/i => '"UCS-2"', );
1055 > qr/^UCS2-LE$/i => '"UTF-16LE"');
1056 Sigh. Thank you, Anton.
1057 Message-Id: <14567692196.20020405062020@motor.ru>
1058 Message-Id: <69FEC0B4-483E-11D6-A045-00039301D480@dan.co.jp>
1060 1.20 2002/04/04 19:50:52
1062 the last minute addtion. Just give it a try. Docs remains to be done.
1063 Not installed by default.
1064 ! lib/Encode/Supported.pod
1067 ! lib/Encode/Alias.pm
1068 ! lib/Encode/utf8.pm
1069 ! lib/Encode/10464_1.pm
1070 ! lib/Encode/ucs2_le.pm
1071 Canonical name for 'UCS-2le" is now "UTF-16LE". UCS-2 left
1072 unchanged but UTF-16BE is added as an alias. Implicit aliases
1073 move to Encode::Alias so init_alias() works more as expected.
1074 Also, 'utf8' is now canonical with 'UTF-8' being an alias.
1075 Though pedantically wrong, This should make perl mongers happier.
1076 t/Alias.t is enhanced to test all these.
1077 Message-Id: <9C39BD58-47AF-11D6-9D82-00039301D480@dan.co.jp>
1079 Now all .ucm are stacked in byte_t; They all share ascii part so 50%
1080 of the codepoints are common. CJKT left as is because the saving is
1084 ! EBCDIC/Makefile.PL
1086 ! Encode/Makefile_PL.e2x
1090 ! Symbol/Makefile.PL
1094 All occurance of _def.h replaced with .exh so djgpp works happily
1095 ever after! To credit this amazing discovery, Laszlo is now in
1097 Message-Id: <20020403181424.GA8778@freemail.hu>
1098 Message-Id: <B5BF0C6F-4732-11D6-B13D-00039301D480@dan.co.jp>
1101 ! Encode/Makefile_PL.skel
1103 No more @INC fiddling! Uses $ENV{PERL_CORE} instead
1104 Message-Id: <20020401222744.GX2000@blackrider>, et al.
1106 Two more tests by added jhi
1107 Message-Id: <200204020000.DAA25121@alpha.hut.fi>
1110 The showstopper fixed -- Memory reallocation bug was causing
1111 Encode::XS to fall into infinite loop on certain conditions.
1112 t/grow.t tests that.
1113 Message-Id: <9572CAC4-463C-11D6-ABA5-00039301D480@dan.co.jp>, et al
1118 ! lib/Encode/Supported.pod
1119 Vendor encodings rebuilt out of original map files at unicode.org.
1120 Indic languages such as MacDevanagali remain unspported do to the
1121 shortcoming of encengine capabilities (they need algorithmical
1122 conversion and I have no knowledge on that!). Pods fixed for added
1124 Oh, macJapan.ucm renamed to macJapanese.ucm.
1125 macROMnn is macRomanian and macRUMnn is macRumanian.
1126 txt2ucm is a crude script that is used to convert them.
1128 Unicode Compound Characters (used extensively on Mac) supported
1130 Typo fixes and improvements by jhi
1131 Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al.
1133 1.11 2002/03/31 22:12:13
1137 Missing files from the MANIFEST fixed.
1138 Message-Id: <20020401010156.H10509@alpha.hut.fi>
1139 Version incremented just to make CPAN happy.
1141 1.10 2002/03/31 21:32:42
1144 INSTALL_UCM option added to Makefile.PL so you can install *.ucm
1145 if you want. This should make Autrijus happy. Also, piconv
1146 is added to default install.
1149 Here-documented files that enc2xs generates are now exported
1150 to *.e2x. Much cleaner and easier to debug.
1152 encoding enhances so you can make it act more like such
1153 (now prehistoric ) "localized" variations of perl like Jperl.
1155 Further test for encoding.pm. Written in euc-jp
1158 Taken over form jhi.
1159 Message-Id: <20020330174618.B10154@alpha.hut.fi>
1164 *.ucm relocated to ucm/ so MakeMaker will not install'em by default.
1171 ! Encode/macIceland.ucm
1172 ! lib/Encode/Alias.pm
1173 ! lib/Encode/Supported.pod
1174 MacIceland fixes and Pod Typo fixes. This adds Andreas to AUTHORS.
1175 Message-Id: <m3lmcavhjt.fsf@anima.de>
1177 1.01 2002/03/29 20:59:39
1180 s/USE_SCRIPTS/MORE_SCRIPTS/
1182 installs enc2xs by default for external Encode:: modules in CPAN,
1183 such as Encode::HanExtra
1185 More sensible perl core detection via $ENV{PERL_CORE}
1187 Message-Id: <200203291007.FAA07329@Orb.Nashua.NH.US>
1189 Perl core ditection via $^X =~ m/\bminiperl$/o
1190 Message-Id: <A5C7B0CA-42F1-11D6-B5AD-00039301D480@dan.co.jp>
1192 1.00 Wed Mar 29 2002
1194 The version of all files is updated to 1.00 via "ci -f -l1.00",
1195 commemorating version 1.00. All files, including *.ucm are now
1196 under version control.
1199 encode.h moved to Encode/ so it will be installed for the later
1202 h2xs-like feature added via "h2xs -M Name *.(enc|ucm)"
1207 compile renamed to enc2xs.
1208 Affected Makefle.PL updated
1210 "Punt it. HanExtra can take care of that later." -- Autrijus
1211 Message-Id: <20020328154338.GA7351@not.autrijus.org>
1214 ! Encode/ksc5601.ucm
1215 ! lib/Encode/CJKConstants.pm
1216 ! lib/Encode/KR/2022_KR.pm
1217 Table patches for Euro Signs, 2022-KR fixups by Jungshik
1218 Message-Id: <Pine.LNX.4.44.0203280616190.2259-200000@www.ykga.org>
1222 bin/ added for example scripts. They are not installed by default.
1223 to install them, "perl Makefile.PL USE_SCRIPTS".
1224 piconv is iconv reinvented in perl. in addition to all features
1225 of iconv, it also adds perlish features. See L<piconv/1> for more
1227 ! lib/Encode/Alias.pm
1228 qr/^ replaced with qr/\b so it directly matches locale names
1229 such as en_US.US-ASCII
1232 Patch by MJD to fix the following problem applied.
1233 Subject: [PATCH 5.7.3 Encode]
1234 Aliases.t not properly skipped when Encode extension not built
1235 Message-Id: <20020328091850.18677.qmail@plover.com>
1236 ! lib/Encode/KR/2022_KR.pm
1237 ! lib/Encode/CJKConstants.pm
1238 Another patch from Jungshik to make iso-2022-kr actually work
1239 Message-Id: <Pine.LNX.4.44.0203271745210.30462-200000@www.ykga.org>
1240 ! Encode/Encode/euc-kr.ucm
1241 + Encode/Encode/johab.ucm
1242 ! Encode/Encode/ksc5601.ucm
1244 ! Encode/KR/Makefile.PL
1245 ! Encode/lib/Encode/Alias.pm
1247 Johab support and complete revision of Korean Encoding by Jungshik
1248 Message-Id: <Pine.LNX.4.44.0203271105060.30462-200000@www.ykga.org>
1250 Revised to make up with now-dropped Encode::Details.
1251 - lib/Encode/Details.pod
1252 Dropped. Besides being obsolete, the topics are now covered in
1253 respective pods now.
1258 Korean aliases fixed thanks to Jungshik Shin
1259 /ks[-_ ]?c[-_ ]?5601-1987$/i => cp936
1260 Message-Id: <Pine.LNX.4.44.0203262102250.1237-100000@www.ykga.org>
1262 =head1 NAME added to all modules to make buildtoc happy
1263 Message-Id: <20020327041151.A10618@alpha.hut.fi>
1264 - lib/Encode/CJKguide.pod
1265 Too controversial and dropped from the dist. Will be available
1266 separately on the web.
1268 RCS tags added so table debugging gets easier (should that be
1269 needed! I hope they all stay 1.00!)
1270 + lib/Encode/CJKguide.pod
1271 A detailed guide to mainly, but not limited to, CJK multibyte
1274 + Encode/hp-roman8.ucm
1276 ! Encode/Supported.pod
1277 All occurance of "roman8" replaced with "hp-roman8" to avoid
1279 ! Encode/Supported.pod
1282 Mac Encodings now comply the Inside Macintosh
1284 Test for '-raw' conventions added.
1286 aliased gb2312 -> euc-cn, ksc5601 -> euc-kr
1287 ! Encode/gb12345.ucm
1289 ! Encode/ksc5601.ucm
1290 "-raw" appended to canonical names.
1291 File mames stay unchanged thanks to UCM format.
1292 ! lib/Encode/CN/HZ.pm
1293 Patch from Autrijus to fix gb2312 -> gb2312-raw + code linting
1294 Message-Id: <20020326035210.GA2091@not.autrijus.org>
1296 0.99 Tue Mar 26 2002
1297 - lib/Encode/JP/Const.pm
1298 + lib/Encode/CJKConstants.pm
1299 + lib/Encode/CN/2022_CN.pm
1300 + lib/Encode/KR/2022_KR.pm
1309 * Support for ISO-2022-KR and ISO-2022-CN added.
1311 * more t/*.{euc,ref} added, which was autogenerated from ucm2table
1312 * ucm2table autogenerates character table out of UCM files.
1315 - lib/Encode/Supports.pod
1316 + lib/Encode/Supported.pod
1317 Names reverted due to popular demand.
1318 8.3 rule applies only when there is a conflict.
1319 Message-Id: <20020325095924.GD44120@not.autrijus.org>
1324 - lib/Encode/Format/Enc.pod
1326 * Character tables is now 100% ucm.
1327 * All files under Encode/ is now 8.3-compliant
1328 * some of missing encodings added (i.e. gsm0338 and nextstep)
1329 * Vendor mappings aggregated with appropriate national std in
1330 Makefile.PL, resulting smaller *.so especially for CJK.
1331 Following is result on Dan's FreeBSD box.
1333 ---------------------------------------------------------------
1334 blib/arch/auto/Encode/Byte/Byte.so 157,279 171,042
1335 blib/arch/auto/Encode/CN/CN.so 1,634,476 1,626,685
1336 blib/arch/auto/Encode/EBCDIC/EBCDIC.so 18,476 18,476
1337 blib/arch/auto/Encode/Encode.so 27,791 27,791
1338 blib/arch/auto/Encode/JP/JP.so 1,408,056 1,832,811
1339 blib/arch/auto/Encode/KR/KR.so 1,156,518 1,329,587
1340 blib/arch/auto/Encode/Symbol/Symbol.so 23,940 20,990
1341 blib/arch/auto/Encode/TW/TW.so* 948,761 1,316,437
1342 ---------------------------------------------------------------
1343 Total 5,375,297 6,343,819
1345 * As a result of ucm-transition, Encode::Tcl dropped because
1346 Encode::Tcl demands *.enc.
1347 Encode::Tcl will be supplied in a separate tarball with *.enc.
1348 Message-Id: <C024E294-3FC3-11D6-8347-00039301D480@dan.co.jp>
1353 -lib/Encode/Supported.pod
1354 +lib/Encode/Supports.pod
1355 -lib/Encode/iso10646_1.pm
1356 +lib/Encode/10646_1.pm
1357 -lib/Encode/EncFormat.pod
1358 +lib/Encode/Format/Enc.pod
1359 Files renamed 8.3 filename compliance. Affected modules/scripts revised.
1360 - lib/Encode/JP/Constants.pm
1361 + lib/Encode/JP/Consts.pm
1362 ! lib/Encode/JP/JIS.pm
1363 ! lib/Encode/JP/H2Z.pm
1364 Version nit problem and 8.3 rule fix.
1365 > Package namespace installed latest in CPAN file
1366 > Encode::JP::Constants 0.92 1.02 J/JH/JHI/perl-5.7.3.tar.gz
1367 was noted by jhi then Dan discovers "Constants.pm" does not comply 8.3
1368 rule. Contants.pm renamed to Consts.pm and affected modules are fixed
1369 accordingly. In addition, legacy "use vars qw()..." are replaced with
1371 Message-Id: <20020325011248.D1561@alpha.hut.fi>
1372 Message-Id: <41023D51-3FB5-11D6-8347-00039301D480@dan.co.jp>
1374 - lib/Encode/JP/ISO_2022_JP.pm
1375 - lib/Encode/JP/ISO_2022_JP_1.pm
1376 + lib/Encode/JP/2022_JP.pm
1377 + lib/Encode/JP/2022_JP1.pm
1379 8.3 naming conflict for vanilla fat addressed by jhi
1380 Message-Id: <20020324201931.V22596@alpha.hut.fi>
1383 Typecast fix addressed by jhi
1384 Message-Id: <20020324185540.T22596@alpha.hut.fi>
1386 0.98 Mon Mar 25 2002
1387 ! lib/Encode/Supported.pod
1389 + lib/Encode/JP/ISO_2022_JP_1.pm
1390 ! lib/Encode/JP/ISO_2022_JP.pm
1391 ! lib/Encode/JP/JIS.pm
1393 Now Encode::JP is more strict on the difference between ISO-2022-JP
1394 and ISO-2022-JP-1. See JP/JP.pm for details. I hope this move
1395 makes Anton happier :) FYI the previous version implements
1396 ISO-2022-JP as ISO-2022-JP-1 since it had X0212 support.
1397 ! lib/Encode/Supported.pod
1400 Avoid core-dump in Encode with PERLIO=mmap by NI-S
1401 Message-Id: <20020324104139.1326.7@bactrian.ni-s.u-net.com>
1406 ! lib/Encode/Suppoted.pod
1407 pod fixes to replace F<http://...> to L<http://...>,
1408 as suggested by Autrijius in:
1409 Message-Id: <20020324083943.GA14901@not.autrijus.org>
1410 ! lib/Encode/Suppoted.pod
1411 fixes and enhancements by Anton
1412 Message-Id: <10632060120.20020324103753@motor.ru>
1413 ! lib/Encode/Alias.pm
1414 > define_alias( qr/^GB[- ]?(\d+)$/i => '"gb$1"' );
1415 added. Suggested by Anton then deobfuscated by Autrijius
1416 Message-Id: <20020324064455.GA3667@not.autrijus.org>
1418 Further fix by Nicholas Clark
1419 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1420 - lib/EncodeFormat.pod
1421 + lib/Encode/EncFormat.pod
1423 File renamed as suggested by Autrijius
1425 ! lib/Encode/Details.pod
1426 ! lib/Encode/Supported.pod Sun Mar 24 13:29:35 2002
1427 ! Encode.pm Sun Mar 24 13:43:47 2002
1428 pod fixes by Autrijius.
1429 Message-Id: <20020324062804.GA3595@not.autrijus.org>
1430 Message-Id: <20020324075627.GB11986@not.autrijus.org>
1432 ! lib/Encode/Alias.pm
1434 now more EBCDIC conscious;
1435 %ExtModules on EBCDIC system excludes CJK so that you don't
1436 have to worry about the matched alias resulting cloaking.
1437 t/Alias.t also revised to reflect changes. Verified by jhi
1438 Message-Id: <20020324022929.D22596@alpha.hut.fi>
1440 0.97 Sun Mar 24 2002
1444 EBCDIC detection mechanism installed as in JP/JP.pm
1445 Message-Id: <20020323211847.G19148@alpha.hut.fi>
1448 ! EBCDIC/Makefile.PL
1451 ! Symbol/Makefile.PL
1453 Now all table files used by compile are postfixed '_t' to avoid
1454 namespace collisions in case insensitive file systems once for all!
1456 Message-ID: <58290227735.20020323195659@familiehaase.de>
1458 Since the Encode::JP is unsupported under EBCDIC we
1459 cannot run this test (aliases as such should work fine) -- jhi
1460 Message-Id: <20020323202119.D19148@alpha.hut.fi>
1462 duplicate occurance of ascii.ucm and 8859-1.ucm
1463 causes MacOS X dlyd to cloak
1469 < chdir 't' if -d 't';
1471 > if (! -d 'blib' and -d 't'){ chdir 't' };
1472 When you are "make test"-ing on Encode/ directory, you must not
1473 change $ENV{PWD}. t/JP.t has been fixed before but others somehow
1474 remain unchanced. Also the situation detection was made simpler
1475 in t/JP.t, which was originally;
1476 > chdir 't' if -d 't' and $ENV{PWD} !~ m,/Encode[^/]*$,o;
1478 "Use of uninitialized value in string eq at Encode.pm line 96."
1479 ! Symbol/Makefile.PL
1480 ! EBCDIC/Makefile.PL
1482 -- Problem on case insensitive file systems
1483 "coexist of ebcdic.c <> EBCDIC.c on Cygwin not possible"
1484 Message-ID: <88254111953.20020323095503@familiehaase.de>
1487 "So I think it's a bug in gcc, not perl. But it still needs to be
1489 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1490 Message-Id: <20020323170509.C96475@plum.flirble.org>
1492 0.96 Sat Mar 23 2002
1494 ! lib/Encode/Encoding.pm
1495 ! lib/Encode/Alias.pm
1496 ! lib/Encode/Supported.pod
1498 Pod Fixes by Michael G Schwern <schwern@pobox.com> via jhi
1499 Message-ID: <20020322073908.GB10539@blackrider>
1502 "...I think we should include ISO 8859-1 as well." -- NI-S
1503 Message-Id: <20020322120230.1332.8@bactrian.elixent.com>
1508 ! lib/Encode/Alias.pm
1509 alias definitions relocated to Encode::Alias so module autoloading
1510 works for aliases also.
1512 encodings() now accepts args to check ExtModules.
1516 + EBCDIC/Makefile.PL
1517 + Symbol/Makefile.PL
1521 Latin and single byte encodings are reorganized so they are
1522 demand-loaded like Encode::XX. Now only ascii is compiled into
1524 ! lib/Encode/Alias.pm
1525 for my $k (keys %hash){ delete $hash{$k}; }
1526 is depreciated; fixed.
1528 0.95 Fri Mar 22 2002
1529 In this update, pod rewrites and alias fixes are the main issues
1530 + lib/Encode/Supported.pod
1531 Describes supported encodings
1533 streamlined compiled-in encodings.
1534 ! lib/Encode/Description.pod -> lib/Encode/Details.pod
1536 + Encode/ibm-125?.ucm
1537 Added from icu distibution with any occurance of
1538 "IBM-125?" to "cp125?". Filenames remain unchanged to pay
1539 some respect to icu staff, however.
1540 + lib/Encode/Alias.pm
1542 Alias difinitions in Encode.pm relocated.
1545 packWARN patch from Paul Marquess via jhi
1546 Message-Id: <20020321010101.O28978@alpha.hut.fi>
1547 Paul added to AUTHORS as a result.
1548 ! t/CJKalias.t -> t/Aliases.t
1549 Renamed. Checks even more aliases and alias overloading
1552 duplicate alias for ujis => euc-jp removed (Encode::JP has one)
1553 gbk => cp936 relocated to CN.pm
1555 Test::More with plans (by jhi)
1557 0.94 Thu Mar 21 2002
1558 + lib/Encode/Description.pod
1559 ! lib/Encode/Encoding.pm
1560 Now the pod in Encode.pm is abridged as programming references.
1561 lib/Encode/Description.pod contains the original, detailed description
1562 and Encode::Encoding explains how to write your own module to
1563 add new encodings. So far, lib/Encode/Description.pod contains
1564 the whole pod once in Encode.pm. This is intentional.
1566 Pod revisions by Anton Tagunov
1567 Message-Id: <517178431.20020320174824@motor.ru>
1569 all occrance of Encode::Tcl::Extended removed including pod
1571 test now checks $encoding->name only; $encoding->{name} are
1572 no longer check to find the canonical name.
1573 ! lib/Encode/JP/JIS.pm
1574 ! lib/Encode/JP/ISO_2022_JP.pm
1575 ->name() added to be more compliant with API
1581 Patch by Autrijus to add aliases to TW and fixes to POD
1582 Message-Id: <20020320090619.GA24774@not.autrijus.org>
1584 SADAHIRO Tomoyuki added as should. My apologies.
1586 0.93 Wed Mar 20 2002
1587 * First release to be uploaded to CPAN. For prehistoric changes,
1588 please see Changes file of perl distibution as well as
1589 perl-unicode@perl.org archive, available at:
1590 http://archive.develooper.com/perl-unicode@perl.org/
1592 Changes Since 0.92 includes;
1597 + Mention to perl-unicode@perl.org added
1599 + Encoding aliases added so you can feed locale names
1600 and MIME Charset="" directly.
1601 - Mention to JISX0212 removed because it's fixed
1604 + Encoding aliases added. Note TW is left untouched because
1605 euc-tw is not implemented in TW but in Encode::HanExtra.
1606 Autrijus, you may fix Encode::HanExtra.
1608 + to test encode aliases added