5 [ this is a template for a new perldelta file. Any text flagged as
6 XXX needs to be processed before release. ]
8 perldelta - what is new for perl v5.13.9
12 This document describes differences between the 5.13.8 release and
15 If you are upgrading from an earlier release such as 5.13.7, first read
16 L<perl5138delta>, which describes differences between 5.13.7 and
21 XXX Any important notices here
23 =head1 Core Enhancements
25 XXX New core language features go here. Summarise user-visible core language
26 enhancements. Particularly prominent performance optimisations could go
27 here, but most should go in the L</Performance Enhancements> section.
29 [ List each enhancement as a =head2 entry ]
31 =head2 New regular expression modifier C</a>
33 The C</a> regular expression modifier restricts C<\s> to match precisely
34 the five characters C<[ \f\n\r\t]>, C<\d> to match precisely the 10
35 characters C<[0-9]>, C<\w> to match precisely the 63 characters
36 C<[A-Za-z0-9_]>, and the Posix (C<[[:posix:]]>) character classes to
37 match only the appropriate ASCII characters. The complements, of
38 course, match everything but; and C<\b> and C<\B> are correspondingly
39 affected. Otherwise, C</a> behaves like the C</u> modifier, in that
40 case-insensitive matching uses Unicode semantics; for example, "k" will
41 match the Unicode C<\N{KELVIN SIGN}> under C</i> matching, and code
42 points in the Latin1 range, above ASCII will have Unicode semantics when
43 it comes to case-insensitive matching. Like its cousins (C</u>, C</l>,
44 and C</d>), and in spite of the terminology, C</a> in 5.14 will not
45 actually be able to be used as a suffix at the end of a regular
46 expression (this restriction is planned to be lifted in 5.16). It must
47 occur either as an infix modifier, such as C<(?a:...)> or (C<(?a)...>,
48 or it can be turned on within the lexical scope of C<use re '/a'>.
49 Turning on C</a> turns off the other "character set" modifiers.
51 =head2 Any unsigned value can be encoded as a character
53 With this release, Perl is adopting a model that any unsigned value can
54 be treated as a code point and encoded internally (as utf8) without
55 warnings -- not just the code points that are legal in Unicode.
56 However, unless utf8 warnings have been
57 explicitly lexically turned off, outputting or performing a
58 Unicode-defined operation (such as upper-casing) on such a code point
59 will generate a warning. Attempting to input these using strict rules
60 (such as with the C<:encoding('UTF-8')> layer) will continue to fail.
61 Prior to this release the handling was very inconsistent, and incorrect
62 in places. Also, the Unicode non-characters, some of which previously were
63 erroneously considered illegal in places by Perl, contrary to the Unicode
64 standard, are now always legal internally. But inputting or outputting
65 them will work the same as for the non-legal Unicode code points, as the
66 Unicode standard says they are illegal for "open interchange".
68 =head2 Regular expression debugging output improvement
70 Regular expression debugging output (turned on by C<use re 'debug';>) now
71 uses hexadecimal when escaping non-ASCII characters, instead of octal.
75 XXX Any security-related notices go here. In particular, any security
76 vulnerabilities closed should be noted here rather than in the
77 L</Selected Bug Fixes> section.
79 [ List each security issue as a =head2 entry ]
81 =head1 Incompatible Changes
83 XXX For a release on a stable branch, this section aspires to be:
85 There are no changes intentionally incompatible with 5.XXX.XXX. If any
86 exist, they are bugs and reports are welcome.
88 [ List each incompatible change as a =head2 entry ]
90 =head2 All objects are destroyed
92 It used to be possible to prevent a destructor from being called during
93 global destruction by artificially increasing the reference count of an
96 Now such objects I<will> will be destroyed, as a result of a bug fix
97 L<[perl #81230]|http://rt.perl.org/rt3/Public/Bug/Display.html?id=81230>.
99 This has the potential to break some XS modules. (In fact, it break some.
100 See L</Known Problems>, below.)
104 XXX Any deprecated features, syntax, modules etc. should be listed here.
105 In particular, deprecated modules should be listed here even if they are
106 listed as an updated module in the L</Modules and Pragmata> section.
108 [ List each deprecation as a =head2 entry ]
110 =head1 Performance Enhancements
112 XXX Changes which enhance performance without changing behaviour go here. There
113 may well be none in a stable release.
115 [ List each enhancement as a =item entry ]
125 =head1 Modules and Pragmata
127 XXX All changes to installed files in F<cpan/>, F<dist/>, F<ext/> and F<lib/>
128 go here. If Module::CoreList is updated, generate an initial draft of the
129 following sections using F<Porting/corelist-perldelta.pl>, which prints stub
130 entries to STDOUT. Results can be pasted in place of the '=head2' entries
131 below. A paragraph summary for important changes should then be added by hand.
132 In an ideal world, dual-life modules would have a F<Changes> file that could be
135 [ Within each section, list entries as a =item entry ]
137 =head2 New Modules and Pragmata
143 L<CPAN::Meta::YAML> 0.003 has been added as a dual-life module. It supports a
144 subset of YAML sufficient for reading and writing META.yml and MYMETA.yml files
145 included with CPAN distributions or generated by the module installation
146 toolchain. It should not be used for any other general YAML parsing or
151 L<HTTP::Tiny> 0.009 has been added as a dual-life module. It is a very
152 small, simple HTTP/1.1 client designed for simple GET requests and file
153 mirroring. It has has been added to enable CPAN.pm and CPANPLUS to
154 "bootstrap" HTTP access to CPAN using pure Perl without relying on external
155 binaries like F<curl> or F<wget>.
159 L<JSON::PP> 2.27103 has been added as a dual-life module, for the sake of
160 reading F<META.json> files in CPAN distributions.
164 L<Module::Metadata> 1.000003 has been added as a dual-life module. It gathers
165 package and POD information from Perl module files. It is a standalone module
166 based on Module::Build::ModuleInfo for use by other module installation
167 toolchain components. Module::Build::ModuleInfo has been deprecated in
168 favor of this module instead.
172 L<Perl::OSType> 1.002 has been added as a dual-life module. It maps Perl
173 operating system names (e.g. 'dragonfly' or 'MSWin32') to more generic types
174 with standardized names (e.g. "Unix" or "Windows"). It has been refactored
175 out of Module::Build and ExtUtils::CBuilder and consolidates such mappings into
176 a single location for easier maintenance.
180 =head2 Updated Modules and Pragmata
186 C<Archive::Extract> has been upgraded from version 0.46 to 0.48
190 C<Archive::Tar> has been upgraded from version 1.74 to 1.76
194 C<CGI> has been upgraded from version 3.50 to 3.51
196 Further improvements have been made to guard against newline injections
201 C<Compress::Raw::Bzip2> has been upgraded from version 2.031 to 2.033
205 C<Compress::Raw::Zlib> has been upgraded from version 2.030 to 2.033
209 C<CPAN> has been upgraded from version 1.94_62 to 1.94_63
213 C<CPANPLUS> has been upgraded from version 0.9010 to 0.9011
217 C<CPANPLUS::Dist::Build> has been upgraded from version 0.50 to 0.52
221 C<DB_File> has been upgraded from version 1.820 to 1.821
225 C<Encode> has been upgraded from version 2.40 to 2.42.
226 Now, all 66 Unicode non-characters are treated the same way U+FFFF has
227 always been treated; if it was disallowed, all 66 are disallowed; if it
232 C<File::Fetch> has been upgraded from version 0.28 to 0.32
236 C<IO::Compress> has been upgraded from version 2.030 to 2.033
240 C<IPC::Cmd> has been upgraded from version 0.66 to 0.68
244 C<Log::Message> has been upgraded from version 0.02 to 0.04
248 C<Log::Message::Simple> has been upgraded from version 0.06 to 0.08
252 C<Module::Load::Conditional> has been upgraded from version 0.38 to 0.40
256 C<Object::Accessor> has been upgraded from version 0.36 to 0.38
260 C<Params::Check> has been upgraded from version 0.26 to 0.28
264 C<Pod::LaTeX> has been upgraded from version 0.58 to 0.59
268 C<Term::UI> has been upgraded from version 0.20 to 0.24
272 C<Thread::Queue> has been upgraded from version 2.11 to 2.12.
276 C<Thread::Semaphore> has been upgraded from version 2.11 to 2.12.
280 C<threads> has been upgraded from version 1.81_03 to 1.82
284 C<threads::shared> has been upgraded from version 1.35 to 1.36
288 C<Time::Local> has been upgraded from version 1.1901_01 to 1.2000.
292 C<Unicode::Normalize> has been upgraded from version 1.07 to 1.10
296 =head2 Removed Modules and Pragmata
308 XXX Changes to files in F<pod/> go here. Consider grouping entries by
309 file and be sure to link to the appropriate page, e.g. L<perlfunc>.
311 =head2 New Documentation
313 XXX Changes which create B<new> files in F<pod/> go here.
317 XXX Description of the purpose of the new file here
319 =head2 Changes to Existing Documentation
321 XXX Changes which significantly change existing files in F<pod/> go here.
322 However, any changes to F<pod/perldiag.pod> should go in the L</Diagnostics>
331 It has now been documented that C<ord> returns 0 for an empty string.
337 The following additions or changes have been made to diagnostic output,
338 including warnings and fatal error messages. For the complete list of
339 diagnostic messages, see L<perldiag>.
341 XXX New or changed warnings emitted by the core's C<C> code go here. Also
342 include any changes in L<perldiag> that reconcile it to the C<C> code.
344 [ Within each section, list entries as a =item entry ]
346 =head2 New Diagnostics
348 XXX Newly added diagnostic messages go here
358 =head2 Changes to Existing Diagnostics
360 XXX Changes (i.e. rewording) of diagnostic messages go here
370 =head1 Utility Changes
372 XXX Changes to installed programs such as F<perlbug> and F<xsubpp> go
373 here. Most of these are built within the directories F<utils> and F<x2p>.
375 [ List utility changes as a =head3 entry for each utility and =item
376 entries for each change
377 Use L<XXX> with program names to get proper documentation linking. ]
385 C<perlbug> did not previously generate a From: header, potentially
386 resulting in dropped mail. Now it does include that header.
390 =head1 Configuration and Compilation
392 XXX Changes to F<Configure>, F<installperl>, F<installman>, and analogous tools
393 go here. Any other changes to the Perl build process should be listed here.
394 However, any platform-specific changes should be listed in the
395 L</Platform Support> section, instead.
397 [ List changes as a =item entry ].
409 XXX Any significant changes to the testing of a freshly built perl should be
410 listed here. Changes which create B<new> files in F<t/> go here as do any
411 large changes to the testing harness (e.g. when parallel testing was added).
412 Changes to existing files in F<t/> aren't worth summarising, although the bugs
413 that they represent may be covered elsewhere.
415 [ List each test improvement as a =item entry ]
421 A new test script, C<t/porting/filenames.t>, makes sure that filenames and
422 paths are reasonably portable.
426 =head1 Platform Support
428 XXX Any changes to platform support should be listed in the sections below.
430 [ Within the sections, list each platform as a =item entry with specific
431 changes as paragraphs below it. ]
435 XXX List any platforms that this version of perl compiles on, that previous
436 versions did not. These will either be enabled by new files in the F<hints/>
437 directories, or new subdirectories and F<README> files at the top level of the
442 =item XXX-some-platform
448 =head2 Discontinued Platforms
450 XXX List any platforms that this version of perl no longer compiles on.
454 =item Apollo DomainOS
456 The last vestiges of support for this platform have been excised from the
457 Perl distribution. It was officially discontinued in version 5.12.0. It had
458 not worked for years before that.
462 =head2 Platform-Specific Notes
464 XXX List any changes for specific platforms. This could include configuration
465 and compilation changes or changes in portability/compatibility. However,
466 changes within modules for platforms should generally be listed in the
467 L</Modules and Pragmata> section.
473 DTrace is now supported on Solaris. There used to be build failures, but
474 these have been fixed
475 L<[perl #73630]|http://rt.perl.org/rt3/Public/Bug/Display.html?id=73630>.
479 =head1 Internal Changes
481 XXX Changes which affect the interface available to C<XS> code go here.
482 Other significant internal changes for future core maintainers should
485 [ List each test improvement as a =item entry ]
491 The opcode bodies for C<chop> and C<chomp> and for C<schop> and C<schomp> have
492 been merged. The implementation functions C<Perl_do_chop()> and
493 C<Perl_do_chomp()>, never part of the public API, have been merged and moved to
494 a static function in F<pp.c>. This shrinks the perl binary slightly, and should
495 not affect any code outside the core (unless it is relying on the order of side
496 effects when C<chomp> is passed a I<list> of values).
500 Some of the flags parameters to the uvuni_to_utf8_flags() and
501 utf8n_to_uvuni() have changed. This is a result of Perl now allowing
502 internal storage and manipulation of code points that are problematic
503 in some situations. Hence, the default actions for these functions has
504 been complemented to allow these code points. The new flags are
505 documented in L<perlapi>. Code that requires the problematic code
506 points to be rejected needs to change to use these flags. Some flag
507 names are retained for backward source compatibility, though they do
508 nothing, as they are now the default. However the flags
509 C<UNICODE_ALLOW_FDD0>, C<UNICODE_ALLOW_FFFF>, C<UNICODE_ILLEGAL>, and
510 C<UNICODE_IS_ILLEGAL> have been removed, as they stem from a
511 fundamentally broken model of how the Unicode non-character code points
512 should be handled, which is now described in
513 L<perlunicode/Non-character code points>. See also L</Selected Bug Fixes>.
517 Certain shared flags in the C<pmop.op_pmflags> and C<regexp.extflags>
518 structures have been removed. These are: C<Rxf_Pmf_LOCALE>,
519 C<Rxf_Pmf_UNICODE>, and C<PMf_LOCALE>. Instead there are encodes and
520 three static in-line functions for accessing the information:
521 C<get_regex_charset()>, C<set_regex_charset()>, and C<get_regex_charset_name()>,
522 which are defined in the places where the orginal flags were.
526 A new option has been added to C<pv_escape> to dump all characters above
527 ASCII in hexadecimal. Before, one could get all characters as hexadecimal
528 or the Latin1 non-ASCII as octal
532 =head1 Selected Bug Fixes
534 XXX Important bug fixes in the core language are summarised here.
535 Bug fixes in files in F<ext/> and F<lib/> are best summarised in
536 L</Modules and Pragmata>.
538 [ List each fix as a =item entry ]
544 The handling of Unicode non-characters has changed.
545 Previously they were mostly considered illegal, except that only one of
546 the 66 of them was known about in places. The Unicode standard
547 considers them legal, but forbids the "open interchange" of them.
548 This is part of the change to allow the internal use of any code point
549 (see L</Core Enhancements>). Together, these changes resolve
550 L<# 38722|https://rt.perl.org/rt3/Ticket/Display.html?id=38722>,
551 L<# 51918|http://rt.perl.org/rt3/Ticket/Display.html?id=51918>,
552 L<# 51936|http://rt.perl.org/rt3/Ticket/Display.html?id=51936>,
553 L<# 63446|http://rt.perl.org/rt3/Ticket/Display.html?id=63446>
557 Sometimes magic (ties, tainted, etc.) attached to variables could cause an
558 object to last longer than it should, or cause a crash if a tied variable
559 were freed from within a tie method. These have been fixed
560 L<[perl #81230]|http://rt.perl.org/rt3/Public/Bug/Display.html?id=81230>.
564 Most I/O functions were not warning for unopened handles unless the
565 'closed' and 'unopened' warnings categories were both enabled. Now only
566 C<use warnings 'unopened'> is necessary to trigger these warnings (as was
567 always meant to be the case.
571 C<< <expr> >> always respects overloading now if the expression is
574 Due to the way that '<> as glob' was parsed differently from
575 '<> as filehandle' from 5.6 onwards, something like C<< <$foo[0]> >> did
576 not handle overloading, even if C<$foo[0]> was an overloaded object. This
577 was contrary to the documentation for overload, and meant that C<< <> >>
578 could not be used as a general overloaded iterator operator.
582 Destructors on objects were not called during global destruction on objects
583 that were not referenced by any scalars. This could happen if an array
584 element were blessed (e.g., C<bless \$a[0]>) or if a closure referenced a
585 blessed variable (C<bless \my @a; sub foo { @a }>).
587 Now there is an extra pass during global destruction to fire destructors on
588 any objects that might be left after the usual passes that check for
589 objects referenced by scalars
590 L<[perl #36347]|http://rt.perl.org/rt3/Public/Bug/Display.html?id=36347>.
594 =head1 Known Problems
596 XXX Descriptions of platform agnostic bugs we know we can't fix go here. Any
597 tests that had to be C<TODO>ed for the release would be noted here, unless
598 they were specific to a particular platform (see below).
600 This is a list of some significant unfixed bugs, which are regressions
601 from either 5.XXX.XXX or 5.XXX.XXX.
603 [ List each fix as a =item entry ]
609 The fix for [perl #81230] causes test failures for C<Tk> version 804.029.
610 This is still being investigated.
616 XXX If any significant core contributor has died, we've added a short obituary
619 =head1 Acknowledgements
621 XXX The list of people to thank goes here.
623 =head1 Reporting Bugs
625 If you find what you think is a bug, you might check the articles
626 recently posted to the comp.lang.perl.misc newsgroup and the perl
627 bug database at http://rt.perl.org/perlbug/ . There may also be
628 information at http://www.perl.org/ , the Perl Home Page.
630 If you believe you have an unreported bug, please run the L<perlbug>
631 program included with your release. Be sure to trim your bug down
632 to a tiny but sufficient test case. Your bug report, along with the
633 output of C<perl -V>, will be sent off to perlbug@perl.org to be
634 analysed by the Perl porting team.
636 If the bug you are reporting has security implications, which make it
637 inappropriate to send to a publicly archived mailing list, then please send
638 it to perl5-security-report@perl.org. This points to a closed subscription
639 unarchived mailing list, which includes all the core committers, who be able
640 to help assess the impact of issues, figure out a resolution, and help
641 co-ordinate the release of patches to mitigate or fix the problem across all
642 platforms on which Perl is supported. Please only use this address for
643 security issues in the Perl core, not for modules independently
648 The F<Changes> file for an explanation of how to view exhaustive details
651 The F<INSTALL> file for how to build Perl.
653 The F<README> file for general stuff.
655 The F<Artistic> and F<Copying> files for copyright information.
661 Below, you'll find a cut-down version of the git log from 5.13.8 to 1b9043bb.
662 Jesse went through and took a first pass at cutting out non-changelog items. He may have been over-(or under-)zealous.
664 It hasn't yet been deduped with the entries that _are_ in the perldelta.
667 Major spell-checking pass throughout the core
669 Remove references to compat3.sym and interp.sym, deleted over 10 years ago.
671 Remove Mac OS classic code from numerous places throughout the core
673 commit 7fe50b8b8a4dc38fc341e3b403545aaca937f50e
674 Author: Leon Timmermans <fawaka@gmail.com>
675 Date: Tue Jan 18 16:40:07 2011 +0100
677 Also unblock signal handlers throwing an exception
679 Also handle and test the edge case of a signal handler throwing an
685 Numberous POD warnings fixed
689 Update Unicode-Collate to CPAN version 0.71
695 regexec.c: Remove break statements from macros
697 This is so future coders won't be tempted to rely on them.
699 regexec.c: Don't rely on break stmts in macros
701 It is safer and clearer to have the break statement in each case statement at
704 commit b57e41186b2ceb48bef4f0588dcd19e105cc8a38
705 Author: Karl Williamson <public@khwilliamson.com>
706 Date: Tue Jan 18 15:03:41 2011 -0700
708 regcomp: Disallow multi-char folds in lookbehind
710 The addition of the ANYOFV regnode to treat multi-char folds in a bracketed
711 character class has exposed a bug, in which those classes have long been able
712 to be varying length (due to the multi-char fold), but the compiler wasn't
713 aware of it. Now it is, and hence won't allow those which have multi-char
714 folds to be part of a lookbehind pattern, which requires a constant length.
716 This patch disallows multi-char folds in a lookbehind bracketed character
719 Author: Nicholas Clark <nick@ccl4.org>
720 Date: Tue Jan 18 16:14:43 2011 +0000
722 Remove Mac OS classic code from scripts in utils/
725 Convert DosGlob.t to Test::More.
727 commit d6dc8a6dc95226ddff7719cd1e0bd053c4e5725e
728 Author: Nicholas Clark <nick@ccl4.org>
729 Date: Tue Jan 18 13:08:11 2011 +0000
731 Remove Mac OS Classic docs from DirHandle and File::{Copy,DosGlob,Find}
733 The documentation for the different behaviour on Mac OS Classic was not
734 removed when the relevant code was removed in 862f843bac3434c2. That commit
735 also remove all callers to several Mac OS classic support functions, but not
736 the functions themselves. Rectify this.
738 commit 8254cbf193c939338449097a80163197fc755150
739 Author: Nicholas Clark <nick@ccl4.org>
740 Date: Tue Jan 18 11:10:41 2011 +0000
742 Build perltoc.pod with pragmata sorted by name.
744 Previously they were actually sorted by full pathname, which isn't logical.
745 I presume that this is an artefact of all pragmata being in lib/ when
746 pod/buildtoc was originally written.
748 commit 1d45ec279e4e105512a2803e3d0bd974a151a0f6
749 Author: Father Chrysostomos <sprout@cpan.org>
750 Date: Mon Jan 17 22:32:52 2011 -0800
752 perlcall: Fixes for various grammatical errors
754 commit faaf68361923e4bb95d1eb919bc724a0dcc5a4ce
755 Author: Leon Timmermans <fawaka@gmail.com>
756 Date: Mon Jan 17 17:59:33 2011 +0100
758 Clarify limitation in safe signals.
760 commit 0c1bf4c7d433bb0ad80bfe5511b1301db32b7b95
761 Author: Leon Timmermans <fawaka@gmail.com>
762 Date: Mon Jan 17 16:29:11 2011 +0100
764 Added tests for conditional unblocking
766 commit 555344425f04e96a72e4d29eab96b34bff8f96ae
767 Author: Leon Timmermans <fawaka@gmail.com>
768 Date: Thu Jan 13 18:30:29 2011 +0100
770 Conditionally unblock after signal handler[#82040]
772 Only unblock signal after a safe-signal handler is executed if that signal was
773 also unblocked before the handler.
774 commit 435aa301127ed481169903cb35187bde1ea44928
775 Author: David Golden <dagolden@cpan.org>
776 Date: Mon Jan 17 20:39:14 2011 -0500
778 Update HTTP::Tiny to CPAN version 0.009
780 commit 78cd53afbb1923bf0a68f361040ad8fe93a7d0d5
781 Author: David Mitchell <davem@iabyn.com>
782 Date: Tue Jan 18 00:46:30 2011 +0000
784 vastly speed up t/porting/diag.t
786 This used to take about 3 minutes of CPU. Reduce this to around
787 6 seconds (!!) by coalescing and pre-compiling various patterns
788 that get applied to nearly every line of every source file.
790 commit cfaf538b6276c6a8ef80ff6c66e106c6a4f1caaa
791 Author: Karl Williamson <public@khwilliamson.com>
792 Date: Mon Jan 17 08:58:53 2011 -0700
794 Add /a regex modifier
796 This restricts certain constructs, like \w, to matching in the ASCII range only.
798 commit 56ae17b45d2513d65903d13468e8f6a16b20f916
799 Author: Karl Williamson <public@khwilliamson.com>
800 Date: Sun Jan 16 17:41:30 2011 -0700
802 regcomp.c: Convert \d \D to a switch{}
804 commit eee47ba661717bca7751443be1b6ac8f8f64585a
805 Author: Chris 'BinGOs' Williams <chris@bingosnet.co.uk>
806 Date: Mon Jan 17 12:35:54 2011 +0000
808 Update File-Fetch to CPAN version 0.32
810 commit e66820012d29519f903709f005e56a2c334ae183
811 Author: Tony Cook <tony@develop-help.com>
812 Date: Mon Jan 17 19:22:08 2011 +1100
814 test_prep now depends on the pods needed for porting/buildtoc.t
816 In the other makefiles test_prep (or test-prep) depends on all, so
817 they shouldn't need updating.
820 commit a52237f3a547cdefddd4c4be6224bfdf67c84263
821 Author: David Golden <dagolden@cpan.org>
822 Date: Sun Jan 16 21:32:21 2011 -0500
824 Update CPAN to CPAN version 1.94_63
826 commit 63ac0dadb1aafcf0c171d3c1422c1923b611b2fc
827 Author: Karl Williamson <public@khwilliamson.com>
828 Date: Tue Dec 28 16:13:49 2010 -0700
830 regex: Use BOUNDU regnodes
832 This refactors one area in regexec.c to use BOUNDU, NBOUNDU for
833 efficiciency, and easier adding of the future BOUNDA.
835 commit 980866de2cf8ecdb4bb72b7f9294763057008f50
836 Author: Karl Williamson <public@khwilliamson.com>
837 Date: Mon Dec 27 12:04:58 2010 -0700
839 regex: Separate nodes for Unicode semantics \s \w
841 This patch converts the \s, \w and complements Unicode semantics to
842 instead of using the flags field of their nodes to instead use separate
843 nodes. This gains some efficiency, especially useful in tight loops and
844 backtracking of regexec.c, and prepares the way for easily adding other
845 semantic variations, such as /a.
847 It refactors the CCC_TRY... macros. I tried to break this piece up into
848 smaller chunks, but found it much easier to get to this in one step.
849 Further patches will do some more refactoring of these.
851 As part of the CCC_TRY macro refactoring, the lines that include the
852 test if (! nextchr) are changed to just look for the end-of-string by
853 position instead of it being NUL. In locales, it could be (however
854 unlikely), that NUL is a real alphabetic, digit, or space character.
855 commit 50e911483ad5c29e25c54c9f81f92df974dd2cc0
856 Author: Karl Williamson <public@khwilliamson.com>
857 Date: Sun Dec 26 10:35:58 2010 -0700
859 Change name of /d to DEPENDS
861 I much prefer David Golden's name for /d whose meaning 'depends' on
862 circumstances, instead of 'dual' meaning it could be one or another.
863 Change it before this gets out in a stable release, and we're stuck with
866 commit 73134a2eb4055c76fe5b154da95e09118f716fd8
867 Author: Karl Williamson <public@khwilliamson.com>
868 Date: Sun Dec 26 10:35:20 2010 -0700
870 CH] Change usage of regex/op common to common names
872 This patch changes the core functions to use the common names for the
873 fields that are shared between op.c and regcomp.c, just for consistency
874 of using one name throughout the core for the same thing.
876 A grep of cpan shows that both names are used in various modules; so
877 both names must be retained.
879 commit a3ab329f3fc9494e700f51c38cef42021c130b6e
880 Author: David Golden <dagolden@cpan.org>
881 Date: Sun Jan 16 20:57:02 2011 -0500
883 Add HTTP::Tiny as a dual-life core module
885 HTTP::Tiny has been added as a dual-life module. It is a very
886 small, simple HTTP/1.1 client designed for simple GET requests and file
887 mirroring. It has has been added to enable CPAN.pm and CPANPLUS to
888 "bootstrap" HTTP access to CPAN using pure Perl without relying on external
889 binaries like F<curl> or F<wget>.
891 commit 211cc5012284f4bd900fcaa630adbcac69ca6112
892 Author: Chris 'BinGOs' Williams <chris@bingosnet.co.uk>
893 Date: Sun Jan 16 23:23:03 2011 +0000
895 Update Unicode-Collate to CPAN version 0.70 and enable XS version
897 commit a62b1201c068dc7b099bcb7182e188c4d2fbf34c
898 Author: Karl Williamson <public@khwilliamson.com>
899 Date: Sun Dec 26 10:31:16 2010 -0700
901 Use multi-bit field for regex character set
903 The /d, /l, and /u regex modifiers are mutually exclusive. This patch
904 changes the field that stores the character set to use more than one bit
905 with an enum determining which one. This data structure more
906 closely follows the semantics of their being mutually exclusive, and
907 conserves bits as well, and is better expandable.
909 A small API is added to set and query the bit field.
911 This patch is not .xs source backwards compatible. A handful of cpan
912 programs are affected.
917 Significant updates to buildtoc
919 Update Unicode-Normalize to CPAN version 1.10
921 commit 11454c594f22abc5945e69a46fc965363dbf326e
922 Author: Karl Williamson <public@khwilliamson.com>
923 Date: Sat Jan 15 13:42:58 2011 -0700
925 Fix \xa0 matching both [\s] [\S], et.al.
927 This bug stemmed from Latin1 characters not matching any (non-complemented)
928 character class in /d semantics when the target string is no utf8; but having
929 unicode semantics when it isn't. The solution here is to add a special flag.
931 There were several tests that relied on the broken behavior, specifically they
932 tested that \xff isn't a printable word character even in utf8. I changed the
933 deparse test to instead use a non-printable code point, and I changed the ones
934 in re_tests to be TODOs, and will change them back using /a when that is
940 restrict \p{IsUserDefined} to In\w+ and In\w+
942 In L<perlunicode/"User-Defined Character Properties">, it says you can
943 create custom properties by defining subroutines whose names begin with
944 "In" or "Is". However, perl doesn't actually enforce that naming
945 restriction, so \p{foo::bar} will call foo::Bar() if it exists.
947 This commit finally enforces this convention. Note that this broke a
948 number of existing tests for properties, since they didn't always use an
953 test that perl.pod, pod.lst, MANIFEST and the file system are consistent
955 commit f56b6394f7cf57733135f56e4e4ac49abe9ac9cc
956 Author: Karl Williamson <public@khwilliamson.com>
957 Date: Thu Jan 13 22:36:36 2011 -0700
961 This patch restructures the regex ANYOF code to generate ANYOFV nodes instead
962 when there is a possibility that it could match more than one character. Note
963 that this doesn't affect the optimizer, as it essentially ignores things that
964 fit into this category. (But it means that the optimizer will no longer reject
965 these when it shouldn't have.)
967 The handling of the LATIN SHARP s is modified to correspond with this new node
970 The initial handling of ANYOFV is placed in regexec.c. More analysis will come
971 on that. But there was significant change to the part that handles matching
972 multi-char strings. This has long been buggy, with it previously comparing a
973 folded-version on one side with a non-folded version on the other.
975 This patch fixes about 60% of the problems that my undelivered test suite gives
976 for multi-char folds. But there are still 17K test failures left, so I'm still
977 not delivering that. The TODOs that this fixes will be cleaned up in a later commit
980 Update Pod-LaTeX to CPAN version 0.59
982 commit 680818c0361b180bb6f09d4bb11c4d5cd467fe62
983 Author: Nicholas Clark <nick@ccl4.org>
984 Date: Thu Jan 13 16:24:52 2011 +0000
986 ithread_create() was relying on the stack not moving. Fix this.
988 4cf5eae5e58faebb changed S_ithread_create() to avoid creating an AV, by
989 passing the thread creation arguments as pointers to a block of memory
990 holding SVs. Unfortunately, this inadvertently introduced a subtle bug,
991 because the block of memory is on the Perl stack, which can move as a side
992 effect of being reallocated to extend it. Hence pass in the offset on the
993 stack instead, read the current value of the relevant interpreter's stack
994 at the point of access, and copy all the SVs away before making any further
995 calls which might cause reallocation.
997 Update to Win32-0.44 from CPAN
998 Update Term-UI to CPAN version 0.24
999 Update IO-Compress to CPAN version 2.033
1000 Update Compress-Raw-Zlib to CPAN version 2.033
1001 Update Compress-Raw-Bzip2 to CPAN version 2.033
1002 Update DB_File to CPAN version 1.821
1005 Correct the "unimplemented" message for get{host,net,proto,serv}ent aliases.
1007 Previously, if all of gethost{byaddr,byname,ent} were unimplemented on a
1008 platform, they would all return 'Unsupported socket function "gethostent"
1009 called', with the analogous results for getnet{byaddr,byname,ent},
1010 getproto{byname,bynumber,ent} and getserv{byname,byport,ent}. This bug was
1011 introduced by change af51a00e97d5c559 - prior to this, all 12 functions would
1012 report their own name when unimplemented.
1014 commit 9ae3ac1a84c63b0eadf5baf47ce7096482280f32
1015 Author: Karl Williamson <public@khwilliamson.com>
1016 Date: Sun Jan 9 15:33:28 2011 -0700
1018 Add warnings for use of problematic code points
1020 The non-Unicode code points have no Unicode semantics, so applying operations
1021 such as casing on them warns.
1023 This patch also includes the changes to test the warnings added by recent
1024 commits for handling the surrogates and above-Unicode code points
1026 commit 949cf4983af707fbd15e422845f4f3df20505f97
1027 Author: Karl Williamson <public@khwilliamson.com>
1028 Date: Sun Jan 9 13:50:18 2011 -0700
1030 utf8.c(): Default to allow problematic code points
1032 Surrogates, non-character code points, and code points that aren't in Unicode
1033 are now allowed by default, instead of having to specify a flag to allow them.
1034 (Most code did specify those flags anyway.)
1036 This affects uvuni_to_utf8_flags(), utf8n_to_uvuni() and various routines that
1037 are specialized interfaces to them.
1039 Now there is a new set of flags to disallow those code points. Further, all 66
1040 of the non-character code points are known about and handled consistently,
1041 instead of just U+FFFF.
1043 Code that requires these code points to be forbidden will have to change to use
1044 the new flags. I have looked at all the (few) instances in CPAN where these
1045 routines are used, and the only one I found that appears to have need to do
1046 this, Encode, has already been patched to accommodate this change. Of course,
1047 I may have overlooked some subtleties.
1050 commit 7627e6d0fe772ac90fce9e03fea273109521e261
1051 Author: Nicholas Clark <nick@ccl4.org>
1052 Date: Sat Jan 8 15:56:22 2011 +0000
1054 Generate "Unsupported socket function" stubs using PL_ppaddr.
1056 Instead of having each socket op conditionally compile as either the
1057 implementation or a DIE() depending on #HAS_SOCKET
1059 1: remove the conditional code from the ops themselves
1060 2: only compile the ops if HAS_SOCKET is defined
1061 3: general conditional code for the intialisation of PL_ppaddr - as appropriate
1062 either the ops, or Perl_unimplemented_op
1063 4: Amend Perl_unimplemented_op to generate the appropriate DIE() for socket
1064 ops (ie not the "panic"... message)
1066 Whilst this complicates the support code in regen/opcode.pl, it's already a
1067 net saving of 5 lines in the C code.
1069 commit 897d398936dd2fc088a265fba2a7b62fa97ed458
1070 Author: Nicholas Clark <nick@ccl4.org>
1071 Date: Sun Jan 9 10:54:58 2011 +0000
1073 Generate pp_* prototypes in pp_proto.h, and remove pp.sym
1075 Eliminate the #define pp_foo Perl_pp_foo(pTHX) macros, and update the 13
1076 locations that relied on them.
1078 regen/opcode.pl now generates prototypes for the PP functions directly, into
1079 pp_proto.h. It no longer writes pp.sym, and regen/embed.pl no longer reads
1080 this, removing the only ordering dependency in the regen scripts. opcode.pl
1081 is now responsible for prototypes for pp_* functions. (embed.pl remains
1082 responsible for ck_* functions, reading from regen/opcodes)
1084 commit f1aaabf80b19a3cc6dc435510732b561ab46361e
1085 Author: Chris 'BinGOs' Williams <chris@bingosnet.co.uk>
1086 Date: Sun Jan 9 00:16:46 2011 +0000
1088 Update CPANPLUS to CPAN version 0.9011
1089 Update CPANPLUS-Dist-Build to CPAN version 0.52
1090 Update Term-UI to CPAN version 0.22
1091 Update Params-Check to CPAN version 0.28
1093 commit a4a4c9e2c086dd5f7b7b05789161614dbbe8385b
1094 Author: Karl Williamson <public@khwilliamson.com>
1095 Date: Sat Jan 8 14:44:05 2011 -0700
1097 perldiag.pod: Add missing message severities
1099 commit 0019012ad86d597fb507f71577d70ecd4c416bba
1100 Author: Chris 'BinGOs' Williams <chris@bingosnet.co.uk>
1101 Date: Sat Jan 8 23:09:04 2011 +0000
1103 Update Object-Accessor to CPAN version 0.38
1104 Update Module-Load-Conditional to CPAN version 0.40
1105 Update Log-Message to CPAN version 0.04
1106 Update IPC-Cmd to CPAN version 0.68
1110 add test for split without a pattern
1112 Add some while tests, about the context of the last statement in a block and about reinitializaiton of lexical variables.
1114 modernise t/cmd/while.t
1116 Add t/base/while.t testing the basic of a while loop with minimal dependencies. Change t/cmd/while.t into a non-base test using "test.pl".
1118 commit ac066c2ab5bc31260104aeee778921b186894769
1120 Update File-Fetch to CPAN version 0.30
1121 Update Archive-Tar to CPAN version 1.76
1122 Update Archive-Extract to CPAN version 0.48
1124 commit b86b68b4eefa1069dabc8ea0401d712b24a67857
1125 Author: Jesse Vincent <jesse@bestpractical.com>
1126 Date: Sat Jan 8 00:14:29 2011 +0800
1128 Update the policy on doc patches to maint
1131 commit e8b333e679eb9a7a62c1d86b647515f01821eb60
1132 Author: David Golden <dagolden@cpan.org>
1133 Date: Wed Jan 5 22:47:41 2011 -0500
1135 Add Module::Metadata as a dual-life core module
1137 This commit adds Module::Metadata 1.000002 as a dual-life module. It
1138 gathers package and POD information from Perl module files. It is a
1139 standalone module based on Module::Build::ModuleInfo for use by other
1140 module installation toolchain components. Module::Build::ModuleInfo
1141 has been deprecated in favor of this module instead.
1143 commit a8fb8d791b74ff90850140d94407aad99ec86fe3
1144 Author: John Peacock <jpeacock@jpeacock-hp.doesntexist.org>
1145 Date: Thu Jan 6 20:02:37 2011 -0500
1147 Sync with version 0.88 on CPAN
1149 commit c1b879e57aecbfe520b3f44dd28472d1862f037d
1150 Author: Craig A. Berry <craigberry@mac.com>
1151 Date: Wed Jan 5 19:14:06 2011 -0600
1153 Make newline on last record explicit.
1155 On VMS, the last line written to a file will get a trailing newline
1156 willy nilly. This has its advantages insofar as you never get the
1157 "no newline at end of file" warnings from various utilities, but
1158 reality conflicts with expectations when you explicitly test for
1159 the last (or only) line *not* ending with newline, which is what
1160 the recent addtion to ref.t (5e3072707906cc4cb8a364c4cf7c487df0300caa)
1161 was doing. Adding an explicit newline makes everyone happy.
1163 commit 0b5e625bc99f5cb78697faf03b297b6cacadf60b
1164 Author: Reini Urban <rurban@x-ray.at>
1165 Date: Tue Sep 14 18:04:22 2010 +0200
1167 build man pages on cygwin too
1169 commit 172830635ea7813c85e51e4ae2b4bed56ddbab83
1170 Author: Reini Urban <rurban@x-ray.at>
1171 Date: Tue Sep 14 17:54:15 2010 +0200
1173 Improve cygwin rebase behaviour
1175 If a dll is updated on cygwin reuse the old imagebase address.
1176 This solves most rebase errors, esp when updating on core dll's.
1177 See http://www.tishler.net/jason/software/rebase/rebase-2.4.2.README
1182 commit cc7e77fd5a0ee9f1498e54dddf566117da62754b
1183 Author: Reini Urban <rurban@x-ray.at>
1184 Date: Tue Sep 14 17:48:32 2010 +0200
1186 CYG14 Dynaloader without USEIMPORTLIB, and search cyg prefix
1188 part1: Support the standard cyg dll prefix, which is e.g. needed for FFI's.
1189 Ctypes and C::DynaLib use DynaLoader to find dlls.
1191 part2: With -DUSEIMPORTLIB DynaLoader symbols link against the prefixed
1192 symbol names for the .dll.a importlib, but we need to link against the
1193 symbols directly. We don't link Dynaloader against libperl.dll.a.
1196 $ g++-4 -o cygperl5_13_4.dll --shared perlsrc.o cygwin.o DynaLoader.o -ldl -lcrypt
1197 Creating library file: libperl.dll.a
1198 DynaLoader.o: In function `XS_DynaLoader_dl_undef_symbols':
1199 ext/DynaLoader/DynaLoader.c:346: undefined reference to `__imp__PL_stack_sp'
1200 ext/DynaLoader/DynaLoader.c:346: undefined reference to `__imp__PL_markstack_ptr'
1201 ext/DynaLoader/DynaLoader.c:346: undefined reference to `__imp__PL_stack_base'
1203 commit c0a149a90b8b6e1c18de7294ca974265fb559cf5
1204 Author: Reini Urban <rurban@x-ray.at>
1205 Date: Tue Sep 14 18:06:38 2010 +0200
1208 do not use usemymalloc (double size + slow)
1209 remove deprecated libcygipc info
1210 remove overlarge stack size
1212 commit 2831a86cee065b53b74fd19ddcc6a4257484646d
1213 Author: Zsbán Ambrus <ambrus@math.bme.hu>
1214 Date: Sun Jan 2 20:25:55 2011 -0800
1216 [perl #81032] Overhaul Porting/epigraphs.pod
1218 This patch makes multiple changes to Porting/epigraphs.pod and
1221 For those that don't know, Porting/epigraphs.pod is a new document that
1222 collects the quotes (chosen by Pumpkins) in perl release announcements.
1224 The changes are the following.
1226 1. Add a link pointing to each release announcement in the mailing
1227 list archives. These are from ysth's list, the source from which
1228 Porting/epigraphs.pod was originally compiled, but they weren't in
1229 Porting/epigraphs.pod so far.
1230 2. Reorder Porting/epigraphs.pod chronologically, because I believe
1231 that makes more sense -- pod/perlhist.pod is still sorted by version numbers.
1232 3. Incidentally, some missing releases are added to pod/perlhist.pod too.
1233 4. Fix a mistake where Porting/epigraphs.pod gives the wrong version number.
1234 5. Add some epigraphs that appear in ysth's list but not in
1235 Porting/epigraphs.pod.
1236 6. I did some research in perl history before the part that ysth's list
1237 covers, and added older perl announcements I found. This work is not
1238 complete: I stopped somewhere in 2000.
1240 commit eccda089fc3dcaafc1ae0aac6b428f799231b824
1241 Author: Paul "LeoNerd" Evans <leonerd@leonerd.org.uk>
1242 Date: Mon Dec 13 17:50:06 2010 +0000
1244 Implement Socket::getaddrinfo() and Socket::getnameinfo(), with related constants
1246 commit edcf105d70e5423fd928c776e086fe31a4a543f4
1247 Author: Jesse Vincent <jesse@bestpractical.com>
1248 Date: Sat Jan 1 18:46:20 2011 +0800
1250 Document 'test_porting' and start a section on how committing to blead
1252 commit cce04bebd8af026c2a6731940ddb895d3c1fc3e4
1253 Author: David Golden <dagolden@cpan.org>
1254 Date: Mon Dec 13 17:36:33 2010 -0500
1256 Reorganize perlhack.pod
1258 Following on an IRC conversation, I've attempted to reorganize
1259 perlhack for greater clarity. I have only cut and paste blocks
1260 of text and amended section titles and levels. (I have not addressed
1261 any of the numerous factual issues which remain.)
1263 The resulting guide should be clearer for those trying to skim the
1264 table of contents to understand what is covered in perlhack and
1265 whether it is worth an in-depth read.
1267 I see this change as the first step towards future improvements.