This is a live mirror of the Perl 5 development currently hosted at https://github.com/perl/perl5
Upgrade to ExtUtils::MakeMaker 6.40.
[perl5.git] / pod / perl5100delta.pod
CommitLineData
81663c3a
RGS
1=encoding utf8
2
cf6c151c
RGS
3=head1 NAME
4
5perldelta - what is new for perl 5.10.0
6
7=head1 DESCRIPTION
8
9This document describes the differences between the 5.8.8 release and
10the 5.10.0 release.
11
12Many of the bug fixes in 5.10.0 were already seen in the 5.8.X maintenance
13releases; they are not duplicated here and are documented in the set of
14man pages named perl58[1-8]?delta.
15
cf6c151c
RGS
16=head1 Core Enhancements
17
18=head2 The C<feature> pragma
19
20The C<feature> pragma is used to enable new syntax that would break Perl's
21backwards-compatibility with older releases of the language. It's a lexical
22pragma, like C<strict> or C<warnings>.
23
24Currently the following new features are available: C<switch> (adds a
25switch statement), C<say> (adds a C<say> built-in function), and C<state>
292c2b28 26(adds a C<state> keyword for declaring "static" variables). Those
cf6c151c
RGS
27features are described in their own sections of this document.
28
29The C<feature> pragma is also implicitly loaded when you require a minimal
30perl version (with the C<use VERSION> construct) greater than, or equal
31to, 5.9.5. See L<feature> for details.
32
33=head2 New B<-E> command-line switch
34
35B<-E> is equivalent to B<-e>, but it implicitly enables all
36optional features (like C<use feature ":5.10">).
37
38=head2 Defined-or operator
39
40A new operator C<//> (defined-or) has been implemented.
dbef3c66 41The following expression:
cf6c151c
RGS
42
43 $a // $b
44
45is merely equivalent to
46
47 defined $a ? $a : $b
48
dbef3c66 49and the statement
cf6c151c
RGS
50
51 $c //= $d;
52
53can now be used instead of
54
55 $c = $d unless defined $c;
56
57The C<//> operator has the same precedence and associativity as C<||>.
58Special care has been taken to ensure that this operator Do What You Mean
59while not breaking old code, but some edge cases involving the empty
60regular expression may now parse differently. See L<perlop> for
61details.
62
63=head2 Switch and Smart Match operator
64
65Perl 5 now has a switch statement. It's available when C<use feature
66'switch'> is in effect. This feature introduces three new keywords,
67C<given>, C<when>, and C<default>:
68
69 given ($foo) {
70 when (/^abc/) { $abc = 1; }
71 when (/^def/) { $def = 1; }
72 when (/^xyz/) { $xyz = 1; }
73 default { $nothing = 1; }
74 }
75
76A more complete description of how Perl matches the switch variable
77against the C<when> conditions is given in L<perlsyn/"Switch statements">.
78
79This kind of match is called I<smart match>, and it's also possible to use
80it outside of switch statements, via the new C<~~> operator. See
81L<perlsyn/"Smart matching in detail">.
82
83This feature was contributed by Robin Houston.
84
85=head2 Regular expressions
86
87=over 4
88
89=item Recursive Patterns
90
91It is now possible to write recursive patterns without using the C<(??{})>
92construct. This new way is more efficient, and in many cases easier to
93read.
94
95Each capturing parenthesis can now be treated as an independent pattern
96that can be entered by using the C<(?PARNO)> syntax (C<PARNO> standing for
97"parenthesis number"). For example, the following pattern will match
98nested balanced angle brackets:
99
100 /
101 ^ # start of line
102 ( # start capture buffer 1
103 < # match an opening angle bracket
104 (?: # match one of:
105 (?> # don't backtrack over the inside of this group
106 [^<>]+ # one or more non angle brackets
107 ) # end non backtracking group
108 | # ... or ...
109 (?1) # recurse to bracket 1 and try it again
110 )* # 0 or more times.
111 > # match a closing angle bracket
112 ) # end capture buffer one
113 $ # end of line
114 /x
115
e15dad31
JC
116PCRE users should note that Perl's recursive regex feature allows
117backtracking into a recursed pattern, whereas in PCRE the recursion is
118atomic or "possessive" in nature. As in the example above, you can
119add (?>) to control this selectively. (Yves Orton)
cf6c151c
RGS
120
121=item Named Capture Buffers
122
123It is now possible to name capturing parenthesis in a pattern and refer to
124the captured contents by name. The naming syntax is C<< (?<NAME>....) >>.
125It's possible to backreference to a named buffer with the C<< \k<NAME> >>
126syntax. In code, the new magical hashes C<%+> and C<%-> can be used to
127access the contents of the capture buffers.
128
e15dad31 129Thus, to replace all doubled chars with a single copy, one could write
cf6c151c
RGS
130
131 s/(?<letter>.)\k<letter>/$+{letter}/g
132
133Only buffers with defined contents will be "visible" in the C<%+> hash, so
134it's possible to do something like
135
136 foreach my $name (keys %+) {
137 print "content of buffer '$name' is $+{$name}\n";
138 }
139
140The C<%-> hash is a bit more complete, since it will contain array refs
141holding values from all capture buffers similarly named, if there should
142be many of them.
143
144C<%+> and C<%-> are implemented as tied hashes through the new module
145C<Tie::Hash::NamedCapture>.
146
147Users exposed to the .NET regex engine will find that the perl
148implementation differs in that the numerical ordering of the buffers
149is sequential, and not "unnamed first, then named". Thus in the pattern
150
151 /(A)(?<B>B)(C)(?<D>D)/
152
153$1 will be 'A', $2 will be 'B', $3 will be 'C' and $4 will be 'D' and not
154$1 is 'A', $2 is 'C' and $3 is 'B' and $4 is 'D' that a .NET programmer
155would expect. This is considered a feature. :-) (Yves Orton)
156
157=item Possessive Quantifiers
158
159Perl now supports the "possessive quantifier" syntax of the "atomic match"
160pattern. Basically a possessive quantifier matches as much as it can and never
161gives any back. Thus it can be used to control backtracking. The syntax is
162similar to non-greedy matching, except instead of using a '?' as the modifier
163the '+' is used. Thus C<?+>, C<*+>, C<++>, C<{min,max}+> are now legal
164quantifiers. (Yves Orton)
165
166=item Backtracking control verbs
167
168The regex engine now supports a number of special-purpose backtrack
169control verbs: (*THEN), (*PRUNE), (*MARK), (*SKIP), (*COMMIT), (*FAIL)
170and (*ACCEPT). See L<perlre> for their descriptions. (Yves Orton)
171
172=item Relative backreferences
173
174A new syntax C<\g{N}> or C<\gN> where "N" is a decimal integer allows a
175safer form of back-reference notation as well as allowing relative
176backreferences. This should make it easier to generate and embed patterns
177that contain backreferences. See L<perlre/"Capture buffers">. (Yves Orton)
178
179=item C<\K> escape
180
181The functionality of Jeff Pinyan's module Regexp::Keep has been added to
254a8700 182the core. In regular expressions you can now use the special escape C<\K>
cf6c151c
RGS
183as a way to do something like floating length positive lookbehind. It is
184also useful in substitutions like:
185
186 s/(foo)bar/$1/g
187
188that can now be converted to
189
190 s/foo\Kbar//g
191
192which is much more efficient. (Yves Orton)
193
194=item Vertical and horizontal whitespace, and linebreak
195
292c2b28 196Regular expressions now recognize the C<\v> and C<\h> escapes that match
cf6c151c
RGS
197vertical and horizontal whitespace, respectively. C<\V> and C<\H>
198logically match their complements.
199
200C<\R> matches a generic linebreak, that is, vertical whitespace, plus
201the multi-character sequence C<"\x0D\x0A">.
202
203=back
204
205=head2 C<say()>
206
207say() is a new built-in, only available when C<use feature 'say'> is in
208effect, that is similar to print(), but that implicitly appends a newline
209to the printed string. See L<perlfunc/say>. (Robin Houston)
210
211=head2 Lexical C<$_>
212
213The default variable C<$_> can now be lexicalized, by declaring it like
214any other lexical variable, with a simple
215
216 my $_;
217
218The operations that default on C<$_> will use the lexically-scoped
219version of C<$_> when it exists, instead of the global C<$_>.
220
221In a C<map> or a C<grep> block, if C<$_> was previously my'ed, then the
222C<$_> inside the block is lexical as well (and scoped to the block).
223
224In a scope where C<$_> has been lexicalized, you can still have access to
225the global version of C<$_> by using C<$::_>, or, more simply, by
597bb945 226overriding the lexical declaration with C<our $_>. (Rafael Garcia-Suarez)
cf6c151c
RGS
227
228=head2 The C<_> prototype
229
254a8700
NC
230A new prototype character has been added. C<_> is equivalent to C<$> but
231defaults to C<$_> if the corresponding argument isn't supplied. (both C<$>
3d9f6fa1 232and C<_> denote a scalar). Due to the optional nature of the argument, you
254a8700 233can only use it at the end of a prototype, or before a semicolon.
cf6c151c
RGS
234
235This has a small incompatible consequence: the prototype() function has
236been adjusted to return C<_> for some built-ins in appropriate cases (for
237example, C<prototype('CORE::rmdir')>). (Rafael Garcia-Suarez)
238
239=head2 UNITCHECK blocks
240
241C<UNITCHECK>, a new special code block has been introduced, in addition to
242C<BEGIN>, C<CHECK>, C<INIT> and C<END>.
243
244C<CHECK> and C<INIT> blocks, while useful for some specialized purposes,
245are always executed at the transition between the compilation and the
246execution of the main program, and thus are useless whenever code is
247loaded at runtime. On the other hand, C<UNITCHECK> blocks are executed
248just after the unit which defined them has been compiled. See L<perlmod>
249for more information. (Alex Gough)
250
251=head2 New Pragma, C<mro>
252
253A new pragma, C<mro> (for Method Resolution Order) has been added. It
254permits to switch, on a per-class basis, the algorithm that perl uses to
dbef3c66 255find inherited methods in case of a multiple inheritance hierarchy. The
cf6c151c
RGS
256default MRO hasn't changed (DFS, for Depth First Search). Another MRO is
257available: the C3 algorithm. See L<mro> for more information.
258(Brandon Black)
259
dbef3c66 260Note that, due to changes in the implementation of class hierarchy search,
cf6c151c
RGS
261code that used to undef the C<*ISA> glob will most probably break. Anyway,
262undef'ing C<*ISA> had the side-effect of removing the magic on the @ISA
263array and should not have been done in the first place.
264
3de67921
JD
265=head2 readdir() may return a "short filename" on Windows
266
267The readdir() function may return a "short filename" when the long
268filename contains characters outside the ANSI codepage. Similarly
269Cwd::cwd() may return a short directory name, and glob() may return short
270names as well. On the NTFS file system these short names can always be
271represented in the ANSI codepage. This will not be true for all other file
272system drivers; e.g. the FAT filesystem stores short filenames in the OEM
273codepage, so some files on FAT volumes remain unaccessible through the
274ANSI APIs.
275
276Similarly, $^X, @INC, and $ENV{PATH} are preprocessed at startup to make
277sure all paths are valid in the ANSI codepage (if possible).
278
279The Win32::GetLongPathName() function now returns the UTF-8 encoded
280correct long file name instead of using replacement characters to force
281the name into the ANSI codepage. The new Win32::GetANSIPathName()
282function can be used to turn a long pathname into a short one only if the
283long one cannot be represented in the ANSI codepage.
284
285Many other functions in the C<Win32> module have been improved to accept
286UTF-8 encoded arguments. Please see L<Win32> for details.
287
cf6c151c
RGS
288=head2 readpipe() is now overridable
289
290The built-in function readpipe() is now overridable. Overriding it permits
291also to override its operator counterpart, C<qx//> (a.k.a. C<``>).
292Moreover, it now defaults to C<$_> if no argument is provided. (Rafael
293Garcia-Suarez)
294
597bb945 295=head2 Default argument for readline()
cf6c151c
RGS
296
297readline() now defaults to C<*ARGV> if no argument is provided. (Rafael
298Garcia-Suarez)
299
300=head2 state() variables
301
302A new class of variables has been introduced. State variables are similar
303to C<my> variables, but are declared with the C<state> keyword in place of
304C<my>. They're visible only in their lexical scope, but their value is
305persistent: unlike C<my> variables, they're not undefined at scope entry,
306but retain their previous value. (Rafael Garcia-Suarez, Nicholas Clark)
307
308To use state variables, one needs to enable them by using
309
254a8700 310 use feature 'state';
cf6c151c
RGS
311
312or by using the C<-E> command-line switch in one-liners.
313See L<perlsub/"Persistent variables via state()">.
314
315=head2 Stacked filetest operators
316
317As a new form of syntactic sugar, it's now possible to stack up filetest
318operators. You can now write C<-f -w -x $file> in a row to mean
319C<-x $file && -w _ && -f _>. See L<perlfunc/-X>.
320
321=head2 UNIVERSAL::DOES()
322
323The C<UNIVERSAL> class has a new method, C<DOES()>. It has been added to
324solve semantic problems with the C<isa()> method. C<isa()> checks for
325inheritance, while C<DOES()> has been designed to be overridden when
326module authors use other types of relations between classes (in addition
327to inheritance). (chromatic)
328
329See L<< UNIVERSAL/"$obj->DOES( ROLE )" >>.
330
cf6c151c
RGS
331=head2 Formats
332
333Formats were improved in several ways. A new field, C<^*>, can be used for
334variable-width, one-line-at-a-time text. Null characters are now handled
335correctly in picture lines. Using C<@#> and C<~~> together will now
336produce a compile-time error, as those format fields are incompatible.
337L<perlform> has been improved, and miscellaneous bugs fixed.
338
339=head2 Byte-order modifiers for pack() and unpack()
340
341There are two new byte-order modifiers, C<E<gt>> (big-endian) and C<E<lt>>
342(little-endian), that can be appended to most pack() and unpack() template
343characters and groups to force a certain byte-order for that type or group.
344See L<perlfunc/pack> and L<perlpacktut> for details.
345
cf6c151c
RGS
346=head2 C<no VERSION>
347
348You can now use C<no> followed by a version number to specify that you
349want to use a version of perl older than the specified one.
350
351=head2 C<chdir>, C<chmod> and C<chown> on filehandles
352
353C<chdir>, C<chmod> and C<chown> can now work on filehandles as well as
354filenames, if the system supports respectively C<fchdir>, C<fchmod> and
355C<fchown>, thanks to a patch provided by Gisle Aas.
356
357=head2 OS groups
358
359C<$(> and C<$)> now return groups in the order where the OS returns them,
360thanks to Gisle Aas. This wasn't previously the case.
361
362=head2 Recursive sort subs
363
364You can now use recursive subroutines with sort(), thanks to Robin Houston.
365
366=head2 Exceptions in constant folding
367
368The constant folding routine is now wrapped in an exception handler, and
369if folding throws an exception (such as attempting to evaluate 0/0), perl
370now retains the current optree, rather than aborting the whole program.
254a8700
NC
371Without this change, programs would not compile if they had expressions that
372happened to generate exceptions, even though those expressions were in code
373that could never be reached at runtime. (Nicholas Clark, Dave Mitchell)
cf6c151c
RGS
374
375=head2 Source filters in @INC
376
377It's possible to enhance the mechanism of subroutine hooks in @INC by
378adding a source filter on top of the filehandle opened and returned by the
379hook. This feature was planned a long time ago, but wasn't quite working
380until now. See L<perlfunc/require> for details. (Nicholas Clark)
381
382=head2 New internal variables
383
384=over 4
385
386=item C<${^RE_DEBUG_FLAGS}>
387
388This variable controls what debug flags are in effect for the regular
389expression engine when running under C<use re "debug">. See L<re> for
390details.
391
392=item C<${^CHILD_ERROR_NATIVE}>
393
394This variable gives the native status returned by the last pipe close,
395backtick command, successful call to wait() or waitpid(), or from the
396system() operator. See L<perlrun> for details. (Contributed by Gisle Aas.)
397
597bb945
RGS
398=item C<${^RE_TRIE_MAXBUF}>
399
400See L</"Trie optimisation of literal string alternations">.
401
402=item C<${^WIN32_SLOPPY_STAT}>
403
404See L</"Sloppy stat on Windows">.
405
cf6c151c
RGS
406=back
407
408=head2 Miscellaneous
409
410C<unpack()> now defaults to unpacking the C<$_> variable.
411
412C<mkdir()> without arguments now defaults to C<$_>.
413
414The internal dump output has been improved, so that non-printable characters
415such as newline and backspace are output in C<\x> notation, rather than
416octal.
417
418The B<-C> option can no longer be used on the C<#!> line. It wasn't
cba8bf60
RGS
419working there anyway, since the standard streams are already set up
420at this point in the execution of the perl interpreter. You can use
421binmode() instead to get the desired behaviour.
cf6c151c
RGS
422
423=head2 UCD 5.0.0
424
425The copy of the Unicode Character Database included in Perl 5 has
426been updated to version 5.0.0.
427
cf6c151c
RGS
428=head2 MAD
429
254a8700 430MAD, which stands for I<Miscellaneous Attribute Decoration>, is a
cf6c151c
RGS
431still-in-development work leading to a Perl 5 to Perl 6 converter. To
432enable it, it's necessary to pass the argument C<-Dmad> to Configure. The
254a8700 433obtained perl isn't binary compatible with a regular perl 5.10, and has
cf6c151c
RGS
434space and speed penalties; moreover not all regression tests still pass
435with it. (Larry Wall, Nicholas Clark)
436
c7d332a5
RGS
437=head2 kill() on Windows
438
439On Windows platforms, C<kill(-9, $pid)> now kills a process tree.
440(On UNIX, this delivers the signal to all processes in the same process
441group.)
442
597bb945
RGS
443=head1 Incompatible Changes
444
445=head2 Packing and UTF-8 strings
446
447=for XXX update this
448
449The semantics of pack() and unpack() regarding UTF-8-encoded data has been
450changed. Processing is now by default character per character instead of
451byte per byte on the underlying encoding. Notably, code that used things
452like C<pack("a*", $string)> to see through the encoding of string will now
453simply get back the original $string. Packed strings can also get upgraded
454during processing when you store upgraded characters. You can get the old
455behaviour by using C<use bytes>.
456
457To be consistent with pack(), the C<C0> in unpack() templates indicates
458that the data is to be processed in character mode, i.e. character by
459character; on the contrary, C<U0> in unpack() indicates UTF-8 mode, where
460the packed string is processed in its UTF-8-encoded Unicode form on a byte
254a8700
NC
461by byte basis. This is reversed with regard to perl 5.8.X, but now consistent
462between pack() and unpack().
597bb945
RGS
463
464Moreover, C<C0> and C<U0> can also be used in pack() templates to specify
465respectively character and byte modes.
466
467C<C0> and C<U0> in the middle of a pack or unpack format now switch to the
468specified encoding mode, honoring parens grouping. Previously, parens were
469ignored.
470
471Also, there is a new pack() character format, C<W>, which is intended to
472replace the old C<C>. C<C> is kept for unsigned chars coded as bytes in
473the strings internal representation. C<W> represents unsigned (logical)
474character values, which can be greater than 255. It is therefore more
475robust when dealing with potentially UTF-8-encoded data (as C<C> will wrap
476values outside the range 0..255, and not respect the string encoding).
477
478In practice, that means that pack formats are now encoding-neutral, except
479C<C>.
480
481For consistency, C<A> in unpack() format now trims all Unicode whitespace
482from the end of the string. Before perl 5.9.2, it used to strip only the
483classical ASCII space characters.
484
485=head2 Byte/character count feature in unpack()
486
487A new unpack() template character, C<".">, returns the number of bytes or
488characters (depending on the selected encoding mode, see above) read so far.
489
490=head2 The C<$*> and C<$#> variables have been removed
491
492C<$*>, which was deprecated in favor of the C</s> and C</m> regexp
493modifiers, has been removed.
494
495The deprecated C<$#> variable (output format for numbers) has been
496removed.
497
f00638a2 498Two new severe warnings, C<$#/$* is no longer supported>, have been added.
597bb945
RGS
499
500=head2 substr() lvalues are no longer fixed-length
501
502The lvalues returned by the three argument form of substr() used to be a
503"fixed length window" on the original string. In some cases this could
504cause surprising action at distance or other undefined behaviour. Now the
505length of the window adjusts itself to the length of the string assigned to
506it.
507
508=head2 Parsing of C<-f _>
509
510The identifier C<_> is now forced to be a bareword after a filetest
511operator. This solves a number of misparsing issues when a global C<_>
512subroutine is defined.
513
514=head2 C<:unique>
515
516The C<:unique> attribute has been made a no-op, since its current
517implementation was fundamentally flawed and not threadsafe.
518
597bb945
RGS
519=head2 Effect of pragmas in eval
520
521The compile-time value of the C<%^H> hint variable can now propagate into
522eval("")uated code. This makes it more useful to implement lexical
523pragmas.
524
525As a side-effect of this, the overloaded-ness of constants now propagates
526into eval("").
527
528=head2 chdir FOO
529
530A bareword argument to chdir() is now recognized as a file handle.
531Earlier releases interpreted the bareword as a directory name.
532(Gisle Aas)
533
534=head2 Handling of .pmc files
535
536An old feature of perl was that before C<require> or C<use> look for a
537file with a F<.pm> extension, they will first look for a similar filename
538with a F<.pmc> extension. If this file is found, it will be loaded in
539place of any potentially existing file ending in a F<.pm> extension.
540
541Previously, F<.pmc> files were loaded only if more recent than the
542matching F<.pm> file. Starting with 5.9.4, they'll be always loaded if
543they exist.
544
545=head2 @- and @+ in patterns
546
547The special arrays C<@-> and C<@+> are no longer interpolated in regular
548expressions. (Sadahiro Tomoyuki)
549
550=head2 $AUTOLOAD can now be tainted
551
552If you call a subroutine by a tainted name, and if it defers to an
553AUTOLOAD function, then $AUTOLOAD will be (correctly) tainted.
554(Rick Delaney)
555
556=head2 Tainting and printf
557
558When perl is run under taint mode, C<printf()> and C<sprintf()> will now
559reject any tainted format argument. (Rafael Garcia-Suarez)
560
561=head2 undef and signal handlers
562
563Undefining or deleting a signal handler via C<undef $SIG{FOO}> is now
564equivalent to setting it to C<'DEFAULT'>. (Rafael Garcia-Suarez)
565
566=head2 strictures and dereferencing in defined()
567
254a8700 568C<use strict 'refs'> was ignoring taking a hard reference in an argument
597bb945
RGS
569to defined(), as in :
570
254a8700
NC
571 use strict 'refs';
572 my $x = 'foo';
597bb945
RGS
573 if (defined $$x) {...}
574
575This now correctly produces the run-time error C<Can't use string as a
576SCALAR ref while "strict refs" in use>.
577
578C<defined @$foo> and C<defined %$bar> are now also subject to C<strict
579'refs'> (that is, C<$foo> and C<$bar> shall be proper references there.)
580(C<defined(@foo)> and C<defined(%bar)> are discouraged constructs anyway.)
581(Nicholas Clark)
582
583=head2 C<(?p{})> has been removed
584
585The regular expression construct C<(?p{})>, which was deprecated in perl
5865.8, has been removed. Use C<(??{})> instead. (Rafael Garcia-Suarez)
587
588=head2 Pseudo-hashes have been removed
589
590Support for pseudo-hashes has been removed from Perl 5.9. (The C<fields>
591pragma remains here, but uses an alternate implementation.)
592
593=head2 Removal of the bytecode compiler and of perlcc
594
595C<perlcc>, the byteloader and the supporting modules (B::C, B::CC,
596B::Bytecode, etc.) are no longer distributed with the perl sources. Those
597experimental tools have never worked reliably, and, due to the lack of
598volunteers to keep them in line with the perl interpreter developments, it
599was decided to remove them instead of shipping a broken version of those.
600The last version of those modules can be found with perl 5.9.4.
601
602However the B compiler framework stays supported in the perl core, as with
603the more useful modules it has permitted (among others, B::Deparse and
604B::Concise).
605
606=head2 Removal of the JPL
607
ed8ea1b6 608The JPL (Java-Perl Lingo) has been removed from the perl sources tarball.
597bb945
RGS
609
610=head2 Recursive inheritance detected earlier
611
612Perl will now immediately throw an exception if you modify any package's
613C<@ISA> in such a way that it would cause recursive inheritance.
614
615Previously, the exception would not occur until Perl attempted to make
616use of the recursive inheritance while resolving a method or doing a
617C<$foo-E<gt>isa($bar)> lookup.
618
cf6c151c 619=head1 Modules and Pragmata
c0c97549 620
f0e260b8
RGS
621=head2 Pragmata Changes
622
623=over 4
624
625=item C<feature>
626
627The new pragma C<feature> is used to enable new features that might break
628old code. See L</"The C<feature> pragma"> above.
629
630=item C<mro>
631
632This new pragma enables to change the algorithm used to resolve inherited
633methods. See L</"New Pragma, C<mro>"> above.
634
635=item Scoping of the C<sort> pragma
636
637The C<sort> pragma is now lexically scoped. Its effect used to be global.
638
639=item Scoping of C<bignum>, C<bigint>, C<bigrat>
640
641The three numeric pragmas C<bignum>, C<bigint> and C<bigrat> are now
642lexically scoped. (Tels)
643
644=item C<base>
645
646The C<base> pragma now warns if a class tries to inherit from itself.
647(Curtis "Ovid" Poe)
648
649=item C<strict> and C<warnings>
650
651C<strict> and C<warnings> will now complain loudly if they are loaded via
652incorrect casing (as in C<use Strict;>). (Johan Vromans)
653
6601a838
RGS
654=item C<version>
655
656The C<version> module provides support for version objects.
657
f0e260b8
RGS
658=item C<warnings>
659
660The C<warnings> pragma doesn't load C<Carp> anymore. That means that code
661that used C<Carp> routines without having loaded it at compile time might
662need to be adjusted; typically, the following (faulty) code won't work
663anymore, and will require parentheses to be added after the function name:
664
665 use warnings;
666 require Carp;
254a8700 667 Carp::confess 'argh';
f0e260b8
RGS
668
669=item C<less>
670
671C<less> now does something useful (or at least it tries to). In fact, it
672has been turned into a lexical pragma. So, in your modules, you can now
673test whether your users have requested to use less CPU, or less memory,
674less magic, or maybe even less fat. See L<less> for more. (Joshua ben
675Jore)
676
677=back
678
0eece9c0
RGS
679=head2 New modules
680
681=over 4
682
683=item *
684
685C<encoding::warnings>, by Audrey Tang, is a module to emit warnings
686whenever an ASCII character string containing high-bit bytes is implicitly
597bb945
RGS
687converted into UTF-8. It's a lexical pragma since Perl 5.9.4; on older
688perls, its effect is global.
0eece9c0
RGS
689
690=item *
691
692C<Module::CoreList>, by Richard Clamp, is a small handy module that tells
693you what versions of core modules ship with any versions of Perl 5. It
694comes with a command-line frontend, C<corelist>.
695
bd3831ee
RGS
696=item *
697
698C<Math::BigInt::FastCalc> is an XS-enabled, and thus faster, version of
699C<Math::BigInt::Calc>.
700
701=item *
702
703C<Compress::Zlib> is an interface to the zlib compression library. It
704comes with a bundled version of zlib, so having a working zlib is not a
705prerequisite to install it. It's used by C<Archive::Tar> (see below).
706
707=item *
708
709C<IO::Zlib> is an C<IO::>-style interface to C<Compress::Zlib>.
710
711=item *
712
713C<Archive::Tar> is a module to manipulate C<tar> archives.
714
715=item *
716
717C<Digest::SHA> is a module used to calculate many types of SHA digests,
718has been included for SHA support in the CPAN module.
719
720=item *
721
722C<ExtUtils::CBuilder> and C<ExtUtils::ParseXS> have been added.
723
597bb945
RGS
724=item *
725
726C<Hash::Util::FieldHash>, by Anno Siegel, has been added. This module
727provides support for I<field hashes>: hashes that maintain an association
728of a reference with a value, in a thread-safe garbage-collected way.
729Such hashes are useful to implement inside-out objects.
730
731=item *
732
733C<Module::Build>, by Ken Williams, has been added. It's an alternative to
734C<ExtUtils::MakeMaker> to build and install perl modules.
735
736=item *
737
738C<Module::Load>, by Jos Boumans, has been added. It provides a single
739interface to load Perl modules and F<.pl> files.
740
741=item *
742
743C<Module::Loaded>, by Jos Boumans, has been added. It's used to mark
744modules as loaded or unloaded.
745
746=item *
747
748C<Package::Constants>, by Jos Boumans, has been added. It's a simple
749helper to list all constants declared in a given package.
750
751=item *
752
753C<Win32API::File>, by Tye McQueen, has been added (for Windows builds).
754This module provides low-level access to Win32 system API calls for
755files/dirs.
756
f0e260b8
RGS
757=item *
758
759C<Locale::Maketext::Simple>, needed by CPANPLUS, is a simple wrapper around
760C<Locale::Maketext::Lexicon>. Note that C<Locale::Maketext::Lexicon> isn't
761included in the perl core; the behaviour of C<Locale::Maketext::Simple>
762gracefully degrades when the later isn't present.
763
764=item *
765
766C<Params::Check> implements a generic input parsing/checking mechanism. It
767is used by CPANPLUS.
768
769=item *
770
771C<Term::UI> simplifies the task to ask questions at a terminal prompt.
772
773=item *
774
775C<Object::Accessor> provides an interface to create per-object accessors.
776
777=item *
778
779C<Module::Pluggable> is a simple framework to create modules that accept
780pluggable sub-modules.
781
782=item *
783
784C<Module::Load::Conditional> provides simple ways to query and possibly
785load installed modules.
786
787=item *
788
789C<Time::Piece> provides an object oriented interface to time functions,
790overriding the built-ins localtime() and gmtime().
791
792=item *
793
794C<IPC::Cmd> helps to find and run external commands, possibly
795interactively.
796
797=item *
798
799C<File::Fetch> provide a simple generic file fetching mechanism.
800
801=item *
802
803C<Log::Message> and C<Log::Message::Simple> are used by the log facility
804of C<CPANPLUS>.
805
806=item *
807
808C<Archive::Extract> is a generic archive extraction mechanism
809for F<.tar> (plain, gziped or bzipped) or F<.zip> files.
810
811=item *
812
813C<CPANPLUS> provides an API and a command-line tool to access the CPAN
814mirrors.
815
e6746346
SP
816=item *
817
818C<Pod::Escapes> provides utilities that are useful in decoding Pod
819EE<lt>...E<gt> sequences.
820
821=item *
822
823C<Pod::Simple> is now the backend for several of the Pod-related modules
824included with Perl.
825
f0e260b8
RGS
826=back
827
828=head2 Selected Changes to Core Modules
829
830=over 4
831
832=item C<Attribute::Handlers>
833
834C<Attribute::Handlers> can now report the caller's file and line number.
835(David Feldman)
836
837=item C<B::Lint>
838
839C<B::Lint> is now based on C<Module::Pluggable>, and so can be extended
840with plugins. (Joshua ben Jore)
841
842=item C<B>
843
844It's now possible to access the lexical pragma hints (C<%^H>) by using the
845method B::COP::hints_hash(). It returns a C<B::RHE> object, which in turn
846can be used to get a hash reference via the method B::RHE::HASH(). (Joshua
847ben Jore)
848
849=item C<Thread>
850
851As the old 5005thread threading model has been removed, in favor of the
852ithreads scheme, the C<Thread> module is now a compatibility wrapper, to
853be used in old code only. It has been removed from the default list of
854dynamic extensions.
855
0eece9c0
RGS
856=back
857
cf6c151c 858=head1 Utility Changes
c0c97549
RGS
859
860=over 4
861
bd3831ee 862=item perl -d
c0c97549
RGS
863
864The Perl debugger can now save all debugger commands for sourcing later;
865notably, it can now emulate stepping backwards, by restarting and
866rerunning all bar the last command from a saved command history.
867
868It can also display the parent inheritance tree of a given class, with the
869C<i> command.
870
bd3831ee
RGS
871=item ptar
872
292c2b28 873C<ptar> is a pure perl implementation of C<tar> that comes with
bd3831ee
RGS
874C<Archive::Tar>.
875
876=item ptardiff
877
254a8700 878C<ptardiff> is a small utility used to generate a diff between the contents
bd3831ee
RGS
879of a tar archive and a directory tree. Like C<ptar>, it comes with
880C<Archive::Tar>.
881
882=item shasum
883
884C<shasum> is a command-line utility, used to print or to check SHA
885digests. It comes with the new C<Digest::SHA> module.
886
887=item corelist
0eece9c0
RGS
888
889The C<corelist> utility is now installed with perl (see L</"New modules">
890above).
891
bd3831ee 892=item h2ph and h2xs
0eece9c0 893
254a8700 894C<h2ph> and C<h2xs> have been made more robust with regard to
0eece9c0
RGS
895"modern" C code.
896
bd3831ee
RGS
897C<h2xs> implements a new option C<--use-xsloader> to force use of
898C<XSLoader> even in backwards compatible modules.
899
900The handling of authors' names that had apostrophes has been fixed.
901
902Any enums with negative values are now skipped.
903
904=item perlivp
905
906C<perlivp> no longer checks for F<*.ph> files by default. Use the new C<-a>
907option to run I<all> tests.
908
909=item find2perl
0eece9c0
RGS
910
911C<find2perl> now assumes C<-print> as a default action. Previously, it
912needed to be specified explicitly.
913
914Several bugs have been fixed in C<find2perl>, regarding C<-exec> and
915C<-eval>. Also the options C<-path>, C<-ipath> and C<-iname> have been
916added.
917
597bb945
RGS
918=item config_data
919
920C<config_data> is a new utility that comes with C<Module::Build>. It
921provides a command-line interface to the configuration of Perl modules
922that use Module::Build's framework of configurability (that is,
923C<*::ConfigData> modules that contain local configuration information for
924their parent modules.)
925
f00638a2 926=item cpanp
f0e260b8 927
254a8700 928C<cpanp>, the CPANPLUS shell, has been added. (C<cpanp-run-perl>, a
f0e260b8
RGS
929helper for CPANPLUS operation, has been added too, but isn't intended for
930direct use).
931
f00638a2 932=item cpan2dist
f0e260b8 933
292c2b28 934C<cpan2dist> is a new utility that comes with CPANPLUS. It's a tool to
f0e260b8
RGS
935create distributions (or packages) from CPAN modules.
936
f00638a2 937=item pod2html
f0e260b8
RGS
938
939The output of C<pod2html> has been enhanced to be more customizable via
940CSS. Some formatting problems were also corrected. (Jari Aalto)
941
c0c97549
RGS
942=back
943
cf6c151c 944=head1 New Documentation
c0c97549 945
597bb945
RGS
946The L<perlpragma> manpage documents how to write one's own lexical
947pragmas in pure Perl (something that is possible starting with 5.9.4).
948
bd3831ee
RGS
949The new L<perlglossary> manpage is a glossary of terms used in the Perl
950documentation, technical and otherwise, kindly provided by O'Reilly Media,
951Inc.
952
597bb945
RGS
953The L<perlreguts> manpage, courtesy of Yves Orton, describes internals of the
954Perl regular expression engine.
955
62c26f88
RGS
956The L<perlreapi> manpage describes the interface to the perl interpreter
957used to write pluggable regular expression engines (by Ævar Arnfjörð
958Bjarmason).
959
597bb945
RGS
960The L<perlunitut> manpage is an tutorial for programming with Unicode and
961string encodings in Perl, courtesy of Juerd Waalboer.
962
f0e260b8
RGS
963A new manual page, L<perlunifaq> (the Perl Unicode FAQ), has been added
964(Juerd Waalboer).
965
dbef3c66
RGS
966The L<perlcommunity> manpage gives a description of the Perl community
967on the Internet and in real life. (Edgar "Trizor" Bering)
968
f00638a2
RGS
969The L<CORE> manual page documents the C<CORE::> namespace. (Tels)
970
c0c97549
RGS
971The long-existing feature of C</(?{...})/> regexps setting C<$_> and pos()
972is now documented.
973
cf6c151c 974=head1 Performance Enhancements
c0c97549 975
597bb945 976=head2 In-place sorting
0eece9c0 977
c0c97549
RGS
978Sorting arrays in place (C<@a = sort @a>) is now optimized to avoid
979making a temporary copy of the array.
980
0eece9c0
RGS
981Likewise, C<reverse sort ...> is now optimized to sort in reverse,
982avoiding the generation of a temporary intermediate list.
983
597bb945 984=head2 Lexical array access
0eece9c0 985
c0c97549
RGS
986Access to elements of lexical arrays via a numeric constant between 0 and
987255 is now faster. (This used to be only the case for global arrays.)
988
597bb945 989=head2 XS-assisted SWASHGET
bd3831ee
RGS
990
991Some pure-perl code that perl was using to retrieve Unicode properties and
992transliteration mappings has been reimplemented in XS.
993
597bb945 994=head2 Constant subroutines
bd3831ee
RGS
995
996The interpreter internals now support a far more memory efficient form of
997inlineable constants. Storing a reference to a constant value in a symbol
998table is equivalent to a full typeglob referencing a constant subroutine,
999but using about 400 bytes less memory. This proxy constant subroutine is
1000automatically upgraded to a real typeglob with subroutine if necessary.
1001The approach taken is analogous to the existing space optimisation for
1002subroutine stub declarations, which are stored as plain scalars in place
1003of the full typeglob.
1004
1005Several of the core modules have been converted to use this feature for
1006their system dependent constants - as a result C<use POSIX;> now takes about
1007200K less memory.
1008
597bb945 1009=head2 C<PERL_DONT_CREATE_GVSV>
bd3831ee
RGS
1010
1011The new compilation flag C<PERL_DONT_CREATE_GVSV>, introduced as an option
1012in perl 5.8.8, is turned on by default in perl 5.9.3. It prevents perl
1013from creating an empty scalar with every new typeglob. See L<perl588delta>
1014for details.
1015
597bb945 1016=head2 Weak references are cheaper
bd3831ee
RGS
1017
1018Weak reference creation is now I<O(1)> rather than I<O(n)>, courtesy of
1019Nicholas Clark. Weak reference deletion remains I<O(n)>, but if deletion only
1020happens at program exit, it may be skipped completely.
1021
597bb945 1022=head2 sort() enhancements
bd3831ee
RGS
1023
1024Salvador Fandiño provided improvements to reduce the memory usage of C<sort>
1025and to speed up some cases.
1026
597bb945
RGS
1027=head2 Memory optimisations
1028
1029Several internal data structures (typeglobs, GVs, CVs, formats) have been
1030restructured to use less memory. (Nicholas Clark)
1031
1032=head2 UTF-8 cache optimisation
1033
1034The UTF-8 caching code is now more efficient, and used more often.
1035(Nicholas Clark)
1036
1037=head2 Sloppy stat on Windows
1038
1039On Windows, perl's stat() function normally opens the file to determine
1040the link count and update attributes that may have been changed through
1041hard links. Setting ${^WIN32_SLOPPY_STAT} to a true value speeds up
1042stat() by not performing this operation. (Jan Dubois)
1043
597bb945
RGS
1044=head2 Regular expressions optimisations
1045
1046=over 4
1047
1048=item Engine de-recursivised
1049
1050The regular expression engine is no longer recursive, meaning that
1051patterns that used to overflow the stack will either die with useful
1052explanations, or run to completion, which, since they were able to blow
1053the stack before, will likely take a very long time to happen. If you were
1054experiencing the occasional stack overflow (or segfault) and upgrade to
1055discover that now perl apparently hangs instead, look for a degenerate
1056regex. (Dave Mitchell)
1057
1058=item Single char char-classes treated as literals
1059
1060Classes of a single character are now treated the same as if the character
1061had been used as a literal, meaning that code that uses char-classes as an
1062escaping mechanism will see a speedup. (Yves Orton)
1063
1064=item Trie optimisation of literal string alternations
1065
1066Alternations, where possible, are optimised into more efficient matching
1067structures. String literal alternations are merged into a trie and are
1068matched simultaneously. This means that instead of O(N) time for matching
1069N alternations at a given point, the new code performs in O(1) time.
1070A new special variable, ${^RE_TRIE_MAXBUF}, has been added to fine-tune
1071this optimization. (Yves Orton)
1072
1073B<Note:> Much code exists that works around perl's historic poor
1074performance on alternations. Often the tricks used to do so will disable
1075the new optimisations. Hopefully the utility modules used for this purpose
99d59c4d 1076will be educated about these new optimisations.
597bb945
RGS
1077
1078=item Aho-Corasick start-point optimisation
1079
1080When a pattern starts with a trie-able alternation and there aren't
e15dad31 1081better optimisations available, the regex engine will use Aho-Corasick
597bb945
RGS
1082matching to find the start point. (Yves Orton)
1083
0eece9c0
RGS
1084=back
1085
cf6c151c 1086=head1 Installation and Configuration Improvements
c0c97549 1087
597bb945
RGS
1088=head2 Configuration improvements
1089
1090=over 4
1091
1092=item C<-Dusesitecustomize>
bd3831ee 1093
0eece9c0 1094Run-time customization of @INC can be enabled by passing the
597bb945 1095C<-Dusesitecustomize> flag to Configure. When enabled, this will make perl
0eece9c0
RGS
1096run F<$sitelibexp/sitecustomize.pl> before anything else. This script can
1097then be set up to add additional entries to @INC.
1098
597bb945
RGS
1099=item Relocatable installations
1100
1101There is now Configure support for creating a relocatable perl tree. If
1102you Configure with C<-Duserelocatableinc>, then the paths in @INC (and
1103everything else in %Config) can be optionally located via the path of the
1104perl executable.
1105
1106That means that, if the string C<".../"> is found at the start of any
1107path, it's substituted with the directory of $^X. So, the relocation can
1108be configured on a per-directory basis, although the default with
1109C<-Duserelocatableinc> is that everything is relocated. The initial
1110install is done to the original configured prefix.
1111
1112=item strlcat() and strlcpy()
1113
1114The configuration process now detects whether strlcat() and strlcpy() are
1115available. When they are not available, perl's own version is used (from
1116Russ Allbery's public domain implementation). Various places in the perl
1117interpreter now use them. (Steve Peters)
1118
f0e260b8
RGS
1119=item C<d_pseudofork> and C<d_printf_format_null>
1120
1121A new configuration variable, available as C<$Config{d_pseudofork}> in
1122the L<Config> module, has been added, to distinguish real fork() support
1123from fake pseudofork used on Windows platforms.
1124
1125A new configuration variable, C<d_printf_format_null>, has been added,
1126to see if printf-like formats are allowed to be NULL.
1127
1128=item Configure help
1129
1130C<Configure -h> has been extended with the most commonly used options.
1131
597bb945
RGS
1132=back
1133
1134=head2 Compilation improvements
1135
1136=over 4
1137
1138=item Parallel build
0eece9c0 1139
bd3831ee
RGS
1140Parallel makes should work properly now, although there may still be problems
1141if C<make test> is instructed to run in parallel.
1142
597bb945
RGS
1143=item Borland's compilers support
1144
bd3831ee
RGS
1145Building with Borland's compilers on Win32 should work more smoothly. In
1146particular Steve Hay has worked to side step many warnings emitted by their
1147compilers and at least one C compiler internal error.
1148
597bb945
RGS
1149=item Static build on Windows
1150
f0e260b8
RGS
1151Perl extensions on Windows now can be statically built into the Perl DLL.
1152
1153Also, it's now possible to build a C<perl-static.exe> that doesn't depend
1154on the Perl DLL on Win32. See the Win32 makefiles for details.
1155(Vadim Konovalov)
bd3831ee 1156
69d2c521 1157=item ppport.h files
597bb945
RGS
1158
1159All F<ppport.h> files in the XS modules bundled with perl are now
1160autogenerated at build time. (Marcus Holland-Moritz)
1161
f0e260b8
RGS
1162=item C++ compatibility
1163
1164Efforts have been made to make perl and the core XS modules compilable
1165with various C++ compilers (although the situation is not perfect with
1166some of the compilers on some of the platforms tested.)
1167
597bb945
RGS
1168=item Support for Microsoft 64-bit compiler
1169
1170Support for building perl with Microsoft's 64-bit compiler has been
1171improved. (ActiveState)
1172
f0e260b8
RGS
1173=item Visual C++
1174
c01f0d41 1175Perl can now be compiled with Microsoft Visual C++ 2005 (and 2008 Beta 2).
f0e260b8
RGS
1176
1177=item Win32 builds
1178
1179All win32 builds (MS-Win, WinCE) have been merged and cleaned up.
1180
597bb945
RGS
1181=back
1182
1183=head2 Installation improvements
1184
1185=over 4
1186
1187=item Module auxiliary files
1188
1189README files and changelogs for CPAN modules bundled with perl are no
1190longer installed.
1191
1192=back
1193
bd3831ee
RGS
1194=head2 New Or Improved Platforms
1195
597bb945 1196Perl has been reported to work on Symbian OS. See L<perlsymbian> for more
bd3831ee
RGS
1197information.
1198
597bb945
RGS
1199Many improvements have been made towards making Perl work correctly on
1200z/OS.
1201
f0e260b8 1202Perl has been reported to work on DragonFlyBSD and MidnightBSD.
597bb945 1203
bd3831ee
RGS
1204The VMS port has been improved. See L<perlvms>.
1205
d43695a1
RGS
1206Support for Cray XT4 Catamount/Qk has been added. See
1207F<hints/catamount.sh> in the source code distribution for more
1208information.
bd3831ee 1209
f0e260b8
RGS
1210Vendor patches have been merged for RedHat and Gentoo.
1211
1212DynaLoader::dl_unload_file() now works on Windows.
bd3831ee 1213
cf6c151c 1214=head1 Selected Bug Fixes
c0c97549 1215
bd3831ee
RGS
1216=over 4
1217
1218=item strictures in regexp-eval blocks
1219
c0c97549
RGS
1220C<strict> wasn't in effect in regexp-eval blocks (C</(?{...})/>).
1221
bd3831ee
RGS
1222=item Calling CORE::require()
1223
1224CORE::require() and CORE::do() were always parsed as require() and do()
1225when they were overridden. This is now fixed.
1226
1227=item Subscripts of slices
1228
1229You can now use a non-arrowed form for chained subscripts after a list
1230slice, like in:
1231
1232 ({foo => "bar"})[0]{foo}
1233
1234This used to be a syntax error; a C<< -> >> was required.
1235
1236=item C<no warnings 'category'> works correctly with -w
1237
1238Previously when running with warnings enabled globally via C<-w>, selective
1239disabling of specific warning categories would actually turn off all warnings.
1240This is now fixed; now C<no warnings 'io';> will only turn off warnings in the
1241C<io> class. Previously it would erroneously turn off all warnings.
1242
597bb945 1243=item threads improvements
bd3831ee
RGS
1244
1245Several memory leaks in ithreads were closed. Also, ithreads were made
1246less memory-intensive.
1247
597bb945
RGS
1248C<threads> is now a dual-life module, also available on CPAN. It has been
1249expanded in many ways. A kill() method is available for thread signalling.
1250One can get thread status, or the list of running or joinable threads.
1251
1252A new C<< threads->exit() >> method is used to exit from the application
1253(this is the default for the main thread) or from the current thread only
1254(this is the default for all other threads). On the other hand, the exit()
1255built-in now always causes the whole application to terminate. (Jerry
1256D. Hedden)
1257
bd3831ee
RGS
1258=item chr() and negative values
1259
1260chr() on a negative value now gives C<\x{FFFD}>, the Unicode replacement
1261character, unless when the C<bytes> pragma is in effect, where the low
1262eight bytes of the value are used.
1263
597bb945
RGS
1264=item PERL5SHELL and tainting
1265
1266On Windows, the PERL5SHELL environment variable is now checked for
1267taintedness. (Rafael Garcia-Suarez)
1268
1269=item Using *FILE{IO}
1270
1271C<stat()> and C<-X> filetests now treat *FILE{IO} filehandles like *FILE
1272filehandles. (Steve Peters)
1273
1274=item Overloading and reblessing
1275
1276Overloading now works when references are reblessed into another class.
1277Internally, this has been implemented by moving the flag for "overloading"
1278from the reference to the referent, which logically is where it should
1279always have been. (Nicholas Clark)
1280
1281=item Overloading and UTF-8
1282
1283A few bugs related to UTF-8 handling with objects that have
1284stringification overloaded have been fixed. (Nicholas Clark)
1285
1286=item eval memory leaks fixed
1287
1288Traditionally, C<eval 'syntax error'> has leaked badly. Many (but not all)
1289of these leaks have now been eliminated or reduced. (Dave Mitchell)
1290
1291=item Random device on Windows
1292
1293In previous versions, perl would read the file F</dev/urandom> if it
1294existed when seeding its random number generator. That file is unlikely
1295to exist on Windows, and if it did would probably not contain appropriate
1296data, so perl no longer tries to read it on Windows. (Alex Davies)
1297
1298=item PERLIO_DEBUG
1299
254a8700 1300The C<PERLIO_DEBUG> environment variable no longer has any effect for
597bb945
RGS
1301setuid scripts and for scripts run with B<-T>.
1302
1303Moreover, with a thread-enabled perl, using C<PERLIO_DEBUG> could lead to
1304an internal buffer overflow. This has been fixed.
1305
f0e260b8
RGS
1306=item PerlIO::scalar and read-only scalars
1307
1308PerlIO::scalar will now prevent writing to read-only scalars. Moreover,
1309seek() is now supported with PerlIO::scalar-based filehandles, the
1310underlying string being zero-filled as needed. (Rafael, Jarkko Hietaniemi)
1311
1312=item study() and UTF-8
1313
1314study() never worked for UTF-8 strings, but could lead to false results.
1315It's now a no-op on UTF-8 data. (Yves Orton)
1316
1317=item Critical signals
1318
1319The signals SIGILL, SIGBUS and SIGSEGV are now always delivered in an
1320"unsafe" manner (contrary to other signals, that are deferred until the
1321perl interpreter reaches a reasonably stable state; see
1322L<perlipc/"Deferred Signals (Safe Signals)">). (Rafael)
1323
1324=item @INC-hook fix
1325
1326When a module or a file is loaded through an @INC-hook, and when this hook
1327has set a filename entry in %INC, __FILE__ is now set for this module
1328accordingly to the contents of that %INC entry. (Rafael)
1329
1330=item C<-t> switch fix
1331
1332The C<-w> and C<-t> switches can now be used together without messing
254a8700 1333up which categories of warnings are activated. (Rafael)
f0e260b8
RGS
1334
1335=item Duping UTF-8 filehandles
1336
1337Duping a filehandle which has the C<:utf8> PerlIO layer set will now
1338properly carry that layer on the duped filehandle. (Rafael)
1339
1340=item Localisation of hash elements
1341
292c2b28 1342Localizing a hash element whose key was given as a variable didn't work
f0e260b8
RGS
1343correctly if the variable was changed while the local() was in effect (as
1344in C<local $h{$x}; ++$x>). (Bo Lindbergh)
1345
bd3831ee 1346=back
0eece9c0 1347
cf6c151c 1348=head1 New or Changed Diagnostics
c0c97549 1349
bd3831ee
RGS
1350=over 4
1351
d43695a1
RGS
1352=item Use of uninitialized value
1353
1354Perl will now try to tell you the name of the variable (if any) that was
1355undefined.
1356
bd3831ee
RGS
1357=item Deprecated use of my() in false conditional
1358
c0c97549
RGS
1359A new deprecation warning, I<Deprecated use of my() in false conditional>,
1360has been added, to warn against the use of the dubious and deprecated
1361construct
1362
1363 my $x if 0;
1364
1365See L<perldiag>. Use C<state> variables instead.
1366
bd3831ee
RGS
1367=item !=~ should be !~
1368
0eece9c0
RGS
1369A new warning, C<!=~ should be !~>, is emitted to prevent this misspelling
1370of the non-matching operator.
1371
bd3831ee
RGS
1372=item Newline in left-justified string
1373
0eece9c0
RGS
1374The warning I<Newline in left-justified string> has been removed.
1375
bd3831ee
RGS
1376=item Too late for "-T" option
1377
0eece9c0
RGS
1378The error I<Too late for "-T" option> has been reformulated to be more
1379descriptive.
1380
bd3831ee
RGS
1381=item "%s" variable %s masks earlier declaration
1382
1383This warning is now emitted in more consistent cases; in short, when one
1384of the declarations involved is a C<my> variable:
1385
1386 my $x; my $x; # warns
1387 my $x; our $x; # warns
1388 our $x; my $x; # warns
1389
1390On the other hand, the following:
1391
1392 our $x; our $x;
1393
1394now gives a C<"our" variable %s redeclared> warning.
1395
1396=item readdir()/closedir()/etc. attempted on invalid dirhandle
1397
1398These new warnings are now emitted when a dirhandle is used but is
1399either closed or not really a dirhandle.
1400
f0e260b8
RGS
1401=item Opening dirhandle/filehandle %s also as a file/directory
1402
1403Two deprecation warnings have been added: (Rafael)
1404
1405 Opening dirhandle %s also as a file
1406 Opening filehandle %s also as a directory
1407
f00638a2
RGS
1408=item Use of -P is deprecated
1409
1410Perl's command-line switch C<-P> is now deprecated.
1411
6601a838
RGS
1412=item v-string in use/require is non-portable
1413
1414Perl will warn you against potential backwards compatibility problems with
1415the C<use VERSION> syntax.
1416
bd3831ee
RGS
1417=item perl -V
1418
0eece9c0
RGS
1419C<perl -V> has several improvements, making it more useable from shell
1420scripts to get the value of configuration variables. See L<perlrun> for
1421details.
1422
bd3831ee
RGS
1423=back
1424
cf6c151c 1425=head1 Changed Internals
c0c97549 1426
16993b2e
JH
1427In general, the source code of perl has been refactored, tidied up,
1428and optimized in many places. Also, memory management and allocation
1429has been improved in several points.
1430
1431When compiling the perl core with gcc, as many gcc warning flags are
1432turned on as is possible on the platform. (This quest for cleanliness
1433doesn't extend to XS code because we cannot guarantee the tidiness of
1434code we didn't write.) Similar strictness flags have been added or
1435tightened for various other C compilers.
bd3831ee 1436
c0c97549
RGS
1437=head2 Reordering of SVt_* constants
1438
1439The relative ordering of constants that define the various types of C<SV>
1440have changed; in particular, C<SVt_PVGV> has been moved before C<SVt_PVLV>,
1441C<SVt_PVAV>, C<SVt_PVHV> and C<SVt_PVCV>. This is unlikely to make any
1442difference unless you have code that explicitly makes assumptions about that
1443ordering. (The inheritance hierarchy of C<B::*> objects has been changed
1444to reflect this.)
1445
254a8700
NC
1446=head2 Elimination of SVt_PVBM
1447
1448Related to this, the internal type C<SVt_PVBM> has been been removed. This
1449dedicated type of C<SV> was used by the C<index> operator and parts of the
1450regexp engine to facilitate fast Boyer-Moore matches. Its use internally has
1451been replaced by C<SV>s of type C<SVt_PVGV>.
1452
1453=head2 New type SVt_BIND
1454
1455A new type C<SVt_BIND> has been added, in readiness for the project to
1456implement Perl 6 on 5. There deliberately is no implementation yet, and
1457they cannot yet be created or destroyed.
1458
c0c97549
RGS
1459=head2 Removal of CPP symbols
1460
1461The C preprocessor symbols C<PERL_PM_APIVERSION> and
1462C<PERL_XS_APIVERSION>, which were supposed to give the version number of
1463the oldest perl binary-compatible (resp. source-compatible) with the
1464present one, were not used, and sometimes had misleading values. They have
1465been removed.
1466
1467=head2 Less space is used by ops
1468
1469The C<BASEOP> structure now uses less space. The C<op_seq> field has been
254a8700 1470removed and replaced by a single bit bit-field C<op_opt>. C<op_type> is now 9
c0c97549
RGS
1471bits long. (Consequently, the C<B::OP> class doesn't provide an C<seq>
1472method anymore.)
1473
1474=head2 New parser
1475
1476perl's parser is now generated by bison (it used to be generated by
1477byacc.) As a result, it seems to be a bit more robust.
1478
bd3831ee
RGS
1479Also, Dave Mitchell improved the lexer debugging output under C<-DT>.
1480
1481=head2 Use of C<const>
1482
1483Andy Lester supplied many improvements to determine which function
1484parameters and local variables could actually be declared C<const> to the C
1485compiler. Steve Peters provided new C<*_set> macros and reworked the core to
1486use these rather than assigning to macros in LVALUE context.
1487
1488=head2 Mathoms
1489
1490A new file, F<mathoms.c>, has been added. It contains functions that are
1491no longer used in the perl core, but that remain available for binary or
1492source compatibility reasons. However, those functions will not be
1493compiled in if you add C<-DNO_MATHOMS> in the compiler flags.
1494
1495=head2 C<AvFLAGS> has been removed
1496
1497The C<AvFLAGS> macro has been removed.
1498
1499=head2 C<av_*> changes
1500
1501The C<av_*()> functions, used to manipulate arrays, no longer accept null
1502C<AV*> parameters.
1503
597bb945
RGS
1504=head2 $^H and %^H
1505
1506The implementation of the special variables $^H and %^H has changed, to
254a8700 1507allow implementing lexical pragmas in pure Perl.
597bb945 1508
bd3831ee
RGS
1509=head2 B:: modules inheritance changed
1510
1511The inheritance hierarchy of C<B::> modules has changed; C<B::NV> now
1512inherits from C<B::SV> (it used to inherit from C<B::IV>).
1513
f0e260b8
RGS
1514=head2 Anonymous hash and array constructors
1515
1516The anonymous hash and array constructors now take 1 op in the optree
1517instead of 3, now that pp_anonhash and pp_anonlist return a reference to
1518an hash/array when the op is flagged with OPf_SPECIAL (Nicholas Clark).
1519
cf6c151c 1520=head1 Known Problems
c0c97549
RGS
1521
1522There's still a remaining problem in the implementation of the lexical
1523C<$_>: it doesn't work inside C</(?{...})/> blocks. (See the TODO test in
1524F<t/op/mydef.t>.)
1525
cf6c151c 1526=head1 Platform Specific Problems
c0c97549 1527
cf6c151c
RGS
1528=head1 Reporting Bugs
1529
1530=head1 SEE ALSO
1531
1532The F<Changes> file and the perl590delta to perl595delta man pages for
1533exhaustive details on what changed.
1534
1535The F<INSTALL> file for how to build Perl.
1536
1537The F<README> file for general stuff.
1538
1539The F<Artistic> and F<Copying> files for copyright information.
1540
1541=cut