This is a live mirror of the Perl 5 development currently hosted at https://github.com/perl/perl5
PerlFAQ sync
[perl5.git] / pod / perl5100delta.pod
CommitLineData
81663c3a
RGS
1=encoding utf8
2
cf6c151c
RGS
3=head1 NAME
4
5perldelta - what is new for perl 5.10.0
6
7=head1 DESCRIPTION
8
9This document describes the differences between the 5.8.8 release and
10the 5.10.0 release.
11
12Many of the bug fixes in 5.10.0 were already seen in the 5.8.X maintenance
13releases; they are not duplicated here and are documented in the set of
14man pages named perl58[1-8]?delta.
15
cf6c151c
RGS
16=head1 Core Enhancements
17
18=head2 The C<feature> pragma
19
20The C<feature> pragma is used to enable new syntax that would break Perl's
21backwards-compatibility with older releases of the language. It's a lexical
22pragma, like C<strict> or C<warnings>.
23
24Currently the following new features are available: C<switch> (adds a
25switch statement), C<say> (adds a C<say> built-in function), and C<state>
292c2b28 26(adds a C<state> keyword for declaring "static" variables). Those
cf6c151c
RGS
27features are described in their own sections of this document.
28
29The C<feature> pragma is also implicitly loaded when you require a minimal
30perl version (with the C<use VERSION> construct) greater than, or equal
31to, 5.9.5. See L<feature> for details.
32
33=head2 New B<-E> command-line switch
34
35B<-E> is equivalent to B<-e>, but it implicitly enables all
36optional features (like C<use feature ":5.10">).
37
38=head2 Defined-or operator
39
40A new operator C<//> (defined-or) has been implemented.
dbef3c66 41The following expression:
cf6c151c
RGS
42
43 $a // $b
44
45is merely equivalent to
46
47 defined $a ? $a : $b
48
dbef3c66 49and the statement
cf6c151c
RGS
50
51 $c //= $d;
52
53can now be used instead of
54
55 $c = $d unless defined $c;
56
57The C<//> operator has the same precedence and associativity as C<||>.
58Special care has been taken to ensure that this operator Do What You Mean
59while not breaking old code, but some edge cases involving the empty
60regular expression may now parse differently. See L<perlop> for
61details.
62
63=head2 Switch and Smart Match operator
64
65Perl 5 now has a switch statement. It's available when C<use feature
66'switch'> is in effect. This feature introduces three new keywords,
67C<given>, C<when>, and C<default>:
68
69 given ($foo) {
70 when (/^abc/) { $abc = 1; }
71 when (/^def/) { $def = 1; }
72 when (/^xyz/) { $xyz = 1; }
73 default { $nothing = 1; }
74 }
75
76A more complete description of how Perl matches the switch variable
77against the C<when> conditions is given in L<perlsyn/"Switch statements">.
78
79This kind of match is called I<smart match>, and it's also possible to use
80it outside of switch statements, via the new C<~~> operator. See
81L<perlsyn/"Smart matching in detail">.
82
83This feature was contributed by Robin Houston.
84
85=head2 Regular expressions
86
87=over 4
88
89=item Recursive Patterns
90
91It is now possible to write recursive patterns without using the C<(??{})>
92construct. This new way is more efficient, and in many cases easier to
93read.
94
95Each capturing parenthesis can now be treated as an independent pattern
96that can be entered by using the C<(?PARNO)> syntax (C<PARNO> standing for
97"parenthesis number"). For example, the following pattern will match
98nested balanced angle brackets:
99
100 /
101 ^ # start of line
102 ( # start capture buffer 1
103 < # match an opening angle bracket
104 (?: # match one of:
105 (?> # don't backtrack over the inside of this group
106 [^<>]+ # one or more non angle brackets
107 ) # end non backtracking group
108 | # ... or ...
109 (?1) # recurse to bracket 1 and try it again
110 )* # 0 or more times.
111 > # match a closing angle bracket
112 ) # end capture buffer one
113 $ # end of line
114 /x
115
e15dad31
JC
116PCRE users should note that Perl's recursive regex feature allows
117backtracking into a recursed pattern, whereas in PCRE the recursion is
118atomic or "possessive" in nature. As in the example above, you can
119add (?>) to control this selectively. (Yves Orton)
cf6c151c
RGS
120
121=item Named Capture Buffers
122
123It is now possible to name capturing parenthesis in a pattern and refer to
124the captured contents by name. The naming syntax is C<< (?<NAME>....) >>.
125It's possible to backreference to a named buffer with the C<< \k<NAME> >>
126syntax. In code, the new magical hashes C<%+> and C<%-> can be used to
127access the contents of the capture buffers.
128
e15dad31 129Thus, to replace all doubled chars with a single copy, one could write
cf6c151c
RGS
130
131 s/(?<letter>.)\k<letter>/$+{letter}/g
132
133Only buffers with defined contents will be "visible" in the C<%+> hash, so
134it's possible to do something like
135
136 foreach my $name (keys %+) {
137 print "content of buffer '$name' is $+{$name}\n";
138 }
139
140The C<%-> hash is a bit more complete, since it will contain array refs
141holding values from all capture buffers similarly named, if there should
142be many of them.
143
144C<%+> and C<%-> are implemented as tied hashes through the new module
145C<Tie::Hash::NamedCapture>.
146
147Users exposed to the .NET regex engine will find that the perl
148implementation differs in that the numerical ordering of the buffers
149is sequential, and not "unnamed first, then named". Thus in the pattern
150
151 /(A)(?<B>B)(C)(?<D>D)/
152
153$1 will be 'A', $2 will be 'B', $3 will be 'C' and $4 will be 'D' and not
154$1 is 'A', $2 is 'C' and $3 is 'B' and $4 is 'D' that a .NET programmer
155would expect. This is considered a feature. :-) (Yves Orton)
156
157=item Possessive Quantifiers
158
159Perl now supports the "possessive quantifier" syntax of the "atomic match"
160pattern. Basically a possessive quantifier matches as much as it can and never
161gives any back. Thus it can be used to control backtracking. The syntax is
162similar to non-greedy matching, except instead of using a '?' as the modifier
163the '+' is used. Thus C<?+>, C<*+>, C<++>, C<{min,max}+> are now legal
164quantifiers. (Yves Orton)
165
166=item Backtracking control verbs
167
168The regex engine now supports a number of special-purpose backtrack
169control verbs: (*THEN), (*PRUNE), (*MARK), (*SKIP), (*COMMIT), (*FAIL)
170and (*ACCEPT). See L<perlre> for their descriptions. (Yves Orton)
171
172=item Relative backreferences
173
174A new syntax C<\g{N}> or C<\gN> where "N" is a decimal integer allows a
175safer form of back-reference notation as well as allowing relative
176backreferences. This should make it easier to generate and embed patterns
177that contain backreferences. See L<perlre/"Capture buffers">. (Yves Orton)
178
179=item C<\K> escape
180
181The functionality of Jeff Pinyan's module Regexp::Keep has been added to
254a8700 182the core. In regular expressions you can now use the special escape C<\K>
cf6c151c
RGS
183as a way to do something like floating length positive lookbehind. It is
184also useful in substitutions like:
185
186 s/(foo)bar/$1/g
187
188that can now be converted to
189
190 s/foo\Kbar//g
191
192which is much more efficient. (Yves Orton)
193
194=item Vertical and horizontal whitespace, and linebreak
195
292c2b28 196Regular expressions now recognize the C<\v> and C<\h> escapes that match
cf6c151c
RGS
197vertical and horizontal whitespace, respectively. C<\V> and C<\H>
198logically match their complements.
199
200C<\R> matches a generic linebreak, that is, vertical whitespace, plus
201the multi-character sequence C<"\x0D\x0A">.
202
203=back
204
205=head2 C<say()>
206
207say() is a new built-in, only available when C<use feature 'say'> is in
208effect, that is similar to print(), but that implicitly appends a newline
209to the printed string. See L<perlfunc/say>. (Robin Houston)
210
211=head2 Lexical C<$_>
212
213The default variable C<$_> can now be lexicalized, by declaring it like
214any other lexical variable, with a simple
215
216 my $_;
217
218The operations that default on C<$_> will use the lexically-scoped
219version of C<$_> when it exists, instead of the global C<$_>.
220
221In a C<map> or a C<grep> block, if C<$_> was previously my'ed, then the
222C<$_> inside the block is lexical as well (and scoped to the block).
223
224In a scope where C<$_> has been lexicalized, you can still have access to
225the global version of C<$_> by using C<$::_>, or, more simply, by
597bb945 226overriding the lexical declaration with C<our $_>. (Rafael Garcia-Suarez)
cf6c151c
RGS
227
228=head2 The C<_> prototype
229
254a8700
NC
230A new prototype character has been added. C<_> is equivalent to C<$> but
231defaults to C<$_> if the corresponding argument isn't supplied. (both C<$>
3d9f6fa1 232and C<_> denote a scalar). Due to the optional nature of the argument, you
254a8700 233can only use it at the end of a prototype, or before a semicolon.
cf6c151c
RGS
234
235This has a small incompatible consequence: the prototype() function has
236been adjusted to return C<_> for some built-ins in appropriate cases (for
237example, C<prototype('CORE::rmdir')>). (Rafael Garcia-Suarez)
238
239=head2 UNITCHECK blocks
240
241C<UNITCHECK>, a new special code block has been introduced, in addition to
242C<BEGIN>, C<CHECK>, C<INIT> and C<END>.
243
244C<CHECK> and C<INIT> blocks, while useful for some specialized purposes,
245are always executed at the transition between the compilation and the
246execution of the main program, and thus are useless whenever code is
247loaded at runtime. On the other hand, C<UNITCHECK> blocks are executed
248just after the unit which defined them has been compiled. See L<perlmod>
249for more information. (Alex Gough)
250
251=head2 New Pragma, C<mro>
252
253A new pragma, C<mro> (for Method Resolution Order) has been added. It
254permits to switch, on a per-class basis, the algorithm that perl uses to
dbef3c66 255find inherited methods in case of a multiple inheritance hierarchy. The
cf6c151c
RGS
256default MRO hasn't changed (DFS, for Depth First Search). Another MRO is
257available: the C3 algorithm. See L<mro> for more information.
258(Brandon Black)
259
dbef3c66 260Note that, due to changes in the implementation of class hierarchy search,
cf6c151c
RGS
261code that used to undef the C<*ISA> glob will most probably break. Anyway,
262undef'ing C<*ISA> had the side-effect of removing the magic on the @ISA
263array and should not have been done in the first place.
264
3de67921
JD
265=head2 readdir() may return a "short filename" on Windows
266
267The readdir() function may return a "short filename" when the long
268filename contains characters outside the ANSI codepage. Similarly
269Cwd::cwd() may return a short directory name, and glob() may return short
270names as well. On the NTFS file system these short names can always be
271represented in the ANSI codepage. This will not be true for all other file
272system drivers; e.g. the FAT filesystem stores short filenames in the OEM
273codepage, so some files on FAT volumes remain unaccessible through the
274ANSI APIs.
275
276Similarly, $^X, @INC, and $ENV{PATH} are preprocessed at startup to make
277sure all paths are valid in the ANSI codepage (if possible).
278
279The Win32::GetLongPathName() function now returns the UTF-8 encoded
280correct long file name instead of using replacement characters to force
281the name into the ANSI codepage. The new Win32::GetANSIPathName()
282function can be used to turn a long pathname into a short one only if the
283long one cannot be represented in the ANSI codepage.
284
285Many other functions in the C<Win32> module have been improved to accept
286UTF-8 encoded arguments. Please see L<Win32> for details.
287
cf6c151c
RGS
288=head2 readpipe() is now overridable
289
290The built-in function readpipe() is now overridable. Overriding it permits
291also to override its operator counterpart, C<qx//> (a.k.a. C<``>).
292Moreover, it now defaults to C<$_> if no argument is provided. (Rafael
293Garcia-Suarez)
294
597bb945 295=head2 Default argument for readline()
cf6c151c
RGS
296
297readline() now defaults to C<*ARGV> if no argument is provided. (Rafael
298Garcia-Suarez)
299
300=head2 state() variables
301
302A new class of variables has been introduced. State variables are similar
303to C<my> variables, but are declared with the C<state> keyword in place of
304C<my>. They're visible only in their lexical scope, but their value is
305persistent: unlike C<my> variables, they're not undefined at scope entry,
306but retain their previous value. (Rafael Garcia-Suarez, Nicholas Clark)
307
308To use state variables, one needs to enable them by using
309
254a8700 310 use feature 'state';
cf6c151c
RGS
311
312or by using the C<-E> command-line switch in one-liners.
313See L<perlsub/"Persistent variables via state()">.
314
315=head2 Stacked filetest operators
316
317As a new form of syntactic sugar, it's now possible to stack up filetest
318operators. You can now write C<-f -w -x $file> in a row to mean
319C<-x $file && -w _ && -f _>. See L<perlfunc/-X>.
320
321=head2 UNIVERSAL::DOES()
322
323The C<UNIVERSAL> class has a new method, C<DOES()>. It has been added to
324solve semantic problems with the C<isa()> method. C<isa()> checks for
325inheritance, while C<DOES()> has been designed to be overridden when
326module authors use other types of relations between classes (in addition
327to inheritance). (chromatic)
328
329See L<< UNIVERSAL/"$obj->DOES( ROLE )" >>.
330
cf6c151c
RGS
331=head2 Formats
332
333Formats were improved in several ways. A new field, C<^*>, can be used for
334variable-width, one-line-at-a-time text. Null characters are now handled
335correctly in picture lines. Using C<@#> and C<~~> together will now
336produce a compile-time error, as those format fields are incompatible.
337L<perlform> has been improved, and miscellaneous bugs fixed.
338
339=head2 Byte-order modifiers for pack() and unpack()
340
341There are two new byte-order modifiers, C<E<gt>> (big-endian) and C<E<lt>>
342(little-endian), that can be appended to most pack() and unpack() template
343characters and groups to force a certain byte-order for that type or group.
344See L<perlfunc/pack> and L<perlpacktut> for details.
345
cf6c151c
RGS
346=head2 C<no VERSION>
347
348You can now use C<no> followed by a version number to specify that you
349want to use a version of perl older than the specified one.
350
351=head2 C<chdir>, C<chmod> and C<chown> on filehandles
352
353C<chdir>, C<chmod> and C<chown> can now work on filehandles as well as
354filenames, if the system supports respectively C<fchdir>, C<fchmod> and
355C<fchown>, thanks to a patch provided by Gisle Aas.
356
357=head2 OS groups
358
359C<$(> and C<$)> now return groups in the order where the OS returns them,
360thanks to Gisle Aas. This wasn't previously the case.
361
362=head2 Recursive sort subs
363
364You can now use recursive subroutines with sort(), thanks to Robin Houston.
365
366=head2 Exceptions in constant folding
367
368The constant folding routine is now wrapped in an exception handler, and
369if folding throws an exception (such as attempting to evaluate 0/0), perl
370now retains the current optree, rather than aborting the whole program.
254a8700
NC
371Without this change, programs would not compile if they had expressions that
372happened to generate exceptions, even though those expressions were in code
373that could never be reached at runtime. (Nicholas Clark, Dave Mitchell)
cf6c151c
RGS
374
375=head2 Source filters in @INC
376
377It's possible to enhance the mechanism of subroutine hooks in @INC by
378adding a source filter on top of the filehandle opened and returned by the
379hook. This feature was planned a long time ago, but wasn't quite working
380until now. See L<perlfunc/require> for details. (Nicholas Clark)
381
382=head2 New internal variables
383
384=over 4
385
386=item C<${^RE_DEBUG_FLAGS}>
387
388This variable controls what debug flags are in effect for the regular
389expression engine when running under C<use re "debug">. See L<re> for
390details.
391
392=item C<${^CHILD_ERROR_NATIVE}>
393
394This variable gives the native status returned by the last pipe close,
395backtick command, successful call to wait() or waitpid(), or from the
396system() operator. See L<perlrun> for details. (Contributed by Gisle Aas.)
397
597bb945
RGS
398=item C<${^RE_TRIE_MAXBUF}>
399
400See L</"Trie optimisation of literal string alternations">.
401
402=item C<${^WIN32_SLOPPY_STAT}>
403
404See L</"Sloppy stat on Windows">.
405
cf6c151c
RGS
406=back
407
408=head2 Miscellaneous
409
410C<unpack()> now defaults to unpacking the C<$_> variable.
411
412C<mkdir()> without arguments now defaults to C<$_>.
413
414The internal dump output has been improved, so that non-printable characters
415such as newline and backspace are output in C<\x> notation, rather than
416octal.
417
418The B<-C> option can no longer be used on the C<#!> line. It wasn't
cba8bf60
RGS
419working there anyway, since the standard streams are already set up
420at this point in the execution of the perl interpreter. You can use
421binmode() instead to get the desired behaviour.
cf6c151c
RGS
422
423=head2 UCD 5.0.0
424
425The copy of the Unicode Character Database included in Perl 5 has
426been updated to version 5.0.0.
427
cf6c151c
RGS
428=head2 MAD
429
254a8700 430MAD, which stands for I<Miscellaneous Attribute Decoration>, is a
cf6c151c
RGS
431still-in-development work leading to a Perl 5 to Perl 6 converter. To
432enable it, it's necessary to pass the argument C<-Dmad> to Configure. The
254a8700 433obtained perl isn't binary compatible with a regular perl 5.10, and has
cf6c151c
RGS
434space and speed penalties; moreover not all regression tests still pass
435with it. (Larry Wall, Nicholas Clark)
436
c7d332a5
RGS
437=head2 kill() on Windows
438
439On Windows platforms, C<kill(-9, $pid)> now kills a process tree.
440(On UNIX, this delivers the signal to all processes in the same process
441group.)
442
597bb945
RGS
443=head1 Incompatible Changes
444
445=head2 Packing and UTF-8 strings
446
597bb945
RGS
447The semantics of pack() and unpack() regarding UTF-8-encoded data has been
448changed. Processing is now by default character per character instead of
449byte per byte on the underlying encoding. Notably, code that used things
450like C<pack("a*", $string)> to see through the encoding of string will now
451simply get back the original $string. Packed strings can also get upgraded
452during processing when you store upgraded characters. You can get the old
453behaviour by using C<use bytes>.
454
455To be consistent with pack(), the C<C0> in unpack() templates indicates
456that the data is to be processed in character mode, i.e. character by
457character; on the contrary, C<U0> in unpack() indicates UTF-8 mode, where
458the packed string is processed in its UTF-8-encoded Unicode form on a byte
254a8700
NC
459by byte basis. This is reversed with regard to perl 5.8.X, but now consistent
460between pack() and unpack().
597bb945
RGS
461
462Moreover, C<C0> and C<U0> can also be used in pack() templates to specify
463respectively character and byte modes.
464
465C<C0> and C<U0> in the middle of a pack or unpack format now switch to the
466specified encoding mode, honoring parens grouping. Previously, parens were
467ignored.
468
469Also, there is a new pack() character format, C<W>, which is intended to
470replace the old C<C>. C<C> is kept for unsigned chars coded as bytes in
471the strings internal representation. C<W> represents unsigned (logical)
472character values, which can be greater than 255. It is therefore more
473robust when dealing with potentially UTF-8-encoded data (as C<C> will wrap
474values outside the range 0..255, and not respect the string encoding).
475
476In practice, that means that pack formats are now encoding-neutral, except
477C<C>.
478
479For consistency, C<A> in unpack() format now trims all Unicode whitespace
480from the end of the string. Before perl 5.9.2, it used to strip only the
481classical ASCII space characters.
482
483=head2 Byte/character count feature in unpack()
484
485A new unpack() template character, C<".">, returns the number of bytes or
486characters (depending on the selected encoding mode, see above) read so far.
487
488=head2 The C<$*> and C<$#> variables have been removed
489
490C<$*>, which was deprecated in favor of the C</s> and C</m> regexp
491modifiers, has been removed.
492
493The deprecated C<$#> variable (output format for numbers) has been
494removed.
495
f00638a2 496Two new severe warnings, C<$#/$* is no longer supported>, have been added.
597bb945
RGS
497
498=head2 substr() lvalues are no longer fixed-length
499
500The lvalues returned by the three argument form of substr() used to be a
501"fixed length window" on the original string. In some cases this could
502cause surprising action at distance or other undefined behaviour. Now the
503length of the window adjusts itself to the length of the string assigned to
504it.
505
506=head2 Parsing of C<-f _>
507
508The identifier C<_> is now forced to be a bareword after a filetest
509operator. This solves a number of misparsing issues when a global C<_>
510subroutine is defined.
511
512=head2 C<:unique>
513
514The C<:unique> attribute has been made a no-op, since its current
515implementation was fundamentally flawed and not threadsafe.
516
597bb945
RGS
517=head2 Effect of pragmas in eval
518
519The compile-time value of the C<%^H> hint variable can now propagate into
520eval("")uated code. This makes it more useful to implement lexical
521pragmas.
522
523As a side-effect of this, the overloaded-ness of constants now propagates
524into eval("").
525
526=head2 chdir FOO
527
528A bareword argument to chdir() is now recognized as a file handle.
529Earlier releases interpreted the bareword as a directory name.
530(Gisle Aas)
531
532=head2 Handling of .pmc files
533
534An old feature of perl was that before C<require> or C<use> look for a
535file with a F<.pm> extension, they will first look for a similar filename
536with a F<.pmc> extension. If this file is found, it will be loaded in
537place of any potentially existing file ending in a F<.pm> extension.
538
539Previously, F<.pmc> files were loaded only if more recent than the
540matching F<.pm> file. Starting with 5.9.4, they'll be always loaded if
541they exist.
542
543=head2 @- and @+ in patterns
544
545The special arrays C<@-> and C<@+> are no longer interpolated in regular
546expressions. (Sadahiro Tomoyuki)
547
548=head2 $AUTOLOAD can now be tainted
549
550If you call a subroutine by a tainted name, and if it defers to an
551AUTOLOAD function, then $AUTOLOAD will be (correctly) tainted.
552(Rick Delaney)
553
554=head2 Tainting and printf
555
556When perl is run under taint mode, C<printf()> and C<sprintf()> will now
557reject any tainted format argument. (Rafael Garcia-Suarez)
558
559=head2 undef and signal handlers
560
561Undefining or deleting a signal handler via C<undef $SIG{FOO}> is now
562equivalent to setting it to C<'DEFAULT'>. (Rafael Garcia-Suarez)
563
564=head2 strictures and dereferencing in defined()
565
254a8700 566C<use strict 'refs'> was ignoring taking a hard reference in an argument
597bb945
RGS
567to defined(), as in :
568
254a8700
NC
569 use strict 'refs';
570 my $x = 'foo';
597bb945
RGS
571 if (defined $$x) {...}
572
573This now correctly produces the run-time error C<Can't use string as a
574SCALAR ref while "strict refs" in use>.
575
576C<defined @$foo> and C<defined %$bar> are now also subject to C<strict
577'refs'> (that is, C<$foo> and C<$bar> shall be proper references there.)
578(C<defined(@foo)> and C<defined(%bar)> are discouraged constructs anyway.)
579(Nicholas Clark)
580
581=head2 C<(?p{})> has been removed
582
583The regular expression construct C<(?p{})>, which was deprecated in perl
5845.8, has been removed. Use C<(??{})> instead. (Rafael Garcia-Suarez)
585
586=head2 Pseudo-hashes have been removed
587
588Support for pseudo-hashes has been removed from Perl 5.9. (The C<fields>
589pragma remains here, but uses an alternate implementation.)
590
591=head2 Removal of the bytecode compiler and of perlcc
592
593C<perlcc>, the byteloader and the supporting modules (B::C, B::CC,
594B::Bytecode, etc.) are no longer distributed with the perl sources. Those
595experimental tools have never worked reliably, and, due to the lack of
596volunteers to keep them in line with the perl interpreter developments, it
597was decided to remove them instead of shipping a broken version of those.
598The last version of those modules can be found with perl 5.9.4.
599
600However the B compiler framework stays supported in the perl core, as with
601the more useful modules it has permitted (among others, B::Deparse and
602B::Concise).
603
604=head2 Removal of the JPL
605
ed8ea1b6 606The JPL (Java-Perl Lingo) has been removed from the perl sources tarball.
597bb945
RGS
607
608=head2 Recursive inheritance detected earlier
609
610Perl will now immediately throw an exception if you modify any package's
611C<@ISA> in such a way that it would cause recursive inheritance.
612
613Previously, the exception would not occur until Perl attempted to make
614use of the recursive inheritance while resolving a method or doing a
615C<$foo-E<gt>isa($bar)> lookup.
616
cf6c151c 617=head1 Modules and Pragmata
c0c97549 618
187a0176
AD
619=head2 Upgrading individual core modules
620
621Even more core modules are now also available separately through the
622CPAN. If you wish to update one of these modules, you don't need to
623wait for a new perl release. From within the cpan shell, running the
624'r' command will report on modules with upgrades available. See
625C<perldoc CPAN> for more information.
626
f0e260b8
RGS
627=head2 Pragmata Changes
628
629=over 4
630
631=item C<feature>
632
633The new pragma C<feature> is used to enable new features that might break
634old code. See L</"The C<feature> pragma"> above.
635
636=item C<mro>
637
638This new pragma enables to change the algorithm used to resolve inherited
639methods. See L</"New Pragma, C<mro>"> above.
640
641=item Scoping of the C<sort> pragma
642
643The C<sort> pragma is now lexically scoped. Its effect used to be global.
644
645=item Scoping of C<bignum>, C<bigint>, C<bigrat>
646
647The three numeric pragmas C<bignum>, C<bigint> and C<bigrat> are now
648lexically scoped. (Tels)
649
650=item C<base>
651
652The C<base> pragma now warns if a class tries to inherit from itself.
653(Curtis "Ovid" Poe)
654
655=item C<strict> and C<warnings>
656
657C<strict> and C<warnings> will now complain loudly if they are loaded via
658incorrect casing (as in C<use Strict;>). (Johan Vromans)
659
6601a838
RGS
660=item C<version>
661
662The C<version> module provides support for version objects.
663
f0e260b8
RGS
664=item C<warnings>
665
666The C<warnings> pragma doesn't load C<Carp> anymore. That means that code
667that used C<Carp> routines without having loaded it at compile time might
668need to be adjusted; typically, the following (faulty) code won't work
669anymore, and will require parentheses to be added after the function name:
670
671 use warnings;
672 require Carp;
254a8700 673 Carp::confess 'argh';
f0e260b8
RGS
674
675=item C<less>
676
677C<less> now does something useful (or at least it tries to). In fact, it
678has been turned into a lexical pragma. So, in your modules, you can now
679test whether your users have requested to use less CPU, or less memory,
680less magic, or maybe even less fat. See L<less> for more. (Joshua ben
681Jore)
682
683=back
684
0eece9c0
RGS
685=head2 New modules
686
687=over 4
688
689=item *
690
691C<encoding::warnings>, by Audrey Tang, is a module to emit warnings
692whenever an ASCII character string containing high-bit bytes is implicitly
597bb945
RGS
693converted into UTF-8. It's a lexical pragma since Perl 5.9.4; on older
694perls, its effect is global.
0eece9c0
RGS
695
696=item *
697
698C<Module::CoreList>, by Richard Clamp, is a small handy module that tells
699you what versions of core modules ship with any versions of Perl 5. It
700comes with a command-line frontend, C<corelist>.
701
bd3831ee
RGS
702=item *
703
704C<Math::BigInt::FastCalc> is an XS-enabled, and thus faster, version of
705C<Math::BigInt::Calc>.
706
707=item *
708
709C<Compress::Zlib> is an interface to the zlib compression library. It
710comes with a bundled version of zlib, so having a working zlib is not a
711prerequisite to install it. It's used by C<Archive::Tar> (see below).
712
713=item *
714
715C<IO::Zlib> is an C<IO::>-style interface to C<Compress::Zlib>.
716
717=item *
718
719C<Archive::Tar> is a module to manipulate C<tar> archives.
720
721=item *
722
723C<Digest::SHA> is a module used to calculate many types of SHA digests,
724has been included for SHA support in the CPAN module.
725
726=item *
727
728C<ExtUtils::CBuilder> and C<ExtUtils::ParseXS> have been added.
729
597bb945
RGS
730=item *
731
732C<Hash::Util::FieldHash>, by Anno Siegel, has been added. This module
733provides support for I<field hashes>: hashes that maintain an association
734of a reference with a value, in a thread-safe garbage-collected way.
735Such hashes are useful to implement inside-out objects.
736
737=item *
738
739C<Module::Build>, by Ken Williams, has been added. It's an alternative to
740C<ExtUtils::MakeMaker> to build and install perl modules.
741
742=item *
743
744C<Module::Load>, by Jos Boumans, has been added. It provides a single
745interface to load Perl modules and F<.pl> files.
746
747=item *
748
749C<Module::Loaded>, by Jos Boumans, has been added. It's used to mark
750modules as loaded or unloaded.
751
752=item *
753
754C<Package::Constants>, by Jos Boumans, has been added. It's a simple
755helper to list all constants declared in a given package.
756
757=item *
758
759C<Win32API::File>, by Tye McQueen, has been added (for Windows builds).
760This module provides low-level access to Win32 system API calls for
761files/dirs.
762
f0e260b8
RGS
763=item *
764
765C<Locale::Maketext::Simple>, needed by CPANPLUS, is a simple wrapper around
766C<Locale::Maketext::Lexicon>. Note that C<Locale::Maketext::Lexicon> isn't
767included in the perl core; the behaviour of C<Locale::Maketext::Simple>
768gracefully degrades when the later isn't present.
769
770=item *
771
772C<Params::Check> implements a generic input parsing/checking mechanism. It
773is used by CPANPLUS.
774
775=item *
776
777C<Term::UI> simplifies the task to ask questions at a terminal prompt.
778
779=item *
780
781C<Object::Accessor> provides an interface to create per-object accessors.
782
783=item *
784
785C<Module::Pluggable> is a simple framework to create modules that accept
786pluggable sub-modules.
787
788=item *
789
790C<Module::Load::Conditional> provides simple ways to query and possibly
791load installed modules.
792
793=item *
794
795C<Time::Piece> provides an object oriented interface to time functions,
796overriding the built-ins localtime() and gmtime().
797
798=item *
799
800C<IPC::Cmd> helps to find and run external commands, possibly
801interactively.
802
803=item *
804
805C<File::Fetch> provide a simple generic file fetching mechanism.
806
807=item *
808
809C<Log::Message> and C<Log::Message::Simple> are used by the log facility
810of C<CPANPLUS>.
811
812=item *
813
814C<Archive::Extract> is a generic archive extraction mechanism
815for F<.tar> (plain, gziped or bzipped) or F<.zip> files.
816
817=item *
818
819C<CPANPLUS> provides an API and a command-line tool to access the CPAN
820mirrors.
821
e6746346
SP
822=item *
823
824C<Pod::Escapes> provides utilities that are useful in decoding Pod
825EE<lt>...E<gt> sequences.
826
827=item *
828
829C<Pod::Simple> is now the backend for several of the Pod-related modules
830included with Perl.
831
f0e260b8
RGS
832=back
833
834=head2 Selected Changes to Core Modules
835
836=over 4
837
838=item C<Attribute::Handlers>
839
840C<Attribute::Handlers> can now report the caller's file and line number.
841(David Feldman)
842
6cdf4617
RGS
843All interpreted attributes are now passed as array references. (Damian
844Conway)
845
f0e260b8
RGS
846=item C<B::Lint>
847
848C<B::Lint> is now based on C<Module::Pluggable>, and so can be extended
849with plugins. (Joshua ben Jore)
850
851=item C<B>
852
853It's now possible to access the lexical pragma hints (C<%^H>) by using the
854method B::COP::hints_hash(). It returns a C<B::RHE> object, which in turn
855can be used to get a hash reference via the method B::RHE::HASH(). (Joshua
856ben Jore)
857
858=item C<Thread>
859
860As the old 5005thread threading model has been removed, in favor of the
861ithreads scheme, the C<Thread> module is now a compatibility wrapper, to
862be used in old code only. It has been removed from the default list of
863dynamic extensions.
864
0eece9c0
RGS
865=back
866
cf6c151c 867=head1 Utility Changes
c0c97549
RGS
868
869=over 4
870
bd3831ee 871=item perl -d
c0c97549
RGS
872
873The Perl debugger can now save all debugger commands for sourcing later;
874notably, it can now emulate stepping backwards, by restarting and
875rerunning all bar the last command from a saved command history.
876
877It can also display the parent inheritance tree of a given class, with the
878C<i> command.
879
bd3831ee
RGS
880=item ptar
881
292c2b28 882C<ptar> is a pure perl implementation of C<tar> that comes with
bd3831ee
RGS
883C<Archive::Tar>.
884
885=item ptardiff
886
254a8700 887C<ptardiff> is a small utility used to generate a diff between the contents
bd3831ee
RGS
888of a tar archive and a directory tree. Like C<ptar>, it comes with
889C<Archive::Tar>.
890
891=item shasum
892
893C<shasum> is a command-line utility, used to print or to check SHA
894digests. It comes with the new C<Digest::SHA> module.
895
896=item corelist
0eece9c0
RGS
897
898The C<corelist> utility is now installed with perl (see L</"New modules">
899above).
900
bd3831ee 901=item h2ph and h2xs
0eece9c0 902
254a8700 903C<h2ph> and C<h2xs> have been made more robust with regard to
0eece9c0
RGS
904"modern" C code.
905
bd3831ee
RGS
906C<h2xs> implements a new option C<--use-xsloader> to force use of
907C<XSLoader> even in backwards compatible modules.
908
909The handling of authors' names that had apostrophes has been fixed.
910
911Any enums with negative values are now skipped.
912
913=item perlivp
914
915C<perlivp> no longer checks for F<*.ph> files by default. Use the new C<-a>
916option to run I<all> tests.
917
918=item find2perl
0eece9c0
RGS
919
920C<find2perl> now assumes C<-print> as a default action. Previously, it
921needed to be specified explicitly.
922
923Several bugs have been fixed in C<find2perl>, regarding C<-exec> and
924C<-eval>. Also the options C<-path>, C<-ipath> and C<-iname> have been
925added.
926
597bb945
RGS
927=item config_data
928
929C<config_data> is a new utility that comes with C<Module::Build>. It
930provides a command-line interface to the configuration of Perl modules
931that use Module::Build's framework of configurability (that is,
932C<*::ConfigData> modules that contain local configuration information for
933their parent modules.)
934
f00638a2 935=item cpanp
f0e260b8 936
254a8700 937C<cpanp>, the CPANPLUS shell, has been added. (C<cpanp-run-perl>, a
f0e260b8
RGS
938helper for CPANPLUS operation, has been added too, but isn't intended for
939direct use).
940
f00638a2 941=item cpan2dist
f0e260b8 942
292c2b28 943C<cpan2dist> is a new utility that comes with CPANPLUS. It's a tool to
f0e260b8
RGS
944create distributions (or packages) from CPAN modules.
945
f00638a2 946=item pod2html
f0e260b8
RGS
947
948The output of C<pod2html> has been enhanced to be more customizable via
949CSS. Some formatting problems were also corrected. (Jari Aalto)
950
c0c97549
RGS
951=back
952
cf6c151c 953=head1 New Documentation
c0c97549 954
597bb945
RGS
955The L<perlpragma> manpage documents how to write one's own lexical
956pragmas in pure Perl (something that is possible starting with 5.9.4).
957
bd3831ee
RGS
958The new L<perlglossary> manpage is a glossary of terms used in the Perl
959documentation, technical and otherwise, kindly provided by O'Reilly Media,
960Inc.
961
597bb945
RGS
962The L<perlreguts> manpage, courtesy of Yves Orton, describes internals of the
963Perl regular expression engine.
964
62c26f88
RGS
965The L<perlreapi> manpage describes the interface to the perl interpreter
966used to write pluggable regular expression engines (by Ævar Arnfjörð
967Bjarmason).
968
597bb945
RGS
969The L<perlunitut> manpage is an tutorial for programming with Unicode and
970string encodings in Perl, courtesy of Juerd Waalboer.
971
f0e260b8
RGS
972A new manual page, L<perlunifaq> (the Perl Unicode FAQ), has been added
973(Juerd Waalboer).
974
dbef3c66
RGS
975The L<perlcommunity> manpage gives a description of the Perl community
976on the Internet and in real life. (Edgar "Trizor" Bering)
977
f00638a2
RGS
978The L<CORE> manual page documents the C<CORE::> namespace. (Tels)
979
c0c97549
RGS
980The long-existing feature of C</(?{...})/> regexps setting C<$_> and pos()
981is now documented.
982
cf6c151c 983=head1 Performance Enhancements
c0c97549 984
597bb945 985=head2 In-place sorting
0eece9c0 986
c0c97549
RGS
987Sorting arrays in place (C<@a = sort @a>) is now optimized to avoid
988making a temporary copy of the array.
989
0eece9c0
RGS
990Likewise, C<reverse sort ...> is now optimized to sort in reverse,
991avoiding the generation of a temporary intermediate list.
992
597bb945 993=head2 Lexical array access
0eece9c0 994
c0c97549
RGS
995Access to elements of lexical arrays via a numeric constant between 0 and
996255 is now faster. (This used to be only the case for global arrays.)
997
597bb945 998=head2 XS-assisted SWASHGET
bd3831ee
RGS
999
1000Some pure-perl code that perl was using to retrieve Unicode properties and
1001transliteration mappings has been reimplemented in XS.
1002
597bb945 1003=head2 Constant subroutines
bd3831ee
RGS
1004
1005The interpreter internals now support a far more memory efficient form of
1006inlineable constants. Storing a reference to a constant value in a symbol
1007table is equivalent to a full typeglob referencing a constant subroutine,
1008but using about 400 bytes less memory. This proxy constant subroutine is
1009automatically upgraded to a real typeglob with subroutine if necessary.
1010The approach taken is analogous to the existing space optimisation for
1011subroutine stub declarations, which are stored as plain scalars in place
1012of the full typeglob.
1013
1014Several of the core modules have been converted to use this feature for
1015their system dependent constants - as a result C<use POSIX;> now takes about
1016200K less memory.
1017
597bb945 1018=head2 C<PERL_DONT_CREATE_GVSV>
bd3831ee
RGS
1019
1020The new compilation flag C<PERL_DONT_CREATE_GVSV>, introduced as an option
1021in perl 5.8.8, is turned on by default in perl 5.9.3. It prevents perl
1022from creating an empty scalar with every new typeglob. See L<perl588delta>
1023for details.
1024
597bb945 1025=head2 Weak references are cheaper
bd3831ee
RGS
1026
1027Weak reference creation is now I<O(1)> rather than I<O(n)>, courtesy of
1028Nicholas Clark. Weak reference deletion remains I<O(n)>, but if deletion only
1029happens at program exit, it may be skipped completely.
1030
597bb945 1031=head2 sort() enhancements
bd3831ee
RGS
1032
1033Salvador Fandiño provided improvements to reduce the memory usage of C<sort>
1034and to speed up some cases.
1035
597bb945
RGS
1036=head2 Memory optimisations
1037
1038Several internal data structures (typeglobs, GVs, CVs, formats) have been
1039restructured to use less memory. (Nicholas Clark)
1040
1041=head2 UTF-8 cache optimisation
1042
1043The UTF-8 caching code is now more efficient, and used more often.
1044(Nicholas Clark)
1045
1046=head2 Sloppy stat on Windows
1047
1048On Windows, perl's stat() function normally opens the file to determine
1049the link count and update attributes that may have been changed through
1050hard links. Setting ${^WIN32_SLOPPY_STAT} to a true value speeds up
1051stat() by not performing this operation. (Jan Dubois)
1052
597bb945
RGS
1053=head2 Regular expressions optimisations
1054
1055=over 4
1056
1057=item Engine de-recursivised
1058
1059The regular expression engine is no longer recursive, meaning that
1060patterns that used to overflow the stack will either die with useful
1061explanations, or run to completion, which, since they were able to blow
1062the stack before, will likely take a very long time to happen. If you were
1063experiencing the occasional stack overflow (or segfault) and upgrade to
1064discover that now perl apparently hangs instead, look for a degenerate
1065regex. (Dave Mitchell)
1066
1067=item Single char char-classes treated as literals
1068
1069Classes of a single character are now treated the same as if the character
1070had been used as a literal, meaning that code that uses char-classes as an
1071escaping mechanism will see a speedup. (Yves Orton)
1072
1073=item Trie optimisation of literal string alternations
1074
1075Alternations, where possible, are optimised into more efficient matching
1076structures. String literal alternations are merged into a trie and are
1077matched simultaneously. This means that instead of O(N) time for matching
1078N alternations at a given point, the new code performs in O(1) time.
1079A new special variable, ${^RE_TRIE_MAXBUF}, has been added to fine-tune
1080this optimization. (Yves Orton)
1081
1082B<Note:> Much code exists that works around perl's historic poor
1083performance on alternations. Often the tricks used to do so will disable
1084the new optimisations. Hopefully the utility modules used for this purpose
99d59c4d 1085will be educated about these new optimisations.
597bb945
RGS
1086
1087=item Aho-Corasick start-point optimisation
1088
1089When a pattern starts with a trie-able alternation and there aren't
e15dad31 1090better optimisations available, the regex engine will use Aho-Corasick
597bb945
RGS
1091matching to find the start point. (Yves Orton)
1092
0eece9c0
RGS
1093=back
1094
cf6c151c 1095=head1 Installation and Configuration Improvements
c0c97549 1096
597bb945
RGS
1097=head2 Configuration improvements
1098
1099=over 4
1100
1101=item C<-Dusesitecustomize>
bd3831ee 1102
0eece9c0 1103Run-time customization of @INC can be enabled by passing the
597bb945 1104C<-Dusesitecustomize> flag to Configure. When enabled, this will make perl
0eece9c0
RGS
1105run F<$sitelibexp/sitecustomize.pl> before anything else. This script can
1106then be set up to add additional entries to @INC.
1107
597bb945
RGS
1108=item Relocatable installations
1109
1110There is now Configure support for creating a relocatable perl tree. If
1111you Configure with C<-Duserelocatableinc>, then the paths in @INC (and
1112everything else in %Config) can be optionally located via the path of the
1113perl executable.
1114
1115That means that, if the string C<".../"> is found at the start of any
1116path, it's substituted with the directory of $^X. So, the relocation can
1117be configured on a per-directory basis, although the default with
1118C<-Duserelocatableinc> is that everything is relocated. The initial
1119install is done to the original configured prefix.
1120
1121=item strlcat() and strlcpy()
1122
1123The configuration process now detects whether strlcat() and strlcpy() are
1124available. When they are not available, perl's own version is used (from
1125Russ Allbery's public domain implementation). Various places in the perl
1126interpreter now use them. (Steve Peters)
1127
f0e260b8
RGS
1128=item C<d_pseudofork> and C<d_printf_format_null>
1129
1130A new configuration variable, available as C<$Config{d_pseudofork}> in
1131the L<Config> module, has been added, to distinguish real fork() support
1132from fake pseudofork used on Windows platforms.
1133
1134A new configuration variable, C<d_printf_format_null>, has been added,
1135to see if printf-like formats are allowed to be NULL.
1136
1137=item Configure help
1138
1139C<Configure -h> has been extended with the most commonly used options.
1140
597bb945
RGS
1141=back
1142
1143=head2 Compilation improvements
1144
1145=over 4
1146
1147=item Parallel build
0eece9c0 1148
bd3831ee
RGS
1149Parallel makes should work properly now, although there may still be problems
1150if C<make test> is instructed to run in parallel.
1151
597bb945
RGS
1152=item Borland's compilers support
1153
bd3831ee
RGS
1154Building with Borland's compilers on Win32 should work more smoothly. In
1155particular Steve Hay has worked to side step many warnings emitted by their
1156compilers and at least one C compiler internal error.
1157
597bb945
RGS
1158=item Static build on Windows
1159
f0e260b8
RGS
1160Perl extensions on Windows now can be statically built into the Perl DLL.
1161
1162Also, it's now possible to build a C<perl-static.exe> that doesn't depend
1163on the Perl DLL on Win32. See the Win32 makefiles for details.
1164(Vadim Konovalov)
bd3831ee 1165
69d2c521 1166=item ppport.h files
597bb945
RGS
1167
1168All F<ppport.h> files in the XS modules bundled with perl are now
1169autogenerated at build time. (Marcus Holland-Moritz)
1170
f0e260b8
RGS
1171=item C++ compatibility
1172
1173Efforts have been made to make perl and the core XS modules compilable
1174with various C++ compilers (although the situation is not perfect with
1175some of the compilers on some of the platforms tested.)
1176
597bb945
RGS
1177=item Support for Microsoft 64-bit compiler
1178
1179Support for building perl with Microsoft's 64-bit compiler has been
1180improved. (ActiveState)
1181
f0e260b8
RGS
1182=item Visual C++
1183
c01f0d41 1184Perl can now be compiled with Microsoft Visual C++ 2005 (and 2008 Beta 2).
f0e260b8
RGS
1185
1186=item Win32 builds
1187
1188All win32 builds (MS-Win, WinCE) have been merged and cleaned up.
1189
597bb945
RGS
1190=back
1191
1192=head2 Installation improvements
1193
1194=over 4
1195
1196=item Module auxiliary files
1197
1198README files and changelogs for CPAN modules bundled with perl are no
1199longer installed.
1200
1201=back
1202
bd3831ee
RGS
1203=head2 New Or Improved Platforms
1204
597bb945 1205Perl has been reported to work on Symbian OS. See L<perlsymbian> for more
bd3831ee
RGS
1206information.
1207
597bb945
RGS
1208Many improvements have been made towards making Perl work correctly on
1209z/OS.
1210
f0e260b8 1211Perl has been reported to work on DragonFlyBSD and MidnightBSD.
597bb945 1212
3af9ce7f
RGS
1213Perl has also been reported to work on NexentaOS
1214( http://www.gnusolaris.org/ ).
1215
bd3831ee
RGS
1216The VMS port has been improved. See L<perlvms>.
1217
d43695a1
RGS
1218Support for Cray XT4 Catamount/Qk has been added. See
1219F<hints/catamount.sh> in the source code distribution for more
1220information.
bd3831ee 1221
f0e260b8
RGS
1222Vendor patches have been merged for RedHat and Gentoo.
1223
1224DynaLoader::dl_unload_file() now works on Windows.
bd3831ee 1225
cf6c151c 1226=head1 Selected Bug Fixes
c0c97549 1227
bd3831ee
RGS
1228=over 4
1229
1230=item strictures in regexp-eval blocks
1231
c0c97549
RGS
1232C<strict> wasn't in effect in regexp-eval blocks (C</(?{...})/>).
1233
bd3831ee
RGS
1234=item Calling CORE::require()
1235
1236CORE::require() and CORE::do() were always parsed as require() and do()
1237when they were overridden. This is now fixed.
1238
1239=item Subscripts of slices
1240
1241You can now use a non-arrowed form for chained subscripts after a list
1242slice, like in:
1243
1244 ({foo => "bar"})[0]{foo}
1245
1246This used to be a syntax error; a C<< -> >> was required.
1247
1248=item C<no warnings 'category'> works correctly with -w
1249
1250Previously when running with warnings enabled globally via C<-w>, selective
1251disabling of specific warning categories would actually turn off all warnings.
1252This is now fixed; now C<no warnings 'io';> will only turn off warnings in the
1253C<io> class. Previously it would erroneously turn off all warnings.
1254
597bb945 1255=item threads improvements
bd3831ee
RGS
1256
1257Several memory leaks in ithreads were closed. Also, ithreads were made
1258less memory-intensive.
1259
597bb945
RGS
1260C<threads> is now a dual-life module, also available on CPAN. It has been
1261expanded in many ways. A kill() method is available for thread signalling.
1262One can get thread status, or the list of running or joinable threads.
1263
1264A new C<< threads->exit() >> method is used to exit from the application
1265(this is the default for the main thread) or from the current thread only
1266(this is the default for all other threads). On the other hand, the exit()
1267built-in now always causes the whole application to terminate. (Jerry
1268D. Hedden)
1269
bd3831ee
RGS
1270=item chr() and negative values
1271
1272chr() on a negative value now gives C<\x{FFFD}>, the Unicode replacement
1273character, unless when the C<bytes> pragma is in effect, where the low
1274eight bytes of the value are used.
1275
597bb945
RGS
1276=item PERL5SHELL and tainting
1277
1278On Windows, the PERL5SHELL environment variable is now checked for
1279taintedness. (Rafael Garcia-Suarez)
1280
1281=item Using *FILE{IO}
1282
1283C<stat()> and C<-X> filetests now treat *FILE{IO} filehandles like *FILE
1284filehandles. (Steve Peters)
1285
1286=item Overloading and reblessing
1287
1288Overloading now works when references are reblessed into another class.
1289Internally, this has been implemented by moving the flag for "overloading"
1290from the reference to the referent, which logically is where it should
1291always have been. (Nicholas Clark)
1292
1293=item Overloading and UTF-8
1294
1295A few bugs related to UTF-8 handling with objects that have
1296stringification overloaded have been fixed. (Nicholas Clark)
1297
1298=item eval memory leaks fixed
1299
1300Traditionally, C<eval 'syntax error'> has leaked badly. Many (but not all)
1301of these leaks have now been eliminated or reduced. (Dave Mitchell)
1302
1303=item Random device on Windows
1304
1305In previous versions, perl would read the file F</dev/urandom> if it
1306existed when seeding its random number generator. That file is unlikely
1307to exist on Windows, and if it did would probably not contain appropriate
1308data, so perl no longer tries to read it on Windows. (Alex Davies)
1309
1310=item PERLIO_DEBUG
1311
254a8700 1312The C<PERLIO_DEBUG> environment variable no longer has any effect for
597bb945
RGS
1313setuid scripts and for scripts run with B<-T>.
1314
1315Moreover, with a thread-enabled perl, using C<PERLIO_DEBUG> could lead to
1316an internal buffer overflow. This has been fixed.
1317
f0e260b8
RGS
1318=item PerlIO::scalar and read-only scalars
1319
1320PerlIO::scalar will now prevent writing to read-only scalars. Moreover,
1321seek() is now supported with PerlIO::scalar-based filehandles, the
1322underlying string being zero-filled as needed. (Rafael, Jarkko Hietaniemi)
1323
1324=item study() and UTF-8
1325
1326study() never worked for UTF-8 strings, but could lead to false results.
1327It's now a no-op on UTF-8 data. (Yves Orton)
1328
1329=item Critical signals
1330
1331The signals SIGILL, SIGBUS and SIGSEGV are now always delivered in an
1332"unsafe" manner (contrary to other signals, that are deferred until the
1333perl interpreter reaches a reasonably stable state; see
1334L<perlipc/"Deferred Signals (Safe Signals)">). (Rafael)
1335
1336=item @INC-hook fix
1337
1338When a module or a file is loaded through an @INC-hook, and when this hook
1339has set a filename entry in %INC, __FILE__ is now set for this module
1340accordingly to the contents of that %INC entry. (Rafael)
1341
1342=item C<-t> switch fix
1343
1344The C<-w> and C<-t> switches can now be used together without messing
254a8700 1345up which categories of warnings are activated. (Rafael)
f0e260b8
RGS
1346
1347=item Duping UTF-8 filehandles
1348
1349Duping a filehandle which has the C<:utf8> PerlIO layer set will now
1350properly carry that layer on the duped filehandle. (Rafael)
1351
1352=item Localisation of hash elements
1353
292c2b28 1354Localizing a hash element whose key was given as a variable didn't work
f0e260b8
RGS
1355correctly if the variable was changed while the local() was in effect (as
1356in C<local $h{$x}; ++$x>). (Bo Lindbergh)
1357
bd3831ee 1358=back
0eece9c0 1359
cf6c151c 1360=head1 New or Changed Diagnostics
c0c97549 1361
bd3831ee
RGS
1362=over 4
1363
d43695a1
RGS
1364=item Use of uninitialized value
1365
1366Perl will now try to tell you the name of the variable (if any) that was
1367undefined.
1368
bd3831ee
RGS
1369=item Deprecated use of my() in false conditional
1370
c0c97549
RGS
1371A new deprecation warning, I<Deprecated use of my() in false conditional>,
1372has been added, to warn against the use of the dubious and deprecated
1373construct
1374
1375 my $x if 0;
1376
1377See L<perldiag>. Use C<state> variables instead.
1378
bd3831ee
RGS
1379=item !=~ should be !~
1380
0eece9c0
RGS
1381A new warning, C<!=~ should be !~>, is emitted to prevent this misspelling
1382of the non-matching operator.
1383
bd3831ee
RGS
1384=item Newline in left-justified string
1385
0eece9c0
RGS
1386The warning I<Newline in left-justified string> has been removed.
1387
bd3831ee
RGS
1388=item Too late for "-T" option
1389
0eece9c0
RGS
1390The error I<Too late for "-T" option> has been reformulated to be more
1391descriptive.
1392
bd3831ee
RGS
1393=item "%s" variable %s masks earlier declaration
1394
1395This warning is now emitted in more consistent cases; in short, when one
1396of the declarations involved is a C<my> variable:
1397
1398 my $x; my $x; # warns
1399 my $x; our $x; # warns
1400 our $x; my $x; # warns
1401
1402On the other hand, the following:
1403
1404 our $x; our $x;
1405
1406now gives a C<"our" variable %s redeclared> warning.
1407
1408=item readdir()/closedir()/etc. attempted on invalid dirhandle
1409
1410These new warnings are now emitted when a dirhandle is used but is
1411either closed or not really a dirhandle.
1412
f0e260b8
RGS
1413=item Opening dirhandle/filehandle %s also as a file/directory
1414
1415Two deprecation warnings have been added: (Rafael)
1416
1417 Opening dirhandle %s also as a file
1418 Opening filehandle %s also as a directory
1419
f00638a2
RGS
1420=item Use of -P is deprecated
1421
1422Perl's command-line switch C<-P> is now deprecated.
1423
6601a838
RGS
1424=item v-string in use/require is non-portable
1425
1426Perl will warn you against potential backwards compatibility problems with
1427the C<use VERSION> syntax.
1428
bd3831ee
RGS
1429=item perl -V
1430
0eece9c0
RGS
1431C<perl -V> has several improvements, making it more useable from shell
1432scripts to get the value of configuration variables. See L<perlrun> for
1433details.
1434
bd3831ee
RGS
1435=back
1436
cf6c151c 1437=head1 Changed Internals
c0c97549 1438
16993b2e
JH
1439In general, the source code of perl has been refactored, tidied up,
1440and optimized in many places. Also, memory management and allocation
1441has been improved in several points.
1442
1443When compiling the perl core with gcc, as many gcc warning flags are
1444turned on as is possible on the platform. (This quest for cleanliness
1445doesn't extend to XS code because we cannot guarantee the tidiness of
1446code we didn't write.) Similar strictness flags have been added or
1447tightened for various other C compilers.
bd3831ee 1448
c0c97549
RGS
1449=head2 Reordering of SVt_* constants
1450
1451The relative ordering of constants that define the various types of C<SV>
1452have changed; in particular, C<SVt_PVGV> has been moved before C<SVt_PVLV>,
1453C<SVt_PVAV>, C<SVt_PVHV> and C<SVt_PVCV>. This is unlikely to make any
1454difference unless you have code that explicitly makes assumptions about that
1455ordering. (The inheritance hierarchy of C<B::*> objects has been changed
1456to reflect this.)
1457
254a8700
NC
1458=head2 Elimination of SVt_PVBM
1459
1460Related to this, the internal type C<SVt_PVBM> has been been removed. This
1461dedicated type of C<SV> was used by the C<index> operator and parts of the
1462regexp engine to facilitate fast Boyer-Moore matches. Its use internally has
1463been replaced by C<SV>s of type C<SVt_PVGV>.
1464
1465=head2 New type SVt_BIND
1466
1467A new type C<SVt_BIND> has been added, in readiness for the project to
1468implement Perl 6 on 5. There deliberately is no implementation yet, and
1469they cannot yet be created or destroyed.
1470
c0c97549
RGS
1471=head2 Removal of CPP symbols
1472
1473The C preprocessor symbols C<PERL_PM_APIVERSION> and
1474C<PERL_XS_APIVERSION>, which were supposed to give the version number of
1475the oldest perl binary-compatible (resp. source-compatible) with the
1476present one, were not used, and sometimes had misleading values. They have
1477been removed.
1478
1479=head2 Less space is used by ops
1480
1481The C<BASEOP> structure now uses less space. The C<op_seq> field has been
254a8700 1482removed and replaced by a single bit bit-field C<op_opt>. C<op_type> is now 9
c0c97549
RGS
1483bits long. (Consequently, the C<B::OP> class doesn't provide an C<seq>
1484method anymore.)
1485
1486=head2 New parser
1487
1488perl's parser is now generated by bison (it used to be generated by
1489byacc.) As a result, it seems to be a bit more robust.
1490
bd3831ee
RGS
1491Also, Dave Mitchell improved the lexer debugging output under C<-DT>.
1492
1493=head2 Use of C<const>
1494
1495Andy Lester supplied many improvements to determine which function
1496parameters and local variables could actually be declared C<const> to the C
1497compiler. Steve Peters provided new C<*_set> macros and reworked the core to
1498use these rather than assigning to macros in LVALUE context.
1499
1500=head2 Mathoms
1501
1502A new file, F<mathoms.c>, has been added. It contains functions that are
1503no longer used in the perl core, but that remain available for binary or
1504source compatibility reasons. However, those functions will not be
1505compiled in if you add C<-DNO_MATHOMS> in the compiler flags.
1506
1507=head2 C<AvFLAGS> has been removed
1508
1509The C<AvFLAGS> macro has been removed.
1510
1511=head2 C<av_*> changes
1512
1513The C<av_*()> functions, used to manipulate arrays, no longer accept null
1514C<AV*> parameters.
1515
597bb945
RGS
1516=head2 $^H and %^H
1517
1518The implementation of the special variables $^H and %^H has changed, to
254a8700 1519allow implementing lexical pragmas in pure Perl.
597bb945 1520
bd3831ee
RGS
1521=head2 B:: modules inheritance changed
1522
1523The inheritance hierarchy of C<B::> modules has changed; C<B::NV> now
1524inherits from C<B::SV> (it used to inherit from C<B::IV>).
1525
f0e260b8
RGS
1526=head2 Anonymous hash and array constructors
1527
1528The anonymous hash and array constructors now take 1 op in the optree
1529instead of 3, now that pp_anonhash and pp_anonlist return a reference to
1530an hash/array when the op is flagged with OPf_SPECIAL (Nicholas Clark).
1531
cf6c151c 1532=head1 Known Problems
c0c97549
RGS
1533
1534There's still a remaining problem in the implementation of the lexical
1535C<$_>: it doesn't work inside C</(?{...})/> blocks. (See the TODO test in
1536F<t/op/mydef.t>.)
1537
3af9ce7f
RGS
1538Stacked filetest operators won't work when the C<filetest> pragma is in
1539effect, because they rely on the stat() buffer C<_> being populated, and
1540filetest bypasses stat().
1541
48d64ccf
RGS
1542When compiled with g++ and thread support on Linux, it's reported that the
1543C<$!> stops working correctly. This is related to the fact that the glibc
1544provides two strerror_r(3) implementation, and perl selects the wrong
1545one.
1546
cf6c151c 1547=head1 Platform Specific Problems
c0c97549 1548
cf6c151c
RGS
1549=head1 Reporting Bugs
1550
1551=head1 SEE ALSO
1552
1553The F<Changes> file and the perl590delta to perl595delta man pages for
1554exhaustive details on what changed.
1555
1556The F<INSTALL> file for how to build Perl.
1557
1558The F<README> file for general stuff.
1559
1560The F<Artistic> and F<Copying> files for copyright information.
1561
1562=cut