This is a live mirror of the Perl 5 development currently hosted at https://github.com/perl/perl5
Replace a few http urls with https
[perl5.git] / pod / perl5100delta.pod
CommitLineData
81663c3a
RGS
1=encoding utf8
2
cf6c151c
RGS
3=head1 NAME
4
7120b314 5perl5100delta - what is new for perl 5.10.0
cf6c151c
RGS
6
7=head1 DESCRIPTION
8
9This document describes the differences between the 5.8.8 release and
10the 5.10.0 release.
11
12Many of the bug fixes in 5.10.0 were already seen in the 5.8.X maintenance
13releases; they are not duplicated here and are documented in the set of
14man pages named perl58[1-8]?delta.
15
cf6c151c
RGS
16=head1 Core Enhancements
17
18=head2 The C<feature> pragma
19
20The C<feature> pragma is used to enable new syntax that would break Perl's
21backwards-compatibility with older releases of the language. It's a lexical
22pragma, like C<strict> or C<warnings>.
23
24Currently the following new features are available: C<switch> (adds a
25switch statement), C<say> (adds a C<say> built-in function), and C<state>
292c2b28 26(adds a C<state> keyword for declaring "static" variables). Those
cf6c151c
RGS
27features are described in their own sections of this document.
28
29The C<feature> pragma is also implicitly loaded when you require a minimal
30perl version (with the C<use VERSION> construct) greater than, or equal
31to, 5.9.5. See L<feature> for details.
32
33=head2 New B<-E> command-line switch
34
35B<-E> is equivalent to B<-e>, but it implicitly enables all
36optional features (like C<use feature ":5.10">).
37
38=head2 Defined-or operator
39
40A new operator C<//> (defined-or) has been implemented.
dbef3c66 41The following expression:
cf6c151c
RGS
42
43 $a // $b
44
45is merely equivalent to
46
47 defined $a ? $a : $b
48
dbef3c66 49and the statement
cf6c151c
RGS
50
51 $c //= $d;
52
53can now be used instead of
54
55 $c = $d unless defined $c;
56
57The C<//> operator has the same precedence and associativity as C<||>.
58Special care has been taken to ensure that this operator Do What You Mean
59while not breaking old code, but some edge cases involving the empty
60regular expression may now parse differently. See L<perlop> for
61details.
62
63=head2 Switch and Smart Match operator
64
65Perl 5 now has a switch statement. It's available when C<use feature
66'switch'> is in effect. This feature introduces three new keywords,
67C<given>, C<when>, and C<default>:
68
69 given ($foo) {
70 when (/^abc/) { $abc = 1; }
71 when (/^def/) { $def = 1; }
72 when (/^xyz/) { $xyz = 1; }
73 default { $nothing = 1; }
74 }
75
76A more complete description of how Perl matches the switch variable
77against the C<when> conditions is given in L<perlsyn/"Switch statements">.
78
79This kind of match is called I<smart match>, and it's also possible to use
80it outside of switch statements, via the new C<~~> operator. See
81L<perlsyn/"Smart matching in detail">.
82
83This feature was contributed by Robin Houston.
84
85=head2 Regular expressions
86
87=over 4
88
89=item Recursive Patterns
90
91It is now possible to write recursive patterns without using the C<(??{})>
92construct. This new way is more efficient, and in many cases easier to
93read.
94
95Each capturing parenthesis can now be treated as an independent pattern
96that can be entered by using the C<(?PARNO)> syntax (C<PARNO> standing for
97"parenthesis number"). For example, the following pattern will match
98nested balanced angle brackets:
99
100 /
101 ^ # start of line
102 ( # start capture buffer 1
103 < # match an opening angle bracket
104 (?: # match one of:
105 (?> # don't backtrack over the inside of this group
d991eed6 106 [^<>]+ # one or more non angle brackets
cf6c151c
RGS
107 ) # end non backtracking group
108 | # ... or ...
109 (?1) # recurse to bracket 1 and try it again
110 )* # 0 or more times.
111 > # match a closing angle bracket
112 ) # end capture buffer one
113 $ # end of line
114 /x
115
e15dad31
JC
116PCRE users should note that Perl's recursive regex feature allows
117backtracking into a recursed pattern, whereas in PCRE the recursion is
118atomic or "possessive" in nature. As in the example above, you can
119add (?>) to control this selectively. (Yves Orton)
cf6c151c
RGS
120
121=item Named Capture Buffers
122
123It is now possible to name capturing parenthesis in a pattern and refer to
124the captured contents by name. The naming syntax is C<< (?<NAME>....) >>.
125It's possible to backreference to a named buffer with the C<< \k<NAME> >>
126syntax. In code, the new magical hashes C<%+> and C<%-> can be used to
127access the contents of the capture buffers.
128
e15dad31 129Thus, to replace all doubled chars with a single copy, one could write
cf6c151c
RGS
130
131 s/(?<letter>.)\k<letter>/$+{letter}/g
132
133Only buffers with defined contents will be "visible" in the C<%+> hash, so
134it's possible to do something like
135
136 foreach my $name (keys %+) {
137 print "content of buffer '$name' is $+{$name}\n";
138 }
139
140The C<%-> hash is a bit more complete, since it will contain array refs
141holding values from all capture buffers similarly named, if there should
142be many of them.
143
144C<%+> and C<%-> are implemented as tied hashes through the new module
145C<Tie::Hash::NamedCapture>.
146
147Users exposed to the .NET regex engine will find that the perl
148implementation differs in that the numerical ordering of the buffers
149is sequential, and not "unnamed first, then named". Thus in the pattern
150
151 /(A)(?<B>B)(C)(?<D>D)/
152
153$1 will be 'A', $2 will be 'B', $3 will be 'C' and $4 will be 'D' and not
154$1 is 'A', $2 is 'C' and $3 is 'B' and $4 is 'D' that a .NET programmer
155would expect. This is considered a feature. :-) (Yves Orton)
156
157=item Possessive Quantifiers
158
159Perl now supports the "possessive quantifier" syntax of the "atomic match"
160pattern. Basically a possessive quantifier matches as much as it can and never
161gives any back. Thus it can be used to control backtracking. The syntax is
162similar to non-greedy matching, except instead of using a '?' as the modifier
163the '+' is used. Thus C<?+>, C<*+>, C<++>, C<{min,max}+> are now legal
164quantifiers. (Yves Orton)
165
166=item Backtracking control verbs
167
168The regex engine now supports a number of special-purpose backtrack
169control verbs: (*THEN), (*PRUNE), (*MARK), (*SKIP), (*COMMIT), (*FAIL)
170and (*ACCEPT). See L<perlre> for their descriptions. (Yves Orton)
171
172=item Relative backreferences
173
174A new syntax C<\g{N}> or C<\gN> where "N" is a decimal integer allows a
175safer form of back-reference notation as well as allowing relative
176backreferences. This should make it easier to generate and embed patterns
177that contain backreferences. See L<perlre/"Capture buffers">. (Yves Orton)
178
179=item C<\K> escape
180
181The functionality of Jeff Pinyan's module Regexp::Keep has been added to
254a8700 182the core. In regular expressions you can now use the special escape C<\K>
cf6c151c
RGS
183as a way to do something like floating length positive lookbehind. It is
184also useful in substitutions like:
185
186 s/(foo)bar/$1/g
187
188that can now be converted to
189
190 s/foo\Kbar//g
191
192which is much more efficient. (Yves Orton)
193
194=item Vertical and horizontal whitespace, and linebreak
195
292c2b28 196Regular expressions now recognize the C<\v> and C<\h> escapes that match
cf6c151c
RGS
197vertical and horizontal whitespace, respectively. C<\V> and C<\H>
198logically match their complements.
199
200C<\R> matches a generic linebreak, that is, vertical whitespace, plus
201the multi-character sequence C<"\x0D\x0A">.
202
9af0006b
ZA
203=item Optional pre-match and post-match captures with the /p flag
204
205There is a new flag C</p> for regular expressions. Using this
206makes the engine preserve a copy of the part of the matched string before
207the matching substring to the new special variable C<${^PREMATCH}>, the
208part after the matching substring to C<${^POSTMATCH}>, and the matched
209substring itself to C<${^MATCH}>.
210
211Perl is still able to store these substrings to the special variables
212C<$`>, C<$'>, C<$&>, but using these variables anywhere in the program
213adds a penalty to all regular expression matches, whereas if you use
214the C</p> flag and the new special variables instead, you pay only for
215the regular expressions where the flag is used.
216
217For more detail on the new variables, see L<perlvar>; for the use of
218the regular expression flag, see L<perlop> and L<perlre>.
219
cf6c151c
RGS
220=back
221
222=head2 C<say()>
223
224say() is a new built-in, only available when C<use feature 'say'> is in
225effect, that is similar to print(), but that implicitly appends a newline
226to the printed string. See L<perlfunc/say>. (Robin Houston)
227
228=head2 Lexical C<$_>
229
230The default variable C<$_> can now be lexicalized, by declaring it like
231any other lexical variable, with a simple
232
233 my $_;
234
235The operations that default on C<$_> will use the lexically-scoped
236version of C<$_> when it exists, instead of the global C<$_>.
237
238In a C<map> or a C<grep> block, if C<$_> was previously my'ed, then the
239C<$_> inside the block is lexical as well (and scoped to the block).
240
241In a scope where C<$_> has been lexicalized, you can still have access to
242the global version of C<$_> by using C<$::_>, or, more simply, by
597bb945 243overriding the lexical declaration with C<our $_>. (Rafael Garcia-Suarez)
cf6c151c
RGS
244
245=head2 The C<_> prototype
246
254a8700 247A new prototype character has been added. C<_> is equivalent to C<$> but
62ba2c93
RB
248defaults to C<$_> if the corresponding argument isn't supplied (both C<$>
249and C<_> denote a scalar). Due to the optional nature of the argument,
250you can only use it at the end of a prototype, or before a semicolon.
cf6c151c
RGS
251
252This has a small incompatible consequence: the prototype() function has
253been adjusted to return C<_> for some built-ins in appropriate cases (for
254example, C<prototype('CORE::rmdir')>). (Rafael Garcia-Suarez)
255
256=head2 UNITCHECK blocks
257
258C<UNITCHECK>, a new special code block has been introduced, in addition to
259C<BEGIN>, C<CHECK>, C<INIT> and C<END>.
260
261C<CHECK> and C<INIT> blocks, while useful for some specialized purposes,
262are always executed at the transition between the compilation and the
263execution of the main program, and thus are useless whenever code is
264loaded at runtime. On the other hand, C<UNITCHECK> blocks are executed
265just after the unit which defined them has been compiled. See L<perlmod>
266for more information. (Alex Gough)
267
268=head2 New Pragma, C<mro>
269
270A new pragma, C<mro> (for Method Resolution Order) has been added. It
271permits to switch, on a per-class basis, the algorithm that perl uses to
dbef3c66 272find inherited methods in case of a multiple inheritance hierarchy. The
cf6c151c
RGS
273default MRO hasn't changed (DFS, for Depth First Search). Another MRO is
274available: the C3 algorithm. See L<mro> for more information.
275(Brandon Black)
276
dbef3c66 277Note that, due to changes in the implementation of class hierarchy search,
cf6c151c
RGS
278code that used to undef the C<*ISA> glob will most probably break. Anyway,
279undef'ing C<*ISA> had the side-effect of removing the magic on the @ISA
0e44e2ed
RGS
280array and should not have been done in the first place. Also, the
281cache C<*::ISA::CACHE::> no longer exists; to force reset the @ISA cache,
282you now need to use the C<mro> API, or more simply to assign to @ISA
283(e.g. with C<@ISA = @ISA>).
cf6c151c 284
3de67921
JD
285=head2 readdir() may return a "short filename" on Windows
286
287The readdir() function may return a "short filename" when the long
288filename contains characters outside the ANSI codepage. Similarly
289Cwd::cwd() may return a short directory name, and glob() may return short
290names as well. On the NTFS file system these short names can always be
291represented in the ANSI codepage. This will not be true for all other file
292system drivers; e.g. the FAT filesystem stores short filenames in the OEM
293codepage, so some files on FAT volumes remain unaccessible through the
294ANSI APIs.
295
296Similarly, $^X, @INC, and $ENV{PATH} are preprocessed at startup to make
297sure all paths are valid in the ANSI codepage (if possible).
298
299The Win32::GetLongPathName() function now returns the UTF-8 encoded
300correct long file name instead of using replacement characters to force
301the name into the ANSI codepage. The new Win32::GetANSIPathName()
302function can be used to turn a long pathname into a short one only if the
303long one cannot be represented in the ANSI codepage.
304
305Many other functions in the C<Win32> module have been improved to accept
306UTF-8 encoded arguments. Please see L<Win32> for details.
307
cf6c151c
RGS
308=head2 readpipe() is now overridable
309
310The built-in function readpipe() is now overridable. Overriding it permits
311also to override its operator counterpart, C<qx//> (a.k.a. C<``>).
312Moreover, it now defaults to C<$_> if no argument is provided. (Rafael
313Garcia-Suarez)
314
597bb945 315=head2 Default argument for readline()
cf6c151c
RGS
316
317readline() now defaults to C<*ARGV> if no argument is provided. (Rafael
318Garcia-Suarez)
319
320=head2 state() variables
321
322A new class of variables has been introduced. State variables are similar
323to C<my> variables, but are declared with the C<state> keyword in place of
324C<my>. They're visible only in their lexical scope, but their value is
325persistent: unlike C<my> variables, they're not undefined at scope entry,
326but retain their previous value. (Rafael Garcia-Suarez, Nicholas Clark)
327
328To use state variables, one needs to enable them by using
329
254a8700 330 use feature 'state';
cf6c151c
RGS
331
332or by using the C<-E> command-line switch in one-liners.
d991eed6 333See L<perlsub/"Persistent Private Variables">.
cf6c151c
RGS
334
335=head2 Stacked filetest operators
336
337As a new form of syntactic sugar, it's now possible to stack up filetest
338operators. You can now write C<-f -w -x $file> in a row to mean
339C<-x $file && -w _ && -f _>. See L<perlfunc/-X>.
340
341=head2 UNIVERSAL::DOES()
342
343The C<UNIVERSAL> class has a new method, C<DOES()>. It has been added to
344solve semantic problems with the C<isa()> method. C<isa()> checks for
345inheritance, while C<DOES()> has been designed to be overridden when
346module authors use other types of relations between classes (in addition
347to inheritance). (chromatic)
348
349See L<< UNIVERSAL/"$obj->DOES( ROLE )" >>.
350
cf6c151c
RGS
351=head2 Formats
352
353Formats were improved in several ways. A new field, C<^*>, can be used for
354variable-width, one-line-at-a-time text. Null characters are now handled
355correctly in picture lines. Using C<@#> and C<~~> together will now
356produce a compile-time error, as those format fields are incompatible.
357L<perlform> has been improved, and miscellaneous bugs fixed.
358
359=head2 Byte-order modifiers for pack() and unpack()
360
361There are two new byte-order modifiers, C<E<gt>> (big-endian) and C<E<lt>>
362(little-endian), that can be appended to most pack() and unpack() template
363characters and groups to force a certain byte-order for that type or group.
364See L<perlfunc/pack> and L<perlpacktut> for details.
365
cf6c151c
RGS
366=head2 C<no VERSION>
367
368You can now use C<no> followed by a version number to specify that you
369want to use a version of perl older than the specified one.
370
371=head2 C<chdir>, C<chmod> and C<chown> on filehandles
372
373C<chdir>, C<chmod> and C<chown> can now work on filehandles as well as
374filenames, if the system supports respectively C<fchdir>, C<fchmod> and
375C<fchown>, thanks to a patch provided by Gisle Aas.
376
377=head2 OS groups
378
379C<$(> and C<$)> now return groups in the order where the OS returns them,
380thanks to Gisle Aas. This wasn't previously the case.
381
382=head2 Recursive sort subs
383
384You can now use recursive subroutines with sort(), thanks to Robin Houston.
385
386=head2 Exceptions in constant folding
387
388The constant folding routine is now wrapped in an exception handler, and
389if folding throws an exception (such as attempting to evaluate 0/0), perl
390now retains the current optree, rather than aborting the whole program.
254a8700
NC
391Without this change, programs would not compile if they had expressions that
392happened to generate exceptions, even though those expressions were in code
393that could never be reached at runtime. (Nicholas Clark, Dave Mitchell)
cf6c151c
RGS
394
395=head2 Source filters in @INC
396
397It's possible to enhance the mechanism of subroutine hooks in @INC by
398adding a source filter on top of the filehandle opened and returned by the
399hook. This feature was planned a long time ago, but wasn't quite working
400until now. See L<perlfunc/require> for details. (Nicholas Clark)
401
402=head2 New internal variables
403
404=over 4
405
406=item C<${^RE_DEBUG_FLAGS}>
407
408This variable controls what debug flags are in effect for the regular
409expression engine when running under C<use re "debug">. See L<re> for
410details.
411
412=item C<${^CHILD_ERROR_NATIVE}>
413
414This variable gives the native status returned by the last pipe close,
415backtick command, successful call to wait() or waitpid(), or from the
eaade56e 416system() operator. See L<perlvar> for details. (Contributed by Gisle Aas.)
cf6c151c 417
597bb945
RGS
418=item C<${^RE_TRIE_MAXBUF}>
419
420See L</"Trie optimisation of literal string alternations">.
421
422=item C<${^WIN32_SLOPPY_STAT}>
423
424See L</"Sloppy stat on Windows">.
425
cf6c151c
RGS
426=back
427
428=head2 Miscellaneous
429
430C<unpack()> now defaults to unpacking the C<$_> variable.
431
432C<mkdir()> without arguments now defaults to C<$_>.
433
434The internal dump output has been improved, so that non-printable characters
435such as newline and backspace are output in C<\x> notation, rather than
436octal.
437
438The B<-C> option can no longer be used on the C<#!> line. It wasn't
cba8bf60
RGS
439working there anyway, since the standard streams are already set up
440at this point in the execution of the perl interpreter. You can use
441binmode() instead to get the desired behaviour.
cf6c151c
RGS
442
443=head2 UCD 5.0.0
444
445The copy of the Unicode Character Database included in Perl 5 has
446been updated to version 5.0.0.
447
cf6c151c
RGS
448=head2 MAD
449
254a8700 450MAD, which stands for I<Miscellaneous Attribute Decoration>, is a
cf6c151c
RGS
451still-in-development work leading to a Perl 5 to Perl 6 converter. To
452enable it, it's necessary to pass the argument C<-Dmad> to Configure. The
254a8700 453obtained perl isn't binary compatible with a regular perl 5.10, and has
cf6c151c
RGS
454space and speed penalties; moreover not all regression tests still pass
455with it. (Larry Wall, Nicholas Clark)
456
c7d332a5
RGS
457=head2 kill() on Windows
458
459On Windows platforms, C<kill(-9, $pid)> now kills a process tree.
e1020413 460(On Unix, this delivers the signal to all processes in the same process
c7d332a5
RGS
461group.)
462
597bb945
RGS
463=head1 Incompatible Changes
464
465=head2 Packing and UTF-8 strings
466
597bb945
RGS
467The semantics of pack() and unpack() regarding UTF-8-encoded data has been
468changed. Processing is now by default character per character instead of
469byte per byte on the underlying encoding. Notably, code that used things
470like C<pack("a*", $string)> to see through the encoding of string will now
471simply get back the original $string. Packed strings can also get upgraded
472during processing when you store upgraded characters. You can get the old
473behaviour by using C<use bytes>.
474
475To be consistent with pack(), the C<C0> in unpack() templates indicates
476that the data is to be processed in character mode, i.e. character by
477character; on the contrary, C<U0> in unpack() indicates UTF-8 mode, where
478the packed string is processed in its UTF-8-encoded Unicode form on a byte
254a8700
NC
479by byte basis. This is reversed with regard to perl 5.8.X, but now consistent
480between pack() and unpack().
597bb945
RGS
481
482Moreover, C<C0> and C<U0> can also be used in pack() templates to specify
483respectively character and byte modes.
484
485C<C0> and C<U0> in the middle of a pack or unpack format now switch to the
486specified encoding mode, honoring parens grouping. Previously, parens were
487ignored.
488
489Also, there is a new pack() character format, C<W>, which is intended to
490replace the old C<C>. C<C> is kept for unsigned chars coded as bytes in
491the strings internal representation. C<W> represents unsigned (logical)
492character values, which can be greater than 255. It is therefore more
493robust when dealing with potentially UTF-8-encoded data (as C<C> will wrap
494values outside the range 0..255, and not respect the string encoding).
495
496In practice, that means that pack formats are now encoding-neutral, except
497C<C>.
498
499For consistency, C<A> in unpack() format now trims all Unicode whitespace
500from the end of the string. Before perl 5.9.2, it used to strip only the
501classical ASCII space characters.
502
503=head2 Byte/character count feature in unpack()
504
505A new unpack() template character, C<".">, returns the number of bytes or
506characters (depending on the selected encoding mode, see above) read so far.
507
508=head2 The C<$*> and C<$#> variables have been removed
509
510C<$*>, which was deprecated in favor of the C</s> and C</m> regexp
511modifiers, has been removed.
512
513The deprecated C<$#> variable (output format for numbers) has been
514removed.
515
f00638a2 516Two new severe warnings, C<$#/$* is no longer supported>, have been added.
597bb945
RGS
517
518=head2 substr() lvalues are no longer fixed-length
519
520The lvalues returned by the three argument form of substr() used to be a
521"fixed length window" on the original string. In some cases this could
522cause surprising action at distance or other undefined behaviour. Now the
523length of the window adjusts itself to the length of the string assigned to
524it.
525
526=head2 Parsing of C<-f _>
527
528The identifier C<_> is now forced to be a bareword after a filetest
529operator. This solves a number of misparsing issues when a global C<_>
530subroutine is defined.
531
532=head2 C<:unique>
533
534The C<:unique> attribute has been made a no-op, since its current
535implementation was fundamentally flawed and not threadsafe.
536
597bb945
RGS
537=head2 Effect of pragmas in eval
538
539The compile-time value of the C<%^H> hint variable can now propagate into
540eval("")uated code. This makes it more useful to implement lexical
541pragmas.
542
543As a side-effect of this, the overloaded-ness of constants now propagates
544into eval("").
545
546=head2 chdir FOO
547
548A bareword argument to chdir() is now recognized as a file handle.
549Earlier releases interpreted the bareword as a directory name.
550(Gisle Aas)
551
552=head2 Handling of .pmc files
553
554An old feature of perl was that before C<require> or C<use> look for a
555file with a F<.pm> extension, they will first look for a similar filename
556with a F<.pmc> extension. If this file is found, it will be loaded in
557place of any potentially existing file ending in a F<.pm> extension.
558
559Previously, F<.pmc> files were loaded only if more recent than the
560matching F<.pm> file. Starting with 5.9.4, they'll be always loaded if
561they exist.
562
a32521b7
JD
563=head2 $^V is now a C<version> object instead of a v-string
564
565$^V can still be used with the C<%vd> format in printf, but any
566character-level operations will now access the string representation
567of the C<version> object and not the ordinals of a v-string.
568Expressions like C<< substr($^V, 0, 2) >> or C<< split //, $^V >>
569no longer work and must be rewritten.
570
597bb945
RGS
571=head2 @- and @+ in patterns
572
573The special arrays C<@-> and C<@+> are no longer interpolated in regular
574expressions. (Sadahiro Tomoyuki)
575
576=head2 $AUTOLOAD can now be tainted
577
578If you call a subroutine by a tainted name, and if it defers to an
579AUTOLOAD function, then $AUTOLOAD will be (correctly) tainted.
580(Rick Delaney)
581
582=head2 Tainting and printf
583
584When perl is run under taint mode, C<printf()> and C<sprintf()> will now
585reject any tainted format argument. (Rafael Garcia-Suarez)
586
587=head2 undef and signal handlers
588
589Undefining or deleting a signal handler via C<undef $SIG{FOO}> is now
590equivalent to setting it to C<'DEFAULT'>. (Rafael Garcia-Suarez)
591
592=head2 strictures and dereferencing in defined()
593
254a8700 594C<use strict 'refs'> was ignoring taking a hard reference in an argument
597bb945
RGS
595to defined(), as in :
596
254a8700
NC
597 use strict 'refs';
598 my $x = 'foo';
597bb945
RGS
599 if (defined $$x) {...}
600
601This now correctly produces the run-time error C<Can't use string as a
602SCALAR ref while "strict refs" in use>.
603
604C<defined @$foo> and C<defined %$bar> are now also subject to C<strict
605'refs'> (that is, C<$foo> and C<$bar> shall be proper references there.)
606(C<defined(@foo)> and C<defined(%bar)> are discouraged constructs anyway.)
607(Nicholas Clark)
608
609=head2 C<(?p{})> has been removed
610
611The regular expression construct C<(?p{})>, which was deprecated in perl
6125.8, has been removed. Use C<(??{})> instead. (Rafael Garcia-Suarez)
613
614=head2 Pseudo-hashes have been removed
615
616Support for pseudo-hashes has been removed from Perl 5.9. (The C<fields>
617pragma remains here, but uses an alternate implementation.)
618
619=head2 Removal of the bytecode compiler and of perlcc
620
621C<perlcc>, the byteloader and the supporting modules (B::C, B::CC,
622B::Bytecode, etc.) are no longer distributed with the perl sources. Those
623experimental tools have never worked reliably, and, due to the lack of
624volunteers to keep them in line with the perl interpreter developments, it
625was decided to remove them instead of shipping a broken version of those.
626The last version of those modules can be found with perl 5.9.4.
627
628However the B compiler framework stays supported in the perl core, as with
629the more useful modules it has permitted (among others, B::Deparse and
630B::Concise).
631
632=head2 Removal of the JPL
633
ed8ea1b6 634The JPL (Java-Perl Lingo) has been removed from the perl sources tarball.
597bb945
RGS
635
636=head2 Recursive inheritance detected earlier
637
638Perl will now immediately throw an exception if you modify any package's
639C<@ISA> in such a way that it would cause recursive inheritance.
640
641Previously, the exception would not occur until Perl attempted to make
642use of the recursive inheritance while resolving a method or doing a
643C<$foo-E<gt>isa($bar)> lookup.
644
77d4bad0 645=head2 warnings::enabled and warnings::warnif changed to favor users of modules
d99e543b
JV
646
647The behaviour in 5.10.x favors the person using the module;
648The behaviour in 5.8.x favors the module writer;
649
650Assume the following code:
651
652 main calls Foo::Bar::baz()
653 Foo::Bar inherits from Foo::Base
654 Foo::Bar::baz() calls Foo::Base::_bazbaz()
655 Foo::Base::_bazbaz() calls: warnings::warnif('substr', 'some warning
656message');
657
658On 5.8.x, the code warns when Foo::Bar contains C<use warnings;>
659It does not matter if Foo::Base or main have warnings enabled
660to disable the warning one has to modify Foo::Bar.
661
662On 5.10.0 and newer, the code warns when main contains C<use warnings;>
663It does not matter if Foo::Base or Foo::Bar have warnings enabled
664to disable the warning one has to modify main.
665
cf6c151c 666=head1 Modules and Pragmata
c0c97549 667
187a0176
AD
668=head2 Upgrading individual core modules
669
670Even more core modules are now also available separately through the
671CPAN. If you wish to update one of these modules, you don't need to
672wait for a new perl release. From within the cpan shell, running the
673'r' command will report on modules with upgrades available. See
674C<perldoc CPAN> for more information.
675
f0e260b8
RGS
676=head2 Pragmata Changes
677
678=over 4
679
680=item C<feature>
681
682The new pragma C<feature> is used to enable new features that might break
683old code. See L</"The C<feature> pragma"> above.
684
685=item C<mro>
686
687This new pragma enables to change the algorithm used to resolve inherited
688methods. See L</"New Pragma, C<mro>"> above.
689
690=item Scoping of the C<sort> pragma
691
692The C<sort> pragma is now lexically scoped. Its effect used to be global.
693
694=item Scoping of C<bignum>, C<bigint>, C<bigrat>
695
696The three numeric pragmas C<bignum>, C<bigint> and C<bigrat> are now
697lexically scoped. (Tels)
698
699=item C<base>
700
701The C<base> pragma now warns if a class tries to inherit from itself.
702(Curtis "Ovid" Poe)
703
704=item C<strict> and C<warnings>
705
706C<strict> and C<warnings> will now complain loudly if they are loaded via
707incorrect casing (as in C<use Strict;>). (Johan Vromans)
708
6601a838
RGS
709=item C<version>
710
711The C<version> module provides support for version objects.
712
f0e260b8
RGS
713=item C<warnings>
714
715The C<warnings> pragma doesn't load C<Carp> anymore. That means that code
716that used C<Carp> routines without having loaded it at compile time might
717need to be adjusted; typically, the following (faulty) code won't work
718anymore, and will require parentheses to be added after the function name:
719
720 use warnings;
721 require Carp;
254a8700 722 Carp::confess 'argh';
f0e260b8
RGS
723
724=item C<less>
725
726C<less> now does something useful (or at least it tries to). In fact, it
727has been turned into a lexical pragma. So, in your modules, you can now
728test whether your users have requested to use less CPU, or less memory,
729less magic, or maybe even less fat. See L<less> for more. (Joshua ben
730Jore)
731
732=back
733
0eece9c0
RGS
734=head2 New modules
735
736=over 4
737
738=item *
739
740C<encoding::warnings>, by Audrey Tang, is a module to emit warnings
741whenever an ASCII character string containing high-bit bytes is implicitly
597bb945
RGS
742converted into UTF-8. It's a lexical pragma since Perl 5.9.4; on older
743perls, its effect is global.
0eece9c0
RGS
744
745=item *
746
747C<Module::CoreList>, by Richard Clamp, is a small handy module that tells
748you what versions of core modules ship with any versions of Perl 5. It
749comes with a command-line frontend, C<corelist>.
750
bd3831ee
RGS
751=item *
752
753C<Math::BigInt::FastCalc> is an XS-enabled, and thus faster, version of
754C<Math::BigInt::Calc>.
755
756=item *
757
758C<Compress::Zlib> is an interface to the zlib compression library. It
759comes with a bundled version of zlib, so having a working zlib is not a
760prerequisite to install it. It's used by C<Archive::Tar> (see below).
761
762=item *
763
764C<IO::Zlib> is an C<IO::>-style interface to C<Compress::Zlib>.
765
766=item *
767
768C<Archive::Tar> is a module to manipulate C<tar> archives.
769
770=item *
771
772C<Digest::SHA> is a module used to calculate many types of SHA digests,
773has been included for SHA support in the CPAN module.
774
775=item *
776
777C<ExtUtils::CBuilder> and C<ExtUtils::ParseXS> have been added.
778
597bb945
RGS
779=item *
780
781C<Hash::Util::FieldHash>, by Anno Siegel, has been added. This module
782provides support for I<field hashes>: hashes that maintain an association
783of a reference with a value, in a thread-safe garbage-collected way.
784Such hashes are useful to implement inside-out objects.
785
786=item *
787
788C<Module::Build>, by Ken Williams, has been added. It's an alternative to
789C<ExtUtils::MakeMaker> to build and install perl modules.
790
791=item *
792
793C<Module::Load>, by Jos Boumans, has been added. It provides a single
794interface to load Perl modules and F<.pl> files.
795
796=item *
797
798C<Module::Loaded>, by Jos Boumans, has been added. It's used to mark
799modules as loaded or unloaded.
800
801=item *
802
803C<Package::Constants>, by Jos Boumans, has been added. It's a simple
804helper to list all constants declared in a given package.
805
806=item *
807
808C<Win32API::File>, by Tye McQueen, has been added (for Windows builds).
809This module provides low-level access to Win32 system API calls for
810files/dirs.
811
f0e260b8
RGS
812=item *
813
814C<Locale::Maketext::Simple>, needed by CPANPLUS, is a simple wrapper around
815C<Locale::Maketext::Lexicon>. Note that C<Locale::Maketext::Lexicon> isn't
816included in the perl core; the behaviour of C<Locale::Maketext::Simple>
817gracefully degrades when the later isn't present.
818
819=item *
820
821C<Params::Check> implements a generic input parsing/checking mechanism. It
822is used by CPANPLUS.
823
824=item *
825
826C<Term::UI> simplifies the task to ask questions at a terminal prompt.
827
828=item *
829
830C<Object::Accessor> provides an interface to create per-object accessors.
831
832=item *
833
834C<Module::Pluggable> is a simple framework to create modules that accept
835pluggable sub-modules.
836
837=item *
838
839C<Module::Load::Conditional> provides simple ways to query and possibly
840load installed modules.
841
842=item *
843
844C<Time::Piece> provides an object oriented interface to time functions,
845overriding the built-ins localtime() and gmtime().
846
847=item *
848
849C<IPC::Cmd> helps to find and run external commands, possibly
850interactively.
851
852=item *
853
854C<File::Fetch> provide a simple generic file fetching mechanism.
855
856=item *
857
858C<Log::Message> and C<Log::Message::Simple> are used by the log facility
859of C<CPANPLUS>.
860
861=item *
862
863C<Archive::Extract> is a generic archive extraction mechanism
1cecf2c0 864for F<.tar> (plain, gzipped or bzipped) or F<.zip> files.
f0e260b8
RGS
865
866=item *
867
868C<CPANPLUS> provides an API and a command-line tool to access the CPAN
869mirrors.
870
e6746346
SP
871=item *
872
873C<Pod::Escapes> provides utilities that are useful in decoding Pod
874EE<lt>...E<gt> sequences.
875
876=item *
877
878C<Pod::Simple> is now the backend for several of the Pod-related modules
879included with Perl.
880
f0e260b8
RGS
881=back
882
883=head2 Selected Changes to Core Modules
884
885=over 4
886
887=item C<Attribute::Handlers>
888
889C<Attribute::Handlers> can now report the caller's file and line number.
890(David Feldman)
891
6cdf4617
RGS
892All interpreted attributes are now passed as array references. (Damian
893Conway)
894
f0e260b8
RGS
895=item C<B::Lint>
896
897C<B::Lint> is now based on C<Module::Pluggable>, and so can be extended
898with plugins. (Joshua ben Jore)
899
900=item C<B>
901
902It's now possible to access the lexical pragma hints (C<%^H>) by using the
903method B::COP::hints_hash(). It returns a C<B::RHE> object, which in turn
904can be used to get a hash reference via the method B::RHE::HASH(). (Joshua
905ben Jore)
906
907=item C<Thread>
908
909As the old 5005thread threading model has been removed, in favor of the
910ithreads scheme, the C<Thread> module is now a compatibility wrapper, to
911be used in old code only. It has been removed from the default list of
912dynamic extensions.
913
0eece9c0
RGS
914=back
915
cf6c151c 916=head1 Utility Changes
c0c97549
RGS
917
918=over 4
919
bd3831ee 920=item perl -d
c0c97549
RGS
921
922The Perl debugger can now save all debugger commands for sourcing later;
923notably, it can now emulate stepping backwards, by restarting and
924rerunning all bar the last command from a saved command history.
925
926It can also display the parent inheritance tree of a given class, with the
927C<i> command.
928
bd3831ee
RGS
929=item ptar
930
292c2b28 931C<ptar> is a pure perl implementation of C<tar> that comes with
bd3831ee
RGS
932C<Archive::Tar>.
933
934=item ptardiff
935
254a8700 936C<ptardiff> is a small utility used to generate a diff between the contents
bd3831ee
RGS
937of a tar archive and a directory tree. Like C<ptar>, it comes with
938C<Archive::Tar>.
939
940=item shasum
941
942C<shasum> is a command-line utility, used to print or to check SHA
943digests. It comes with the new C<Digest::SHA> module.
944
945=item corelist
0eece9c0
RGS
946
947The C<corelist> utility is now installed with perl (see L</"New modules">
948above).
949
bd3831ee 950=item h2ph and h2xs
0eece9c0 951
254a8700 952C<h2ph> and C<h2xs> have been made more robust with regard to
0eece9c0
RGS
953"modern" C code.
954
bd3831ee
RGS
955C<h2xs> implements a new option C<--use-xsloader> to force use of
956C<XSLoader> even in backwards compatible modules.
957
958The handling of authors' names that had apostrophes has been fixed.
959
960Any enums with negative values are now skipped.
961
962=item perlivp
963
964C<perlivp> no longer checks for F<*.ph> files by default. Use the new C<-a>
965option to run I<all> tests.
966
967=item find2perl
0eece9c0
RGS
968
969C<find2perl> now assumes C<-print> as a default action. Previously, it
970needed to be specified explicitly.
971
972Several bugs have been fixed in C<find2perl>, regarding C<-exec> and
973C<-eval>. Also the options C<-path>, C<-ipath> and C<-iname> have been
974added.
975
597bb945
RGS
976=item config_data
977
978C<config_data> is a new utility that comes with C<Module::Build>. It
979provides a command-line interface to the configuration of Perl modules
980that use Module::Build's framework of configurability (that is,
981C<*::ConfigData> modules that contain local configuration information for
982their parent modules.)
983
f00638a2 984=item cpanp
f0e260b8 985
254a8700 986C<cpanp>, the CPANPLUS shell, has been added. (C<cpanp-run-perl>, a
f0e260b8
RGS
987helper for CPANPLUS operation, has been added too, but isn't intended for
988direct use).
989
f00638a2 990=item cpan2dist
f0e260b8 991
292c2b28 992C<cpan2dist> is a new utility that comes with CPANPLUS. It's a tool to
f0e260b8
RGS
993create distributions (or packages) from CPAN modules.
994
f00638a2 995=item pod2html
f0e260b8
RGS
996
997The output of C<pod2html> has been enhanced to be more customizable via
998CSS. Some formatting problems were also corrected. (Jari Aalto)
999
c0c97549
RGS
1000=back
1001
cf6c151c 1002=head1 New Documentation
c0c97549 1003
597bb945
RGS
1004The L<perlpragma> manpage documents how to write one's own lexical
1005pragmas in pure Perl (something that is possible starting with 5.9.4).
1006
bd3831ee
RGS
1007The new L<perlglossary> manpage is a glossary of terms used in the Perl
1008documentation, technical and otherwise, kindly provided by O'Reilly Media,
1009Inc.
1010
597bb945
RGS
1011The L<perlreguts> manpage, courtesy of Yves Orton, describes internals of the
1012Perl regular expression engine.
1013
62c26f88
RGS
1014The L<perlreapi> manpage describes the interface to the perl interpreter
1015used to write pluggable regular expression engines (by Ævar Arnfjörð
1016Bjarmason).
1017
33f0d962 1018The L<perlunitut> manpage is a tutorial for programming with Unicode and
597bb945
RGS
1019string encodings in Perl, courtesy of Juerd Waalboer.
1020
f0e260b8
RGS
1021A new manual page, L<perlunifaq> (the Perl Unicode FAQ), has been added
1022(Juerd Waalboer).
1023
dbef3c66
RGS
1024The L<perlcommunity> manpage gives a description of the Perl community
1025on the Internet and in real life. (Edgar "Trizor" Bering)
1026
f00638a2
RGS
1027The L<CORE> manual page documents the C<CORE::> namespace. (Tels)
1028
c0c97549
RGS
1029The long-existing feature of C</(?{...})/> regexps setting C<$_> and pos()
1030is now documented.
1031
cf6c151c 1032=head1 Performance Enhancements
c0c97549 1033
597bb945 1034=head2 In-place sorting
0eece9c0 1035
c0c97549
RGS
1036Sorting arrays in place (C<@a = sort @a>) is now optimized to avoid
1037making a temporary copy of the array.
1038
0eece9c0
RGS
1039Likewise, C<reverse sort ...> is now optimized to sort in reverse,
1040avoiding the generation of a temporary intermediate list.
1041
597bb945 1042=head2 Lexical array access
0eece9c0 1043
c0c97549
RGS
1044Access to elements of lexical arrays via a numeric constant between 0 and
1045255 is now faster. (This used to be only the case for global arrays.)
1046
597bb945 1047=head2 XS-assisted SWASHGET
bd3831ee
RGS
1048
1049Some pure-perl code that perl was using to retrieve Unicode properties and
1050transliteration mappings has been reimplemented in XS.
1051
597bb945 1052=head2 Constant subroutines
bd3831ee
RGS
1053
1054The interpreter internals now support a far more memory efficient form of
1055inlineable constants. Storing a reference to a constant value in a symbol
1056table is equivalent to a full typeglob referencing a constant subroutine,
1057but using about 400 bytes less memory. This proxy constant subroutine is
1058automatically upgraded to a real typeglob with subroutine if necessary.
1059The approach taken is analogous to the existing space optimisation for
1060subroutine stub declarations, which are stored as plain scalars in place
1061of the full typeglob.
1062
1063Several of the core modules have been converted to use this feature for
1064their system dependent constants - as a result C<use POSIX;> now takes about
1065200K less memory.
1066
597bb945 1067=head2 C<PERL_DONT_CREATE_GVSV>
bd3831ee
RGS
1068
1069The new compilation flag C<PERL_DONT_CREATE_GVSV>, introduced as an option
1070in perl 5.8.8, is turned on by default in perl 5.9.3. It prevents perl
4cd37d19 1071from creating an empty scalar with every new typeglob. See L<perl589delta>
bd3831ee
RGS
1072for details.
1073
597bb945 1074=head2 Weak references are cheaper
bd3831ee
RGS
1075
1076Weak reference creation is now I<O(1)> rather than I<O(n)>, courtesy of
1077Nicholas Clark. Weak reference deletion remains I<O(n)>, but if deletion only
1078happens at program exit, it may be skipped completely.
1079
597bb945 1080=head2 sort() enhancements
bd3831ee
RGS
1081
1082Salvador Fandiño provided improvements to reduce the memory usage of C<sort>
1083and to speed up some cases.
1084
597bb945
RGS
1085=head2 Memory optimisations
1086
1087Several internal data structures (typeglobs, GVs, CVs, formats) have been
1088restructured to use less memory. (Nicholas Clark)
1089
1090=head2 UTF-8 cache optimisation
1091
1092The UTF-8 caching code is now more efficient, and used more often.
1093(Nicholas Clark)
1094
1095=head2 Sloppy stat on Windows
1096
1097On Windows, perl's stat() function normally opens the file to determine
1098the link count and update attributes that may have been changed through
1099hard links. Setting ${^WIN32_SLOPPY_STAT} to a true value speeds up
1100stat() by not performing this operation. (Jan Dubois)
1101
597bb945
RGS
1102=head2 Regular expressions optimisations
1103
1104=over 4
1105
1106=item Engine de-recursivised
1107
1108The regular expression engine is no longer recursive, meaning that
1109patterns that used to overflow the stack will either die with useful
1110explanations, or run to completion, which, since they were able to blow
1111the stack before, will likely take a very long time to happen. If you were
1112experiencing the occasional stack overflow (or segfault) and upgrade to
1113discover that now perl apparently hangs instead, look for a degenerate
1114regex. (Dave Mitchell)
1115
1116=item Single char char-classes treated as literals
1117
1118Classes of a single character are now treated the same as if the character
1119had been used as a literal, meaning that code that uses char-classes as an
1120escaping mechanism will see a speedup. (Yves Orton)
1121
1122=item Trie optimisation of literal string alternations
1123
1124Alternations, where possible, are optimised into more efficient matching
1125structures. String literal alternations are merged into a trie and are
1126matched simultaneously. This means that instead of O(N) time for matching
1127N alternations at a given point, the new code performs in O(1) time.
1128A new special variable, ${^RE_TRIE_MAXBUF}, has been added to fine-tune
1129this optimization. (Yves Orton)
1130
1131B<Note:> Much code exists that works around perl's historic poor
1132performance on alternations. Often the tricks used to do so will disable
1133the new optimisations. Hopefully the utility modules used for this purpose
99d59c4d 1134will be educated about these new optimisations.
597bb945
RGS
1135
1136=item Aho-Corasick start-point optimisation
1137
1138When a pattern starts with a trie-able alternation and there aren't
e15dad31 1139better optimisations available, the regex engine will use Aho-Corasick
597bb945
RGS
1140matching to find the start point. (Yves Orton)
1141
0eece9c0
RGS
1142=back
1143
cf6c151c 1144=head1 Installation and Configuration Improvements
c0c97549 1145
597bb945
RGS
1146=head2 Configuration improvements
1147
1148=over 4
1149
1150=item C<-Dusesitecustomize>
bd3831ee 1151
0eece9c0 1152Run-time customization of @INC can be enabled by passing the
597bb945 1153C<-Dusesitecustomize> flag to Configure. When enabled, this will make perl
0eece9c0
RGS
1154run F<$sitelibexp/sitecustomize.pl> before anything else. This script can
1155then be set up to add additional entries to @INC.
1156
597bb945
RGS
1157=item Relocatable installations
1158
1159There is now Configure support for creating a relocatable perl tree. If
1160you Configure with C<-Duserelocatableinc>, then the paths in @INC (and
1161everything else in %Config) can be optionally located via the path of the
1162perl executable.
1163
1164That means that, if the string C<".../"> is found at the start of any
1165path, it's substituted with the directory of $^X. So, the relocation can
1166be configured on a per-directory basis, although the default with
1167C<-Duserelocatableinc> is that everything is relocated. The initial
1168install is done to the original configured prefix.
1169
1170=item strlcat() and strlcpy()
1171
1172The configuration process now detects whether strlcat() and strlcpy() are
1173available. When they are not available, perl's own version is used (from
1174Russ Allbery's public domain implementation). Various places in the perl
1175interpreter now use them. (Steve Peters)
1176
f0e260b8
RGS
1177=item C<d_pseudofork> and C<d_printf_format_null>
1178
1179A new configuration variable, available as C<$Config{d_pseudofork}> in
1180the L<Config> module, has been added, to distinguish real fork() support
1181from fake pseudofork used on Windows platforms.
1182
1183A new configuration variable, C<d_printf_format_null>, has been added,
1184to see if printf-like formats are allowed to be NULL.
1185
1186=item Configure help
1187
1188C<Configure -h> has been extended with the most commonly used options.
1189
597bb945
RGS
1190=back
1191
1192=head2 Compilation improvements
1193
1194=over 4
1195
1196=item Parallel build
0eece9c0 1197
bd3831ee
RGS
1198Parallel makes should work properly now, although there may still be problems
1199if C<make test> is instructed to run in parallel.
1200
597bb945
RGS
1201=item Borland's compilers support
1202
bd3831ee
RGS
1203Building with Borland's compilers on Win32 should work more smoothly. In
1204particular Steve Hay has worked to side step many warnings emitted by their
1205compilers and at least one C compiler internal error.
1206
597bb945
RGS
1207=item Static build on Windows
1208
f0e260b8
RGS
1209Perl extensions on Windows now can be statically built into the Perl DLL.
1210
1211Also, it's now possible to build a C<perl-static.exe> that doesn't depend
1212on the Perl DLL on Win32. See the Win32 makefiles for details.
1213(Vadim Konovalov)
bd3831ee 1214
69d2c521 1215=item ppport.h files
597bb945
RGS
1216
1217All F<ppport.h> files in the XS modules bundled with perl are now
1218autogenerated at build time. (Marcus Holland-Moritz)
1219
f0e260b8
RGS
1220=item C++ compatibility
1221
1222Efforts have been made to make perl and the core XS modules compilable
1223with various C++ compilers (although the situation is not perfect with
1224some of the compilers on some of the platforms tested.)
1225
597bb945
RGS
1226=item Support for Microsoft 64-bit compiler
1227
1228Support for building perl with Microsoft's 64-bit compiler has been
1229improved. (ActiveState)
1230
f0e260b8
RGS
1231=item Visual C++
1232
c01f0d41 1233Perl can now be compiled with Microsoft Visual C++ 2005 (and 2008 Beta 2).
f0e260b8
RGS
1234
1235=item Win32 builds
1236
1237All win32 builds (MS-Win, WinCE) have been merged and cleaned up.
1238
597bb945
RGS
1239=back
1240
1241=head2 Installation improvements
1242
1243=over 4
1244
1245=item Module auxiliary files
1246
1247README files and changelogs for CPAN modules bundled with perl are no
1248longer installed.
1249
1250=back
1251
bd3831ee
RGS
1252=head2 New Or Improved Platforms
1253
597bb945 1254Perl has been reported to work on Symbian OS. See L<perlsymbian> for more
bd3831ee
RGS
1255information.
1256
597bb945
RGS
1257Many improvements have been made towards making Perl work correctly on
1258z/OS.
1259
f0e260b8 1260Perl has been reported to work on DragonFlyBSD and MidnightBSD.
597bb945 1261
3af9ce7f
RGS
1262Perl has also been reported to work on NexentaOS
1263( http://www.gnusolaris.org/ ).
1264
bd3831ee
RGS
1265The VMS port has been improved. See L<perlvms>.
1266
d43695a1
RGS
1267Support for Cray XT4 Catamount/Qk has been added. See
1268F<hints/catamount.sh> in the source code distribution for more
1269information.
bd3831ee 1270
f0e260b8
RGS
1271Vendor patches have been merged for RedHat and Gentoo.
1272
1273DynaLoader::dl_unload_file() now works on Windows.
bd3831ee 1274
cf6c151c 1275=head1 Selected Bug Fixes
c0c97549 1276
bd3831ee
RGS
1277=over 4
1278
1279=item strictures in regexp-eval blocks
1280
c0c97549
RGS
1281C<strict> wasn't in effect in regexp-eval blocks (C</(?{...})/>).
1282
bd3831ee
RGS
1283=item Calling CORE::require()
1284
1285CORE::require() and CORE::do() were always parsed as require() and do()
1286when they were overridden. This is now fixed.
1287
1288=item Subscripts of slices
1289
1290You can now use a non-arrowed form for chained subscripts after a list
1291slice, like in:
1292
1293 ({foo => "bar"})[0]{foo}
1294
1295This used to be a syntax error; a C<< -> >> was required.
1296
1297=item C<no warnings 'category'> works correctly with -w
1298
1299Previously when running with warnings enabled globally via C<-w>, selective
1300disabling of specific warning categories would actually turn off all warnings.
1301This is now fixed; now C<no warnings 'io';> will only turn off warnings in the
1302C<io> class. Previously it would erroneously turn off all warnings.
1303
597bb945 1304=item threads improvements
bd3831ee
RGS
1305
1306Several memory leaks in ithreads were closed. Also, ithreads were made
1307less memory-intensive.
1308
597bb945
RGS
1309C<threads> is now a dual-life module, also available on CPAN. It has been
1310expanded in many ways. A kill() method is available for thread signalling.
1311One can get thread status, or the list of running or joinable threads.
1312
1313A new C<< threads->exit() >> method is used to exit from the application
1314(this is the default for the main thread) or from the current thread only
1315(this is the default for all other threads). On the other hand, the exit()
1316built-in now always causes the whole application to terminate. (Jerry
1317D. Hedden)
1318
bd3831ee
RGS
1319=item chr() and negative values
1320
1321chr() on a negative value now gives C<\x{FFFD}>, the Unicode replacement
1322character, unless when the C<bytes> pragma is in effect, where the low
20d131f3 1323eight bits of the value are used.
bd3831ee 1324
597bb945
RGS
1325=item PERL5SHELL and tainting
1326
1327On Windows, the PERL5SHELL environment variable is now checked for
1328taintedness. (Rafael Garcia-Suarez)
1329
1330=item Using *FILE{IO}
1331
1332C<stat()> and C<-X> filetests now treat *FILE{IO} filehandles like *FILE
1333filehandles. (Steve Peters)
1334
1335=item Overloading and reblessing
1336
1337Overloading now works when references are reblessed into another class.
1338Internally, this has been implemented by moving the flag for "overloading"
1339from the reference to the referent, which logically is where it should
1340always have been. (Nicholas Clark)
1341
1342=item Overloading and UTF-8
1343
1344A few bugs related to UTF-8 handling with objects that have
1345stringification overloaded have been fixed. (Nicholas Clark)
1346
1347=item eval memory leaks fixed
1348
1349Traditionally, C<eval 'syntax error'> has leaked badly. Many (but not all)
1350of these leaks have now been eliminated or reduced. (Dave Mitchell)
1351
1352=item Random device on Windows
1353
1354In previous versions, perl would read the file F</dev/urandom> if it
1355existed when seeding its random number generator. That file is unlikely
1356to exist on Windows, and if it did would probably not contain appropriate
1357data, so perl no longer tries to read it on Windows. (Alex Davies)
1358
1359=item PERLIO_DEBUG
1360
254a8700 1361The C<PERLIO_DEBUG> environment variable no longer has any effect for
597bb945
RGS
1362setuid scripts and for scripts run with B<-T>.
1363
1364Moreover, with a thread-enabled perl, using C<PERLIO_DEBUG> could lead to
1365an internal buffer overflow. This has been fixed.
1366
f0e260b8
RGS
1367=item PerlIO::scalar and read-only scalars
1368
1369PerlIO::scalar will now prevent writing to read-only scalars. Moreover,
1370seek() is now supported with PerlIO::scalar-based filehandles, the
1371underlying string being zero-filled as needed. (Rafael, Jarkko Hietaniemi)
1372
1373=item study() and UTF-8
1374
1375study() never worked for UTF-8 strings, but could lead to false results.
1376It's now a no-op on UTF-8 data. (Yves Orton)
1377
1378=item Critical signals
1379
1380The signals SIGILL, SIGBUS and SIGSEGV are now always delivered in an
1381"unsafe" manner (contrary to other signals, that are deferred until the
1382perl interpreter reaches a reasonably stable state; see
1383L<perlipc/"Deferred Signals (Safe Signals)">). (Rafael)
1384
1385=item @INC-hook fix
1386
1387When a module or a file is loaded through an @INC-hook, and when this hook
1388has set a filename entry in %INC, __FILE__ is now set for this module
1389accordingly to the contents of that %INC entry. (Rafael)
1390
1391=item C<-t> switch fix
1392
1393The C<-w> and C<-t> switches can now be used together without messing
254a8700 1394up which categories of warnings are activated. (Rafael)
f0e260b8
RGS
1395
1396=item Duping UTF-8 filehandles
1397
1398Duping a filehandle which has the C<:utf8> PerlIO layer set will now
1399properly carry that layer on the duped filehandle. (Rafael)
1400
1401=item Localisation of hash elements
1402
292c2b28 1403Localizing a hash element whose key was given as a variable didn't work
f0e260b8
RGS
1404correctly if the variable was changed while the local() was in effect (as
1405in C<local $h{$x}; ++$x>). (Bo Lindbergh)
1406
bd3831ee 1407=back
0eece9c0 1408
cf6c151c 1409=head1 New or Changed Diagnostics
c0c97549 1410
bd3831ee
RGS
1411=over 4
1412
d43695a1
RGS
1413=item Use of uninitialized value
1414
1415Perl will now try to tell you the name of the variable (if any) that was
1416undefined.
1417
bd3831ee
RGS
1418=item Deprecated use of my() in false conditional
1419
c0c97549
RGS
1420A new deprecation warning, I<Deprecated use of my() in false conditional>,
1421has been added, to warn against the use of the dubious and deprecated
1422construct
1423
1424 my $x if 0;
1425
1426See L<perldiag>. Use C<state> variables instead.
1427
bd3831ee
RGS
1428=item !=~ should be !~
1429
0eece9c0
RGS
1430A new warning, C<!=~ should be !~>, is emitted to prevent this misspelling
1431of the non-matching operator.
1432
bd3831ee
RGS
1433=item Newline in left-justified string
1434
0eece9c0
RGS
1435The warning I<Newline in left-justified string> has been removed.
1436
bd3831ee
RGS
1437=item Too late for "-T" option
1438
0eece9c0
RGS
1439The error I<Too late for "-T" option> has been reformulated to be more
1440descriptive.
1441
bd3831ee
RGS
1442=item "%s" variable %s masks earlier declaration
1443
1444This warning is now emitted in more consistent cases; in short, when one
1445of the declarations involved is a C<my> variable:
1446
1447 my $x; my $x; # warns
1448 my $x; our $x; # warns
1449 our $x; my $x; # warns
1450
1451On the other hand, the following:
1452
1453 our $x; our $x;
1454
1455now gives a C<"our" variable %s redeclared> warning.
1456
1457=item readdir()/closedir()/etc. attempted on invalid dirhandle
1458
1459These new warnings are now emitted when a dirhandle is used but is
1460either closed or not really a dirhandle.
1461
f0e260b8
RGS
1462=item Opening dirhandle/filehandle %s also as a file/directory
1463
1464Two deprecation warnings have been added: (Rafael)
1465
1466 Opening dirhandle %s also as a file
1467 Opening filehandle %s also as a directory
1468
f00638a2
RGS
1469=item Use of -P is deprecated
1470
1471Perl's command-line switch C<-P> is now deprecated.
1472
6601a838
RGS
1473=item v-string in use/require is non-portable
1474
1475Perl will warn you against potential backwards compatibility problems with
1476the C<use VERSION> syntax.
1477
bd3831ee
RGS
1478=item perl -V
1479
0eece9c0
RGS
1480C<perl -V> has several improvements, making it more useable from shell
1481scripts to get the value of configuration variables. See L<perlrun> for
1482details.
1483
bd3831ee
RGS
1484=back
1485
cf6c151c 1486=head1 Changed Internals
c0c97549 1487
16993b2e
JH
1488In general, the source code of perl has been refactored, tidied up,
1489and optimized in many places. Also, memory management and allocation
1490has been improved in several points.
1491
1492When compiling the perl core with gcc, as many gcc warning flags are
1493turned on as is possible on the platform. (This quest for cleanliness
1494doesn't extend to XS code because we cannot guarantee the tidiness of
1495code we didn't write.) Similar strictness flags have been added or
1496tightened for various other C compilers.
bd3831ee 1497
c0c97549
RGS
1498=head2 Reordering of SVt_* constants
1499
1500The relative ordering of constants that define the various types of C<SV>
1501have changed; in particular, C<SVt_PVGV> has been moved before C<SVt_PVLV>,
1502C<SVt_PVAV>, C<SVt_PVHV> and C<SVt_PVCV>. This is unlikely to make any
1503difference unless you have code that explicitly makes assumptions about that
1504ordering. (The inheritance hierarchy of C<B::*> objects has been changed
1505to reflect this.)
1506
254a8700
NC
1507=head2 Elimination of SVt_PVBM
1508
e1020413 1509Related to this, the internal type C<SVt_PVBM> has been removed. This
254a8700
NC
1510dedicated type of C<SV> was used by the C<index> operator and parts of the
1511regexp engine to facilitate fast Boyer-Moore matches. Its use internally has
1512been replaced by C<SV>s of type C<SVt_PVGV>.
1513
1514=head2 New type SVt_BIND
1515
1516A new type C<SVt_BIND> has been added, in readiness for the project to
1517implement Perl 6 on 5. There deliberately is no implementation yet, and
1518they cannot yet be created or destroyed.
1519
c0c97549
RGS
1520=head2 Removal of CPP symbols
1521
1522The C preprocessor symbols C<PERL_PM_APIVERSION> and
1523C<PERL_XS_APIVERSION>, which were supposed to give the version number of
1524the oldest perl binary-compatible (resp. source-compatible) with the
1525present one, were not used, and sometimes had misleading values. They have
1526been removed.
1527
1528=head2 Less space is used by ops
1529
1530The C<BASEOP> structure now uses less space. The C<op_seq> field has been
254a8700 1531removed and replaced by a single bit bit-field C<op_opt>. C<op_type> is now 9
c0c97549
RGS
1532bits long. (Consequently, the C<B::OP> class doesn't provide an C<seq>
1533method anymore.)
1534
1535=head2 New parser
1536
1537perl's parser is now generated by bison (it used to be generated by
1538byacc.) As a result, it seems to be a bit more robust.
1539
bd3831ee
RGS
1540Also, Dave Mitchell improved the lexer debugging output under C<-DT>.
1541
1542=head2 Use of C<const>
1543
1544Andy Lester supplied many improvements to determine which function
1545parameters and local variables could actually be declared C<const> to the C
1546compiler. Steve Peters provided new C<*_set> macros and reworked the core to
1547use these rather than assigning to macros in LVALUE context.
1548
1549=head2 Mathoms
1550
1551A new file, F<mathoms.c>, has been added. It contains functions that are
1552no longer used in the perl core, but that remain available for binary or
1553source compatibility reasons. However, those functions will not be
1554compiled in if you add C<-DNO_MATHOMS> in the compiler flags.
1555
1556=head2 C<AvFLAGS> has been removed
1557
1558The C<AvFLAGS> macro has been removed.
1559
1560=head2 C<av_*> changes
1561
1562The C<av_*()> functions, used to manipulate arrays, no longer accept null
1563C<AV*> parameters.
1564
597bb945
RGS
1565=head2 $^H and %^H
1566
1567The implementation of the special variables $^H and %^H has changed, to
254a8700 1568allow implementing lexical pragmas in pure Perl.
597bb945 1569
bd3831ee
RGS
1570=head2 B:: modules inheritance changed
1571
1572The inheritance hierarchy of C<B::> modules has changed; C<B::NV> now
1573inherits from C<B::SV> (it used to inherit from C<B::IV>).
1574
f0e260b8
RGS
1575=head2 Anonymous hash and array constructors
1576
1577The anonymous hash and array constructors now take 1 op in the optree
1578instead of 3, now that pp_anonhash and pp_anonlist return a reference to
33f0d962 1579a hash/array when the op is flagged with OPf_SPECIAL. (Nicholas Clark)
f0e260b8 1580
cf6c151c 1581=head1 Known Problems
c0c97549
RGS
1582
1583There's still a remaining problem in the implementation of the lexical
1584C<$_>: it doesn't work inside C</(?{...})/> blocks. (See the TODO test in
1585F<t/op/mydef.t>.)
1586
3af9ce7f
RGS
1587Stacked filetest operators won't work when the C<filetest> pragma is in
1588effect, because they rely on the stat() buffer C<_> being populated, and
1589filetest bypasses stat().
1590
a3d15f9a
RGS
1591=head2 UTF-8 problems
1592
1593The handling of Unicode still is unclean in several places, where it's
1594dependent on whether a string is internally flagged as UTF-8. This will
1595be made more consistent in perl 5.12, but that won't be possible without
1596a certain amount of backwards incompatibility.
1597
1598=head1 Platform Specific Problems
1599
48d64ccf
RGS
1600When compiled with g++ and thread support on Linux, it's reported that the
1601C<$!> stops working correctly. This is related to the fact that the glibc
1602provides two strerror_r(3) implementation, and perl selects the wrong
1603one.
1604
cf6c151c
RGS
1605=head1 Reporting Bugs
1606
a3d15f9a
RGS
1607If you find what you think is a bug, you might check the articles
1608recently posted to the comp.lang.perl.misc newsgroup and the perl
1609bug database at http://rt.perl.org/rt3/ . There may also be
1610information at http://www.perl.org/ , the Perl Home Page.
1611
1612If you believe you have an unreported bug, please run the B<perlbug>
1613program included with your release. Be sure to trim your bug down
1614to a tiny but sufficient test case. Your bug report, along with the
1615output of C<perl -V>, will be sent off to perlbug@perl.org to be
1616analysed by the Perl porting team.
1617
cf6c151c
RGS
1618=head1 SEE ALSO
1619
1620The F<Changes> file and the perl590delta to perl595delta man pages for
1621exhaustive details on what changed.
1622
1623The F<INSTALL> file for how to build Perl.
1624
1625The F<README> file for general stuff.
1626
1627The F<Artistic> and F<Copying> files for copyright information.
1628
1629=cut