Commit | Line | Data |
---|---|---|
44691e6f AB |
1 | =encoding utf8 |
2 | ||
3 | =head1 NAME | |
4 | ||
5ed58cbd | 5 | perldelta - what is new for perl v5.18.0 |
e128ab2c | 6 | |
4eabcf70 | 7 | =head1 DESCRIPTION |
6db9054f | 8 | |
5ed58cbd | 9 | This document describes differences between the 5.16.0 release and the 5.18.0 |
e08634c5 | 10 | release. |
6db9054f | 11 | |
d5f315e8 | 12 | If you are upgrading from an earlier release such as 5.14.0, first read |
5ed58cbd | 13 | L<perl5160delta>, which describes differences between 5.14.0 and 5.16.0. |
3f01b192 | 14 | |
5ed58cbd | 15 | =head1 Core Enhancements |
3f01b192 | 16 | |
5ed58cbd | 17 | =head2 New mechanism for experimental features |
82d98f72 | 18 | |
5ed58cbd | 19 | Newly-added experimental features will now require this incantation: |
82d98f72 | 20 | |
5ed58cbd RS |
21 | no warnings "experimental::feature_name"; |
22 | use feature "feature_name"; # would warn without the prev line | |
23 | ||
24 | There is a new warnings category, called "experimental", containing | |
25 | warnings that the L<feature> pragma emits when enabling experimental | |
26 | features. | |
27 | ||
28 | Newly-added experimental features will also be given special warning IDs, | |
29 | which consist of "experimental::" followed by the name of the feature. (The | |
30 | plan is to extend this mechanism eventually to all warnings, to allow them | |
31 | to be enabled or disabled individually, and not just by category.) | |
32 | ||
33 | By saying | |
34 | ||
35 | no warnings "experimental::feature_name"; | |
36 | ||
37 | you are taking responsibility for any breakage that future changes to, or | |
38 | removal of, the feature may cause. | |
39 | ||
40 | Existing experimental features may begin emitting these warnings, too. Please | |
41 | consult L<perlexperiment> for information on which features are considered | |
42 | experimental. | |
43 | ||
44 | =head2 Hash overhaul | |
45 | ||
46 | Changes to the implementation of hashes in perl 5.18.0 will be one of the most | |
47 | visible changes to the behavior of existing code. For the most part, these | |
48 | changes will be visible as two distinct hash variables now providing their | |
49 | contents in a different order where it was previously identical. When | |
50 | encountering these changes, the key to cleaning up from them is to accept that | |
51 | B<hashes are unordered collections> and to act accordingly. | |
52 | ||
53 | =head3 Hash randomization | |
54 | ||
55 | The seed used by Perl's hash function is now random. This means that the | |
56 | order which keys/values will be returned from functions like C<keys()>, | |
57 | C<values()>, and C<each()> will differ from run to run. | |
58 | ||
59 | This change was introduced to make Perl's hashes more robust to algorithmic | |
60 | complexity attacks, and also because we discovered that it exposes hash | |
61 | ordering dependency bugs and makes them easier to track down. | |
62 | ||
63 | Toolchain maintainers might want to invest in additional infrastructure to | |
64 | test for things like this. Running tests several times in a row and then | |
65 | comparing results will make it easier to spot hash order dependencies in | |
66 | code. Authors are strongly encouraged not to expose the key order of | |
67 | Perl's hashes to insecure audiences. | |
68 | ||
69 | Further, every hash has its own iteration order, which should make it much | |
70 | more difficult to determine what the current hash seed is. | |
71 | ||
72 | =head3 New hash function: Murmurhash-32 | |
73 | ||
74 | We have switched Perl's hash function to use Murmurhash-32, and added build | |
75 | support for several other hash functions. This new function is expected to | |
76 | perform equivalently to the old one for shorter strings and is faster for | |
77 | hashing longer strings. | |
78 | ||
f105b7be | 79 | =head3 PERL_HASH_SEED environment variable now takes a hex value |
5ed58cbd | 80 | |
f105b7be | 81 | C<PERL_HASH_SEED> no longer accepts an integer as a parameter; instead the |
5ed58cbd RS |
82 | value is expected to be a binary string encoded in hex. This is to make |
83 | the infrastructure support hash seeds of arbitrary lengths which might | |
84 | exceed that of an integer. (SipHash uses a 16 byte seed). | |
85 | ||
c40c48bb RS |
86 | =head3 PERL_PERTURB_KEYS environment variable added |
87 | ||
f105b7be | 88 | The C<PERL_PERTURB_KEYS> environment variable allows one to control the level of |
c40c48bb RS |
89 | randomization applied to C<keys> and friends. |
90 | ||
f105b7be | 91 | When C<PERL_PERTURB_KEYS> is 0, perl will not randomize the key order at all. The |
c40c48bb RS |
92 | chance that C<keys> changes due to an insert will be the same as in previous |
93 | perls, basically only when the bucket size is changed. | |
94 | ||
f105b7be | 95 | When C<PERL_PERTURB_KEYS> is 1, perl will randomize keys in a non-repeatable |
c40c48bb RS |
96 | way. The chance that C<keys> changes due to an insert will be very high. This |
97 | is the most secure and default mode. | |
98 | ||
f105b7be KE |
99 | When C<PERL_PERTURB_KEYS> is 2, perl will randomize keys in a repeatable way. |
100 | Repeated runs of the same program should produce the same output every time. | |
c40c48bb RS |
101 | The chance that keys changes due to an insert will be very high. |
102 | ||
f105b7be KE |
103 | C<PERL_HASH_SEED> implies a non-default C<PERL_PERTURB_KEYS> setting. Setting |
104 | C<PERL_HASH_SEED=0> (exactly one 0) implies C<PERL_PERTURB_KEYS=0> (hash key | |
105 | randomization disabled); settng C<PERL_HASH_SEED> to any other value implies | |
106 | C<PERL_PERTURB_KEYS=2> (deterministic and repeatable hash key randomization). | |
107 | Specifying C<PERL_PERTURB_KEYS> explicitly to a different level overrides this | |
c40c48bb RS |
108 | behavior. |
109 | ||
5ed58cbd RS |
110 | =head3 Hash::Util::hash_seed() now returns a string |
111 | ||
112 | Hash::Util::hash_seed() now returns a string instead of an integer. This | |
113 | is to make the infrastructure support hash seeds of arbitrary lengths | |
114 | which might exceed that of an integer. (SipHash uses a 16 byte seed). | |
115 | ||
116 | =head3 Output of PERL_HASH_SEED_DEBUG has been changed | |
117 | ||
118 | The environment variable PERL_HASH_SEED_DEBUG now makes perl show both the | |
f105b7be | 119 | hash function perl was built with, I<and> the seed, in hex, in use for that |
5ed58cbd RS |
120 | process. Code parsing this output, should it exist, must change to accommodate |
121 | the new format. Example of the new format: | |
122 | ||
123 | $ PERL_HASH_SEED_DEBUG=1 ./perl -e1 | |
124 | HASH_FUNCTION = MURMUR3 HASH_SEED = 0x1476bb9f | |
125 | ||
126 | =head2 Upgrade to Unicode 6.2 | |
127 | ||
2e7bc647 | 128 | Perl now supports Unicode 6.2. A list of changes from Unicode |
5ed58cbd RS |
129 | 6.1 is at L<http://www.unicode.org/versions/Unicode6.2.0>. |
130 | ||
131 | =head2 Character name aliases may now include non-Latin1-range characters | |
132 | ||
133 | It is possible to define your own names for characters for use in | |
134 | C<\N{...}>, C<charnames::vianame()>, etc. These names can now be | |
135 | comprised of characters from the whole Unicode range. This allows for | |
136 | names to be in your native language, and not just English. Certain | |
137 | restrictions apply to the characters that may be used (you can't define | |
138 | a name that has punctuation in it, for example). See L<charnames/CUSTOM | |
139 | ALIASES>. | |
140 | ||
141 | =head2 New DTrace probes | |
142 | ||
143 | The following new DTrace probes have been added: | |
14731ad1 | 144 | |
337fb649 | 145 | =over 4 |
14731ad1 | 146 | |
82d98f72 | 147 | =item * |
14731ad1 | 148 | |
5ed58cbd RS |
149 | C<op-entry> |
150 | ||
151 | =item * | |
152 | ||
153 | C<loading-file> | |
154 | ||
155 | =item * | |
156 | ||
157 | C<loaded-file> | |
158 | ||
159 | =back | |
160 | ||
161 | =head2 C<${^LAST_FH}> | |
162 | ||
163 | This new variable provides access to the filehandle that was last read. | |
164 | This is the handle used by C<$.> and by C<tell> and C<eof> without | |
165 | arguments. | |
166 | ||
167 | =head2 Regular Expression Set Operations | |
168 | ||
169 | This is an B<experimental> feature to allow matching against the union, | |
170 | intersection, etc., of sets of code points, similar to | |
171 | L<Unicode::Regex::Set>. It can also be used to extend C</x> processing | |
172 | to [bracketed] character classes, and as a replacement of user-defined | |
173 | properties, allowing more complex expressions than they do. See | |
174 | L<perlrecharclass/Extended Bracketed Character Classes>. | |
175 | ||
176 | =head2 Lexical subroutines | |
177 | ||
178 | This new feature is still considered B<experimental>. To enable it: | |
179 | ||
180 | use 5.018; | |
181 | no warnings "experimental::lexical_subs"; | |
182 | use feature "lexical_subs"; | |
183 | ||
184 | You can now declare subroutines with C<state sub foo>, C<my sub foo>, and | |
185 | C<our sub foo>. (C<state sub> requires that the "state" feature be | |
186 | enabled, unless you write it as C<CORE::state sub foo>.) | |
187 | ||
188 | C<state sub> creates a subroutine visible within the lexical scope in which | |
189 | it is declared. The subroutine is shared between calls to the outer sub. | |
190 | ||
191 | C<my sub> declares a lexical subroutine that is created each time the | |
192 | enclosing block is entered. C<state sub> is generally slightly faster than | |
193 | C<my sub>. | |
194 | ||
195 | C<our sub> declares a lexical alias to the package subroutine of the same | |
196 | name. | |
197 | ||
198 | For more information, see L<perlsub/Lexical Subroutines>. | |
199 | ||
200 | =head2 Computed Labels | |
201 | ||
202 | The loop controls C<next>, C<last> and C<redo>, and the special C<dump> | |
203 | operator, now allow arbitrary expressions to be used to compute labels at run | |
204 | time. Previously, any argument that was not a constant was treated as the | |
205 | empty string. | |
206 | ||
207 | =head2 More CORE:: subs | |
208 | ||
209 | Several more built-in functions have been added as subroutines to the | |
210 | CORE:: namespace - namely, those non-overridable keywords that can be | |
211 | implemented without custom parsers: C<defined>, C<delete>, C<exists>, | |
212 | C<glob>, C<pos>, C<protoytpe>, C<scalar>, C<split>, C<study>, and C<undef>. | |
213 | ||
214 | As some of these have prototypes, C<prototype('CORE::...')> has been | |
215 | changed to not make a distinction between overridable and non-overridable | |
216 | keywords. This is to make C<prototype('CORE::pos')> consistent with | |
217 | C<prototype(&CORE::pos)>. | |
218 | ||
219 | =head2 C<kill> with negative signal names | |
220 | ||
221 | C<kill> has always allowed a negative signal number, which kills the | |
222 | process group instead of a single process. It has also allowed signal | |
223 | names. But it did not behave consistently, because negative signal names | |
224 | were treated as 0. Now negative signals names like C<-INT> are supported | |
225 | and treated the same way as -2 [perl #112990]. | |
226 | ||
227 | =head1 Security | |
228 | ||
229 | =head2 C<Storable> security warning in documentation | |
230 | ||
231 | The documentation for C<Storable> now includes a section which warns readers | |
232 | of the danger of accepting Storable documents from untrusted sources. The | |
233 | short version is that deserializing certain types of data can lead to loading | |
234 | modules and other code execution. This is documented behavior and wanted | |
235 | behavior, but this opens an attack vector for malicious entities. | |
236 | ||
237 | =head2 C<Locale::Maketext> allowed code injection via a malicious template | |
238 | ||
239 | If users could provide a translation string to Locale::Maketext, this could be | |
240 | used to invoke arbitrary Perl subroutines available in the current process. | |
241 | ||
242 | This has been fixed, but it is still possible to invoke any method provided by | |
243 | C<Locale::Maketext> itself or a subclass that you are using. One of these | |
244 | methods in turn will invoke the Perl core's C<sprintf> subroutine. | |
245 | ||
246 | In summary, allowing users to provide translation strings without auditing | |
247 | them is a bad idea. | |
248 | ||
249 | This vulnerability is documented in CVE-2012-6329. | |
250 | ||
251 | =head2 Avoid calling memset with a negative count | |
252 | ||
253 | Poorly written perl code that allows an attacker to specify the count to perl's | |
254 | C<x> string repeat operator can already cause a memory exhaustion | |
255 | denial-of-service attack. A flaw in versions of perl before 5.15.5 can escalate | |
256 | that into a heap buffer overrun; coupled with versions of glibc before 2.16, it | |
257 | possibly allows the execution of arbitrary code. | |
258 | ||
259 | The flaw addressed to this commit has been assigned identifier CVE-2012-5195 | |
260 | and was researched by Tim Brown. | |
261 | ||
262 | =head1 Incompatible Changes | |
263 | ||
264 | =head2 See also: hash overhaul | |
265 | ||
266 | Some of the changes in the L<hash overhaul|/"Hash overhaul"> are not fully | |
267 | compatible with previous versions of perl. Please read that section. | |
268 | ||
269 | =head2 An unknown character name in C<\N{...}> is now a syntax error | |
270 | ||
271 | Previously, it warned, and the Unicode REPLACEMENT CHARACTER was | |
272 | substituted. Unicode now recommends that this situation be a syntax | |
273 | error. Also, the previous behavior led to some confusing warnings and | |
274 | behaviors, and since the REPLACEMENT CHARACTER has no use other than as | |
275 | a stand-in for some unknown character, any code that has this problem is | |
276 | buggy. | |
277 | ||
278 | =head2 Formerly deprecated characters in C<\N{}> character name aliases are now errors. | |
279 | ||
280 | Since v5.12.0, it has been deprecated to use certain characters in | |
281 | user-defined C<\N{...}> character names. These now cause a syntax | |
282 | error. For example, it is now an error to begin a name with a digit, | |
283 | such as in | |
284 | ||
285 | my $undraftable = "\N{4F}"; # Syntax error! | |
286 | ||
287 | or to have commas anywhere in the name. See L<charnames/CUSTOM ALIASES> | |
288 | ||
289 | =head2 C<\N{BELL}> now refers to U+1F514 instead of U+0007 | |
290 | ||
291 | Unicode 6.0 reused the name "BELL" for a different code point than it | |
292 | traditionally had meant. Since Perl v5.14, use of this name still | |
293 | referred to U+0007, but would raise a deprecation warning. Now, "BELL" | |
294 | refers to U+1F514, and the name for U+0007 is "ALERT". All the | |
295 | functions in L<charnames> have been correspondingly updated. | |
296 | ||
297 | =head2 New Restrictions in Multi-Character Case-Insensitive Matching in Regular Expression Bracketed Character Classes | |
298 | ||
299 | Unicode has now withdrawn their previous recommendation for regular | |
300 | expressions to automatically handle cases where a single character can | |
301 | match multiple characters case-insensitively, for example the letter | |
302 | LATIN SMALL LETTER SHARP S and the sequence C<ss>. This is because | |
303 | it turns out to be impracticable to do this correctly in all | |
304 | circumstances. Because Perl has tried to do this as best it can, it | |
305 | will continue to do so. (We are considering an option to turn it off.) | |
306 | However, a new restriction is being added on such matches when they | |
307 | occur in [bracketed] character classes. People were specifying | |
308 | things such as C</[\0-\xff]/i>, and being surprised that it matches the | |
309 | two character sequence C<ss> (since LATIN SMALL LETTER SHARP S occurs in | |
310 | this range). This behavior is also inconsistent with using a | |
311 | property instead of a range: C<\p{Block=Latin1}> also includes LATIN | |
312 | SMALL LETTER SHARP S, but C</[\p{Block=Latin1}]/i> does not match C<ss>. | |
313 | The new rule is that for there to be a multi-character case-insensitive | |
314 | match within a bracketed character class, the character must be | |
315 | explicitly listed, and not as an end point of a range. This more | |
316 | closely obeys the Principle of Least Astonishment. See | |
317 | L<perlrecharclass/Bracketed Character Classes>. Note that a bug [perl | |
318 | #89774], now fixed as part of this change, prevented the previous | |
319 | behavior from working fully. | |
320 | ||
321 | =head2 Explicit rules for variable names and identifiers | |
322 | ||
323 | Due to an oversight, length-one variable names in 5.16 were completely | |
324 | unrestricted, and opened the door to several kinds of insanity. As of | |
325 | 5.18, these now follow the rules of other identifiers, in addition | |
326 | to accepting characters that match the C<\p{POSIX_Punct}> property. | |
327 | ||
328 | There are no longer any differences in the parsing of identifiers | |
329 | specified as C<$...> or C<${...}>; previously, they were dealt with in | |
330 | different parts of the core, and so had slightly different behavior. For | |
331 | instance, C<${foo:bar}> was a legal variable name. Since they are now | |
332 | both parsed by the same code, that is no longer the case. | |
333 | ||
334 | =head2 C<\s> in regular expressions now matches a Vertical Tab | |
335 | ||
336 | No one could recall why C<\s> didn't match C<\cK>, the vertical tab. | |
337 | Now it does. Given the extreme rarity of that character, very little | |
338 | breakage is expected. | |
339 | ||
340 | =head2 C</(?{})/> and C</(??{})/> have been heavily reworked | |
341 | ||
342 | The implementation of this feature has been almost completely rewritten. | |
343 | Although its main intent is to fix bugs, some behaviors, especially | |
344 | related to the scope of lexical variables, will have changed. This is | |
345 | described more fully in the L</Selected Bug Fixes> section. | |
346 | ||
347 | =head2 Stricter parsing of substitution replacement | |
348 | ||
349 | It is no longer possible to abuse the way the parser parses C<s///e> like | |
350 | this: | |
351 | ||
352 | %_=(_,"Just another "); | |
353 | $_="Perl hacker,\n"; | |
354 | s//_}->{_/e;print | |
355 | ||
356 | =head2 C<given> now aliases the global C<$_> | |
357 | ||
358 | Instead of assigning to an implicit lexical C<$_>, C<given> now makes the | |
359 | global C<$_> an alias for its argument, just like C<foreach>. However, it | |
360 | still uses lexical C<$_> if there is lexical C<$_> in scope (again, just like | |
361 | C<foreach>) [perl #114020]. | |
362 | ||
363 | =head2 Lexical C<$_> is now experimental | |
364 | ||
365 | Since it was introduced in Perl 5.10, it has caused much confusion with no | |
366 | obvious solution: | |
367 | ||
368 | =over | |
829397b9 TC |
369 | |
370 | =item * | |
371 | ||
5ed58cbd RS |
372 | Various modules (e.g., List::Util) expect callback routines to use the |
373 | global C<$_>. C<use List::Util 'first'; my $_; first { $_ == 1 } @list> | |
374 | does not work as one would expect. | |
71e6aba6 RS |
375 | |
376 | =item * | |
377 | ||
5ed58cbd RS |
378 | A C<my $_> declaration earlier in the same file can cause confusing closure |
379 | warnings. | |
71e6aba6 RS |
380 | |
381 | =item * | |
382 | ||
5ed58cbd RS |
383 | The "_" subroutine prototype character allows called subroutines to access |
384 | your lexical C<$_>, so it is not really private after all. | |
2426c394 | 385 | |
71e6aba6 RS |
386 | =item * |
387 | ||
5ed58cbd RS |
388 | Nevertheless, subroutines with a "(@)" prototype and methods cannot access |
389 | the caller's lexical C<$_>, unless they are written in XS. | |
71e6aba6 RS |
390 | |
391 | =item * | |
392 | ||
5ed58cbd RS |
393 | But even XS routines cannot access a lexical C<$_> declared, not in the |
394 | calling subroutine, but in an outer scope, iff that subroutine happened not | |
395 | to mention C<$_> or use any operators that default to C<$_>. | |
396 | ||
397 | =back | |
398 | ||
399 | It is our hope that lexical C<$_> can be rehabilitated, but this may | |
400 | cause changes in its behavior. Please use it with caution until it | |
401 | becomes stable. | |
402 | ||
403 | =head2 readline() with C<$/ = \N> now reads N characters, not N bytes | |
404 | ||
405 | Previously, when reading from a stream with I/O layers such as | |
406 | C<encoding>, the readline() function, otherwise known as the C<< <> >> | |
407 | operator, would read I<N> bytes from the top-most layer. [perl #79960] | |
408 | ||
409 | Now, I<N> characters are read instead. | |
410 | ||
411 | There is no change in behaviour when reading from streams with no | |
412 | extra layers, since bytes map exactly to characters. | |
413 | ||
414 | =head2 Overridden C<glob> is now passed one argument | |
415 | ||
416 | C<glob> overrides used to be passed a magical undocumented second argument | |
417 | that identified the caller. Nothing on CPAN was using this, and it got in | |
418 | the way of a bug fix, so it was removed. If you really need to identify | |
419 | the caller, see L<Devel::Callsite> on CPAN. | |
420 | ||
421 | =head2 Here-doc parsing | |
422 | ||
423 | The body of a here-document inside a quote-like operator now always begins | |
424 | on the line after the "<<foo" marker. Previously, it was documented to | |
425 | begin on the line following the containing quote-like operator, but that | |
426 | was only sometimes the case [perl #114040]. | |
427 | ||
428 | =head2 Alphanumeric operators must now be separated from the closing | |
429 | delimiter of regular expressions | |
430 | ||
431 | You may no longer write something like: | |
432 | ||
433 | m/a/and 1 | |
434 | ||
435 | Instead you must write | |
436 | ||
437 | m/a/ and 1 | |
438 | ||
439 | with whitespace separating the operator from the closing delimiter of | |
440 | the regular expression. Not having whitespace has resulted in a | |
441 | deprecation warning since Perl v5.14.0. | |
442 | ||
443 | =head2 qw(...) can no longer be used as parentheses | |
444 | ||
445 | C<qw> lists used to fool the parser into thinking they were always | |
446 | surrounded by parentheses. This permitted some surprising constructions | |
447 | such as C<foreach $x qw(a b c) {...}>, which should really be written | |
448 | C<foreach $x (qw(a b c)) {...}>. These would sometimes get the lexer into | |
449 | the wrong state, so they didn't fully work, and the similar C<foreach qw(a | |
450 | b c) {...}> that one might expect to be permitted never worked at all. | |
451 | ||
452 | This side effect of C<qw> has now been abolished. It has been deprecated | |
453 | since Perl 5.13.11. It is now necessary to use real parentheses | |
454 | everywhere that the grammar calls for them. | |
455 | ||
456 | =head2 Interaction of lexical and default warnings | |
457 | ||
458 | Turning on any lexical warnings used first to disable all default warnings | |
459 | if lexical warnings were not already enabled: | |
460 | ||
461 | $*; # deprecation warning | |
462 | use warnings "void"; | |
463 | $#; # void warning; no deprecation warning | |
464 | ||
f105b7be | 465 | Now, the C<debugging>, C<deprecated>, C<glob>, C<inplace> and C<malloc> warnings |
5ed58cbd RS |
466 | categories are left on when turning on lexical warnings (unless they are |
467 | turned off by C<no warnings>, of course). | |
468 | ||
469 | This may cause deprecation warnings to occur in code that used to be free | |
470 | of warnings. | |
471 | ||
472 | Those are the only categories consisting only of default warnings. Default | |
f105b7be KE |
473 | warnings in other categories are still disabled by C<< use warnings "category" >>, |
474 | as we do not yet have the infrastructure for controlling | |
5ed58cbd RS |
475 | individual warnings. |
476 | ||
477 | =head2 C<state sub> and C<our sub> | |
478 | ||
479 | Due to an accident of history, C<state sub> and C<our sub> were equivalent | |
480 | to a plain C<sub>, so one could even create an anonymous sub with | |
481 | C<our sub { ... }>. These are now disallowed outside of the "lexical_subs" | |
482 | feature. Under the "lexical_subs" feature they have new meanings described | |
483 | in L<perlsub/Lexical Subroutines>. | |
484 | ||
485 | =head2 Defined values stored in environment are forced to byte strings | |
486 | ||
487 | A value stored in an environment variable has always been stringified. In this | |
488 | release, it is converted to be only a byte string. First, it is forced to be a | |
489 | only a string. Then if the string is utf8 and the equivalent of | |
490 | C<utf8::downgrade()> works, that result is used; otherwise, the equivalent of | |
491 | C<utf8::encode()> is used, and a warning is issued about wide characters | |
492 | (L</Diagnostics>). | |
493 | ||
494 | =head2 C<require> dies for unreadable files | |
495 | ||
496 | When C<require> encounters an unreadable file, it now dies. It used to | |
497 | ignore the file and continue searching the directories in C<@INC> | |
498 | [perl #113422]. | |
499 | ||
500 | =head2 C<gv_fetchmeth_*> and SUPER | |
501 | ||
502 | The various C<gv_fetchmeth_*> XS functions used to treat a package whose | |
f105b7be KE |
503 | named ended with C<::SUPER> specially. A method lookup on the C<Foo::SUPER> |
504 | package would be treated as a C<SUPER> method lookup on the C<Foo> package. This | |
505 | is no longer the case. To do a C<SUPER> lookup, pass the C<Foo> stash and the | |
506 | C<GV_SUPER> flag. | |
5ed58cbd RS |
507 | |
508 | =head2 C<split>'s first argument is more consistently interpreted | |
509 | ||
510 | After some changes earlier in 5.17, C<split>'s behavior has been | |
511 | simplified: if the PATTERN argument evaluates to a literal string | |
512 | containing one space, it is treated the way that a I<literal> string | |
513 | containing one space once was. | |
514 | ||
515 | =head1 Deprecations | |
516 | ||
517 | =head2 Deprecated modules | |
518 | ||
519 | The following modules will be removed from the core distribution in a | |
520 | future release, and should be installed from CPAN instead. Distributions | |
521 | on CPAN which require these should add them to their prerequisites. | |
f105b7be | 522 | The core versions of these modules will issue C<"deprecated">-category |
5ed58cbd RS |
523 | warnings. |
524 | ||
525 | You can silence these deprecation warnings by installing the modules | |
526 | in question from CPAN. | |
527 | ||
528 | =over | |
529 | ||
530 | =item L<Archive::Extract> | |
531 | ||
532 | =item L<B::Lint> | |
533 | ||
534 | =item L<B::Lint::Debug> | |
535 | ||
536 | =item L<CPANPLUS> and all included C<CPANPLUS::*> modules | |
537 | ||
538 | =item L<Devel::InnerPackage> | |
539 | ||
540 | =item L<encoding> | |
541 | ||
542 | =item L<Log::Message> | |
543 | ||
544 | =item L<Log::Message::Config> | |
545 | ||
546 | =item L<Log::Message::Handlers> | |
547 | ||
548 | =item L<Log::Message::Item> | |
549 | ||
550 | =item L<Log::Message::Simple> | |
551 | ||
552 | =item L<Module::Pluggable> | |
553 | ||
554 | =item L<Module::Pluggable::Object> | |
555 | ||
556 | =item L<Object::Accessor> | |
557 | ||
558 | =item L<Pod::LaTeX> | |
559 | ||
560 | =item L<Term::UI> | |
561 | ||
562 | =item L<Term::UI::History> | |
563 | ||
564 | =back | |
565 | ||
566 | =head2 Deprecated Utilities | |
567 | ||
568 | The following utilities will be removed from the core distribution in a | |
569 | future release as their associated modules have been deprecated. They | |
570 | will remain available with the applicable CPAN distribution. | |
571 | ||
572 | =over | |
573 | ||
574 | =item L<cpanp> | |
575 | ||
576 | =item C<cpanp-run-perl> | |
577 | ||
578 | =item L<cpan2dist> | |
579 | ||
580 | These items are part of the C<CPANPLUS> distribution. | |
581 | ||
582 | =item L<pod2latex> | |
583 | ||
584 | This item is part of the C<Pod::LaTeX> distribution. | |
585 | ||
586 | =back | |
587 | ||
588 | =head2 PL_sv_objcount | |
589 | ||
590 | This interpreter-global variable used to track the total number of | |
591 | Perl objects in the interpreter. It is no longer maintained and will | |
592 | be removed altogether in Perl 5.20. | |
593 | ||
594 | =head2 Five additional characters should be escaped in patterns with C</x> | |
595 | ||
596 | When a regular expression pattern is compiled with C</x>, Perl treats 6 | |
597 | characters as white space to ignore, such as SPACE and TAB. However, | |
598 | Unicode recommends 11 characters be treated thusly. We will conform | |
599 | with this in a future Perl version. In the meantime, use of any of the | |
600 | missing characters will raise a deprecation warning, unless turned off. | |
601 | The five characters are: | |
f105b7be KE |
602 | |
603 | U+0085 NEXT LINE, | |
604 | U+200E LEFT-TO-RIGHT MARK, | |
605 | U+200F RIGHT-TO-LEFT MARK, | |
606 | U+2028 LINE SEPARATOR, | |
607 | ||
5ed58cbd | 608 | and |
f105b7be KE |
609 | |
610 | U+2029 PARAGRAPH SEPARATOR. | |
5ed58cbd RS |
611 | |
612 | =head2 User-defined charnames with surprising whitespace | |
613 | ||
614 | A user-defined character name with trailing or multiple spaces in a row is | |
615 | likely a typo. This now generates a warning when defined, on the assumption | |
616 | that uses of it will be unlikely to include the excess whitespace. | |
617 | ||
618 | =head2 Various XS-callable functions are now deprecated | |
619 | ||
620 | All the functions used to classify characters will be removed from a | |
621 | future version of Perl, and should not be used. With participating C | |
622 | compilers (e.g., gcc), compiling any file that uses any of these will | |
623 | generate a warning. These were not intended for public use; there are | |
624 | equivalent, faster, macros for most of them. | |
2e7bc647 | 625 | See L<perlapi/Character classes>. The complete list is: |
5ed58cbd RS |
626 | C<is_uni_alnum>, C<is_uni_alnumc>, C<is_uni_alnumc_lc>, |
627 | C<is_uni_alnum_lc>, C<is_uni_alpha>, C<is_uni_alpha_lc>, | |
628 | C<is_uni_ascii>, C<is_uni_ascii_lc>, C<is_uni_blank>, | |
629 | C<is_uni_blank_lc>, C<is_uni_cntrl>, C<is_uni_cntrl_lc>, | |
630 | C<is_uni_digit>, C<is_uni_digit_lc>, C<is_uni_graph>, | |
631 | C<is_uni_graph_lc>, C<is_uni_idfirst>, C<is_uni_idfirst_lc>, | |
632 | C<is_uni_lower>, C<is_uni_lower_lc>, C<is_uni_print>, | |
633 | C<is_uni_print_lc>, C<is_uni_punct>, C<is_uni_punct_lc>, | |
634 | C<is_uni_space>, C<is_uni_space_lc>, C<is_uni_upper>, | |
635 | C<is_uni_upper_lc>, C<is_uni_xdigit>, C<is_uni_xdigit_lc>, | |
636 | C<is_utf8_alnum>, C<is_utf8_alnumc>, C<is_utf8_alpha>, | |
637 | C<is_utf8_ascii>, C<is_utf8_blank>, C<is_utf8_char>, | |
638 | C<is_utf8_cntrl>, C<is_utf8_digit>, C<is_utf8_graph>, | |
639 | C<is_utf8_idcont>, C<is_utf8_idfirst>, C<is_utf8_lower>, | |
640 | C<is_utf8_mark>, C<is_utf8_perl_space>, C<is_utf8_perl_word>, | |
641 | C<is_utf8_posix_digit>, C<is_utf8_print>, C<is_utf8_punct>, | |
642 | C<is_utf8_space>, C<is_utf8_upper>, C<is_utf8_xdigit>, | |
643 | C<is_utf8_xidcont>, C<is_utf8_xidfirst>. | |
644 | ||
645 | In addition these three functions that have never worked properly are | |
646 | deprecated: | |
647 | C<to_uni_lower_lc>, C<to_uni_title_lc>, and C<to_uni_upper_lc>. | |
648 | ||
f105b7be | 649 | =head2 Certain rare uses of backslashes within regexes are now deprecated |
5ed58cbd RS |
650 | |
651 | There are three pairs of characters that Perl recognizes as | |
652 | metacharacters in regular expression patterns: C<{}>, C<[]>, and C<()>. | |
653 | These can be used as well to delimit patterns, as in: | |
654 | ||
655 | m{foo} | |
656 | s(foo)(bar) | |
657 | ||
658 | Since they are metacharacters, they have special meaning to regular | |
659 | expression patterns, and it turns out that you can't turn off that | |
660 | special meaning by the normal means of preceding them with a backslash, | |
f105b7be | 661 | if you use them, paired, within a pattern delimited by them. For |
5ed58cbd RS |
662 | example, in |
663 | ||
664 | m{foo\{1,3\}} | |
665 | ||
666 | the backslashes do not change the behavior, and this matches | |
667 | S<C<"f o">> followed by one to three more occurrences of C<"o">. | |
668 | ||
669 | Usages like this, where they are interpreted as metacharacters, are | |
670 | exceedingly rare; we think there are none, for example, in all of CPAN. | |
671 | Hence, this deprecation should affect very little code. It does give | |
672 | notice, however, that any such code needs to change, which will in turn | |
673 | allow us to change the behavior in future Perl versions so that the | |
674 | backslashes do have an effect, and without fear that we are silently | |
675 | breaking any existing code. | |
676 | ||
d5f315e8 KW |
677 | =head2 Splitting the tokens C<(?> and C<(*> in regular expressions |
678 | ||
679 | A deprecation warning is now raised if the C<(> and C<?> are separated | |
680 | by white space or comments in C<(?...)> regular expression constructs. | |
681 | Similarly, if the C<(> and C<*> are separated in C<(*VERB...)> | |
682 | constructs. | |
683 | ||
e0a1dec5 LT |
684 | =head2 Pre-PerlIO IO implementations |
685 | ||
686 | Perl supports being built without PerlIO proper, using a stdio or sfio | |
687 | wrapper instead. A perl build like this will not support IO layers and | |
688 | thus Unicode IO, making it rather handicapped. | |
689 | ||
690 | PerlIO supports a C<stdio> layer if stdio use is desired, and similarly a | |
691 | sfio layer could be produced. | |
692 | ||
5ed58cbd RS |
693 | =head1 Future Deprecations |
694 | ||
695 | =over | |
71e6aba6 RS |
696 | |
697 | =item * | |
698 | ||
4263dd11 | 699 | Platforms without support infrastructure |
5ed58cbd RS |
700 | |
701 | Both Windows CE and z/OS have been historically under-maintained, and are | |
702 | currently neither successfully building nor regularly being smoke tested. | |
703 | Efforts are underway to change this situation, but it should not be taken for | |
704 | granted that the platforms are safe and supported. If they do not become | |
705 | buildable and regularly smoked, support for them may be actively removed in | |
706 | future releases. If you have an interest in these platforms and you can lend | |
707 | your time, expertise, or hardware to help support these platforms, please let | |
708 | the perl development effort know by emailing C<perl5-porters@perl.org>. | |
709 | ||
710 | Some platforms that appear otherwise entirely dead are also on the short list | |
711 | for removal between now and 5.20.0: | |
712 | ||
713 | =over | |
714 | ||
715 | =item DG/UX | |
716 | ||
717 | =item NeXT | |
718 | ||
719 | =back | |
1993add8 RS |
720 | |
721 | =item * | |
722 | ||
5ed58cbd RS |
723 | Swapping of $< and $> |
724 | ||
725 | For more information about this future deprecation, see L<the relevant RT | |
726 | ticket|https://rt.perl.org/rt3/Ticket/Display.html?id=96212>. | |
71e6aba6 RS |
727 | |
728 | =item * | |
729 | ||
5ed58cbd | 730 | C<microperl>, long broken and of unclear present purpose, will be removed. |
71e6aba6 RS |
731 | |
732 | =item * | |
733 | ||
5ed58cbd RS |
734 | Revamping C<< "\Q" >> semantics in double-quotish strings when combined with |
735 | other escapes. | |
736 | ||
737 | There are several bugs and inconsistencies involving combinations | |
738 | of C<\Q> and escapes like C<\x>, C<\L>, etc., within a C<\Q...\E> pair. | |
739 | These need to be fixed, and doing so will necessarily change current | |
740 | behavior. The changes have not yet been settled. | |
71e6aba6 | 741 | |
d5f315e8 KW |
742 | =item * |
743 | ||
744 | Use of C<$^>, where C<^> stands for any actual (non-printing) C0 control | |
745 | character will be disallowed in a future Perl version. Use C<${^}> | |
746 | instead (where again C<^> stands for a control character), | |
747 | or better, C<$^A> , where C<^> this time is a caret (CIRCUMFLEX ACCENT), | |
748 | and C<A> stands for any of the characters listed at the end of | |
749 | L<perlebcdic/OPERATOR DIFFERENCES>. | |
750 | ||
337fb649 | 751 | =back |
2426c394 | 752 | |
5ed58cbd RS |
753 | =head1 Performance Enhancements |
754 | ||
755 | =over 4 | |
2426c394 | 756 | |
5ed58cbd | 757 | =item * |
2426c394 | 758 | |
5ed58cbd RS |
759 | Lists of lexical variable declarations (C<my($x, $y)>) are now optimised |
760 | down to a single op and are hence faster than before. | |
2426c394 | 761 | |
5ed58cbd | 762 | =item * |
2426c394 | 763 | |
5ed58cbd RS |
764 | A new C preprocessor define C<NO_TAINT_SUPPORT> was added that, if set, |
765 | disables Perl's taint support altogether. Using the -T or -t command | |
766 | line flags will cause a fatal error. Beware that both core tests as | |
767 | well as many a CPAN distribution's tests will fail with this change. On | |
768 | the upside, it provides a small performance benefit due to reduced | |
769 | branching. | |
2426c394 | 770 | |
5ed58cbd RS |
771 | B<Do not enable this unless you know exactly what you are getting yourself |
772 | into.> | |
773 | ||
774 | =item * | |
775 | ||
776 | C<pack> with constant arguments is now constant folded in most cases | |
777 | [perl #113470]. | |
778 | ||
779 | =item * | |
780 | ||
781 | Speed up in regular expression matching against Unicode properties. The | |
782 | largest gain is for C<\X>, the Unicode "extended grapheme cluster." The | |
783 | gain for it is about 35% - 40%. Bracketed character classes, e.g., | |
784 | C<[0-9\x{100}]> containing code points above 255 are also now faster. | |
785 | ||
786 | =item * | |
787 | ||
788 | On platforms supporting it, several former macros are now implemented as static | |
789 | inline functions. This should speed things up slightly on non-GCC platforms. | |
790 | ||
791 | =item * | |
792 | ||
66f62cf6 RS |
793 | The optimisation of hashes in boolean context has been extended to |
794 | affect C<scalar(%hash)>, C<%hash ? ... : ...>, and C<sub { %hash || ... }>. | |
5ed58cbd RS |
795 | |
796 | =item * | |
797 | ||
f105b7be | 798 | Filetest operators manage the stack in a fractionally more efficient manner. |
5ed58cbd RS |
799 | |
800 | =item * | |
801 | ||
802 | Globs used in a numeric context are now numified directly in most cases, | |
f105b7be | 803 | rather than being numified via stringification. |
5ed58cbd RS |
804 | |
805 | =item * | |
806 | ||
807 | The C<x> repetition operator is now folded to a single constant at compile | |
808 | time if called in scalar context with constant operands and no parentheses | |
809 | around the left operand. | |
810 | ||
811 | =back | |
812 | ||
813 | =head1 Modules and Pragmata | |
814 | ||
815 | =head2 New Modules and Pragmata | |
2426c394 | 816 | |
337fb649 | 817 | =over 4 |
982110e0 | 818 | |
82d98f72 | 819 | =item * |
2426c394 | 820 | |
5ed58cbd RS |
821 | L<Config::Perl::V> version 0.16 has been added as a dual-lifed module. |
822 | It provides structured data retrieval of C<perl -V> output including | |
823 | information only known to the C<perl> binary and not available via L<Config>. | |
824 | ||
825 | =back | |
826 | ||
827 | =head2 Updated Modules and Pragmata | |
828 | ||
829 | This is only an overview of selected module updates. For a complete | |
830 | list of updates, run: | |
831 | ||
f105b7be | 832 | $ corelist --diff 5.16.0 5.18.0 |
5ed58cbd | 833 | |
f105b7be | 834 | You can substitute your favorite version in place of 5.16.0, too. |
5ed58cbd RS |
835 | |
836 | =over 4 | |
33392251 BF |
837 | |
838 | =item * | |
839 | ||
5ed58cbd RS |
840 | L<XXX> has been upgraded from version A.xx to B.yy. |
841 | ||
842 | =back | |
843 | ||
844 | =head2 Removed Modules and Pragmata | |
845 | ||
846 | =over | |
33392251 BF |
847 | |
848 | =item * | |
849 | ||
5ed58cbd RS |
850 | L<Version::Requirements> has been removed from the core distribution. It is |
851 | available under a different name: L<CPAN::Meta::Requirements>. | |
2426c394 | 852 | |
337fb649 | 853 | =back |
2426c394 | 854 | |
5ed58cbd | 855 | =head1 Documentation |
19718730 | 856 | |
5ed58cbd RS |
857 | =head2 Changes to Existing Documentation |
858 | ||
859 | =head3 L<perlcheat> | |
82d98f72 | 860 | |
5a6a30f4 | 861 | =over 4 |
b7c7d786 | 862 | |
5ed58cbd RS |
863 | =item * |
864 | ||
865 | L<perlcheat> has been reorganized, and a few new sections were added. | |
866 | ||
867 | =back | |
868 | ||
869 | =head3 L<perldata> | |
870 | ||
871 | =over 4 | |
82d98f72 | 872 | |
5ed58cbd | 873 | =item * |
d2d1e842 | 874 | |
5ed58cbd RS |
875 | Now explicitly documents the behaviour of hash initializer lists that |
876 | contain duplicate keys. | |
f355e93d | 877 | |
5a6a30f4 | 878 | =back |
f355e93d | 879 | |
5ed58cbd | 880 | =head3 L<perldiag> |
19718730 | 881 | |
19718730 | 882 | =over 4 |
e14ac59b | 883 | |
5ed58cbd RS |
884 | =item * |
885 | ||
886 | The explanation of symbolic references being prevented by "strict refs" | |
887 | now doesn't assume that the reader knows what symbolic references are. | |
888 | ||
889 | =back | |
9f351b45 | 890 | |
5ed58cbd | 891 | =head3 L<perlfaq> |
9f351b45 | 892 | |
5ed58cbd | 893 | =over 4 |
9f351b45 | 894 | |
5ed58cbd | 895 | =item * |
7cf3104f | 896 | |
5ed58cbd | 897 | L<perlfaq> has been synchronized with version 5.0150040 from CPAN. |
12719193 | 898 | |
6253ee75 | 899 | =back |
216cf7fc | 900 | |
5ed58cbd | 901 | =head3 L<perlfunc> |
f5b73711 | 902 | |
5ed58cbd RS |
903 | =over 4 |
904 | ||
905 | =item * | |
a75569c0 | 906 | |
5ed58cbd | 907 | The return value of C<pipe> is now documented. |
a75569c0 | 908 | |
5ed58cbd | 909 | =item * |
a75569c0 | 910 | |
5ed58cbd RS |
911 | Clarified documentation of C<our>. |
912 | ||
913 | =back | |
914 | ||
915 | =head3 L<perlop> | |
916 | ||
917 | =over 4 | |
918 | ||
919 | =item * | |
920 | ||
921 | Loop control verbs (C<dump>, C<goto>, C<next>, C<last> and C<redo>) have always | |
922 | had the same precedence as assignment operators, but this was not documented | |
923 | until now. | |
924 | ||
925 | =back | |
926 | ||
927 | =head3 Diagnostics | |
928 | ||
929 | The following additions or changes have been made to diagnostic output, | |
930 | including warnings and fatal error messages. For the complete list of | |
931 | diagnostic messages, see L<perldiag>. | |
932 | ||
933 | XXX New or changed warnings emitted by the core's C<C> code go here. Also | |
934 | include any changes in L<perldiag> that reconcile it to the C<C> code. | |
935 | ||
936 | =head2 New Diagnostics | |
937 | ||
938 | XXX Newly added diagnostic messages go under here, separated into New Errors | |
939 | and New Warnings | |
940 | ||
941 | =head3 New Errors | |
942 | ||
943 | =over 4 | |
944 | ||
945 | =item * | |
946 | ||
947 | L<Unterminated delimiter for here document|perldiag/"Unterminated delimiter for here document"> | |
948 | ||
949 | This message now occurs when a here document label has an initial quotation | |
950 | mark but the final quotation mark is missing. | |
951 | ||
952 | This replaces a bogus and misleading error message about not finding the label | |
953 | itself [perl #114104]. | |
954 | ||
955 | =item * | |
956 | ||
957 | L<panic: child pseudo-process was never scheduled|perldiag/"panic: child pseudo-process was never scheduled"> | |
958 | ||
959 | This error is thrown when a child pseudo-process in the ithreads implementation | |
960 | on Windows was not scheduled within the time period allowed and therefore was | |
961 | not able to initialize properly [perl #88840]. | |
962 | ||
963 | =item * | |
964 | ||
965 | L<Group name must start with a non-digit word character in regex; marked by <-- HERE in mE<sol>%sE<sol>|perldiag/"Group name must start with a non-digit word character in regex; marked by <-- HERE in m/%s/"> | |
966 | ||
967 | This error has been added for C<(?&0)>, which is invalid. It used to | |
968 | produce an incomprehensible error message [perl #101666]. | |
969 | ||
970 | =item * | |
971 | ||
972 | L<Can't use an undefined value as a subroutine reference|perldiag/"Can't use an undefined value as %s reference"> | |
973 | ||
974 | Calling an undefined value as a subroutine now produces this error message. | |
975 | It used to, but was accidentally disabled, first in Perl 5.004 for | |
976 | non-magical variables, and then in Perl 5.14 for magical (e.g., tied) | |
977 | variables. It has now been restored. In the mean time, undef was treated | |
978 | as an empty string [perl #113576]. | |
979 | ||
980 | =item * | |
981 | ||
982 | L<Experimental "%s" subs not enabled|perldiag/"Experimental "%s" subs not enabled"> | |
983 | ||
984 | To use lexical subs, you must first enable them: | |
985 | ||
986 | no warnings 'experimental::lexical_subs'; | |
987 | use feature 'lexical_subs'; | |
988 | my sub foo { ... } | |
989 | ||
990 | =back | |
991 | ||
992 | =head3 New Warnings | |
993 | ||
994 | =over 4 | |
995 | ||
996 | =item * | |
997 | ||
998 | XXX: This needs more detail. | |
999 | ||
1000 | Strings with code points over 0xFF may not be mapped into in-memory file | |
1001 | handles | |
1002 | ||
1003 | =item * | |
1004 | ||
1005 | L<'%s' resolved to '\o{%s}%d'|perldiag/"'%s' resolved to '\o{%s}%d'"> | |
1006 | ||
1007 | =item * | |
1008 | ||
1009 | L<'Trailing white-space in a charnames alias definition is deprecated'|perldiag/"Trailing white-space in a charnames alias definition is deprecated"> | |
1010 | ||
1011 | =item * | |
1012 | ||
1013 | L<'A sequence of multiple spaces in a charnames alias definition is deprecated'|perldiag/"A sequence of multiple spaces in a charnames alias definition is deprecated"> | |
1014 | ||
1015 | =item * | |
1016 | ||
1017 | L<'Passing malformed UTF-8 to "%s" is deprecated'|perldiag/"Passing malformed UTF-8 to "%s" is deprecated"> | |
1018 | ||
1019 | =item * | |
1020 | ||
1021 | L<Subroutine "&%s" is not available|perldiag/"Subroutine "&%s" is not available"> | |
1022 | ||
1023 | (W closure) During compilation, an inner named subroutine or eval is | |
1024 | attempting to capture an outer lexical subroutine that is not currently | |
1025 | available. This can happen for one of two reasons. First, the lexical | |
1026 | subroutine may be declared in an outer anonymous subroutine that has not | |
1027 | yet been created. (Remember that named subs are created at compile time, | |
1028 | while anonymous subs are created at run-time.) For example, | |
1029 | ||
1030 | sub { my sub a {...} sub f { \&a } } | |
1031 | ||
1032 | At the time that f is created, it can't capture the current the "a" sub, | |
1033 | since the anonymous subroutine hasn't been created yet. Conversely, the | |
1034 | following won't give a warning since the anonymous subroutine has by now | |
1035 | been created and is live: | |
1036 | ||
1037 | sub { my sub a {...} eval 'sub f { \&a }' }->(); | |
1038 | ||
1039 | The second situation is caused by an eval accessing a variable that has | |
1040 | gone out of scope, for example, | |
1041 | ||
1042 | sub f { | |
1043 | my sub a {...} | |
1044 | sub { eval '\&a' } | |
1045 | } | |
1046 | f()->(); | |
1047 | ||
1048 | Here, when the '\&a' in the eval is being compiled, f() is not currently | |
1049 | being executed, so its &a is not available for capture. | |
1050 | ||
1051 | =item * | |
1052 | ||
1053 | L<"%s" subroutine &%s masks earlier declaration in same %s|perldiag/"%s" subroutine &%s masks earlier declaration in same %s> | |
1054 | ||
1055 | (W misc) A "my" or "state" subroutine has been redeclared in the | |
1056 | current scope or statement, effectively eliminating all access to | |
1057 | the previous instance. This is almost always a typographical error. | |
1058 | Note that the earlier subroutine will still exist until the end of | |
1059 | the scope or until all closure references to it are destroyed. | |
1060 | ||
1061 | =item * | |
1062 | ||
1063 | L<The %s feature is experimental|perldiag/"The %s feature is experimental"> | |
1064 | ||
1065 | (S experimental) This warning is emitted if you enable an experimental | |
1066 | feature via C<use feature>. Simply suppress the warning if you want | |
1067 | to use the feature, but know that in doing so you are taking the risk | |
1068 | of using an experimental feature which may change or be removed in a | |
1069 | future Perl version: | |
1070 | ||
1071 | no warnings "experimental::lexical_subs"; | |
1072 | use feature "lexical_subs"; | |
1073 | ||
1074 | =item * | |
1075 | ||
1076 | L<sleep(%u) too large|perldiag/"sleep(%u) too large"> | |
1077 | ||
1078 | (W overflow) You called C<sleep> with a number that was larger than it can | |
1079 | reliably handle and C<sleep> probably slept for less time than requested. | |
1080 | ||
1081 | =item * | |
1082 | ||
1083 | L<Wide character in setenv|perldiag/"Wide character in %s"> | |
1084 | ||
1085 | Attempts to put wide characters into environment variables via C<%ENV> now | |
1086 | provoke this warning. | |
1087 | ||
1088 | =item * | |
1089 | ||
1090 | "L<Invalid negative number (%s) in chr|perldiag/"Invalid negative number (%s) in chr">" | |
1091 | ||
1092 | C<chr()> now warns when passed a negative value [perl #83048]. | |
1093 | ||
1094 | =item * | |
1095 | ||
1096 | "L<Integer overflow in srand|perldiag/"Integer overflow in srand">" | |
1097 | ||
1098 | C<srand()> now warns when passed a value that doesn't fit in a C<UV> (since the | |
1099 | value will be truncated rather than overflowing) [perl #40605]. | |
1100 | ||
1101 | =item * | |
1102 | ||
1103 | "L<-i used with no filenames on the command line, reading from STDIN|perldiag/"-i used with no filenames on the command line, reading from STDIN">" | |
1104 | ||
1105 | Running perl with the C<-i> flag now warns if no input files are provided on | |
1106 | the command line [perl #113410]. | |
1107 | ||
1108 | =back | |
1109 | ||
1110 | =head2 Changes to Existing Diagnostics | |
1111 | ||
1112 | =over 4 | |
1113 | ||
1114 | =item * | |
1115 | ||
1116 | L<$* is no longer supported|perldiag/"$* is no longer supported"> | |
1117 | ||
1118 | The warning that use of C<$*> and C<$#> is no longer supported is now | |
1119 | generated for every location that references them. Previously it would fail | |
1120 | to be generated if another variable using the same typeglob was seen first | |
1121 | (e.g. C<@*> before C<$*>), and would not be generated for the second and | |
1122 | subsequent uses. (It's hard to fix the failure to generate warnings at all | |
1123 | without also generating them every time, and warning every time is | |
1124 | consistent with the warnings that C<$[> used to generate.) | |
1125 | ||
1126 | =item * | |
1127 | ||
1128 | The warnings for C<\b{> and C<\B{> were added. They are a deprecation | |
1129 | warning which should be turned off by that category. One should not | |
1130 | have to turn off regular regexp warnings as well to get rid of these. | |
1131 | ||
1132 | =item * | |
1133 | ||
1134 | L<Constant(%s): Call to &{$^H{%s}} did not return a defined value|perldiag/Constant(%s): Call to &{$^H{%s}} did not return a defined value> | |
1135 | ||
1136 | Constant overloading that returns C<undef> results in this error message. | |
1137 | For numeric constants, it used to say "Constant(undef)". "undef" has been | |
1138 | replaced with the number itself. | |
1139 | ||
1140 | =item * | |
1141 | ||
1142 | The error produced when a module cannot be loaded now includes a hint that | |
1143 | the module may need to be installed: "Can't locate hopping.pm in @INC (you | |
1144 | may need to install the hopping module) (@INC contains: ...)" | |
1145 | ||
1146 | =item * | |
1147 | ||
1148 | L<vector argument not supported with alpha versions|perldiag/vector argument not supported with alpha versions> | |
1149 | ||
1150 | This warning was not suppressable, even with C<no warnings>. Now it is | |
1151 | suppressible, and has been moved from the "internal" category to the | |
1152 | "printf" category. | |
1153 | ||
1154 | =item * | |
1155 | ||
1156 | C<< Can't do {n,m} with n > m in regex; marked by <-- HERE in m/%s/ >> | |
1157 | ||
1158 | This fatal error has been turned into a warning that reads: | |
1159 | ||
1160 | L<< Quantifier {n,m} with n > m can't match in regex | perldiag/Quantifier {n,m} with n > m can't match in regex >> | |
1161 | ||
1162 | (W regexp) Minima should be less than or equal to maxima. If you really want | |
1163 | your regexp to match something 0 times, just put {0}. | |
1164 | ||
1165 | =item * | |
1166 | ||
1167 | The "Runaway prototype" warning that occurs in bizarre cases has been | |
1168 | removed as being unhelpful and inconsistent. | |
1169 | ||
1170 | =item * | |
1171 | ||
1172 | The "Not a format reference" error has been removed, as the only case in | |
1173 | which it could be triggered was a bug. | |
1174 | ||
1175 | =item * | |
1176 | ||
1177 | The "Unable to create sub named %s" error has been removed for the same | |
1178 | reason. | |
1179 | ||
1180 | =item * | |
1181 | ||
1182 | The 'Can't use "my %s" in sort comparison' error has been downgraded to a | |
1183 | warning, '"my %s" used in sort comparison' (with 'state' instead of 'my' | |
1184 | for state variables). In addition, the heuristics for guessing whether | |
1185 | lexical $a or $b has been misused have been improved to generate fewer | |
1186 | false positives. Lexical $a and $b are no longer disallowed if they are | |
1187 | outside the sort block. Also, a named unary or list operator inside the | |
1188 | sort block no longer causes the $a or $b to be ignored [perl #86136]. | |
1189 | ||
1190 | =back | |
1191 | ||
1192 | =head1 Utility Changes | |
1193 | ||
1194 | =head3 L<h2xs> | |
1195 | ||
1196 | =over 4 | |
1197 | ||
1198 | =item * | |
1199 | ||
1200 | F<h2xs> no longer produces invalid code for empty defines. [perl #20636] | |
1201 | ||
1202 | =back | |
1203 | ||
1204 | =head1 Configuration and Compilation | |
1205 | ||
1206 | =over 4 | |
1207 | ||
1208 | =item * | |
1209 | ||
1210 | Added C<useversionedarchname> option to Configure | |
1211 | ||
1212 | When set, it includes 'api_versionstring' in 'archname'. E.g. | |
1213 | x86_64-linux-5.13.6-thread-multi. It is unset by default. | |
1214 | ||
1215 | This feature was requested by Tim Bunce, who observed that | |
f105b7be | 1216 | C<INSTALL_BASE> creates a library structure that does not |
5ed58cbd RS |
1217 | differentiate by perl version. Instead, it places architecture |
1218 | specific files in "$install_base/lib/perl5/$archname". This makes | |
f105b7be | 1219 | it difficult to use a common C<INSTALL_BASE> library path with |
5ed58cbd RS |
1220 | multiple versions of perl. |
1221 | ||
f105b7be | 1222 | By setting C<-Duseversionedarchname>, the $archname will be |
c2959982 | 1223 | distinct for architecture I<and> API version, allowing mixed use of |
f105b7be | 1224 | C<INSTALL_BASE>. |
5ed58cbd RS |
1225 | |
1226 | =item * | |
1227 | ||
ff772877 RS |
1228 | Add a C<PERL_NO_INLINE_FUNCTIONS> option |
1229 | ||
f105b7be | 1230 | If C<PERL_NO_INLINE_FUNCTIONS> is defined, don't include "inline.h" |
ff772877 RS |
1231 | |
1232 | This permits test code to include the perl headers for definitions without | |
1233 | creating a link dependency on the perl library (which may not exist yet). | |
1234 | ||
1235 | =item * | |
1236 | ||
5ed58cbd RS |
1237 | Configure will honour the external C<MAILDOMAIN> environment variable, if set. |
1238 | ||
1239 | =item * | |
1240 | ||
1241 | C<installman> no longer ignores the silent option | |
1242 | ||
1243 | =item * | |
1244 | ||
1245 | Both C<META.yml> and C<META.json> files are now included in the distribution. | |
1246 | ||
1247 | =item * | |
1248 | ||
1249 | F<Configure> will now correctly detect C<isblank()> when compiling with a C++ | |
1250 | compiler. | |
1251 | ||
1252 | =item * | |
1253 | ||
1254 | The pager detection in F<Configure> has been improved to allow responses which | |
1255 | specify options after the program name, e.g. B</usr/bin/less -R>, if the user | |
1256 | accepts the default value. This helps B<perldoc> when handling ANSI escapes | |
1257 | [perl #72156]. | |
1258 | ||
1259 | =back | |
1260 | ||
1261 | =head1 Testing | |
1262 | ||
1263 | =over 4 | |
1264 | ||
1265 | =item * | |
1266 | ||
1267 | The test suite now has a section for tests that require very large amounts | |
1268 | of memory. These tests won't run by default; they can be enabled by | |
1269 | setting the C<PERL_TEST_MEMORY> environment variable to the number of | |
1270 | gibibytes of memory that may be safely used. | |
1271 | ||
1272 | =back | |
1273 | ||
1274 | =head1 Platform Support | |
1275 | ||
1276 | =head2 Discontinued Platforms | |
1277 | ||
1278 | =over 4 | |
1279 | ||
1280 | =item BeOS | |
1281 | ||
1282 | BeOS was an operating system for personal computers developed by Be Inc, | |
1283 | initially for their BeBox hardware. The OS Haiku was written as an open | |
1284 | source replacement for/continuation of BeOS, and its perl port is current and | |
1285 | actively maintained. | |
1286 | ||
1287 | =item UTS Global | |
1288 | ||
1289 | Support code relating to UTS global has been removed. UTS was a mainframe | |
1290 | version of System V created by Amdahl, subsequently sold to UTS Global. The | |
1291 | port has not been touched since before Perl 5.8.0, and UTS Global is now | |
1292 | defunct. | |
1293 | ||
1294 | =item VM/ESA | |
1295 | ||
1296 | Support for VM/ESA has been removed. The port was tested on 2.3.0, which | |
1297 | IBM ended service on in March 2002. 2.4.0 ended service in June 2003, and | |
1298 | was superseded by Z/VM. The current version of Z/VM is V6.2.0, and scheduled | |
1299 | for end of service on 2015/04/30. | |
1300 | ||
1301 | =item MPE/IX | |
1302 | ||
1303 | Support for MPE/IX has been removed. | |
1304 | ||
1305 | =item EPOC | |
1306 | ||
1307 | Support code relating to EPOC has been removed. EPOC was a family of | |
1308 | operating systems developed by Psion for mobile devices. It was the | |
1309 | predecessor of Symbian. The port was last updated in April 2002. | |
1310 | ||
1311 | =item Rhapsody | |
1312 | ||
1313 | Support for Rhapsody has been removed. | |
1314 | ||
1315 | =back | |
1316 | ||
1317 | =head2 Platform-Specific Notes | |
1318 | ||
1319 | =head3 AIX | |
1320 | ||
1321 | Configure now always adds C<-qlanglvl=extc99> to the CC flags on AIX when | |
1322 | using xlC. This will make it easier to compile a number of XS-based modules | |
1323 | that assume C99 [perl #113778]. | |
1324 | ||
1325 | =head3 clang++ | |
1326 | ||
1327 | There is now a workaround for a compiler bug that prevented compiling | |
1328 | with clang++ since Perl 5.15.7 [perl #112786]. | |
1329 | ||
1330 | =head3 C++ | |
1331 | ||
1332 | When compiling the Perl core as C++ (which is only semi-supported), the | |
1333 | mathom functions are now compiled as C<extern "C">, to ensure proper | |
1334 | binary compatibility. (However, binary compatibility isn't generally | |
1335 | guaranteed anyway in the situations where this would matter.) | |
1336 | ||
1337 | =head3 Darwin | |
1338 | ||
1339 | Stop hardcoding an alignment on 8 byte boundaries to fix builds using | |
1340 | -Dusemorebits. | |
1341 | ||
1342 | =head3 Haiku | |
1343 | ||
1344 | Perl should now work out of the box on Haiku R1 Alpha 4. | |
1345 | ||
1346 | =head3 MidnightBSD | |
1347 | ||
1348 | C<libc_r> was removed from recent versions of MidnightBSD and older versions | |
1349 | work better with C<pthread>. Threading is now enabled using C<pthread> which | |
1350 | corrects build errors with threading enabled on 0.4-CURRENT. | |
1351 | ||
1352 | =head3 Solaris | |
1353 | ||
1354 | In Configure, avoid running sed commands with flags not supported on Solaris. | |
1355 | ||
1356 | =head3 VMS | |
1357 | ||
1358 | =over | |
1359 | ||
1360 | =item * | |
1361 | ||
1362 | Where possible, the case of filenames and command-line arguments is now | |
1363 | preserved by enabling the CRTL features C<DECC$EFS_CASE_PRESERVE> and | |
1364 | C<DECC$ARGV_PARSE_STYLE> at start-up time. The latter only takes effect | |
1365 | when extended parse is enabled in the process from which Perl is run. | |
1366 | ||
1367 | =item * | |
1368 | ||
1369 | The character set for Extended Filename Syntax (EFS) is now enabled by default | |
1370 | on VMS. Among other things, this provides better handling of dots in directory | |
05f5908f | 1371 | names, multiple dots in filenames, and spaces in filenames. To obtain the old |
5ed58cbd RS |
1372 | behavior, set the logical name C<DECC$EFS_CHARSET> to C<DISABLE>. |
1373 | ||
1374 | =item * | |
1375 | ||
05f5908f | 1376 | Fixed linking on builds configured with C<-Dusemymalloc=y>. |
5ed58cbd RS |
1377 | |
1378 | =item * | |
1379 | ||
05f5908f CB |
1380 | Experimental support for building Perl with the HP C++ compiler is available |
1381 | by configuring with C<-Dusecxx>. | |
5ed58cbd RS |
1382 | |
1383 | =item * | |
1384 | ||
1385 | All C header files from the top-level directory of the distribution are now | |
1386 | installed on VMS, providing consistency with a long-standing practice on other | |
1387 | platforms. Previously only a subset were installed, which broke non-core | |
1388 | extension builds for extensions that depended on the missing include files. | |
1389 | ||
1390 | =item * | |
1391 | ||
1392 | Quotes are now removed from the command verb (but not the parameters) for | |
1393 | commands spawned via C<system>, backticks, or a piped C<open>. Previously, | |
1394 | quotes on the verb were passed through to DCL, which would fail to recognize | |
1395 | the command. Also, if the verb is actually a path to an image or command | |
1396 | procedure on an ODS-5 volume, quoting it now allows the path to contain spaces. | |
1397 | ||
1398 | =item * | |
1399 | ||
1400 | The B<a2p> build has been fixed for the HP C++ compiler on OpenVMS. | |
1401 | ||
1402 | =back | |
1403 | ||
1404 | =head3 Win32 | |
1405 | ||
1406 | =over | |
1407 | ||
1408 | =item * | |
1409 | ||
1410 | Perl can now be built using Microsoft's Visual C++ 2012 compiler by specifying | |
1411 | CCTYPE=MSVC110 (or MSVC110FREE if you are using the free Express edition for | |
1412 | Windows Desktop) in F<win32/Makefile>. | |
1413 | ||
1414 | =item * | |
1415 | ||
f105b7be | 1416 | The option to build without C<USE_SOCKETS_AS_HANDLES> has been removed. |
5ed58cbd RS |
1417 | |
1418 | =item * | |
1419 | ||
1420 | Fixed a problem where perl could crash while cleaning up threads (including the | |
1421 | main thread) in threaded debugging builds on Win32 and possibly other platforms | |
1422 | [perl #114496]. | |
1423 | ||
1424 | =item * | |
1425 | ||
1426 | A rare race condition that would lead to L<sleep|perlfunc/sleep> taking more | |
1427 | time than requested, and possibly even hanging, has been fixed [perl #33096]. | |
1428 | ||
1429 | =item * | |
1430 | ||
1431 | C<link> on Win32 now attempts to set C<$!> to more appropriate values | |
1432 | based on the Win32 API error code. [perl #112272] | |
1433 | ||
1434 | Perl no longer mangles the environment block, e.g. when launching a new | |
1435 | sub-process, when the environment contains non-ASCII characters. Known | |
1436 | problems still remain, however, when the environment contains characters | |
1437 | outside of the current ANSI codepage (e.g. see the item about Unicode in | |
1438 | C<%ENV> in L<http://perl5.git.perl.org/perl.git/blob/HEAD:/Porting/todo.pod>). | |
1439 | [perl #113536] | |
1440 | ||
1441 | =item * | |
1442 | ||
1443 | Building perl with some Windows compilers used to fail due to a problem | |
1444 | with miniperl's C<glob> operator (which uses the C<perlglob> program) | |
1445 | deleting the PATH environment variable [perl #113798]. | |
1446 | ||
1447 | =item * | |
1448 | ||
f105b7be | 1449 | A new makefile option, C<USE_64_BIT_INT>, has been added to the Windows |
5ed58cbd RS |
1450 | makefiles. Set this to "define" when building a 32-bit perl if you want |
1451 | it to use 64-bit integers. | |
1452 | ||
1453 | Machine code size reductions, already made to the DLLs of XS modules in | |
1454 | Perl 5.17.2, have now been extended to the perl DLL itself. | |
1455 | ||
1456 | Building with VC++ 6.0 was inadvertently broken in Perl 5.17.2 but has | |
1457 | now been fixed again. | |
1458 | ||
1459 | =back | |
1460 | ||
1461 | =head3 WinCE | |
1462 | ||
1463 | Building on WinCE is now possible once again, although more work is required | |
1464 | to fully restore a clean build. | |
1465 | ||
1466 | =head1 Internal Changes | |
1467 | ||
1468 | =over | |
1469 | ||
1470 | =item * | |
1471 | ||
4263dd11 | 1472 | Synonyms for the misleadingly named C<av_len()> have been created: |
5ed58cbd RS |
1473 | C<av_top_index()> and C<av_tindex>. All three of these return the |
1474 | number of the highest index in the array, not the number of elements it | |
1475 | contains. | |
1476 | ||
1477 | =item * | |
1478 | ||
1479 | SvUPGRADE() is no longer an expression. Originally this macro (and its | |
1480 | underlying function, sv_upgrade()) were documented as boolean, although | |
1481 | in reality they always croaked on error and never returned false. In 2005 | |
1482 | the documentation was updated to specify a void return value, but | |
1483 | SvUPGRADE() was left always returning 1 for backwards compatibility. This | |
1484 | has now been removed, and SvUPGRADE() is now a statement with no return | |
1485 | value. | |
1486 | ||
1487 | So this is now a syntax error: | |
1488 | ||
1489 | if (!SvUPGRADE(sv)) { croak(...); } | |
1490 | ||
1491 | If you have code like that, simply replace it with | |
1492 | ||
1493 | SvUPGRADE(sv); | |
1494 | ||
1495 | or to to avoid compiler warnings with older perls, possibly | |
1496 | ||
1497 | (void)SvUPGRADE(sv); | |
1498 | ||
1499 | =item * | |
1500 | ||
1501 | Perl has a new copy-on-write mechanism that allows any SvPOK scalar to be | |
1502 | upgraded to a copy-on-write scalar. A reference count on the string buffer | |
1503 | is stored in the string buffer itself. | |
1504 | ||
1505 | This breaks a few XS modules by allowing copy-on-write scalars to go | |
1506 | through code paths that never encountered them before. | |
1507 | ||
1508 | This behaviour can still be disabled by running F<Configure> with | |
1509 | B<-Accflags=-DPERL_NO_COW>. This option will probably be removed in Perl | |
1510 | 5.20. | |
1511 | ||
1512 | =item * | |
1513 | ||
1514 | Copy-on-write no longer uses the SvFAKE and SvREADONLY flags. Hence, | |
1515 | SvREADONLY indicates a true read-only SV. | |
1516 | ||
1517 | Use the SvIsCOW macro (as before) to identify a copy-on-write scalar. | |
1518 | ||
1519 | =item * | |
1520 | ||
f105b7be | 1521 | C<PL_glob_index> is gone. |
5ed58cbd RS |
1522 | |
1523 | =item * | |
1524 | ||
1525 | The private Perl_croak_no_modify has had its context parameter removed. It is | |
1526 | now has a void prototype. Users of the public API croak_no_modify remain | |
1527 | unaffected. | |
1528 | ||
1529 | =item * | |
1530 | ||
1531 | Copy-on-write (shared hash key) scalars are no longer marked read-only. | |
1532 | C<SvREADONLY> returns false on such an SV, but C<SvIsCOW> still returns | |
1533 | true. | |
1534 | ||
1535 | =item * | |
1536 | ||
1537 | A new op type, C<OP_PADRANGE> has been introduced. The perl peephole | |
1538 | optimiser will, where possible, substitute a single padrange op for a | |
1539 | pushmark followed by one or more pad ops, and possibly also skipping list | |
1540 | and nextstate ops. In addition, the op can carry out the tasks associated | |
f105b7be | 1541 | with the RHS of a C<< my(...) = @_ >> assignment, so those ops may be optimised |
5ed58cbd RS |
1542 | away too. |
1543 | ||
1544 | =item * | |
1545 | ||
1546 | Case-insensitive matching inside a [bracketed] character class with a | |
1547 | multi-character fold no longer excludes one of the possibilities in the | |
1548 | circumstances that it used to. [perl #89774]. | |
1549 | ||
1550 | =item * | |
1551 | ||
1552 | C<PL_formfeed> has been removed. | |
1553 | ||
1554 | =item * | |
1555 | ||
1556 | The regular expression engine no longer reads one byte past the end of the | |
1557 | target string. While for all internally well-formed scalars this should | |
1558 | never have been a problem, this change facilitates clever tricks with | |
1559 | string buffers in CPAN modules. [perl #73542] | |
1560 | ||
1561 | =item * | |
1562 | ||
1563 | Inside a BEGIN block, C<PL_compcv> now points to the currently-compiling | |
1564 | subroutine, rather than the BEGIN block itself. | |
1565 | ||
1566 | =item * | |
1567 | ||
1568 | C<mg_length> has been deprecated. | |
1569 | ||
1570 | =item * | |
1571 | ||
1572 | C<sv_len> now always returns a byte count and C<sv_len_utf8> a character | |
1573 | count. Previously, C<sv_len> and C<sv_len_utf8> were both buggy and would | |
1574 | sometimes returns bytes and sometimes characters. C<sv_len_utf8> no longer | |
1575 | assumes that its argument is in UTF8. Neither of these creates UTF8 caches | |
1576 | for tied or overloaded values or for non-PVs any more. | |
1577 | ||
1578 | =item * | |
1579 | ||
1580 | C<sv_mortalcopy> now copies string buffers of shared hash key scalars when | |
1581 | called from XS modules [perl #79824]. | |
1582 | ||
1583 | =item * | |
1584 | ||
1585 | C<RXf_SPLIT> and C<RXf_SKIPWHITE> are no longer used. They are now | |
1586 | #defined as 0. | |
1587 | ||
1588 | =item * | |
1589 | ||
1590 | The new C<RXf_MODIFIES_VARS> flag can be set by custom regular expression | |
1591 | engines to indicate that the execution of the regular expression may cause | |
1592 | variables to be modified. This lets C<s///> know to skip certain | |
1593 | optimisations. Perl's own regular expression engine sets this flag for the | |
1594 | special backtracking verbs that set $REGMARK and $REGERROR. | |
1595 | ||
1596 | =item * | |
1597 | ||
1598 | The APIs for accessing lexical pads have changed considerably. | |
1599 | ||
1600 | C<PADLIST>s are now longer C<AV>s, but their own type instead. | |
1601 | C<PADLIST>s now contain a C<PAD> and a C<PADNAMELIST> of C<PADNAME>s, | |
1602 | rather than C<AV>s for the pad and the list of pad names. C<PAD>s, | |
1603 | C<PADNAMELIST>s, and C<PADNAME>s are to be accessed as such through the | |
1604 | newly added pad API instead of the plain C<AV> and C<SV> APIs. See | |
1605 | L<perlapi> for details. | |
1606 | ||
1607 | =item * | |
1608 | ||
1609 | In the regex API, the numbered capture callbacks are passed an index | |
1610 | indicating what match variable is being accessed. There are special | |
1611 | index values for the C<$`, $&, $&> variables. Previously the same three | |
1612 | values were used to retrieve C<${^PREMATCH}, ${^MATCH}, ${^POSTMATCH}> | |
1613 | too, but these have now been assigned three separate values. See | |
1614 | L<perlreapi/Numbered capture callbacks>. | |
1615 | ||
1616 | =item * | |
1617 | ||
1618 | C<PL_sawampersand> was previously a boolean indicating that any of | |
1619 | C<$`, $&, $&> had been seen; it now contains three one-bit flags | |
1620 | indicating the presence of each of the variables individually. | |
1621 | ||
1622 | =item * | |
1623 | ||
1624 | The C<CV *> typemap entry now supports C<&{}> overloading and typeglobs, | |
1625 | just like C<&{...}> [perl #96872]. | |
1626 | ||
1627 | =item * | |
1628 | ||
1629 | The C<SVf_AMAGIC> flag to indicate overloading is now on the stash, not the | |
1630 | object. It is now set automatically whenever a method or @ISA changes, so | |
1631 | its meaning has changed, too. It now means "potentially overloaded". When | |
1632 | the overload table is calculated, the flag is automatically turned off if | |
1633 | there is no overloading, so there should be no noticeable slowdown. | |
1634 | ||
1635 | The staleness of the overload tables is now checked when overload methods | |
1636 | are invoked, rather than during C<bless>. | |
1637 | ||
1638 | "A" magic is gone. The changes to the handling of the C<SVf_AMAGIC> flag | |
1639 | eliminate the need for it. | |
1640 | ||
1641 | C<PL_amagic_generation> has been removed as no longer necessary. For XS | |
1642 | modules, it is now a macro alias to C<PL_na>. | |
1643 | ||
1644 | The fallback overload setting is now stored in a stash entry separate from | |
1645 | overloadedness itself. | |
1646 | ||
1647 | =item * | |
1648 | ||
1649 | The character-processing code has been cleaned up in places. The changes | |
1650 | should be operationally invisible. | |
1651 | ||
1652 | =item * | |
1653 | ||
1654 | The C<study> function was made a no-op in 5.16. It was simply disabled via | |
1655 | a C<return> statement; the code was left in place. Now the code supporting | |
1656 | what C<study> used to do has been removed. | |
1657 | ||
1658 | =item * | |
1659 | ||
1660 | Under threaded perls, there is no longer a separate PV allocated for every | |
1661 | COP to store its package name (C<< cop->stashpv >>). Instead, there is an | |
1662 | offset (C<< cop->stashoff >>) into the new C<PL_stashpad> array, which | |
1663 | holds stash pointers. | |
1664 | ||
1665 | =item * | |
1666 | ||
1667 | In the pluggable regex API, the C<regexp_engine> struct has acquired a new | |
1668 | field C<op_comp>, which is currently just for perl's internal use, and | |
f105b7be | 1669 | should be initialized to NULL by other regex plugin modules. |
5ed58cbd RS |
1670 | |
1671 | =item * | |
1672 | ||
1673 | A new function C<alloccoptash> has been added to the API, but is considered | |
1674 | experimental. See L<perlapi>. | |
1675 | ||
1676 | =item * | |
1677 | ||
1678 | Perl used to implement get magic in a way that would sometimes hide bugs in | |
4263dd11 | 1679 | code that could call mg_get() too many times on magical values. This hiding of |
5ed58cbd RS |
1680 | errors no longer occurs, so long-standing bugs may become visible now. If |
1681 | you see magic-related errors in XS code, check to make sure it, together | |
1682 | with the Perl API functions it uses, calls mg_get() only once on SvGMAGICAL() | |
1683 | values. | |
1684 | ||
1685 | =item * | |
1686 | ||
1687 | OP allocation for CVs now uses a slab allocator. This simplifies | |
1688 | memory management for OPs allocated to a CV, so cleaning up after a | |
1689 | compilation error is simpler and safer [perl #111462][perl #112312]. | |
1690 | ||
1691 | =item * | |
1692 | ||
f105b7be | 1693 | C<PERL_DEBUG_READONLY_OPS> has been rewritten to work with the new slab |
5ed58cbd RS |
1694 | allocator, allowing it to catch more violations than before. |
1695 | ||
1696 | =item * | |
1697 | ||
f105b7be KE |
1698 | The old slab allocator for ops, which was only enabled for C<PERL_IMPLICIT_SYS> |
1699 | and C<PERL_DEBUG_READONLY_OPS>, has been retired. | |
5ed58cbd RS |
1700 | |
1701 | =back | |
1702 | ||
1703 | =head1 Selected Bug Fixes | |
1704 | ||
1705 | =over 4 | |
1706 | ||
1707 | =item * | |
1708 | ||
1709 | Here-doc terminators no longer require a terminating newline character when | |
1710 | they occur at the end of a file. This was already the case at the end of a | |
1711 | string eval [perl #65838]. | |
1712 | ||
1713 | =item * | |
1714 | ||
f105b7be | 1715 | C<-DPERL_GLOBAL_STRUCT> builds now free the global struct B<after> |
5ed58cbd RS |
1716 | they've finished using it. |
1717 | ||
1718 | =item * | |
1719 | ||
1720 | A trailing '/' on a path in @INC will no longer have an additional '/' | |
1721 | appended. | |
1722 | ||
1723 | =item * | |
1724 | ||
1725 | The C<:crlf> layer now works when unread data doesn't fit into its own | |
1726 | buffer. [perl #112244]. | |
1727 | ||
1728 | =item * | |
1729 | ||
1730 | C<ungetc()> now handles UTF-8 encoded data. [perl #116322]. | |
1731 | ||
1732 | =item * | |
1733 | ||
1734 | A bug in the core typemap caused any C types that map to the T_BOOL core | |
1735 | typemap entry to not be set, updated, or modified when the T_BOOL variable was | |
1736 | used in an OUTPUT: section with an exception for RETVAL. T_BOOL in an INPUT: | |
1737 | section was not affected. Using a T_BOOL return type for an XSUB (RETVAL) | |
1738 | was not affected. A side effect of fixing this bug is, if a T_BOOL is specified | |
1739 | in the OUTPUT: section (which previous did nothing to the SV), and a read only | |
1740 | SV (literal) is passed to the XSUB, croaks like "Modification of a read-only | |
1741 | value attempted" will happen. [perl #115796] | |
1742 | ||
1743 | =item * | |
1744 | ||
1745 | On many platforms, providing a directory name as the script name caused perl | |
1746 | to do nothing and report success. It should now universally report an error | |
1747 | and exit nonzero. [perl #61362] | |
1748 | ||
1749 | =item * | |
1750 | ||
1751 | C<sort {undef} ...> under fatal warnings no longer crashes. It had | |
1752 | begun crashing in Perl 5.16. | |
1753 | ||
1754 | =item * | |
1755 | ||
1756 | Stashes blessed into each other | |
1757 | (C<bless \%Foo::, 'Bar'; bless \%Bar::, 'Foo'>) no longer result in double | |
1758 | frees. This bug started happening in Perl 5.16. | |
1759 | ||
1760 | =item * | |
1761 | ||
1762 | Numerous memory leaks have been fixed, mostly involving fatal warnings and | |
1763 | syntax errors. | |
1764 | ||
1765 | =item * | |
1766 | ||
1767 | Some failed regular expression matches such as C<'f' =~ /../g> were not | |
1768 | resetting C<pos>. Also, "match-once" patterns (C<m?...?g>) failed to reset | |
1769 | it, too, when invoked a second time [perl #23180]. | |
1770 | ||
1771 | =item * | |
1772 | ||
1773 | Accessing C<$&> after a pattern match now works if it had not been seen | |
1774 | before the match. I.e., this applies to C<${'&'}> (under C<no strict>) and | |
1775 | C<eval '$&'>. The same applies to C<$'> and C<$`> [perl #4289]. | |
1776 | ||
1777 | =item * | |
1778 | ||
1779 | Several bugs involving C<local *ISA> and C<local *Foo::> causing stale | |
1780 | MRO caches have been fixed. | |
1781 | ||
1782 | =item * | |
1783 | ||
1784 | Defining a subroutine when its typeglob has been aliased no longer results | |
1785 | in stale method caches. This bug was introduced in Perl 5.10. | |
1786 | ||
1787 | =item * | |
1788 | ||
1789 | Localising a typeglob containing a subroutine when the typeglob's package | |
1790 | has been deleted from its parent stash no longer produces an error. This | |
1791 | bug was introduced in Perl 5.14. | |
1792 | ||
1793 | =item * | |
1794 | ||
1795 | Under some circumstances, C<local *method=...> would fail to reset method | |
1796 | caches upon scope exit. | |
1797 | ||
1798 | =item * | |
1799 | ||
1800 | C</[.foo.]/> is no longer an error, but produces a warning (as before) and | |
1801 | is treated as C</[.fo]/> [perl #115818]. | |
1802 | ||
1803 | =item * | |
1804 | ||
1805 | C<goto $tied_var> now calls FETCH before deciding what type of goto | |
1806 | (subroutine or label) this is. | |
1807 | ||
1808 | =item * | |
1809 | ||
1810 | Renaming packages through glob assignment | |
1811 | (C<*Foo:: = *Bar::; *Bar:: = *Baz::>) in combination with C<m?...?> and | |
1812 | C<reset> no longer makes threaded builds crash. | |
1813 | ||
1814 | =item * | |
1815 | ||
1816 | A number of bugs related to assigning a list to hash have been fixed. Many of | |
1817 | these involve lists with repeated keys like C<(1, 1, 1, 1)>. | |
1818 | ||
1819 | =over 4 | |
1820 | ||
1821 | =item * | |
1822 | ||
1823 | The expression C<scalar(%h = (1, 1, 1, 1))> now returns C<4>, not C<2>. | |
1824 | ||
1825 | =item * | |
1826 | ||
1827 | The return value of C<%h = (1, 1, 1)> in list context was wrong. Previously | |
1828 | this would return C<(1, undef, 1)>, now it returns C<(1, undef)>. | |
1829 | ||
1830 | =item * | |
1831 | ||
1832 | Perl now issues the same warning on C<($s, %h) = (1, {})> as it does for | |
1833 | C<(%h) = ({})>, "Reference found where even-sized list expected". | |
1834 | ||
1835 | =item * | |
1836 | ||
1837 | A number of additional edge cases in list assignment to hashes were | |
1838 | corrected. For more details see commit 23b7025ebc. | |
1839 | ||
1840 | =back | |
1841 | ||
1842 | =item * | |
1843 | ||
1844 | Attributes applied to lexical variables no longer leak memory. | |
1845 | [perl #114764] | |
1846 | ||
1847 | =item * | |
1848 | ||
1849 | C<dump>, C<goto>, C<last>, C<next>, C<redo> or C<require> followed by a | |
1850 | bareword (or version) and then an infix operator is no longer a syntax | |
1851 | error. It used to be for those infix operators (like C<+>) that have a | |
1852 | different meaning where a term is expected. [perl #105924] | |
1853 | ||
1854 | =item * | |
1855 | ||
1856 | C<require a::b . 1> and C<require a::b + 1> no longer produce erroneous | |
1857 | ambiguity warnings. [perl #107002] | |
1858 | ||
1859 | =item * | |
1860 | ||
1861 | Class method calls are now allowed on any string, and not just strings | |
1862 | beginning with an alphanumeric character. [perl #105922] | |
1863 | ||
1864 | =item * | |
1865 | ||
1866 | An empty pattern created with C<qr//> used in C<m///> no longer triggers | |
1867 | the "empty pattern reuses last pattern" behaviour. [perl #96230] | |
1868 | ||
1869 | =item * | |
1870 | ||
1871 | Tying a hash during iteration no longer results in a memory leak. | |
1872 | ||
1873 | =item * | |
1874 | ||
1875 | Freeing a tied hash during iteration no longer results in a memory leak. | |
1876 | ||
1877 | =item * | |
1878 | ||
1879 | List assignment to a tied array or hash that dies on STORE no longer | |
1880 | results in a memory leak. | |
1881 | ||
1882 | =item * | |
1883 | ||
1884 | If the hint hash (C<%^H>) is tied, compile-time scope entry (which copies | |
1885 | the hint hash) no longer leaks memory if FETCH dies. [perl #107000] | |
1886 | ||
1887 | =item * | |
1888 | ||
1889 | Constant folding no longer inappropriately triggers the special | |
1890 | C<split " "> behaviour. [perl #94490] | |
1891 | ||
1892 | =item * | |
1893 | ||
1894 | C<defined scalar(@array)>, C<defined do { &foo }>, and similar constructs | |
1895 | now treat the argument to C<defined> as a simple scalar. [perl #97466] | |
1896 | ||
1897 | =item * | |
1898 | ||
1899 | Running a custom debugging that defines no C<*DB::DB> glob or provides a | |
1900 | subroutine stub for C<&DB::DB> no longer results in a crash, but an error | |
1901 | instead. [perl #114990] | |
1902 | ||
1903 | =item * | |
1904 | ||
1905 | C<reset ""> now matches its documentation. C<reset> only resets C<m?...?> | |
1906 | patterns when called with no argument. An empty string for an argument now | |
1907 | does nothing. (It used to be treated as no argument.) [perl #97958] | |
1908 | ||
1909 | =item * | |
1910 | ||
1911 | C<printf> with an argument returning an empty list no longer reads past the | |
1912 | end of the stack, resulting in erratic behaviour. [perl #77094] | |
1913 | ||
1914 | =item * | |
1915 | ||
1916 | C<--subname> no longer produces erroneous ambiguity warnings. | |
1917 | [perl #77240] | |
1918 | ||
1919 | =item * | |
1920 | ||
1921 | C<v10> is now allowed as a label or package name. This was inadvertently | |
1922 | broken when v-strings were added in Perl 5.6. [perl #56880] | |
1923 | ||
1924 | =item * | |
1925 | ||
1926 | C<length>, C<pos>, C<substr> and C<sprintf> could be confused by ties, | |
1927 | overloading, references and typeglobs if the stringification of such | |
1928 | changed the internal representation to or from UTF8. [perl #114410] | |
1929 | ||
1930 | =item * | |
1931 | ||
1932 | utf8::encode now calls FETCH and STORE on tied variables. utf8::decode now | |
1933 | calls STORE (it was already calling FETCH). | |
1934 | ||
1935 | =item * | |
1936 | ||
1937 | C<$tied =~ s/$non_utf8/$utf8/> no longer loops infinitely if the tied | |
1938 | variable returns a Latin-1 string, shared hash key scalar, or reference or | |
2ae351f8 | 1939 | typeglob that stringifies as ASCII or Latin-1. This was a regression from |
5ed58cbd RS |
1940 | 5.12.x. |
1941 | ||
1942 | =item * | |
1943 | ||
1944 | C<s///> without /e is now better at detecting when it needs to forego | |
1945 | certain optimisations, fixing some buggy cases: | |
1946 | ||
1947 | =over | |
1948 | ||
1949 | =item * | |
1950 | ||
1951 | Match variables in certain constructs (C<&&>, C<||>, C<..> and others) in | |
1952 | the replacement part; e.g., C<s/(.)/$l{$a||$1}/g>. [perl #26986] | |
1953 | ||
1954 | =item * | |
1955 | ||
1956 | Aliases to match variables in the replacement. | |
1957 | ||
1958 | =item * | |
1959 | ||
1960 | C<$REGERROR> or C<$REGMARK> in the replacement. [perl #49190] | |
1961 | ||
1962 | =item * | |
1963 | ||
1964 | An empty pattern (C<s//$foo/>) that causes the last-successful pattern to | |
1965 | be used, when that pattern contains code blocks that modify the variables | |
1966 | in the replacement. | |
1967 | ||
1968 | =back | |
1969 | ||
1970 | =item * | |
1971 | ||
1972 | The taintedness of the replacement string no longer affects the taintedness | |
1973 | of the return value of C<s///e>. | |
1974 | ||
1975 | =item * | |
1976 | ||
1977 | The C<$|> autoflush variable is created on-the-fly when needed. If this | |
1978 | happened (e.g., if it was mentioned in a module or eval) when the | |
1979 | currently-selected filehandle was a typeglob with an empty IO slot, it used | |
1980 | to crash. [perl #115206] | |
1981 | ||
1982 | =item * | |
1983 | ||
1984 | Line numbers at the end of a string eval are no longer off by one. | |
1985 | [perl #114658] | |
1986 | ||
1987 | =item * | |
1988 | ||
1989 | @INC filters (subroutines returned by subroutines in @INC) that set $_ to a | |
1990 | copy-on-write scalar no longer cause the parser to modify that string | |
1991 | buffer in place. | |
1992 | ||
1993 | =item * | |
1994 | ||
1995 | C<length($object)> no longer returns the undefined value if the object has | |
1996 | string overloading that returns undef. [perl #115260] | |
1997 | ||
1998 | =item * | |
1999 | ||
2000 | The use of C<PL_stashcache>, the stash name lookup cache for method calls, has | |
2001 | been restored, | |
2002 | ||
2003 | Commit da6b625f78f5f133 in August 2011 inadvertently broke the code that looks | |
2004 | up values in C<PL_stashcache>. As it's a only cache, quite correctly everything | |
2005 | carried on working without it. | |
2006 | ||
2007 | =item * | |
2008 | ||
2009 | The error "Can't localize through a reference" had disappeared in 5.16.0 | |
2010 | when C<local %$ref> appeared on the last line of an lvalue subroutine. | |
2011 | This error disappeared for C<\local %$ref> in perl 5.8.1. It has now | |
2012 | been restored. | |
2013 | ||
2014 | =item * | |
2015 | ||
2016 | The parsing of here-docs has been improved significantly, fixing several | |
2017 | parsing bugs and crashes and one memory leak, and correcting wrong | |
2018 | subsequent line numbers under certain conditions. | |
2019 | ||
2020 | =item * | |
2021 | ||
2022 | Inside an eval, the error message for an unterminated here-doc no longer | |
2023 | has a newline in the middle of it [perl #70836]. | |
2024 | ||
2025 | =item * | |
2026 | ||
2027 | A substitution inside a substitution pattern (C<s/${s|||}//>) no longer | |
2028 | confuses the parser. | |
2029 | ||
2030 | =item * | |
2031 | ||
2032 | It may be an odd place to allow comments, but C<s//"" # hello/e> has | |
2033 | always worked, I<unless> there happens to be a null character before the | |
2034 | first #. Now it works even in the presence of nulls. | |
2035 | ||
2036 | =item * | |
2037 | ||
2038 | An invalid range in C<tr///> or C<y///> no longer results in a memory leak. | |
2039 | ||
2040 | =item * | |
2041 | ||
2042 | String eval no longer treats a semicolon-delimited quote-like operator at | |
2043 | the very end (C<eval 'q;;'>) as a syntax error. | |
2044 | ||
2045 | =item * | |
2046 | ||
2047 | C<< warn {$_ => 1} + 1 >> is no longer a syntax error. The parser used to | |
2048 | get confused with certain list operators followed by an anonymous hash and | |
2049 | then an infix operator that shares its form with a unary operator. | |
2050 | ||
2051 | =item * | |
2052 | ||
2053 | C<(caller $n)[6]> (which gives the text of the eval) used to return the | |
2054 | actual parser buffer. Modifying it could result in crashes. Now it always | |
2055 | returns a copy. The string returned no longer has "\n;" tacked on to the | |
2056 | end. The returned text also includes here-doc bodies, which used to be | |
2057 | omitted. | |
2058 | ||
2059 | =item * | |
2060 | ||
2061 | Reset the utf8 position cache when accessing magical variables to avoid the | |
2062 | string buffer and the utf8 position cache getting out of sync | |
2063 | [perl #114410]. | |
2064 | ||
2065 | =item * | |
2066 | ||
2067 | Various cases of get magic being called twice for magical utf8 strings have been | |
2068 | fixed. | |
2069 | ||
2070 | =item * | |
2071 | ||
2072 | This code (when not in the presence of C<$&> etc) | |
2073 | ||
2074 | $_ = 'x' x 1_000_000; | |
2075 | 1 while /(.)/; | |
2076 | ||
2077 | used to skip the buffer copy for performance reasons, but suffered from C<$1> | |
2078 | etc changing if the original string changed. That's now been fixed. | |
2079 | ||
2080 | =item * | |
2081 | ||
2082 | Perl doesn't use PerlIO anymore to report out of memory messages, as PerlIO | |
2083 | might attempt to allocate more memory. | |
2084 | ||
2085 | =item * | |
2086 | ||
2087 | In a regular expression, if something is quantified with C<{n,m}> where | |
2088 | C<S<n E<gt> m>>, it can't possibly match. Previously this was a fatal | |
2089 | error, but now is merely a warning (and that something won't match). | |
2090 | [perl #82954]. | |
2091 | ||
2092 | =item * | |
2093 | ||
2094 | It used to be possible for formats defined in subroutines that have | |
2095 | subsequently been undefined and redefined to close over variables in the | |
2096 | wrong pad (the newly-defined enclosing sub), resulting in crashes or | |
2097 | "Bizarre copy" errors. | |
2098 | ||
2099 | =item * | |
2100 | ||
2101 | Redefinition of XSUBs at run time could produce warnings with the wrong | |
2102 | line number. | |
2103 | ||
2104 | =item * | |
2105 | ||
2106 | The %vd sprintf format does not support version objects for alpha versions. | |
2107 | It used to output the format itself (%vd) when passed an alpha version, and | |
2108 | also emit an "Invalid conversion in printf" warning. It no longer does, | |
2109 | but produces the empty string in the output. It also no longer leaks | |
2110 | memory in this case. | |
2111 | ||
2112 | =item * | |
2113 | ||
2114 | C<< $obj->SUPER::method >> calls in the main package could fail if the | |
2115 | SUPER package had already been accessed by other means. | |
2116 | ||
2117 | =item * | |
2118 | ||
f105b7be | 2119 | Stash aliasing (C<< *foo:: = *bar:: >>) no longer causes SUPER calls to ignore |
5ed58cbd RS |
2120 | changes to methods or @ISA or use the wrong package. |
2121 | ||
2122 | =item * | |
2123 | ||
2124 | Method calls on packages whose names end in ::SUPER are no longer treated | |
2125 | as SUPER method calls, resulting in failure to find the method. | |
2126 | Furthermore, defining subroutines in such packages no longer causes them to | |
2127 | be found by SUPER method calls on the containing package [perl #114924]. | |
2128 | ||
2129 | =item * | |
2130 | ||
2131 | C<\w> now matches the code points U+200C (ZERO WIDTH NON-JOINER) and U+200D | |
2132 | (ZERO WIDTH JOINER). C<\W> no longer matches these. This change is because | |
2133 | Unicode corrected their definition of what C<\w> should match. | |
2134 | ||
2135 | =item * | |
2136 | ||
2137 | C<dump LABEL> no longer leaks its label. | |
2138 | ||
2139 | =item * | |
2140 | ||
2141 | Constant folding no longer changes the behaviour of functions like C<stat()> | |
2142 | and C<truncate()> that can take either filenames or handles. | |
2143 | C<stat 1 ? foo : bar> nows treats its argument as a file name (since it is an | |
2144 | arbitrary expression), rather than the handle "foo". | |
2145 | ||
2146 | =item * | |
2147 | ||
2148 | C<truncate FOO, $len> no longer falls back to treating "FOO" as a file name if | |
2149 | the filehandle has been deleted. This was broken in Perl 5.16.0. | |
2150 | ||
2151 | =item * | |
2152 | ||
2153 | Subroutine redefinitions after sub-to-glob and glob-to-glob assignments no | |
2154 | longer cause double frees or panic messages. | |
2155 | ||
2156 | =item * | |
2157 | ||
2158 | C<s///> now turns vstrings into plain strings when performing a substitution, | |
2159 | even if the resulting string is the same (C<s/a/a/>). | |
2160 | ||
2161 | =item * | |
2162 | ||
2163 | Prototype mismatch warnings no longer erroneously treat constant subs as having | |
2164 | no prototype when they actually have "". | |
2165 | ||
2166 | =item * | |
2167 | ||
2168 | Constant subroutines and forward declarations no longer prevent prototype | |
2169 | mismatch warnings from omitting the sub name. | |
2170 | ||
2171 | =item * | |
2172 | ||
2173 | C<undef> on a subroutine now clears call checkers. | |
2174 | ||
2175 | =item * | |
2176 | ||
2177 | The C<ref> operator started leaking memory on blessed objects in Perl 5.16.0. | |
2178 | This has been fixed [perl #114340]. | |
2179 | ||
2180 | =item * | |
2181 | ||
2182 | C<use> no longer tries to parse its arguments as a statement, making | |
2183 | C<use constant { () };> a syntax error [perl #114222]. | |
2184 | ||
2185 | =item * | |
2186 | ||
2187 | On debugging builds, "uninitialized" warnings inside formats no longer cause | |
2188 | assertion failures. | |
2189 | ||
2190 | =item * | |
2191 | ||
2192 | On debugging builds, subroutines nested inside formats no longer cause | |
2193 | assertion failures [perl #78550]. | |
2194 | ||
2195 | =item * | |
2196 | ||
2197 | Formats and C<use> statements are now permitted inside formats. | |
2198 | ||
2199 | =item * | |
2200 | ||
2201 | C<print $x> and C<sub { print $x }-E<gt>()> now always produce the same output. | |
2202 | It was possible for the latter to refuse to close over $x if the variable was | |
2203 | not active; e.g., if it was defined outside a currently-running named | |
2204 | subroutine. | |
2205 | ||
2206 | =item * | |
2207 | ||
2208 | Similarly, C<print $x> and C<print eval '$x'> now produce the same output. | |
2209 | This also allows "my $x if 0" variables to be seen in the debugger [perl | |
2210 | #114018]. | |
2211 | ||
2212 | =item * | |
2213 | ||
2214 | Formats called recursively no longer stomp on their own lexical variables, but | |
2215 | each recursive call has its own set of lexicals. | |
2216 | ||
2217 | =item * | |
2218 | ||
2219 | Attempting to free an active format or the handle associated with it no longer | |
2220 | results in a crash. | |
2221 | ||
2222 | =item * | |
2223 | ||
2224 | Format parsing no longer gets confused by braces, semicolons and low-precedence | |
2225 | operators. It used to be possible to use braces as format delimiters (instead | |
2226 | of C<=> and C<.>), but only sometimes. Semicolons and low-precedence operators | |
2227 | in format argument lines no longer confuse the parser into ignoring the line's | |
2228 | return value. In format argument lines, braces can now be used for anonymous | |
2229 | hashes, instead of being treated always as C<do> blocks. | |
2230 | ||
2231 | =item * | |
2232 | ||
2233 | Formats can now be nested inside code blocks in regular expressions and other | |
2234 | quoted constructs (C</(?{...})/> and C<qq/${...}/>) [perl #114040]. | |
2235 | ||
2236 | =item * | |
2237 | ||
2238 | Formats are no longer created after compilation errors. | |
2239 | ||
2240 | =item * | |
2241 | ||
2242 | Under debugging builds, the B<-DA> command line option started crashing in Perl | |
2243 | 5.16.0. It has been fixed [perl #114368]. | |
2244 | ||
2245 | =item * | |
2246 | ||
2247 | A potential deadlock scenario involving the premature termination of a pseudo- | |
2248 | forked child in a Windows build with ithreads enabled has been fixed. This | |
2249 | resolves the common problem of the F<t/op/fork.t> test hanging on Windows [perl | |
2250 | #88840]. | |
2251 | ||
2252 | =item * | |
2253 | ||
5ed58cbd RS |
2254 | The code which generates errors from C<require()> could potentially read one or |
2255 | two bytes before the start of the filename for filenames less than three bytes | |
2256 | long and ending C</\.p?\z/>. This has now been fixed. Note that it could | |
2257 | never have happened with module names given to C<use()> or C<require()> anyway. | |
2258 | ||
2259 | =item * | |
2260 | ||
2261 | The handling of pathnames of modules given to C<require()> has been made | |
2262 | thread-safe on VMS. | |
2263 | ||
2264 | =item * | |
2265 | ||
d85cd26b RS |
2266 | Non-blocking sockets have been fixed on VMS. |
2267 | ||
2268 | =item * | |
2269 | ||
5ed58cbd RS |
2270 | A bug in the compilation of a C</(?{})/> expression which affected the TryCatch |
2271 | test suite has been fixed [perl #114242]. | |
2272 | ||
2273 | =item * | |
2274 | ||
2275 | Pod can now be nested in code inside a quoted construct outside of a string | |
2276 | eval. This used to work only within string evals [perl #114040]. | |
2277 | ||
2278 | =item * | |
2279 | ||
2280 | C<goto ''> now looks for an empty label, producing the "goto must have | |
2281 | label" error message, instead of exiting the program [perl #111794]. | |
2282 | ||
2283 | =item * | |
2284 | ||
2285 | C<goto "\0"> now dies with "Can't find label" instead of "goto must have | |
2286 | label". | |
2287 | ||
2288 | =item * | |
2289 | ||
2290 | The C function C<hv_store> used to result in crashes when used on C<%^H> | |
2291 | [perl #111000]. | |
2292 | ||
2293 | =item * | |
2294 | ||
2295 | A call checker attached to a closure prototype via C<cv_set_call_checker> | |
2296 | is now copied to closures cloned from it. So C<cv_set_call_checker> now | |
2297 | works inside an attribute handler for a closure. | |
2298 | ||
2299 | =item * | |
2300 | ||
2301 | Writing to C<$^N> used to have no effect. Now it croaks with "Modification | |
2302 | of a read-only value" by default, but that can be overridden by a custom | |
2303 | regular expression engine, as with C<$1> [perl #112184]. | |
2304 | ||
2305 | =item * | |
2306 | ||
2307 | C<undef> on a control character glob (C<undef *^H>) no longer emits an | |
2308 | erroneous warning about ambiguity [perl #112456]. | |
2309 | ||
2310 | =item * | |
2311 | ||
2312 | For efficiency's sake, many operators and built-in functions return the | |
2313 | same scalar each time. Lvalue subroutines and subroutines in the CORE:: | |
2314 | namespace were allowing this implementation detail to leak through. | |
2315 | C<print &CORE::uc("a"), &CORE::uc("b")> used to print "BB". The same thing | |
2316 | would happen with an lvalue subroutine returning the return value of C<uc>. | |
2317 | Now the value is copied in such cases. | |
2318 | ||
2319 | =item * | |
2320 | ||
2321 | C<method {}> syntax with an empty block or a block returning an empty list | |
2322 | used to crash or use some random value left on the stack as its invocant. | |
2323 | Now it produces an error. | |
2324 | ||
2325 | =item * | |
2326 | ||
2327 | C<vec> now works with extremely large offsets (E<gt>2 GB) [perl #111730]. | |
2328 | ||
2329 | =item * | |
2330 | ||
2331 | Changes to overload settings now take effect immediately, as do changes to | |
2332 | inheritance that affect overloading. They used to take effect only after | |
2333 | C<bless>. | |
2334 | ||
2335 | Objects that were created before a class had any overloading used to remain | |
2336 | non-overloaded even if the class gained overloading through C<use overload> | |
2337 | or @ISA changes, and even after C<bless>. This has been fixed | |
2338 | [perl #112708]. | |
2339 | ||
2340 | =item * | |
2341 | ||
2342 | Classes with overloading can now inherit fallback values. | |
2343 | ||
2344 | =item * | |
2345 | ||
2346 | Overloading was not respecting a fallback value of 0 if there were | |
2347 | overloaded objects on both sides of an assignment operator like C<+=> | |
2348 | [perl #111856]. | |
2349 | ||
2350 | =item * | |
2351 | ||
2352 | C<pos> now croaks with hash and array arguments, instead of producing | |
2353 | erroneous warnings. | |
2354 | ||
2355 | =item * | |
2356 | ||
2357 | C<while(each %h)> now implies C<while(defined($_ = each %h))>, like | |
2358 | C<readline> and C<readdir>. | |
2359 | ||
2360 | =item * | |
2361 | ||
2362 | Subs in the CORE:: namespace no longer crash after C<undef *_> when called | |
2363 | with no argument list (C<&CORE::time> with no parentheses). | |
2364 | ||
2365 | =item * | |
2366 | ||
2367 | C<unpack> no longer produces the "'/' must follow a numeric type in unpack" | |
2368 | error when it is the data that are at fault [perl #60204]. | |
2369 | ||
2370 | =item * | |
2371 | ||
2372 | C<join> and C<"@array"> now call FETCH only once on a tied C<$"> | |
2373 | [perl #8931]. | |
2374 | ||
2375 | =item * | |
2376 | ||
2377 | Some subroutine calls generated by compiling core ops affected by a | |
2378 | C<CORE::GLOBAL> override had op checking performed twice. The checking | |
2379 | is always idempotent for pure Perl code, but the double checking can | |
2380 | matter when custom call checkers are involved. | |
2381 | ||
2382 | =item * | |
2383 | ||
2384 | A race condition used to exist around fork that could cause a signal sent to | |
2385 | the parent to be handled by both parent and child. Signals are now blocked | |
2386 | briefly around fork to prevent this from happening [perl #82580]. | |
2387 | ||
2388 | =item * | |
2389 | ||
2390 | The implementation of code blocks in regular expressions, such as C<(?{})> | |
2391 | and C<(??{})>, has been heavily reworked to eliminate a whole slew of bugs. | |
2392 | The main user-visible changes are: | |
2393 | ||
2394 | =over 4 | |
2395 | ||
2396 | =item * | |
2397 | ||
2398 | Code blocks within patterns are now parsed in the same pass as the | |
2399 | surrounding code; in particular it is no longer necessary to have balanced | |
2400 | braces: this now works: | |
2401 | ||
2402 | /(?{ $x='{' })/ | |
2403 | ||
2404 | This means that this error message is no longer generated: | |
2405 | ||
2406 | Sequence (?{...}) not terminated or not {}-balanced in regex | |
2407 | ||
2408 | but a new error may be seen: | |
2409 | ||
2410 | Sequence (?{...}) not terminated with ')' | |
2411 | ||
2412 | In addition, literal code blocks within run-time patterns are only | |
2413 | compiled once, at perl compile-time: | |
2414 | ||
2415 | for my $p (...) { | |
2416 | # this 'FOO' block of code is compiled once, | |
2417 | # at the same time as the surrounding 'for' loop | |
2418 | /$p{(?{FOO;})/; | |
2419 | } | |
2420 | ||
2421 | =item * | |
2422 | ||
2423 | Lexical variables are now sane as regards scope, recursion and closure | |
2424 | behavior. In particular, C</A(?{B})C/> behaves (from a closure viewpoint) | |
2425 | exactly like C</A/ && do { B } && /C/>, while C<qr/A(?{B})C/> is like | |
2426 | C<sub {/A/ && do { B } && /C/}>. So this code now works how you might | |
2427 | expect, creating three regexes that match 0, 1, and 2: | |
2428 | ||
2429 | for my $i (0..2) { | |
2430 | push @r, qr/^(??{$i})$/; | |
2431 | } | |
2432 | "1" =~ $r[1]; # matches | |
2433 | ||
2434 | =item * | |
2435 | ||
2436 | The C<use re 'eval'> pragma is now only required for code blocks defined | |
2437 | at runtime; in particular in the following, the text of the C<$r> pattern is | |
2438 | still interpolated into the new pattern and recompiled, but the individual | |
2439 | compiled code-blocks within C<$r> are reused rather than being recompiled, | |
2440 | and C<use re 'eval'> isn't needed any more: | |
2441 | ||
2442 | my $r = qr/abc(?{....})def/; | |
2443 | /xyz$r/; | |
2444 | ||
2445 | =item * | |
2446 | ||
2447 | Flow control operators no longer crash. Each code block runs in a new | |
2448 | dynamic scope, so C<next> etc. will not see | |
2449 | any enclosing loops. C<return> returns a value | |
2450 | from the code block, not from any enclosing subroutine. | |
2451 | ||
2452 | =item * | |
2453 | ||
2454 | Perl normally caches the compilation of run-time patterns, and doesn't | |
2455 | recompile if the pattern hasn't changed, but this is now disabled if | |
2456 | required for the correct behavior of closures. For example: | |
2457 | ||
2458 | my $code = '(??{$x})'; | |
2459 | for my $x (1..3) { | |
2460 | # recompile to see fresh value of $x each time | |
2461 | $x =~ /$code/; | |
2462 | } | |
2463 | ||
2464 | =item * | |
2465 | ||
2466 | The C</msix> and C<(?msix)> etc. flags are now propagated into the return | |
2467 | value from C<(??{})>; this now works: | |
2468 | ||
2469 | "AB" =~ /a(??{'b'})/i; | |
2470 | ||
2471 | =item * | |
2472 | ||
2473 | Warnings and errors will appear to come from the surrounding code (or for | |
2474 | run-time code blocks, from an eval) rather than from an C<re_eval>: | |
2475 | ||
2476 | use re 'eval'; $c = '(?{ warn "foo" })'; /$c/; | |
2477 | /(?{ warn "foo" })/; | |
2478 | ||
2479 | formerly gave: | |
2480 | ||
2481 | foo at (re_eval 1) line 1. | |
2482 | foo at (re_eval 2) line 1. | |
2483 | ||
2484 | and now gives: | |
2485 | ||
2486 | foo at (eval 1) line 1. | |
2487 | foo at /some/prog line 2. | |
2488 | ||
2489 | =back | |
2490 | ||
2491 | =item * | |
2492 | ||
2e7bc647 KW |
2493 | Perl now can be recompiled to use any Unicode version. In v5.16, it |
2494 | worked on Unicodes 6.0 and 6.1, but there were various bugs if earlier | |
2495 | releases were used; the older the release the more problems. | |
5ed58cbd RS |
2496 | |
2497 | =item * | |
2498 | ||
2499 | C<vec> no longer produces "uninitialized" warnings in lvalue context | |
2500 | [perl #9423]. | |
2501 | ||
2502 | =item * | |
2503 | ||
2504 | An optimization involving fixed strings in regular expressions could cause | |
2505 | a severe performance penalty in edge cases. This has been fixed | |
2506 | [perl #76546]. | |
2507 | ||
2508 | =item * | |
2509 | ||
2510 | In certain cases, including empty subpatterns within a regular expression (such | |
2511 | as C<(?:)> or C<(?:|)>) could disable some optimizations. This has been fixed. | |
2512 | ||
2513 | =item * | |
2514 | ||
2515 | The "Can't find an opnumber" message that C<prototype> produces when passed | |
2516 | a string like "CORE::nonexistent_keyword" now passes UTF-8 and embedded | |
2517 | NULs through unchanged [perl #97478]. | |
2518 | ||
2519 | =item * | |
2520 | ||
2521 | C<prototype> now treats magical variables like C<$1> the same way as | |
2522 | non-magical variables when checking for the CORE:: prefix, instead of | |
2523 | treating them as subroutine names. | |
2524 | ||
2525 | =item * | |
2526 | ||
2527 | Under threaded perls, a runtime code block in a regular expression could | |
2528 | corrupt the package name stored in the op tree, resulting in bad reads | |
2529 | in C<caller>, and possibly crashes [perl #113060]. | |
2530 | ||
2531 | =item * | |
2532 | ||
2533 | Referencing a closure prototype (C<\&{$_[1]}> in an attribute handler for a | |
2534 | closure) no longer results in a copy of the subroutine (or assertion | |
2535 | failures on debugging builds). | |
2536 | ||
2537 | =item * | |
2538 | ||
2539 | C<eval '__PACKAGE__'> now returns the right answer on threaded builds if | |
2540 | the current package has been assigned over (as in | |
2541 | C<*ThisPackage:: = *ThatPackage::>) [perl #78742]. | |
2542 | ||
2543 | =item * | |
2544 | ||
2545 | If a package is deleted by code that it calls, it is possible for C<caller> | |
2546 | to see a stack frame belonging to that deleted package. C<caller> could | |
2547 | crash if the stash's memory address was reused for a scalar and a | |
2548 | substitution was performed on the same scalar [perl #113486]. | |
2549 | ||
2550 | =item * | |
2551 | ||
2552 | C<UNIVERSAL::can> no longer treats its first argument differently | |
2553 | depending on whether it is a string or number internally. | |
2554 | ||
2555 | =item * | |
2556 | ||
2557 | C<open> with C<< <& >> for the mode checks to see whether the third argument is | |
2558 | a number, in determining whether to treat it as a file descriptor or a handle | |
2559 | name. Magical variables like C<$1> were always failing the numeric check and | |
2560 | being treated as handle names. | |
2561 | ||
2562 | =item * | |
2563 | ||
2564 | C<warn>'s handling of magical variables (C<$1>, ties) has undergone several | |
2565 | fixes. C<FETCH> is only called once now on a tied argument or a tied C<$@> | |
2566 | [perl #97480]. Tied variables returning objects that stringify as "" are | |
2567 | no longer ignored. A tied C<$@> that happened to return a reference the | |
2568 | I<previous> time it was used is no longer ignored. | |
2569 | ||
2570 | =item * | |
2571 | ||
2572 | C<warn ""> now treats C<$@> with a number in it the same way, regardless of | |
2573 | whether it happened via C<$@=3> or C<$@="3">. It used to ignore the | |
2574 | former. Now it appends "\t...caught", as it has always done with | |
2575 | C<$@="3">. | |
2576 | ||
2577 | =item * | |
2578 | ||
2579 | Numeric operators on magical variables (e.g., S<C<$1 + 1>>) used to use | |
2580 | floating point operations even where integer operations were more appropriate, | |
2581 | resulting in loss of accuracy on 64-bit platforms [perl #109542]. | |
2582 | ||
2583 | =item * | |
2584 | ||
2585 | Unary negation no longer treats a string as a number if the string happened | |
2586 | to be used as a number at some point. So, if C<$x> contains the string "dogs", | |
2587 | C<-$x> returns "-dogs" even if C<$y=0+$x> has happened at some point. | |
2588 | ||
2589 | =item * | |
2590 | ||
2591 | In Perl 5.14, C<-'-10'> was fixed to return "10", not "+10". But magical | |
2592 | variables (C<$1>, ties) were not fixed till now [perl #57706]. | |
2593 | ||
2594 | =item * | |
2595 | ||
2596 | Unary negation now treats strings consistently, regardless of the internal | |
2597 | C<UTF8> flag. | |
2598 | ||
2599 | =item * | |
2600 | ||
2601 | A regression introduced in Perl v5.16.0 involving | |
2602 | C<tr/I<SEARCHLIST>/I<REPLACEMENTLIST>/> has been fixed. Only the first | |
2603 | instance is supposed to be meaningful if a character appears more than | |
2604 | once in C<I<SEARCHLIST>>. Under some circumstances, the final instance | |
2605 | was overriding all earlier ones. [perl #113584] | |
2606 | ||
2607 | =item * | |
2608 | ||
2609 | Regular expressions like C<qr/\87/> previously silently inserted a NUL | |
2610 | character, thus matching as if it had been written C<qr/\00087/>. Now it | |
2611 | matches as if it had been written as C<qr/87/>, with a message that the | |
2612 | sequence C<"\8"> is unrecognized. | |
2613 | ||
2614 | =item * | |
2615 | ||
2616 | C<__SUB__> now works in special blocks (C<BEGIN>, C<END>, etc.). | |
2617 | ||
2618 | =item * | |
2619 | ||
2620 | Thread creation on Windows could theoretically result in a crash if done | |
2621 | inside a C<BEGIN> block. It still does not work properly, but it no longer | |
2622 | crashes [perl #111610]. | |
2623 | ||
2624 | =item * | |
2625 | ||
2626 | C<\&{''}> (with the empty string) now autovivifies a stub like any other | |
2627 | sub name, and no longer produces the "Unable to create sub" error | |
2628 | [perl #94476]. | |
2629 | ||
2630 | =item * | |
2631 | ||
2632 | A regression introduced in v5.14.0 has been fixed, in which some calls | |
2633 | to the C<re> module would clobber C<$_> [perl #113750]. | |
2634 | ||
2635 | =item * | |
2636 | ||
2637 | C<do FILE> now always either sets or clears C<$@>, even when the file can't be | |
2638 | read. This ensures that testing C<$@> first (as recommended by the | |
2639 | documentation) always returns the correct result. | |
2640 | ||
2641 | =item * | |
2642 | ||
2643 | The array iterator used for the C<each @array> construct is now correctly | |
2644 | reset when C<@array> is cleared (RT #75596). This happens for example when the | |
2645 | array is globally assigned to, as in C<@array = (...)>, but not when its | |
2646 | B<values> are assigned to. In terms of the XS API, it means that C<av_clear()> | |
2647 | will now reset the iterator. | |
2648 | ||
2649 | This mirrors the behaviour of the hash iterator when the hash is cleared. | |
2650 | ||
2651 | =item * | |
2652 | ||
2653 | C<< $class->can >>, C<< $class->isa >>, and C<< $class->DOES >> now return | |
2654 | correct results, regardless of whether that package referred to by C<$class> | |
2655 | exists [perl #47113]. | |
2656 | ||
2657 | =item * | |
2658 | ||
2659 | Arriving signals no longer clear C<$@> [perl #45173]. | |
2660 | ||
2661 | =item * | |
2662 | ||
2663 | Allow C<my ()> declarations with an empty variable list [perl #113554]. | |
2664 | ||
2665 | =item * | |
2666 | ||
2667 | During parsing, subs declared after errors no longer leave stubs | |
2668 | [perl #113712]. | |
2669 | ||
2670 | =item * | |
2671 | ||
2672 | Closures containing no string evals no longer hang on to their containing | |
2673 | subroutines, allowing variables closed over by outer subroutines to be | |
2674 | freed when the outer sub is freed, even if the inner sub still exists | |
2675 | [perl #89544]. | |
2676 | ||
2677 | =item * | |
2678 | ||
2679 | Duplication of in-memory filehandles by opening with a "<&=" or ">&=" mode | |
2680 | stopped working properly in 5.16.0. It was causing the new handle to | |
2681 | reference a different scalar variable. This has been fixed [perl #113764]. | |
2682 | ||
2683 | =item * | |
2684 | ||
2685 | C<qr//> expressions no longer crash with custom regular expression engines | |
2686 | that do not set C<offs> at regular expression compilation time | |
2687 | [perl #112962]. | |
2688 | ||
2689 | =item * | |
2690 | ||
2691 | C<delete local> no longer crashes with certain magical arrays and hashes | |
2692 | [perl #112966]. | |
2693 | ||
2694 | =item * | |
2695 | ||
2696 | C<local> on elements of certain magical arrays and hashes used not to | |
2697 | arrange to have the element deleted on scope exit, even if the element did | |
2698 | not exist before C<local>. | |
2699 | ||
2700 | =item * | |
2701 | ||
2702 | C<scalar(write)> no longer returns multiple items [perl #73690]. | |
2703 | ||
2704 | =item * | |
2705 | ||
2706 | String to floating point conversions no longer misparse certain strings under | |
2707 | C<use locale> [perl #109318]. | |
2708 | ||
2709 | =item * | |
2710 | ||
2711 | C<@INC> filters that die no longer leak memory [perl #92252]. | |
2712 | ||
2713 | =item * | |
2714 | ||
2715 | The implementations of overloaded operations are now called in the correct | |
2716 | context. This allows, among other things, being able to properly override | |
2717 | C<< <> >> [perl #47119]. | |
2718 | ||
2719 | =item * | |
2720 | ||
2721 | Specifying only the C<fallback> key when calling C<use overload> now behaves | |
2722 | properly [perl #113010]. | |
2723 | ||
2724 | =item * | |
2725 | ||
2726 | C<< sub foo { my $a = 0; while ($a) { ... } } >> and | |
2727 | C<< sub foo { while (0) { ... } } >> now return the same thing [perl #73618]. | |
2728 | ||
2729 | =item * | |
2730 | ||
2731 | String negation now behaves the same under C<use integer;> as it does | |
2732 | without [perl #113012]. | |
2733 | ||
2734 | =item * | |
2735 | ||
2736 | C<chr> now returns the Unicode replacement character (U+FFFD) for -1, | |
2737 | regardless of the internal representation. -1 used to wrap if the argument | |
2738 | was tied or a string internally. | |
2739 | ||
2740 | =item * | |
2741 | ||
2742 | Using a C<format> after its enclosing sub was freed could crash as of | |
2743 | perl 5.12.0, if the format referenced lexical variables from the outer sub. | |
2744 | ||
2745 | =item * | |
2746 | ||
2747 | Using a C<format> after its enclosing sub was undefined could crash as of | |
2748 | perl 5.10.0, if the format referenced lexical variables from the outer sub. | |
2749 | ||
2750 | =item * | |
2751 | ||
2752 | Using a C<format> defined inside a closure, which format references | |
2753 | lexical variables from outside, never really worked unless the C<write> | |
2754 | call was directly inside the closure. In 5.10.0 it even started crashing. | |
2755 | Now the copy of that closure nearest the top of the call stack is used to | |
2756 | find those variables. | |
2757 | ||
2758 | =item * | |
2759 | ||
2760 | Formats that close over variables in special blocks no longer crash if a | |
2761 | stub exists with the same name as the special block before the special | |
2762 | block is compiled. | |
2763 | ||
2764 | =item * | |
2765 | ||
2766 | The parser no longer gets confused, treating C<eval foo ()> as a syntax | |
2767 | error if preceded by C<print;> [perl #16249]. | |
2768 | ||
2769 | =item * | |
2770 | ||
2771 | The return value of C<syscall> is no longer truncated on 64-bit platforms | |
2772 | [perl #113980]. | |
2773 | ||
2774 | =item * | |
2775 | ||
2776 | Constant folding no longer causes C<print 1 ? FOO : BAR> to print to the | |
2777 | FOO handle [perl #78064]. | |
2778 | ||
2779 | =item * | |
2780 | ||
2781 | C<do subname> now calls the named subroutine and uses the file name it | |
2782 | returns, instead of opening a file named "subname". | |
2783 | ||
2784 | =item * | |
2785 | ||
2786 | Subroutines looked up by rv2cv check hooks (registered by XS modules) are | |
2787 | now taken into consideration when determining whether C<foo bar> should be | |
2788 | the sub call C<foo(bar)> or the method call C<< "bar"->foo >>. | |
2789 | ||
2790 | =item * | |
2791 | ||
2792 | C<CORE::foo::bar> is no longer treated specially, allowing global overrides | |
2793 | to be called directly via C<CORE::GLOBAL::uc(...)> [perl #113016]. | |
2794 | ||
2795 | =item * | |
2796 | ||
2797 | Calling an undefined sub whose typeglob has been undefined now produces the | |
2798 | customary "Undefined subroutine called" error, instead of "Not a CODE | |
2799 | reference". | |
2800 | ||
2801 | =item * | |
2802 | ||
2803 | Two bugs involving @ISA have been fixed. C<*ISA = *glob_without_array> and | |
2804 | C<undef *ISA; @{*ISA}> would prevent future modifications to @ISA from | |
2805 | updating the internal caches used to look up methods. The | |
2806 | *glob_without_array case was a regression from Perl 5.12. | |
2807 | ||
2808 | =item * | |
2809 | ||
2810 | Regular expression optimisations sometimes caused C<$> with C</m> to | |
2811 | produce failed or incorrect matches [perl #114068]. | |
2812 | ||
2813 | =item * | |
2814 | ||
2815 | C<__SUB__> now works in a C<sort> block when the enclosing subroutine is | |
2816 | predeclared with C<sub foo;> syntax [perl #113710]. | |
2817 | ||
2818 | =item * | |
2819 | ||
2820 | Unicode properties only apply to Unicode code points, which leads to | |
2821 | some subtleties when regular expressions are matched against | |
2822 | above-Unicode code points. There is a warning generated to draw your | |
2823 | attention to this. However, this warning was being generated | |
2824 | inappropriately in some cases, such as when a program was being parsed. | |
2825 | Non-Unicode matches such as C<\w> and C<[:word;]> should not generate the | |
2826 | warning, as their definitions don't limit them to apply to only Unicode | |
2827 | code points. Now the message is only generated when matching against | |
2828 | C<\p{}> and C<\P{}>. There remains a bug, [perl #114148], for the very | |
2829 | few properties in Unicode that match just a single code point. The | |
2830 | warning is not generated if they are matched against an above-Unicode | |
2831 | code point. | |
2832 | ||
2833 | =item * | |
2834 | ||
2835 | Uninitialized warnings mentioning hash elements would only mention the | |
2836 | element name if it was not in the first bucket of the hash, due to an | |
2837 | off-by-one error. | |
2838 | ||
2839 | =item * | |
2840 | ||
2841 | A regular expression optimizer bug could cause multiline "^" to behave | |
2842 | incorrectly in the presence of line breaks, such that | |
2843 | C<"/\n\n" =~ m#\A(?:^/$)#im> would not match [perl #115242]. | |
2844 | ||
2845 | =item * | |
2846 | ||
2847 | Failed C<fork> in list context no longer corrupts the stack. | |
2848 | C<@a = (1, 2, fork, 3)> used to gobble up the 2 and assign C<(1, undef, 3)> | |
2849 | if the C<fork> call failed. | |
2850 | ||
2851 | =item * | |
2852 | ||
2853 | Numerous memory leaks have been fixed, mostly involving tied variables that | |
2854 | die, regular expression character classes and code blocks, and syntax | |
2855 | errors. | |
2856 | ||
2857 | =item * | |
2858 | ||
2859 | Assigning a regular expression (C<${qr//}>) to a variable that happens to | |
2860 | hold a floating point number no longer causes assertion failures on | |
2861 | debugging builds. | |
2862 | ||
2863 | =item * | |
2864 | ||
2865 | Assigning a regular expression to a scalar containing a number no longer | |
f105b7be | 2866 | causes subsequent numification to produce random numbers. |
5ed58cbd RS |
2867 | |
2868 | =item * | |
2869 | ||
2870 | Assigning a regular expression to a magic variable no longer wipes away the | |
2ae351f8 | 2871 | magic. This was a regression from 5.10. |
5ed58cbd RS |
2872 | |
2873 | =item * | |
2874 | ||
2875 | Assigning a regular expression to a blessed scalar no longer results in | |
2ae351f8 | 2876 | crashes. This was also a regression from 5.10. |
5ed58cbd RS |
2877 | |
2878 | =item * | |
2879 | ||
2880 | Regular expression can now be assigned to tied hash and array elements with | |
2881 | flattening into strings. | |
2882 | ||
2883 | =item * | |
2884 | ||
f105b7be | 2885 | Numifying a regular expression no longer results in an uninitialized |
5ed58cbd RS |
2886 | warning. |
2887 | ||
2888 | =item * | |
2889 | ||
2890 | Negative array indices no longer cause EXISTS methods of tied variables to | |
2ae351f8 | 2891 | be ignored. This was a regression from 5.12. |
5ed58cbd RS |
2892 | |
2893 | =item * | |
2894 | ||
2895 | Negative array indices no longer result in crashes on arrays tied to | |
2896 | non-objects. | |
2897 | ||
2898 | =item * | |
2899 | ||
2900 | C<$byte_overload .= $utf8> no longer results in doubly-encoded UTF8 if the | |
2901 | left-hand scalar happened to have produced a UTF8 string the last time | |
2902 | overloading was invoked. | |
2903 | ||
2904 | =item * | |
2905 | ||
2906 | C<goto &sub> now uses the current value of @_, instead of using the array | |
2907 | the subroutine was originally called with. This means | |
2908 | C<local @_ = (...); goto &sub> now works [perl #43077]. | |
2909 | ||
2910 | =item * | |
2911 | ||
2912 | If a debugger is invoked recursively, it no longer stomps on its own | |
2913 | lexical variables. Formerly under recursion all calls would share the same | |
2914 | set of lexical variables [perl #115742]. | |
2915 | ||
2916 | =item * | |
2917 | ||
2918 | C<*_{ARRAY}> returned from a subroutine no longer spontaneously | |
2919 | becomes empty. | |
2920 | ||
2921 | =back | |
2922 | ||
2923 | =head1 Known Problems | |
2924 | ||
2925 | =over 4 | |
2926 | ||
2927 | =item * | |
2928 | ||
2929 | XXX: the imperfect behavior of the ** deprecation | |
2930 | ||
2931 | =back | |
2932 | ||
2933 | =head1 Acknowledgements | |
a75569c0 | 2934 | |
5ed58cbd | 2935 | XXX Generate this with: |
a75569c0 | 2936 | |
5ed58cbd | 2937 | perl Porting/acknowledgements.pl v5.18.0..HEAD |
f5b73711 | 2938 | |
44691e6f AB |
2939 | =head1 Reporting Bugs |
2940 | ||
e08634c5 SH |
2941 | If you find what you think is a bug, you might check the articles recently |
2942 | posted to the comp.lang.perl.misc newsgroup and the perl bug database at | |
2943 | http://rt.perl.org/perlbug/ . There may also be information at | |
2944 | http://www.perl.org/ , the Perl Home Page. | |
44691e6f | 2945 | |
e08634c5 SH |
2946 | If you believe you have an unreported bug, please run the L<perlbug> program |
2947 | included with your release. Be sure to trim your bug down to a tiny but | |
2948 | sufficient test case. Your bug report, along with the output of C<perl -V>, | |
2949 | will be sent off to perlbug@perl.org to be analysed by the Perl porting team. | |
44691e6f AB |
2950 | |
2951 | If the bug you are reporting has security implications, which make it | |
e08634c5 SH |
2952 | inappropriate to send to a publicly archived mailing list, then please send it |
2953 | to perl5-security-report@perl.org. This points to a closed subscription | |
2954 | unarchived mailing list, which includes all the core committers, who will be | |
2955 | able to help assess the impact of issues, figure out a resolution, and help | |
f9001595 | 2956 | co-ordinate the release of patches to mitigate or fix the problem across all |
e08634c5 SH |
2957 | platforms on which Perl is supported. Please only use this address for |
2958 | security issues in the Perl core, not for modules independently distributed on | |
2959 | CPAN. | |
44691e6f AB |
2960 | |
2961 | =head1 SEE ALSO | |
2962 | ||
e08634c5 SH |
2963 | The F<Changes> file for an explanation of how to view exhaustive details on |
2964 | what changed. | |
44691e6f AB |
2965 | |
2966 | The F<INSTALL> file for how to build Perl. | |
2967 | ||
2968 | The F<README> file for general stuff. | |
2969 | ||
2970 | The F<Artistic> and F<Copying> files for copyright information. | |
2971 | ||
2972 | =cut |