Commit | Line | Data |
---|---|---|
44691e6f AB |
1 | =encoding utf8 |
2 | ||
3 | =head1 NAME | |
4 | ||
5ed58cbd | 5 | perldelta - what is new for perl v5.18.0 |
e128ab2c | 6 | |
4eabcf70 | 7 | =head1 DESCRIPTION |
6db9054f | 8 | |
e612b5a0 | 9 | This document describes differences between the v5.16.0 release and the v5.18.0 |
e08634c5 | 10 | release. |
6db9054f | 11 | |
e612b5a0 RS |
12 | If you are upgrading from an earlier release such as v5.14.0, first read |
13 | L<perl5160delta>, which describes differences between v5.14.0 and v5.16.0. | |
3f01b192 | 14 | |
5ed58cbd | 15 | =head1 Core Enhancements |
3f01b192 | 16 | |
5ed58cbd | 17 | =head2 New mechanism for experimental features |
82d98f72 | 18 | |
5ed58cbd | 19 | Newly-added experimental features will now require this incantation: |
82d98f72 | 20 | |
5ed58cbd RS |
21 | no warnings "experimental::feature_name"; |
22 | use feature "feature_name"; # would warn without the prev line | |
23 | ||
24 | There is a new warnings category, called "experimental", containing | |
25 | warnings that the L<feature> pragma emits when enabling experimental | |
26 | features. | |
27 | ||
28 | Newly-added experimental features will also be given special warning IDs, | |
29 | which consist of "experimental::" followed by the name of the feature. (The | |
30 | plan is to extend this mechanism eventually to all warnings, to allow them | |
31 | to be enabled or disabled individually, and not just by category.) | |
32 | ||
33 | By saying | |
34 | ||
35 | no warnings "experimental::feature_name"; | |
36 | ||
37 | you are taking responsibility for any breakage that future changes to, or | |
38 | removal of, the feature may cause. | |
39 | ||
40 | Existing experimental features may begin emitting these warnings, too. Please | |
41 | consult L<perlexperiment> for information on which features are considered | |
42 | experimental. | |
43 | ||
44 | =head2 Hash overhaul | |
45 | ||
e612b5a0 | 46 | Changes to the implementation of hashes in perl v5.18.0 will be one of the most |
484d8ba1 DG |
47 | visible changes to the behavior of existing code. |
48 | ||
8e74b2ed | 49 | By default, two distinct hash variables with identical keys and values may now |
484d8ba1 DG |
50 | provide their contents in a different order where it was previously identical. |
51 | ||
52 | When encountering these changes, the key to cleaning up from them is to accept | |
53 | that B<hashes are unordered collections> and to act accordingly. | |
5ed58cbd RS |
54 | |
55 | =head3 Hash randomization | |
56 | ||
57 | The seed used by Perl's hash function is now random. This means that the | |
58 | order which keys/values will be returned from functions like C<keys()>, | |
59 | C<values()>, and C<each()> will differ from run to run. | |
60 | ||
61 | This change was introduced to make Perl's hashes more robust to algorithmic | |
62 | complexity attacks, and also because we discovered that it exposes hash | |
63 | ordering dependency bugs and makes them easier to track down. | |
64 | ||
65 | Toolchain maintainers might want to invest in additional infrastructure to | |
66 | test for things like this. Running tests several times in a row and then | |
67 | comparing results will make it easier to spot hash order dependencies in | |
68 | code. Authors are strongly encouraged not to expose the key order of | |
69 | Perl's hashes to insecure audiences. | |
70 | ||
71 | Further, every hash has its own iteration order, which should make it much | |
72 | more difficult to determine what the current hash seed is. | |
73 | ||
0f2be12f RS |
74 | =head3 New hash functions |
75 | ||
e612b5a0 | 76 | Perl v5.18 includes support for multiple hash functions, and changed |
0f2be12f RS |
77 | the default (to ONE_AT_A_TIME_HARD), you can choose a different |
78 | algorithm by defining a symbol at compile time. For a current list, | |
e612b5a0 | 79 | consult the F<INSTALL> document. Note that as of Perl v5.18 we can |
0f2be12f RS |
80 | only recommend use of the default or SIPHASH. All the others are |
81 | known to have security issues and are for research purposes only. | |
5ed58cbd | 82 | |
f105b7be | 83 | =head3 PERL_HASH_SEED environment variable now takes a hex value |
5ed58cbd | 84 | |
484d8ba1 DG |
85 | C<PERL_HASH_SEED> no longer accepts an integer as a parameter; |
86 | instead the value is expected to be a binary value encoded in a hex | |
87 | string, such as "0xf5867c55039dc724". This is to make the | |
88 | infrastructure support hash seeds of arbitrary lengths, which might | |
5ed58cbd RS |
89 | exceed that of an integer. (SipHash uses a 16 byte seed). |
90 | ||
c40c48bb RS |
91 | =head3 PERL_PERTURB_KEYS environment variable added |
92 | ||
f105b7be | 93 | The C<PERL_PERTURB_KEYS> environment variable allows one to control the level of |
c40c48bb RS |
94 | randomization applied to C<keys> and friends. |
95 | ||
f105b7be | 96 | When C<PERL_PERTURB_KEYS> is 0, perl will not randomize the key order at all. The |
c40c48bb RS |
97 | chance that C<keys> changes due to an insert will be the same as in previous |
98 | perls, basically only when the bucket size is changed. | |
99 | ||
f105b7be | 100 | When C<PERL_PERTURB_KEYS> is 1, perl will randomize keys in a non-repeatable |
c40c48bb RS |
101 | way. The chance that C<keys> changes due to an insert will be very high. This |
102 | is the most secure and default mode. | |
103 | ||
f105b7be KE |
104 | When C<PERL_PERTURB_KEYS> is 2, perl will randomize keys in a repeatable way. |
105 | Repeated runs of the same program should produce the same output every time. | |
c40c48bb | 106 | |
f105b7be KE |
107 | C<PERL_HASH_SEED> implies a non-default C<PERL_PERTURB_KEYS> setting. Setting |
108 | C<PERL_HASH_SEED=0> (exactly one 0) implies C<PERL_PERTURB_KEYS=0> (hash key | |
109 | randomization disabled); settng C<PERL_HASH_SEED> to any other value implies | |
110 | C<PERL_PERTURB_KEYS=2> (deterministic and repeatable hash key randomization). | |
111 | Specifying C<PERL_PERTURB_KEYS> explicitly to a different level overrides this | |
c40c48bb RS |
112 | behavior. |
113 | ||
5ed58cbd RS |
114 | =head3 Hash::Util::hash_seed() now returns a string |
115 | ||
116 | Hash::Util::hash_seed() now returns a string instead of an integer. This | |
117 | is to make the infrastructure support hash seeds of arbitrary lengths | |
118 | which might exceed that of an integer. (SipHash uses a 16 byte seed). | |
119 | ||
120 | =head3 Output of PERL_HASH_SEED_DEBUG has been changed | |
121 | ||
122 | The environment variable PERL_HASH_SEED_DEBUG now makes perl show both the | |
f105b7be | 123 | hash function perl was built with, I<and> the seed, in hex, in use for that |
5ed58cbd RS |
124 | process. Code parsing this output, should it exist, must change to accommodate |
125 | the new format. Example of the new format: | |
126 | ||
127 | $ PERL_HASH_SEED_DEBUG=1 ./perl -e1 | |
128 | HASH_FUNCTION = MURMUR3 HASH_SEED = 0x1476bb9f | |
129 | ||
130 | =head2 Upgrade to Unicode 6.2 | |
131 | ||
2e7bc647 | 132 | Perl now supports Unicode 6.2. A list of changes from Unicode |
5ed58cbd RS |
133 | 6.1 is at L<http://www.unicode.org/versions/Unicode6.2.0>. |
134 | ||
135 | =head2 Character name aliases may now include non-Latin1-range characters | |
136 | ||
137 | It is possible to define your own names for characters for use in | |
138 | C<\N{...}>, C<charnames::vianame()>, etc. These names can now be | |
139 | comprised of characters from the whole Unicode range. This allows for | |
140 | names to be in your native language, and not just English. Certain | |
141 | restrictions apply to the characters that may be used (you can't define | |
142 | a name that has punctuation in it, for example). See L<charnames/CUSTOM | |
143 | ALIASES>. | |
144 | ||
145 | =head2 New DTrace probes | |
146 | ||
147 | The following new DTrace probes have been added: | |
14731ad1 | 148 | |
337fb649 | 149 | =over 4 |
14731ad1 | 150 | |
82d98f72 | 151 | =item * |
14731ad1 | 152 | |
5ed58cbd RS |
153 | C<op-entry> |
154 | ||
155 | =item * | |
156 | ||
157 | C<loading-file> | |
158 | ||
159 | =item * | |
160 | ||
161 | C<loaded-file> | |
162 | ||
163 | =back | |
164 | ||
165 | =head2 C<${^LAST_FH}> | |
166 | ||
167 | This new variable provides access to the filehandle that was last read. | |
168 | This is the handle used by C<$.> and by C<tell> and C<eof> without | |
169 | arguments. | |
170 | ||
171 | =head2 Regular Expression Set Operations | |
172 | ||
173 | This is an B<experimental> feature to allow matching against the union, | |
174 | intersection, etc., of sets of code points, similar to | |
175 | L<Unicode::Regex::Set>. It can also be used to extend C</x> processing | |
176 | to [bracketed] character classes, and as a replacement of user-defined | |
177 | properties, allowing more complex expressions than they do. See | |
178 | L<perlrecharclass/Extended Bracketed Character Classes>. | |
179 | ||
180 | =head2 Lexical subroutines | |
181 | ||
182 | This new feature is still considered B<experimental>. To enable it: | |
183 | ||
184 | use 5.018; | |
185 | no warnings "experimental::lexical_subs"; | |
186 | use feature "lexical_subs"; | |
187 | ||
188 | You can now declare subroutines with C<state sub foo>, C<my sub foo>, and | |
189 | C<our sub foo>. (C<state sub> requires that the "state" feature be | |
190 | enabled, unless you write it as C<CORE::state sub foo>.) | |
191 | ||
192 | C<state sub> creates a subroutine visible within the lexical scope in which | |
193 | it is declared. The subroutine is shared between calls to the outer sub. | |
194 | ||
195 | C<my sub> declares a lexical subroutine that is created each time the | |
196 | enclosing block is entered. C<state sub> is generally slightly faster than | |
197 | C<my sub>. | |
198 | ||
199 | C<our sub> declares a lexical alias to the package subroutine of the same | |
200 | name. | |
201 | ||
202 | For more information, see L<perlsub/Lexical Subroutines>. | |
203 | ||
204 | =head2 Computed Labels | |
205 | ||
206 | The loop controls C<next>, C<last> and C<redo>, and the special C<dump> | |
207 | operator, now allow arbitrary expressions to be used to compute labels at run | |
208 | time. Previously, any argument that was not a constant was treated as the | |
209 | empty string. | |
210 | ||
211 | =head2 More CORE:: subs | |
212 | ||
213 | Several more built-in functions have been added as subroutines to the | |
214 | CORE:: namespace - namely, those non-overridable keywords that can be | |
215 | implemented without custom parsers: C<defined>, C<delete>, C<exists>, | |
216 | C<glob>, C<pos>, C<protoytpe>, C<scalar>, C<split>, C<study>, and C<undef>. | |
217 | ||
218 | As some of these have prototypes, C<prototype('CORE::...')> has been | |
219 | changed to not make a distinction between overridable and non-overridable | |
220 | keywords. This is to make C<prototype('CORE::pos')> consistent with | |
221 | C<prototype(&CORE::pos)>. | |
222 | ||
223 | =head2 C<kill> with negative signal names | |
224 | ||
225 | C<kill> has always allowed a negative signal number, which kills the | |
226 | process group instead of a single process. It has also allowed signal | |
227 | names. But it did not behave consistently, because negative signal names | |
228 | were treated as 0. Now negative signals names like C<-INT> are supported | |
229 | and treated the same way as -2 [perl #112990]. | |
230 | ||
231 | =head1 Security | |
232 | ||
233 | =head2 C<Storable> security warning in documentation | |
234 | ||
235 | The documentation for C<Storable> now includes a section which warns readers | |
236 | of the danger of accepting Storable documents from untrusted sources. The | |
237 | short version is that deserializing certain types of data can lead to loading | |
238 | modules and other code execution. This is documented behavior and wanted | |
239 | behavior, but this opens an attack vector for malicious entities. | |
240 | ||
241 | =head2 C<Locale::Maketext> allowed code injection via a malicious template | |
242 | ||
243 | If users could provide a translation string to Locale::Maketext, this could be | |
244 | used to invoke arbitrary Perl subroutines available in the current process. | |
245 | ||
246 | This has been fixed, but it is still possible to invoke any method provided by | |
247 | C<Locale::Maketext> itself or a subclass that you are using. One of these | |
248 | methods in turn will invoke the Perl core's C<sprintf> subroutine. | |
249 | ||
250 | In summary, allowing users to provide translation strings without auditing | |
251 | them is a bad idea. | |
252 | ||
253 | This vulnerability is documented in CVE-2012-6329. | |
254 | ||
255 | =head2 Avoid calling memset with a negative count | |
256 | ||
257 | Poorly written perl code that allows an attacker to specify the count to perl's | |
258 | C<x> string repeat operator can already cause a memory exhaustion | |
e612b5a0 | 259 | denial-of-service attack. A flaw in versions of perl before v5.15.5 can escalate |
5ed58cbd RS |
260 | that into a heap buffer overrun; coupled with versions of glibc before 2.16, it |
261 | possibly allows the execution of arbitrary code. | |
262 | ||
263 | The flaw addressed to this commit has been assigned identifier CVE-2012-5195 | |
264 | and was researched by Tim Brown. | |
265 | ||
266 | =head1 Incompatible Changes | |
267 | ||
268 | =head2 See also: hash overhaul | |
269 | ||
270 | Some of the changes in the L<hash overhaul|/"Hash overhaul"> are not fully | |
271 | compatible with previous versions of perl. Please read that section. | |
272 | ||
273 | =head2 An unknown character name in C<\N{...}> is now a syntax error | |
274 | ||
275 | Previously, it warned, and the Unicode REPLACEMENT CHARACTER was | |
276 | substituted. Unicode now recommends that this situation be a syntax | |
277 | error. Also, the previous behavior led to some confusing warnings and | |
278 | behaviors, and since the REPLACEMENT CHARACTER has no use other than as | |
279 | a stand-in for some unknown character, any code that has this problem is | |
280 | buggy. | |
281 | ||
282 | =head2 Formerly deprecated characters in C<\N{}> character name aliases are now errors. | |
283 | ||
284 | Since v5.12.0, it has been deprecated to use certain characters in | |
285 | user-defined C<\N{...}> character names. These now cause a syntax | |
286 | error. For example, it is now an error to begin a name with a digit, | |
287 | such as in | |
288 | ||
289 | my $undraftable = "\N{4F}"; # Syntax error! | |
290 | ||
291 | or to have commas anywhere in the name. See L<charnames/CUSTOM ALIASES> | |
292 | ||
293 | =head2 C<\N{BELL}> now refers to U+1F514 instead of U+0007 | |
294 | ||
295 | Unicode 6.0 reused the name "BELL" for a different code point than it | |
296 | traditionally had meant. Since Perl v5.14, use of this name still | |
297 | referred to U+0007, but would raise a deprecation warning. Now, "BELL" | |
298 | refers to U+1F514, and the name for U+0007 is "ALERT". All the | |
299 | functions in L<charnames> have been correspondingly updated. | |
300 | ||
301 | =head2 New Restrictions in Multi-Character Case-Insensitive Matching in Regular Expression Bracketed Character Classes | |
302 | ||
303 | Unicode has now withdrawn their previous recommendation for regular | |
304 | expressions to automatically handle cases where a single character can | |
305 | match multiple characters case-insensitively, for example the letter | |
306 | LATIN SMALL LETTER SHARP S and the sequence C<ss>. This is because | |
307 | it turns out to be impracticable to do this correctly in all | |
308 | circumstances. Because Perl has tried to do this as best it can, it | |
309 | will continue to do so. (We are considering an option to turn it off.) | |
310 | However, a new restriction is being added on such matches when they | |
311 | occur in [bracketed] character classes. People were specifying | |
312 | things such as C</[\0-\xff]/i>, and being surprised that it matches the | |
313 | two character sequence C<ss> (since LATIN SMALL LETTER SHARP S occurs in | |
314 | this range). This behavior is also inconsistent with using a | |
315 | property instead of a range: C<\p{Block=Latin1}> also includes LATIN | |
316 | SMALL LETTER SHARP S, but C</[\p{Block=Latin1}]/i> does not match C<ss>. | |
317 | The new rule is that for there to be a multi-character case-insensitive | |
318 | match within a bracketed character class, the character must be | |
319 | explicitly listed, and not as an end point of a range. This more | |
320 | closely obeys the Principle of Least Astonishment. See | |
321 | L<perlrecharclass/Bracketed Character Classes>. Note that a bug [perl | |
322 | #89774], now fixed as part of this change, prevented the previous | |
323 | behavior from working fully. | |
324 | ||
325 | =head2 Explicit rules for variable names and identifiers | |
326 | ||
c6c6975e RS |
327 | Due to an oversight, single character variable names in v5.16 were |
328 | completely unrestricted. This opened the door to several kinds of | |
329 | insanity. As of v5.18, these now follow the rules of other identifiers, | |
330 | in addition to accepting characters that match the C<\p{POSIX_Punct}> | |
331 | property. | |
5ed58cbd RS |
332 | |
333 | There are no longer any differences in the parsing of identifiers | |
334 | specified as C<$...> or C<${...}>; previously, they were dealt with in | |
335 | different parts of the core, and so had slightly different behavior. For | |
336 | instance, C<${foo:bar}> was a legal variable name. Since they are now | |
337 | both parsed by the same code, that is no longer the case. | |
338 | ||
339 | =head2 C<\s> in regular expressions now matches a Vertical Tab | |
340 | ||
341 | No one could recall why C<\s> didn't match C<\cK>, the vertical tab. | |
342 | Now it does. Given the extreme rarity of that character, very little | |
343 | breakage is expected. | |
344 | ||
345 | =head2 C</(?{})/> and C</(??{})/> have been heavily reworked | |
346 | ||
347 | The implementation of this feature has been almost completely rewritten. | |
348 | Although its main intent is to fix bugs, some behaviors, especially | |
349 | related to the scope of lexical variables, will have changed. This is | |
350 | described more fully in the L</Selected Bug Fixes> section. | |
351 | ||
352 | =head2 Stricter parsing of substitution replacement | |
353 | ||
354 | It is no longer possible to abuse the way the parser parses C<s///e> like | |
355 | this: | |
356 | ||
357 | %_=(_,"Just another "); | |
358 | $_="Perl hacker,\n"; | |
359 | s//_}->{_/e;print | |
360 | ||
361 | =head2 C<given> now aliases the global C<$_> | |
362 | ||
363 | Instead of assigning to an implicit lexical C<$_>, C<given> now makes the | |
364 | global C<$_> an alias for its argument, just like C<foreach>. However, it | |
365 | still uses lexical C<$_> if there is lexical C<$_> in scope (again, just like | |
366 | C<foreach>) [perl #114020]. | |
367 | ||
368 | =head2 Lexical C<$_> is now experimental | |
369 | ||
e612b5a0 | 370 | Since it was introduced in Perl v5.10, it has caused much confusion with no |
5ed58cbd RS |
371 | obvious solution: |
372 | ||
373 | =over | |
829397b9 TC |
374 | |
375 | =item * | |
376 | ||
5ed58cbd RS |
377 | Various modules (e.g., List::Util) expect callback routines to use the |
378 | global C<$_>. C<use List::Util 'first'; my $_; first { $_ == 1 } @list> | |
379 | does not work as one would expect. | |
71e6aba6 RS |
380 | |
381 | =item * | |
382 | ||
5ed58cbd RS |
383 | A C<my $_> declaration earlier in the same file can cause confusing closure |
384 | warnings. | |
71e6aba6 RS |
385 | |
386 | =item * | |
387 | ||
5ed58cbd RS |
388 | The "_" subroutine prototype character allows called subroutines to access |
389 | your lexical C<$_>, so it is not really private after all. | |
2426c394 | 390 | |
71e6aba6 RS |
391 | =item * |
392 | ||
5ed58cbd RS |
393 | Nevertheless, subroutines with a "(@)" prototype and methods cannot access |
394 | the caller's lexical C<$_>, unless they are written in XS. | |
71e6aba6 RS |
395 | |
396 | =item * | |
397 | ||
5ed58cbd RS |
398 | But even XS routines cannot access a lexical C<$_> declared, not in the |
399 | calling subroutine, but in an outer scope, iff that subroutine happened not | |
400 | to mention C<$_> or use any operators that default to C<$_>. | |
401 | ||
402 | =back | |
403 | ||
404 | It is our hope that lexical C<$_> can be rehabilitated, but this may | |
405 | cause changes in its behavior. Please use it with caution until it | |
406 | becomes stable. | |
407 | ||
408 | =head2 readline() with C<$/ = \N> now reads N characters, not N bytes | |
409 | ||
410 | Previously, when reading from a stream with I/O layers such as | |
411 | C<encoding>, the readline() function, otherwise known as the C<< <> >> | |
412 | operator, would read I<N> bytes from the top-most layer. [perl #79960] | |
413 | ||
414 | Now, I<N> characters are read instead. | |
415 | ||
416 | There is no change in behaviour when reading from streams with no | |
417 | extra layers, since bytes map exactly to characters. | |
418 | ||
419 | =head2 Overridden C<glob> is now passed one argument | |
420 | ||
421 | C<glob> overrides used to be passed a magical undocumented second argument | |
422 | that identified the caller. Nothing on CPAN was using this, and it got in | |
423 | the way of a bug fix, so it was removed. If you really need to identify | |
424 | the caller, see L<Devel::Callsite> on CPAN. | |
425 | ||
426 | =head2 Here-doc parsing | |
427 | ||
428 | The body of a here-document inside a quote-like operator now always begins | |
429 | on the line after the "<<foo" marker. Previously, it was documented to | |
430 | begin on the line following the containing quote-like operator, but that | |
431 | was only sometimes the case [perl #114040]. | |
432 | ||
433 | =head2 Alphanumeric operators must now be separated from the closing | |
434 | delimiter of regular expressions | |
435 | ||
436 | You may no longer write something like: | |
437 | ||
438 | m/a/and 1 | |
439 | ||
440 | Instead you must write | |
441 | ||
442 | m/a/ and 1 | |
443 | ||
444 | with whitespace separating the operator from the closing delimiter of | |
445 | the regular expression. Not having whitespace has resulted in a | |
446 | deprecation warning since Perl v5.14.0. | |
447 | ||
448 | =head2 qw(...) can no longer be used as parentheses | |
449 | ||
450 | C<qw> lists used to fool the parser into thinking they were always | |
451 | surrounded by parentheses. This permitted some surprising constructions | |
452 | such as C<foreach $x qw(a b c) {...}>, which should really be written | |
453 | C<foreach $x (qw(a b c)) {...}>. These would sometimes get the lexer into | |
454 | the wrong state, so they didn't fully work, and the similar C<foreach qw(a | |
455 | b c) {...}> that one might expect to be permitted never worked at all. | |
456 | ||
457 | This side effect of C<qw> has now been abolished. It has been deprecated | |
e612b5a0 | 458 | since Perl v5.13.11. It is now necessary to use real parentheses |
5ed58cbd RS |
459 | everywhere that the grammar calls for them. |
460 | ||
461 | =head2 Interaction of lexical and default warnings | |
462 | ||
463 | Turning on any lexical warnings used first to disable all default warnings | |
464 | if lexical warnings were not already enabled: | |
465 | ||
466 | $*; # deprecation warning | |
467 | use warnings "void"; | |
468 | $#; # void warning; no deprecation warning | |
469 | ||
f105b7be | 470 | Now, the C<debugging>, C<deprecated>, C<glob>, C<inplace> and C<malloc> warnings |
5ed58cbd RS |
471 | categories are left on when turning on lexical warnings (unless they are |
472 | turned off by C<no warnings>, of course). | |
473 | ||
474 | This may cause deprecation warnings to occur in code that used to be free | |
475 | of warnings. | |
476 | ||
477 | Those are the only categories consisting only of default warnings. Default | |
f105b7be KE |
478 | warnings in other categories are still disabled by C<< use warnings "category" >>, |
479 | as we do not yet have the infrastructure for controlling | |
5ed58cbd RS |
480 | individual warnings. |
481 | ||
482 | =head2 C<state sub> and C<our sub> | |
483 | ||
484 | Due to an accident of history, C<state sub> and C<our sub> were equivalent | |
485 | to a plain C<sub>, so one could even create an anonymous sub with | |
486 | C<our sub { ... }>. These are now disallowed outside of the "lexical_subs" | |
487 | feature. Under the "lexical_subs" feature they have new meanings described | |
488 | in L<perlsub/Lexical Subroutines>. | |
489 | ||
490 | =head2 Defined values stored in environment are forced to byte strings | |
491 | ||
492 | A value stored in an environment variable has always been stringified. In this | |
493 | release, it is converted to be only a byte string. First, it is forced to be a | |
494 | only a string. Then if the string is utf8 and the equivalent of | |
495 | C<utf8::downgrade()> works, that result is used; otherwise, the equivalent of | |
496 | C<utf8::encode()> is used, and a warning is issued about wide characters | |
497 | (L</Diagnostics>). | |
498 | ||
499 | =head2 C<require> dies for unreadable files | |
500 | ||
501 | When C<require> encounters an unreadable file, it now dies. It used to | |
502 | ignore the file and continue searching the directories in C<@INC> | |
503 | [perl #113422]. | |
504 | ||
505 | =head2 C<gv_fetchmeth_*> and SUPER | |
506 | ||
507 | The various C<gv_fetchmeth_*> XS functions used to treat a package whose | |
f105b7be KE |
508 | named ended with C<::SUPER> specially. A method lookup on the C<Foo::SUPER> |
509 | package would be treated as a C<SUPER> method lookup on the C<Foo> package. This | |
510 | is no longer the case. To do a C<SUPER> lookup, pass the C<Foo> stash and the | |
511 | C<GV_SUPER> flag. | |
5ed58cbd RS |
512 | |
513 | =head2 C<split>'s first argument is more consistently interpreted | |
514 | ||
e612b5a0 | 515 | After some changes earlier in v5.17, C<split>'s behavior has been |
5ed58cbd RS |
516 | simplified: if the PATTERN argument evaluates to a literal string |
517 | containing one space, it is treated the way that a I<literal> string | |
518 | containing one space once was. | |
519 | ||
520 | =head1 Deprecations | |
521 | ||
522 | =head2 Deprecated modules | |
523 | ||
524 | The following modules will be removed from the core distribution in a | |
525 | future release, and should be installed from CPAN instead. Distributions | |
526 | on CPAN which require these should add them to their prerequisites. | |
f105b7be | 527 | The core versions of these modules will issue C<"deprecated">-category |
5ed58cbd RS |
528 | warnings. |
529 | ||
530 | You can silence these deprecation warnings by installing the modules | |
531 | in question from CPAN. | |
532 | ||
533 | =over | |
534 | ||
535 | =item L<Archive::Extract> | |
536 | ||
537 | =item L<B::Lint> | |
538 | ||
539 | =item L<B::Lint::Debug> | |
540 | ||
541 | =item L<CPANPLUS> and all included C<CPANPLUS::*> modules | |
542 | ||
543 | =item L<Devel::InnerPackage> | |
544 | ||
545 | =item L<encoding> | |
546 | ||
547 | =item L<Log::Message> | |
548 | ||
549 | =item L<Log::Message::Config> | |
550 | ||
551 | =item L<Log::Message::Handlers> | |
552 | ||
553 | =item L<Log::Message::Item> | |
554 | ||
555 | =item L<Log::Message::Simple> | |
556 | ||
557 | =item L<Module::Pluggable> | |
558 | ||
559 | =item L<Module::Pluggable::Object> | |
560 | ||
561 | =item L<Object::Accessor> | |
562 | ||
563 | =item L<Pod::LaTeX> | |
564 | ||
565 | =item L<Term::UI> | |
566 | ||
567 | =item L<Term::UI::History> | |
568 | ||
569 | =back | |
570 | ||
571 | =head2 Deprecated Utilities | |
572 | ||
573 | The following utilities will be removed from the core distribution in a | |
574 | future release as their associated modules have been deprecated. They | |
575 | will remain available with the applicable CPAN distribution. | |
576 | ||
577 | =over | |
578 | ||
579 | =item L<cpanp> | |
580 | ||
581 | =item C<cpanp-run-perl> | |
582 | ||
583 | =item L<cpan2dist> | |
584 | ||
585 | These items are part of the C<CPANPLUS> distribution. | |
586 | ||
587 | =item L<pod2latex> | |
588 | ||
589 | This item is part of the C<Pod::LaTeX> distribution. | |
590 | ||
591 | =back | |
592 | ||
593 | =head2 PL_sv_objcount | |
594 | ||
595 | This interpreter-global variable used to track the total number of | |
596 | Perl objects in the interpreter. It is no longer maintained and will | |
e612b5a0 | 597 | be removed altogether in Perl v5.20. |
5ed58cbd RS |
598 | |
599 | =head2 Five additional characters should be escaped in patterns with C</x> | |
600 | ||
601 | When a regular expression pattern is compiled with C</x>, Perl treats 6 | |
602 | characters as white space to ignore, such as SPACE and TAB. However, | |
603 | Unicode recommends 11 characters be treated thusly. We will conform | |
604 | with this in a future Perl version. In the meantime, use of any of the | |
605 | missing characters will raise a deprecation warning, unless turned off. | |
606 | The five characters are: | |
f105b7be KE |
607 | |
608 | U+0085 NEXT LINE, | |
609 | U+200E LEFT-TO-RIGHT MARK, | |
610 | U+200F RIGHT-TO-LEFT MARK, | |
611 | U+2028 LINE SEPARATOR, | |
612 | ||
5ed58cbd | 613 | and |
f105b7be KE |
614 | |
615 | U+2029 PARAGRAPH SEPARATOR. | |
5ed58cbd RS |
616 | |
617 | =head2 User-defined charnames with surprising whitespace | |
618 | ||
619 | A user-defined character name with trailing or multiple spaces in a row is | |
620 | likely a typo. This now generates a warning when defined, on the assumption | |
621 | that uses of it will be unlikely to include the excess whitespace. | |
622 | ||
623 | =head2 Various XS-callable functions are now deprecated | |
624 | ||
625 | All the functions used to classify characters will be removed from a | |
626 | future version of Perl, and should not be used. With participating C | |
627 | compilers (e.g., gcc), compiling any file that uses any of these will | |
628 | generate a warning. These were not intended for public use; there are | |
629 | equivalent, faster, macros for most of them. | |
e612b5a0 | 630 | |
2e7bc647 | 631 | See L<perlapi/Character classes>. The complete list is: |
e612b5a0 | 632 | |
5ed58cbd RS |
633 | C<is_uni_alnum>, C<is_uni_alnumc>, C<is_uni_alnumc_lc>, |
634 | C<is_uni_alnum_lc>, C<is_uni_alpha>, C<is_uni_alpha_lc>, | |
635 | C<is_uni_ascii>, C<is_uni_ascii_lc>, C<is_uni_blank>, | |
636 | C<is_uni_blank_lc>, C<is_uni_cntrl>, C<is_uni_cntrl_lc>, | |
637 | C<is_uni_digit>, C<is_uni_digit_lc>, C<is_uni_graph>, | |
638 | C<is_uni_graph_lc>, C<is_uni_idfirst>, C<is_uni_idfirst_lc>, | |
639 | C<is_uni_lower>, C<is_uni_lower_lc>, C<is_uni_print>, | |
640 | C<is_uni_print_lc>, C<is_uni_punct>, C<is_uni_punct_lc>, | |
641 | C<is_uni_space>, C<is_uni_space_lc>, C<is_uni_upper>, | |
642 | C<is_uni_upper_lc>, C<is_uni_xdigit>, C<is_uni_xdigit_lc>, | |
643 | C<is_utf8_alnum>, C<is_utf8_alnumc>, C<is_utf8_alpha>, | |
644 | C<is_utf8_ascii>, C<is_utf8_blank>, C<is_utf8_char>, | |
645 | C<is_utf8_cntrl>, C<is_utf8_digit>, C<is_utf8_graph>, | |
646 | C<is_utf8_idcont>, C<is_utf8_idfirst>, C<is_utf8_lower>, | |
647 | C<is_utf8_mark>, C<is_utf8_perl_space>, C<is_utf8_perl_word>, | |
648 | C<is_utf8_posix_digit>, C<is_utf8_print>, C<is_utf8_punct>, | |
649 | C<is_utf8_space>, C<is_utf8_upper>, C<is_utf8_xdigit>, | |
650 | C<is_utf8_xidcont>, C<is_utf8_xidfirst>. | |
651 | ||
652 | In addition these three functions that have never worked properly are | |
653 | deprecated: | |
654 | C<to_uni_lower_lc>, C<to_uni_title_lc>, and C<to_uni_upper_lc>. | |
655 | ||
f105b7be | 656 | =head2 Certain rare uses of backslashes within regexes are now deprecated |
5ed58cbd RS |
657 | |
658 | There are three pairs of characters that Perl recognizes as | |
659 | metacharacters in regular expression patterns: C<{}>, C<[]>, and C<()>. | |
660 | These can be used as well to delimit patterns, as in: | |
661 | ||
662 | m{foo} | |
663 | s(foo)(bar) | |
664 | ||
665 | Since they are metacharacters, they have special meaning to regular | |
666 | expression patterns, and it turns out that you can't turn off that | |
667 | special meaning by the normal means of preceding them with a backslash, | |
f105b7be | 668 | if you use them, paired, within a pattern delimited by them. For |
5ed58cbd RS |
669 | example, in |
670 | ||
671 | m{foo\{1,3\}} | |
672 | ||
673 | the backslashes do not change the behavior, and this matches | |
674 | S<C<"f o">> followed by one to three more occurrences of C<"o">. | |
675 | ||
676 | Usages like this, where they are interpreted as metacharacters, are | |
677 | exceedingly rare; we think there are none, for example, in all of CPAN. | |
678 | Hence, this deprecation should affect very little code. It does give | |
679 | notice, however, that any such code needs to change, which will in turn | |
680 | allow us to change the behavior in future Perl versions so that the | |
681 | backslashes do have an effect, and without fear that we are silently | |
682 | breaking any existing code. | |
683 | ||
d5f315e8 KW |
684 | =head2 Splitting the tokens C<(?> and C<(*> in regular expressions |
685 | ||
686 | A deprecation warning is now raised if the C<(> and C<?> are separated | |
687 | by white space or comments in C<(?...)> regular expression constructs. | |
688 | Similarly, if the C<(> and C<*> are separated in C<(*VERB...)> | |
689 | constructs. | |
690 | ||
e0a1dec5 LT |
691 | =head2 Pre-PerlIO IO implementations |
692 | ||
693 | Perl supports being built without PerlIO proper, using a stdio or sfio | |
694 | wrapper instead. A perl build like this will not support IO layers and | |
695 | thus Unicode IO, making it rather handicapped. | |
696 | ||
697 | PerlIO supports a C<stdio> layer if stdio use is desired, and similarly a | |
698 | sfio layer could be produced. | |
699 | ||
5ed58cbd RS |
700 | =head1 Future Deprecations |
701 | ||
702 | =over | |
71e6aba6 RS |
703 | |
704 | =item * | |
705 | ||
4263dd11 | 706 | Platforms without support infrastructure |
5ed58cbd RS |
707 | |
708 | Both Windows CE and z/OS have been historically under-maintained, and are | |
709 | currently neither successfully building nor regularly being smoke tested. | |
710 | Efforts are underway to change this situation, but it should not be taken for | |
711 | granted that the platforms are safe and supported. If they do not become | |
712 | buildable and regularly smoked, support for them may be actively removed in | |
713 | future releases. If you have an interest in these platforms and you can lend | |
714 | your time, expertise, or hardware to help support these platforms, please let | |
715 | the perl development effort know by emailing C<perl5-porters@perl.org>. | |
716 | ||
717 | Some platforms that appear otherwise entirely dead are also on the short list | |
e612b5a0 | 718 | for removal between now and v5.20.0: |
5ed58cbd RS |
719 | |
720 | =over | |
721 | ||
722 | =item DG/UX | |
723 | ||
724 | =item NeXT | |
725 | ||
726 | =back | |
1993add8 | 727 | |
ec985017 RS |
728 | We also think it likely that current versions of Perl will no longer |
729 | build AmigaOS, DJGPP, NetWare (natively), OS/2 and Plan 9. If you | |
730 | are using Perl on such a platform and have an interest in ensuring | |
731 | Perl's future on them, please contact us. | |
732 | ||
733 | We believe that Perl has long been unable to build on mixed endian | |
734 | architectures (such as PDP-11s), and intend to remove any remain | |
735 | support code. Similarly, code supporting the long umaintained GNU | |
736 | dld will be removed soon if no-one makes themselves known as an | |
737 | active user. | |
738 | ||
1993add8 RS |
739 | =item * |
740 | ||
5ed58cbd RS |
741 | Swapping of $< and $> |
742 | ||
743 | For more information about this future deprecation, see L<the relevant RT | |
744 | ticket|https://rt.perl.org/rt3/Ticket/Display.html?id=96212>. | |
71e6aba6 RS |
745 | |
746 | =item * | |
747 | ||
5ed58cbd | 748 | C<microperl>, long broken and of unclear present purpose, will be removed. |
71e6aba6 RS |
749 | |
750 | =item * | |
751 | ||
5ed58cbd RS |
752 | Revamping C<< "\Q" >> semantics in double-quotish strings when combined with |
753 | other escapes. | |
754 | ||
755 | There are several bugs and inconsistencies involving combinations | |
756 | of C<\Q> and escapes like C<\x>, C<\L>, etc., within a C<\Q...\E> pair. | |
757 | These need to be fixed, and doing so will necessarily change current | |
758 | behavior. The changes have not yet been settled. | |
71e6aba6 | 759 | |
d5f315e8 KW |
760 | =item * |
761 | ||
762 | Use of C<$^>, where C<^> stands for any actual (non-printing) C0 control | |
763 | character will be disallowed in a future Perl version. Use C<${^}> | |
764 | instead (where again C<^> stands for a control character), | |
765 | or better, C<$^A> , where C<^> this time is a caret (CIRCUMFLEX ACCENT), | |
766 | and C<A> stands for any of the characters listed at the end of | |
767 | L<perlebcdic/OPERATOR DIFFERENCES>. | |
768 | ||
337fb649 | 769 | =back |
2426c394 | 770 | |
5ed58cbd RS |
771 | =head1 Performance Enhancements |
772 | ||
773 | =over 4 | |
2426c394 | 774 | |
5ed58cbd | 775 | =item * |
2426c394 | 776 | |
5ed58cbd RS |
777 | Lists of lexical variable declarations (C<my($x, $y)>) are now optimised |
778 | down to a single op and are hence faster than before. | |
2426c394 | 779 | |
5ed58cbd | 780 | =item * |
2426c394 | 781 | |
5ed58cbd RS |
782 | A new C preprocessor define C<NO_TAINT_SUPPORT> was added that, if set, |
783 | disables Perl's taint support altogether. Using the -T or -t command | |
784 | line flags will cause a fatal error. Beware that both core tests as | |
785 | well as many a CPAN distribution's tests will fail with this change. On | |
786 | the upside, it provides a small performance benefit due to reduced | |
787 | branching. | |
2426c394 | 788 | |
5ed58cbd RS |
789 | B<Do not enable this unless you know exactly what you are getting yourself |
790 | into.> | |
791 | ||
792 | =item * | |
793 | ||
794 | C<pack> with constant arguments is now constant folded in most cases | |
795 | [perl #113470]. | |
796 | ||
797 | =item * | |
798 | ||
799 | Speed up in regular expression matching against Unicode properties. The | |
800 | largest gain is for C<\X>, the Unicode "extended grapheme cluster." The | |
801 | gain for it is about 35% - 40%. Bracketed character classes, e.g., | |
802 | C<[0-9\x{100}]> containing code points above 255 are also now faster. | |
803 | ||
804 | =item * | |
805 | ||
806 | On platforms supporting it, several former macros are now implemented as static | |
807 | inline functions. This should speed things up slightly on non-GCC platforms. | |
808 | ||
809 | =item * | |
810 | ||
66f62cf6 RS |
811 | The optimisation of hashes in boolean context has been extended to |
812 | affect C<scalar(%hash)>, C<%hash ? ... : ...>, and C<sub { %hash || ... }>. | |
5ed58cbd RS |
813 | |
814 | =item * | |
815 | ||
f105b7be | 816 | Filetest operators manage the stack in a fractionally more efficient manner. |
5ed58cbd RS |
817 | |
818 | =item * | |
819 | ||
820 | Globs used in a numeric context are now numified directly in most cases, | |
f105b7be | 821 | rather than being numified via stringification. |
5ed58cbd RS |
822 | |
823 | =item * | |
824 | ||
825 | The C<x> repetition operator is now folded to a single constant at compile | |
826 | time if called in scalar context with constant operands and no parentheses | |
827 | around the left operand. | |
828 | ||
829 | =back | |
830 | ||
831 | =head1 Modules and Pragmata | |
832 | ||
833 | =head2 New Modules and Pragmata | |
2426c394 | 834 | |
337fb649 | 835 | =over 4 |
982110e0 | 836 | |
82d98f72 | 837 | =item * |
2426c394 | 838 | |
5ed58cbd RS |
839 | L<Config::Perl::V> version 0.16 has been added as a dual-lifed module. |
840 | It provides structured data retrieval of C<perl -V> output including | |
841 | information only known to the C<perl> binary and not available via L<Config>. | |
842 | ||
843 | =back | |
844 | ||
845 | =head2 Updated Modules and Pragmata | |
846 | ||
847 | This is only an overview of selected module updates. For a complete | |
848 | list of updates, run: | |
849 | ||
f105b7be | 850 | $ corelist --diff 5.16.0 5.18.0 |
5ed58cbd | 851 | |
e612b5a0 | 852 | You can substitute your favorite version in place of C<5.16.0>, too. |
5ed58cbd RS |
853 | |
854 | =over 4 | |
33392251 BF |
855 | |
856 | =item * | |
857 | ||
5ed58cbd RS |
858 | L<XXX> has been upgraded from version A.xx to B.yy. |
859 | ||
860 | =back | |
861 | ||
862 | =head2 Removed Modules and Pragmata | |
863 | ||
864 | =over | |
33392251 BF |
865 | |
866 | =item * | |
867 | ||
5ed58cbd RS |
868 | L<Version::Requirements> has been removed from the core distribution. It is |
869 | available under a different name: L<CPAN::Meta::Requirements>. | |
2426c394 | 870 | |
337fb649 | 871 | =back |
2426c394 | 872 | |
5ed58cbd | 873 | =head1 Documentation |
19718730 | 874 | |
5ed58cbd RS |
875 | =head2 Changes to Existing Documentation |
876 | ||
877 | =head3 L<perlcheat> | |
82d98f72 | 878 | |
5a6a30f4 | 879 | =over 4 |
b7c7d786 | 880 | |
5ed58cbd RS |
881 | =item * |
882 | ||
883 | L<perlcheat> has been reorganized, and a few new sections were added. | |
884 | ||
885 | =back | |
886 | ||
887 | =head3 L<perldata> | |
888 | ||
889 | =over 4 | |
82d98f72 | 890 | |
5ed58cbd | 891 | =item * |
d2d1e842 | 892 | |
5ed58cbd RS |
893 | Now explicitly documents the behaviour of hash initializer lists that |
894 | contain duplicate keys. | |
f355e93d | 895 | |
5a6a30f4 | 896 | =back |
f355e93d | 897 | |
5ed58cbd | 898 | =head3 L<perldiag> |
19718730 | 899 | |
19718730 | 900 | =over 4 |
e14ac59b | 901 | |
5ed58cbd RS |
902 | =item * |
903 | ||
904 | The explanation of symbolic references being prevented by "strict refs" | |
905 | now doesn't assume that the reader knows what symbolic references are. | |
906 | ||
907 | =back | |
9f351b45 | 908 | |
5ed58cbd | 909 | =head3 L<perlfaq> |
9f351b45 | 910 | |
5ed58cbd | 911 | =over 4 |
9f351b45 | 912 | |
5ed58cbd | 913 | =item * |
7cf3104f | 914 | |
5ed58cbd | 915 | L<perlfaq> has been synchronized with version 5.0150040 from CPAN. |
12719193 | 916 | |
6253ee75 | 917 | =back |
216cf7fc | 918 | |
5ed58cbd | 919 | =head3 L<perlfunc> |
f5b73711 | 920 | |
5ed58cbd RS |
921 | =over 4 |
922 | ||
923 | =item * | |
a75569c0 | 924 | |
5ed58cbd | 925 | The return value of C<pipe> is now documented. |
a75569c0 | 926 | |
5ed58cbd | 927 | =item * |
a75569c0 | 928 | |
5ed58cbd RS |
929 | Clarified documentation of C<our>. |
930 | ||
931 | =back | |
932 | ||
933 | =head3 L<perlop> | |
934 | ||
935 | =over 4 | |
936 | ||
937 | =item * | |
938 | ||
939 | Loop control verbs (C<dump>, C<goto>, C<next>, C<last> and C<redo>) have always | |
940 | had the same precedence as assignment operators, but this was not documented | |
941 | until now. | |
942 | ||
943 | =back | |
944 | ||
945 | =head3 Diagnostics | |
946 | ||
947 | The following additions or changes have been made to diagnostic output, | |
948 | including warnings and fatal error messages. For the complete list of | |
949 | diagnostic messages, see L<perldiag>. | |
950 | ||
951 | XXX New or changed warnings emitted by the core's C<C> code go here. Also | |
952 | include any changes in L<perldiag> that reconcile it to the C<C> code. | |
953 | ||
954 | =head2 New Diagnostics | |
955 | ||
956 | XXX Newly added diagnostic messages go under here, separated into New Errors | |
957 | and New Warnings | |
958 | ||
959 | =head3 New Errors | |
960 | ||
961 | =over 4 | |
962 | ||
963 | =item * | |
964 | ||
965 | L<Unterminated delimiter for here document|perldiag/"Unterminated delimiter for here document"> | |
966 | ||
967 | This message now occurs when a here document label has an initial quotation | |
968 | mark but the final quotation mark is missing. | |
969 | ||
970 | This replaces a bogus and misleading error message about not finding the label | |
971 | itself [perl #114104]. | |
972 | ||
973 | =item * | |
974 | ||
975 | L<panic: child pseudo-process was never scheduled|perldiag/"panic: child pseudo-process was never scheduled"> | |
976 | ||
977 | This error is thrown when a child pseudo-process in the ithreads implementation | |
978 | on Windows was not scheduled within the time period allowed and therefore was | |
979 | not able to initialize properly [perl #88840]. | |
980 | ||
981 | =item * | |
982 | ||
983 | L<Group name must start with a non-digit word character in regex; marked by <-- HERE in mE<sol>%sE<sol>|perldiag/"Group name must start with a non-digit word character in regex; marked by <-- HERE in m/%s/"> | |
984 | ||
985 | This error has been added for C<(?&0)>, which is invalid. It used to | |
986 | produce an incomprehensible error message [perl #101666]. | |
987 | ||
988 | =item * | |
989 | ||
990 | L<Can't use an undefined value as a subroutine reference|perldiag/"Can't use an undefined value as %s reference"> | |
991 | ||
992 | Calling an undefined value as a subroutine now produces this error message. | |
993 | It used to, but was accidentally disabled, first in Perl 5.004 for | |
e612b5a0 | 994 | non-magical variables, and then in Perl v5.14 for magical (e.g., tied) |
5ed58cbd RS |
995 | variables. It has now been restored. In the mean time, undef was treated |
996 | as an empty string [perl #113576]. | |
997 | ||
998 | =item * | |
999 | ||
1000 | L<Experimental "%s" subs not enabled|perldiag/"Experimental "%s" subs not enabled"> | |
1001 | ||
1002 | To use lexical subs, you must first enable them: | |
1003 | ||
1004 | no warnings 'experimental::lexical_subs'; | |
1005 | use feature 'lexical_subs'; | |
1006 | my sub foo { ... } | |
1007 | ||
1008 | =back | |
1009 | ||
1010 | =head3 New Warnings | |
1011 | ||
1012 | =over 4 | |
1013 | ||
1014 | =item * | |
1015 | ||
1016 | XXX: This needs more detail. | |
1017 | ||
1018 | Strings with code points over 0xFF may not be mapped into in-memory file | |
1019 | handles | |
1020 | ||
1021 | =item * | |
1022 | ||
1023 | L<'%s' resolved to '\o{%s}%d'|perldiag/"'%s' resolved to '\o{%s}%d'"> | |
1024 | ||
1025 | =item * | |
1026 | ||
1027 | L<'Trailing white-space in a charnames alias definition is deprecated'|perldiag/"Trailing white-space in a charnames alias definition is deprecated"> | |
1028 | ||
1029 | =item * | |
1030 | ||
1031 | L<'A sequence of multiple spaces in a charnames alias definition is deprecated'|perldiag/"A sequence of multiple spaces in a charnames alias definition is deprecated"> | |
1032 | ||
1033 | =item * | |
1034 | ||
1035 | L<'Passing malformed UTF-8 to "%s" is deprecated'|perldiag/"Passing malformed UTF-8 to "%s" is deprecated"> | |
1036 | ||
1037 | =item * | |
1038 | ||
1039 | L<Subroutine "&%s" is not available|perldiag/"Subroutine "&%s" is not available"> | |
1040 | ||
1041 | (W closure) During compilation, an inner named subroutine or eval is | |
1042 | attempting to capture an outer lexical subroutine that is not currently | |
1043 | available. This can happen for one of two reasons. First, the lexical | |
1044 | subroutine may be declared in an outer anonymous subroutine that has not | |
1045 | yet been created. (Remember that named subs are created at compile time, | |
1046 | while anonymous subs are created at run-time.) For example, | |
1047 | ||
1048 | sub { my sub a {...} sub f { \&a } } | |
1049 | ||
1050 | At the time that f is created, it can't capture the current the "a" sub, | |
1051 | since the anonymous subroutine hasn't been created yet. Conversely, the | |
1052 | following won't give a warning since the anonymous subroutine has by now | |
1053 | been created and is live: | |
1054 | ||
1055 | sub { my sub a {...} eval 'sub f { \&a }' }->(); | |
1056 | ||
1057 | The second situation is caused by an eval accessing a variable that has | |
1058 | gone out of scope, for example, | |
1059 | ||
1060 | sub f { | |
1061 | my sub a {...} | |
1062 | sub { eval '\&a' } | |
1063 | } | |
1064 | f()->(); | |
1065 | ||
1066 | Here, when the '\&a' in the eval is being compiled, f() is not currently | |
1067 | being executed, so its &a is not available for capture. | |
1068 | ||
1069 | =item * | |
1070 | ||
1071 | L<"%s" subroutine &%s masks earlier declaration in same %s|perldiag/"%s" subroutine &%s masks earlier declaration in same %s> | |
1072 | ||
1073 | (W misc) A "my" or "state" subroutine has been redeclared in the | |
1074 | current scope or statement, effectively eliminating all access to | |
1075 | the previous instance. This is almost always a typographical error. | |
1076 | Note that the earlier subroutine will still exist until the end of | |
1077 | the scope or until all closure references to it are destroyed. | |
1078 | ||
1079 | =item * | |
1080 | ||
1081 | L<The %s feature is experimental|perldiag/"The %s feature is experimental"> | |
1082 | ||
1083 | (S experimental) This warning is emitted if you enable an experimental | |
1084 | feature via C<use feature>. Simply suppress the warning if you want | |
1085 | to use the feature, but know that in doing so you are taking the risk | |
1086 | of using an experimental feature which may change or be removed in a | |
1087 | future Perl version: | |
1088 | ||
1089 | no warnings "experimental::lexical_subs"; | |
1090 | use feature "lexical_subs"; | |
1091 | ||
1092 | =item * | |
1093 | ||
1094 | L<sleep(%u) too large|perldiag/"sleep(%u) too large"> | |
1095 | ||
1096 | (W overflow) You called C<sleep> with a number that was larger than it can | |
1097 | reliably handle and C<sleep> probably slept for less time than requested. | |
1098 | ||
1099 | =item * | |
1100 | ||
1101 | L<Wide character in setenv|perldiag/"Wide character in %s"> | |
1102 | ||
1103 | Attempts to put wide characters into environment variables via C<%ENV> now | |
1104 | provoke this warning. | |
1105 | ||
1106 | =item * | |
1107 | ||
1108 | "L<Invalid negative number (%s) in chr|perldiag/"Invalid negative number (%s) in chr">" | |
1109 | ||
1110 | C<chr()> now warns when passed a negative value [perl #83048]. | |
1111 | ||
1112 | =item * | |
1113 | ||
1114 | "L<Integer overflow in srand|perldiag/"Integer overflow in srand">" | |
1115 | ||
1116 | C<srand()> now warns when passed a value that doesn't fit in a C<UV> (since the | |
1117 | value will be truncated rather than overflowing) [perl #40605]. | |
1118 | ||
1119 | =item * | |
1120 | ||
1121 | "L<-i used with no filenames on the command line, reading from STDIN|perldiag/"-i used with no filenames on the command line, reading from STDIN">" | |
1122 | ||
1123 | Running perl with the C<-i> flag now warns if no input files are provided on | |
1124 | the command line [perl #113410]. | |
1125 | ||
1126 | =back | |
1127 | ||
1128 | =head2 Changes to Existing Diagnostics | |
1129 | ||
1130 | =over 4 | |
1131 | ||
1132 | =item * | |
1133 | ||
1134 | L<$* is no longer supported|perldiag/"$* is no longer supported"> | |
1135 | ||
1136 | The warning that use of C<$*> and C<$#> is no longer supported is now | |
1137 | generated for every location that references them. Previously it would fail | |
1138 | to be generated if another variable using the same typeglob was seen first | |
1139 | (e.g. C<@*> before C<$*>), and would not be generated for the second and | |
1140 | subsequent uses. (It's hard to fix the failure to generate warnings at all | |
1141 | without also generating them every time, and warning every time is | |
1142 | consistent with the warnings that C<$[> used to generate.) | |
1143 | ||
1144 | =item * | |
1145 | ||
1146 | The warnings for C<\b{> and C<\B{> were added. They are a deprecation | |
1147 | warning which should be turned off by that category. One should not | |
1148 | have to turn off regular regexp warnings as well to get rid of these. | |
1149 | ||
1150 | =item * | |
1151 | ||
1152 | L<Constant(%s): Call to &{$^H{%s}} did not return a defined value|perldiag/Constant(%s): Call to &{$^H{%s}} did not return a defined value> | |
1153 | ||
1154 | Constant overloading that returns C<undef> results in this error message. | |
1155 | For numeric constants, it used to say "Constant(undef)". "undef" has been | |
1156 | replaced with the number itself. | |
1157 | ||
1158 | =item * | |
1159 | ||
1160 | The error produced when a module cannot be loaded now includes a hint that | |
1161 | the module may need to be installed: "Can't locate hopping.pm in @INC (you | |
1162 | may need to install the hopping module) (@INC contains: ...)" | |
1163 | ||
1164 | =item * | |
1165 | ||
1166 | L<vector argument not supported with alpha versions|perldiag/vector argument not supported with alpha versions> | |
1167 | ||
1168 | This warning was not suppressable, even with C<no warnings>. Now it is | |
1169 | suppressible, and has been moved from the "internal" category to the | |
1170 | "printf" category. | |
1171 | ||
1172 | =item * | |
1173 | ||
1174 | C<< Can't do {n,m} with n > m in regex; marked by <-- HERE in m/%s/ >> | |
1175 | ||
1176 | This fatal error has been turned into a warning that reads: | |
1177 | ||
1178 | L<< Quantifier {n,m} with n > m can't match in regex | perldiag/Quantifier {n,m} with n > m can't match in regex >> | |
1179 | ||
1180 | (W regexp) Minima should be less than or equal to maxima. If you really want | |
1181 | your regexp to match something 0 times, just put {0}. | |
1182 | ||
1183 | =item * | |
1184 | ||
1185 | The "Runaway prototype" warning that occurs in bizarre cases has been | |
1186 | removed as being unhelpful and inconsistent. | |
1187 | ||
1188 | =item * | |
1189 | ||
1190 | The "Not a format reference" error has been removed, as the only case in | |
1191 | which it could be triggered was a bug. | |
1192 | ||
1193 | =item * | |
1194 | ||
1195 | The "Unable to create sub named %s" error has been removed for the same | |
1196 | reason. | |
1197 | ||
1198 | =item * | |
1199 | ||
1200 | The 'Can't use "my %s" in sort comparison' error has been downgraded to a | |
1201 | warning, '"my %s" used in sort comparison' (with 'state' instead of 'my' | |
1202 | for state variables). In addition, the heuristics for guessing whether | |
1203 | lexical $a or $b has been misused have been improved to generate fewer | |
1204 | false positives. Lexical $a and $b are no longer disallowed if they are | |
1205 | outside the sort block. Also, a named unary or list operator inside the | |
1206 | sort block no longer causes the $a or $b to be ignored [perl #86136]. | |
1207 | ||
1208 | =back | |
1209 | ||
1210 | =head1 Utility Changes | |
1211 | ||
1212 | =head3 L<h2xs> | |
1213 | ||
1214 | =over 4 | |
1215 | ||
1216 | =item * | |
1217 | ||
1218 | F<h2xs> no longer produces invalid code for empty defines. [perl #20636] | |
1219 | ||
1220 | =back | |
1221 | ||
1222 | =head1 Configuration and Compilation | |
1223 | ||
1224 | =over 4 | |
1225 | ||
1226 | =item * | |
1227 | ||
1228 | Added C<useversionedarchname> option to Configure | |
1229 | ||
1230 | When set, it includes 'api_versionstring' in 'archname'. E.g. | |
1231 | x86_64-linux-5.13.6-thread-multi. It is unset by default. | |
1232 | ||
1233 | This feature was requested by Tim Bunce, who observed that | |
f105b7be | 1234 | C<INSTALL_BASE> creates a library structure that does not |
5ed58cbd RS |
1235 | differentiate by perl version. Instead, it places architecture |
1236 | specific files in "$install_base/lib/perl5/$archname". This makes | |
f105b7be | 1237 | it difficult to use a common C<INSTALL_BASE> library path with |
5ed58cbd RS |
1238 | multiple versions of perl. |
1239 | ||
f105b7be | 1240 | By setting C<-Duseversionedarchname>, the $archname will be |
c2959982 | 1241 | distinct for architecture I<and> API version, allowing mixed use of |
f105b7be | 1242 | C<INSTALL_BASE>. |
5ed58cbd RS |
1243 | |
1244 | =item * | |
1245 | ||
ff772877 RS |
1246 | Add a C<PERL_NO_INLINE_FUNCTIONS> option |
1247 | ||
f105b7be | 1248 | If C<PERL_NO_INLINE_FUNCTIONS> is defined, don't include "inline.h" |
ff772877 RS |
1249 | |
1250 | This permits test code to include the perl headers for definitions without | |
1251 | creating a link dependency on the perl library (which may not exist yet). | |
1252 | ||
1253 | =item * | |
1254 | ||
5ed58cbd RS |
1255 | Configure will honour the external C<MAILDOMAIN> environment variable, if set. |
1256 | ||
1257 | =item * | |
1258 | ||
1259 | C<installman> no longer ignores the silent option | |
1260 | ||
1261 | =item * | |
1262 | ||
1263 | Both C<META.yml> and C<META.json> files are now included in the distribution. | |
1264 | ||
1265 | =item * | |
1266 | ||
1267 | F<Configure> will now correctly detect C<isblank()> when compiling with a C++ | |
1268 | compiler. | |
1269 | ||
1270 | =item * | |
1271 | ||
1272 | The pager detection in F<Configure> has been improved to allow responses which | |
1273 | specify options after the program name, e.g. B</usr/bin/less -R>, if the user | |
1274 | accepts the default value. This helps B<perldoc> when handling ANSI escapes | |
1275 | [perl #72156]. | |
1276 | ||
1277 | =back | |
1278 | ||
1279 | =head1 Testing | |
1280 | ||
1281 | =over 4 | |
1282 | ||
1283 | =item * | |
1284 | ||
1285 | The test suite now has a section for tests that require very large amounts | |
1286 | of memory. These tests won't run by default; they can be enabled by | |
1287 | setting the C<PERL_TEST_MEMORY> environment variable to the number of | |
1288 | gibibytes of memory that may be safely used. | |
1289 | ||
1290 | =back | |
1291 | ||
1292 | =head1 Platform Support | |
1293 | ||
1294 | =head2 Discontinued Platforms | |
1295 | ||
1296 | =over 4 | |
1297 | ||
1298 | =item BeOS | |
1299 | ||
1300 | BeOS was an operating system for personal computers developed by Be Inc, | |
1301 | initially for their BeBox hardware. The OS Haiku was written as an open | |
1302 | source replacement for/continuation of BeOS, and its perl port is current and | |
1303 | actively maintained. | |
1304 | ||
1305 | =item UTS Global | |
1306 | ||
1307 | Support code relating to UTS global has been removed. UTS was a mainframe | |
1308 | version of System V created by Amdahl, subsequently sold to UTS Global. The | |
e612b5a0 | 1309 | port has not been touched since before Perl v5.8.0, and UTS Global is now |
5ed58cbd RS |
1310 | defunct. |
1311 | ||
1312 | =item VM/ESA | |
1313 | ||
1314 | Support for VM/ESA has been removed. The port was tested on 2.3.0, which | |
1315 | IBM ended service on in March 2002. 2.4.0 ended service in June 2003, and | |
1316 | was superseded by Z/VM. The current version of Z/VM is V6.2.0, and scheduled | |
1317 | for end of service on 2015/04/30. | |
1318 | ||
1319 | =item MPE/IX | |
1320 | ||
1321 | Support for MPE/IX has been removed. | |
1322 | ||
1323 | =item EPOC | |
1324 | ||
1325 | Support code relating to EPOC has been removed. EPOC was a family of | |
1326 | operating systems developed by Psion for mobile devices. It was the | |
1327 | predecessor of Symbian. The port was last updated in April 2002. | |
1328 | ||
1329 | =item Rhapsody | |
1330 | ||
1331 | Support for Rhapsody has been removed. | |
1332 | ||
1333 | =back | |
1334 | ||
1335 | =head2 Platform-Specific Notes | |
1336 | ||
1337 | =head3 AIX | |
1338 | ||
1339 | Configure now always adds C<-qlanglvl=extc99> to the CC flags on AIX when | |
1340 | using xlC. This will make it easier to compile a number of XS-based modules | |
1341 | that assume C99 [perl #113778]. | |
1342 | ||
1343 | =head3 clang++ | |
1344 | ||
1345 | There is now a workaround for a compiler bug that prevented compiling | |
e612b5a0 | 1346 | with clang++ since Perl v5.15.7 [perl #112786]. |
5ed58cbd RS |
1347 | |
1348 | =head3 C++ | |
1349 | ||
1350 | When compiling the Perl core as C++ (which is only semi-supported), the | |
1351 | mathom functions are now compiled as C<extern "C">, to ensure proper | |
1352 | binary compatibility. (However, binary compatibility isn't generally | |
1353 | guaranteed anyway in the situations where this would matter.) | |
1354 | ||
1355 | =head3 Darwin | |
1356 | ||
1357 | Stop hardcoding an alignment on 8 byte boundaries to fix builds using | |
1358 | -Dusemorebits. | |
1359 | ||
1360 | =head3 Haiku | |
1361 | ||
1362 | Perl should now work out of the box on Haiku R1 Alpha 4. | |
1363 | ||
1364 | =head3 MidnightBSD | |
1365 | ||
1366 | C<libc_r> was removed from recent versions of MidnightBSD and older versions | |
1367 | work better with C<pthread>. Threading is now enabled using C<pthread> which | |
1368 | corrects build errors with threading enabled on 0.4-CURRENT. | |
1369 | ||
1370 | =head3 Solaris | |
1371 | ||
1372 | In Configure, avoid running sed commands with flags not supported on Solaris. | |
1373 | ||
1374 | =head3 VMS | |
1375 | ||
1376 | =over | |
1377 | ||
1378 | =item * | |
1379 | ||
1380 | Where possible, the case of filenames and command-line arguments is now | |
1381 | preserved by enabling the CRTL features C<DECC$EFS_CASE_PRESERVE> and | |
1382 | C<DECC$ARGV_PARSE_STYLE> at start-up time. The latter only takes effect | |
1383 | when extended parse is enabled in the process from which Perl is run. | |
1384 | ||
1385 | =item * | |
1386 | ||
1387 | The character set for Extended Filename Syntax (EFS) is now enabled by default | |
1388 | on VMS. Among other things, this provides better handling of dots in directory | |
05f5908f | 1389 | names, multiple dots in filenames, and spaces in filenames. To obtain the old |
5ed58cbd RS |
1390 | behavior, set the logical name C<DECC$EFS_CHARSET> to C<DISABLE>. |
1391 | ||
1392 | =item * | |
1393 | ||
05f5908f | 1394 | Fixed linking on builds configured with C<-Dusemymalloc=y>. |
5ed58cbd RS |
1395 | |
1396 | =item * | |
1397 | ||
05f5908f CB |
1398 | Experimental support for building Perl with the HP C++ compiler is available |
1399 | by configuring with C<-Dusecxx>. | |
5ed58cbd RS |
1400 | |
1401 | =item * | |
1402 | ||
1403 | All C header files from the top-level directory of the distribution are now | |
1404 | installed on VMS, providing consistency with a long-standing practice on other | |
1405 | platforms. Previously only a subset were installed, which broke non-core | |
1406 | extension builds for extensions that depended on the missing include files. | |
1407 | ||
1408 | =item * | |
1409 | ||
1410 | Quotes are now removed from the command verb (but not the parameters) for | |
1411 | commands spawned via C<system>, backticks, or a piped C<open>. Previously, | |
1412 | quotes on the verb were passed through to DCL, which would fail to recognize | |
1413 | the command. Also, if the verb is actually a path to an image or command | |
1414 | procedure on an ODS-5 volume, quoting it now allows the path to contain spaces. | |
1415 | ||
1416 | =item * | |
1417 | ||
1418 | The B<a2p> build has been fixed for the HP C++ compiler on OpenVMS. | |
1419 | ||
1420 | =back | |
1421 | ||
1422 | =head3 Win32 | |
1423 | ||
1424 | =over | |
1425 | ||
1426 | =item * | |
1427 | ||
1428 | Perl can now be built using Microsoft's Visual C++ 2012 compiler by specifying | |
1429 | CCTYPE=MSVC110 (or MSVC110FREE if you are using the free Express edition for | |
1430 | Windows Desktop) in F<win32/Makefile>. | |
1431 | ||
1432 | =item * | |
1433 | ||
f105b7be | 1434 | The option to build without C<USE_SOCKETS_AS_HANDLES> has been removed. |
5ed58cbd RS |
1435 | |
1436 | =item * | |
1437 | ||
1438 | Fixed a problem where perl could crash while cleaning up threads (including the | |
1439 | main thread) in threaded debugging builds on Win32 and possibly other platforms | |
1440 | [perl #114496]. | |
1441 | ||
1442 | =item * | |
1443 | ||
1444 | A rare race condition that would lead to L<sleep|perlfunc/sleep> taking more | |
1445 | time than requested, and possibly even hanging, has been fixed [perl #33096]. | |
1446 | ||
1447 | =item * | |
1448 | ||
1449 | C<link> on Win32 now attempts to set C<$!> to more appropriate values | |
1450 | based on the Win32 API error code. [perl #112272] | |
1451 | ||
1452 | Perl no longer mangles the environment block, e.g. when launching a new | |
1453 | sub-process, when the environment contains non-ASCII characters. Known | |
1454 | problems still remain, however, when the environment contains characters | |
1455 | outside of the current ANSI codepage (e.g. see the item about Unicode in | |
1456 | C<%ENV> in L<http://perl5.git.perl.org/perl.git/blob/HEAD:/Porting/todo.pod>). | |
1457 | [perl #113536] | |
1458 | ||
1459 | =item * | |
1460 | ||
1461 | Building perl with some Windows compilers used to fail due to a problem | |
1462 | with miniperl's C<glob> operator (which uses the C<perlglob> program) | |
1463 | deleting the PATH environment variable [perl #113798]. | |
1464 | ||
1465 | =item * | |
1466 | ||
f105b7be | 1467 | A new makefile option, C<USE_64_BIT_INT>, has been added to the Windows |
5ed58cbd RS |
1468 | makefiles. Set this to "define" when building a 32-bit perl if you want |
1469 | it to use 64-bit integers. | |
1470 | ||
1471 | Machine code size reductions, already made to the DLLs of XS modules in | |
e612b5a0 | 1472 | Perl v5.17.2, have now been extended to the perl DLL itself. |
5ed58cbd | 1473 | |
e612b5a0 | 1474 | Building with VC++ 6.0 was inadvertently broken in Perl v5.17.2 but has |
5ed58cbd RS |
1475 | now been fixed again. |
1476 | ||
1477 | =back | |
1478 | ||
1479 | =head3 WinCE | |
1480 | ||
1481 | Building on WinCE is now possible once again, although more work is required | |
1482 | to fully restore a clean build. | |
1483 | ||
1484 | =head1 Internal Changes | |
1485 | ||
1486 | =over | |
1487 | ||
1488 | =item * | |
1489 | ||
4263dd11 | 1490 | Synonyms for the misleadingly named C<av_len()> have been created: |
5ed58cbd RS |
1491 | C<av_top_index()> and C<av_tindex>. All three of these return the |
1492 | number of the highest index in the array, not the number of elements it | |
1493 | contains. | |
1494 | ||
1495 | =item * | |
1496 | ||
1497 | SvUPGRADE() is no longer an expression. Originally this macro (and its | |
1498 | underlying function, sv_upgrade()) were documented as boolean, although | |
1499 | in reality they always croaked on error and never returned false. In 2005 | |
1500 | the documentation was updated to specify a void return value, but | |
1501 | SvUPGRADE() was left always returning 1 for backwards compatibility. This | |
1502 | has now been removed, and SvUPGRADE() is now a statement with no return | |
1503 | value. | |
1504 | ||
1505 | So this is now a syntax error: | |
1506 | ||
1507 | if (!SvUPGRADE(sv)) { croak(...); } | |
1508 | ||
1509 | If you have code like that, simply replace it with | |
1510 | ||
1511 | SvUPGRADE(sv); | |
1512 | ||
1513 | or to to avoid compiler warnings with older perls, possibly | |
1514 | ||
1515 | (void)SvUPGRADE(sv); | |
1516 | ||
1517 | =item * | |
1518 | ||
1519 | Perl has a new copy-on-write mechanism that allows any SvPOK scalar to be | |
1520 | upgraded to a copy-on-write scalar. A reference count on the string buffer | |
d16360cf RS |
1521 | is stored in the string buffer itself. This feature is B<not enabled by |
1522 | default>. | |
5ed58cbd | 1523 | |
d16360cf RS |
1524 | It can be enabled in a perl build by running F<Configure> with |
1525 | B<-Accflags=-DPERL_NEW_COPY_ON_WRITE>, and we would encourage XS authors | |
1526 | to try their code with such an enabled perl, and provide feedback. | |
1527 | Unfortunately, there is not yet a good guide to updating XS code to cope | |
1528 | with COW. Until such a document is available, consult the perl5-porters | |
1529 | mailing list. | |
5ed58cbd | 1530 | |
d16360cf RS |
1531 | It breaks a few XS modules by allowing copy-on-write scalars to go |
1532 | through code paths that never encountered them before. | |
5ed58cbd RS |
1533 | |
1534 | =item * | |
1535 | ||
1536 | Copy-on-write no longer uses the SvFAKE and SvREADONLY flags. Hence, | |
1537 | SvREADONLY indicates a true read-only SV. | |
1538 | ||
1539 | Use the SvIsCOW macro (as before) to identify a copy-on-write scalar. | |
1540 | ||
1541 | =item * | |
1542 | ||
f105b7be | 1543 | C<PL_glob_index> is gone. |
5ed58cbd RS |
1544 | |
1545 | =item * | |
1546 | ||
1547 | The private Perl_croak_no_modify has had its context parameter removed. It is | |
1548 | now has a void prototype. Users of the public API croak_no_modify remain | |
1549 | unaffected. | |
1550 | ||
1551 | =item * | |
1552 | ||
1553 | Copy-on-write (shared hash key) scalars are no longer marked read-only. | |
1554 | C<SvREADONLY> returns false on such an SV, but C<SvIsCOW> still returns | |
1555 | true. | |
1556 | ||
1557 | =item * | |
1558 | ||
1559 | A new op type, C<OP_PADRANGE> has been introduced. The perl peephole | |
1560 | optimiser will, where possible, substitute a single padrange op for a | |
1561 | pushmark followed by one or more pad ops, and possibly also skipping list | |
1562 | and nextstate ops. In addition, the op can carry out the tasks associated | |
f105b7be | 1563 | with the RHS of a C<< my(...) = @_ >> assignment, so those ops may be optimised |
5ed58cbd RS |
1564 | away too. |
1565 | ||
1566 | =item * | |
1567 | ||
1568 | Case-insensitive matching inside a [bracketed] character class with a | |
1569 | multi-character fold no longer excludes one of the possibilities in the | |
1570 | circumstances that it used to. [perl #89774]. | |
1571 | ||
1572 | =item * | |
1573 | ||
1574 | C<PL_formfeed> has been removed. | |
1575 | ||
1576 | =item * | |
1577 | ||
1578 | The regular expression engine no longer reads one byte past the end of the | |
1579 | target string. While for all internally well-formed scalars this should | |
1580 | never have been a problem, this change facilitates clever tricks with | |
1581 | string buffers in CPAN modules. [perl #73542] | |
1582 | ||
1583 | =item * | |
1584 | ||
1585 | Inside a BEGIN block, C<PL_compcv> now points to the currently-compiling | |
1586 | subroutine, rather than the BEGIN block itself. | |
1587 | ||
1588 | =item * | |
1589 | ||
1590 | C<mg_length> has been deprecated. | |
1591 | ||
1592 | =item * | |
1593 | ||
1594 | C<sv_len> now always returns a byte count and C<sv_len_utf8> a character | |
1595 | count. Previously, C<sv_len> and C<sv_len_utf8> were both buggy and would | |
1596 | sometimes returns bytes and sometimes characters. C<sv_len_utf8> no longer | |
be12dd22 | 1597 | assumes that its argument is in UTF-8. Neither of these creates UTF-8 caches |
5ed58cbd RS |
1598 | for tied or overloaded values or for non-PVs any more. |
1599 | ||
1600 | =item * | |
1601 | ||
1602 | C<sv_mortalcopy> now copies string buffers of shared hash key scalars when | |
1603 | called from XS modules [perl #79824]. | |
1604 | ||
1605 | =item * | |
1606 | ||
1607 | C<RXf_SPLIT> and C<RXf_SKIPWHITE> are no longer used. They are now | |
1608 | #defined as 0. | |
1609 | ||
1610 | =item * | |
1611 | ||
1612 | The new C<RXf_MODIFIES_VARS> flag can be set by custom regular expression | |
1613 | engines to indicate that the execution of the regular expression may cause | |
1614 | variables to be modified. This lets C<s///> know to skip certain | |
1615 | optimisations. Perl's own regular expression engine sets this flag for the | |
1616 | special backtracking verbs that set $REGMARK and $REGERROR. | |
1617 | ||
1618 | =item * | |
1619 | ||
1620 | The APIs for accessing lexical pads have changed considerably. | |
1621 | ||
1622 | C<PADLIST>s are now longer C<AV>s, but their own type instead. | |
1623 | C<PADLIST>s now contain a C<PAD> and a C<PADNAMELIST> of C<PADNAME>s, | |
1624 | rather than C<AV>s for the pad and the list of pad names. C<PAD>s, | |
1625 | C<PADNAMELIST>s, and C<PADNAME>s are to be accessed as such through the | |
1626 | newly added pad API instead of the plain C<AV> and C<SV> APIs. See | |
1627 | L<perlapi> for details. | |
1628 | ||
1629 | =item * | |
1630 | ||
1631 | In the regex API, the numbered capture callbacks are passed an index | |
1632 | indicating what match variable is being accessed. There are special | |
1633 | index values for the C<$`, $&, $&> variables. Previously the same three | |
1634 | values were used to retrieve C<${^PREMATCH}, ${^MATCH}, ${^POSTMATCH}> | |
1635 | too, but these have now been assigned three separate values. See | |
1636 | L<perlreapi/Numbered capture callbacks>. | |
1637 | ||
1638 | =item * | |
1639 | ||
1640 | C<PL_sawampersand> was previously a boolean indicating that any of | |
1641 | C<$`, $&, $&> had been seen; it now contains three one-bit flags | |
1642 | indicating the presence of each of the variables individually. | |
1643 | ||
1644 | =item * | |
1645 | ||
1646 | The C<CV *> typemap entry now supports C<&{}> overloading and typeglobs, | |
1647 | just like C<&{...}> [perl #96872]. | |
1648 | ||
1649 | =item * | |
1650 | ||
1651 | The C<SVf_AMAGIC> flag to indicate overloading is now on the stash, not the | |
1652 | object. It is now set automatically whenever a method or @ISA changes, so | |
1653 | its meaning has changed, too. It now means "potentially overloaded". When | |
1654 | the overload table is calculated, the flag is automatically turned off if | |
1655 | there is no overloading, so there should be no noticeable slowdown. | |
1656 | ||
1657 | The staleness of the overload tables is now checked when overload methods | |
1658 | are invoked, rather than during C<bless>. | |
1659 | ||
1660 | "A" magic is gone. The changes to the handling of the C<SVf_AMAGIC> flag | |
1661 | eliminate the need for it. | |
1662 | ||
1663 | C<PL_amagic_generation> has been removed as no longer necessary. For XS | |
1664 | modules, it is now a macro alias to C<PL_na>. | |
1665 | ||
1666 | The fallback overload setting is now stored in a stash entry separate from | |
1667 | overloadedness itself. | |
1668 | ||
1669 | =item * | |
1670 | ||
1671 | The character-processing code has been cleaned up in places. The changes | |
1672 | should be operationally invisible. | |
1673 | ||
1674 | =item * | |
1675 | ||
e612b5a0 | 1676 | The C<study> function was made a no-op in v5.16. It was simply disabled via |
5ed58cbd RS |
1677 | a C<return> statement; the code was left in place. Now the code supporting |
1678 | what C<study> used to do has been removed. | |
1679 | ||
1680 | =item * | |
1681 | ||
1682 | Under threaded perls, there is no longer a separate PV allocated for every | |
1683 | COP to store its package name (C<< cop->stashpv >>). Instead, there is an | |
1684 | offset (C<< cop->stashoff >>) into the new C<PL_stashpad> array, which | |
1685 | holds stash pointers. | |
1686 | ||
1687 | =item * | |
1688 | ||
1689 | In the pluggable regex API, the C<regexp_engine> struct has acquired a new | |
1690 | field C<op_comp>, which is currently just for perl's internal use, and | |
f105b7be | 1691 | should be initialized to NULL by other regex plugin modules. |
5ed58cbd RS |
1692 | |
1693 | =item * | |
1694 | ||
7779650e | 1695 | A new function C<alloccopstash> has been added to the API, but is considered |
5ed58cbd RS |
1696 | experimental. See L<perlapi>. |
1697 | ||
1698 | =item * | |
1699 | ||
1700 | Perl used to implement get magic in a way that would sometimes hide bugs in | |
4263dd11 | 1701 | code that could call mg_get() too many times on magical values. This hiding of |
5ed58cbd RS |
1702 | errors no longer occurs, so long-standing bugs may become visible now. If |
1703 | you see magic-related errors in XS code, check to make sure it, together | |
1704 | with the Perl API functions it uses, calls mg_get() only once on SvGMAGICAL() | |
1705 | values. | |
1706 | ||
1707 | =item * | |
1708 | ||
1709 | OP allocation for CVs now uses a slab allocator. This simplifies | |
1710 | memory management for OPs allocated to a CV, so cleaning up after a | |
1711 | compilation error is simpler and safer [perl #111462][perl #112312]. | |
1712 | ||
1713 | =item * | |
1714 | ||
f105b7be | 1715 | C<PERL_DEBUG_READONLY_OPS> has been rewritten to work with the new slab |
5ed58cbd RS |
1716 | allocator, allowing it to catch more violations than before. |
1717 | ||
1718 | =item * | |
1719 | ||
f105b7be KE |
1720 | The old slab allocator for ops, which was only enabled for C<PERL_IMPLICIT_SYS> |
1721 | and C<PERL_DEBUG_READONLY_OPS>, has been retired. | |
5ed58cbd RS |
1722 | |
1723 | =back | |
1724 | ||
1725 | =head1 Selected Bug Fixes | |
1726 | ||
1727 | =over 4 | |
1728 | ||
1729 | =item * | |
1730 | ||
1731 | Here-doc terminators no longer require a terminating newline character when | |
1732 | they occur at the end of a file. This was already the case at the end of a | |
1733 | string eval [perl #65838]. | |
1734 | ||
1735 | =item * | |
1736 | ||
f105b7be | 1737 | C<-DPERL_GLOBAL_STRUCT> builds now free the global struct B<after> |
5ed58cbd RS |
1738 | they've finished using it. |
1739 | ||
1740 | =item * | |
1741 | ||
1742 | A trailing '/' on a path in @INC will no longer have an additional '/' | |
1743 | appended. | |
1744 | ||
1745 | =item * | |
1746 | ||
1747 | The C<:crlf> layer now works when unread data doesn't fit into its own | |
1748 | buffer. [perl #112244]. | |
1749 | ||
1750 | =item * | |
1751 | ||
1752 | C<ungetc()> now handles UTF-8 encoded data. [perl #116322]. | |
1753 | ||
1754 | =item * | |
1755 | ||
1756 | A bug in the core typemap caused any C types that map to the T_BOOL core | |
1757 | typemap entry to not be set, updated, or modified when the T_BOOL variable was | |
1758 | used in an OUTPUT: section with an exception for RETVAL. T_BOOL in an INPUT: | |
1759 | section was not affected. Using a T_BOOL return type for an XSUB (RETVAL) | |
1760 | was not affected. A side effect of fixing this bug is, if a T_BOOL is specified | |
1761 | in the OUTPUT: section (which previous did nothing to the SV), and a read only | |
1762 | SV (literal) is passed to the XSUB, croaks like "Modification of a read-only | |
1763 | value attempted" will happen. [perl #115796] | |
1764 | ||
1765 | =item * | |
1766 | ||
1767 | On many platforms, providing a directory name as the script name caused perl | |
1768 | to do nothing and report success. It should now universally report an error | |
1769 | and exit nonzero. [perl #61362] | |
1770 | ||
1771 | =item * | |
1772 | ||
1773 | C<sort {undef} ...> under fatal warnings no longer crashes. It had | |
e612b5a0 | 1774 | begun crashing in Perl v5.16. |
5ed58cbd RS |
1775 | |
1776 | =item * | |
1777 | ||
1778 | Stashes blessed into each other | |
1779 | (C<bless \%Foo::, 'Bar'; bless \%Bar::, 'Foo'>) no longer result in double | |
e612b5a0 | 1780 | frees. This bug started happening in Perl v5.16. |
5ed58cbd RS |
1781 | |
1782 | =item * | |
1783 | ||
1784 | Numerous memory leaks have been fixed, mostly involving fatal warnings and | |
1785 | syntax errors. | |
1786 | ||
1787 | =item * | |
1788 | ||
1789 | Some failed regular expression matches such as C<'f' =~ /../g> were not | |
1790 | resetting C<pos>. Also, "match-once" patterns (C<m?...?g>) failed to reset | |
1791 | it, too, when invoked a second time [perl #23180]. | |
1792 | ||
1793 | =item * | |
1794 | ||
1795 | Accessing C<$&> after a pattern match now works if it had not been seen | |
1796 | before the match. I.e., this applies to C<${'&'}> (under C<no strict>) and | |
1797 | C<eval '$&'>. The same applies to C<$'> and C<$`> [perl #4289]. | |
1798 | ||
1799 | =item * | |
1800 | ||
1801 | Several bugs involving C<local *ISA> and C<local *Foo::> causing stale | |
1802 | MRO caches have been fixed. | |
1803 | ||
1804 | =item * | |
1805 | ||
1806 | Defining a subroutine when its typeglob has been aliased no longer results | |
e612b5a0 | 1807 | in stale method caches. This bug was introduced in Perl v5.10. |
5ed58cbd RS |
1808 | |
1809 | =item * | |
1810 | ||
1811 | Localising a typeglob containing a subroutine when the typeglob's package | |
1812 | has been deleted from its parent stash no longer produces an error. This | |
e612b5a0 | 1813 | bug was introduced in Perl v5.14. |
5ed58cbd RS |
1814 | |
1815 | =item * | |
1816 | ||
1817 | Under some circumstances, C<local *method=...> would fail to reset method | |
1818 | caches upon scope exit. | |
1819 | ||
1820 | =item * | |
1821 | ||
1822 | C</[.foo.]/> is no longer an error, but produces a warning (as before) and | |
1823 | is treated as C</[.fo]/> [perl #115818]. | |
1824 | ||
1825 | =item * | |
1826 | ||
1827 | C<goto $tied_var> now calls FETCH before deciding what type of goto | |
1828 | (subroutine or label) this is. | |
1829 | ||
1830 | =item * | |
1831 | ||
1832 | Renaming packages through glob assignment | |
1833 | (C<*Foo:: = *Bar::; *Bar:: = *Baz::>) in combination with C<m?...?> and | |
1834 | C<reset> no longer makes threaded builds crash. | |
1835 | ||
1836 | =item * | |
1837 | ||
1838 | A number of bugs related to assigning a list to hash have been fixed. Many of | |
1839 | these involve lists with repeated keys like C<(1, 1, 1, 1)>. | |
1840 | ||
1841 | =over 4 | |
1842 | ||
1843 | =item * | |
1844 | ||
1845 | The expression C<scalar(%h = (1, 1, 1, 1))> now returns C<4>, not C<2>. | |
1846 | ||
1847 | =item * | |
1848 | ||
1849 | The return value of C<%h = (1, 1, 1)> in list context was wrong. Previously | |
1850 | this would return C<(1, undef, 1)>, now it returns C<(1, undef)>. | |
1851 | ||
1852 | =item * | |
1853 | ||
1854 | Perl now issues the same warning on C<($s, %h) = (1, {})> as it does for | |
1855 | C<(%h) = ({})>, "Reference found where even-sized list expected". | |
1856 | ||
1857 | =item * | |
1858 | ||
1859 | A number of additional edge cases in list assignment to hashes were | |
1860 | corrected. For more details see commit 23b7025ebc. | |
1861 | ||
1862 | =back | |
1863 | ||
1864 | =item * | |
1865 | ||
1866 | Attributes applied to lexical variables no longer leak memory. | |
1867 | [perl #114764] | |
1868 | ||
1869 | =item * | |
1870 | ||
1871 | C<dump>, C<goto>, C<last>, C<next>, C<redo> or C<require> followed by a | |
1872 | bareword (or version) and then an infix operator is no longer a syntax | |
1873 | error. It used to be for those infix operators (like C<+>) that have a | |
1874 | different meaning where a term is expected. [perl #105924] | |
1875 | ||
1876 | =item * | |
1877 | ||
1878 | C<require a::b . 1> and C<require a::b + 1> no longer produce erroneous | |
1879 | ambiguity warnings. [perl #107002] | |
1880 | ||
1881 | =item * | |
1882 | ||
1883 | Class method calls are now allowed on any string, and not just strings | |
1884 | beginning with an alphanumeric character. [perl #105922] | |
1885 | ||
1886 | =item * | |
1887 | ||
1888 | An empty pattern created with C<qr//> used in C<m///> no longer triggers | |
1889 | the "empty pattern reuses last pattern" behaviour. [perl #96230] | |
1890 | ||
1891 | =item * | |
1892 | ||
1893 | Tying a hash during iteration no longer results in a memory leak. | |
1894 | ||
1895 | =item * | |
1896 | ||
1897 | Freeing a tied hash during iteration no longer results in a memory leak. | |
1898 | ||
1899 | =item * | |
1900 | ||
1901 | List assignment to a tied array or hash that dies on STORE no longer | |
1902 | results in a memory leak. | |
1903 | ||
1904 | =item * | |
1905 | ||
1906 | If the hint hash (C<%^H>) is tied, compile-time scope entry (which copies | |
1907 | the hint hash) no longer leaks memory if FETCH dies. [perl #107000] | |
1908 | ||
1909 | =item * | |
1910 | ||
1911 | Constant folding no longer inappropriately triggers the special | |
1912 | C<split " "> behaviour. [perl #94490] | |
1913 | ||
1914 | =item * | |
1915 | ||
1916 | C<defined scalar(@array)>, C<defined do { &foo }>, and similar constructs | |
1917 | now treat the argument to C<defined> as a simple scalar. [perl #97466] | |
1918 | ||
1919 | =item * | |
1920 | ||
1921 | Running a custom debugging that defines no C<*DB::DB> glob or provides a | |
1922 | subroutine stub for C<&DB::DB> no longer results in a crash, but an error | |
1923 | instead. [perl #114990] | |
1924 | ||
1925 | =item * | |
1926 | ||
1927 | C<reset ""> now matches its documentation. C<reset> only resets C<m?...?> | |
1928 | patterns when called with no argument. An empty string for an argument now | |
1929 | does nothing. (It used to be treated as no argument.) [perl #97958] | |
1930 | ||
1931 | =item * | |
1932 | ||
1933 | C<printf> with an argument returning an empty list no longer reads past the | |
1934 | end of the stack, resulting in erratic behaviour. [perl #77094] | |
1935 | ||
1936 | =item * | |
1937 | ||
1938 | C<--subname> no longer produces erroneous ambiguity warnings. | |
1939 | [perl #77240] | |
1940 | ||
1941 | =item * | |
1942 | ||
1943 | C<v10> is now allowed as a label or package name. This was inadvertently | |
e612b5a0 | 1944 | broken when v-strings were added in Perl v5.6. [perl #56880] |
5ed58cbd RS |
1945 | |
1946 | =item * | |
1947 | ||
1948 | C<length>, C<pos>, C<substr> and C<sprintf> could be confused by ties, | |
1949 | overloading, references and typeglobs if the stringification of such | |
be12dd22 | 1950 | changed the internal representation to or from UTF-8. [perl #114410] |
5ed58cbd RS |
1951 | |
1952 | =item * | |
1953 | ||
1954 | utf8::encode now calls FETCH and STORE on tied variables. utf8::decode now | |
1955 | calls STORE (it was already calling FETCH). | |
1956 | ||
1957 | =item * | |
1958 | ||
1959 | C<$tied =~ s/$non_utf8/$utf8/> no longer loops infinitely if the tied | |
1960 | variable returns a Latin-1 string, shared hash key scalar, or reference or | |
2ae351f8 | 1961 | typeglob that stringifies as ASCII or Latin-1. This was a regression from |
e612b5a0 | 1962 | v5.12. |
5ed58cbd RS |
1963 | |
1964 | =item * | |
1965 | ||
1966 | C<s///> without /e is now better at detecting when it needs to forego | |
1967 | certain optimisations, fixing some buggy cases: | |
1968 | ||
1969 | =over | |
1970 | ||
1971 | =item * | |
1972 | ||
1973 | Match variables in certain constructs (C<&&>, C<||>, C<..> and others) in | |
1974 | the replacement part; e.g., C<s/(.)/$l{$a||$1}/g>. [perl #26986] | |
1975 | ||
1976 | =item * | |
1977 | ||
1978 | Aliases to match variables in the replacement. | |
1979 | ||
1980 | =item * | |
1981 | ||
1982 | C<$REGERROR> or C<$REGMARK> in the replacement. [perl #49190] | |
1983 | ||
1984 | =item * | |
1985 | ||
1986 | An empty pattern (C<s//$foo/>) that causes the last-successful pattern to | |
1987 | be used, when that pattern contains code blocks that modify the variables | |
1988 | in the replacement. | |
1989 | ||
1990 | =back | |
1991 | ||
1992 | =item * | |
1993 | ||
1994 | The taintedness of the replacement string no longer affects the taintedness | |
1995 | of the return value of C<s///e>. | |
1996 | ||
1997 | =item * | |
1998 | ||
1999 | The C<$|> autoflush variable is created on-the-fly when needed. If this | |
2000 | happened (e.g., if it was mentioned in a module or eval) when the | |
2001 | currently-selected filehandle was a typeglob with an empty IO slot, it used | |
2002 | to crash. [perl #115206] | |
2003 | ||
2004 | =item * | |
2005 | ||
2006 | Line numbers at the end of a string eval are no longer off by one. | |
2007 | [perl #114658] | |
2008 | ||
2009 | =item * | |
2010 | ||
2011 | @INC filters (subroutines returned by subroutines in @INC) that set $_ to a | |
2012 | copy-on-write scalar no longer cause the parser to modify that string | |
2013 | buffer in place. | |
2014 | ||
2015 | =item * | |
2016 | ||
2017 | C<length($object)> no longer returns the undefined value if the object has | |
2018 | string overloading that returns undef. [perl #115260] | |
2019 | ||
2020 | =item * | |
2021 | ||
2022 | The use of C<PL_stashcache>, the stash name lookup cache for method calls, has | |
2023 | been restored, | |
2024 | ||
2025 | Commit da6b625f78f5f133 in August 2011 inadvertently broke the code that looks | |
2026 | up values in C<PL_stashcache>. As it's a only cache, quite correctly everything | |
2027 | carried on working without it. | |
2028 | ||
2029 | =item * | |
2030 | ||
e612b5a0 | 2031 | The error "Can't localize through a reference" had disappeared in v5.16.0 |
5ed58cbd | 2032 | when C<local %$ref> appeared on the last line of an lvalue subroutine. |
e612b5a0 | 2033 | This error disappeared for C<\local %$ref> in perl v5.8.1. It has now |
5ed58cbd RS |
2034 | been restored. |
2035 | ||
2036 | =item * | |
2037 | ||
2038 | The parsing of here-docs has been improved significantly, fixing several | |
2039 | parsing bugs and crashes and one memory leak, and correcting wrong | |
2040 | subsequent line numbers under certain conditions. | |
2041 | ||
2042 | =item * | |
2043 | ||
2044 | Inside an eval, the error message for an unterminated here-doc no longer | |
2045 | has a newline in the middle of it [perl #70836]. | |
2046 | ||
2047 | =item * | |
2048 | ||
2049 | A substitution inside a substitution pattern (C<s/${s|||}//>) no longer | |
2050 | confuses the parser. | |
2051 | ||
2052 | =item * | |
2053 | ||
2054 | It may be an odd place to allow comments, but C<s//"" # hello/e> has | |
2055 | always worked, I<unless> there happens to be a null character before the | |
2056 | first #. Now it works even in the presence of nulls. | |
2057 | ||
2058 | =item * | |
2059 | ||
2060 | An invalid range in C<tr///> or C<y///> no longer results in a memory leak. | |
2061 | ||
2062 | =item * | |
2063 | ||
2064 | String eval no longer treats a semicolon-delimited quote-like operator at | |
2065 | the very end (C<eval 'q;;'>) as a syntax error. | |
2066 | ||
2067 | =item * | |
2068 | ||
2069 | C<< warn {$_ => 1} + 1 >> is no longer a syntax error. The parser used to | |
2070 | get confused with certain list operators followed by an anonymous hash and | |
2071 | then an infix operator that shares its form with a unary operator. | |
2072 | ||
2073 | =item * | |
2074 | ||
2075 | C<(caller $n)[6]> (which gives the text of the eval) used to return the | |
2076 | actual parser buffer. Modifying it could result in crashes. Now it always | |
2077 | returns a copy. The string returned no longer has "\n;" tacked on to the | |
2078 | end. The returned text also includes here-doc bodies, which used to be | |
2079 | omitted. | |
2080 | ||
2081 | =item * | |
2082 | ||
be12dd22 RS |
2083 | Reset the UTF-8 position cache when accessing magical variables to avoid the |
2084 | string buffer and the UTF-8 position cache getting out of sync | |
5ed58cbd RS |
2085 | [perl #114410]. |
2086 | ||
2087 | =item * | |
2088 | ||
be12dd22 RS |
2089 | Various cases of get magic being called twice for magical UTF-8 |
2090 | strings have been fixed. | |
5ed58cbd RS |
2091 | |
2092 | =item * | |
2093 | ||
2094 | This code (when not in the presence of C<$&> etc) | |
2095 | ||
2096 | $_ = 'x' x 1_000_000; | |
2097 | 1 while /(.)/; | |
2098 | ||
2099 | used to skip the buffer copy for performance reasons, but suffered from C<$1> | |
2100 | etc changing if the original string changed. That's now been fixed. | |
2101 | ||
2102 | =item * | |
2103 | ||
2104 | Perl doesn't use PerlIO anymore to report out of memory messages, as PerlIO | |
2105 | might attempt to allocate more memory. | |
2106 | ||
2107 | =item * | |
2108 | ||
2109 | In a regular expression, if something is quantified with C<{n,m}> where | |
2110 | C<S<n E<gt> m>>, it can't possibly match. Previously this was a fatal | |
2111 | error, but now is merely a warning (and that something won't match). | |
2112 | [perl #82954]. | |
2113 | ||
2114 | =item * | |
2115 | ||
2116 | It used to be possible for formats defined in subroutines that have | |
2117 | subsequently been undefined and redefined to close over variables in the | |
2118 | wrong pad (the newly-defined enclosing sub), resulting in crashes or | |
2119 | "Bizarre copy" errors. | |
2120 | ||
2121 | =item * | |
2122 | ||
2123 | Redefinition of XSUBs at run time could produce warnings with the wrong | |
2124 | line number. | |
2125 | ||
2126 | =item * | |
2127 | ||
2128 | The %vd sprintf format does not support version objects for alpha versions. | |
2129 | It used to output the format itself (%vd) when passed an alpha version, and | |
2130 | also emit an "Invalid conversion in printf" warning. It no longer does, | |
2131 | but produces the empty string in the output. It also no longer leaks | |
2132 | memory in this case. | |
2133 | ||
2134 | =item * | |
2135 | ||
2136 | C<< $obj->SUPER::method >> calls in the main package could fail if the | |
2137 | SUPER package had already been accessed by other means. | |
2138 | ||
2139 | =item * | |
2140 | ||
f105b7be | 2141 | Stash aliasing (C<< *foo:: = *bar:: >>) no longer causes SUPER calls to ignore |
5ed58cbd RS |
2142 | changes to methods or @ISA or use the wrong package. |
2143 | ||
2144 | =item * | |
2145 | ||
2146 | Method calls on packages whose names end in ::SUPER are no longer treated | |
2147 | as SUPER method calls, resulting in failure to find the method. | |
2148 | Furthermore, defining subroutines in such packages no longer causes them to | |
2149 | be found by SUPER method calls on the containing package [perl #114924]. | |
2150 | ||
2151 | =item * | |
2152 | ||
2153 | C<\w> now matches the code points U+200C (ZERO WIDTH NON-JOINER) and U+200D | |
2154 | (ZERO WIDTH JOINER). C<\W> no longer matches these. This change is because | |
2155 | Unicode corrected their definition of what C<\w> should match. | |
2156 | ||
2157 | =item * | |
2158 | ||
2159 | C<dump LABEL> no longer leaks its label. | |
2160 | ||
2161 | =item * | |
2162 | ||
2163 | Constant folding no longer changes the behaviour of functions like C<stat()> | |
2164 | and C<truncate()> that can take either filenames or handles. | |
2165 | C<stat 1 ? foo : bar> nows treats its argument as a file name (since it is an | |
2166 | arbitrary expression), rather than the handle "foo". | |
2167 | ||
2168 | =item * | |
2169 | ||
2170 | C<truncate FOO, $len> no longer falls back to treating "FOO" as a file name if | |
e612b5a0 | 2171 | the filehandle has been deleted. This was broken in Perl v5.16.0. |
5ed58cbd RS |
2172 | |
2173 | =item * | |
2174 | ||
2175 | Subroutine redefinitions after sub-to-glob and glob-to-glob assignments no | |
2176 | longer cause double frees or panic messages. | |
2177 | ||
2178 | =item * | |
2179 | ||
2180 | C<s///> now turns vstrings into plain strings when performing a substitution, | |
2181 | even if the resulting string is the same (C<s/a/a/>). | |
2182 | ||
2183 | =item * | |
2184 | ||
2185 | Prototype mismatch warnings no longer erroneously treat constant subs as having | |
2186 | no prototype when they actually have "". | |
2187 | ||
2188 | =item * | |
2189 | ||
2190 | Constant subroutines and forward declarations no longer prevent prototype | |
2191 | mismatch warnings from omitting the sub name. | |
2192 | ||
2193 | =item * | |
2194 | ||
2195 | C<undef> on a subroutine now clears call checkers. | |
2196 | ||
2197 | =item * | |
2198 | ||
e612b5a0 | 2199 | The C<ref> operator started leaking memory on blessed objects in Perl v5.16.0. |
5ed58cbd RS |
2200 | This has been fixed [perl #114340]. |
2201 | ||
2202 | =item * | |
2203 | ||
2204 | C<use> no longer tries to parse its arguments as a statement, making | |
2205 | C<use constant { () };> a syntax error [perl #114222]. | |
2206 | ||
2207 | =item * | |
2208 | ||
2209 | On debugging builds, "uninitialized" warnings inside formats no longer cause | |
2210 | assertion failures. | |
2211 | ||
2212 | =item * | |
2213 | ||
2214 | On debugging builds, subroutines nested inside formats no longer cause | |
2215 | assertion failures [perl #78550]. | |
2216 | ||
2217 | =item * | |
2218 | ||
2219 | Formats and C<use> statements are now permitted inside formats. | |
2220 | ||
2221 | =item * | |
2222 | ||
2223 | C<print $x> and C<sub { print $x }-E<gt>()> now always produce the same output. | |
2224 | It was possible for the latter to refuse to close over $x if the variable was | |
2225 | not active; e.g., if it was defined outside a currently-running named | |
2226 | subroutine. | |
2227 | ||
2228 | =item * | |
2229 | ||
2230 | Similarly, C<print $x> and C<print eval '$x'> now produce the same output. | |
2231 | This also allows "my $x if 0" variables to be seen in the debugger [perl | |
2232 | #114018]. | |
2233 | ||
2234 | =item * | |
2235 | ||
2236 | Formats called recursively no longer stomp on their own lexical variables, but | |
2237 | each recursive call has its own set of lexicals. | |
2238 | ||
2239 | =item * | |
2240 | ||
2241 | Attempting to free an active format or the handle associated with it no longer | |
2242 | results in a crash. | |
2243 | ||
2244 | =item * | |
2245 | ||
2246 | Format parsing no longer gets confused by braces, semicolons and low-precedence | |
2247 | operators. It used to be possible to use braces as format delimiters (instead | |
2248 | of C<=> and C<.>), but only sometimes. Semicolons and low-precedence operators | |
2249 | in format argument lines no longer confuse the parser into ignoring the line's | |
2250 | return value. In format argument lines, braces can now be used for anonymous | |
2251 | hashes, instead of being treated always as C<do> blocks. | |
2252 | ||
2253 | =item * | |
2254 | ||
2255 | Formats can now be nested inside code blocks in regular expressions and other | |
2256 | quoted constructs (C</(?{...})/> and C<qq/${...}/>) [perl #114040]. | |
2257 | ||
2258 | =item * | |
2259 | ||
2260 | Formats are no longer created after compilation errors. | |
2261 | ||
2262 | =item * | |
2263 | ||
2264 | Under debugging builds, the B<-DA> command line option started crashing in Perl | |
e612b5a0 | 2265 | v5.16.0. It has been fixed [perl #114368]. |
5ed58cbd RS |
2266 | |
2267 | =item * | |
2268 | ||
2269 | A potential deadlock scenario involving the premature termination of a pseudo- | |
2270 | forked child in a Windows build with ithreads enabled has been fixed. This | |
2271 | resolves the common problem of the F<t/op/fork.t> test hanging on Windows [perl | |
2272 | #88840]. | |
2273 | ||
2274 | =item * | |
2275 | ||
5ed58cbd RS |
2276 | The code which generates errors from C<require()> could potentially read one or |
2277 | two bytes before the start of the filename for filenames less than three bytes | |
2278 | long and ending C</\.p?\z/>. This has now been fixed. Note that it could | |
2279 | never have happened with module names given to C<use()> or C<require()> anyway. | |
2280 | ||
2281 | =item * | |
2282 | ||
2283 | The handling of pathnames of modules given to C<require()> has been made | |
2284 | thread-safe on VMS. | |
2285 | ||
2286 | =item * | |
2287 | ||
d85cd26b RS |
2288 | Non-blocking sockets have been fixed on VMS. |
2289 | ||
2290 | =item * | |
2291 | ||
5ed58cbd RS |
2292 | A bug in the compilation of a C</(?{})/> expression which affected the TryCatch |
2293 | test suite has been fixed [perl #114242]. | |
2294 | ||
2295 | =item * | |
2296 | ||
2297 | Pod can now be nested in code inside a quoted construct outside of a string | |
2298 | eval. This used to work only within string evals [perl #114040]. | |
2299 | ||
2300 | =item * | |
2301 | ||
2302 | C<goto ''> now looks for an empty label, producing the "goto must have | |
2303 | label" error message, instead of exiting the program [perl #111794]. | |
2304 | ||
2305 | =item * | |
2306 | ||
2307 | C<goto "\0"> now dies with "Can't find label" instead of "goto must have | |
2308 | label". | |
2309 | ||
2310 | =item * | |
2311 | ||
2312 | The C function C<hv_store> used to result in crashes when used on C<%^H> | |
2313 | [perl #111000]. | |
2314 | ||
2315 | =item * | |
2316 | ||
2317 | A call checker attached to a closure prototype via C<cv_set_call_checker> | |
2318 | is now copied to closures cloned from it. So C<cv_set_call_checker> now | |
2319 | works inside an attribute handler for a closure. | |
2320 | ||
2321 | =item * | |
2322 | ||
2323 | Writing to C<$^N> used to have no effect. Now it croaks with "Modification | |
2324 | of a read-only value" by default, but that can be overridden by a custom | |
2325 | regular expression engine, as with C<$1> [perl #112184]. | |
2326 | ||
2327 | =item * | |
2328 | ||
2329 | C<undef> on a control character glob (C<undef *^H>) no longer emits an | |
2330 | erroneous warning about ambiguity [perl #112456]. | |
2331 | ||
2332 | =item * | |
2333 | ||
2334 | For efficiency's sake, many operators and built-in functions return the | |
2335 | same scalar each time. Lvalue subroutines and subroutines in the CORE:: | |
2336 | namespace were allowing this implementation detail to leak through. | |
2337 | C<print &CORE::uc("a"), &CORE::uc("b")> used to print "BB". The same thing | |
2338 | would happen with an lvalue subroutine returning the return value of C<uc>. | |
2339 | Now the value is copied in such cases. | |
2340 | ||
2341 | =item * | |
2342 | ||
2343 | C<method {}> syntax with an empty block or a block returning an empty list | |
2344 | used to crash or use some random value left on the stack as its invocant. | |
2345 | Now it produces an error. | |
2346 | ||
2347 | =item * | |
2348 | ||
2349 | C<vec> now works with extremely large offsets (E<gt>2 GB) [perl #111730]. | |
2350 | ||
2351 | =item * | |
2352 | ||
2353 | Changes to overload settings now take effect immediately, as do changes to | |
2354 | inheritance that affect overloading. They used to take effect only after | |
2355 | C<bless>. | |
2356 | ||
2357 | Objects that were created before a class had any overloading used to remain | |
2358 | non-overloaded even if the class gained overloading through C<use overload> | |
2359 | or @ISA changes, and even after C<bless>. This has been fixed | |
2360 | [perl #112708]. | |
2361 | ||
2362 | =item * | |
2363 | ||
2364 | Classes with overloading can now inherit fallback values. | |
2365 | ||
2366 | =item * | |
2367 | ||
2368 | Overloading was not respecting a fallback value of 0 if there were | |
2369 | overloaded objects on both sides of an assignment operator like C<+=> | |
2370 | [perl #111856]. | |
2371 | ||
2372 | =item * | |
2373 | ||
2374 | C<pos> now croaks with hash and array arguments, instead of producing | |
2375 | erroneous warnings. | |
2376 | ||
2377 | =item * | |
2378 | ||
2379 | C<while(each %h)> now implies C<while(defined($_ = each %h))>, like | |
2380 | C<readline> and C<readdir>. | |
2381 | ||
2382 | =item * | |
2383 | ||
2384 | Subs in the CORE:: namespace no longer crash after C<undef *_> when called | |
2385 | with no argument list (C<&CORE::time> with no parentheses). | |
2386 | ||
2387 | =item * | |
2388 | ||
2389 | C<unpack> no longer produces the "'/' must follow a numeric type in unpack" | |
2390 | error when it is the data that are at fault [perl #60204]. | |
2391 | ||
2392 | =item * | |
2393 | ||
2394 | C<join> and C<"@array"> now call FETCH only once on a tied C<$"> | |
2395 | [perl #8931]. | |
2396 | ||
2397 | =item * | |
2398 | ||
2399 | Some subroutine calls generated by compiling core ops affected by a | |
2400 | C<CORE::GLOBAL> override had op checking performed twice. The checking | |
2401 | is always idempotent for pure Perl code, but the double checking can | |
2402 | matter when custom call checkers are involved. | |
2403 | ||
2404 | =item * | |
2405 | ||
2406 | A race condition used to exist around fork that could cause a signal sent to | |
2407 | the parent to be handled by both parent and child. Signals are now blocked | |
2408 | briefly around fork to prevent this from happening [perl #82580]. | |
2409 | ||
2410 | =item * | |
2411 | ||
2412 | The implementation of code blocks in regular expressions, such as C<(?{})> | |
2413 | and C<(??{})>, has been heavily reworked to eliminate a whole slew of bugs. | |
2414 | The main user-visible changes are: | |
2415 | ||
2416 | =over 4 | |
2417 | ||
2418 | =item * | |
2419 | ||
2420 | Code blocks within patterns are now parsed in the same pass as the | |
2421 | surrounding code; in particular it is no longer necessary to have balanced | |
2422 | braces: this now works: | |
2423 | ||
2424 | /(?{ $x='{' })/ | |
2425 | ||
2426 | This means that this error message is no longer generated: | |
2427 | ||
2428 | Sequence (?{...}) not terminated or not {}-balanced in regex | |
2429 | ||
2430 | but a new error may be seen: | |
2431 | ||
2432 | Sequence (?{...}) not terminated with ')' | |
2433 | ||
2434 | In addition, literal code blocks within run-time patterns are only | |
2435 | compiled once, at perl compile-time: | |
2436 | ||
2437 | for my $p (...) { | |
2438 | # this 'FOO' block of code is compiled once, | |
2439 | # at the same time as the surrounding 'for' loop | |
2440 | /$p{(?{FOO;})/; | |
2441 | } | |
2442 | ||
2443 | =item * | |
2444 | ||
2445 | Lexical variables are now sane as regards scope, recursion and closure | |
2446 | behavior. In particular, C</A(?{B})C/> behaves (from a closure viewpoint) | |
2447 | exactly like C</A/ && do { B } && /C/>, while C<qr/A(?{B})C/> is like | |
2448 | C<sub {/A/ && do { B } && /C/}>. So this code now works how you might | |
2449 | expect, creating three regexes that match 0, 1, and 2: | |
2450 | ||
2451 | for my $i (0..2) { | |
2452 | push @r, qr/^(??{$i})$/; | |
2453 | } | |
2454 | "1" =~ $r[1]; # matches | |
2455 | ||
2456 | =item * | |
2457 | ||
2458 | The C<use re 'eval'> pragma is now only required for code blocks defined | |
2459 | at runtime; in particular in the following, the text of the C<$r> pattern is | |
2460 | still interpolated into the new pattern and recompiled, but the individual | |
2461 | compiled code-blocks within C<$r> are reused rather than being recompiled, | |
2462 | and C<use re 'eval'> isn't needed any more: | |
2463 | ||
2464 | my $r = qr/abc(?{....})def/; | |
2465 | /xyz$r/; | |
2466 | ||
2467 | =item * | |
2468 | ||
2469 | Flow control operators no longer crash. Each code block runs in a new | |
2470 | dynamic scope, so C<next> etc. will not see | |
2471 | any enclosing loops. C<return> returns a value | |
2472 | from the code block, not from any enclosing subroutine. | |
2473 | ||
2474 | =item * | |
2475 | ||
2476 | Perl normally caches the compilation of run-time patterns, and doesn't | |
2477 | recompile if the pattern hasn't changed, but this is now disabled if | |
2478 | required for the correct behavior of closures. For example: | |
2479 | ||
2480 | my $code = '(??{$x})'; | |
2481 | for my $x (1..3) { | |
2482 | # recompile to see fresh value of $x each time | |
2483 | $x =~ /$code/; | |
2484 | } | |
2485 | ||
2486 | =item * | |
2487 | ||
2488 | The C</msix> and C<(?msix)> etc. flags are now propagated into the return | |
2489 | value from C<(??{})>; this now works: | |
2490 | ||
2491 | "AB" =~ /a(??{'b'})/i; | |
2492 | ||
2493 | =item * | |
2494 | ||
2495 | Warnings and errors will appear to come from the surrounding code (or for | |
2496 | run-time code blocks, from an eval) rather than from an C<re_eval>: | |
2497 | ||
2498 | use re 'eval'; $c = '(?{ warn "foo" })'; /$c/; | |
2499 | /(?{ warn "foo" })/; | |
2500 | ||
2501 | formerly gave: | |
2502 | ||
2503 | foo at (re_eval 1) line 1. | |
2504 | foo at (re_eval 2) line 1. | |
2505 | ||
2506 | and now gives: | |
2507 | ||
2508 | foo at (eval 1) line 1. | |
2509 | foo at /some/prog line 2. | |
2510 | ||
2511 | =back | |
2512 | ||
2513 | =item * | |
2514 | ||
2e7bc647 KW |
2515 | Perl now can be recompiled to use any Unicode version. In v5.16, it |
2516 | worked on Unicodes 6.0 and 6.1, but there were various bugs if earlier | |
2517 | releases were used; the older the release the more problems. | |
5ed58cbd RS |
2518 | |
2519 | =item * | |
2520 | ||
2521 | C<vec> no longer produces "uninitialized" warnings in lvalue context | |
2522 | [perl #9423]. | |
2523 | ||
2524 | =item * | |
2525 | ||
2526 | An optimization involving fixed strings in regular expressions could cause | |
2527 | a severe performance penalty in edge cases. This has been fixed | |
2528 | [perl #76546]. | |
2529 | ||
2530 | =item * | |
2531 | ||
2532 | In certain cases, including empty subpatterns within a regular expression (such | |
2533 | as C<(?:)> or C<(?:|)>) could disable some optimizations. This has been fixed. | |
2534 | ||
2535 | =item * | |
2536 | ||
2537 | The "Can't find an opnumber" message that C<prototype> produces when passed | |
2538 | a string like "CORE::nonexistent_keyword" now passes UTF-8 and embedded | |
2539 | NULs through unchanged [perl #97478]. | |
2540 | ||
2541 | =item * | |
2542 | ||
2543 | C<prototype> now treats magical variables like C<$1> the same way as | |
2544 | non-magical variables when checking for the CORE:: prefix, instead of | |
2545 | treating them as subroutine names. | |
2546 | ||
2547 | =item * | |
2548 | ||
2549 | Under threaded perls, a runtime code block in a regular expression could | |
2550 | corrupt the package name stored in the op tree, resulting in bad reads | |
2551 | in C<caller>, and possibly crashes [perl #113060]. | |
2552 | ||
2553 | =item * | |
2554 | ||
2555 | Referencing a closure prototype (C<\&{$_[1]}> in an attribute handler for a | |
2556 | closure) no longer results in a copy of the subroutine (or assertion | |
2557 | failures on debugging builds). | |
2558 | ||
2559 | =item * | |
2560 | ||
2561 | C<eval '__PACKAGE__'> now returns the right answer on threaded builds if | |
2562 | the current package has been assigned over (as in | |
2563 | C<*ThisPackage:: = *ThatPackage::>) [perl #78742]. | |
2564 | ||
2565 | =item * | |
2566 | ||
2567 | If a package is deleted by code that it calls, it is possible for C<caller> | |
2568 | to see a stack frame belonging to that deleted package. C<caller> could | |
2569 | crash if the stash's memory address was reused for a scalar and a | |
2570 | substitution was performed on the same scalar [perl #113486]. | |
2571 | ||
2572 | =item * | |
2573 | ||
2574 | C<UNIVERSAL::can> no longer treats its first argument differently | |
2575 | depending on whether it is a string or number internally. | |
2576 | ||
2577 | =item * | |
2578 | ||
2579 | C<open> with C<< <& >> for the mode checks to see whether the third argument is | |
2580 | a number, in determining whether to treat it as a file descriptor or a handle | |
2581 | name. Magical variables like C<$1> were always failing the numeric check and | |
2582 | being treated as handle names. | |
2583 | ||
2584 | =item * | |
2585 | ||
2586 | C<warn>'s handling of magical variables (C<$1>, ties) has undergone several | |
2587 | fixes. C<FETCH> is only called once now on a tied argument or a tied C<$@> | |
2588 | [perl #97480]. Tied variables returning objects that stringify as "" are | |
2589 | no longer ignored. A tied C<$@> that happened to return a reference the | |
2590 | I<previous> time it was used is no longer ignored. | |
2591 | ||
2592 | =item * | |
2593 | ||
2594 | C<warn ""> now treats C<$@> with a number in it the same way, regardless of | |
2595 | whether it happened via C<$@=3> or C<$@="3">. It used to ignore the | |
2596 | former. Now it appends "\t...caught", as it has always done with | |
2597 | C<$@="3">. | |
2598 | ||
2599 | =item * | |
2600 | ||
2601 | Numeric operators on magical variables (e.g., S<C<$1 + 1>>) used to use | |
2602 | floating point operations even where integer operations were more appropriate, | |
2603 | resulting in loss of accuracy on 64-bit platforms [perl #109542]. | |
2604 | ||
2605 | =item * | |
2606 | ||
2607 | Unary negation no longer treats a string as a number if the string happened | |
2608 | to be used as a number at some point. So, if C<$x> contains the string "dogs", | |
2609 | C<-$x> returns "-dogs" even if C<$y=0+$x> has happened at some point. | |
2610 | ||
2611 | =item * | |
2612 | ||
e612b5a0 | 2613 | In Perl v5.14, C<-'-10'> was fixed to return "10", not "+10". But magical |
5ed58cbd RS |
2614 | variables (C<$1>, ties) were not fixed till now [perl #57706]. |
2615 | ||
2616 | =item * | |
2617 | ||
2618 | Unary negation now treats strings consistently, regardless of the internal | |
2619 | C<UTF8> flag. | |
2620 | ||
2621 | =item * | |
2622 | ||
2623 | A regression introduced in Perl v5.16.0 involving | |
2624 | C<tr/I<SEARCHLIST>/I<REPLACEMENTLIST>/> has been fixed. Only the first | |
2625 | instance is supposed to be meaningful if a character appears more than | |
2626 | once in C<I<SEARCHLIST>>. Under some circumstances, the final instance | |
2627 | was overriding all earlier ones. [perl #113584] | |
2628 | ||
2629 | =item * | |
2630 | ||
2631 | Regular expressions like C<qr/\87/> previously silently inserted a NUL | |
2632 | character, thus matching as if it had been written C<qr/\00087/>. Now it | |
2633 | matches as if it had been written as C<qr/87/>, with a message that the | |
2634 | sequence C<"\8"> is unrecognized. | |
2635 | ||
2636 | =item * | |
2637 | ||
2638 | C<__SUB__> now works in special blocks (C<BEGIN>, C<END>, etc.). | |
2639 | ||
2640 | =item * | |
2641 | ||
2642 | Thread creation on Windows could theoretically result in a crash if done | |
2643 | inside a C<BEGIN> block. It still does not work properly, but it no longer | |
2644 | crashes [perl #111610]. | |
2645 | ||
2646 | =item * | |
2647 | ||
2648 | C<\&{''}> (with the empty string) now autovivifies a stub like any other | |
2649 | sub name, and no longer produces the "Unable to create sub" error | |
2650 | [perl #94476]. | |
2651 | ||
2652 | =item * | |
2653 | ||
2654 | A regression introduced in v5.14.0 has been fixed, in which some calls | |
2655 | to the C<re> module would clobber C<$_> [perl #113750]. | |
2656 | ||
2657 | =item * | |
2658 | ||
2659 | C<do FILE> now always either sets or clears C<$@>, even when the file can't be | |
2660 | read. This ensures that testing C<$@> first (as recommended by the | |
2661 | documentation) always returns the correct result. | |
2662 | ||
2663 | =item * | |
2664 | ||
2665 | The array iterator used for the C<each @array> construct is now correctly | |
2666 | reset when C<@array> is cleared (RT #75596). This happens for example when the | |
2667 | array is globally assigned to, as in C<@array = (...)>, but not when its | |
2668 | B<values> are assigned to. In terms of the XS API, it means that C<av_clear()> | |
2669 | will now reset the iterator. | |
2670 | ||
2671 | This mirrors the behaviour of the hash iterator when the hash is cleared. | |
2672 | ||
2673 | =item * | |
2674 | ||
2675 | C<< $class->can >>, C<< $class->isa >>, and C<< $class->DOES >> now return | |
2676 | correct results, regardless of whether that package referred to by C<$class> | |
2677 | exists [perl #47113]. | |
2678 | ||
2679 | =item * | |
2680 | ||
2681 | Arriving signals no longer clear C<$@> [perl #45173]. | |
2682 | ||
2683 | =item * | |
2684 | ||
2685 | Allow C<my ()> declarations with an empty variable list [perl #113554]. | |
2686 | ||
2687 | =item * | |
2688 | ||
2689 | During parsing, subs declared after errors no longer leave stubs | |
2690 | [perl #113712]. | |
2691 | ||
2692 | =item * | |
2693 | ||
2694 | Closures containing no string evals no longer hang on to their containing | |
2695 | subroutines, allowing variables closed over by outer subroutines to be | |
2696 | freed when the outer sub is freed, even if the inner sub still exists | |
2697 | [perl #89544]. | |
2698 | ||
2699 | =item * | |
2700 | ||
2701 | Duplication of in-memory filehandles by opening with a "<&=" or ">&=" mode | |
e612b5a0 | 2702 | stopped working properly in v5.16.0. It was causing the new handle to |
5ed58cbd RS |
2703 | reference a different scalar variable. This has been fixed [perl #113764]. |
2704 | ||
2705 | =item * | |
2706 | ||
2707 | C<qr//> expressions no longer crash with custom regular expression engines | |
2708 | that do not set C<offs> at regular expression compilation time | |
2709 | [perl #112962]. | |
2710 | ||
2711 | =item * | |
2712 | ||
2713 | C<delete local> no longer crashes with certain magical arrays and hashes | |
2714 | [perl #112966]. | |
2715 | ||
2716 | =item * | |
2717 | ||
2718 | C<local> on elements of certain magical arrays and hashes used not to | |
2719 | arrange to have the element deleted on scope exit, even if the element did | |
2720 | not exist before C<local>. | |
2721 | ||
2722 | =item * | |
2723 | ||
2724 | C<scalar(write)> no longer returns multiple items [perl #73690]. | |
2725 | ||
2726 | =item * | |
2727 | ||
2728 | String to floating point conversions no longer misparse certain strings under | |
2729 | C<use locale> [perl #109318]. | |
2730 | ||
2731 | =item * | |
2732 | ||
2733 | C<@INC> filters that die no longer leak memory [perl #92252]. | |
2734 | ||
2735 | =item * | |
2736 | ||
2737 | The implementations of overloaded operations are now called in the correct | |
2738 | context. This allows, among other things, being able to properly override | |
2739 | C<< <> >> [perl #47119]. | |
2740 | ||
2741 | =item * | |
2742 | ||
2743 | Specifying only the C<fallback> key when calling C<use overload> now behaves | |
2744 | properly [perl #113010]. | |
2745 | ||
2746 | =item * | |
2747 | ||
2748 | C<< sub foo { my $a = 0; while ($a) { ... } } >> and | |
2749 | C<< sub foo { while (0) { ... } } >> now return the same thing [perl #73618]. | |
2750 | ||
2751 | =item * | |
2752 | ||
2753 | String negation now behaves the same under C<use integer;> as it does | |
2754 | without [perl #113012]. | |
2755 | ||
2756 | =item * | |
2757 | ||
2758 | C<chr> now returns the Unicode replacement character (U+FFFD) for -1, | |
2759 | regardless of the internal representation. -1 used to wrap if the argument | |
2760 | was tied or a string internally. | |
2761 | ||
2762 | =item * | |
2763 | ||
2764 | Using a C<format> after its enclosing sub was freed could crash as of | |
e612b5a0 | 2765 | perl v5.12.0, if the format referenced lexical variables from the outer sub. |
5ed58cbd RS |
2766 | |
2767 | =item * | |
2768 | ||
2769 | Using a C<format> after its enclosing sub was undefined could crash as of | |
e612b5a0 | 2770 | perl v5.10.0, if the format referenced lexical variables from the outer sub. |
5ed58cbd RS |
2771 | |
2772 | =item * | |
2773 | ||
2774 | Using a C<format> defined inside a closure, which format references | |
2775 | lexical variables from outside, never really worked unless the C<write> | |
e612b5a0 | 2776 | call was directly inside the closure. In v5.10.0 it even started crashing. |
5ed58cbd RS |
2777 | Now the copy of that closure nearest the top of the call stack is used to |
2778 | find those variables. | |
2779 | ||
2780 | =item * | |
2781 | ||
2782 | Formats that close over variables in special blocks no longer crash if a | |
2783 | stub exists with the same name as the special block before the special | |
2784 | block is compiled. | |
2785 | ||
2786 | =item * | |
2787 | ||
2788 | The parser no longer gets confused, treating C<eval foo ()> as a syntax | |
2789 | error if preceded by C<print;> [perl #16249]. | |
2790 | ||
2791 | =item * | |
2792 | ||
2793 | The return value of C<syscall> is no longer truncated on 64-bit platforms | |
2794 | [perl #113980]. | |
2795 | ||
2796 | =item * | |
2797 | ||
2798 | Constant folding no longer causes C<print 1 ? FOO : BAR> to print to the | |
2799 | FOO handle [perl #78064]. | |
2800 | ||
2801 | =item * | |
2802 | ||
2803 | C<do subname> now calls the named subroutine and uses the file name it | |
2804 | returns, instead of opening a file named "subname". | |
2805 | ||
2806 | =item * | |
2807 | ||
2808 | Subroutines looked up by rv2cv check hooks (registered by XS modules) are | |
2809 | now taken into consideration when determining whether C<foo bar> should be | |
2810 | the sub call C<foo(bar)> or the method call C<< "bar"->foo >>. | |
2811 | ||
2812 | =item * | |
2813 | ||
2814 | C<CORE::foo::bar> is no longer treated specially, allowing global overrides | |
2815 | to be called directly via C<CORE::GLOBAL::uc(...)> [perl #113016]. | |
2816 | ||
2817 | =item * | |
2818 | ||
2819 | Calling an undefined sub whose typeglob has been undefined now produces the | |
2820 | customary "Undefined subroutine called" error, instead of "Not a CODE | |
2821 | reference". | |
2822 | ||
2823 | =item * | |
2824 | ||
2825 | Two bugs involving @ISA have been fixed. C<*ISA = *glob_without_array> and | |
2826 | C<undef *ISA; @{*ISA}> would prevent future modifications to @ISA from | |
2827 | updating the internal caches used to look up methods. The | |
e612b5a0 | 2828 | *glob_without_array case was a regression from Perl v5.12. |
5ed58cbd RS |
2829 | |
2830 | =item * | |
2831 | ||
2832 | Regular expression optimisations sometimes caused C<$> with C</m> to | |
2833 | produce failed or incorrect matches [perl #114068]. | |
2834 | ||
2835 | =item * | |
2836 | ||
2837 | C<__SUB__> now works in a C<sort> block when the enclosing subroutine is | |
2838 | predeclared with C<sub foo;> syntax [perl #113710]. | |
2839 | ||
2840 | =item * | |
2841 | ||
2842 | Unicode properties only apply to Unicode code points, which leads to | |
2843 | some subtleties when regular expressions are matched against | |
2844 | above-Unicode code points. There is a warning generated to draw your | |
2845 | attention to this. However, this warning was being generated | |
2846 | inappropriately in some cases, such as when a program was being parsed. | |
2847 | Non-Unicode matches such as C<\w> and C<[:word;]> should not generate the | |
2848 | warning, as their definitions don't limit them to apply to only Unicode | |
2849 | code points. Now the message is only generated when matching against | |
2850 | C<\p{}> and C<\P{}>. There remains a bug, [perl #114148], for the very | |
2851 | few properties in Unicode that match just a single code point. The | |
2852 | warning is not generated if they are matched against an above-Unicode | |
2853 | code point. | |
2854 | ||
2855 | =item * | |
2856 | ||
2857 | Uninitialized warnings mentioning hash elements would only mention the | |
2858 | element name if it was not in the first bucket of the hash, due to an | |
2859 | off-by-one error. | |
2860 | ||
2861 | =item * | |
2862 | ||
2863 | A regular expression optimizer bug could cause multiline "^" to behave | |
2864 | incorrectly in the presence of line breaks, such that | |
2865 | C<"/\n\n" =~ m#\A(?:^/$)#im> would not match [perl #115242]. | |
2866 | ||
2867 | =item * | |
2868 | ||
2869 | Failed C<fork> in list context no longer corrupts the stack. | |
2870 | C<@a = (1, 2, fork, 3)> used to gobble up the 2 and assign C<(1, undef, 3)> | |
2871 | if the C<fork> call failed. | |
2872 | ||
2873 | =item * | |
2874 | ||
2875 | Numerous memory leaks have been fixed, mostly involving tied variables that | |
2876 | die, regular expression character classes and code blocks, and syntax | |
2877 | errors. | |
2878 | ||
2879 | =item * | |
2880 | ||
2881 | Assigning a regular expression (C<${qr//}>) to a variable that happens to | |
2882 | hold a floating point number no longer causes assertion failures on | |
2883 | debugging builds. | |
2884 | ||
2885 | =item * | |
2886 | ||
2887 | Assigning a regular expression to a scalar containing a number no longer | |
f105b7be | 2888 | causes subsequent numification to produce random numbers. |
5ed58cbd RS |
2889 | |
2890 | =item * | |
2891 | ||
2892 | Assigning a regular expression to a magic variable no longer wipes away the | |
e612b5a0 | 2893 | magic. This was a regression from v5.10. |
5ed58cbd RS |
2894 | |
2895 | =item * | |
2896 | ||
2897 | Assigning a regular expression to a blessed scalar no longer results in | |
e612b5a0 | 2898 | crashes. This was also a regression from v5.10. |
5ed58cbd RS |
2899 | |
2900 | =item * | |
2901 | ||
2902 | Regular expression can now be assigned to tied hash and array elements with | |
2903 | flattening into strings. | |
2904 | ||
2905 | =item * | |
2906 | ||
f105b7be | 2907 | Numifying a regular expression no longer results in an uninitialized |
5ed58cbd RS |
2908 | warning. |
2909 | ||
2910 | =item * | |
2911 | ||
2912 | Negative array indices no longer cause EXISTS methods of tied variables to | |
e612b5a0 | 2913 | be ignored. This was a regression from v5.12. |
5ed58cbd RS |
2914 | |
2915 | =item * | |
2916 | ||
2917 | Negative array indices no longer result in crashes on arrays tied to | |
2918 | non-objects. | |
2919 | ||
2920 | =item * | |
2921 | ||
be12dd22 RS |
2922 | C<$byte_overload .= $utf8> no longer results in doubly-encoded UTF-8 if the |
2923 | left-hand scalar happened to have produced a UTF-8 string the last time | |
5ed58cbd RS |
2924 | overloading was invoked. |
2925 | ||
2926 | =item * | |
2927 | ||
2928 | C<goto &sub> now uses the current value of @_, instead of using the array | |
2929 | the subroutine was originally called with. This means | |
2930 | C<local @_ = (...); goto &sub> now works [perl #43077]. | |
2931 | ||
2932 | =item * | |
2933 | ||
2934 | If a debugger is invoked recursively, it no longer stomps on its own | |
2935 | lexical variables. Formerly under recursion all calls would share the same | |
2936 | set of lexical variables [perl #115742]. | |
2937 | ||
2938 | =item * | |
2939 | ||
2940 | C<*_{ARRAY}> returned from a subroutine no longer spontaneously | |
2941 | becomes empty. | |
2942 | ||
2943 | =back | |
2944 | ||
2945 | =head1 Known Problems | |
2946 | ||
2947 | =over 4 | |
2948 | ||
2949 | =item * | |
2950 | ||
7f50b25b | 2951 | There are no known regressions. Please report any bugs you find! |
5ed58cbd RS |
2952 | |
2953 | =back | |
2954 | ||
2955 | =head1 Acknowledgements | |
a75569c0 | 2956 | |
5ed58cbd | 2957 | XXX Generate this with: |
a75569c0 | 2958 | |
5ed58cbd | 2959 | perl Porting/acknowledgements.pl v5.18.0..HEAD |
f5b73711 | 2960 | |
44691e6f AB |
2961 | =head1 Reporting Bugs |
2962 | ||
e08634c5 SH |
2963 | If you find what you think is a bug, you might check the articles recently |
2964 | posted to the comp.lang.perl.misc newsgroup and the perl bug database at | |
2965 | http://rt.perl.org/perlbug/ . There may also be information at | |
2966 | http://www.perl.org/ , the Perl Home Page. | |
44691e6f | 2967 | |
e08634c5 SH |
2968 | If you believe you have an unreported bug, please run the L<perlbug> program |
2969 | included with your release. Be sure to trim your bug down to a tiny but | |
2970 | sufficient test case. Your bug report, along with the output of C<perl -V>, | |
2971 | will be sent off to perlbug@perl.org to be analysed by the Perl porting team. | |
44691e6f AB |
2972 | |
2973 | If the bug you are reporting has security implications, which make it | |
e08634c5 SH |
2974 | inappropriate to send to a publicly archived mailing list, then please send it |
2975 | to perl5-security-report@perl.org. This points to a closed subscription | |
2976 | unarchived mailing list, which includes all the core committers, who will be | |
2977 | able to help assess the impact of issues, figure out a resolution, and help | |
f9001595 | 2978 | co-ordinate the release of patches to mitigate or fix the problem across all |
e08634c5 SH |
2979 | platforms on which Perl is supported. Please only use this address for |
2980 | security issues in the Perl core, not for modules independently distributed on | |
2981 | CPAN. | |
44691e6f AB |
2982 | |
2983 | =head1 SEE ALSO | |
2984 | ||
e08634c5 SH |
2985 | The F<Changes> file for an explanation of how to view exhaustive details on |
2986 | what changed. | |
44691e6f AB |
2987 | |
2988 | The F<INSTALL> file for how to build Perl. | |
2989 | ||
2990 | The F<README> file for general stuff. | |
2991 | ||
2992 | The F<Artistic> and F<Copying> files for copyright information. | |
2993 | ||
2994 | =cut |