This is a live mirror of the Perl 5 development currently hosted at https://github.com/perl/perl5
Re: SunOS 5.8 is FUN! (and not quite ok, either) - @12876, gcc and suncc
[perl5.git] / pod / perltodo.pod
CommitLineData
7711098a
GS
1=head1 NAME
2
3perltodo - Perl TO-DO List
4
5=head1 DESCRIPTION
e50bb9a1 6
722d2a37 7This is a list of wishes for Perl. Send updates to
e50bb9a1
GS
8I<perl5-porters@perl.org>. If you want to work on any of these
9projects, be sure to check the perl5-porters archives for past ideas,
10flames, and propaganda. This will save you time and also prevent you
11from implementing something that Larry has already vetoed. One set
12of archives may be found at:
13
14 http://www.xray.mpe.mpg.de/mailing-lists/perl5-porters/
15
722d2a37 16=head1 To do during 5.6.x
e50bb9a1 17
722d2a37 18=head2 Support for I/O disciplines
e50bb9a1 19
722d2a37
SC
20C<perlio> provides this, but the interface could be a lot more
21straightforward.
e50bb9a1 22
4b3b956a 23=head2 Autoload bytes.pm
e50bb9a1 24
4b3b956a
JH
25When the lexer sees, for instance, C<bytes::length>, it should
26automatically load the C<bytes> pragma.
27
28=head2 Make "\u{XXXX}" et al work
29
30Danger, Will Robinson! Discussing the semantics of C<"\x{F00}">,
31C<"\xF00"> and C<"\U{F00}"> on P5P I<will> lead to a long and boring
32flamewar.
e50bb9a1 33
c6287c21 34=head2 Create a char *sv_pvprintify(sv, STRLEN *lenp, UV flags)
0562c0e3
JH
35
36For displaying PVs with control characters, embedded nulls, and Unicode.
37This would be useful for printing warnings, or data and regex dumping,
38not_a_number(), and so on.
39
f35392ae
JH
40Requirements: should handle both byte and UTF8 strings. isPRINT()
41characters printed as-is, character less than 256 as \xHH, Unicode
0661e9a4
JH
42characters as \x{HHH}. Don't assume ASCII-like, either, get somebody
43on EBCDIC to test the output.
f35392ae
JH
44
45Possible options, controlled by the flags:
0661e9a4 46- whitespace (other than ' ' of isPRINT()) printed as-is
f35392ae
JH
47- use isPRINT_LC() instead of isPRINT()
48- print control characters like this: "\cA"
49- print control characters like this: "^A"
0661e9a4
JH
50- non-PRINTables printed as '.' instead of \xHH
51- use \OOO instead of \xHH
52- use the C/Perl-metacharacters like \n, \t
f35392ae
JH
53- have a maximum length for the produced string (read it from *lenp)
54- append a "..." to the produced string if the maximum length is exceeded
0661e9a4 55- really fancy: print unicode characters as \N{...}
f35392ae 56
722d2a37 57=head2 Overloadable regex assertions
e50bb9a1 58
722d2a37
SC
59This may or may not be possible with the current regular expression
60engine. The idea is that, for instance, C<\b> needs to be
61algorithmically computed if you're dealing with Thai text. Hence, the
62B<\b> assertion wants to be overloaded by a function.
e50bb9a1 63
776f8809
JH
64=head2 Unicode
65
66=over 4
67
68=item *
e50bb9a1 69
f34dec15
JH
70Allow for long form of the General Category Properties, e.g
71C<\p{IsOpenPunctuation}>, not just the abbreviated form, e.g.
72C<\p{IsPs}>.
73
74=item *
75
1ac13f9a
JH
76Allow for the metaproperties: C<XID Start>, C<XID Continue>,
77C<NF*_NO>, C<NF*_MAYBE> (require the DerivedCoreProperties and
78DerviceNormalizationProperties files).
f34dec15 79
71d929cb
JH
80There are also multiple value properties still unimplemented:
81C<Numeric Type>, C<East Asian Width>.
f34dec15
JH
82
83=item *
84
722d2a37 85 Case Mappings? http://www.unicode.org/unicode/reports/tr21/
e50bb9a1 86
6f16a292
JH
87lc(), uc(), lcfirst(), and ucfirst() work only for some of the
88simplest cases, where the mapping goes from a single Unicode character
89to another single Unicode character. See lib/unicore/SpecCase.txt
90(and CaseFold.txt).
ac1256e8 91
776f8809 92=item *
e50bb9a1 93
c6287c21
JH
94They have some tricks Perl doesn't yet implement like character
95class subtraction.
e50bb9a1 96
722d2a37 97 http://www.unicode.org/unicode/reports/tr18/
e50bb9a1 98
776f8809
JH
99=back
100
101See L<perlunicode/UNICODE REGULAR EXPRESSION SUPPORT LEVEL> for what's
f34dec15
JH
102there and what's missing. Almost all of Levels 2 and 3 is missing,
103and as of 5.8.0 not even all of Level 1 is there.
776f8809 104
722d2a37 105=head2 use Thread for iThreads
e50bb9a1 106
722d2a37
SC
107Artur Bergman's C<iThreads> module is a start on this, but needs to
108be more mature.
e50bb9a1 109
dd0afe54
AB
110=head2 make perl_clone optionally clone ops
111
112So that pseudoforking, mod_perl, iThreads and nvi will work properly
113(but not as efficiently) until the regex engine is fixed to be threadsafe.
114
722d2a37 115=head2 Work out exit/die semantics for threads
e50bb9a1 116
722d2a37 117=head2 Typed lexicals for compiler
e50bb9a1 118
722d2a37 119=head2 Compiler workarounds for Win32
e50bb9a1 120
722d2a37 121=head2 AUTOLOADing in the compiler
e50bb9a1 122
722d2a37 123=head2 Fixing comppadlist when compiling
e50bb9a1 124
722d2a37 125=head2 Cleaning up exported namespace
e50bb9a1 126
722d2a37 127=head2 Complete signal handling
e50bb9a1 128
722d2a37
SC
129Add C<PERL_ASYNC_CHECK> to opcodes which loop; replace C<sigsetjmp> with
130C<sigjmp>; check C<wait> for signal safety.
e50bb9a1 131
722d2a37 132=head2 Out-of-source builds
e50bb9a1 133
722d2a37 134This was done for 5.6.0, but needs reworking for 5.7.x
e50bb9a1 135
722d2a37 136=head2 POSIX realtime support
e50bb9a1 137
722d2a37
SC
138POSIX 1003.1 1996 Edition support--realtime stuff: POSIX semaphores,
139message queues, shared memory, realtime clocks, timers, signals (the
140metaconfig units mostly already exist for these)
e50bb9a1 141
722d2a37 142=head2 UNIX98 support
e50bb9a1 143
722d2a37 144Reader-writer locks, realtime/asynchronous IO
e50bb9a1 145
722d2a37 146=head2 IPv6 Support
e50bb9a1 147
722d2a37
SC
148There are non-core modules, such as C<Net::IPv6>, but these will need
149integrating when IPv6 actually starts to really happen. See RFC 2292
150and RFC 2553.
e50bb9a1 151
722d2a37 152=head2 Long double conversion
e50bb9a1 153
722d2a37 154Floating point formatting is still causing some weird test failures.
e50bb9a1 155
722d2a37 156=head2 Locales
e50bb9a1 157
722d2a37
SC
158Locales and Unicode interact with each other in unpleasant ways.
159One possible solution would be to adopt/support ICU:
e50bb9a1 160
722d2a37 161 http://oss.software.ibm.com/developerworks/opensource/icu/project/
e50bb9a1 162
722d2a37 163=head2 Thread-safe regexes
e50bb9a1 164
722d2a37 165The regular expression engine is currently non-threadsafe.
e50bb9a1 166
722d2a37 167=head2 Arithmetic on non-Arabic numerals
e50bb9a1 168
722d2a37 169C<[1234567890]> aren't the only numerals any more.
e50bb9a1 170
722d2a37 171=head2 POSIX Unicode character classes
e50bb9a1 172
722d2a37
SC
173([=a=] for equivalance classes, [.ch.] for collation.)
174These are dependent on Unicode normalization and collation.
e50bb9a1 175
722d2a37 176=head2 Factoring out common suffices/prefices in regexps (trie optimization)
c47ff5f1 177
722d2a37
SC
178Currently, the user has to optimize C<foo|far> and C<foo|goo> into
179C<f(?:oo|ar)> and C<[fg]oo> by hand; this could be done automatically.
e50bb9a1 180
722d2a37 181=head2 Security audit shipped utilities
e50bb9a1 182
722d2a37
SC
183All the code we ship with Perl needs to be sensible about temporary file
184handling, locking, input validation, and so on.
e50bb9a1 185
722d2a37 186=head2 Custom opcodes
e50bb9a1 187
722d2a37
SC
188Have a way to introduce user-defined opcodes without the subroutine call
189overhead of an XSUB; the user should be able to create PP code. Simon
190Cozens has some ideas on this.
e50bb9a1 191
722d2a37 192=head2 DLL Versioning
e50bb9a1 193
722d2a37
SC
194Windows needs a way to know what version of a XS or C<libperl> DLL it's
195loading.
e50bb9a1 196
722d2a37 197=head2 Introduce @( and @)
e50bb9a1 198
722d2a37
SC
199C<$(> may return "foo bar baz". Unfortunately, since groups can
200theoretically have spaces in their names, this could be one, two or
201three groups.
e50bb9a1 202
722d2a37 203=head2 Floating point handling
e50bb9a1 204
722d2a37
SC
205C<NaN> and C<inf> support is particularly troublesome.
206(fp_classify(), fp_class(), fp_class_d(), class(), isinf(),
207isfinite(), finite(), isnormal(), unordered(), <ieeefp.h>,
208<fp_class.h> (there are metaconfig units for all these) (I think),
209fp_setmask(), fp_getmask(), fp_setround(), fp_getround()
210(no metaconfig units yet for these). Don't forget finitel(), fp_classl(),
211fp_class_l(), (yes, both do, unfortunately, exist), and unorderedl().)
e50bb9a1 212
722d2a37 213As of Perl 5.6.1 is a Perl macro, Perl_isnan().
e50bb9a1 214
722d2a37 215=head2 IV/UV preservation
e50bb9a1 216
722d2a37
SC
217Nicholas Clark has done a lot of work on this, but work is continuing.
218C<+>, C<-> and C<*> work, but guards need to be in place for C<%>, C</>,
219C<&>, C<oct>, C<hex> and C<pack>.
e50bb9a1 220
722d2a37 221=head2 Replace pod2html with something using Pod::Parser
83df6a1d 222
722d2a37
SC
223The CPAN module C<Malik::Pod::Html> may be a more suitable basis for a
224C<pod2html> convertor; the current one duplicates the functionality
225abstracted in C<Pod::Parser>, which makes updating the POD language
226difficult.
e50bb9a1 227
722d2a37 228=head2 Automate module testing on CPAN
e50bb9a1 229
722d2a37
SC
230When a new Perl is being beta tested, porters have to manually grab
231their favourite CPAN modules and test them - this should be done
232automatically.
e50bb9a1 233
722d2a37 234=head2 sendmsg and recvmsg
83df6a1d 235
722d2a37
SC
236We have all the other BSD socket functions but these. There are
237metaconfig units for these functions which can be added. To avoid these
238being new opcodes, a solution similar to the way C<sockatmark> was added
239would be preferable. (Autoload the C<IO::whatever> module.)
e50bb9a1 240
722d2a37 241=head2 Rewrite perlre documentation
e50bb9a1 242
722d2a37
SC
243The new-style patterns need full documentation, and the whole document
244needs to be a lot clearer.
e50bb9a1 245
722d2a37 246=head2 Convert example code to IO::Handle filehandles
e50bb9a1 247
722d2a37 248=head2 Document Win32 choices
e50bb9a1 249
722d2a37 250=head2 Check new modules
e50bb9a1 251
722d2a37 252=head2 Make roffitall find pods and libs itself
e50bb9a1 253
722d2a37 254Simon Cozens has done some work on this but it needs a rethink.
e50bb9a1 255
722d2a37 256=head1 To do at some point
e50bb9a1 257
722d2a37
SC
258These are ideas that have been regularly tossed around, that most
259people believe should be done maybe during 5.8.x
e50bb9a1 260
722d2a37 261=head2 Remove regular expression recursion
e50bb9a1 262
722d2a37
SC
263Because the regular expression engine is recursive, badly designed
264expressions can lead to lots of recursion filling up the stack. Ilya
265claims that it is easy to convert the engine to being iterative, but
266this has still not yet been done. There may be a regular expression
267engine hit squad meeting at TPC5.
e50bb9a1 268
722d2a37 269=head2 Memory leaks after failed eval
e50bb9a1 270
722d2a37
SC
271Perl will leak memory if you C<eval "hlagh hlagh hlagh hlagh">. This is
272partially because it attempts to build up an op tree for that code and
273doesn't properly free it. The same goes for non-syntactically-correct
274regular expressions. Hugo looked into this, but decided it needed a
275mark-and-sweep GC implementation.
e50bb9a1 276
722d2a37
SC
277Alan notes that: The basic idea was to extend the parser token stack
278(C<YYSTYPE>) to include a type field so we knew what sort of thing each
279element of the stack was. The F<<perly.c> code would then have to be
280postprocessed to record the type of each entry on the stack as it was
281created, and the parser patched so that it could unroll the stack
282properly on error.
e50bb9a1 283
722d2a37
SC
284This is possible to do, but would be pretty messy to implement, as it
285would rely on even more sed hackery in F<perly.fixer>.
e50bb9a1 286
722d2a37 287=head2 pack "(stuff)*"
e50bb9a1 288
722d2a37 289That's to say, C<pack "(sI)40"> would be the same as C<pack "sI"x40>
e50bb9a1 290
722d2a37 291=head2 bitfields in pack
e50bb9a1 292
722d2a37 293=head2 Cross compilation
e50bb9a1 294
722d2a37 295Make Perl buildable with a cross-compiler. This will play havoc with
da75cd15 296Configure, which needs to know how the target system will respond to
722d2a37
SC
297its tests; maybe C<microperl> will be a good starting point here.
298(Indeed, Bart Schuller reports that he compiled up C<microperl> for
299the Agenda PDA and it works fine.) A really big spanner in the works
300is the bootstrapping build process of Perl: if the filesystem the
301target systems sees is not the same what the build host sees, various
302input, output, and (Perl) library files need to be copied back and forth.
e50bb9a1 303
f86a8bc5
JH
304As of 5.8.0 Configure mostly works for cross-compilation
305(used successfully for iPAQ Linux), miniperl gets built,
306but then building DynaLoader (and other extensions) fails
307since MakeMaker knows nothing of cross-compilation.
308(See INSTALL/Cross-compilation for the state of things.)
309
722d2a37 310=head2 Perl preprocessor / macros
e50bb9a1 311
722d2a37
SC
312Source filters help with this, but do not get us all the way. For
313instance, it should be possible to implement the C<??> operator somehow;
314source filters don't (quite) cut it.
e50bb9a1 315
722d2a37 316=head2 Perl lexer in Perl
a45bd81d 317
722d2a37 318Damian Conway is planning to work on this, but it hasn't happened yet.
e50bb9a1 319
722d2a37 320=head2 Using POSIX calls internally
e50bb9a1 321
722d2a37
SC
322When faced with a BSD vs. SySV -style interface to some library or
323system function, perl's roots show in that it typically prefers the BSD
324interface (but falls back to the SysV one). One example is getpgrp().
325Other examples include C<memcpy> vs. C<bcopy>. There are others, mostly in
326F<<pp_sys.c>.
e50bb9a1 327
722d2a37
SC
328Mostly, this item is a suggestion for which way to start a journey into
329an C<#ifdef> forest. It is not primarily a suggestion to eliminate any of
330the C<#ifdef> forests.
e50bb9a1 331
722d2a37
SC
332POSIX calls are perhaps more likely to be portable to unexpected
333architectures. They are also perhaps more likely to be actively
334maintained by a current vendor. They are also perhaps more likely to be
335available in thread-safe versions, if appropriate.
e50bb9a1 336
722d2a37 337=head2 -i rename file when changed
e50bb9a1 338
722d2a37
SC
339It's only necessary to rename a file when inplace editing when the file
340has changed. Detecting a change is perhaps the difficult bit.
e50bb9a1 341
722d2a37 342=head2 All ARGV input should act like E<lt>E<gt>
e50bb9a1 343
2d84a16a
DM
344eg C<read(ARGV, ...)> doesn't currently read across multiple files.
345
722d2a37 346=head2 Support for rerunning debugger
e50bb9a1 347
722d2a37 348There should be a way of restarting the debugger on demand.
e50bb9a1 349
c6287c21
JH
350=head2 Test Suite for the Debugger
351
352The debugger is a complex piece of software and fixing something
353here may inadvertently break something else over there. To tame
354this chaotic behaviour, a test suite is necessary.
355
722d2a37 356=head2 my sub foo { }
c47ff5f1 357
722d2a37
SC
358The basic principle is sound, but there are problems with the semantics
359of self-referential and mutually referential lexical subs: how to
360declare the subs?
c47ff5f1 361
722d2a37 362=head2 One-pass global destruction
c47ff5f1 363
722d2a37
SC
364Sweeping away all the allocated memory in one go is a laudable goal, but
365it's difficult and in most cases, it's easier to let the memory get
366freed by exiting.
e50bb9a1 367
722d2a37 368=head2 Rewrite regexp parser
e50bb9a1 369
722d2a37
SC
370There has been talk recently of rewriting the regular expression parser
371to produce an optree instead of a chain of opcodes; it's unclear whether
372or not this would be a win.
e50bb9a1 373
722d2a37 374=head2 Cache recently used regexps
e50bb9a1 375
722d2a37 376This is to speed up
e50bb9a1 377
722d2a37
SC
378 for my $re (@regexps) {
379 $matched++ if /$re/
380 }
e50bb9a1 381
722d2a37
SC
382C<qr//> already gives us a way of saving compiled regexps, but it should
383be done automatically.
e50bb9a1 384
722d2a37 385=head2 Re-entrant functions
e50bb9a1 386
722d2a37
SC
387Add configure probes for C<_r> forms of system calls and fit them to the
388core. Unfortunately, calling conventions for these functions and not
389standardised.
04c70446 390
722d2a37 391=head2 Cross-compilation support
04c70446 392
722d2a37
SC
393Bart Schuller reports that using C<microperl> and a cross-compiler, he
394got Perl working on the Agenda PDA. However, one cannot build a full
395Perl because Configure needs to get the results for the target platform,
396for the host.
e50bb9a1 397
722d2a37 398=head2 Bit-shifting bitvectors
e50bb9a1 399
722d2a37 400Given:
e50bb9a1 401
722d2a37 402 vec($v, 1000, 1) = 1;
e50bb9a1 403
722d2a37 404One should be able to do
e50bb9a1 405
722d2a37 406 $v <<= 1;
e50bb9a1 407
722d2a37 408and have the 999'th bit set.
e50bb9a1 409
722d2a37
SC
410Currently if you try with shift bitvectors you shift the NV/UV, instead
411of the bits in the PV. Not very logical.
e50bb9a1 412
722d2a37 413=head2 debugger pragma
e50bb9a1 414
722d2a37
SC
415The debugger is implemented in Perl in F<perl5db.pl>; turning it into a
416pragma should be easy, but making it work lexically might be more
417difficult. Fiddling with C<$^P> would be necessary.
e50bb9a1 418
722d2a37 419=head2 use less pragma
e50bb9a1 420
722d2a37
SC
421Identify areas where speed/memory tradeoffs can be made and have a hint
422to switch between them.
e50bb9a1 423
722d2a37 424=head2 switch structures
e50bb9a1 425
722d2a37
SC
426Although we have C<Switch.pm> in core, Larry points to the dormant
427C<nswitch> and C<cswitch> ops in F<pp.c>; using these opcodes would be
428much faster.
e50bb9a1 429
722d2a37 430=head2 Cache eval tree
e50bb9a1 431
722d2a37 432=head2 rcatmaybe
e50bb9a1 433
722d2a37 434=head2 Shrink opcode tables
e50bb9a1 435
722d2a37 436=head2 Optimize away @_
e50bb9a1 437
722d2a37 438Look at the "reification" code in C<av.c>
e50bb9a1 439
722d2a37 440=head2 Prototypes versus indirect objects
e50bb9a1 441
722d2a37 442Currently, indirect object syntax bypasses prototype checks.
e50bb9a1 443
722d2a37 444=head2 Install HMTL
e50bb9a1 445
722d2a37
SC
446HTML versions of the documentation need to be installed by default; a
447call to C<installhtml> from C<installperl> may be all that's necessary.
e50bb9a1 448
722d2a37 449=head2 Prototype method calls
e50bb9a1 450
722d2a37 451=head2 Return context prototype declarations
e50bb9a1 452
722d2a37 453=head2 magic_setisa
e50bb9a1 454
722d2a37 455=head2 Garbage collection
e50bb9a1 456
722d2a37
SC
457There have been persistent mumblings about putting a mark-and-sweep
458garbage detector into Perl; Alan Burlison has some ideas about this.
e50bb9a1 459
722d2a37 460=head2 IO tutorial
e50bb9a1 461
722d2a37 462Mark-Jason Dominus has the beginnings of one of these.
e50bb9a1 463
722d2a37 464=head2 pack/unpack tutorial
e50bb9a1 465
722d2a37 466Simon Cozens has the beginnings of one of these.
e50bb9a1 467
722d2a37 468=head2 Rewrite perldoc
e50bb9a1 469
722d2a37
SC
470There are a few suggestions for what to do with C<perldoc>: maybe a
471full-text search, an index function, locating pages on a particular
472high-level subject, and so on.
e50bb9a1 473
3958b146 474=head2 Install .3p manpages
e50bb9a1 475
3958b146 476This is a bone of contention; we can create C<.3p> manpages for each
722d2a37
SC
477built-in function, but should we install them by default? Tcl does this,
478and it clutters up C<apropos>.
e50bb9a1 479
722d2a37 480=head2 Unicode tutorial
e50bb9a1 481
722d2a37 482Simon Cozens promises to do this before he gets old.
e50bb9a1 483
722d2a37 484=head2 Update POSIX.pm for 1003.1-2
3958b146 485
722d2a37 486=head2 Retargetable installation
e50bb9a1 487
722d2a37 488Allow C<@INC> to be changed after Perl is built.
e50bb9a1 489
722d2a37 490=head2 POSIX emulation on non-POSIX systems
e50bb9a1 491
722d2a37
SC
492Make C<POSIX.pm> behave as POSIXly as possible everywhere, meaning we
493have to implement POSIX equivalents for some functions if necessary.
e50bb9a1 494
722d2a37 495=head2 Rename Win32 headers
e50bb9a1 496
722d2a37
SC
497=head2 Finish off lvalue functions
498
499They don't work in the debugger, and they don't work for list or hash
500slices.
e50bb9a1 501
722d2a37 502=head2 Update sprintf documentation
e50bb9a1 503
722d2a37 504Hugo van der Sanden plans to look at this.
e50bb9a1 505
722d2a37 506=head2 Use fchown/fchmod internally
e50bb9a1 507
722d2a37
SC
508This has been done in places, but needs a thorough code review.
509Also fchdir is available in some platforms.
e50bb9a1 510
722d2a37 511=head1 Vague ideas
e50bb9a1 512
722d2a37 513Ideas which have been discussed, and which may or may not happen.
e50bb9a1 514
722d2a37 515=head2 ref() in list context
e50bb9a1 516
722d2a37
SC
517It's unclear what this should do or how to do it without breaking old
518code.
e50bb9a1 519
f86a8bc5 520=head2 Make tr/// return histogram of characters in list context
e50bb9a1 521
722d2a37 522There is a patch for this, but it may require Unicodification.
e50bb9a1 523
722d2a37 524=head2 Compile to real threaded code
3958b146 525
722d2a37 526=head2 Structured types
3958b146 527
722d2a37 528=head2 Modifiable $1 et al.
e50bb9a1 529
722d2a37
SC
530 ($x = "elephant") =~ /e(ph)/;
531 $1 = "g"; # $x = "elegant"
e50bb9a1 532
722d2a37
SC
533What happens if there are multiple (nested?) brackets? What if the
534string changes between the match and the assignment?
e50bb9a1 535
722d2a37 536=head2 Procedural interfaces for IO::*, etc.
e50bb9a1 537
722d2a37
SC
538Some core modules have been accused of being overly-OO. Adding
539procedural interfaces could demystify them.
e50bb9a1 540
722d2a37 541=head2 RPC modules
e50bb9a1 542
722d2a37 543=head2 Attach/detach debugger from running program
e50bb9a1 544
722d2a37
SC
545With C<gdb>, you can attach the debugger to a running program if you
546pass the process ID. It would be good to do this with the Perl debugger
547on a running Perl program, although I'm not sure how it would be done.
e50bb9a1 548
722d2a37 549=head2 Alternative RE syntax module
e50bb9a1 550
722d2a37
SC
551 use Regex::Newbie;
552 $re = Regex::Newbie->new
553 ->start
554 ->match("foo")
555 ->repeat(Regex::Newbie->class("char"),3)
556 ->end;
557 /$re/;
e50bb9a1 558
722d2a37 559=head2 GUI::Native
e50bb9a1 560
722d2a37
SC
561A non-core module that would use "native" GUI to create graphical
562applications.
e50bb9a1 563
722d2a37 564=head2 foreach(reverse ...)
e50bb9a1 565
722d2a37 566Currently
e50bb9a1 567
722d2a37 568 foreach (reverse @_) { ... }
e50bb9a1 569
722d2a37
SC
570puts C<@_> on the stack, reverses it putting the reversed version on the
571stack, then iterates forwards. Instead, it could be special-cased to put
572C<@_> on the stack then iterate backwards.
e50bb9a1 573
722d2a37 574=head2 Constant function cache
e50bb9a1 575
722d2a37 576=head2 Approximate regular expression matching
e50bb9a1 577
722d2a37 578=head1 Ongoing
e50bb9a1 579
722d2a37 580These items B<always> need doing:
e50bb9a1 581
722d2a37 582=head2 Update guts documentation
e50bb9a1 583
722d2a37
SC
584Simon Cozens tries to do this when possible, and contributions to the
585C<perlapi> documentation is welcome.
e50bb9a1 586
722d2a37 587=head2 Add more tests
e50bb9a1 588
722d2a37
SC
589Michael Schwern will donate $500 to Yet Another Society when all core
590modules have tests.
e50bb9a1 591
722d2a37 592=head2 Update auxiliary tools
e50bb9a1 593
722d2a37 594The code we ship with Perl should look like good Perl 5.
e50bb9a1 595
722d2a37 596=head1 Recently done things
e50bb9a1 597
722d2a37
SC
598These are things which have been on the todo lists in previous releases
599but have recently been completed.
e50bb9a1 600
722d2a37 601=head2 Safe signal handling
e50bb9a1 602
722d2a37
SC
603A new signal model went into 5.7.1 without much fanfare. Operations and
604C<malloc>s are no longer interrupted by signals, which are handled
605between opcodes. This means that C<PERL_ASYNC_CHECK> now actually does
606something. However, there are still a few things that need to be done.
e50bb9a1 607
722d2a37 608=head2 Tie Modules
e50bb9a1 609
722d2a37
SC
610Modules which implement arrays in terms of strings, substrings or files
611can be found on the CPAN.
e50bb9a1 612
722d2a37 613=head2 gettimeofday
e50bb9a1 614
722d2a37 615C<Time::Hires> has been integrated into the core.
e50bb9a1 616
722d2a37 617=head2 setitimer and getimiter
e50bb9a1 618
722d2a37 619Adding C<Time::Hires> got us this too.
e50bb9a1 620
722d2a37
SC
621=head2 Testing __DIE__ hook
622
623Tests have been added.
624
625=head2 CPP equivalent in Perl
e50bb9a1 626
722d2a37
SC
627A C Yardley will probably have done this by the time you can read this.
628This allows for a generalization of the C constant detection used in
629building C<Errno.pm>.
e50bb9a1 630
722d2a37 631=head2 Explicit switch statements
e50bb9a1 632
722d2a37
SC
633C<Switch.pm> has been integrated into the core to give you all manner of
634C<switch...case> semantics.
e50bb9a1 635
722d2a37 636=head2 autocroak
e50bb9a1 637
722d2a37 638This is C<Fatal.pm>.
e50bb9a1 639
722d2a37 640=head2 UTF/EBCDIC
e50bb9a1 641
722d2a37 642Nick Ing-Simmons has made UTF-EBCDIC (UTR13) work with Perl.
e50bb9a1 643
722d2a37 644 EBCDIC? http://www.unicode.org/unicode/reports/tr16/
e50bb9a1 645
722d2a37 646=head2 UTF Regexes
e50bb9a1 647
722d2a37
SC
648Although there are probably some small bugs to be rooted out, Jarkko
649Hietaniemi has made regular expressions polymorphic between bytes and
650characters.
e50bb9a1 651
722d2a37 652=head2 perlcc to produce executable
e50bb9a1 653
722d2a37
SC
654C<perlcc> was recently rewritten, and can now produce standalone
655executables.
e50bb9a1 656
722d2a37 657=head2 END blocks saved in compiled output
e50bb9a1 658
722d2a37 659=head2 Secure temporary file module
e50bb9a1 660
722d2a37 661Tim Jenness' C<File::Temp> is now in core.
e50bb9a1 662
722d2a37 663=head2 Integrate Time::HiRes
e50bb9a1 664
722d2a37 665This module is now part of core.
e50bb9a1 666
722d2a37 667=head2 Turn Cwd into XS
e50bb9a1 668
722d2a37 669Benjamin Sugars has done this.
e50bb9a1 670
722d2a37 671=head2 Mmap for input
e50bb9a1 672
722d2a37 673Nick Ing-Simmons' C<perlio> supports an C<mmap> IO method.
e50bb9a1 674
722d2a37 675=head2 Byte to/from UTF8 and UTF8 to/from local conversion
e50bb9a1 676
722d2a37 677C<Encode> provides this.
e50bb9a1 678
722d2a37 679=head2 Add sockatmark support
e50bb9a1 680
722d2a37 681Added in 5.7.1
e50bb9a1 682
722d2a37
SC
683=head2 Mailing list archives
684
685http://lists.perl.org/, http://archive.develooper.com/
686
687=head2 Bug tracking
688
689Richard Foley has written the bug tracking system at http://bugs.perl.org/
e50bb9a1 690
722d2a37 691=head2 Integrate MacPerl
e50bb9a1 692
722d2a37
SC
693Chris Nandor and Matthias Neeracher have integrated the MacPerl changes
694into 5.6.0.
e50bb9a1 695
722d2a37 696=head2 Web "nerve center" for Perl
e50bb9a1 697
722d2a37 698http://use.perl.org/ is what you're looking for.
e50bb9a1 699
722d2a37 700=head2 Regular expression tutorial
e50bb9a1 701
722d2a37 702C<perlretut>, provided by Mark Kvale.
e50bb9a1 703
722d2a37 704=head2 Debugging Tutorial
e50bb9a1 705
722d2a37 706C<perldebtut>, written by Richard Foley.
e50bb9a1 707
722d2a37 708=head2 Integrate new modules
e50bb9a1 709
722d2a37 710Jarkko has been integrating madly into 5.7.x
e50bb9a1 711
722d2a37 712=head2 Integrate profiler
e50bb9a1 713
722d2a37 714C<Devel::DProf> is now a core module.
e50bb9a1 715
722d2a37 716=head2 Y2K error detection
e50bb9a1 717
722d2a37
SC
718There's a configure option to detect unsafe concatenation with "19", and
719a CPAN module. (C<D'oh::Year>)
e50bb9a1 720
722d2a37 721=head2 Regular expression debugger
e50bb9a1 722
722d2a37
SC
723While not part of core, Mark-Jason Dominus has written C<Rx> and has
724also come up with a generalised strategy for regular expression
725debugging.
e50bb9a1 726
722d2a37 727=head2 POD checker
e50bb9a1 728
722d2a37 729That's, uh, F<podchecker>
e50bb9a1 730
722d2a37 731=head2 "Dynamic" lexicals
e50bb9a1 732
722d2a37 733=head2 Cache precompiled modules
e50bb9a1 734
722d2a37 735=head1 Deprecated Wishes
e50bb9a1 736
722d2a37
SC
737These are items which used to be in the todo file, but have been
738deprecated for some reason.
e50bb9a1 739
722d2a37 740=head2 Loop control on do{}
e50bb9a1 741
722d2a37 742This would break old code; use C<do{{ }}> instead.
e50bb9a1 743
722d2a37 744=head2 Lexically scoped typeglobs
e50bb9a1 745
722d2a37 746Not needed now we have lexical IO handles.
e50bb9a1 747
722d2a37 748=head2 format BOTTOM
3958b146 749
722d2a37 750=head2 report HANDLE
e50bb9a1 751
722d2a37 752Damian Conway's text formatting modules seem to be the Way To Go.
e50bb9a1 753
722d2a37 754=head2 Generalised want()/caller())
3958b146 755
722d2a37 756=head2 Named prototypes
e50bb9a1 757
722d2a37 758These both seem to be delayed until Perl 6.
e50bb9a1 759
722d2a37 760=head2 Built-in globbing
e50bb9a1 761
722d2a37 762The C<File::Glob> module has been used to replace the C<glob> function.
e50bb9a1 763
722d2a37 764=head2 Regression tests for suidperl
e50bb9a1 765
722d2a37 766C<suidperl> is deprecated in favour of common sense.
e50bb9a1 767
722d2a37 768=head2 Cached hash values
e50bb9a1 769
722d2a37 770We have shared hash keys, which perform the same job.
e50bb9a1 771
722d2a37 772=head2 Add compression modules
e50bb9a1 773
722d2a37
SC
774The compression modules are a little heavy; meanwhile, Nick Clark is
775working on experimental pragmata to do transparent decompression on
776input.
e50bb9a1 777
722d2a37 778=head2 Reorganise documentation into tutorials/references
e50bb9a1 779
722d2a37 780Could not get consensus on P5P about this.
e50bb9a1 781
722d2a37
SC
782=head2 Remove distinction between functions and operators
783
784Caution: highly flammable.
785
786=head2 Make XS easier to use
e50bb9a1 787
722d2a37 788Use C<Inline> instead, or SWIG.
e50bb9a1 789
722d2a37 790=head2 Make embedding easier to use
e50bb9a1 791
722d2a37 792Use C<Inline::CPR>.
e50bb9a1 793
722d2a37 794=head2 man for perl
04c70446 795
722d2a37 796See the Perl Power Tools. (http://language.perl.com/ppt/)
04c70446 797
722d2a37 798=head2 my $Package::variable
04c70446 799
722d2a37 800Use C<our> instead.
04c70446 801
722d2a37 802=head2 "or" tests defined, not truth
04c70446 803
722d2a37 804Suggesting this on P5P B<will> cause a boring and interminable flamewar.
04c70446 805
722d2a37 806=head2 "class"-based lexicals
04c70446 807
cbb3fa72 808Use flyweight objects, secure hashes or, dare I say it, pseudo-hashes instead.
f86a8bc5 809(Or whatever will replace pseudohashes in 5.10.)
04c70446 810
722d2a37 811=head2 byteperl
04c70446 812
722d2a37 813C<ByteLoader> covers this.
04c70446 814
722d2a37 815=head2 Lazy evaluation / tail recursion removal
04c70446 816
f86a8bc5
JH
817C<List::Util> gives first() (a short-circuiting grep); tail recursion
818removal is done manually, with C<goto &whoami;>. (However, MJD has
819found that C<goto &whoami> introduces a performance penalty, so maybe
820there should be a way to do this after all: C<sub foo {START: ... goto
821START;> is better.)
0562c0e3
JH
822
823=head2 Make "use utf8" the default
824
f86a8bc5
JH
825Because of backward compatibility this is difficult: scripts could not
826contain B<any legacy eight-bit data> (like Latin-1) anymore, even in
827string literals or pod. Also would introduce a measurable slowdown of
828at least few percentages since all regular expression operations would
829be done in full UTF-8. But if you want to try this, add
830-DUSE_UTF8_SCRIPTS to your compilation flags.
831
3298bd4d
JH
832=head2 Unicode collation and normalization
833
834The Unicode::Collate and Unicode::Normalize modules
835by SADAHIRO Tomoyuki have been included since 5.8.0.
836
837 Collation? http://www.unicode.org/unicode/reports/tr10/
838 Normalization? http://www.unicode.org/unicode/reports/tr15/
0562c0e3 839
825b3abc
RS
840=head2 Create debugging macros
841
842Debugging macros (like printsv, dump) can make debugging perl inside a
843C debugger much easier. A good set for gdb comes with mod_perl.
844Something similar should be distributed with perl.
845
846The proper way to do this is to use and extend Devel::DebugInit.
847Devel::DebugInit also needs to be extended to support threads.
848
849See p5p archives for late May/early June 2001 for a recent discussion
850on this topic.
851
3298bd4d 852=cut