This is a live mirror of the Perl 5 development currently hosted at https://github.com/perl/perl5
Tailor \b{wb} for Perl
authorKarl Williamson <khw@cpan.org>
Tue, 5 Jan 2016 23:12:55 +0000 (16:12 -0700)
committerKarl Williamson <khw@cpan.org>
Fri, 8 Jan 2016 21:17:11 +0000 (14:17 -0700)
commitf1f6961f5a6fd77a3e3c36f242f1b72ce5dfe205
tree52365bdb2759341217eb979be04a61f5b351eb2f
parentcbdbe9d466e0d26852ca1ace0825220c8ca7d215
Tailor \b{wb} for Perl

The Unicode \b{wb} matches the boundary between space characters in a
span of them.  This is opposite of what \b does, and is counterintuitive
to Perl expectations.  This commit tailors \b{wb} to not split up spans
of white space.

I have submitted a request to Unicode to re-examine their algorithm, and
this has been assigned to a subcommittee to look at, but the result
won't be available until after 5.24 is done.  In any event, Unicode
encourages tailoring for local conditions.
charclass_invlists.h
lib/unicore/mktables
pod/perldelta.pod
pod/perlrebackslash.pod
regcharclass.h
regen/mk_invlists.pl
regexec.c