This is a live mirror of the Perl 5 development currently hosted at https://github.com/perl/perl5
regen/mk_invlists.pl: Simplify handling of early Unicode releases
authorKarl Williamson <khw@cpan.org>
Sat, 23 Dec 2017 19:39:16 +0000 (12:39 -0700)
committerKarl Williamson <khw@cpan.org>
Mon, 25 Dec 2017 00:12:03 +0000 (17:12 -0700)
commit226b74db842dc6466710b278cae98a80b0f77458
treee10b233b09d56bbebc11e8fe39ced47499f9e253
parent39c4defe660921fe7b772ebc0f18b2473a6c5a4c
regen/mk_invlists.pl: Simplify handling of early Unicode releases

Several Unicode properties are used internally in regex matching, for
example \b{wb} and \X.  The code is supposed to compile and work as best
it can on all Unicode releases.  But the rules for these have been
expanded by Unicode over the years, including new types of boundary
conditions.  Each such type is given an enum value in an enum type for
the given boundary.  For early Unicodes, this enum isn't present.

This commit simplifies how this generating code copes with this,
removing the hard-coded lists in favor of using a different mechanism
already in place for handling slightly different circumstances.
charclass_invlists.h
regen/mk_invlists.pl