perl5.git.perl.org Git - perl5.git/commit

author	Jarkko Hietaniemi <jhi@iki.fi>
	Fri, 11 May 2001 14:08:20 +0000 (14:08 +0000)
committer	Jarkko Hietaniemi <jhi@iki.fi>
	Fri, 11 May 2001 14:08:20 +0000 (14:08 +0000)
commit	3a3c44472a318717ec0cdc0a7f768125ae0f001f
tree	5e068b0d32d9904e2ad08658e1039a9e19088d72	tree \| snapshot
parent	c80f55d1ea34dba13189e54d4f4e9bce7de39357	commit \| diff

Remove the 'asciir' re subpragma.  Should instead implement
the 'physical vs logical' range scheme:

\xAA-\xCC is a native physical range, you want that range of
codepoints in your native encoding.  In EBCDIC the codepoints
in the gaps (between i-j and r-s) should be included.

\x{AA}-\x{CC} is a physical Unicode range, you want that range of
codepoints in Unicode.

a-z is a logical range, you want that range of 'logical' codepoints
in your native encoding.  In EBCDIC the codepoints in the gaps
(between i-j and r-s) should not be included.

Mixed cases (a-\xAA, etc) should either be errors, or maybe
the 'logical' endpoints should be converted to native/Unicode
codepoints, and the range handled as a physical range.

'Logical endpoints' are to be recognized only in the A-Z, a-z,
and 0-9 ranges.  Probably a warning should be given for mixed
cases like A-z or a-9 (since such expressions are encoding
dependent), with a recommendation to use physical ranges.

p4raw-id: //depot/perl@10085

ext/re/re.pm		diff \| blob \| blame \| history
perl.h		diff \| blob \| blame \| history
regcomp.c		diff \| blob \| blame \| history
t/op/pat.t		diff \| blob \| blame \| history
t/op/regexp.t		diff \| blob \| blame \| history