This is a live mirror of the Perl 5 development currently hosted at https://github.com/perl/perl5
don't mess with the umask()
[perl5.git] / regcomp.sym
CommitLineData
d09b2d29
IZ
1# Format:
2# NAME \t TYPE, arg-description [num-args] [longjump-len] \t DESCRIPTION
3
4# Empty rows and #-comment rows are ignored.
5
6# Exit points
7END END, no End of program.
8SUCCEED END, no Return from a subroutine, basically.
9
10# Anchors:
11BOL BOL, no Match "" at beginning of line.
12MBOL BOL, no Same, assuming multiline.
13SBOL BOL, no Same, assuming singleline.
b85d18e9 14EOS EOL, no Match "" at end of string.
d09b2d29
IZ
15EOL EOL, no Match "" at end of line.
16MEOL EOL, no Same, assuming multiline.
17SEOL EOL, no Same, assuming singleline.
18BOUND BOUND, no Match "" at any word boundary
a0ed51b3 19BOUNDUTF8 BOUND, no Match "" at any word boundary
d09b2d29 20BOUNDL BOUND, no Match "" at any word boundary
a0ed51b3 21BOUNDLUTF8 BOUND, no Match "" at any word boundary
d09b2d29 22NBOUND NBOUND, no Match "" at any word non-boundary
a0ed51b3 23NBOUNDUTF8 NBOUND, no Match "" at any word non-boundary
d09b2d29 24NBOUNDL NBOUND, no Match "" at any word non-boundary
a0ed51b3 25NBOUNDLUTF8 NBOUND, no Match "" at any word non-boundary
d09b2d29
IZ
26GPOS GPOS, no Matches where last m//g left off.
27
28# [Special] alternatives
22c35a8c
GS
29REG_ANY REG_ANY, no Match any one character (except newline).
30ANYUTF8 REG_ANY, no Match any one Unicode character (except newline).
31SANY REG_ANY, no Match any one character.
32SANYUTF8 REG_ANY, no Match any one Unicode character.
d09b2d29 33ANYOF ANYOF, sv Match character in (or not in) this class.
a0ed51b3 34ANYOFUTF8 ANYOF, sv 1 Match character in (or not in) this class.
d09b2d29 35ALNUM ALNUM, no Match any alphanumeric character
d6a28714 36ALNUMUTF8 ALNUM, no Match any alphanumeric character in utf8
d09b2d29 37ALNUML ALNUM, no Match any alphanumeric char in locale
d6a28714 38ALNUMLUTF8 ALNUM, no Match any alphanumeric char in locale+utf8
d09b2d29 39NALNUM NALNUM, no Match any non-alphanumeric character
d6a28714 40NALNUMUTF8 NALNUM, no Match any non-alphanumeric character in utf8
d09b2d29 41NALNUML NALNUM, no Match any non-alphanumeric char in locale
d6a28714 42NALNUMLUTF8 NALNUM, no Match any non-alphanumeric char in locale+utf8
d09b2d29 43SPACE SPACE, no Match any whitespace character
d6a28714 44SPACEUTF8 SPACE, no Match any whitespace character in utf8
d09b2d29 45SPACEL SPACE, no Match any whitespace char in locale
d6a28714 46SPACELUTF8 SPACE, no Match any whitespace char in locale+utf8
d09b2d29 47NSPACE NSPACE, no Match any non-whitespace character
d6a28714 48NSPACEUTF8 NSPACE, no Match any non-whitespace character in utf8
d09b2d29 49NSPACEL NSPACE, no Match any non-whitespace char in locale
d6a28714 50NSPACELUTF8 NSPACE, no Match any non-whitespace char in locale+utf8
d09b2d29 51DIGIT DIGIT, no Match any numeric character
d6a28714 52DIGITUTF8 DIGIT, no Match any numeric character in utf8
b8c5462f 53DIGITL DIGIT, no Match any numeric character in locale
d6a28714 54DIGITLUTF8 DIGIT, no Match any numeric character in locale+utf8
d09b2d29 55NDIGIT NDIGIT, no Match any non-numeric character
d6a28714 56NDIGITUTF8 NDIGIT, no Match any non-numeric character in utf8
b8c5462f 57NDIGITL NDIGIT, no Match any non-numeric character in locale
d6a28714 58NDIGITLUTF8 NDIGIT, no Match any non-numeric character in locale+utf8
a0ed51b3 59CLUMP CLUMP, no Match any combining character sequence
d09b2d29
IZ
60
61# BRANCH The set of branches constituting a single choice are hooked
62# together with their "next" pointers, since precedence prevents
63# anything being concatenated to any individual branch. The
64# "next" pointer of the last BRANCH in a choice points to the
65# thing following the whole choice. This is also where the
66# final "next" pointer of each individual branch points; each
67# branch starts with the operand node of a BRANCH node.
68#
69BRANCH BRANCH, node Match this alternative, or the next...
70
71# BACK Normal "next" pointers all implicitly point forward; BACK
72# exists to make loop structures possible.
73# not used
74BACK BACK, no Match "", "next" ptr points backward.
75
76# Literals
77EXACT EXACT, sv Match this string (preceded by length).
78EXACTF EXACT, sv Match this string, folded (prec. by length).
79EXACTFL EXACT, sv Match this string, folded in locale (w/len).
80
81# Do nothing
82NOTHING NOTHING,no Match empty string.
83# A variant of above which delimits a group, thus stops optimizations
84TAIL NOTHING,no Match empty string. Can jump here from outside.
85
86# STAR,PLUS '?', and complex '*' and '+', are implemented as circular
87# BRANCH structures using BACK. Simple cases (one character
88# per match) are implemented with STAR and PLUS for speed
89# and to minimize recursive plunges.
90#
91STAR STAR, node Match this (simple) thing 0 or more times.
92PLUS PLUS, node Match this (simple) thing 1 or more times.
93
94CURLY CURLY, sv 2 Match this simple thing {n,m} times.
95CURLYN CURLY, no 2 Match next-after-this simple thing
96# {n,m} times, set parenths.
97CURLYM CURLY, no 2 Match this medium-complex thing {n,m} times.
98CURLYX CURLY, sv 2 Match this complex thing {n,m} times.
99
100# This terminator creates a loop structure for CURLYX
101WHILEM WHILEM, no Do curly processing and see if rest matches.
102
103# OPEN,CLOSE,GROUPP ...are numbered at compile time.
104OPEN OPEN, num 1 Mark this point in input as start of #n.
105CLOSE CLOSE, num 1 Analogous to OPEN.
106
107REF REF, num 1 Match some already matched string
108REFF REF, num 1 Match already matched string, folded
109REFFL REF, num 1 Match already matched string, folded in loc.
110
111# grouping assertions
112IFMATCH BRANCHJ,off 1 2 Succeeds if the following matches.
113UNLESSM BRANCHJ,off 1 2 Fails if the following matches.
114SUSPEND BRANCHJ,off 1 1 "Independent" sub-RE.
115IFTHEN BRANCHJ,off 1 1 Switch, should be preceeded by switcher .
116GROUPP GROUPP, num 1 Whether the group matched.
117
118# Support for long RE
119LONGJMP LONGJMP,off 1 1 Jump far away.
120BRANCHJ BRANCHJ,off 1 1 BRANCH with long offset.
121
122# The heavy worker
123EVAL EVAL, evl 1 Execute some Perl code.
124
125# Modifiers
126MINMOD MINMOD, no Next operator is not greedy.
127LOGICAL LOGICAL,no Next opcode should set the flag only.
128
129# This is not used yet
130RENUM BRANCHJ,off 1 1 Group with independently numbered parens.
131
132# This is not really a node, but an optimized away piece of a "long" node.
133# To simplify debugging output, we mark it as if it were a node
134OPTIMIZED NOTHING,off Placeholder for dump.