This is a live mirror of the Perl 5 development currently hosted at https://github.com/perl/perl5
Set PERL_DL_NONLAZY in t/TEST to check existence of all symbols
[perl5.git] / pod / perlfunc.pod
CommitLineData
a0d0e21e 1=head1 NAME
d74e8afc 2X<function>
a0d0e21e
LW
3
4perlfunc - Perl builtin functions
5
6=head1 DESCRIPTION
7
8The functions in this section can serve as terms in an expression.
9They fall into two major categories: list operators and named unary
10operators. These differ in their precedence relationship with a
11following comma. (See the precedence table in L<perlop>.) List
12operators take more than one argument, while unary operators can never
13take more than one argument. Thus, a comma terminates the argument of
14a unary operator, but merely separates the arguments of a list
8f1da26d 15operator. A unary operator generally provides scalar context to its
2b5ab1e7 16argument, while a list operator may provide either scalar or list
3b10bc60 17contexts for its arguments. If it does both, scalar arguments
18come first and list argument follow, and there can only ever
19be one such list argument. For instance, splice() has three scalar
2b5ab1e7
TC
20arguments followed by a list, whereas gethostbyname() has four scalar
21arguments.
a0d0e21e
LW
22
23In the syntax descriptions that follow, list operators that expect a
3b10bc60 24list (and provide list context for elements of the list) are shown
a0d0e21e
LW
25with LIST as an argument. Such a list may consist of any combination
26of scalar arguments or list values; the list values will be included
27in the list as if each individual element were interpolated at that
28point in the list, forming a longer single-dimensional list value.
8bdbc703 29Commas should separate literal elements of the LIST.
a0d0e21e
LW
30
31Any function in the list below may be used either with or without
32parentheses around its arguments. (The syntax descriptions omit the
3b10bc60 33parentheses.) If you use parentheses, the simple but occasionally
34surprising rule is this: It I<looks> like a function, therefore it I<is> a
a0d0e21e 35function, and precedence doesn't matter. Otherwise it's a list
3b10bc60 36operator or unary operator, and precedence does matter. Whitespace
37between the function and left parenthesis doesn't count, so sometimes
38you need to be careful:
a0d0e21e 39
5ed4f2ec 40 print 1+2+4; # Prints 7.
41 print(1+2) + 4; # Prints 3.
42 print (1+2)+4; # Also prints 3!
43 print +(1+2)+4; # Prints 7.
44 print ((1+2)+4); # Prints 7.
a0d0e21e
LW
45
46If you run Perl with the B<-w> switch it can warn you about this. For
47example, the third line above produces:
48
49 print (...) interpreted as function at - line 1.
50 Useless use of integer addition in void context at - line 1.
51
2b5ab1e7
TC
52A few functions take no arguments at all, and therefore work as neither
53unary nor list operators. These include such functions as C<time>
54and C<endpwent>. For example, C<time+86_400> always means
55C<time() + 86_400>.
56
a0d0e21e 57For functions that can be used in either a scalar or list context,
8f1da26d
TC
58nonabortive failure is generally indicated in scalar context by
59returning the undefined value, and in list context by returning the
3b10bc60 60empty list.
a0d0e21e 61
5a964f20
TC
62Remember the following important rule: There is B<no rule> that relates
63the behavior of an expression in list context to its behavior in scalar
64context, or vice versa. It might do two totally different things.
80d38338 65Each operator and function decides which sort of value would be most
2b5ab1e7 66appropriate to return in scalar context. Some operators return the
5a964f20 67length of the list that would have been returned in list context. Some
a0d0e21e
LW
68operators return the first value in the list. Some operators return the
69last value in the list. Some operators return a count of successful
70operations. In general, they do what you want, unless you want
71consistency.
d74e8afc 72X<context>
a0d0e21e 73
d1be9408 74A named array in scalar context is quite different from what would at
5a964f20
TC
75first glance appear to be a list in scalar context. You can't get a list
76like C<(1,2,3)> into being in scalar context, because the compiler knows
77the context at compile time. It would generate the scalar comma operator
78there, not the list construction version of the comma. That means it
79was never a list to start with.
80
3b10bc60 81In general, functions in Perl that serve as wrappers for system calls ("syscalls")
5dac7880 82of the same name (like chown(2), fork(2), closedir(2), etc.) return
5a964f20
TC
83true when they succeed and C<undef> otherwise, as is usually mentioned
84in the descriptions below. This is different from the C interfaces,
5dac7880 85which return C<-1> on failure. Exceptions to this rule include C<wait>,
19799a22 86C<waitpid>, and C<syscall>. System calls also set the special C<$!>
5a964f20
TC
87variable on failure. Other functions do not, except accidentally.
88
88e1f1a2
JV
89Extension modules can also hook into the Perl parser to define new
90kinds of keyword-headed expression. These may look like functions, but
91may also look completely different. The syntax following the keyword
92is defined entirely by the extension. If you are an implementor, see
93L<perlapi/PL_keyword_plugin> for the mechanism. If you are using such
94a module, see the module's documentation for details of the syntax that
95it defines.
96
cb1a09d0 97=head2 Perl Functions by Category
d74e8afc 98X<function>
cb1a09d0
AD
99
100Here are Perl's functions (including things that look like
5a964f20 101functions, like some keywords and named operators)
cb1a09d0
AD
102arranged by category. Some functions appear in more
103than one place.
104
13a2d996 105=over 4
cb1a09d0
AD
106
107=item Functions for SCALARs or strings
d74e8afc 108X<scalar> X<string> X<character>
cb1a09d0 109
22fae026 110C<chomp>, C<chop>, C<chr>, C<crypt>, C<hex>, C<index>, C<lc>, C<lcfirst>,
1dc8ecb8 111C<length>, C<oct>, C<ord>, C<pack>, C<q//>, C<qq//>, C<reverse>,
945c54fd 112C<rindex>, C<sprintf>, C<substr>, C<tr///>, C<uc>, C<ucfirst>, C<y///>
cb1a09d0
AD
113
114=item Regular expressions and pattern matching
d74e8afc 115X<regular expression> X<regex> X<regexp>
cb1a09d0 116
ab4f32c2 117C<m//>, C<pos>, C<quotemeta>, C<s///>, C<split>, C<study>, C<qr//>
cb1a09d0
AD
118
119=item Numeric functions
d74e8afc 120X<numeric> X<number> X<trigonometric> X<trigonometry>
cb1a09d0 121
22fae026
TM
122C<abs>, C<atan2>, C<cos>, C<exp>, C<hex>, C<int>, C<log>, C<oct>, C<rand>,
123C<sin>, C<sqrt>, C<srand>
cb1a09d0
AD
124
125=item Functions for real @ARRAYs
d74e8afc 126X<array>
cb1a09d0 127
a5ce339c 128C<each>, C<keys>, C<pop>, C<push>, C<shift>, C<splice>, C<unshift>, C<values>
cb1a09d0
AD
129
130=item Functions for list data
d74e8afc 131X<list>
cb1a09d0 132
1dc8ecb8 133C<grep>, C<join>, C<map>, C<qw//>, C<reverse>, C<sort>, C<unpack>
cb1a09d0
AD
134
135=item Functions for real %HASHes
d74e8afc 136X<hash>
cb1a09d0 137
22fae026 138C<delete>, C<each>, C<exists>, C<keys>, C<values>
cb1a09d0
AD
139
140=item Input and output functions
d74e8afc 141X<I/O> X<input> X<output> X<dbm>
cb1a09d0 142
22fae026
TM
143C<binmode>, C<close>, C<closedir>, C<dbmclose>, C<dbmopen>, C<die>, C<eof>,
144C<fileno>, C<flock>, C<format>, C<getc>, C<print>, C<printf>, C<read>,
0d863452 145C<readdir>, C<rewinddir>, C<say>, C<seek>, C<seekdir>, C<select>, C<syscall>,
22fae026
TM
146C<sysread>, C<sysseek>, C<syswrite>, C<tell>, C<telldir>, C<truncate>,
147C<warn>, C<write>
cb1a09d0 148
5dac7880 149=item Functions for fixed-length data or records
cb1a09d0 150
22fae026 151C<pack>, C<read>, C<syscall>, C<sysread>, C<syswrite>, C<unpack>, C<vec>
cb1a09d0
AD
152
153=item Functions for filehandles, files, or directories
d74e8afc 154X<file> X<filehandle> X<directory> X<pipe> X<link> X<symlink>
cb1a09d0 155
22fae026 156C<-I<X>>, C<chdir>, C<chmod>, C<chown>, C<chroot>, C<fcntl>, C<glob>,
5ff3f7a4 157C<ioctl>, C<link>, C<lstat>, C<mkdir>, C<open>, C<opendir>,
1e278fd9
JH
158C<readlink>, C<rename>, C<rmdir>, C<stat>, C<symlink>, C<sysopen>,
159C<umask>, C<unlink>, C<utime>
cb1a09d0 160
cf264981 161=item Keywords related to the control flow of your Perl program
d74e8afc 162X<control flow>
cb1a09d0 163
98293880 164C<caller>, C<continue>, C<die>, C<do>, C<dump>, C<eval>, C<exit>,
cfa52385
FC
165C<__FILE__>, C<goto>, C<last>, C<__LINE__>, C<next>, C<__PACKAGE__>,
166C<redo>, C<return>, C<sub>, C<wantarray>,
cb1a09d0 167
8f1da26d 168=item Keywords related to the switch feature
0d863452 169
4a904372 170C<break>, C<continue>, C<default>, C<given>, C<when>
0d863452 171
4a904372
FC
172Except for C<continue>, these are available only if you enable the
173C<"switch"> feature or use the C<CORE::> prefix.
8f1da26d 174See L<feature> and L<perlsyn/"Switch statements">.
4a904372
FC
175Alternately, include a C<use v5.10> or later to the current scope. In Perl
1765.14 and earlier, C<continue> required the C<"switch"> feature, like the
177other keywords.
0d863452 178
54310121 179=item Keywords related to scoping
cb1a09d0 180
8f1da26d 181C<caller>, C<import>, C<local>, C<my>, C<our>, C<package>, C<state>, C<use>
36fb85f3 182
4a904372
FC
183C<state> is available only if the C<"state"> feature
184is enabled or if it is prefixed with C<CORE::>. See
8f1da26d 185L<feature>. Alternately, include a C<use v5.10> or later to the current scope.
cb1a09d0
AD
186
187=item Miscellaneous functions
188
36fb85f3 189C<defined>, C<dump>, C<eval>, C<formline>, C<local>, C<my>, C<our>,
834df1c5 190C<reset>, C<scalar>, C<state>, C<undef>, C<wantarray>
cb1a09d0
AD
191
192=item Functions for processes and process groups
d74e8afc 193X<process> X<pid> X<process id>
cb1a09d0 194
22fae026 195C<alarm>, C<exec>, C<fork>, C<getpgrp>, C<getppid>, C<getpriority>, C<kill>,
1dc8ecb8 196C<pipe>, C<qx//>, C<setpgrp>, C<setpriority>, C<sleep>, C<system>,
22fae026 197C<times>, C<wait>, C<waitpid>
cb1a09d0 198
3b10bc60 199=item Keywords related to Perl modules
d74e8afc 200X<module>
cb1a09d0 201
22fae026 202C<do>, C<import>, C<no>, C<package>, C<require>, C<use>
cb1a09d0 203
353c6505 204=item Keywords related to classes and object-orientation
d74e8afc 205X<object> X<class> X<package>
cb1a09d0 206
22fae026
TM
207C<bless>, C<dbmclose>, C<dbmopen>, C<package>, C<ref>, C<tie>, C<tied>,
208C<untie>, C<use>
cb1a09d0
AD
209
210=item Low-level socket functions
d74e8afc 211X<socket> X<sock>
cb1a09d0 212
22fae026
TM
213C<accept>, C<bind>, C<connect>, C<getpeername>, C<getsockname>,
214C<getsockopt>, C<listen>, C<recv>, C<send>, C<setsockopt>, C<shutdown>,
737dd4b4 215C<socket>, C<socketpair>
cb1a09d0
AD
216
217=item System V interprocess communication functions
d74e8afc 218X<IPC> X<System V> X<semaphore> X<shared memory> X<memory> X<message>
cb1a09d0 219
22fae026
TM
220C<msgctl>, C<msgget>, C<msgrcv>, C<msgsnd>, C<semctl>, C<semget>, C<semop>,
221C<shmctl>, C<shmget>, C<shmread>, C<shmwrite>
cb1a09d0
AD
222
223=item Fetching user and group info
d74e8afc 224X<user> X<group> X<password> X<uid> X<gid> X<passwd> X</etc/passwd>
cb1a09d0 225
22fae026
TM
226C<endgrent>, C<endhostent>, C<endnetent>, C<endpwent>, C<getgrent>,
227C<getgrgid>, C<getgrnam>, C<getlogin>, C<getpwent>, C<getpwnam>,
228C<getpwuid>, C<setgrent>, C<setpwent>
cb1a09d0
AD
229
230=item Fetching network info
d74e8afc 231X<network> X<protocol> X<host> X<hostname> X<IP> X<address> X<service>
cb1a09d0 232
22fae026
TM
233C<endprotoent>, C<endservent>, C<gethostbyaddr>, C<gethostbyname>,
234C<gethostent>, C<getnetbyaddr>, C<getnetbyname>, C<getnetent>,
235C<getprotobyname>, C<getprotobynumber>, C<getprotoent>,
236C<getservbyname>, C<getservbyport>, C<getservent>, C<sethostent>,
237C<setnetent>, C<setprotoent>, C<setservent>
cb1a09d0
AD
238
239=item Time-related functions
d74e8afc 240X<time> X<date>
cb1a09d0 241
22fae026 242C<gmtime>, C<localtime>, C<time>, C<times>
cb1a09d0 243
37798a01 244=item Functions new in perl5
d74e8afc 245X<perl5>
37798a01 246
834df1c5
SP
247C<abs>, C<bless>, C<break>, C<chomp>, C<chr>, C<continue>, C<default>,
248C<exists>, C<formline>, C<given>, C<glob>, C<import>, C<lc>, C<lcfirst>,
1dc8ecb8 249C<lock>, C<map>, C<my>, C<no>, C<our>, C<prototype>, C<qr//>, C<qw//>, C<qx//>,
834df1c5
SP
250C<readline>, C<readpipe>, C<ref>, C<sub>*, C<sysopen>, C<tie>, C<tied>, C<uc>,
251C<ucfirst>, C<untie>, C<use>, C<when>
37798a01 252
3b10bc60 253* C<sub> was a keyword in Perl 4, but in Perl 5 it is an
5a964f20 254operator, which can be used in expressions.
37798a01 255
256=item Functions obsoleted in perl5
257
22fae026 258C<dbmclose>, C<dbmopen>
37798a01 259
cb1a09d0
AD
260=back
261
60f9f73c 262=head2 Portability
d74e8afc 263X<portability> X<Unix> X<portable>
60f9f73c 264
2b5ab1e7
TC
265Perl was born in Unix and can therefore access all common Unix
266system calls. In non-Unix environments, the functionality of some
8f1da26d 267Unix system calls may not be available or details of the available
2b5ab1e7 268functionality may differ slightly. The Perl functions affected
60f9f73c
JH
269by this are:
270
271C<-X>, C<binmode>, C<chmod>, C<chown>, C<chroot>, C<crypt>,
272C<dbmclose>, C<dbmopen>, C<dump>, C<endgrent>, C<endhostent>,
273C<endnetent>, C<endprotoent>, C<endpwent>, C<endservent>, C<exec>,
ef5a6dd7
JH
274C<fcntl>, C<flock>, C<fork>, C<getgrent>, C<getgrgid>, C<gethostbyname>,
275C<gethostent>, C<getlogin>, C<getnetbyaddr>, C<getnetbyname>, C<getnetent>,
54d7b083 276C<getppid>, C<getpgrp>, C<getpriority>, C<getprotobynumber>,
60f9f73c
JH
277C<getprotoent>, C<getpwent>, C<getpwnam>, C<getpwuid>,
278C<getservbyport>, C<getservent>, C<getsockopt>, C<glob>, C<ioctl>,
279C<kill>, C<link>, C<lstat>, C<msgctl>, C<msgget>, C<msgrcv>,
2b5ab1e7 280C<msgsnd>, C<open>, C<pipe>, C<readlink>, C<rename>, C<select>, C<semctl>,
60f9f73c
JH
281C<semget>, C<semop>, C<setgrent>, C<sethostent>, C<setnetent>,
282C<setpgrp>, C<setpriority>, C<setprotoent>, C<setpwent>,
283C<setservent>, C<setsockopt>, C<shmctl>, C<shmget>, C<shmread>,
737dd4b4 284C<shmwrite>, C<socket>, C<socketpair>,
80cbd5ad
JH
285C<stat>, C<symlink>, C<syscall>, C<sysopen>, C<system>,
286C<times>, C<truncate>, C<umask>, C<unlink>,
2b5ab1e7 287C<utime>, C<wait>, C<waitpid>
60f9f73c
JH
288
289For more information about the portability of these functions, see
290L<perlport> and other available platform-specific documentation.
291
cb1a09d0
AD
292=head2 Alphabetical Listing of Perl Functions
293
3b10bc60 294=over
a0d0e21e 295
5b3c99c0 296=item -X FILEHANDLE
d74e8afc
ITB
297X<-r>X<-w>X<-x>X<-o>X<-R>X<-W>X<-X>X<-O>X<-e>X<-z>X<-s>X<-f>X<-d>X<-l>X<-p>
298X<-S>X<-b>X<-c>X<-t>X<-u>X<-g>X<-k>X<-T>X<-B>X<-M>X<-A>X<-C>
a0d0e21e 299
5b3c99c0 300=item -X EXPR
a0d0e21e 301
5228a96c
SP
302=item -X DIRHANDLE
303
5b3c99c0 304=item -X
a0d0e21e
LW
305
306A file test, where X is one of the letters listed below. This unary
5228a96c
SP
307operator takes one argument, either a filename, a filehandle, or a dirhandle,
308and tests the associated file to see if something is true about it. If the
7660c0ab 309argument is omitted, tests C<$_>, except for C<-t>, which tests STDIN.
19799a22 310Unless otherwise documented, it returns C<1> for true and C<''> for false, or
a0d0e21e 311the undefined value if the file doesn't exist. Despite the funny
d0821a6a 312names, precedence is the same as any other named unary operator. The
a0d0e21e
LW
313operator may be any of:
314
5ed4f2ec 315 -r File is readable by effective uid/gid.
316 -w File is writable by effective uid/gid.
317 -x File is executable by effective uid/gid.
318 -o File is owned by effective uid.
a0d0e21e 319
5ed4f2ec 320 -R File is readable by real uid/gid.
321 -W File is writable by real uid/gid.
322 -X File is executable by real uid/gid.
323 -O File is owned by real uid.
a0d0e21e 324
5ed4f2ec 325 -e File exists.
326 -z File has zero size (is empty).
327 -s File has nonzero size (returns size in bytes).
a0d0e21e 328
5ed4f2ec 329 -f File is a plain file.
330 -d File is a directory.
331 -l File is a symbolic link.
332 -p File is a named pipe (FIFO), or Filehandle is a pipe.
333 -S File is a socket.
334 -b File is a block special file.
335 -c File is a character special file.
336 -t Filehandle is opened to a tty.
a0d0e21e 337
5ed4f2ec 338 -u File has setuid bit set.
339 -g File has setgid bit set.
340 -k File has sticky bit set.
a0d0e21e 341
5ed4f2ec 342 -T File is an ASCII text file (heuristic guess).
343 -B File is a "binary" file (opposite of -T).
a0d0e21e 344
5ed4f2ec 345 -M Script start time minus file modification time, in days.
346 -A Same for access time.
347 -C Same for inode change time (Unix, may differ for other platforms)
a0d0e21e 348
a0d0e21e
LW
349Example:
350
351 while (<>) {
a9a5a0dc
VP
352 chomp;
353 next unless -f $_; # ignore specials
354 #...
a0d0e21e
LW
355 }
356
4fb67938
FC
357Note that C<-s/a/b/> does not do a negated substitution. Saying
358C<-exp($foo)> still works as expected, however: only single letters
359following a minus are interpreted as file tests.
360
361These operators are exempt from the "looks like a function rule" described
4d0444a3
FC
362above. That is, an opening parenthesis after the operator does not affect
363how much of the following code constitutes the argument. Put the opening
4fb67938
FC
364parentheses before the operator to separate it from code that follows (this
365applies only to operators with higher precedence than unary operators, of
366course):
367
368 -s($file) + 1024 # probably wrong; same as -s($file + 1024)
369 (-s $file) + 1024 # correct
370
5ff3f7a4
GS
371The interpretation of the file permission operators C<-r>, C<-R>,
372C<-w>, C<-W>, C<-x>, and C<-X> is by default based solely on the mode
373of the file and the uids and gids of the user. There may be other
ecae030f
MO
374reasons you can't actually read, write, or execute the file: for
375example network filesystem access controls, ACLs (access control lists),
376read-only filesystems, and unrecognized executable formats. Note
377that the use of these six specific operators to verify if some operation
378is possible is usually a mistake, because it may be open to race
379conditions.
5ff3f7a4 380
2b5ab1e7
TC
381Also note that, for the superuser on the local filesystems, the C<-r>,
382C<-R>, C<-w>, and C<-W> tests always return 1, and C<-x> and C<-X> return 1
5ff3f7a4
GS
383if any execute bit is set in the mode. Scripts run by the superuser
384may thus need to do a stat() to determine the actual mode of the file,
2b5ab1e7 385or temporarily set their effective uid to something else.
5ff3f7a4
GS
386
387If you are using ACLs, there is a pragma called C<filetest> that may
388produce more accurate results than the bare stat() mode bits.
5dac7880
FC
389When under C<use filetest 'access'> the above-mentioned filetests
390test whether the permission can(not) be granted using the
3b10bc60 391access(2) family of system calls. Also note that the C<-x> and C<-X> may
5ff3f7a4
GS
392under this pragma return true even if there are no execute permission
393bits set (nor any extra execute permission ACLs). This strangeness is
ecae030f
MO
394due to the underlying system calls' definitions. Note also that, due to
395the implementation of C<use filetest 'access'>, the C<_> special
396filehandle won't cache the results of the file tests when this pragma is
397in effect. Read the documentation for the C<filetest> pragma for more
398information.
5ff3f7a4 399
a0d0e21e
LW
400The C<-T> and C<-B> switches work as follows. The first block or so of the
401file is examined for odd characters such as strange control codes or
61eff3bc 402characters with the high bit set. If too many strange characters (>30%)
cf264981 403are found, it's a C<-B> file; otherwise it's a C<-T> file. Also, any file
3b10bc60 404containing a zero byte in the first block is considered a binary file. If C<-T>
9124316e 405or C<-B> is used on a filehandle, the current IO buffer is examined
3b10bc60 406rather than the first block. Both C<-T> and C<-B> return true on an empty
54310121 407file, or a file at EOF when testing a filehandle. Because you have to
4633a7c4
LW
408read a file to do the C<-T> test, on most occasions you want to use a C<-f>
409against the file first, as in C<next unless -f $file && -T $file>.
a0d0e21e 410
5dac7880 411If any of the file tests (or either the C<stat> or C<lstat> operator) is given
28757baa 412the special filehandle consisting of a solitary underline, then the stat
a0d0e21e
LW
413structure of the previous file test (or stat operator) is used, saving
414a system call. (This doesn't work with C<-t>, and you need to remember
3b10bc60 415that lstat() and C<-l> leave values in the stat structure for the
5c9aa243 416symbolic link, not the real file.) (Also, if the stat buffer was filled by
cf264981 417an C<lstat> call, C<-T> and C<-B> will reset it with the results of C<stat _>).
5c9aa243 418Example:
a0d0e21e
LW
419
420 print "Can do.\n" if -r $a || -w _ || -x _;
421
422 stat($filename);
423 print "Readable\n" if -r _;
424 print "Writable\n" if -w _;
425 print "Executable\n" if -x _;
426 print "Setuid\n" if -u _;
427 print "Setgid\n" if -g _;
428 print "Sticky\n" if -k _;
429 print "Text\n" if -T _;
430 print "Binary\n" if -B _;
431
fbb0b3b3
RGS
432As of Perl 5.9.1, as a form of purely syntactic sugar, you can stack file
433test operators, in a way that C<-f -w -x $file> is equivalent to
3b10bc60 434C<-x $file && -w _ && -f _>. (This is only fancy fancy: if you use
fbb0b3b3
RGS
435the return value of C<-f $file> as an argument to another filetest
436operator, no special magic will happen.)
437
bee96257 438Portability issues: L<perlport/-X>.
ea9eb35a 439
a0d0e21e 440=item abs VALUE
d74e8afc 441X<abs> X<absolute>
a0d0e21e 442
54310121 443=item abs
bbce6d69 444
a0d0e21e 445Returns the absolute value of its argument.
7660c0ab 446If VALUE is omitted, uses C<$_>.
a0d0e21e
LW
447
448=item accept NEWSOCKET,GENERICSOCKET
d74e8afc 449X<accept>
a0d0e21e 450
3b10bc60 451Accepts an incoming socket connect, just as accept(2)
19799a22 452does. Returns the packed address if it succeeded, false otherwise.
2b5ab1e7 453See the example in L<perlipc/"Sockets: Client/Server Communication">.
a0d0e21e 454
8d2a6795
GS
455On systems that support a close-on-exec flag on files, the flag will
456be set for the newly opened file descriptor, as determined by the
457value of $^F. See L<perlvar/$^F>.
458
a0d0e21e 459=item alarm SECONDS
d74e8afc
ITB
460X<alarm>
461X<SIGALRM>
462X<timer>
a0d0e21e 463
54310121 464=item alarm
bbce6d69 465
a0d0e21e 466Arranges to have a SIGALRM delivered to this process after the
cf264981 467specified number of wallclock seconds has elapsed. If SECONDS is not
d400eac8
JH
468specified, the value stored in C<$_> is used. (On some machines,
469unfortunately, the elapsed time may be up to one second less or more
470than you specified because of how seconds are counted, and process
471scheduling may delay the delivery of the signal even further.)
472
473Only one timer may be counting at once. Each call disables the
474previous timer, and an argument of C<0> may be supplied to cancel the
475previous timer without starting a new one. The returned value is the
476amount of time remaining on the previous timer.
a0d0e21e 477
2bc69794
BS
478For delays of finer granularity than one second, the Time::HiRes module
479(from CPAN, and starting from Perl 5.8 part of the standard
480distribution) provides ualarm(). You may also use Perl's four-argument
481version of select() leaving the first three arguments undefined, or you
482might be able to use the C<syscall> interface to access setitimer(2) if
483your system supports it. See L<perlfaq8> for details.
2b5ab1e7 484
80d38338
TC
485It is usually a mistake to intermix C<alarm> and C<sleep> calls, because
486C<sleep> may be internally implemented on your system with C<alarm>.
a0d0e21e 487
19799a22
GS
488If you want to use C<alarm> to time out a system call you need to use an
489C<eval>/C<die> pair. You can't rely on the alarm causing the system call to
f86cebdf 490fail with C<$!> set to C<EINTR> because Perl sets up signal handlers to
19799a22 491restart system calls on some systems. Using C<eval>/C<die> always works,
5a964f20 492modulo the caveats given in L<perlipc/"Signals">.
ff68c719 493
494 eval {
a9a5a0dc
VP
495 local $SIG{ALRM} = sub { die "alarm\n" }; # NB: \n required
496 alarm $timeout;
497 $nread = sysread SOCKET, $buffer, $size;
498 alarm 0;
ff68c719 499 };
ff68c719 500 if ($@) {
a9a5a0dc 501 die unless $@ eq "alarm\n"; # propagate unexpected errors
5ed4f2ec 502 # timed out
ff68c719 503 }
504 else {
5ed4f2ec 505 # didn't
ff68c719 506 }
507
91d81acc
JH
508For more information see L<perlipc>.
509
ea9eb35a
BJ
510Portability issues: L<perlport/alarm>.
511
a0d0e21e 512=item atan2 Y,X
d74e8afc 513X<atan2> X<arctangent> X<tan> X<tangent>
a0d0e21e
LW
514
515Returns the arctangent of Y/X in the range -PI to PI.
516
ca6e1c26 517For the tangent operation, you may use the C<Math::Trig::tan>
28757baa 518function, or use the familiar relation:
519
520 sub tan { sin($_[0]) / cos($_[0]) }
521
a1021d57
RGS
522The return value for C<atan2(0,0)> is implementation-defined; consult
523your atan2(3) manpage for more information.
bf5f1b4c 524
ea9eb35a
BJ
525Portability issues: L<perlport/atan2>.
526
a0d0e21e 527=item bind SOCKET,NAME
d74e8afc 528X<bind>
a0d0e21e 529
3b10bc60 530Binds a network address to a socket, just as bind(2)
19799a22 531does. Returns true if it succeeded, false otherwise. NAME should be a
4633a7c4
LW
532packed address of the appropriate type for the socket. See the examples in
533L<perlipc/"Sockets: Client/Server Communication">.
a0d0e21e 534
fae2c0fb 535=item binmode FILEHANDLE, LAYER
d74e8afc 536X<binmode> X<binary> X<text> X<DOS> X<Windows>
1c1fc3ea 537
a0d0e21e
LW
538=item binmode FILEHANDLE
539
1cbfc93d
NIS
540Arranges for FILEHANDLE to be read or written in "binary" or "text"
541mode on systems where the run-time libraries distinguish between
542binary and text files. If FILEHANDLE is an expression, the value is
543taken as the name of the filehandle. Returns true on success,
b5fe5ca2 544otherwise it returns C<undef> and sets C<$!> (errno).
1cbfc93d 545
8f1da26d 546On some systems (in general, DOS- and Windows-based systems) binmode()
d807c6f4 547is necessary when you're not working with a text file. For the sake
d7a0d798
FC
548of portability it is a good idea always to use it when appropriate,
549and never to use it when it isn't appropriate. Also, people can
8f1da26d 550set their I/O to be by default UTF8-encoded Unicode, not bytes.
d807c6f4
JH
551
552In other words: regardless of platform, use binmode() on binary data,
d7a0d798 553like images, for example.
d807c6f4
JH
554
555If LAYER is present it is a single string, but may contain multiple
3b10bc60 556directives. The directives alter the behaviour of the filehandle.
d7a0d798 557When LAYER is present, using binmode on a text file makes sense.
d807c6f4 558
fae2c0fb 559If LAYER is omitted or specified as C<:raw> the filehandle is made
0226bbdb
NIS
560suitable for passing binary data. This includes turning off possible CRLF
561translation and marking it as bytes (as opposed to Unicode characters).
749683d2 562Note that, despite what may be implied in I<"Programming Perl"> (the
3b10bc60 563Camel, 3rd edition) or elsewhere, C<:raw> is I<not> simply the inverse of C<:crlf>.
564Other layers that would affect the binary nature of the stream are
565I<also> disabled. See L<PerlIO>, L<perlrun>, and the discussion about the
0226bbdb 566PERLIO environment variable.
01e6739c 567
3b10bc60 568The C<:bytes>, C<:crlf>, C<:utf8>, and any other directives of the
d807c6f4
JH
569form C<:...>, are called I/O I<layers>. The C<open> pragma can be used to
570establish default I/O layers. See L<open>.
571
fae2c0fb
RGS
572I<The LAYER parameter of the binmode() function is described as "DISCIPLINE"
573in "Programming Perl, 3rd Edition". However, since the publishing of this
574book, by many known as "Camel III", the consensus of the naming of this
575functionality has moved from "discipline" to "layer". All documentation
576of this version of Perl therefore refers to "layers" rather than to
577"disciplines". Now back to the regularly scheduled documentation...>
578
8f1da26d 579To mark FILEHANDLE as UTF-8, use C<:utf8> or C<:encoding(UTF-8)>.
6902c96a 580C<:utf8> just marks the data as UTF-8 without further checking,
8f1da26d 581while C<:encoding(UTF-8)> checks the data for actually being valid
6902c96a 582UTF-8. More details can be found in L<PerlIO::encoding>.
1cbfc93d 583
ed53a2bb 584In general, binmode() should be called after open() but before any I/O
3b10bc60 585is done on the filehandle. Calling binmode() normally flushes any
01e6739c 586pending buffered output data (and perhaps pending input data) on the
fae2c0fb 587handle. An exception to this is the C<:encoding> layer that
d7a0d798 588changes the default character encoding of the handle; see L</open>.
fae2c0fb 589The C<:encoding> layer sometimes needs to be called in
3874323d
JH
590mid-stream, and it doesn't flush the stream. The C<:encoding>
591also implicitly pushes on top of itself the C<:utf8> layer because
3b10bc60 592internally Perl operates on UTF8-encoded Unicode characters.
16fe6d59 593
19799a22 594The operating system, device drivers, C libraries, and Perl run-time
8f1da26d
TC
595system all conspire to let the programmer treat a single
596character (C<\n>) as the line terminator, irrespective of external
30168b04
GS
597representation. On many operating systems, the native text file
598representation matches the internal representation, but on some
599platforms the external representation of C<\n> is made up of more than
600one character.
601
8f1da26d
TC
602All variants of Unix, Mac OS (old and new), and Stream_LF files on VMS use
603a single character to end each line in the external representation of text
604(even though that single character is CARRIAGE RETURN on old, pre-Darwin
605flavors of Mac OS, and is LINE FEED on Unix and most VMS files). In other
606systems like OS/2, DOS, and the various flavors of MS-Windows, your program
607sees a C<\n> as a simple C<\cJ>, but what's stored in text files are the
608two characters C<\cM\cJ>. That means that if you don't use binmode() on
609these systems, C<\cM\cJ> sequences on disk will be converted to C<\n> on
610input, and any C<\n> in your program will be converted back to C<\cM\cJ> on
611output. This is what you want for text files, but it can be disastrous for
612binary files.
30168b04
GS
613
614Another consequence of using binmode() (on some systems) is that
615special end-of-file markers will be seen as part of the data stream.
d7a0d798
FC
616For systems from the Microsoft family this means that, if your binary
617data contain C<\cZ>, the I/O subsystem will regard it as the end of
30168b04
GS
618the file, unless you use binmode().
619
3b10bc60 620binmode() is important not only for readline() and print() operations,
30168b04
GS
621but also when using read(), seek(), sysread(), syswrite() and tell()
622(see L<perlport> for more details). See the C<$/> and C<$\> variables
623in L<perlvar> for how to manually set your input and output
624line-termination sequences.
a0d0e21e 625
ea9eb35a
BJ
626Portability issues: L<perlport/binmode>.
627
4633a7c4 628=item bless REF,CLASSNAME
d74e8afc 629X<bless>
a0d0e21e
LW
630
631=item bless REF
632
2b5ab1e7
TC
633This function tells the thingy referenced by REF that it is now an object
634in the CLASSNAME package. If CLASSNAME is omitted, the current package
19799a22 635is used. Because a C<bless> is often the last thing in a constructor,
2b5ab1e7 636it returns the reference for convenience. Always use the two-argument
cf264981 637version if a derived class might inherit the function doing the blessing.
82e1c0d9 638SeeL<perlobj> for more about the blessing (and blessings) of objects.
a0d0e21e 639
57668c4d 640Consider always blessing objects in CLASSNAMEs that are mixed case.
2b5ab1e7 641Namespaces with all lowercase names are considered reserved for
cf264981 642Perl pragmata. Builtin types have all uppercase names. To prevent
2b5ab1e7
TC
643confusion, you may wish to avoid such package names as well. Make sure
644that CLASSNAME is a true value.
60ad88b8
GS
645
646See L<perlmod/"Perl Modules">.
647
0d863452
RH
648=item break
649
650Break out of a C<given()> block.
651
8f1da26d 652This keyword is enabled by the C<"switch"> feature: see
4a904372
FC
653L<feature> for more information. You can also access it by
654prefixing it with C<CORE::>. Alternately, include a C<use
8f1da26d 655v5.10> or later to the current scope.
0d863452 656
a0d0e21e 657=item caller EXPR
d74e8afc 658X<caller> X<call stack> X<stack> X<stack trace>
a0d0e21e
LW
659
660=item caller
661
5a964f20 662Returns the context of the current subroutine call. In scalar context,
80d38338
TC
663returns the caller's package name if there I<is> a caller (that is, if
664we're in a subroutine or C<eval> or C<require>) and the undefined value
5a964f20 665otherwise. In list context, returns
a0d0e21e 666
ee6b43cc 667 # 0 1 2
748a9306 668 ($package, $filename, $line) = caller;
a0d0e21e
LW
669
670With EXPR, it returns some extra information that the debugger uses to
671print a stack trace. The value of EXPR indicates how many call frames
672to go back before the current one.
673
ee6b43cc 674 # 0 1 2 3 4
f3aa04c2 675 ($package, $filename, $line, $subroutine, $hasargs,
ee6b43cc 676
677 # 5 6 7 8 9 10
b3ca2e83 678 $wantarray, $evaltext, $is_require, $hints, $bitmask, $hinthash)
ee6b43cc 679 = caller($i);
e7ea3e70 680
951ba7fe 681Here $subroutine may be C<(eval)> if the frame is not a subroutine
19799a22 682call, but an C<eval>. In such a case additional elements $evaltext and
7660c0ab 683C<$is_require> are set: C<$is_require> is true if the frame is created by a
19799a22 684C<require> or C<use> statement, $evaltext contains the text of the
277ddfaf 685C<eval EXPR> statement. In particular, for an C<eval BLOCK> statement,
cc1c2e42 686$subroutine is C<(eval)>, but $evaltext is undefined. (Note also that
0fc9dec4
RGS
687each C<use> statement creates a C<require> frame inside an C<eval EXPR>
688frame.) $subroutine may also be C<(unknown)> if this particular
689subroutine happens to have been deleted from the symbol table.
690C<$hasargs> is true if a new instance of C<@_> was set up for the frame.
691C<$hints> and C<$bitmask> contain pragmatic hints that the caller was
692compiled with. The C<$hints> and C<$bitmask> values are subject to change
693between versions of Perl, and are not meant for external use.
748a9306 694
b3ca2e83
NC
695C<$hinthash> is a reference to a hash containing the value of C<%^H> when the
696caller was compiled, or C<undef> if C<%^H> was empty. Do not modify the values
697of this hash, as they are the actual values stored in the optree.
698
748a9306 699Furthermore, when called from within the DB package, caller returns more
7660c0ab 700detailed information: it sets the list variable C<@DB::args> to be the
54310121 701arguments with which the subroutine was invoked.
748a9306 702
7660c0ab 703Be aware that the optimizer might have optimized call frames away before
19799a22 704C<caller> had a chance to get the information. That means that C<caller(N)>
80d38338 705might not return information about the call frame you expect it to, for
b76cc8ba 706C<< N > 1 >>. In particular, C<@DB::args> might have information from the
19799a22 707previous time C<caller> was called.
7660c0ab 708
8f1da26d 709Be aware that setting C<@DB::args> is I<best effort>, intended for
ca9f0cb5
NC
710debugging or generating backtraces, and should not be relied upon. In
711particular, as C<@_> contains aliases to the caller's arguments, Perl does
712not take a copy of C<@_>, so C<@DB::args> will contain modifications the
713subroutine makes to C<@_> or its contents, not the original values at call
714time. C<@DB::args>, like C<@_>, does not hold explicit references to its
715elements, so under certain cases its elements may have become freed and
716reallocated for other variables or temporary values. Finally, a side effect
d7a0d798 717of the current implementation is that the effects of C<shift @_> can
8f1da26d
TC
718I<normally> be undone (but not C<pop @_> or other splicing, I<and> not if a
719reference to C<@_> has been taken, I<and> subject to the caveat about reallocated
ca9f0cb5
NC
720elements), so C<@DB::args> is actually a hybrid of the current state and
721initial state of C<@_>. Buyer beware.
722
a0d0e21e 723=item chdir EXPR
d74e8afc
ITB
724X<chdir>
725X<cd>
f723aae1 726X<directory, change>
a0d0e21e 727
c4aca7d0
GA
728=item chdir FILEHANDLE
729
730=item chdir DIRHANDLE
731
ce2984c3
PF
732=item chdir
733
ffce7b87 734Changes the working directory to EXPR, if possible. If EXPR is omitted,
0bfc1ec4 735changes to the directory specified by C<$ENV{HOME}>, if set; if not,
ffce7b87 736changes to the directory specified by C<$ENV{LOGDIR}>. (Under VMS, the
b4ad75f0 737variable C<$ENV{SYS$LOGIN}> is also checked, and used if it is set.) If
80d38338 738neither is set, C<chdir> does nothing. It returns true on success,
b4ad75f0 739false otherwise. See the example under C<die>.
a0d0e21e 740
3b10bc60 741On systems that support fchdir(2), you may pass a filehandle or
34169887 742directory handle as the argument. On systems that don't support fchdir(2),
3b10bc60 743passing handles raises an exception.
c4aca7d0 744
a0d0e21e 745=item chmod LIST
d74e8afc 746X<chmod> X<permission> X<mode>
a0d0e21e
LW
747
748Changes the permissions of a list of files. The first element of the
8f1da26d 749list must be the numeric mode, which should probably be an octal
4ad40acf 750number, and which definitely should I<not> be a string of octal digits:
3b10bc60 751C<0644> is okay, but C<"0644"> is not. Returns the number of files
8f1da26d 752successfully changed. See also L</oct> if all you have is a string.
a0d0e21e 753
3b10bc60 754 $cnt = chmod 0755, "foo", "bar";
a0d0e21e 755 chmod 0755, @executables;
3b10bc60 756 $mode = "0644"; chmod $mode, "foo"; # !!! sets mode to
f86cebdf 757 # --w----r-T
3b10bc60 758 $mode = "0644"; chmod oct($mode), "foo"; # this is better
759 $mode = 0644; chmod $mode, "foo"; # this is best
a0d0e21e 760
3b10bc60 761On systems that support fchmod(2), you may pass filehandles among the
762files. On systems that don't support fchmod(2), passing filehandles raises
763an exception. Filehandles must be passed as globs or glob references to be
764recognized; barewords are considered filenames.
c4aca7d0
GA
765
766 open(my $fh, "<", "foo");
767 my $perm = (stat $fh)[2] & 07777;
768 chmod($perm | 0600, $fh);
769
3b10bc60 770You can also import the symbolic C<S_I*> constants from the C<Fcntl>
ca6e1c26
JH
771module:
772
3b10bc60 773 use Fcntl qw( :mode );
ca6e1c26 774 chmod S_IRWXU|S_IRGRP|S_IXGRP|S_IROTH|S_IXOTH, @executables;
3b10bc60 775 # Identical to the chmod 0755 of the example above.
ca6e1c26 776
ea9eb35a
BJ
777Portability issues: L<perlport/chmod>.
778
a0d0e21e 779=item chomp VARIABLE
d74e8afc 780X<chomp> X<INPUT_RECORD_SEPARATOR> X<$/> X<newline> X<eol>
a0d0e21e 781
313c9f5c 782=item chomp( LIST )
a0d0e21e
LW
783
784=item chomp
785
2b5ab1e7
TC
786This safer version of L</chop> removes any trailing string
787that corresponds to the current value of C<$/> (also known as
28757baa 788$INPUT_RECORD_SEPARATOR in the C<English> module). It returns the total
789number of characters removed from all its arguments. It's often used to
790remove the newline from the end of an input record when you're worried
2b5ab1e7
TC
791that the final record may be missing its newline. When in paragraph
792mode (C<$/ = "">), it removes all trailing newlines from the string.
4c5a6083 793When in slurp mode (C<$/ = undef>) or fixed-length record mode (C<$/> is
34169887 794a reference to an integer or the like; see L<perlvar>) chomp() won't
b76cc8ba 795remove anything.
19799a22 796If VARIABLE is omitted, it chomps C<$_>. Example:
a0d0e21e
LW
797
798 while (<>) {
a9a5a0dc
VP
799 chomp; # avoid \n on last field
800 @array = split(/:/);
801 # ...
a0d0e21e
LW
802 }
803
4bf21a6d
RD
804If VARIABLE is a hash, it chomps the hash's values, but not its keys.
805
a0d0e21e
LW
806You can actually chomp anything that's an lvalue, including an assignment:
807
808 chomp($cwd = `pwd`);
809 chomp($answer = <STDIN>);
810
811If you chomp a list, each element is chomped, and the total number of
812characters removed is returned.
813
15e44fd8
RGS
814Note that parentheses are necessary when you're chomping anything
815that is not a simple variable. This is because C<chomp $cwd = `pwd`;>
816is interpreted as C<(chomp $cwd) = `pwd`;>, rather than as
817C<chomp( $cwd = `pwd` )> which you might expect. Similarly,
818C<chomp $a, $b> is interpreted as C<chomp($a), $b> rather than
819as C<chomp($a, $b)>.
820
a0d0e21e 821=item chop VARIABLE
d74e8afc 822X<chop>
a0d0e21e 823
313c9f5c 824=item chop( LIST )
a0d0e21e
LW
825
826=item chop
827
828Chops off the last character of a string and returns the character
5b3eff12 829chopped. It is much more efficient than C<s/.$//s> because it neither
7660c0ab 830scans nor copies the string. If VARIABLE is omitted, chops C<$_>.
4bf21a6d
RD
831If VARIABLE is a hash, it chops the hash's values, but not its keys.
832
5b3eff12 833You can actually chop anything that's an lvalue, including an assignment.
a0d0e21e
LW
834
835If you chop a list, each element is chopped. Only the value of the
19799a22 836last C<chop> is returned.
a0d0e21e 837
19799a22 838Note that C<chop> returns the last character. To return all but the last
748a9306
LW
839character, use C<substr($string, 0, -1)>.
840
15e44fd8
RGS
841See also L</chomp>.
842
a0d0e21e 843=item chown LIST
d74e8afc 844X<chown> X<owner> X<user> X<group>
a0d0e21e
LW
845
846Changes the owner (and group) of a list of files. The first two
19799a22
GS
847elements of the list must be the I<numeric> uid and gid, in that
848order. A value of -1 in either position is interpreted by most
849systems to leave that value unchanged. Returns the number of files
850successfully changed.
a0d0e21e
LW
851
852 $cnt = chown $uid, $gid, 'foo', 'bar';
853 chown $uid, $gid, @filenames;
854
3b10bc60 855On systems that support fchown(2), you may pass filehandles among the
856files. On systems that don't support fchown(2), passing filehandles raises
857an exception. Filehandles must be passed as globs or glob references to be
858recognized; barewords are considered filenames.
c4aca7d0 859
54310121 860Here's an example that looks up nonnumeric uids in the passwd file:
a0d0e21e
LW
861
862 print "User: ";
19799a22 863 chomp($user = <STDIN>);
5a964f20 864 print "Files: ";
19799a22 865 chomp($pattern = <STDIN>);
a0d0e21e
LW
866
867 ($login,$pass,$uid,$gid) = getpwnam($user)
a9a5a0dc 868 or die "$user not in passwd file";
a0d0e21e 869
5ed4f2ec 870 @ary = glob($pattern); # expand filenames
a0d0e21e
LW
871 chown $uid, $gid, @ary;
872
54310121 873On most systems, you are not allowed to change the ownership of the
4633a7c4
LW
874file unless you're the superuser, although you should be able to change
875the group to any of your secondary groups. On insecure systems, these
876restrictions may be relaxed, but this is not a portable assumption.
19799a22
GS
877On POSIX systems, you can detect this condition this way:
878
879 use POSIX qw(sysconf _PC_CHOWN_RESTRICTED);
880 $can_chown_giveaway = not sysconf(_PC_CHOWN_RESTRICTED);
4633a7c4 881
ea9eb35a
BJ
882Portability issues: L<perlport/chmod>.
883
a0d0e21e 884=item chr NUMBER
d74e8afc 885X<chr> X<character> X<ASCII> X<Unicode>
a0d0e21e 886
54310121 887=item chr
bbce6d69 888
a0d0e21e 889Returns the character represented by that NUMBER in the character set.
a0ed51b3 890For example, C<chr(65)> is C<"A"> in either ASCII or Unicode, and
2575c402 891chr(0x263a) is a Unicode smiley face.
aaa68c4a 892
8a064bd6 893Negative values give the Unicode replacement character (chr(0xfffd)),
80d38338 894except under the L<bytes> pragma, where the low eight bits of the value
8a064bd6
JH
895(truncated to an integer) are used.
896
974da8e5
JH
897If NUMBER is omitted, uses C<$_>.
898
b76cc8ba 899For the reverse, use L</ord>.
a0d0e21e 900
2575c402
JW
901Note that characters from 128 to 255 (inclusive) are by default
902internally not encoded as UTF-8 for backward compatibility reasons.
974da8e5 903
2575c402 904See L<perlunicode> for more about Unicode.
bbce6d69 905
a0d0e21e 906=item chroot FILENAME
d74e8afc 907X<chroot> X<root>
a0d0e21e 908
54310121 909=item chroot
bbce6d69 910
5a964f20 911This function works like the system call by the same name: it makes the
4633a7c4 912named directory the new root directory for all further pathnames that
951ba7fe 913begin with a C</> by your process and all its children. (It doesn't
28757baa 914change your current working directory, which is unaffected.) For security
4633a7c4 915reasons, this call is restricted to the superuser. If FILENAME is
19799a22 916omitted, does a C<chroot> to C<$_>.
a0d0e21e 917
ea9eb35a
BJ
918Portability issues: L<perlport/chroot>.
919
a0d0e21e 920=item close FILEHANDLE
d74e8afc 921X<close>
a0d0e21e 922
6a518fbc
TP
923=item close
924
3b10bc60 925Closes the file or pipe associated with the filehandle, flushes the IO
e0f13c26 926buffers, and closes the system file descriptor. Returns true if those
8f1da26d 927operations succeed and if no error was reported by any PerlIO
e0f13c26
RGS
928layer. Closes the currently selected filehandle if the argument is
929omitted.
fb73857a 930
931You don't have to close FILEHANDLE if you are immediately going to do
3b10bc60 932another C<open> on it, because C<open> closes it for you. (See
01aa884e 933L<open|/open FILEHANDLE>.) However, an explicit C<close> on an input file resets the line
19799a22 934counter (C<$.>), while the implicit close done by C<open> does not.
fb73857a 935
3b10bc60 936If the filehandle came from a piped open, C<close> returns false if one of
937the other syscalls involved fails or if its program exits with non-zero
938status. If the only problem was that the program exited non-zero, C<$!>
939will be set to C<0>. Closing a pipe also waits for the process executing
940on the pipe to exit--in case you wish to look at the output of the pipe
941afterwards--and implicitly puts the exit status value of that command into
942C<$?> and C<${^CHILD_ERROR_NATIVE}>.
5a964f20 943
2e0cfa16
FC
944If there are multiple threads running, C<close> on a filehandle from a
945piped open returns true without waiting for the child process to terminate,
946if the filehandle is still open in another thread.
947
80d38338
TC
948Closing the read end of a pipe before the process writing to it at the
949other end is done writing results in the writer receiving a SIGPIPE. If
950the other end can't handle that, be sure to read all the data before
951closing the pipe.
73689b13 952
fb73857a 953Example:
a0d0e21e 954
fb73857a 955 open(OUTPUT, '|sort >foo') # pipe to sort
956 or die "Can't start sort: $!";
5ed4f2ec 957 #... # print stuff to output
958 close OUTPUT # wait for sort to finish
fb73857a 959 or warn $! ? "Error closing sort pipe: $!"
960 : "Exit status $? from sort";
5ed4f2ec 961 open(INPUT, 'foo') # get sort's results
fb73857a 962 or die "Can't open 'foo' for input: $!";
a0d0e21e 963
5a964f20 964FILEHANDLE may be an expression whose value can be used as an indirect
8f1da26d 965filehandle, usually the real filehandle name or an autovivified handle.
a0d0e21e
LW
966
967=item closedir DIRHANDLE
d74e8afc 968X<closedir>
a0d0e21e 969
19799a22 970Closes a directory opened by C<opendir> and returns the success of that
5a964f20
TC
971system call.
972
a0d0e21e 973=item connect SOCKET,NAME
d74e8afc 974X<connect>
a0d0e21e 975
80d38338
TC
976Attempts to connect to a remote socket, just like connect(2).
977Returns true if it succeeded, false otherwise. NAME should be a
4633a7c4
LW
978packed address of the appropriate type for the socket. See the examples in
979L<perlipc/"Sockets: Client/Server Communication">.
a0d0e21e 980
cb1a09d0 981=item continue BLOCK
d74e8afc 982X<continue>
cb1a09d0 983
0d863452
RH
984=item continue
985
4a904372
FC
986When followed by a BLOCK, C<continue> is actually a
987flow control statement rather than a function. If
cf264981 988there is a C<continue> BLOCK attached to a BLOCK (typically in a C<while> or
98293880
JH
989C<foreach>), it is always executed just before the conditional is about to
990be evaluated again, just like the third part of a C<for> loop in C. Thus
cb1a09d0
AD
991it can be used to increment a loop variable, even when the loop has been
992continued via the C<next> statement (which is similar to the C C<continue>
993statement).
994
98293880 995C<last>, C<next>, or C<redo> may appear within a C<continue>
3b10bc60 996block; C<last> and C<redo> behave as if they had been executed within
19799a22 997the main block. So will C<next>, but since it will execute a C<continue>
1d2dff63
GS
998block, it may be more entertaining.
999
1000 while (EXPR) {
a9a5a0dc
VP
1001 ### redo always comes here
1002 do_something;
1d2dff63 1003 } continue {
a9a5a0dc
VP
1004 ### next always comes here
1005 do_something_else;
1006 # then back the top to re-check EXPR
1d2dff63
GS
1007 }
1008 ### last always comes here
1009
3b10bc60 1010Omitting the C<continue> section is equivalent to using an
1011empty one, logically enough, so C<next> goes directly back
1d2dff63
GS
1012to check the condition at the top of the loop.
1013
4a904372 1014When there is no BLOCK, C<continue> is a function that
8f1da26d
TC
1015falls through the current C<when> or C<default> block instead of iterating
1016a dynamically enclosing C<foreach> or exiting a lexically enclosing C<given>.
4a904372
FC
1017In Perl 5.14 and earlier, this form of C<continue> was
1018only available when the C<"switch"> feature was enabled.
8f1da26d
TC
1019See L<feature> and L<perlsyn/"Switch statements"> for more
1020information.
0d863452 1021
a0d0e21e 1022=item cos EXPR
d74e8afc 1023X<cos> X<cosine> X<acos> X<arccosine>
a0d0e21e 1024
d6217f1e
GS
1025=item cos
1026
5a964f20 1027Returns the cosine of EXPR (expressed in radians). If EXPR is omitted,
34169887 1028takes the cosine of C<$_>.
a0d0e21e 1029
ca6e1c26 1030For the inverse cosine operation, you may use the C<Math::Trig::acos()>
28757baa 1031function, or use this relation:
1032
1033 sub acos { atan2( sqrt(1 - $_[0] * $_[0]), $_[0] ) }
1034
a0d0e21e 1035=item crypt PLAINTEXT,SALT
d74e8afc 1036X<crypt> X<digest> X<hash> X<salt> X<plaintext> X<password>
f723aae1 1037X<decrypt> X<cryptography> X<passwd> X<encrypt>
a0d0e21e 1038
ef2e6798
MS
1039Creates a digest string exactly like the crypt(3) function in the C
1040library (assuming that you actually have a version there that has not
bb23f8d1 1041been extirpated as a potential munition).
ef2e6798 1042
34169887 1043crypt() is a one-way hash function. The PLAINTEXT and SALT are turned
ef2e6798
MS
1044into a short string, called a digest, which is returned. The same
1045PLAINTEXT and SALT will always return the same string, but there is no
1046(known) way to get the original PLAINTEXT from the hash. Small
1047changes in the PLAINTEXT or SALT will result in large changes in the
1048digest.
1049
1050There is no decrypt function. This function isn't all that useful for
1051cryptography (for that, look for F<Crypt> modules on your nearby CPAN
1052mirror) and the name "crypt" is a bit of a misnomer. Instead it is
1053primarily used to check if two pieces of text are the same without
1054having to transmit or store the text itself. An example is checking
1055if a correct password is given. The digest of the password is stored,
cf264981 1056not the password itself. The user types in a password that is
ef2e6798 1057crypt()'d with the same salt as the stored digest. If the two digests
34169887 1058match, the password is correct.
ef2e6798
MS
1059
1060When verifying an existing digest string you should use the digest as
1061the salt (like C<crypt($plain, $digest) eq $digest>). The SALT used
cf264981 1062to create the digest is visible as part of the digest. This ensures
ef2e6798
MS
1063crypt() will hash the new string with the same salt as the digest.
1064This allows your code to work with the standard L<crypt|/crypt> and
8f1da26d
TC
1065with more exotic implementations. In other words, assume
1066nothing about the returned string itself nor about how many bytes
1067of SALT may matter.
85c16d83
JH
1068
1069Traditionally the result is a string of 13 bytes: two first bytes of
1070the salt, followed by 11 bytes from the set C<[./0-9A-Za-z]>, and only
bb23f8d1 1071the first eight bytes of PLAINTEXT mattered. But alternative
ef2e6798 1072hashing schemes (like MD5), higher level security schemes (like C2),
e1020413 1073and implementations on non-Unix platforms may produce different
ef2e6798 1074strings.
85c16d83
JH
1075
1076When choosing a new salt create a random two character string whose
1077characters come from the set C<[./0-9A-Za-z]> (like C<join '', ('.',
d3989d75
CW
1078'/', 0..9, 'A'..'Z', 'a'..'z')[rand 64, rand 64]>). This set of
1079characters is just a recommendation; the characters allowed in
1080the salt depend solely on your system's crypt library, and Perl can't
1081restrict what salts C<crypt()> accepts.
e71965be 1082
a0d0e21e 1083Here's an example that makes sure that whoever runs this program knows
cf264981 1084their password:
a0d0e21e
LW
1085
1086 $pwd = (getpwuid($<))[1];
a0d0e21e
LW
1087
1088 system "stty -echo";
1089 print "Password: ";
e71965be 1090 chomp($word = <STDIN>);
a0d0e21e
LW
1091 print "\n";
1092 system "stty echo";
1093
e71965be 1094 if (crypt($word, $pwd) ne $pwd) {
a9a5a0dc 1095 die "Sorry...\n";
a0d0e21e 1096 } else {
a9a5a0dc 1097 print "ok\n";
54310121 1098 }
a0d0e21e 1099
9f8f0c9d 1100Of course, typing in your own password to whoever asks you
748a9306 1101for it is unwise.
a0d0e21e 1102
ef2e6798 1103The L<crypt|/crypt> function is unsuitable for hashing large quantities
19799a22 1104of data, not least of all because you can't get the information
ef2e6798 1105back. Look at the L<Digest> module for more robust algorithms.
19799a22 1106
f2791508
JH
1107If using crypt() on a Unicode string (which I<potentially> has
1108characters with codepoints above 255), Perl tries to make sense
34169887 1109of the situation by trying to downgrade (a copy of)
f2791508
JH
1110the string back to an eight-bit byte string before calling crypt()
1111(on that copy). If that works, good. If not, crypt() dies with
1112C<Wide character in crypt>.
85c16d83 1113
ea9eb35a
BJ
1114Portability issues: L<perlport/crypt>.
1115
aa689395 1116=item dbmclose HASH
d74e8afc 1117X<dbmclose>
a0d0e21e 1118
19799a22 1119[This function has been largely superseded by the C<untie> function.]
a0d0e21e 1120
aa689395 1121Breaks the binding between a DBM file and a hash.
a0d0e21e 1122
ea9eb35a
BJ
1123Portability issues: L<perlport/dbmclose>.
1124
19799a22 1125=item dbmopen HASH,DBNAME,MASK
d74e8afc 1126X<dbmopen> X<dbm> X<ndbm> X<sdbm> X<gdbm>
a0d0e21e 1127
01aa884e
KW
1128[This function has been largely superseded by the
1129L<tie|/tie VARIABLE,CLASSNAME,LIST> function.]
a0d0e21e 1130
7b8d334a 1131This binds a dbm(3), ndbm(3), sdbm(3), gdbm(3), or Berkeley DB file to a
19799a22
GS
1132hash. HASH is the name of the hash. (Unlike normal C<open>, the first
1133argument is I<not> a filehandle, even though it looks like one). DBNAME
aa689395 1134is the name of the database (without the F<.dir> or F<.pag> extension if
1135any). If the database does not exist, it is created with protection
19799a22 1136specified by MASK (as modified by the C<umask>). If your system supports
80d38338 1137only the older DBM functions, you may make only one C<dbmopen> call in your
aa689395 1138program. In older versions of Perl, if your system had neither DBM nor
19799a22 1139ndbm, calling C<dbmopen> produced a fatal error; it now falls back to
aa689395 1140sdbm(3).
1141
1142If you don't have write access to the DBM file, you can only read hash
1143variables, not set them. If you want to test whether you can write,
3b10bc60 1144either use file tests or try setting a dummy hash entry inside an C<eval>
1145to trap the error.
a0d0e21e 1146
19799a22
GS
1147Note that functions such as C<keys> and C<values> may return huge lists
1148when used on large DBM files. You may prefer to use the C<each>
a0d0e21e
LW
1149function to iterate over large DBM files. Example:
1150
1151 # print out history file offsets
1152 dbmopen(%HIST,'/usr/lib/news/history',0666);
1153 while (($key,$val) = each %HIST) {
a9a5a0dc 1154 print $key, ' = ', unpack('L',$val), "\n";
a0d0e21e
LW
1155 }
1156 dbmclose(%HIST);
1157
cb1a09d0 1158See also L<AnyDBM_File> for a more general description of the pros and
184e9718 1159cons of the various dbm approaches, as well as L<DB_File> for a particularly
cb1a09d0 1160rich implementation.
4633a7c4 1161
2b5ab1e7
TC
1162You can control which DBM library you use by loading that library
1163before you call dbmopen():
1164
1165 use DB_File;
1166 dbmopen(%NS_Hist, "$ENV{HOME}/.netscape/history.db")
a9a5a0dc 1167 or die "Can't open netscape history file: $!";
2b5ab1e7 1168
ea9eb35a
BJ
1169Portability issues: L<perlport/dbmopen>.
1170
8f1da26d
TC
1171=item default BLOCK
1172
1173Within a C<foreach> or a C<given>, a C<default> BLOCK acts like a C<when>
1174that's always true. Only available after Perl 5.10, and only if the
4a904372
FC
1175C<switch> feature has been requested or if the keyword is prefixed with
1176C<CORE::>. See L</when>.
8f1da26d 1177
a0d0e21e 1178=item defined EXPR
d74e8afc 1179X<defined> X<undef> X<undefined>
a0d0e21e 1180
54310121 1181=item defined
bbce6d69 1182
2f9daede 1183Returns a Boolean value telling whether EXPR has a value other than
3b10bc60 1184the undefined value C<undef>. If EXPR is not present, C<$_> is
2f9daede
TP
1185checked.
1186
1187Many operations return C<undef> to indicate failure, end of file,
1188system error, uninitialized variable, and other exceptional
1189conditions. This function allows you to distinguish C<undef> from
1190other values. (A simple Boolean test will not distinguish among
7660c0ab 1191C<undef>, zero, the empty string, and C<"0">, which are all equally
2f9daede 1192false.) Note that since C<undef> is a valid scalar, its presence
19799a22 1193doesn't I<necessarily> indicate an exceptional condition: C<pop>
2f9daede
TP
1194returns C<undef> when its argument is an empty array, I<or> when the
1195element to return happens to be C<undef>.
1196
f10b0346
GS
1197You may also use C<defined(&func)> to check whether subroutine C<&func>
1198has ever been defined. The return value is unaffected by any forward
80d38338 1199declarations of C<&func>. A subroutine that is not defined
847c7ebe 1200may still be callable: its package may have an C<AUTOLOAD> method that
3b10bc60 1201makes it spring into existence the first time that it is called; see
847c7ebe 1202L<perlsub>.
f10b0346
GS
1203
1204Use of C<defined> on aggregates (hashes and arrays) is deprecated. It
34169887 1205used to report whether memory for that aggregate had ever been
f10b0346
GS
1206allocated. This behavior may disappear in future versions of Perl.
1207You should instead use a simple test for size:
1208
1209 if (@an_array) { print "has array elements\n" }
1210 if (%a_hash) { print "has hash members\n" }
2f9daede
TP
1211
1212When used on a hash element, it tells you whether the value is defined,
dc848c6f 1213not whether the key exists in the hash. Use L</exists> for the latter
2f9daede 1214purpose.
a0d0e21e
LW
1215
1216Examples:
1217
8f1da26d 1218 print if defined $switch{D};
a0d0e21e
LW
1219 print "$val\n" while defined($val = pop(@ary));
1220 die "Can't readlink $sym: $!"
a9a5a0dc 1221 unless defined($value = readlink $sym);
a0d0e21e 1222 sub foo { defined &$bar ? &$bar(@_) : die "No bar"; }
2f9daede 1223 $debugging = 0 unless defined $debugging;
a0d0e21e 1224
8f1da26d 1225Note: Many folks tend to overuse C<defined> and are then surprised to
7660c0ab 1226discover that the number C<0> and C<""> (the zero-length string) are, in fact,
2f9daede 1227defined values. For example, if you say
a5f75d66
AD
1228
1229 "ab" =~ /a(.*)b/;
1230
80d38338 1231The pattern match succeeds and C<$1> is defined, although it
cf264981 1232matched "nothing". It didn't really fail to match anything. Rather, it
2b5ab1e7 1233matched something that happened to be zero characters long. This is all
a5f75d66 1234very above-board and honest. When a function returns an undefined value,
2f9daede 1235it's an admission that it couldn't give you an honest answer. So you
3b10bc60 1236should use C<defined> only when questioning the integrity of what
7660c0ab 1237you're trying to do. At other times, a simple comparison to C<0> or C<""> is
2f9daede
TP
1238what you want.
1239
dc848c6f 1240See also L</undef>, L</exists>, L</ref>.
2f9daede 1241
a0d0e21e 1242=item delete EXPR
d74e8afc 1243X<delete>
a0d0e21e 1244
d0a76353
RS
1245Given an expression that specifies an element or slice of a hash, C<delete>
1246deletes the specified elements from that hash so that exists() on that element
1247no longer returns true. Setting a hash element to the undefined value does
1248not remove its key, but deleting it does; see L</exists>.
80d38338 1249
8f1da26d 1250In list context, returns the value or values deleted, or the last such
80d38338 1251element in scalar context. The return list's length always matches that of
d0a76353
RS
1252the argument list: deleting non-existent elements returns the undefined value
1253in their corresponding positions.
80d38338 1254
d0a76353
RS
1255delete() may also be used on arrays and array slices, but its behavior is less
1256straightforward. Although exists() will return false for deleted entries,
1257deleting array elements never changes indices of existing values; use shift()
1258or splice() for that. However, if all deleted elements fall at the end of an
1259array, the array's size shrinks to the position of the highest element that
1260still tests true for exists(), or to 0 if none do.
1261
8f1da26d 1262B<WARNING:> Calling delete on array values is deprecated and likely to
d0a76353 1263be removed in a future version of Perl.
80d38338
TC
1264
1265Deleting from C<%ENV> modifies the environment. Deleting from a hash tied to
1266a DBM file deletes the entry from the DBM file. Deleting from a C<tied> hash
1267or array may not necessarily return anything; it depends on the implementation
1268of the C<tied> package's DELETE method, which may do whatever it pleases.
a0d0e21e 1269
80d38338
TC
1270The C<delete local EXPR> construct localizes the deletion to the current
1271block at run time. Until the block exits, elements locally deleted
1272temporarily no longer exist. See L<perlsub/"Localized deletion of elements
1273of composite types">.
eba0920a
EM
1274
1275 %hash = (foo => 11, bar => 22, baz => 33);
1276 $scalar = delete $hash{foo}; # $scalar is 11
1277 $scalar = delete @hash{qw(foo bar)}; # $scalar is 22
1278 @array = delete @hash{qw(foo bar baz)}; # @array is (undef,undef,33)
1279
01020589 1280The following (inefficiently) deletes all the values of %HASH and @ARRAY:
a0d0e21e 1281
5f05dabc 1282 foreach $key (keys %HASH) {
a9a5a0dc 1283 delete $HASH{$key};
a0d0e21e
LW
1284 }
1285
01020589 1286 foreach $index (0 .. $#ARRAY) {
a9a5a0dc 1287 delete $ARRAY[$index];
01020589
GS
1288 }
1289
1290And so do these:
5f05dabc 1291
01020589
GS
1292 delete @HASH{keys %HASH};
1293
9740c838 1294 delete @ARRAY[0 .. $#ARRAY];
5f05dabc 1295
80d38338
TC
1296But both are slower than assigning the empty list
1297or undefining %HASH or @ARRAY, which is the customary
1298way to empty out an aggregate:
01020589 1299
5ed4f2ec 1300 %HASH = (); # completely empty %HASH
1301 undef %HASH; # forget %HASH ever existed
2b5ab1e7 1302
5ed4f2ec 1303 @ARRAY = (); # completely empty @ARRAY
1304 undef @ARRAY; # forget @ARRAY ever existed
2b5ab1e7 1305
80d38338
TC
1306The EXPR can be arbitrarily complicated provided its
1307final operation is an element or slice of an aggregate:
a0d0e21e
LW
1308
1309 delete $ref->[$x][$y]{$key};
5f05dabc 1310 delete @{$ref->[$x][$y]}{$key1, $key2, @morekeys};
a0d0e21e 1311
01020589
GS
1312 delete $ref->[$x][$y][$index];
1313 delete @{$ref->[$x][$y]}[$index1, $index2, @moreindices];
1314
a0d0e21e 1315=item die LIST
d74e8afc 1316X<die> X<throw> X<exception> X<raise> X<$@> X<abort>
a0d0e21e 1317
4c050ad5
NC
1318C<die> raises an exception. Inside an C<eval> the error message is stuffed
1319into C<$@> and the C<eval> is terminated with the undefined value.
1320If the exception is outside of all enclosing C<eval>s, then the uncaught
1321exception prints LIST to C<STDERR> and exits with a non-zero value. If you
96090e4f 1322need to exit the process with a specific exit code, see L</exit>.
a0d0e21e
LW
1323
1324Equivalent examples:
1325
1326 die "Can't cd to spool: $!\n" unless chdir '/usr/spool/news';
54310121 1327 chdir '/usr/spool/news' or die "Can't cd to spool: $!\n"
a0d0e21e 1328
ccac6780 1329If the last element of LIST does not end in a newline, the current
df37ec69
WW
1330script line number and input line number (if any) are also printed,
1331and a newline is supplied. Note that the "input line number" (also
1332known as "chunk") is subject to whatever notion of "line" happens to
1333be currently in effect, and is also available as the special variable
1334C<$.>. See L<perlvar/"$/"> and L<perlvar/"$.">.
1335
1336Hint: sometimes appending C<", stopped"> to your message will cause it
1337to make better sense when the string C<"at foo line 123"> is appended.
1338Suppose you are running script "canasta".
a0d0e21e
LW
1339
1340 die "/etc/games is no good";
1341 die "/etc/games is no good, stopped";
1342
1343produce, respectively
1344
1345 /etc/games is no good at canasta line 123.
1346 /etc/games is no good, stopped at canasta line 123.
1347
a96d0188 1348If the output is empty and C<$@> already contains a value (typically from a
7660c0ab 1349previous eval) that value is reused after appending C<"\t...propagated">.
fb73857a 1350This is useful for propagating exceptions:
1351
1352 eval { ... };
1353 die unless $@ =~ /Expected exception/;
1354
a96d0188 1355If the output is empty and C<$@> contains an object reference that has a
ad216e65
JH
1356C<PROPAGATE> method, that method will be called with additional file
1357and line number parameters. The return value replaces the value in
34169887 1358C<$@>; i.e., as if C<< $@ = eval { $@->PROPAGATE(__FILE__, __LINE__) }; >>
ad216e65
JH
1359were called.
1360
7660c0ab 1361If C<$@> is empty then the string C<"Died"> is used.
fb73857a 1362
4c050ad5
NC
1363If an uncaught exception results in interpreter exit, the exit code is
1364determined from the values of C<$!> and C<$?> with this pseudocode:
1365
1366 exit $! if $!; # errno
1367 exit $? >> 8 if $? >> 8; # child exit status
1368 exit 255; # last resort
1369
1370The intent is to squeeze as much possible information about the likely cause
1371into the limited space of the system exit code. However, as C<$!> is the value
1372of C's C<errno>, which can be set by any system call, this means that the value
1373of the exit code used by C<die> can be non-predictable, so should not be relied
1374upon, other than to be non-zero.
1375
80d38338
TC
1376You can also call C<die> with a reference argument, and if this is trapped
1377within an C<eval>, C<$@> contains that reference. This permits more
1378elaborate exception handling using objects that maintain arbitrary state
1379about the exception. Such a scheme is sometimes preferable to matching
1380particular string values of C<$@> with regular expressions. Because C<$@>
1381is a global variable and C<eval> may be used within object implementations,
1382be careful that analyzing the error object doesn't replace the reference in
1383the global variable. It's easiest to make a local copy of the reference
1384before any manipulations. Here's an example:
52531d10 1385
80d38338 1386 use Scalar::Util "blessed";
da279afe 1387
52531d10 1388 eval { ... ; die Some::Module::Exception->new( FOO => "bar" ) };
746d7dd7
GL
1389 if (my $ev_err = $@) {
1390 if (blessed($ev_err) && $ev_err->isa("Some::Module::Exception")) {
52531d10
GS
1391 # handle Some::Module::Exception
1392 }
1393 else {
1394 # handle all other possible exceptions
1395 }
1396 }
1397
3b10bc60 1398Because Perl stringifies uncaught exception messages before display,
80d38338 1399you'll probably want to overload stringification operations on
52531d10
GS
1400exception objects. See L<overload> for details about that.
1401
19799a22
GS
1402You can arrange for a callback to be run just before the C<die>
1403does its deed, by setting the C<$SIG{__DIE__}> hook. The associated
3b10bc60 1404handler is called with the error text and can change the error
19799a22 1405message, if it sees fit, by calling C<die> again. See
96090e4f 1406L<perlvar/%SIG> for details on setting C<%SIG> entries, and
cf264981 1407L<"eval BLOCK"> for some examples. Although this feature was
19799a22 1408to be run only right before your program was to exit, this is not
3b10bc60 1409currently so: the C<$SIG{__DIE__}> hook is currently called
19799a22
GS
1410even inside eval()ed blocks/strings! If one wants the hook to do
1411nothing in such situations, put
fb73857a 1412
5ed4f2ec 1413 die @_ if $^S;
fb73857a 1414
19799a22
GS
1415as the first line of the handler (see L<perlvar/$^S>). Because
1416this promotes strange action at a distance, this counterintuitive
b76cc8ba 1417behavior may be fixed in a future release.
774d564b 1418
4c050ad5
NC
1419See also exit(), warn(), and the Carp module.
1420
a0d0e21e 1421=item do BLOCK
d74e8afc 1422X<do> X<block>
a0d0e21e
LW
1423
1424Not really a function. Returns the value of the last command in the
6b275a1f
RGS
1425sequence of commands indicated by BLOCK. When modified by the C<while> or
1426C<until> loop modifier, executes the BLOCK once before testing the loop
1427condition. (On other statements the loop modifiers test the conditional
1428first.)
a0d0e21e 1429
4968c1e4 1430C<do BLOCK> does I<not> count as a loop, so the loop control statements
2b5ab1e7
TC
1431C<next>, C<last>, or C<redo> cannot be used to leave or restart the block.
1432See L<perlsyn> for alternative strategies.
4968c1e4 1433
a0d0e21e 1434=item do SUBROUTINE(LIST)
d74e8afc 1435X<do>
a0d0e21e 1436
51124b83
FC
1437This form of subroutine call is deprecated. SUBROUTINE can be a bareword,
1438a scalar variable or a subroutine beginning with C<&>.
a0d0e21e
LW
1439
1440=item do EXPR
d74e8afc 1441X<do>
a0d0e21e
LW
1442
1443Uses the value of EXPR as a filename and executes the contents of the
ea63ef19 1444file as a Perl script.
a0d0e21e
LW
1445
1446 do 'stat.pl';
1447
1448is just like
1449
986b19de 1450 eval `cat stat.pl`;
a0d0e21e 1451
2b5ab1e7 1452except that it's more efficient and concise, keeps track of the current
96090e4f
LB
1453filename for error messages, searches the C<@INC> directories, and updates
1454C<%INC> if the file is found. See L<perlvar/@INC> and L<perlvar/%INC> for
1455these variables. It also differs in that code evaluated with C<do FILENAME>
2b5ab1e7
TC
1456cannot see lexicals in the enclosing scope; C<eval STRING> does. It's the
1457same, however, in that it does reparse the file every time you call it,
1458so you probably don't want to do this inside a loop.
a0d0e21e 1459
8f1da26d 1460If C<do> can read the file but cannot compile it, it returns C<undef> and sets
9dc513c5
DG
1461an error message in C<$@>. If C<do> cannot read the file, it returns undef
1462and sets C<$!> to the error. Always check C<$@> first, as compilation
1463could fail in a way that also sets C<$!>. If the file is successfully
1464compiled, C<do> returns the value of the last expression evaluated.
8e30cc93 1465
80d38338 1466Inclusion of library modules is better done with the
19799a22 1467C<use> and C<require> operators, which also do automatic error checking
4633a7c4 1468and raise an exception if there's a problem.
a0d0e21e 1469
5a964f20
TC
1470You might like to use C<do> to read in a program configuration
1471file. Manual error checking can be done this way:
1472
b76cc8ba 1473 # read in config files: system first, then user
f86cebdf 1474 for $file ("/share/prog/defaults.rc",
b76cc8ba 1475 "$ENV{HOME}/.someprogrc")
a9a5a0dc
VP
1476 {
1477 unless ($return = do $file) {
1478 warn "couldn't parse $file: $@" if $@;
1479 warn "couldn't do $file: $!" unless defined $return;
1480 warn "couldn't run $file" unless $return;
1481 }
5a964f20
TC
1482 }
1483
a0d0e21e 1484=item dump LABEL
d74e8afc 1485X<dump> X<core> X<undump>
a0d0e21e 1486
1614b0e3
JD
1487=item dump
1488
19799a22
GS
1489This function causes an immediate core dump. See also the B<-u>
1490command-line switch in L<perlrun>, which does the same thing.
1491Primarily this is so that you can use the B<undump> program (not
1492supplied) to turn your core dump into an executable binary after
1493having initialized all your variables at the beginning of the
1494program. When the new binary is executed it will begin by executing
1495a C<goto LABEL> (with all the restrictions that C<goto> suffers).
1496Think of it as a goto with an intervening core dump and reincarnation.
1497If C<LABEL> is omitted, restarts the program from the top.
1498
1499B<WARNING>: Any files opened at the time of the dump will I<not>
1500be open any more when the program is reincarnated, with possible
80d38338 1501resulting confusion by Perl.
19799a22 1502
59f521f4
RGS
1503This function is now largely obsolete, mostly because it's very hard to
1504convert a core file into an executable. That's why you should now invoke
1505it as C<CORE::dump()>, if you don't want to be warned against a possible
ac206dc8 1506typo.
19799a22 1507
ea9eb35a
BJ
1508Portability issues: L<perlport/dump>.
1509
532eee96 1510=item each HASH
d74e8afc 1511X<each> X<hash, iterator>
aa689395 1512
532eee96 1513=item each ARRAY
aeedbbed
NC
1514X<array, iterator>
1515
f5a93a43
TC
1516=item each EXPR
1517
80d38338
TC
1518When called in list context, returns a 2-element list consisting of the key
1519and value for the next element of a hash, or the index and value for the
1520next element of an array, so that you can iterate over it. When called in
1521scalar context, returns only the key (not the value) in a hash, or the index
1522in an array.
2f9daede 1523
aeedbbed 1524Hash entries are returned in an apparently random order. The actual random
3b10bc60 1525order is subject to change in future versions of Perl, but it is
504f80c1 1526guaranteed to be in the same order as either the C<keys> or C<values>
4546b9e6 1527function would produce on the same (unmodified) hash. Since Perl
22883ac5 15285.8.2 the ordering can be different even between different runs of Perl
4546b9e6 1529for security reasons (see L<perlsec/"Algorithmic Complexity Attacks">).
ab192400 1530
80d38338
TC
1531After C<each> has returned all entries from the hash or array, the next
1532call to C<each> returns the empty list in list context and C<undef> in
1533scalar context. The next call following that one restarts iteration. Each
1534hash or array has its own internal iterator, accessed by C<each>, C<keys>,
1535and C<values>. The iterator is implicitly reset when C<each> has reached
1536the end as just described; it can be explicitly reset by calling C<keys> or
1537C<values> on the hash or array. If you add or delete a hash's elements
1538while iterating over it, entries may be skipped or duplicated--so don't do
1539that. Exception: It is always safe to delete the item most recently
3b10bc60 1540returned by C<each()>, so the following code works properly:
74fc8b5f
MJD
1541
1542 while (($key, $value) = each %hash) {
1543 print $key, "\n";
1544 delete $hash{$key}; # This is safe
1545 }
aa689395 1546
80d38338 1547This prints out your environment like the printenv(1) program,
3b10bc60 1548but in a different order:
a0d0e21e
LW
1549
1550 while (($key,$value) = each %ENV) {
a9a5a0dc 1551 print "$key=$value\n";
a0d0e21e
LW
1552 }
1553
f5a93a43
TC
1554Starting with Perl 5.14, C<each> can take a scalar EXPR, which must hold
1555reference to an unblessed hash or array. The argument will be dereferenced
1556automatically. This aspect of C<each> is considered highly experimental.
1557The exact behaviour may change in a future version of Perl.
cba5a3b0
DG
1558
1559 while (($key,$value) = each $hashref) { ... }
1560
8f1da26d 1561See also C<keys>, C<values>, and C<sort>.
a0d0e21e
LW
1562
1563=item eof FILEHANDLE
d74e8afc
ITB
1564X<eof>
1565X<end of file>
1566X<end-of-file>
a0d0e21e 1567
4633a7c4
LW
1568=item eof ()
1569
a0d0e21e
LW
1570=item eof
1571
8f1da26d 1572Returns 1 if the next read on FILEHANDLE will return end of file I<or> if
a0d0e21e 1573FILEHANDLE is not open. FILEHANDLE may be an expression whose value
5a964f20 1574gives the real filehandle. (Note that this function actually
80d38338 1575reads a character and then C<ungetc>s it, so isn't useful in an
748a9306 1576interactive context.) Do not read from a terminal file (or call
19799a22 1577C<eof(FILEHANDLE)> on it) after end-of-file is reached. File types such
748a9306
LW
1578as terminals may lose the end-of-file condition if you do.
1579
820475bd 1580An C<eof> without an argument uses the last file read. Using C<eof()>
80d38338 1581with empty parentheses is different. It refers to the pseudo file
820475bd 1582formed from the files listed on the command line and accessed via the
61eff3bc
JH
1583C<< <> >> operator. Since C<< <> >> isn't explicitly opened,
1584as a normal filehandle is, an C<eof()> before C<< <> >> has been
820475bd 1585used will cause C<@ARGV> to be examined to determine if input is
67408cae 1586available. Similarly, an C<eof()> after C<< <> >> has returned
efdd0218
RB
1587end-of-file will assume you are processing another C<@ARGV> list,
1588and if you haven't set C<@ARGV>, will read input from C<STDIN>;
1589see L<perlop/"I/O Operators">.
820475bd 1590
61eff3bc 1591In a C<< while (<>) >> loop, C<eof> or C<eof(ARGV)> can be used to
8f1da26d
TC
1592detect the end of each file, whereas C<eof()> will detect the end
1593of the very last file only. Examples:
a0d0e21e 1594
748a9306
LW
1595 # reset line numbering on each input file
1596 while (<>) {
a9a5a0dc
VP
1597 next if /^\s*#/; # skip comments
1598 print "$.\t$_";
5a964f20 1599 } continue {
a9a5a0dc 1600 close ARGV if eof; # Not eof()!
748a9306
LW
1601 }
1602
a0d0e21e
LW
1603 # insert dashes just before last line of last file
1604 while (<>) {
a9a5a0dc
VP
1605 if (eof()) { # check for end of last file
1606 print "--------------\n";
1607 }
1608 print;
4d0444a3 1609 last if eof(); # needed if we're reading from a terminal
a0d0e21e
LW
1610 }
1611
a0d0e21e 1612Practical hint: you almost never need to use C<eof> in Perl, because the
8f1da26d
TC
1613input operators typically return C<undef> when they run out of data or
1614encounter an error.
a0d0e21e
LW
1615
1616=item eval EXPR
d74e8afc 1617X<eval> X<try> X<catch> X<evaluate> X<parse> X<execute>
f723aae1 1618X<error, handling> X<exception, handling>
a0d0e21e
LW
1619
1620=item eval BLOCK
1621
ce2984c3
PF
1622=item eval
1623
c7cc6f1c
GS
1624In the first form, the return value of EXPR is parsed and executed as if it
1625were a little Perl program. The value of the expression (which is itself
8f1da26d 1626determined within scalar context) is first parsed, and if there were no
2341804c 1627errors, executed as a block within the lexical context of the current Perl
df4833a8 1628program. This means, that in particular, any outer lexical variables are
2341804c
DM
1629visible to it, and any package variable settings or subroutine and format
1630definitions remain afterwards.
1631
1632Note that the value is parsed every time the C<eval> executes.
be3174d2
GS
1633If EXPR is omitted, evaluates C<$_>. This form is typically used to
1634delay parsing and subsequent execution of the text of EXPR until run time.
c7cc6f1c
GS
1635
1636In the second form, the code within the BLOCK is parsed only once--at the
cf264981 1637same time the code surrounding the C<eval> itself was parsed--and executed
c7cc6f1c
GS
1638within the context of the current Perl program. This form is typically
1639used to trap exceptions more efficiently than the first (see below), while
1640also providing the benefit of checking the code within BLOCK at compile
1641time.
1642
1643The final semicolon, if any, may be omitted from the value of EXPR or within
1644the BLOCK.
1645
1646In both forms, the value returned is the value of the last expression
5a964f20 1647evaluated inside the mini-program; a return statement may be also used, just
c7cc6f1c 1648as with subroutines. The expression providing the return value is evaluated
cf264981
SP
1649in void, scalar, or list context, depending on the context of the C<eval>
1650itself. See L</wantarray> for more on how the evaluation context can be
1651determined.
a0d0e21e 1652
19799a22 1653If there is a syntax error or runtime error, or a C<die> statement is
8f1da26d 1654executed, C<eval> returns C<undef> in scalar context
1f0d8f98
FC
1655or an empty list--or, for syntax errors, a list containing a single
1656undefined value--in list context, and C<$@> is set to the error
1657message. The discrepancy in the return values in list context is
1658considered a bug by some, and will probably be fixed in a future
9cc672d4
FC
1659release. If there was no error, C<$@> is set to the empty string. A
1660control flow operator like C<last> or C<goto> can bypass the setting of
1661C<$@>. Beware that using C<eval> neither silences Perl from printing
c7cc6f1c 1662warnings to STDERR, nor does it stuff the text of warning messages into C<$@>.
d9984052
A
1663To do either of those, you have to use the C<$SIG{__WARN__}> facility, or
1664turn off warnings inside the BLOCK or EXPR using S<C<no warnings 'all'>>.
1665See L</warn>, L<perlvar>, L<warnings> and L<perllexwarn>.
a0d0e21e 1666
19799a22
GS
1667Note that, because C<eval> traps otherwise-fatal errors, it is useful for
1668determining whether a particular feature (such as C<socket> or C<symlink>)
82bcec1b 1669is implemented. It is also Perl's exception-trapping mechanism, where
a0d0e21e
LW
1670the die operator is used to raise exceptions.
1671
5f1da31c
NT
1672If you want to trap errors when loading an XS module, some problems with
1673the binary interface (such as Perl version skew) may be fatal even with
df4833a8 1674C<eval> unless C<$ENV{PERL_DL_NONLAZY}> is set. See L<perlrun>.
5f1da31c 1675
a0d0e21e
LW
1676If the code to be executed doesn't vary, you may use the eval-BLOCK
1677form to trap run-time errors without incurring the penalty of
1678recompiling each time. The error, if any, is still returned in C<$@>.
1679Examples:
1680
54310121 1681 # make divide-by-zero nonfatal
a0d0e21e
LW
1682 eval { $answer = $a / $b; }; warn $@ if $@;
1683
1684 # same thing, but less efficient
1685 eval '$answer = $a / $b'; warn $@ if $@;
1686
1687 # a compile-time error
5ed4f2ec 1688 eval { $answer = }; # WRONG
a0d0e21e
LW
1689
1690 # a run-time error
5ed4f2ec 1691 eval '$answer ='; # sets $@
a0d0e21e 1692
cf264981
SP
1693Using the C<eval{}> form as an exception trap in libraries does have some
1694issues. Due to the current arguably broken state of C<__DIE__> hooks, you
1695may wish not to trigger any C<__DIE__> hooks that user code may have installed.
2b5ab1e7 1696You can use the C<local $SIG{__DIE__}> construct for this purpose,
80d38338 1697as this example shows:
774d564b 1698
80d38338 1699 # a private exception trap for divide-by-zero
f86cebdf
GS
1700 eval { local $SIG{'__DIE__'}; $answer = $a / $b; };
1701 warn $@ if $@;
774d564b 1702
1703This is especially significant, given that C<__DIE__> hooks can call
19799a22 1704C<die> again, which has the effect of changing their error messages:
774d564b 1705
1706 # __DIE__ hooks may modify error messages
1707 {
f86cebdf
GS
1708 local $SIG{'__DIE__'} =
1709 sub { (my $x = $_[0]) =~ s/foo/bar/g; die $x };
c7cc6f1c
GS
1710 eval { die "foo lives here" };
1711 print $@ if $@; # prints "bar lives here"
774d564b 1712 }
1713
19799a22 1714Because this promotes action at a distance, this counterintuitive behavior
2b5ab1e7
TC
1715may be fixed in a future release.
1716
19799a22 1717With an C<eval>, you should be especially careful to remember what's
a0d0e21e
LW
1718being looked at when:
1719
5ed4f2ec 1720 eval $x; # CASE 1
1721 eval "$x"; # CASE 2
a0d0e21e 1722
5ed4f2ec 1723 eval '$x'; # CASE 3
1724 eval { $x }; # CASE 4
a0d0e21e 1725
5ed4f2ec 1726 eval "\$$x++"; # CASE 5
1727 $$x++; # CASE 6
a0d0e21e 1728
2f9daede 1729Cases 1 and 2 above behave identically: they run the code contained in
19799a22 1730the variable $x. (Although case 2 has misleading double quotes making
2f9daede 1731the reader wonder what else might be happening (nothing is).) Cases 3
7660c0ab 1732and 4 likewise behave in the same way: they run the code C<'$x'>, which
19799a22 1733does nothing but return the value of $x. (Case 4 is preferred for
2f9daede
TP
1734purely visual reasons, but it also has the advantage of compiling at
1735compile-time instead of at run-time.) Case 5 is a place where
19799a22 1736normally you I<would> like to use double quotes, except that in this
2f9daede
TP
1737particular situation, you can just use symbolic references instead, as
1738in case 6.
a0d0e21e 1739
b6538e4f
TC
1740Before Perl 5.14, the assignment to C<$@> occurred before restoration
1741of localised variables, which means that for your code to run on older
b208c909 1742versions, a temporary is required if you want to mask some but not all
8a5a710d
DN
1743errors:
1744
1745 # alter $@ on nefarious repugnancy only
1746 {
1747 my $e;
1748 {
1749 local $@; # protect existing $@
1750 eval { test_repugnancy() };
b208c909 1751 # $@ =~ /nefarious/ and die $@; # Perl 5.14 and higher only
8a5a710d
DN
1752 $@ =~ /nefarious/ and $e = $@;
1753 }
1754 die $e if defined $e
1755 }
1756
4968c1e4 1757C<eval BLOCK> does I<not> count as a loop, so the loop control statements
2b5ab1e7 1758C<next>, C<last>, or C<redo> cannot be used to leave or restart the block.
4968c1e4 1759
3b10bc60 1760An C<eval ''> executed within the C<DB> package doesn't see the usual
1761surrounding lexical scope, but rather the scope of the first non-DB piece
df4833a8 1762of code that called it. You don't normally need to worry about this unless
3b10bc60 1763you are writing a Perl debugger.
d819b83a 1764
a0d0e21e 1765=item exec LIST
d74e8afc 1766X<exec> X<execute>
a0d0e21e 1767
8bf3b016
GS
1768=item exec PROGRAM LIST
1769
3b10bc60 1770The C<exec> function executes a system command I<and never returns>;
19799a22
GS
1771use C<system> instead of C<exec> if you want it to return. It fails and
1772returns false only if the command does not exist I<and> it is executed
fb73857a 1773directly instead of via your system's command shell (see below).
a0d0e21e 1774
19799a22 1775Since it's a common mistake to use C<exec> instead of C<system>, Perl
80d38338 1776warns you if there is a following statement that isn't C<die>, C<warn>,
3b10bc60 1777or C<exit> (if C<-w> is set--but you always do that, right?). If you
19799a22 1778I<really> want to follow an C<exec> with some other statement, you
55d729e4
GS
1779can use one of these styles to avoid the warning:
1780
5a964f20
TC
1781 exec ('foo') or print STDERR "couldn't exec foo: $!";
1782 { exec ('foo') }; print STDERR "couldn't exec foo: $!";
55d729e4 1783
5a964f20 1784If there is more than one argument in LIST, or if LIST is an array
f86cebdf 1785with more than one value, calls execvp(3) with the arguments in LIST.
5a964f20
TC
1786If there is only one scalar argument or an array with one element in it,
1787the argument is checked for shell metacharacters, and if there are any,
1788the entire argument is passed to the system's command shell for parsing
1789(this is C</bin/sh -c> on Unix platforms, but varies on other platforms).
1790If there are no shell metacharacters in the argument, it is split into
b76cc8ba 1791words and passed directly to C<execvp>, which is more efficient.
19799a22 1792Examples:
a0d0e21e 1793
19799a22
GS
1794 exec '/bin/echo', 'Your arguments are: ', @ARGV;
1795 exec "sort $outfile | uniq";
a0d0e21e
LW
1796
1797If you don't really want to execute the first argument, but want to lie
1798to the program you are executing about its own name, you can specify
1799the program you actually want to run as an "indirect object" (without a
1800comma) in front of the LIST. (This always forces interpretation of the
54310121 1801LIST as a multivalued list, even if there is only a single scalar in
a0d0e21e
LW
1802the list.) Example:
1803
1804 $shell = '/bin/csh';
5ed4f2ec 1805 exec $shell '-sh'; # pretend it's a login shell
a0d0e21e
LW
1806
1807or, more directly,
1808
5ed4f2ec 1809 exec {'/bin/csh'} '-sh'; # pretend it's a login shell
a0d0e21e 1810
3b10bc60 1811When the arguments get executed via the system shell, results are
1812subject to its quirks and capabilities. See L<perlop/"`STRING`">
bb32b41a
GS
1813for details.
1814
19799a22
GS
1815Using an indirect object with C<exec> or C<system> is also more
1816secure. This usage (which also works fine with system()) forces
1817interpretation of the arguments as a multivalued list, even if the
1818list had just one argument. That way you're safe from the shell
1819expanding wildcards or splitting up words with whitespace in them.
5a964f20
TC
1820
1821 @args = ( "echo surprise" );
1822
2b5ab1e7 1823 exec @args; # subject to shell escapes
f86cebdf 1824 # if @args == 1
2b5ab1e7 1825 exec { $args[0] } @args; # safe even with one-arg list
5a964f20
TC
1826
1827The first version, the one without the indirect object, ran the I<echo>
80d38338
TC
1828program, passing it C<"surprise"> an argument. The second version didn't;
1829it tried to run a program named I<"echo surprise">, didn't find it, and set
1830C<$?> to a non-zero value indicating failure.
5a964f20 1831
80d38338 1832Beginning with v5.6.0, Perl attempts to flush all files opened for
0f897271
GS
1833output before the exec, but this may not be supported on some platforms
1834(see L<perlport>). To be safe, you may need to set C<$|> ($AUTOFLUSH
1835in English) or call the C<autoflush()> method of C<IO::Handle> on any
80d38338 1836open handles to avoid lost output.
0f897271 1837
80d38338
TC
1838Note that C<exec> will not call your C<END> blocks, nor will it invoke
1839C<DESTROY> methods on your objects.
7660c0ab 1840
ea9eb35a
BJ
1841Portability issues: L<perlport/exec>.
1842
a0d0e21e 1843=item exists EXPR
d74e8afc 1844X<exists> X<autovivification>
a0d0e21e 1845
d0a76353
RS
1846Given an expression that specifies an element of a hash, returns true if the
1847specified element in the hash has ever been initialized, even if the
1848corresponding value is undefined.
a0d0e21e 1849
5ed4f2ec 1850 print "Exists\n" if exists $hash{$key};
1851 print "Defined\n" if defined $hash{$key};
01020589
GS
1852 print "True\n" if $hash{$key};
1853
d0a76353 1854exists may also be called on array elements, but its behavior is much less
8f1da26d 1855obvious and is strongly tied to the use of L</delete> on arrays. B<Be aware>
d0a76353
RS
1856that calling exists on array values is deprecated and likely to be removed in
1857a future version of Perl.
1858
5ed4f2ec 1859 print "Exists\n" if exists $array[$index];
1860 print "Defined\n" if defined $array[$index];
01020589 1861 print "True\n" if $array[$index];
a0d0e21e 1862
8f1da26d 1863A hash or array element can be true only if it's defined and defined only if
a0d0e21e
LW
1864it exists, but the reverse doesn't necessarily hold true.
1865
afebc493
GS
1866Given an expression that specifies the name of a subroutine,
1867returns true if the specified subroutine has ever been declared, even
1868if it is undefined. Mentioning a subroutine name for exists or defined
80d38338 1869does not count as declaring it. Note that a subroutine that does not
847c7ebe
DD
1870exist may still be callable: its package may have an C<AUTOLOAD>
1871method that makes it spring into existence the first time that it is
3b10bc60 1872called; see L<perlsub>.
afebc493 1873
5ed4f2ec 1874 print "Exists\n" if exists &subroutine;
1875 print "Defined\n" if defined &subroutine;
afebc493 1876
a0d0e21e 1877Note that the EXPR can be arbitrarily complicated as long as the final
afebc493 1878operation is a hash or array key lookup or subroutine name:
a0d0e21e 1879
5ed4f2ec 1880 if (exists $ref->{A}->{B}->{$key}) { }
1881 if (exists $hash{A}{B}{$key}) { }
2b5ab1e7 1882
5ed4f2ec 1883 if (exists $ref->{A}->{B}->[$ix]) { }
1884 if (exists $hash{A}{B}[$ix]) { }
01020589 1885
afebc493
GS
1886 if (exists &{$ref->{A}{B}{$key}}) { }
1887
3b10bc60 1888Although the mostly deeply nested array or hash will not spring into
1889existence just because its existence was tested, any intervening ones will.
61eff3bc 1890Thus C<< $ref->{"A"} >> and C<< $ref->{"A"}->{"B"} >> will spring
01020589 1891into existence due to the existence test for the $key element above.
3b10bc60 1892This happens anywhere the arrow operator is used, including even here:
5a964f20 1893
2b5ab1e7 1894 undef $ref;
5ed4f2ec 1895 if (exists $ref->{"Some key"}) { }
1896 print $ref; # prints HASH(0x80d3d5c)
2b5ab1e7
TC
1897
1898This surprising autovivification in what does not at first--or even
1899second--glance appear to be an lvalue context may be fixed in a future
5a964f20 1900release.
a0d0e21e 1901
afebc493
GS
1902Use of a subroutine call, rather than a subroutine name, as an argument
1903to exists() is an error.
1904
5ed4f2ec 1905 exists &sub; # OK
1906 exists &sub(); # Error
afebc493 1907
a0d0e21e 1908=item exit EXPR
d74e8afc 1909X<exit> X<terminate> X<abort>
a0d0e21e 1910
ce2984c3
PF
1911=item exit
1912
2b5ab1e7 1913Evaluates EXPR and exits immediately with that value. Example:
a0d0e21e
LW
1914
1915 $ans = <STDIN>;
1916 exit 0 if $ans =~ /^[Xx]/;
1917
19799a22 1918See also C<die>. If EXPR is omitted, exits with C<0> status. The only
2b5ab1e7
TC
1919universally recognized values for EXPR are C<0> for success and C<1>
1920for error; other values are subject to interpretation depending on the
1921environment in which the Perl program is running. For example, exiting
192269 (EX_UNAVAILABLE) from a I<sendmail> incoming-mail filter will cause
1923the mailer to return the item undelivered, but that's not true everywhere.
a0d0e21e 1924
19799a22
GS
1925Don't use C<exit> to abort a subroutine if there's any chance that
1926someone might want to trap whatever error happened. Use C<die> instead,
1927which can be trapped by an C<eval>.
28757baa 1928
19799a22 1929The exit() function does not always exit immediately. It calls any
2b5ab1e7 1930defined C<END> routines first, but these C<END> routines may not
19799a22 1931themselves abort the exit. Likewise any object destructors that need to
60275626
FC
1932be called are called before the real exit. C<END> routines and destructors
1933can change the exit status by modifying C<$?>. If this is a problem, you
2b5ab1e7 1934can call C<POSIX:_exit($status)> to avoid END and destructor processing.
87275199 1935See L<perlmod> for details.
5a964f20 1936
ea9eb35a
BJ
1937Portability issues: L<perlport/exit>.
1938
a0d0e21e 1939=item exp EXPR
d74e8afc 1940X<exp> X<exponential> X<antilog> X<antilogarithm> X<e>
a0d0e21e 1941
54310121 1942=item exp
bbce6d69 1943
b76cc8ba 1944Returns I<e> (the natural logarithm base) to the power of EXPR.
a0d0e21e
LW
1945If EXPR is omitted, gives C<exp($_)>.
1946
1947=item fcntl FILEHANDLE,FUNCTION,SCALAR
d74e8afc 1948X<fcntl>
a0d0e21e 1949
f86cebdf 1950Implements the fcntl(2) function. You'll probably have to say
a0d0e21e
LW
1951
1952 use Fcntl;
1953
0ade1984 1954first to get the correct constant definitions. Argument processing and
3b10bc60 1955value returned work just like C<ioctl> below.
a0d0e21e
LW
1956For example:
1957
1958 use Fcntl;
5a964f20 1959 fcntl($filehandle, F_GETFL, $packed_return_buffer)
a9a5a0dc 1960 or die "can't fcntl F_GETFL: $!";
5a964f20 1961
554ad1fc 1962You don't have to check for C<defined> on the return from C<fcntl>.
951ba7fe
GS
1963Like C<ioctl>, it maps a C<0> return from the system call into
1964C<"0 but true"> in Perl. This string is true in boolean context and C<0>
2b5ab1e7
TC
1965in numeric context. It is also exempt from the normal B<-w> warnings
1966on improper numeric conversions.
5a964f20 1967
3b10bc60 1968Note that C<fcntl> raises an exception if used on a machine that
2b5ab1e7
TC
1969doesn't implement fcntl(2). See the Fcntl module or your fcntl(2)
1970manpage to learn what functions are available on your system.
a0d0e21e 1971
be2f7487
TH
1972Here's an example of setting a filehandle named C<REMOTE> to be
1973non-blocking at the system level. You'll have to negotiate C<$|>
1974on your own, though.
1975
1976 use Fcntl qw(F_GETFL F_SETFL O_NONBLOCK);
1977
1978 $flags = fcntl(REMOTE, F_GETFL, 0)
1979 or die "Can't get flags for the socket: $!\n";
1980
1981 $flags = fcntl(REMOTE, F_SETFL, $flags | O_NONBLOCK)
1982 or die "Can't set flags for the socket: $!\n";
1983
ea9eb35a
BJ
1984Portability issues: L<perlport/fcntl>.
1985
cfa52385
FC
1986=item __FILE__
1987X<__FILE__>
1988
1989A special token that returns the name of the file in which it occurs.
1990
a0d0e21e 1991=item fileno FILEHANDLE
d74e8afc 1992X<fileno>
a0d0e21e 1993
2b5ab1e7 1994Returns the file descriptor for a filehandle, or undefined if the
a7c1632d
FC
1995filehandle is not open. If there is no real file descriptor at the OS
1996level, as can happen with filehandles connected to memory objects via
1997C<open> with a reference for the third argument, -1 is returned.
1998
1999This is mainly useful for constructing
19799a22 2000bitmaps for C<select> and low-level POSIX tty-handling operations.
2b5ab1e7
TC
2001If FILEHANDLE is an expression, the value is taken as an indirect
2002filehandle, generally its name.
5a964f20 2003
b76cc8ba 2004You can use this to find out whether two handles refer to the
5a964f20
TC
2005same underlying descriptor:
2006
2007 if (fileno(THIS) == fileno(THAT)) {
a9a5a0dc 2008 print "THIS and THAT are dups\n";
b76cc8ba
NIS
2009 }
2010
a0d0e21e 2011=item flock FILEHANDLE,OPERATION
d74e8afc 2012X<flock> X<lock> X<locking>
a0d0e21e 2013
19799a22
GS
2014Calls flock(2), or an emulation of it, on FILEHANDLE. Returns true
2015for success, false on failure. Produces a fatal error if used on a
2b5ab1e7 2016machine that doesn't implement flock(2), fcntl(2) locking, or lockf(3).
dbfe1e81 2017C<flock> is Perl's portable file-locking interface, although it locks
3b10bc60 2018entire files only, not records.
2b5ab1e7
TC
2019
2020Two potentially non-obvious but traditional C<flock> semantics are
2021that it waits indefinitely until the lock is granted, and that its locks
dbfe1e81
FC
2022are B<merely advisory>. Such discretionary locks are more flexible, but
2023offer fewer guarantees. This means that programs that do not also use
2024C<flock> may modify files locked with C<flock>. See L<perlport>,
8f1da26d 2025your port's specific documentation, and your system-specific local manpages
2b5ab1e7
TC
2026for details. It's best to assume traditional behavior if you're writing
2027portable programs. (But if you're not, you should as always feel perfectly
2028free to write for your own system's idiosyncrasies (sometimes called
2029"features"). Slavish adherence to portability concerns shouldn't get
2030in the way of your getting your job done.)
a3cb178b 2031
8ebc5c01 2032OPERATION is one of LOCK_SH, LOCK_EX, or LOCK_UN, possibly combined with
2033LOCK_NB. These constants are traditionally valued 1, 2, 8 and 4, but
8f1da26d
TC
2034you can use the symbolic names if you import them from the L<Fcntl> module,
2035either individually, or as a group using the C<:flock> tag. LOCK_SH
68dc0745 2036requests a shared lock, LOCK_EX requests an exclusive lock, and LOCK_UN
ea3105be 2037releases a previously requested lock. If LOCK_NB is bitwise-or'ed with
8f1da26d 2038LOCK_SH or LOCK_EX, then C<flock> returns immediately rather than blocking
3b10bc60 2039waiting for the lock; check the return status to see if you got it.
68dc0745 2040
2b5ab1e7
TC
2041To avoid the possibility of miscoordination, Perl now flushes FILEHANDLE
2042before locking or unlocking it.
8ebc5c01 2043
f86cebdf 2044Note that the emulation built with lockf(3) doesn't provide shared
8ebc5c01 2045locks, and it requires that FILEHANDLE be open with write intent. These
2b5ab1e7 2046are the semantics that lockf(3) implements. Most if not all systems
f86cebdf 2047implement lockf(3) in terms of fcntl(2) locking, though, so the
8ebc5c01 2048differing semantics shouldn't bite too many people.
2049
becacb53
TM
2050Note that the fcntl(2) emulation of flock(3) requires that FILEHANDLE
2051be open with read intent to use LOCK_SH and requires that it be open
2052with write intent to use LOCK_EX.
2053
19799a22
GS
2054Note also that some versions of C<flock> cannot lock things over the
2055network; you would need to use the more system-specific C<fcntl> for
f86cebdf
GS
2056that. If you like you can force Perl to ignore your system's flock(2)
2057function, and so provide its own fcntl(2)-based emulation, by passing
8ebc5c01 2058the switch C<-Ud_flock> to the F<Configure> program when you configure
8f1da26d 2059and build a new Perl.
4633a7c4
LW
2060
2061Here's a mailbox appender for BSD systems.
a0d0e21e 2062
7ed5353d 2063 use Fcntl qw(:flock SEEK_END); # import LOCK_* and SEEK_END constants
a0d0e21e
LW
2064
2065 sub lock {
a9a5a0dc
VP
2066 my ($fh) = @_;
2067 flock($fh, LOCK_EX) or die "Cannot lock mailbox - $!\n";
7ed5353d 2068
a9a5a0dc
VP
2069 # and, in case someone appended while we were waiting...
2070 seek($fh, 0, SEEK_END) or die "Cannot seek - $!\n";
a0d0e21e
LW
2071 }
2072
2073 sub unlock {
a9a5a0dc
VP
2074 my ($fh) = @_;
2075 flock($fh, LOCK_UN) or die "Cannot unlock mailbox - $!\n";
a0d0e21e
LW
2076 }
2077
b0169937 2078 open(my $mbox, ">>", "/usr/spool/mail/$ENV{'USER'}")
5ed4f2ec 2079 or die "Can't open mailbox: $!";
a0d0e21e 2080
7ed5353d 2081 lock($mbox);
b0169937 2082 print $mbox $msg,"\n\n";
7ed5353d 2083 unlock($mbox);
a0d0e21e 2084
3b10bc60 2085On systems that support a real flock(2), locks are inherited across fork()
2086calls, whereas those that must resort to the more capricious fcntl(2)
2087function lose their locks, making it seriously harder to write servers.
2b5ab1e7 2088
cb1a09d0 2089See also L<DB_File> for other flock() examples.
a0d0e21e 2090
ea9eb35a
BJ
2091Portability issues: L<perlport/flock>.
2092
a0d0e21e 2093=item fork
d74e8afc 2094X<fork> X<child> X<parent>
a0d0e21e 2095
2b5ab1e7
TC
2096Does a fork(2) system call to create a new process running the
2097same program at the same point. It returns the child pid to the
2098parent process, C<0> to the child process, or C<undef> if the fork is
2099unsuccessful. File descriptors (and sometimes locks on those descriptors)
2100are shared, while everything else is copied. On most systems supporting
2101fork(), great care has gone into making it extremely efficient (for
2102example, using copy-on-write technology on data pages), making it the
2103dominant paradigm for multitasking over the last few decades.
5a964f20 2104
80d38338 2105Beginning with v5.6.0, Perl attempts to flush all files opened for
0f897271
GS
2106output before forking the child process, but this may not be supported
2107on some platforms (see L<perlport>). To be safe, you may need to set
2108C<$|> ($AUTOFLUSH in English) or call the C<autoflush()> method of
80d38338 2109C<IO::Handle> on any open handles to avoid duplicate output.
a0d0e21e 2110
19799a22 2111If you C<fork> without ever waiting on your children, you will
2b5ab1e7
TC
2112accumulate zombies. On some systems, you can avoid this by setting
2113C<$SIG{CHLD}> to C<"IGNORE">. See also L<perlipc> for more examples of
2114forking and reaping moribund children.
cb1a09d0 2115
28757baa 2116Note that if your forked child inherits system file descriptors like
2117STDIN and STDOUT that are actually connected by a pipe or socket, even
2b5ab1e7 2118if you exit, then the remote server (such as, say, a CGI script or a
19799a22 2119backgrounded job launched from a remote shell) won't think you're done.
2b5ab1e7 2120You should reopen those to F</dev/null> if it's any issue.
28757baa 2121
ea9eb35a
BJ
2122On some platforms such as Windows, where the fork() system call is not available,
2123Perl can be built to emulate fork() in the Perl interpreter. The emulation is designed to,
2124at the level of the Perl program, be as compatible as possible with the "Unix" fork().
6d17f725 2125However it has limitations that have to be considered in code intended to be portable.
ea9eb35a
BJ
2126See L<perlfork> for more details.
2127
2128Portability issues: L<perlport/fork>.
2129
cb1a09d0 2130=item format
d74e8afc 2131X<format>
cb1a09d0 2132
19799a22 2133Declare a picture format for use by the C<write> function. For
cb1a09d0
AD
2134example:
2135
54310121 2136 format Something =
a9a5a0dc
VP
2137 Test: @<<<<<<<< @||||| @>>>>>
2138 $str, $%, '$' . int($num)
cb1a09d0
AD
2139 .
2140
2141 $str = "widget";
184e9718 2142 $num = $cost/$quantity;
cb1a09d0
AD
2143 $~ = 'Something';
2144 write;
2145
2146See L<perlform> for many details and examples.
2147
8903cb82 2148=item formline PICTURE,LIST
d74e8afc 2149X<formline>
a0d0e21e 2150
5a964f20 2151This is an internal function used by C<format>s, though you may call it,
a0d0e21e
LW
2152too. It formats (see L<perlform>) a list of values according to the
2153contents of PICTURE, placing the output into the format output
7660c0ab 2154accumulator, C<$^A> (or C<$ACCUMULATOR> in English).
19799a22 2155Eventually, when a C<write> is done, the contents of
cf264981
SP
2156C<$^A> are written to some filehandle. You could also read C<$^A>
2157and then set C<$^A> back to C<"">. Note that a format typically
19799a22 2158does one C<formline> per line of form, but the C<formline> function itself
748a9306 2159doesn't care how many newlines are embedded in the PICTURE. This means
3b10bc60 2160that the C<~> and C<~~> tokens treat the entire PICTURE as a single line.
748a9306 2161You may therefore need to use multiple formlines to implement a single
3b10bc60 2162record format, just like the C<format> compiler.
748a9306 2163
19799a22 2164Be careful if you put double quotes around the picture, because an C<@>
748a9306 2165character may be taken to mean the beginning of an array name.
19799a22 2166C<formline> always returns true. See L<perlform> for other examples.
a0d0e21e 2167
445b09e5
FC
2168If you are trying to use this instead of C<write> to capture the output,
2169you may find it easier to open a filehandle to a scalar
2170(C<< open $fh, ">", \$output >>) and write to that instead.
2171
a0d0e21e 2172=item getc FILEHANDLE
f723aae1 2173X<getc> X<getchar> X<character> X<file, read>
a0d0e21e
LW
2174
2175=item getc
2176
2177Returns the next character from the input file attached to FILEHANDLE,
3b10bc60 2178or the undefined value at end of file or if there was an error (in
b5fe5ca2
SR
2179the latter case C<$!> is set). If FILEHANDLE is omitted, reads from
2180STDIN. This is not particularly efficient. However, it cannot be
2181used by itself to fetch single characters without waiting for the user
2182to hit enter. For that, try something more like:
4633a7c4
LW
2183
2184 if ($BSD_STYLE) {
a9a5a0dc 2185 system "stty cbreak </dev/tty >/dev/tty 2>&1";
4633a7c4
LW
2186 }
2187 else {
a9a5a0dc 2188 system "stty", '-icanon', 'eol', "\001";
4633a7c4
LW
2189 }
2190
2191 $key = getc(STDIN);
2192
2193 if ($BSD_STYLE) {
a9a5a0dc 2194 system "stty -cbreak </dev/tty >/dev/tty 2>&1";
4633a7c4
LW
2195 }
2196 else {
3b10bc60 2197 system 'stty', 'icanon', 'eol', '^@'; # ASCII NUL
4633a7c4
LW
2198 }
2199 print "\n";
2200
54310121 2201Determination of whether $BSD_STYLE should be set
2202is left as an exercise to the reader.
cb1a09d0 2203
19799a22 2204The C<POSIX::getattr> function can do this more portably on
2b5ab1e7 2205systems purporting POSIX compliance. See also the C<Term::ReadKey>
a3390c9f 2206module from your nearest CPAN site; details on CPAN can be found under
2b5ab1e7 2207L<perlmodlib/CPAN>.
a0d0e21e
LW
2208
2209=item getlogin
d74e8afc 2210X<getlogin> X<login>
a0d0e21e 2211
cf264981 2212This implements the C library function of the same name, which on most
3b10bc60 2213systems returns the current login from F</etc/utmp>, if any. If it
2214returns the empty string, use C<getpwuid>.
a0d0e21e 2215
f86702cc 2216 $login = getlogin || getpwuid($<) || "Kilroy";
a0d0e21e 2217
19799a22
GS
2218Do not consider C<getlogin> for authentication: it is not as
2219secure as C<getpwuid>.
4633a7c4 2220
ea9eb35a
BJ
2221Portability issues: L<perlport/getlogin>.
2222
a0d0e21e 2223=item getpeername SOCKET
d74e8afc 2224X<getpeername> X<peer>
a0d0e21e 2225
a3390c9f
FC
2226Returns the packed sockaddr address of the other end of the SOCKET
2227connection.
a0d0e21e 2228
4633a7c4
LW
2229 use Socket;
2230 $hersockaddr = getpeername(SOCK);
19799a22 2231 ($port, $iaddr) = sockaddr_in($hersockaddr);
4633a7c4
LW
2232 $herhostname = gethostbyaddr($iaddr, AF_INET);
2233 $herstraddr = inet_ntoa($iaddr);
a0d0e21e
LW
2234
2235=item getpgrp PID
d74e8afc 2236X<getpgrp> X<group>
a0d0e21e 2237
47e29363 2238Returns the current process group for the specified PID. Use
7660c0ab 2239a PID of C<0> to get the current process group for the
4633a7c4 2240current process. Will raise an exception if used on a machine that
a3390c9f
FC
2241doesn't implement getpgrp(2). If PID is omitted, returns the process
2242group of the current process. Note that the POSIX version of C<getpgrp>
7660c0ab 2243does not accept a PID argument, so only C<PID==0> is truly portable.
a0d0e21e 2244
ea9eb35a
BJ
2245Portability issues: L<perlport/getpgrp>.
2246
a0d0e21e 2247=item getppid
d74e8afc 2248X<getppid> X<parent> X<pid>
a0d0e21e
LW
2249
2250Returns the process id of the parent process.
2251
4d76a344
RGS
2252Note for Linux users: on Linux, the C functions C<getpid()> and
2253C<getppid()> return different values from different threads. In order to
3b10bc60 2254be portable, this behavior is not reflected by the Perl-level function
4d76a344 2255C<getppid()>, that returns a consistent value across threads. If you want
e3256f86
RGS
2256to call the underlying C<getppid()>, you may use the CPAN module
2257C<Linux::Pid>.
4d76a344 2258
ea9eb35a
BJ
2259Portability issues: L<perlport/getppid>.
2260
a0d0e21e 2261=item getpriority WHICH,WHO
d74e8afc 2262X<getpriority> X<priority> X<nice>
a0d0e21e 2263
4633a7c4 2264Returns the current priority for a process, a process group, or a user.
01aa884e 2265(See L<getpriority(2)>.) Will raise a fatal exception if used on a
f86cebdf 2266machine that doesn't implement getpriority(2).
a0d0e21e 2267
ea9eb35a
BJ
2268Portability issues: L<perlport/getpriority>.
2269
a0d0e21e 2270=item getpwnam NAME
d74e8afc
ITB
2271X<getpwnam> X<getgrnam> X<gethostbyname> X<getnetbyname> X<getprotobyname>
2272X<getpwuid> X<getgrgid> X<getservbyname> X<gethostbyaddr> X<getnetbyaddr>
2273X<getprotobynumber> X<getservbyport> X<getpwent> X<getgrent> X<gethostent>
2274X<getnetent> X<getprotoent> X<getservent> X<setpwent> X<setgrent> X<sethostent>
2275X<setnetent> X<setprotoent> X<setservent> X<endpwent> X<endgrent> X<endhostent>
2276X<endnetent> X<endprotoent> X<endservent>
a0d0e21e
LW
2277
2278=item getgrnam NAME
2279
2280=item gethostbyname NAME
2281
2282=item getnetbyname NAME
2283
2284=item getprotobyname NAME
2285
2286=item getpwuid UID
2287
2288=item getgrgid GID
2289
2290=item getservbyname NAME,PROTO
2291
2292=item gethostbyaddr ADDR,ADDRTYPE
2293
2294=item getnetbyaddr ADDR,ADDRTYPE
2295
2296=item getprotobynumber NUMBER
2297
2298=item getservbyport PORT,PROTO
2299
2300=item getpwent
2301
2302=item getgrent
2303
2304=item gethostent
2305
2306=item getnetent
2307
2308=item getprotoent
2309
2310=item getservent
2311
2312=item setpwent
2313
2314=item setgrent
2315
2316=item sethostent STAYOPEN
2317
2318=item setnetent STAYOPEN
2319
2320=item setprotoent STAYOPEN
2321
2322=item setservent STAYOPEN
2323
2324=item endpwent
2325
2326=item endgrent
2327
2328=item endhostent
2329
2330=item endnetent
2331
2332=item endprotoent
2333
2334=item endservent
2335
80d38338
TC
2336These routines are the same as their counterparts in the
2337system C library. In list context, the return values from the
a0d0e21e
LW
2338various get routines are as follows:
2339
2340 ($name,$passwd,$uid,$gid,
6ee623d5 2341 $quota,$comment,$gcos,$dir,$shell,$expire) = getpw*
a0d0e21e
LW
2342 ($name,$passwd,$gid,$members) = getgr*
2343 ($name,$aliases,$addrtype,$length,@addrs) = gethost*
2344 ($name,$aliases,$addrtype,$net) = getnet*
2345 ($name,$aliases,$proto) = getproto*
2346 ($name,$aliases,$port,$proto) = getserv*
2347
3b10bc60 2348(If the entry doesn't exist you get an empty list.)
a0d0e21e 2349
4602f195
JH
2350The exact meaning of the $gcos field varies but it usually contains
2351the real name of the user (as opposed to the login name) and other
2352information pertaining to the user. Beware, however, that in many
2353system users are able to change this information and therefore it
106325ad 2354cannot be trusted and therefore the $gcos is tainted (see
2959b6e3 2355L<perlsec>). The $passwd and $shell, user's encrypted password and
a3390c9f 2356login shell, are also tainted, for the same reason.
4602f195 2357
5a964f20 2358In scalar context, you get the name, unless the function was a
a0d0e21e
LW
2359lookup by name, in which case you get the other thing, whatever it is.
2360(If the entry doesn't exist you get the undefined value.) For example:
2361
5a964f20
TC
2362 $uid = getpwnam($name);
2363 $name = getpwuid($num);
2364 $name = getpwent();
2365 $gid = getgrnam($name);
08a33e13 2366 $name = getgrgid($num);
5a964f20
TC
2367 $name = getgrent();
2368 #etc.
a0d0e21e 2369
4602f195 2370In I<getpw*()> the fields $quota, $comment, and $expire are special
80d38338 2371in that they are unsupported on many systems. If the
4602f195
JH
2372$quota is unsupported, it is an empty scalar. If it is supported, it
2373usually encodes the disk quota. If the $comment field is unsupported,
2374it is an empty scalar. If it is supported it usually encodes some
2375administrative comment about the user. In some systems the $quota
2376field may be $change or $age, fields that have to do with password
2377aging. In some systems the $comment field may be $class. The $expire
2378field, if present, encodes the expiration period of the account or the
2379password. For the availability and the exact meaning of these fields
8f1da26d 2380in your system, please consult getpwnam(3) and your system's
4602f195
JH
2381F<pwd.h> file. You can also find out from within Perl what your
2382$quota and $comment fields mean and whether you have the $expire field
2383by using the C<Config> module and the values C<d_pwquota>, C<d_pwage>,
2384C<d_pwchange>, C<d_pwcomment>, and C<d_pwexpire>. Shadow password
3b10bc60 2385files are supported only if your vendor has implemented them in the
4602f195 2386intuitive fashion that calling the regular C library routines gets the
5d3a0a3b 2387shadow versions if you're running under privilege or if there exists
cf264981 2388the shadow(3) functions as found in System V (this includes Solaris
a3390c9f 2389and Linux). Those systems that implement a proprietary shadow password
5d3a0a3b 2390facility are unlikely to be supported.
6ee623d5 2391
a3390c9f 2392The $members value returned by I<getgr*()> is a space-separated list of
a0d0e21e
LW
2393the login names of the members of the group.
2394
2395For the I<gethost*()> functions, if the C<h_errno> variable is supported in
2396C, it will be returned to you via C<$?> if the function call fails. The
3b10bc60 2397C<@addrs> value returned by a successful call is a list of raw
2398addresses returned by the corresponding library call. In the
2399Internet domain, each address is four bytes long; you can unpack it
a0d0e21e
LW
2400by saying something like:
2401
f337b084 2402 ($a,$b,$c,$d) = unpack('W4',$addr[0]);
a0d0e21e 2403
2b5ab1e7
TC
2404The Socket library makes this slightly easier:
2405
2406 use Socket;
2407 $iaddr = inet_aton("127.1"); # or whatever address
2408 $name = gethostbyaddr($iaddr, AF_INET);
2409
2410 # or going the other way
19799a22 2411 $straddr = inet_ntoa($iaddr);
2b5ab1e7 2412
d760c846
GS
2413In the opposite way, to resolve a hostname to the IP address
2414you can write this:
2415
2416 use Socket;
2417 $packed_ip = gethostbyname("www.perl.org");
2418 if (defined $packed_ip) {
2419 $ip_address = inet_ntoa($packed_ip);
2420 }
2421
b018eaf1 2422Make sure C<gethostbyname()> is called in SCALAR context and that
d760c846
GS
2423its return value is checked for definedness.
2424
0d043efa
FC
2425The C<getprotobynumber> function, even though it only takes one argument,
2426has the precedence of a list operator, so beware:
2427
2428 getprotobynumber $number eq 'icmp' # WRONG
2429 getprotobynumber($number eq 'icmp') # actually means this
2430 getprotobynumber($number) eq 'icmp' # better this way
2431
19799a22
GS
2432If you get tired of remembering which element of the return list
2433contains which return value, by-name interfaces are provided
2434in standard modules: C<File::stat>, C<Net::hostent>, C<Net::netent>,
2435C<Net::protoent>, C<Net::servent>, C<Time::gmtime>, C<Time::localtime>,
2436and C<User::grent>. These override the normal built-ins, supplying
2437versions that return objects with the appropriate names
2438for each field. For example:
5a964f20
TC
2439
2440 use File::stat;
2441 use User::pwent;
2442 $is_his = (stat($filename)->uid == pwent($whoever)->uid);
2443
a3390c9f 2444Even though it looks as though they're the same method calls (uid),
b76cc8ba 2445they aren't, because a C<File::stat> object is different from
19799a22 2446a C<User::pwent> object.
5a964f20 2447
ea9eb35a
BJ
2448Portability issues: L<perlport/getpwnam> to L<perlport/endservent>.
2449
a0d0e21e 2450=item getsockname SOCKET
d74e8afc 2451X<getsockname>
a0d0e21e 2452
19799a22
GS
2453Returns the packed sockaddr address of this end of the SOCKET connection,
2454in case you don't know the address because you have several different
2455IPs that the connection might have come in on.
a0d0e21e 2456
4633a7c4
LW
2457 use Socket;
2458 $mysockaddr = getsockname(SOCK);
19799a22 2459 ($port, $myaddr) = sockaddr_in($mysockaddr);
b76cc8ba 2460 printf "Connect to %s [%s]\n",
19799a22
GS
2461 scalar gethostbyaddr($myaddr, AF_INET),
2462 inet_ntoa($myaddr);
a0d0e21e
LW
2463
2464=item getsockopt SOCKET,LEVEL,OPTNAME
d74e8afc 2465X<getsockopt>
a0d0e21e 2466
636e6b1f
TH
2467Queries the option named OPTNAME associated with SOCKET at a given LEVEL.
2468Options may exist at multiple protocol levels depending on the socket
2469type, but at least the uppermost socket level SOL_SOCKET (defined in the
2470C<Socket> module) will exist. To query options at another level the
2471protocol number of the appropriate protocol controlling the option
2472should be supplied. For example, to indicate that an option is to be
2473interpreted by the TCP protocol, LEVEL should be set to the protocol
80d38338 2474number of TCP, which you can get using C<getprotobyname>.
636e6b1f 2475
80d38338 2476The function returns a packed string representing the requested socket
3b10bc60 2477option, or C<undef> on error, with the reason for the error placed in
a3390c9f 2478C<$!>. Just what is in the packed string depends on LEVEL and OPTNAME;
80d38338
TC
2479consult getsockopt(2) for details. A common case is that the option is an
2480integer, in which case the result is a packed integer, which you can decode
2481using C<unpack> with the C<i> (or C<I>) format.
636e6b1f 2482
8f1da26d 2483Here's an example to test whether Nagle's algorithm is enabled on a socket:
636e6b1f 2484
4852725b 2485 use Socket qw(:all);
636e6b1f
TH
2486
2487 defined(my $tcp = getprotobyname("tcp"))
a9a5a0dc 2488 or die "Could not determine the protocol number for tcp";
4852725b
DD
2489 # my $tcp = IPPROTO_TCP; # Alternative
2490 my $packed = getsockopt($socket, $tcp, TCP_NODELAY)
80d38338 2491 or die "getsockopt TCP_NODELAY: $!";
636e6b1f
TH
2492 my $nodelay = unpack("I", $packed);
2493 print "Nagle's algorithm is turned ", $nodelay ? "off\n" : "on\n";
2494
ea9eb35a 2495Portability issues: L<perlport/getsockopt>.
a0d0e21e 2496
15a348aa
NC
2497=item given EXPR BLOCK
2498X<given>
2499
2500=item given BLOCK
2501
2502C<given> is analogous to the C<switch> keyword in other languages. C<given>
2503and C<when> are used in Perl to implement C<switch>/C<case> like statements.
8f1da26d 2504Only available after Perl 5.10. For example:
15a348aa 2505
8f1da26d 2506 use v5.10;
15a348aa
NC
2507 given ($fruit) {
2508 when (/apples?/) {
2509 print "I like apples."
2510 }
2511 when (/oranges?/) {
2512 print "I don't like oranges."
2513 }
2514 default {
2515 print "I don't like anything"
2516 }
2517 }
2518
2519See L<perlsyn/"Switch statements"> for detailed information.
2520
a0d0e21e 2521=item glob EXPR
d74e8afc 2522X<glob> X<wildcard> X<filename, expansion> X<expand>
a0d0e21e 2523
0a753a76 2524=item glob
2525
d9a9d457
JL
2526In list context, returns a (possibly empty) list of filename expansions on
2527the value of EXPR such as the standard Unix shell F</bin/csh> would do. In
2528scalar context, glob iterates through such filename expansions, returning
2529undef when the list is exhausted. This is the internal function
2530implementing the C<< <*.c> >> operator, but you can use it directly. If
2531EXPR is omitted, C<$_> is used. The C<< <*.c> >> operator is discussed in
2532more detail in L<perlop/"I/O Operators">.
a0d0e21e 2533
80d38338
TC
2534Note that C<glob> splits its arguments on whitespace and treats
2535each segment as separate pattern. As such, C<glob("*.c *.h")>
2536matches all files with a F<.c> or F<.h> extension. The expression
b474a1b1 2537C<glob(".* *")> matches all files in the current working directory.
80d38338
TC
2538
2539If non-empty braces are the only wildcard characters used in the
2540C<glob>, no filenames are matched, but potentially many strings
2541are returned. For example, this produces nine strings, one for
2542each pairing of fruits and colors:
2543
2544 @many = glob "{apple,tomato,cherry}={green,yellow,red}";
5c0c9249 2545
3a4b19e4 2546Beginning with v5.6.0, this operator is implemented using the standard
5c0c9249
PF
2547C<File::Glob> extension. See L<File::Glob> for details, including
2548C<bsd_glob> which does not treat whitespace as a pattern separator.
3a4b19e4 2549
ea9eb35a
BJ
2550Portability issues: L<perlport/glob>.
2551
a0d0e21e 2552=item gmtime EXPR
d74e8afc 2553X<gmtime> X<UTC> X<Greenwich>
a0d0e21e 2554
ce2984c3
PF
2555=item gmtime
2556
4509d391 2557Works just like L</localtime> but the returned values are
435fbc73 2558localized for the standard Greenwich time zone.
a0d0e21e 2559
a3390c9f
FC
2560Note: When called in list context, $isdst, the last value
2561returned by gmtime, is always C<0>. There is no
435fbc73 2562Daylight Saving Time in GMT.
0a753a76 2563
ea9eb35a 2564Portability issues: L<perlport/gmtime>.
62aa5637 2565
a0d0e21e 2566=item goto LABEL
d74e8afc 2567X<goto> X<jump> X<jmp>
a0d0e21e 2568
748a9306
LW
2569=item goto EXPR
2570
a0d0e21e
LW
2571=item goto &NAME
2572
b500e03b
GG
2573The C<goto-LABEL> form finds the statement labeled with LABEL and
2574resumes execution there. It can't be used to get out of a block or
2575subroutine given to C<sort>. It can be used to go almost anywhere
2576else within the dynamic scope, including out of subroutines, but it's
2577usually better to use some other construct such as C<last> or C<die>.
2578The author of Perl has never felt the need to use this form of C<goto>
3b10bc60 2579(in Perl, that is; C is another matter). (The difference is that C
b500e03b
GG
2580does not offer named loops combined with loop control. Perl does, and
2581this replaces most structured uses of C<goto> in other languages.)
a0d0e21e 2582
7660c0ab
A
2583The C<goto-EXPR> form expects a label name, whose scope will be resolved
2584dynamically. This allows for computed C<goto>s per FORTRAN, but isn't
748a9306
LW
2585necessarily recommended if you're optimizing for maintainability:
2586
2587 goto ("FOO", "BAR", "GLARCH")[$i];
2588
887d89fd
FC
2589As shown in this example, C<goto-EXPR> is exempt from the "looks like a
2590function" rule. A pair of parentheses following it does not (necessarily)
2591delimit its argument. C<goto("NE")."XT"> is equivalent to C<goto NEXT>.
2592
b500e03b 2593Use of C<goto-LABEL> or C<goto-EXPR> to jump into a construct is
0b98bec9 2594deprecated and will issue a warning. Even then, it may not be used to
b500e03b
GG
2595go into any construct that requires initialization, such as a
2596subroutine or a C<foreach> loop. It also can't be used to go into a
0b98bec9 2597construct that is optimized away.
b500e03b 2598
1b6921cb
BT
2599The C<goto-&NAME> form is quite different from the other forms of
2600C<goto>. In fact, it isn't a goto in the normal sense at all, and
2601doesn't have the stigma associated with other gotos. Instead, it
2602exits the current subroutine (losing any changes set by local()) and
2603immediately calls in its place the named subroutine using the current
2604value of @_. This is used by C<AUTOLOAD> subroutines that wish to
2605load another subroutine and then pretend that the other subroutine had
2606been called in the first place (except that any modifications to C<@_>
6cb9131c
GS
2607in the current subroutine are propagated to the other subroutine.)
2608After the C<goto>, not even C<caller> will be able to tell that this
2609routine was called first.
2610
2611NAME needn't be the name of a subroutine; it can be a scalar variable
8f1da26d 2612containing a code reference or a block that evaluates to a code
6cb9131c 2613reference.
a0d0e21e
LW
2614
2615=item grep BLOCK LIST
d74e8afc 2616X<grep>
a0d0e21e
LW
2617
2618=item grep EXPR,LIST
2619
2b5ab1e7
TC
2620This is similar in spirit to, but not the same as, grep(1) and its
2621relatives. In particular, it is not limited to using regular expressions.
2f9daede 2622
a0d0e21e 2623Evaluates the BLOCK or EXPR for each element of LIST (locally setting
7660c0ab 2624C<$_> to each element) and returns the list value consisting of those
19799a22
GS
2625elements for which the expression evaluated to true. In scalar
2626context, returns the number of times the expression was true.
a0d0e21e
LW
2627
2628 @foo = grep(!/^#/, @bar); # weed out comments
2629
2630or equivalently,
2631
2632 @foo = grep {!/^#/} @bar; # weed out comments
2633
be3174d2
GS
2634Note that C<$_> is an alias to the list value, so it can be used to
2635modify the elements of the LIST. While this is useful and supported,
2636it can cause bizarre results if the elements of LIST are not variables.
2b5ab1e7
TC
2637Similarly, grep returns aliases into the original list, much as a for
2638loop's index variable aliases the list elements. That is, modifying an
19799a22
GS
2639element of a list returned by grep (for example, in a C<foreach>, C<map>
2640or another C<grep>) actually modifies the element in the original list.
2b5ab1e7 2641This is usually something to be avoided when writing clear code.
a0d0e21e 2642
a4fb8298 2643If C<$_> is lexical in the scope where the C<grep> appears (because it has
cf264981 2644been declared with C<my $_>) then, in addition to being locally aliased to
80d38338 2645the list elements, C<$_> keeps being lexical inside the block; i.e., it
a4fb8298
RGS
2646can't be seen from the outside, avoiding any potential side-effects.
2647
19799a22 2648See also L</map> for a list composed of the results of the BLOCK or EXPR.
38325410 2649
a0d0e21e 2650=item hex EXPR
d74e8afc 2651X<hex> X<hexadecimal>
a0d0e21e 2652
54310121 2653=item hex
bbce6d69 2654
2b5ab1e7 2655Interprets EXPR as a hex string and returns the corresponding value.
38366c11 2656(To convert strings that might start with either C<0>, C<0x>, or C<0b>, see
2b5ab1e7 2657L</oct>.) If EXPR is omitted, uses C<$_>.
2f9daede
TP
2658
2659 print hex '0xAf'; # prints '175'
2660 print hex 'aF'; # same
a0d0e21e 2661
19799a22 2662Hex strings may only represent integers. Strings that would cause
53305cf1 2663integer overflow trigger a warning. Leading whitespace is not stripped,
38366c11 2664unlike oct(). To present something as hex, look into L</printf>,
8f1da26d 2665L</sprintf>, and L</unpack>.
19799a22 2666
ce2984c3 2667=item import LIST
d74e8afc 2668X<import>
a0d0e21e 2669
19799a22 2670There is no builtin C<import> function. It is just an ordinary
4633a7c4 2671method (subroutine) defined (or inherited) by modules that wish to export
19799a22 2672names to another module. The C<use> function calls the C<import> method
cea6626f 2673for the package used. See also L</use>, L<perlmod>, and L<Exporter>.
a0d0e21e
LW
2674
2675=item index STR,SUBSTR,POSITION
d74e8afc 2676X<index> X<indexOf> X<InStr>
a0d0e21e
LW
2677
2678=item index STR,SUBSTR
2679
2b5ab1e7
TC
2680The index function searches for one string within another, but without
2681the wildcard-like behavior of a full regular-expression pattern match.
2682It returns the position of the first occurrence of SUBSTR in STR at
2683or after POSITION. If POSITION is omitted, starts searching from the
26f149de
YST
2684beginning of the string. POSITION before the beginning of the string
2685or after its end is treated as if it were the beginning or the end,
e1dccc0d
Z
2686respectively. POSITION and the return value are based at zero.
2687If the substring is not found, C<index> returns -1.
a0d0e21e
LW
2688
2689=item int EXPR
f723aae1 2690X<int> X<integer> X<truncate> X<trunc> X<floor>
a0d0e21e 2691
54310121 2692=item int
bbce6d69 2693
7660c0ab 2694Returns the integer portion of EXPR. If EXPR is omitted, uses C<$_>.
2b5ab1e7 2695You should not use this function for rounding: one because it truncates
3b10bc60 2696towards C<0>, and two because machine representations of floating-point
2b5ab1e7
TC
2697numbers can sometimes produce counterintuitive results. For example,
2698C<int(-6.725/0.025)> produces -268 rather than the correct -269; that's
2699because it's really more like -268.99999999999994315658 instead. Usually,
19799a22 2700the C<sprintf>, C<printf>, or the C<POSIX::floor> and C<POSIX::ceil>
2b5ab1e7 2701functions will serve you better than will int().
a0d0e21e
LW
2702
2703=item ioctl FILEHANDLE,FUNCTION,SCALAR
d74e8afc 2704X<ioctl>
a0d0e21e 2705
2b5ab1e7 2706Implements the ioctl(2) function. You'll probably first have to say
a0d0e21e 2707
5ed4f2ec 2708 require "sys/ioctl.ph"; # probably in $Config{archlib}/sys/ioctl.ph
a0d0e21e 2709
a11c483f 2710to get the correct function definitions. If F<sys/ioctl.ph> doesn't
a0d0e21e 2711exist or doesn't have the correct definitions you'll have to roll your
61eff3bc 2712own, based on your C header files such as F<< <sys/ioctl.h> >>.
5a964f20 2713(There is a Perl script called B<h2ph> that comes with the Perl kit that
54310121 2714may help you in this, but it's nontrivial.) SCALAR will be read and/or
3b10bc60 2715written depending on the FUNCTION; a C pointer to the string value of SCALAR
19799a22 2716will be passed as the third argument of the actual C<ioctl> call. (If SCALAR
4633a7c4
LW
2717has no string value but does have a numeric value, that value will be
2718passed rather than a pointer to the string value. To guarantee this to be
19799a22
GS
2719true, add a C<0> to the scalar before using it.) The C<pack> and C<unpack>
2720functions may be needed to manipulate the values of structures used by
b76cc8ba 2721C<ioctl>.
a0d0e21e 2722
19799a22 2723The return value of C<ioctl> (and C<fcntl>) is as follows:
a0d0e21e 2724
5ed4f2ec 2725 if OS returns: then Perl returns:
2726 -1 undefined value
2727 0 string "0 but true"
2728 anything else that number
a0d0e21e 2729
19799a22 2730Thus Perl returns true on success and false on failure, yet you can
a0d0e21e
LW
2731still easily determine the actual value returned by the operating
2732system:
2733
2b5ab1e7 2734 $retval = ioctl(...) || -1;
a0d0e21e
LW
2735 printf "System returned %d\n", $retval;
2736
be2f7487 2737The special string C<"0 but true"> is exempt from B<-w> complaints
5a964f20
TC
2738about improper numeric conversions.
2739
ea9eb35a
BJ
2740Portability issues: L<perlport/ioctl>.
2741
a0d0e21e 2742=item join EXPR,LIST
d74e8afc 2743X<join>
a0d0e21e 2744
2b5ab1e7
TC
2745Joins the separate strings of LIST into a single string with fields
2746separated by the value of EXPR, and returns that new string. Example:
a0d0e21e 2747
2b5ab1e7 2748 $rec = join(':', $login,$passwd,$uid,$gid,$gcos,$home,$shell);
a0d0e21e 2749
eb6e2d6f
GS
2750Beware that unlike C<split>, C<join> doesn't take a pattern as its
2751first argument. Compare L</split>.
a0d0e21e 2752
532eee96 2753=item keys HASH
d74e8afc 2754X<keys> X<key>
aa689395 2755
532eee96 2756=item keys ARRAY
aeedbbed 2757
f5a93a43
TC
2758=item keys EXPR
2759
aeedbbed
NC
2760Returns a list consisting of all the keys of the named hash, or the indices
2761of an array. (In scalar context, returns the number of keys or indices.)
504f80c1 2762
aeedbbed 2763The keys of a hash are returned in an apparently random order. The actual
3b10bc60 2764random order is subject to change in future versions of Perl, but it
504f80c1 2765is guaranteed to be the same order as either the C<values> or C<each>
4546b9e6 2766function produces (given that the hash has not been modified). Since
c5f61d2f 2767Perl 5.8.1 the ordering can be different even between different runs of
4546b9e6 2768Perl for security reasons (see L<perlsec/"Algorithmic Complexity
d6df3700 2769Attacks">).
504f80c1 2770
8f1da26d 2771As a side effect, calling keys() resets the internal interator of the HASH or ARRAY
cf264981
SP
2772(see L</each>). In particular, calling keys() in void context resets
2773the iterator with no other overhead.
a0d0e21e 2774
aa689395 2775Here is yet another way to print your environment:
a0d0e21e
LW
2776
2777 @keys = keys %ENV;
2778 @values = values %ENV;
b76cc8ba 2779 while (@keys) {
a9a5a0dc 2780 print pop(@keys), '=', pop(@values), "\n";
a0d0e21e
LW
2781 }
2782
2783or how about sorted by key:
2784
2785 foreach $key (sort(keys %ENV)) {
a9a5a0dc 2786 print $key, '=', $ENV{$key}, "\n";
a0d0e21e
LW
2787 }
2788
8ea1e5d4
GS
2789The returned values are copies of the original keys in the hash, so
2790modifying them will not affect the original hash. Compare L</values>.
2791
19799a22 2792To sort a hash by value, you'll need to use a C<sort> function.
aa689395 2793Here's a descending numeric sort of a hash by its values:
4633a7c4 2794
5a964f20 2795 foreach $key (sort { $hash{$b} <=> $hash{$a} } keys %hash) {
a9a5a0dc 2796 printf "%4d %s\n", $hash{$key}, $key;
4633a7c4
LW
2797 }
2798
3b10bc60 2799Used as an lvalue, C<keys> allows you to increase the number of hash buckets
aa689395 2800allocated for the given hash. This can gain you a measure of efficiency if
2801you know the hash is going to get big. (This is similar to pre-extending
2802an array by assigning a larger number to $#array.) If you say
55497cff 2803
2804 keys %hash = 200;
2805
ab192400
GS
2806then C<%hash> will have at least 200 buckets allocated for it--256 of them,
2807in fact, since it rounds up to the next power of two. These
55497cff 2808buckets will be retained even if you do C<%hash = ()>, use C<undef
2809%hash> if you want to free the storage while C<%hash> is still in scope.
2810You can't shrink the number of buckets allocated for the hash using
19799a22 2811C<keys> in this way (but you needn't worry about doing this by accident,
aeedbbed
NC
2812as trying has no effect). C<keys @array> in an lvalue context is a syntax
2813error.
55497cff 2814
f5a93a43
TC
2815Starting with Perl 5.14, C<keys> can take a scalar EXPR, which must contain
2816a reference to an unblessed hash or array. The argument will be
2817dereferenced automatically. This aspect of C<keys> is considered highly
2818experimental. The exact behaviour may change in a future version of Perl.
cba5a3b0
DG
2819
2820 for (keys $hashref) { ... }
2821 for (keys $obj->get_arrayref) { ... }
2822
8f1da26d 2823See also C<each>, C<values>, and C<sort>.
ab192400 2824
b350dd2f 2825=item kill SIGNAL, LIST
d74e8afc 2826X<kill> X<signal>
a0d0e21e 2827
b350dd2f 2828Sends a signal to a list of processes. Returns the number of
517db077
GS
2829processes successfully signaled (which is not necessarily the
2830same as the number actually killed).
a0d0e21e
LW
2831
2832 $cnt = kill 1, $child1, $child2;
2833 kill 9, @goners;
2834
3b10bc60 2835If SIGNAL is zero, no signal is sent to the process, but C<kill>
2836checks whether it's I<possible> to send a signal to it (that
70fb64f6 2837means, to be brief, that the process is owned by the same user, or we are
3b10bc60 2838the super-user). This is useful to check that a child process is still
81fd35db
DN
2839alive (even if only as a zombie) and hasn't changed its UID. See
2840L<perlport> for notes on the portability of this construct.
b350dd2f 2841
e2c0f81f
DG
2842Unlike in the shell, if SIGNAL is negative, it kills process groups instead
2843of processes. That means you usually want to use positive not negative signals.
2844You may also use a signal name in quotes.
2845
2846The behavior of kill when a I<PROCESS> number is zero or negative depends on
2847the operating system. For example, on POSIX-conforming systems, zero will
2848signal the current process group and -1 will signal all processes.
1e9c1022
JL
2849
2850See L<perlipc/"Signals"> for more details.
a0d0e21e 2851
ea9eb35a
BJ
2852On some platforms such as Windows where the fork() system call is not available.
2853Perl can be built to emulate fork() at the interpreter level.
6d17f725 2854This emulation has limitations related to kill that have to be considered,
ea9eb35a
BJ
2855for code running on Windows and in code intended to be portable.
2856
2857See L<perlfork> for more details.
2858
2859Portability issues: L<perlport/kill>.
2860
a0d0e21e 2861=item last LABEL
d74e8afc 2862X<last> X<break>
a0d0e21e
LW
2863
2864=item last
2865
2866The C<last> command is like the C<break> statement in C (as used in
2867loops); it immediately exits the loop in question. If the LABEL is
2868omitted, the command refers to the innermost enclosing loop. The
2869C<continue> block, if any, is not executed:
2870
4633a7c4 2871 LINE: while (<STDIN>) {
a9a5a0dc
VP
2872 last LINE if /^$/; # exit when done with header
2873 #...
a0d0e21e
LW
2874 }
2875
80d38338 2876C<last> cannot be used to exit a block that returns a value such as
8f1da26d 2877C<eval {}>, C<sub {}>, or C<do {}>, and should not be used to exit
2b5ab1e7 2878a grep() or map() operation.
4968c1e4 2879
6c1372ed
GS
2880Note that a block by itself is semantically identical to a loop
2881that executes once. Thus C<last> can be used to effect an early
2882exit out of such a block.
2883
98293880
JH
2884See also L</continue> for an illustration of how C<last>, C<next>, and
2885C<redo> work.
1d2dff63 2886
a0d0e21e 2887=item lc EXPR
d74e8afc 2888X<lc> X<lowercase>
a0d0e21e 2889
54310121 2890=item lc
bbce6d69 2891
d1be9408 2892Returns a lowercased version of EXPR. This is the internal function
3980dc9c 2893implementing the C<\L> escape in double-quoted strings.
a0d0e21e 2894
7660c0ab 2895If EXPR is omitted, uses C<$_>.
bbce6d69 2896
3980dc9c
KW
2897What gets returned depends on several factors:
2898
2899=over
2900
2901=item If C<use bytes> is in effect:
2902
2903=over
2904
2905=item On EBCDIC platforms
2906
2907The results are what the C language system call C<tolower()> returns.
2908
2909=item On ASCII platforms
2910
2911The results follow ASCII semantics. Only characters C<A-Z> change, to C<a-z>
2912respectively.
2913
2914=back
2915
2916=item Otherwise, If EXPR has the UTF8 flag set
2917
5d1892be 2918Unicode semantics are used for the case change.
3980dc9c
KW
2919
2920=item Otherwise, if C<use locale> is in effect
2921
2922Respects current LC_CTYPE locale. See L<perllocale>.
2923
2924=item Otherwise, if C<use feature 'unicode_strings'> is in effect:
2925
5d1892be 2926Unicode semantics are used for the case change.
3980dc9c
KW
2927
2928=item Otherwise:
2929
2930=over
2931
2932=item On EBCDIC platforms
2933
2934The results are what the C language system call C<tolower()> returns.
2935
2936=item On ASCII platforms
2937
2938ASCII semantics are used for the case change. The lowercase of any character
2939outside the ASCII range is the character itself.
2940
2941=back
2942
2943=back
2944
a0d0e21e 2945=item lcfirst EXPR
d74e8afc 2946X<lcfirst> X<lowercase>
a0d0e21e 2947
54310121 2948=item lcfirst
bbce6d69 2949
ad0029c4
JH
2950Returns the value of EXPR with the first character lowercased. This
2951is the internal function implementing the C<\l> escape in
3980dc9c 2952double-quoted strings.
a0d0e21e 2953
7660c0ab 2954If EXPR is omitted, uses C<$_>.
bbce6d69 2955
15dbbbab 2956This function behaves the same way under various pragmata, such as in a locale,
3980dc9c
KW
2957as L</lc> does.
2958
a0d0e21e 2959=item length EXPR
d74e8afc 2960X<length> X<size>
a0d0e21e 2961
54310121 2962=item length
bbce6d69 2963
974da8e5 2964Returns the length in I<characters> of the value of EXPR. If EXPR is
15dbbbab
FC
2965omitted, returns the length of C<$_>. If EXPR is undefined, returns
2966C<undef>.
3b10bc60 2967
2968This function cannot be used on an entire array or hash to find out how
2969many elements these have. For that, use C<scalar @array> and C<scalar keys
2970%hash>, respectively.
2971
2972Like all Perl character operations, length() normally deals in logical
2973characters, not physical bytes. For how many bytes a string encoded as
2974UTF-8 would take up, use C<length(Encode::encode_utf8(EXPR))> (you'll have
2975to C<use Encode> first). See L<Encode> and L<perlunicode>.
974da8e5 2976
cfa52385
FC
2977=item __LINE__
2978X<__LINE__>
2979
2980A special token that compiles to the current line number.
2981
a0d0e21e 2982=item link OLDFILE,NEWFILE
d74e8afc 2983X<link>
a0d0e21e 2984
19799a22 2985Creates a new filename linked to the old filename. Returns true for
b76cc8ba 2986success, false otherwise.
a0d0e21e 2987
ea9eb35a
BJ
2988Portability issues: L<perlport/link>.
2989
a0d0e21e 2990=item listen SOCKET,QUEUESIZE
d74e8afc 2991X<listen>
a0d0e21e 2992
3b10bc60 2993Does the same thing that the listen(2) system call does. Returns true if
b76cc8ba 2994it succeeded, false otherwise. See the example in
cea6626f 2995L<perlipc/"Sockets: Client/Server Communication">.
a0d0e21e
LW
2996
2997=item local EXPR
d74e8afc 2998X<local>
a0d0e21e 2999
19799a22 3000You really probably want to be using C<my> instead, because C<local> isn't
b76cc8ba 3001what most people think of as "local". See
13a2d996 3002L<perlsub/"Private Variables via my()"> for details.
2b5ab1e7 3003
5a964f20
TC
3004A local modifies the listed variables to be local to the enclosing
3005block, file, or eval. If more than one value is listed, the list must
3006be placed in parentheses. See L<perlsub/"Temporary Values via local()">
3007for details, including issues with tied arrays and hashes.
a0d0e21e 3008
d361fafa
VP
3009The C<delete local EXPR> construct can also be used to localize the deletion
3010of array/hash elements to the current block.
3011See L<perlsub/"Localized deletion of elements of composite types">.
3012
a0d0e21e 3013=item localtime EXPR
435fbc73 3014X<localtime> X<ctime>
a0d0e21e 3015
ba053783
AL
3016=item localtime
3017
19799a22 3018Converts a time as returned by the time function to a 9-element list
5f05dabc 3019with the time analyzed for the local time zone. Typically used as
a0d0e21e
LW
3020follows:
3021
54310121 3022 # 0 1 2 3 4 5 6 7 8
a0d0e21e 3023 ($sec,$min,$hour,$mday,$mon,$year,$wday,$yday,$isdst) =
ba053783 3024 localtime(time);
a0d0e21e 3025
8f1da26d 3026All list elements are numeric and come straight out of the C `struct
ba053783
AL
3027tm'. C<$sec>, C<$min>, and C<$hour> are the seconds, minutes, and hours
3028of the specified time.
48a26b3a 3029
8f1da26d
TC
3030C<$mday> is the day of the month and C<$mon> the month in
3031the range C<0..11>, with 0 indicating January and 11 indicating December.
ba053783 3032This makes it easy to get a month name from a list:
54310121 3033
ba053783
AL
3034 my @abbr = qw( Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec );
3035 print "$abbr[$mon] $mday";
3036 # $mon=9, $mday=18 gives "Oct 18"
abd75f24 3037
8f1da26d 3038C<$year> is the number of years since 1900, B<not> just the last two digits
ba053783 3039of the year. That is, C<$year> is C<123> in year 2023. The proper way
80d38338 3040to get a 4-digit year is simply:
abd75f24 3041
ba053783 3042 $year += 1900;
abd75f24 3043
435fbc73
GS
3044Otherwise you create non-Y2K-compliant programs--and you wouldn't want
3045to do that, would you?
3046
8f1da26d 3047To get the last two digits of the year (e.g., "01" in 2001) do:
ba053783
AL
3048
3049 $year = sprintf("%02d", $year % 100);
3050
3051C<$wday> is the day of the week, with 0 indicating Sunday and 3 indicating
3052Wednesday. C<$yday> is the day of the year, in the range C<0..364>
3053(or C<0..365> in leap years.)
3054
3055C<$isdst> is true if the specified time occurs during Daylight Saving
3056Time, false otherwise.
abd75f24 3057
e1998452 3058If EXPR is omitted, C<localtime()> uses the current time (as returned
e3176d09 3059by time(3)).
a0d0e21e 3060
48a26b3a 3061In scalar context, C<localtime()> returns the ctime(3) value:
a0d0e21e 3062
5f05dabc 3063 $now_string = localtime; # e.g., "Thu Oct 13 04:54:34 1994"
a0d0e21e 3064
15dbbbab 3065This scalar value is B<not> locale-dependent but is a Perl builtin. For GMT
fe86afc2 3066instead of local time use the L</gmtime> builtin. See also the
8f1da26d 3067C<Time::Local> module (for converting seconds, minutes, hours, and such back to
fe86afc2
NC
3068the integer value returned by time()), and the L<POSIX> module's strftime(3)
3069and mktime(3) functions.
3070
15dbbbab 3071To get somewhat similar but locale-dependent date strings, set up your
fe86afc2
NC
3072locale environment variables appropriately (please see L<perllocale>) and
3073try for example:
a3cb178b 3074
5a964f20 3075 use POSIX qw(strftime);
2b5ab1e7 3076 $now_string = strftime "%a %b %e %H:%M:%S %Y", localtime;
fe86afc2
NC
3077 # or for GMT formatted appropriately for your locale:
3078 $now_string = strftime "%a %b %e %H:%M:%S %Y", gmtime;
a3cb178b
GS
3079
3080Note that the C<%a> and C<%b>, the short forms of the day of the week
3081and the month of the year, may not necessarily be three characters wide.
a0d0e21e 3082
15dbbbab 3083The L<Time::gmtime> and L<Time::localtime> modules provide a convenient,
435fbc73
GS
3084by-name access mechanism to the gmtime() and localtime() functions,
3085respectively.
3086
3087For a comprehensive date and time representation look at the
3088L<DateTime> module on CPAN.
3089
ea9eb35a
BJ
3090Portability issues: L<perlport/localtime>.
3091
07698885 3092=item lock THING
d74e8afc 3093X<lock>
19799a22 3094
15dbbbab 3095This function places an advisory lock on a shared variable or referenced
03730085 3096object contained in I<THING> until the lock goes out of scope.
a6d5524e 3097
904028df 3098The value returned is the scalar itself, if the argument is a scalar, or a
f79aa60b 3099reference, if the argument is a hash, array or subroutine.
904028df 3100
f3a23afb 3101lock() is a "weak keyword" : this means that if you've defined a function
67408cae 3102by this name (before any calls to it), that function will be called
7b043ca5
RGS
3103instead. If you are not under C<use threads::shared> this does nothing.
3104See L<threads::shared>.
19799a22 3105
a0d0e21e 3106=item log EXPR
d74e8afc 3107X<log> X<logarithm> X<e> X<ln> X<base>
a0d0e21e 3108
54310121 3109=item log
bbce6d69 3110
2b5ab1e7 3111Returns the natural logarithm (base I<e>) of EXPR. If EXPR is omitted,
15dbbbab
FC
3112returns the log of C<$_>. To get the
3113log of another base, use basic algebra:
19799a22 3114The base-N log of a number is equal to the natural log of that number
2b5ab1e7
TC
3115divided by the natural log of N. For example:
3116
3117 sub log10 {
a9a5a0dc
VP
3118 my $n = shift;
3119 return log($n)/log(10);
b76cc8ba 3120 }
2b5ab1e7
TC
3121
3122See also L</exp> for the inverse operation.
a0d0e21e 3123
7ded94be 3124=item lstat FILEHANDLE
d74e8afc 3125X<lstat>
a0d0e21e 3126
7ded94be
FC
3127=item lstat EXPR
3128
3129=item lstat DIRHANDLE
3130
54310121 3131=item lstat
bbce6d69 3132
19799a22 3133Does the same thing as the C<stat> function (including setting the
5a964f20
TC
3134special C<_> filehandle) but stats a symbolic link instead of the file
3135the symbolic link points to. If symbolic links are unimplemented on
c837d5b4
DP
3136your system, a normal C<stat> is done. For much more detailed
3137information, please see the documentation for C<stat>.
a0d0e21e 3138
7660c0ab 3139If EXPR is omitted, stats C<$_>.
bbce6d69 3140
ea9eb35a
BJ
3141Portability issues: L<perlport/lstat>.
3142
a0d0e21e
LW
3143=item m//
3144
9f4b9cd0 3145The match operator. See L<perlop/"Regexp Quote-Like Operators">.
a0d0e21e
LW
3146
3147=item map BLOCK LIST
d74e8afc 3148X<map>
a0d0e21e
LW
3149
3150=item map EXPR,LIST
3151
19799a22
GS
3152Evaluates the BLOCK or EXPR for each element of LIST (locally setting
3153C<$_> to each element) and returns the list value composed of the
3154results of each such evaluation. In scalar context, returns the
3155total number of elements so generated. Evaluates BLOCK or EXPR in
3156list context, so each element of LIST may produce zero, one, or
3157more elements in the returned value.
dd99ebda 3158
f9476272 3159 @chars = map(chr, @numbers);
a0d0e21e 3160
f9476272
AH
3161translates a list of numbers to the corresponding characters.
3162
3163 my @squares = map { $_ * $_ } @numbers;
3164
3165translates a list of numbers to their squared values.
3166
3167 my @squares = map { $_ > 5 ? ($_ * $_) : () } @numbers;
3168
3169shows that number of returned elements can differ from the number of
3170input elements. To omit an element, return an empty list ().
3171This could also be achieved by writing
3172
3173 my @squares = map { $_ * $_ } grep { $_ > 5 } @numbers;
3174
3175which makes the intention more clear.
3176
15dbbbab
FC
3177Map always returns a list, which can be
3178assigned to a hash such that the elements
f9476272 3179become key/value pairs. See L<perldata> for more details.
a0d0e21e 3180
d8216f19 3181 %hash = map { get_a_key_for($_) => $_ } @array;
a0d0e21e
LW
3182
3183is just a funny way to write
3184
3185 %hash = ();
d8216f19 3186 foreach (@array) {
a9a5a0dc 3187 $hash{get_a_key_for($_)} = $_;
a0d0e21e
LW
3188 }
3189
be3174d2
GS
3190Note that C<$_> is an alias to the list value, so it can be used to
3191modify the elements of the LIST. While this is useful and supported,
3192it can cause bizarre results if the elements of LIST are not variables.
2b5ab1e7
TC
3193Using a regular C<foreach> loop for this purpose would be clearer in
3194most cases. See also L</grep> for an array composed of those items of
3195the original list for which the BLOCK or EXPR evaluates to true.
fb73857a 3196
a4fb8298 3197If C<$_> is lexical in the scope where the C<map> appears (because it has
d8216f19
RGS
3198been declared with C<my $_>), then, in addition to being locally aliased to
3199the list elements, C<$_> keeps being lexical inside the block; that is, it
a4fb8298
RGS
3200can't be seen from the outside, avoiding any potential side-effects.
3201
205fdb4d 3202C<{> starts both hash references and blocks, so C<map { ...> could be either
3b10bc60 3203the start of map BLOCK LIST or map EXPR, LIST. Because Perl doesn't look
80d38338
TC
3204ahead for the closing C<}> it has to take a guess at which it's dealing with
3205based on what it finds just after the C<{>. Usually it gets it right, but if it
205fdb4d
NC
3206doesn't it won't realize something is wrong until it gets to the C<}> and
3207encounters the missing (or unexpected) comma. The syntax error will be
80d38338 3208reported close to the C<}>, but you'll need to change something near the C<{>
3b10bc60 3209such as using a unary C<+> to give Perl some help:
205fdb4d 3210
3b10bc60 3211 %hash = map { "\L$_" => 1 } @array # perl guesses EXPR. wrong
3212 %hash = map { +"\L$_" => 1 } @array # perl guesses BLOCK. right
3213 %hash = map { ("\L$_" => 1) } @array # this also works
3214 %hash = map { lc($_) => 1 } @array # as does this.
3215 %hash = map +( lc($_) => 1 ), @array # this is EXPR and works!
cea6626f 3216
3b10bc60 3217 %hash = map ( lc($_), 1 ), @array # evaluates to (1, @array)
205fdb4d 3218
d8216f19 3219or to force an anon hash constructor use C<+{>:
205fdb4d 3220
3b10bc60 3221 @hashes = map +{ lc($_) => 1 }, @array # EXPR, so needs comma at end
205fdb4d 3222
3b10bc60 3223to get a list of anonymous hashes each with only one entry apiece.
205fdb4d 3224
19799a22 3225=item mkdir FILENAME,MASK
d74e8afc 3226X<mkdir> X<md> X<directory, create>
a0d0e21e 3227
5a211162
GS
3228=item mkdir FILENAME
3229
491873e5
RGS
3230=item mkdir
3231
0591cd52 3232Creates the directory specified by FILENAME, with permissions
19799a22 3233specified by MASK (as modified by C<umask>). If it succeeds it
8f1da26d
TC
3234returns true; otherwise it returns false and sets C<$!> (errno).
3235MASK defaults to 0777 if omitted, and FILENAME defaults
3236to C<$_> if omitted.
0591cd52 3237
8f1da26d
TC
3238In general, it is better to create directories with a permissive MASK
3239and let the user modify that with their C<umask> than it is to supply
19799a22 3240a restrictive MASK and give the user no way to be more permissive.
0591cd52
NT
3241The exceptions to this rule are when the file or directory should be
3242kept private (mail files, for instance). The perlfunc(1) entry on
19799a22 3243C<umask> discusses the choice of MASK in more detail.
a0d0e21e 3244
cc1852e8
JH
3245Note that according to the POSIX 1003.1-1996 the FILENAME may have any
3246number of trailing slashes. Some operating and filesystems do not get
3247this right, so Perl automatically removes all trailing slashes to keep
3248everyone happy.
3249
80d38338 3250To recursively create a directory structure, look at
dd184578
RGS
3251the C<mkpath> function of the L<File::Path> module.
3252
a0d0e21e 3253=item msgctl ID,CMD,ARG
d74e8afc 3254X<msgctl>
a0d0e21e 3255
f86cebdf 3256Calls the System V IPC function msgctl(2). You'll probably have to say
0ade1984
JH
3257
3258 use IPC::SysV;
3259
7660c0ab 3260first to get the correct constant definitions. If CMD is C<IPC_STAT>,
cf264981 3261then ARG must be a variable that will hold the returned C<msqid_ds>
951ba7fe
GS
3262structure. Returns like C<ioctl>: the undefined value for error,
3263C<"0 but true"> for zero, or the actual return value otherwise. See also
15dbbbab
FC
3264L<perlipc/"SysV IPC"> and the documentation for C<IPC::SysV> and
3265C<IPC::Semaphore>.
a0d0e21e 3266
ea9eb35a
BJ
3267Portability issues: L<perlport/msgctl>.
3268
a0d0e21e 3269=item msgget KEY,FLAGS
d74e8afc 3270X<msgget>
a0d0e21e 3271
f86cebdf 3272Calls the System V IPC function msgget(2). Returns the message queue
8f1da26d 3273id, or C<undef> on error. See also
15dbbbab
FC
3274L<perlipc/"SysV IPC"> and the documentation for C<IPC::SysV> and
3275C<IPC::Msg>.
a0d0e21e 3276
ea9eb35a
BJ
3277Portability issues: L<perlport/msgget>.
3278
a0d0e21e 3279=item msgrcv ID,VAR,SIZE,TYPE,FLAGS
d74e8afc 3280X<msgrcv>
a0d0e21e
LW
3281
3282Calls the System V IPC function msgrcv to receive a message from
3283message queue ID into variable VAR with a maximum message size of
41d6edb2
JH
3284SIZE. Note that when a message is received, the message type as a
3285native long integer will be the first thing in VAR, followed by the
3286actual message. This packing may be opened with C<unpack("l! a*")>.
8f1da26d
TC
3287Taints the variable. Returns true if successful, false
3288on error. See also L<perlipc/"SysV IPC"> and the documentation for
15dbbbab 3289C<IPC::SysV> and C<IPC::SysV::Msg>.
41d6edb2 3290
ea9eb35a
BJ
3291Portability issues: L<perlport/msgrcv>.
3292
41d6edb2 3293=item msgsnd ID,MSG,FLAGS
d74e8afc 3294X<msgsnd>
41d6edb2
JH
3295
3296Calls the System V IPC function msgsnd to send the message MSG to the
3297message queue ID. MSG must begin with the native long integer message
8f1da26d 3298type, be followed by the length of the actual message, and then finally
41d6edb2
JH
3299the message itself. This kind of packing can be achieved with
3300C<pack("l! a*", $type, $message)>. Returns true if successful,
8f1da26d 3301false on error. See also the C<IPC::SysV>
41d6edb2 3302and C<IPC::SysV::Msg> documentation.
a0d0e21e 3303
ea9eb35a
BJ
3304Portability issues: L<perlport/msgsnd>.
3305
a0d0e21e 3306=item my EXPR
d74e8afc 3307X<my>
a0d0e21e 3308
307ea6df
JH
3309=item my TYPE EXPR
3310
1d2de774 3311=item my EXPR : ATTRS
09bef843 3312
1d2de774 3313=item my TYPE EXPR : ATTRS
307ea6df 3314
19799a22 3315A C<my> declares the listed variables to be local (lexically) to the
1d2de774
JH
3316enclosing block, file, or C<eval>. If more than one value is listed,
3317the list must be placed in parentheses.
307ea6df 3318
1d2de774 3319The exact semantics and interface of TYPE and ATTRS are still
15dbbbab 3320evolving. TYPE is currently bound to the use of the C<fields> pragma,
307ea6df
JH
3321and attributes are handled using the C<attributes> pragma, or starting
3322from Perl 5.8.0 also via the C<Attribute::Handlers> module. See
3323L<perlsub/"Private Variables via my()"> for details, and L<fields>,
3324L<attributes>, and L<Attribute::Handlers>.
4633a7c4 3325
a0d0e21e 3326=item next LABEL
d74e8afc 3327X<next> X<continue>
a0d0e21e
LW
3328
3329=item next
3330
3331The C<next> command is like the C<continue> statement in C; it starts
3332the next iteration of the loop:
3333
4633a7c4 3334 LINE: while (<STDIN>) {
a9a5a0dc
VP
3335 next LINE if /^#/; # discard comments
3336 #...
a0d0e21e
LW
3337 }
3338
3339Note that if there were a C<continue> block on the above, it would get
3b10bc60 3340executed even on discarded lines. If LABEL is omitted, the command
a0d0e21e
LW
3341refers to the innermost enclosing loop.
3342
4968c1e4 3343C<next> cannot be used to exit a block which returns a value such as
8f1da26d 3344C<eval {}>, C<sub {}>, or C<do {}>, and should not be used to exit
2b5ab1e7 3345a grep() or map() operation.
4968c1e4 3346
6c1372ed
GS
3347Note that a block by itself is semantically identical to a loop
3348that executes once. Thus C<next> will exit such a block early.
3349
98293880
JH
3350See also L</continue> for an illustration of how C<last>, C<next>, and
3351C<redo> work.
1d2dff63 3352
3b10bc60 3353=item no MODULE VERSION LIST
3354X<no declarations>
3355X<unimporting>
4a66ea5a 3356
3b10bc60 3357=item no MODULE VERSION
4a66ea5a 3358
3b10bc60 3359=item no MODULE LIST
a0d0e21e 3360
3b10bc60 3361=item no MODULE
4a66ea5a 3362
c986422f
RGS
3363=item no VERSION
3364
593b9c14 3365See the C<use> function, of which C<no> is the opposite.
a0d0e21e
LW
3366
3367=item oct EXPR
d74e8afc 3368X<oct> X<octal> X<hex> X<hexadecimal> X<binary> X<bin>
a0d0e21e 3369
54310121 3370=item oct
bbce6d69 3371
4633a7c4 3372Interprets EXPR as an octal string and returns the corresponding
4f19785b
WSI
3373value. (If EXPR happens to start off with C<0x>, interprets it as a
3374hex string. If EXPR starts off with C<0b>, it is interpreted as a
53305cf1 3375binary string. Leading whitespace is ignored in all three cases.)
3b10bc60 3376The following will handle decimal, binary, octal, and hex in standard
3377Perl notation:
a0d0e21e
LW
3378
3379 $val = oct($val) if $val =~ /^0/;
3380
19799a22
GS
3381If EXPR is omitted, uses C<$_>. To go the other way (produce a number
3382in octal), use sprintf() or printf():
3383
3b10bc60 3384 $dec_perms = (stat("filename"))[2] & 07777;
3385 $oct_perm_str = sprintf "%o", $perms;
19799a22
GS
3386
3387The oct() function is commonly used when a string such as C<644> needs
3b10bc60 3388to be converted into a file mode, for example. Although Perl
3389automatically converts strings into numbers as needed, this automatic
3390conversion assumes base 10.
3391
3392Leading white space is ignored without warning, as too are any trailing
3393non-digits, such as a decimal point (C<oct> only handles non-negative
3394integers, not negative integers or floating point).
a0d0e21e
LW
3395
3396=item open FILEHANDLE,EXPR
d74e8afc 3397X<open> X<pipe> X<file, open> X<fopen>
a0d0e21e 3398
68bd7414
NIS
3399=item open FILEHANDLE,MODE,EXPR
3400
3401=item open FILEHANDLE,MODE,EXPR,LIST
3402
ba964c95
T
3403=item open FILEHANDLE,MODE,REFERENCE
3404
a0d0e21e
LW
3405=item open FILEHANDLE
3406
3407Opens the file whose filename is given by EXPR, and associates it with
ed53a2bb
JH
3408FILEHANDLE.
3409
460b70c2
GS
3410Simple examples to open a file for reading:
3411
8f1da26d
TC
3412 open(my $fh, "<", "input.txt")
3413 or die "cannot open < input.txt: $!";
460b70c2
GS
3414
3415and for writing:
3416
8f1da26d
TC
3417 open(my $fh, ">", "output.txt")
3418 or die "cannot open > output.txt: $!";
460b70c2 3419
ed53a2bb
JH
3420(The following is a comprehensive reference to open(): for a gentler
3421introduction you may consider L<perlopentut>.)
3422
8f1da26d
TC
3423If FILEHANDLE is an undefined scalar variable (or array or hash element), a
3424new filehandle is autovivified, meaning that the variable is assigned a
3425reference to a newly allocated anonymous filehandle. Otherwise if
3426FILEHANDLE is an expression, its value is the real filehandle. (This is
3427considered a symbolic reference, so C<use strict "refs"> should I<not> be
3428in effect.)
3429
3430If EXPR is omitted, the global (package) scalar variable of the same
3431name as the FILEHANDLE contains the filename. (Note that lexical
3432variables--those declared with C<my> or C<state>--will not work for this
3433purpose; so if you're using C<my> or C<state>, specify EXPR in your
3434call to open.)
3435
3436If three (or more) arguments are specified, the open mode (including
3437optional encoding) in the second argument are distinct from the filename in
3438the third. If MODE is C<< < >> or nothing, the file is opened for input.
3439If MODE is C<< > >>, the file is opened for output, with existing files
3440first being truncated ("clobbered") and nonexisting files newly created.
3441If MODE is C<<< >> >>>, the file is opened for appending, again being
3442created if necessary.
3443
3444You can put a C<+> in front of the C<< > >> or C<< < >> to
ed53a2bb 3445indicate that you want both read and write access to the file; thus
8f1da26d
TC
3446C<< +< >> is almost always preferred for read/write updates--the
3447C<< +> >> mode would clobber the file first. You cant usually use
ed53a2bb 3448either read-write mode for updating textfiles, since they have
bea6df1c 3449variable-length records. See the B<-i> switch in L<perlrun> for a
ed53a2bb 3450better approach. The file is created with permissions of C<0666>
e1020413 3451modified by the process's C<umask> value.
ed53a2bb 3452
8f1da26d
TC
3453These various prefixes correspond to the fopen(3) modes of C<r>,
3454C<r+>, C<w>, C<w+>, C<a>, and C<a+>.
5f05dabc 3455
8f1da26d
TC
3456In the one- and two-argument forms of the call, the mode and filename
3457should be concatenated (in that order), preferably separated by white
3458space. You can--but shouldn't--omit the mode in these forms when that mode
3459is C<< < >>. It is always safe to use the two-argument form of C<open> if
3460the filename argument is a known literal.
6170680b 3461
8f1da26d 3462For three or more arguments if MODE is C<|->, the filename is
ed53a2bb 3463interpreted as a command to which output is to be piped, and if MODE
8f1da26d 3464is C<-|>, the filename is interpreted as a command that pipes
3b10bc60 3465output to us. In the two-argument (and one-argument) form, one should
8f1da26d 3466replace dash (C<->) with the command.
ed53a2bb
JH
3467See L<perlipc/"Using open() for IPC"> for more examples of this.
3468(You are not allowed to C<open> to a command that pipes both in I<and>
3469out, but see L<IPC::Open2>, L<IPC::Open3>, and
96090e4f
LB
3470L<perlipc/"Bidirectional Communication with Another Process"> for
3471alternatives.)
ed53a2bb 3472
3b10bc60 3473In the form of pipe opens taking three or more arguments, if LIST is specified
ed53a2bb
JH
3474(extra arguments after the command name) then LIST becomes arguments
3475to the command invoked if the platform supports it. The meaning of
3476C<open> with more than three arguments for non-pipe modes is not yet
3b10bc60 3477defined, but experimental "layers" may give extra LIST arguments
ed53a2bb 3478meaning.
6170680b 3479
8f1da26d
TC
3480In the two-argument (and one-argument) form, opening C<< <- >>
3481or C<-> opens STDIN and opening C<< >- >> opens STDOUT.
6170680b 3482
8f1da26d
TC
3483You may (and usually should) use the three-argument form of open to specify
3484I/O layers (sometimes referred to as "disciplines") to apply to the handle
fae2c0fb 3485that affect how the input and output are processed (see L<open> and
3b10bc60 3486L<PerlIO> for more details). For example:
7207e29d 3487
3b10bc60 3488 open(my $fh, "<:encoding(UTF-8)", "filename")
3489 || die "can't open UTF-8 encoded filename: $!";
9124316e 3490
8f1da26d 3491opens the UTF8-encoded file containing Unicode characters;
6d5e88a0 3492see L<perluniintro>. Note that if layers are specified in the
3b10bc60 3493three-argument form, then default layers stored in ${^OPEN} (see L<perlvar>;
6d5e88a0 3494usually set by the B<open> pragma or the switch B<-CioD>) are ignored.
c0fd9d21
FC
3495Those layers will also be ignored if you specifying a colon with no name
3496following it. In that case the default layer for the operating system
3497(:raw on Unix, :crlf on Windows) is used.
ed53a2bb 3498
80d38338 3499Open returns nonzero on success, the undefined value otherwise. If
ed53a2bb
JH
3500the C<open> involved a pipe, the return value happens to be the pid of
3501the subprocess.
cb1a09d0 3502
ed53a2bb
JH
3503If you're running Perl on a system that distinguishes between text
3504files and binary files, then you should check out L</binmode> for tips
3505for dealing with this. The key distinction between systems that need
3506C<binmode> and those that don't is their text file formats. Systems
80d38338
TC
3507like Unix, Mac OS, and Plan 9, that end lines with a single
3508character and encode that character in C as C<"\n"> do not
ed53a2bb 3509need C<binmode>. The rest need it.
cb1a09d0 3510
80d38338
TC
3511When opening a file, it's seldom a good idea to continue
3512if the request failed, so C<open> is frequently used with
19799a22 3513C<die>. Even if C<die> won't do what you want (say, in a CGI script,
80d38338
TC
3514where you want to format a suitable error message (but there are
3515modules that can help with that problem)) always check
3516the return value from opening a file.
fb73857a 3517
8f1da26d 3518As a special case the three-argument form with a read/write mode and the third
ed53a2bb 3519argument being C<undef>:
b76cc8ba 3520
460b70c2 3521 open(my $tmp, "+>", undef) or die ...
b76cc8ba 3522
8f1da26d 3523opens a filehandle to an anonymous temporary file. Also using C<< +< >>
f253e835
JH
3524works for symmetry, but you really should consider writing something
3525to the temporary file first. You will need to seek() to do the
3526reading.
b76cc8ba 3527
3b10bc60 3528Since v5.8.0, Perl has built using PerlIO by default. Unless you've
8f1da26d
TC
3529changed this (such as building Perl with C<Configure -Uuseperlio>), you can
3530open filehandles directly to Perl scalars via:
ba964c95 3531
8f1da26d 3532 open($fh, ">", \$variable) || ..
b996200f 3533
3b10bc60 3534To (re)open C<STDOUT> or C<STDERR> as an in-memory file, close it first:
b996200f
SB
3535
3536 close STDOUT;
8f1da26d
TC
3537 open(STDOUT, ">", \$variable)
3538 or die "Can't open STDOUT: $!";
ba964c95 3539
3b10bc60 3540General examples:
a0d0e21e
LW
3541
3542 $ARTICLE = 100;
8f1da26d 3543 open(ARTICLE) or die "Can't find article $ARTICLE: $!\n";
a0d0e21e
LW
3544 while (<ARTICLE>) {...
3545
8f1da26d 3546 open(LOG, ">>/usr/spool/news/twitlog"); # (log is reserved)
fb73857a 3547 # if the open fails, output is discarded
a0d0e21e 3548
8f1da26d 3549 open(my $dbase, "+<", "dbase.mine") # open for update
a9a5a0dc 3550 or die "Can't open 'dbase.mine' for update: $!";
cb1a09d0 3551
8f1da26d 3552 open(my $dbase, "+<dbase.mine") # ditto
a9a5a0dc 3553 or die "Can't open 'dbase.mine' for update: $!";
6170680b 3554
8f1da26d 3555 open(ARTICLE, "-|", "caesar <$article") # decrypt article
a9a5a0dc 3556 or die "Can't start caesar: $!";
a0d0e21e 3557
5ed4f2ec 3558 open(ARTICLE, "caesar <$article |") # ditto
a9a5a0dc 3559 or die "Can't start caesar: $!";
6170680b 3560
5ed4f2ec 3561 open(EXTRACT, "|sort >Tmp$$") # $$ is our process id
a9a5a0dc 3562 or die "Can't start sort: $!";
a0d0e21e 3563
3b10bc60 3564 # in-memory files
8f1da26d 3565 open(MEMORY, ">", \$var)
a9a5a0dc 3566 or die "Can't open memory file: $!";
80d38338 3567 print MEMORY "foo!\n"; # output will appear in $var
ba964c95 3568
a0d0e21e
LW
3569 # process argument list of files along with any includes
3570
3571 foreach $file (@ARGV) {
8f1da26d 3572 process($file, "fh00");
a0d0e21e
LW
3573 }
3574
3575 sub process {
a9a5a0dc
VP
3576 my($filename, $input) = @_;
3577 $input++; # this is a string increment
8f1da26d 3578 unless (open($input, "<", $filename)) {
a9a5a0dc
VP
3579 print STDERR "Can't open $filename: $!\n";
3580 return;
3581 }
5ed4f2ec 3582
a9a5a0dc
VP
3583 local $_;
3584 while (<$input>) { # note use of indirection
3585 if (/^#include "(.*)"/) {
3586 process($1, $input);
3587 next;
3588 }
3589 #... # whatever
5ed4f2ec 3590 }
a0d0e21e
LW
3591 }
3592
ae4c5402 3593See L<perliol> for detailed info on PerlIO.
2ce64696 3594
a0d0e21e 3595You may also, in the Bourne shell tradition, specify an EXPR beginning
8f1da26d 3596with C<< >& >>, in which case the rest of the string is interpreted
00cafafa 3597as the name of a filehandle (or file descriptor, if numeric) to be
f4084e39 3598duped (as C<dup(2)>) and opened. You may use C<&> after C<< > >>,
00cafafa
JH
3599C<<< >> >>>, C<< < >>, C<< +> >>, C<<< +>> >>>, and C<< +< >>.
3600The mode you specify should match the mode of the original filehandle.
3601(Duping a filehandle does not take into account any existing contents
8f1da26d
TC
3602of IO buffers.) If you use the three-argument form, then you can pass either a
3603number, the name of a filehandle, or the normal "reference to a glob".
6170680b 3604
eae1b76b
SB
3605Here is a script that saves, redirects, and restores C<STDOUT> and
3606C<STDERR> using various methods:
a0d0e21e
LW
3607
3608 #!/usr/bin/perl
8f1da26d
TC
3609 open(my $oldout, ">&STDOUT") or die "Can't dup STDOUT: $!";
3610 open(OLDERR, ">&", \*STDERR) or die "Can't dup STDERR: $!";
818c4caa 3611
8f1da26d
TC
3612 open(STDOUT, '>', "foo.out") or die "Can't redirect STDOUT: $!";
3613 open(STDERR, ">&STDOUT") or die "Can't dup STDOUT: $!";
a0d0e21e 3614
5ed4f2ec 3615 select STDERR; $| = 1; # make unbuffered
3616 select STDOUT; $| = 1; # make unbuffered
a0d0e21e 3617
5ed4f2ec 3618 print STDOUT "stdout 1\n"; # this works for
3619 print STDERR "stderr 1\n"; # subprocesses too
a0d0e21e 3620
8f1da26d
TC
3621 open(STDOUT, ">&", $oldout) or die "Can't dup \$oldout: $!";
3622 open(STDERR, ">&OLDERR") or die "Can't dup OLDERR: $!";
a0d0e21e
LW
3623
3624 print STDOUT "stdout 2\n";
3625 print STDERR "stderr 2\n";
3626
ef8b303f
JH
3627If you specify C<< '<&=X' >>, where C<X> is a file descriptor number
3628or a filehandle, then Perl will do an equivalent of C's C<fdopen> of
f4084e39 3629that file descriptor (and not call C<dup(2)>); this is more
ef8b303f 3630parsimonious of file descriptors. For example:
a0d0e21e 3631
00cafafa 3632 # open for input, reusing the fileno of $fd
a0d0e21e 3633 open(FILEHANDLE, "<&=$fd")
df632fdf 3634
b76cc8ba 3635or
df632fdf 3636
b76cc8ba 3637 open(FILEHANDLE, "<&=", $fd)
a0d0e21e 3638
00cafafa
JH
3639or
3640
3641 # open for append, using the fileno of OLDFH
3642 open(FH, ">>&=", OLDFH)
3643
3644or
3645
3646 open(FH, ">>&=OLDFH")
3647
ef8b303f
JH
3648Being parsimonious on filehandles is also useful (besides being
3649parsimonious) for example when something is dependent on file
3650descriptors, like for example locking using flock(). If you do just
8f1da26d
TC
3651C<< open(A, ">>&B") >>, the filehandle A will not have the same file
3652descriptor as B, and therefore flock(A) will not flock(B) nor vice
3653versa. But with C<< open(A, ">>&=B") >>, the filehandles will share
3654the same underlying system file descriptor.
3655
3656Note that under Perls older than 5.8.0, Perl uses the standard C library's'
3657fdopen() to implement the C<=> functionality. On many Unix systems,
3658fdopen() fails when file descriptors exceed a certain value, typically 255.
3659For Perls 5.8.0 and later, PerlIO is (most often) the default.
3660
3661You can see whether your Perl was built with PerlIO by running C<perl -V>
3662and looking for the C<useperlio=> line. If C<useperlio> is C<define>, you
3663have PerlIO; otherwise you don't.
3664
3665If you open a pipe on the command C<-> (that is, specify either C<|-> or C<-|>
3666with the one- or two-argument forms of C<open>),
3667an implicit C<fork> is done, so C<open> returns twice: in the parent
3668process it returns the pid
3669of the child process, and in the child process it returns (a defined) C<0>.
3670Use C<defined($pid)> or C<//> to determine whether the open was successful.
3671
3672For example, use either
3673
5f64ea7a 3674 $child_pid = open(FROM_KID, "-|") // die "can't fork: $!";
8f1da26d
TC
3675
3676or
3677 $child_pid = open(TO_KID, "|-") // die "can't fork: $!";
3678
3679followed by
3680
3681 if ($child_pid) {
3682 # am the parent:
3683 # either write TO_KID or else read FROM_KID
3684 ...
3685 wait $child_pid;
3686 } else {
3687 # am the child; use STDIN/STDOUT normally
3688 ...
3689 exit;
3690 }
3691
3b10bc60 3692The filehandle behaves normally for the parent, but I/O to that
a0d0e21e 3693filehandle is piped from/to the STDOUT/STDIN of the child process.
3b10bc60 3694In the child process, the filehandle isn't opened--I/O happens from/to
3695the new STDOUT/STDIN. Typically this is used like the normal
a0d0e21e 3696piped open when you want to exercise more control over just how the
3b10bc60 3697pipe command gets executed, such as when running setuid and
3698you don't want to have to scan shell commands for metacharacters.
3699
5b867647 3700The following blocks are more or less equivalent:
a0d0e21e
LW
3701
3702 open(FOO, "|tr '[a-z]' '[A-Z]'");
8f1da26d
TC
3703 open(FOO, "|-", "tr '[a-z]' '[A-Z]'");
3704 open(FOO, "|-") || exec 'tr', '[a-z]', '[A-Z]';
3705 open(FOO, "|-", "tr", '[a-z]', '[A-Z]');
a0d0e21e
LW
3706
3707 open(FOO, "cat -n '$file'|");
8f1da26d
TC
3708 open(FOO, "-|", "cat -n '$file'");
3709 open(FOO, "-|") || exec "cat", "-n", $file;
3710 open(FOO, "-|", "cat", "-n", $file);
b76cc8ba 3711
8f1da26d 3712The last two examples in each block show the pipe as "list form", which is
64da03b2 3713not yet supported on all platforms. A good rule of thumb is that if
8f1da26d
TC
3714your platform has a real C<fork()> (in other words, if your platform is
3715Unix, including Linux and MacOS X), you can use the list form. You would
3716want to use the list form of the pipe so you can pass literal arguments
3717to the command without risk of the shell interpreting any shell metacharacters
3718in them. However, this also bars you from opening pipes to commands
3719that intentionally contain shell metacharacters, such as:
3720
3721 open(FOO, "|cat -n | expand -4 | lpr")
3722 // die "Can't open pipeline to lpr: $!";
a0d0e21e 3723
4633a7c4
LW
3724See L<perlipc/"Safe Pipe Opens"> for more examples of this.
3725
0f897271
GS
3726Beginning with v5.6.0, Perl will attempt to flush all files opened for
3727output before any operation that may do a fork, but this may not be
3728supported on some platforms (see L<perlport>). To be safe, you may need
3729to set C<$|> ($AUTOFLUSH in English) or call the C<autoflush()> method
3730of C<IO::Handle> on any open handles.
3731
ed53a2bb
JH
3732On systems that support a close-on-exec flag on files, the flag will
3733be set for the newly opened file descriptor as determined by the value
8f1da26d 3734of C<$^F>. See L<perlvar/$^F>.
a0d0e21e 3735
0dccf244 3736Closing any piped filehandle causes the parent process to wait for the
8f1da26d 3737child to finish, then returns the status value in C<$?> and
e5218da5 3738C<${^CHILD_ERROR_NATIVE}>.
0dccf244 3739
8f1da26d
TC
3740The filename passed to the one- and two-argument forms of open() will
3741have leading and trailing whitespace deleted and normal
ed53a2bb 3742redirection characters honored. This property, known as "magic open",
5a964f20 3743can often be used to good effect. A user could specify a filename of
7660c0ab 3744F<"rsh cat file |">, or you could change certain filenames as needed:
5a964f20
TC
3745
3746 $filename =~ s/(.*\.gz)\s*$/gzip -dc < $1|/;
3747 open(FH, $filename) or die "Can't open $filename: $!";
3748
8f1da26d 3749Use the three-argument form to open a file with arbitrary weird characters in it,
6170680b 3750
8f1da26d
TC
3751 open(FOO, "<", $file)
3752 || die "can't open < $file: $!";
6170680b
IZ
3753
3754otherwise it's necessary to protect any leading and trailing whitespace:
5a964f20
TC
3755
3756 $file =~ s#^(\s)#./$1#;
8f1da26d
TC
3757 open(FOO, "< $file\0")
3758 || die "open failed: $!";
5a964f20 3759
a31a806a 3760(this may not work on some bizarre filesystems). One should
8f1da26d 3761conscientiously choose between the I<magic> and I<three-argument> form
6170680b
IZ
3762of open():
3763
8f1da26d 3764 open(IN, $ARGV[0]) || die "can't open $ARGV[0]: $!";
6170680b
IZ
3765
3766will allow the user to specify an argument of the form C<"rsh cat file |">,
80d38338 3767but will not work on a filename that happens to have a trailing space, while
6170680b 3768
8f1da26d
TC
3769 open(IN, "<", $ARGV[0])
3770 || die "can't open < $ARGV[0]: $!";
6170680b
IZ
3771
3772will have exactly the opposite restrictions.
3773
01aa884e 3774If you want a "real" C C<open> (see L<open(2)> on your system), then you
8f1da26d
TC
3775should use the C<sysopen> function, which involves no such magic (but may
3776use subtly different filemodes than Perl open(), which is mapped to C
3777fopen()). This is another way to protect your filenames from
3778interpretation. For example:
5a964f20
TC
3779
3780 use IO::Handle;
3781 sysopen(HANDLE, $path, O_RDWR|O_CREAT|O_EXCL)
a9a5a0dc 3782 or die "sysopen $path: $!";
5a964f20 3783 $oldfh = select(HANDLE); $| = 1; select($oldfh);
38762f02 3784 print HANDLE "stuff $$\n";
5a964f20
TC
3785 seek(HANDLE, 0, 0);
3786 print "File contains: ", <HANDLE>;
3787
7660c0ab
A
3788Using the constructor from the C<IO::Handle> package (or one of its
3789subclasses, such as C<IO::File> or C<IO::Socket>), you can generate anonymous
8f1da26d
TC
3790filehandles that have the scope of the variables used to hold them, then
3791automatically (but silently) close once their reference counts become
3792zero, typically at scope exit:
c07a80fd 3793
5f05dabc 3794 use IO::File;
5a964f20 3795 #...
c07a80fd 3796 sub read_myfile_munged {
a9a5a0dc 3797 my $ALL = shift;
8f1da26d 3798 # or just leave it undef to autoviv
a9a5a0dc 3799 my $handle = IO::File->new;
8f1da26d 3800 open($handle, "<", "myfile") or die "myfile: $!";
a9a5a0dc
VP
3801 $first = <$handle>
3802 or return (); # Automatically closed here.
8f1da26d
TC
3803 mung($first) or die "mung failed"; # Or here.
3804 return (first, <$handle>) if $ALL; # Or here.
3805 return $first; # Or here.
c07a80fd 3806 }
3807
8f1da26d
TC
3808B<WARNING:> The previous example has a bug because the automatic
3809close that happens when the refcount on C<handle> does not
3810properly detect and report failures. I<Always> close the handle
3811yourself and inspect the return value.
3812
3813 close($handle)
3814 || warn "close failed: $!";
3815
b687b08b 3816See L</seek> for some details about mixing reading and writing.
a0d0e21e 3817
ea9eb35a
BJ
3818Portability issues: L<perlport/open>.
3819
a0d0e21e 3820=item opendir DIRHANDLE,EXPR
d74e8afc 3821X<opendir>
a0d0e21e 3822
19799a22
GS
3823Opens a directory named EXPR for processing by C<readdir>, C<telldir>,
3824C<seekdir>, C<rewinddir>, and C<closedir>. Returns true if successful.
a28cd5c9
NT
3825DIRHANDLE may be an expression whose value can be used as an indirect
3826dirhandle, usually the real dirhandle name. If DIRHANDLE is an undefined
3827scalar variable (or array or hash element), the variable is assigned a
8f1da26d 3828reference to a new anonymous dirhandle; that is, it's autovivified.
a0d0e21e
LW
3829DIRHANDLEs have their own namespace separate from FILEHANDLEs.
3830
bea6df1c 3831See the example at C<readdir>.
b0169937 3832
a0d0e21e 3833=item ord EXPR
d74e8afc 3834X<ord> X<encoding>
a0d0e21e 3835
54310121 3836=item ord
bbce6d69 3837
121910a4 3838Returns the numeric (the native 8-bit encoding, like ASCII or EBCDIC,
8f1da26d
TC
3839or Unicode) value of the first character of EXPR.
3840If EXPR is an empty string, returns 0. If EXPR is omitted, uses C<$_>.
3841(Note I<character>, not byte.)
121910a4
JH
3842
3843For the reverse, see L</chr>.
2575c402 3844See L<perlunicode> for more about Unicode.
a0d0e21e 3845
77ca0c92 3846=item our EXPR
d74e8afc 3847X<our> X<global>
77ca0c92 3848
36fb85f3 3849=item our TYPE EXPR
307ea6df 3850
1d2de774 3851=item our EXPR : ATTRS
9969eac4 3852
1d2de774 3853=item our TYPE EXPR : ATTRS
307ea6df 3854
85d8b7d5 3855C<our> associates a simple name with a package variable in the current
65c680eb
MS
3856package for use within the current scope. When C<use strict 'vars'> is in
3857effect, C<our> lets you use declared global variables without qualifying
3858them with package names, within the lexical scope of the C<our> declaration.
bea6df1c 3859In this way C<our> differs from C<use vars>, which is package-scoped.
65c680eb 3860
8f1da26d
TC
3861Unlike C<my> or C<state>, which allocates storage for a variable and
3862associates a simple name with that storage for use within the current
3863scope, C<our> associates a simple name with a package (read: global)
3864variable in the current package, for use within the current lexical scope.
3865In other words, C<our> has the same scoping rules as C<my> or C<state>, but
3866does not necessarily create a variable.
65c680eb
MS
3867
3868If more than one value is listed, the list must be placed
3869in parentheses.
85d8b7d5
MS
3870
3871 our $foo;
3872 our($bar, $baz);
77ca0c92 3873
f472eb5c
GS
3874An C<our> declaration declares a global variable that will be visible
3875across its entire lexical scope, even across package boundaries. The
3876package in which the variable is entered is determined at the point
3877of the declaration, not at the point of use. This means the following
3878behavior holds:
3879
3880 package Foo;
5ed4f2ec 3881 our $bar; # declares $Foo::bar for rest of lexical scope
f472eb5c
GS
3882 $bar = 20;
3883
3884 package Bar;
5ed4f2ec 3885 print $bar; # prints 20, as it refers to $Foo::bar
f472eb5c 3886
65c680eb
MS
3887Multiple C<our> declarations with the same name in the same lexical
3888scope are allowed if they are in different packages. If they happen
3889to be in the same package, Perl will emit warnings if you have asked
3890for them, just like multiple C<my> declarations. Unlike a second
3891C<my> declaration, which will bind the name to a fresh variable, a
3892second C<our> declaration in the same package, in the same scope, is
3893merely redundant.
f472eb5c
GS
3894
3895 use warnings;
3896 package Foo;
5ed4f2ec 3897 our $bar; # declares $Foo::bar for rest of lexical scope
f472eb5c
GS
3898 $bar = 20;
3899
3900 package Bar;
5ed4f2ec 3901 our $bar = 30; # declares $Bar::bar for rest of lexical scope
3902 print $bar; # prints 30
f472eb5c 3903
5ed4f2ec 3904 our $bar; # emits warning but has no other effect
3905 print $bar; # still prints 30
f472eb5c 3906
9969eac4 3907An C<our> declaration may also have a list of attributes associated
307ea6df
JH
3908with it.
3909
1d2de774
JH
3910The exact semantics and interface of TYPE and ATTRS are still
3911evolving. TYPE is currently bound to the use of C<fields> pragma,
307ea6df
JH
3912and attributes are handled using the C<attributes> pragma, or starting
3913from Perl 5.8.0 also via the C<Attribute::Handlers> module. See
3914L<perlsub/"Private Variables via my()"> for details, and L<fields>,
3915L<attributes>, and L<Attribute::Handlers>.
3916
a0d0e21e 3917=item pack TEMPLATE,LIST
d74e8afc 3918X<pack>
a0d0e21e 3919
2b6c5635
GS
3920Takes a LIST of values and converts it into a string using the rules
3921given by the TEMPLATE. The resulting string is the concatenation of
3922the converted values. Typically, each converted value looks
3923like its machine-level representation. For example, on 32-bit machines
3980dc9c
KW
3924an integer may be represented by a sequence of 4 bytes, which will in
3925Perl be presented as a string that's 4 characters long.
3926
3927See L<perlpacktut> for an introduction to this function.
e1b711da 3928
18529408
IZ
3929The TEMPLATE is a sequence of characters that give the order and type
3930of values, as follows:
a0d0e21e 3931
5ed4f2ec 3932 a A string with arbitrary binary data, will be null padded.
3933 A A text (ASCII) string, will be space padded.
3b10bc60 3934 Z A null-terminated (ASCIZ) string, will be null padded.
5a929a98 3935
4d0444a3
FC
3936 b A bit string (ascending bit order inside each byte,
3937 like vec()).
5ed4f2ec 3938 B A bit string (descending bit order inside each byte).
3939 h A hex string (low nybble first).
3940 H A hex string (high nybble first).
a0d0e21e 3941
5ed4f2ec 3942 c A signed char (8-bit) value.
3943 C An unsigned char (octet) value.
3b10bc60 3944 W An unsigned char value (can be greater than 255).
96e4d5b1 3945
5ed4f2ec 3946 s A signed short (16-bit) value.
3947 S An unsigned short value.
96e4d5b1 3948
5ed4f2ec 3949 l A signed long (32-bit) value.
3950 L An unsigned long value.
a0d0e21e 3951
5ed4f2ec 3952 q A signed quad (64-bit) value.
3953 Q An unsigned quad value.
4d0444a3
FC
3954 (Quads are available only if your system supports 64-bit
3955 integer values _and_ if Perl has been compiled to support
3956 those. Raises an exception otherwise.)
dae0da7a 3957
5ed4f2ec 3958 i A signed integer value.
3959 I A unsigned integer value.
4d0444a3
FC
3960 (This 'integer' is _at_least_ 32 bits wide. Its exact
3961 size depends on what a local C compiler calls 'int'.)
2b191d53 3962
5ed4f2ec 3963 n An unsigned short (16-bit) in "network" (big-endian) order.
3964 N An unsigned long (32-bit) in "network" (big-endian) order.
3965 v An unsigned short (16-bit) in "VAX" (little-endian) order.
3966 V An unsigned long (32-bit) in "VAX" (little-endian) order.
1109a392 3967
4d0444a3
FC
3968 j A Perl internal signed integer value (IV).
3969 J A Perl internal unsigned integer value (UV).
92d41999 3970
3b10bc60 3971 f A single-precision float in native format.
3972 d A double-precision float in native format.
a0d0e21e 3973
3b10bc60 3974 F A Perl internal floating-point value (NV) in native format
3975 D A float of long-double precision in native format.
4d0444a3
FC
3976 (Long doubles are available only if your system supports
3977 long double values _and_ if Perl has been compiled to
3978 support those. Raises an exception otherwise.)
92d41999 3979
5ed4f2ec 3980 p A pointer to a null-terminated string.
3981 P A pointer to a structure (fixed-length string).
a0d0e21e 3982
5ed4f2ec 3983 u A uuencoded string.
4d0444a3
FC
3984 U A Unicode character number. Encodes to a character in char-
3985 acter mode and UTF-8 (or UTF-EBCDIC in EBCDIC platforms) in
3986 byte mode.
a0d0e21e 3987
4d0444a3
FC
3988 w A BER compressed integer (not an ASN.1 BER, see perlpacktut
3989 for details). Its bytes represent an unsigned integer in
3990 base 128, most significant digit first, with as few digits
3991 as possible. Bit eight (the high bit) is set on each byte
3992 except the last.
def98dd4 3993
3b10bc60 3994 x A null byte (a.k.a ASCII NUL, "\000", chr(0))
5ed4f2ec 3995 X Back up a byte.
3b10bc60 3996 @ Null-fill or truncate to absolute position, counted from the
3997 start of the innermost ()-group.
4d0444a3
FC
3998 . Null-fill or truncate to absolute position specified by
3999 the value.
5ed4f2ec 4000 ( Start of a ()-group.
a0d0e21e 4001
3b10bc60 4002One or more modifiers below may optionally follow certain letters in the
4003TEMPLATE (the second column lists letters for which the modifier is valid):
1109a392
MHM
4004
4005 ! sSlLiI Forces native (short, long, int) sizes instead
4006 of fixed (16-/32-bit) sizes.
4007
4008 xX Make x and X act as alignment commands.
4009
4010 nNvV Treat integers as signed instead of unsigned.
4011
28be1210
TH
4012 @. Specify position as byte offset in the internal
4013 representation of the packed string. Efficient but
4014 dangerous.
4015
1109a392
MHM
4016 > sSiIlLqQ Force big-endian byte-order on the type.
4017 jJfFdDpP (The "big end" touches the construct.)
4018
4019 < sSiIlLqQ Force little-endian byte-order on the type.
4020 jJfFdDpP (The "little end" touches the construct.)
4021
3b10bc60 4022The C<< > >> and C<< < >> modifiers can also be used on C<()> groups
4023to force a particular byte-order on all components in that group,
4024including all its subgroups.
66c611c5 4025
5a929a98
VU
4026The following rules apply:
4027
3b10bc60 4028=over
5a929a98
VU
4029
4030=item *
4031
3b10bc60 4032Each letter may optionally be followed by a number indicating the repeat
4033count. A numeric repeat count may optionally be enclosed in brackets, as
4034in C<pack("C[80]", @arr)>. The repeat count gobbles that many values from
4035the LIST when used with all format types other than C<a>, C<A>, C<Z>, C<b>,
4036C<B>, C<h>, C<H>, C<@>, C<.>, C<x>, C<X>, and C<P>, where it means
7698aede 4037something else, described below. Supplying a C<*> for the repeat count
3b10bc60 4038instead of a number means to use however many items are left, except for:
4039
4040=over
4041
4042=item *
4043
4044C<@>, C<x>, and C<X>, where it is equivalent to C<0>.
4045
4046=item *
4047
4048<.>, where it means relative to the start of the string.
4049
4050=item *
4051
4052C<u>, where it is equivalent to 1 (or 45, which here is equivalent).
4053
4054=back
4055
4056One can replace a numeric repeat count with a template letter enclosed in
4057brackets to use the packed byte length of the bracketed template for the
4058repeat count.
4059
4060For example, the template C<x[L]> skips as many bytes as in a packed long,
4061and the template C<"$t X[$t] $t"> unpacks twice whatever $t (when
4062variable-expanded) unpacks. If the template in brackets contains alignment
4063commands (such as C<x![d]>), its packed length is calculated as if the
4064start of the template had the maximal possible alignment.
4065
4066When used with C<Z>, a C<*> as the repeat count is guaranteed to add a
4067trailing null byte, so the resulting string is always one byte longer than
4068the byte length of the item itself.
2b6c5635 4069
28be1210 4070When used with C<@>, the repeat count represents an offset from the start
3b10bc60 4071of the innermost C<()> group.
4072
4073When used with C<.>, the repeat count determines the starting position to
4074calculate the value offset as follows:
4075
4076=over
4077
4078=item *
4079
4080If the repeat count is C<0>, it's relative to the current position.
28be1210 4081
3b10bc60 4082=item *
4083
4084If the repeat count is C<*>, the offset is relative to the start of the
4085packed string.
4086
4087=item *
4088
4089And if it's an integer I<n>, the offset is relative to the start of the
8f1da26d 4090I<n>th innermost C<( )> group, or to the start of the string if I<n> is
3b10bc60 4091bigger then the group level.
4092
4093=back
28be1210 4094
951ba7fe 4095The repeat count for C<u> is interpreted as the maximal number of bytes
f337b084
TH
4096to encode per line of output, with 0, 1 and 2 replaced by 45. The repeat
4097count should not be more than 65.
5a929a98
VU
4098
4099=item *
4100
951ba7fe 4101The C<a>, C<A>, and C<Z> types gobble just one value, but pack it as a
3b10bc60 4102string of length count, padding with nulls or spaces as needed. When
18bdf90a 4103unpacking, C<A> strips trailing whitespace and nulls, C<Z> strips everything
8f1da26d 4104after the first null, and C<a> returns data with no stripping at all.
2b6c5635 4105
3b10bc60 4106If the value to pack is too long, the result is truncated. If it's too
4107long and an explicit count is provided, C<Z> packs only C<$count-1> bytes,
4108followed by a null byte. Thus C<Z> always packs a trailing null, except
8f1da26d 4109when the count is 0.
5a929a98
VU
4110
4111=item *
4112
3b10bc60 4113Likewise, the C<b> and C<B> formats pack a string that's that many bits long.
8f1da26d
TC
4114Each such format generates 1 bit of the result. These are typically followed
4115by a repeat count like C<B8> or C<B64>.
3b10bc60 4116
c73032f5 4117Each result bit is based on the least-significant bit of the corresponding
f337b084 4118input character, i.e., on C<ord($char)%2>. In particular, characters C<"0">
3b10bc60 4119and C<"1"> generate bits 0 and 1, as do characters C<"\000"> and C<"\001">.
c73032f5 4120
3b10bc60 4121Starting from the beginning of the input string, each 8-tuple
4122of characters is converted to 1 character of output. With format C<b>,
f337b084 4123the first character of the 8-tuple determines the least-significant bit of a
3b10bc60 4124character; with format C<B>, it determines the most-significant bit of
f337b084 4125a character.
c73032f5 4126
3b10bc60 4127If the length of the input string is not evenly divisible by 8, the
f337b084 4128remainder is packed as if the input string were padded by null characters
3b10bc60 4129at the end. Similarly during unpacking, "extra" bits are ignored.
c73032f5 4130
3b10bc60 4131If the input string is longer than needed, remaining characters are ignored.
4132
4133A C<*> for the repeat count uses all characters of the input field.
8f1da26d 4134On unpacking, bits are converted to a string of C<0>s and C<1>s.
5a929a98
VU
4135
4136=item *
4137
3b10bc60 4138The C<h> and C<H> formats pack a string that many nybbles (4-bit groups,
4139representable as hexadecimal digits, C<"0".."9"> C<"a".."f">) long.
5a929a98 4140
8f1da26d 4141For each such format, pack() generates 4 bits of result.
3b10bc60 4142With non-alphabetical characters, the result is based on the 4 least-significant
f337b084
TH
4143bits of the input character, i.e., on C<ord($char)%16>. In particular,
4144characters C<"0"> and C<"1"> generate nybbles 0 and 1, as do bytes
ce7b6f06 4145C<"\000"> and C<"\001">. For characters C<"a".."f"> and C<"A".."F">, the result
c73032f5 4146is compatible with the usual hexadecimal digits, so that C<"a"> and
8f1da26d
TC
4147C<"A"> both generate the nybble C<0xA==10>. Use only these specific hex
4148characters with this format.
c73032f5 4149
3b10bc60 4150Starting from the beginning of the template to pack(), each pair
4151of characters is converted to 1 character of output. With format C<h>, the
f337b084 4152first character of the pair determines the least-significant nybble of the
3b10bc60 4153output character; with format C<H>, it determines the most-significant
c73032f5
IZ
4154nybble.
4155
3b10bc60 4156If the length of the input string is not even, it behaves as if padded by
4157a null character at the end. Similarly, "extra" nybbles are ignored during
4158unpacking.
4159
4160If the input string is longer than needed, extra characters are ignored.
c73032f5 4161
3b10bc60 4162A C<*> for the repeat count uses all characters of the input field. For
4163unpack(), nybbles are converted to a string of hexadecimal digits.
c73032f5 4164
5a929a98
VU
4165=item *
4166
3b10bc60 4167The C<p> format packs a pointer to a null-terminated string. You are
4168responsible for ensuring that the string is not a temporary value, as that
4169could potentially get deallocated before you got around to using the packed
4170result. The C<P> format packs a pointer to a structure of the size indicated
4171by the length. A null pointer is created if the corresponding value for
4172C<p> or C<P> is C<undef>; similarly with unpack(), where a null pointer
4173unpacks into C<undef>.
5a929a98 4174
3b10bc60 4175If your system has a strange pointer size--meaning a pointer is neither as
4176big as an int nor as big as a long--it may not be possible to pack or
1109a392 4177unpack pointers in big- or little-endian byte order. Attempting to do
3b10bc60 4178so raises an exception.
1109a392 4179
5a929a98
VU
4180=item *
4181
246f24af 4182The C</> template character allows packing and unpacking of a sequence of
3b10bc60 4183items where the packed structure contains a packed item count followed by
4184the packed items themselves. This is useful when the structure you're
4185unpacking has encoded the sizes or repeat counts for some of its fields
4186within the structure itself as separate fields.
4187
4188For C<pack>, you write I<length-item>C</>I<sequence-item>, and the
4189I<length-item> describes how the length value is packed. Formats likely
4190to be of most use are integer-packing ones like C<n> for Java strings,
4191C<w> for ASN.1 or SNMP, and C<N> for Sun XDR.
4192
4193For C<pack>, I<sequence-item> may have a repeat count, in which case
4194the minimum of that and the number of available items is used as the argument
4195for I<length-item>. If it has no repeat count or uses a '*', the number
54f961c9
PD
4196of available items is used.
4197
3b10bc60 4198For C<unpack>, an internal stack of integer arguments unpacked so far is
54f961c9
PD
4199used. You write C</>I<sequence-item> and the repeat count is obtained by
4200popping off the last element from the stack. The I<sequence-item> must not
4201have a repeat count.
246f24af 4202
3b10bc60 4203If I<sequence-item> refers to a string type (C<"A">, C<"a">, or C<"Z">),
4204the I<length-item> is the string length, not the number of strings. With
4205an explicit repeat count for pack, the packed string is adjusted to that
4206length. For example:
246f24af 4207
4d0444a3
FC
4208 This code: gives this result:
4209
4210 unpack("W/a", "\004Gurusamy") ("Guru")
4211 unpack("a3/A A*", "007 Bond J ") (" Bond", "J")
4212 unpack("a3 x2 /A A*", "007: Bond, J.") ("Bond, J", ".")
3b10bc60 4213
4d0444a3
FC
4214 pack("n/a* w/a","hello,","world") "\000\006hello,\005world"
4215 pack("a/W2", ord("a") .. ord("z")) "2ab"
43192e07
IP
4216
4217The I<length-item> is not returned explicitly from C<unpack>.
4218
3b10bc60 4219Supplying a count to the I<length-item> format letter is only useful with
4220C<A>, C<a>, or C<Z>. Packing with a I<length-item> of C<a> or C<Z> may
4221introduce C<"\000"> characters, which Perl does not regard as legal in
4222numeric strings.
43192e07
IP
4223
4224=item *
4225
951ba7fe 4226The integer types C<s>, C<S>, C<l>, and C<L> may be
3b10bc60 4227followed by a C<!> modifier to specify native shorts or
4228longs. As shown in the example above, a bare C<l> means
4229exactly 32 bits, although the native C<long> as seen by the local C compiler
4230may be larger. This is mainly an issue on 64-bit platforms. You can
4231see whether using C<!> makes any difference this way:
4232
4233 printf "format s is %d, s! is %d\n",
4234 length pack("s"), length pack("s!");
726ea183 4235
3b10bc60 4236 printf "format l is %d, l! is %d\n",
4237 length pack("l"), length pack("l!");
ef54e1a4 4238
3b10bc60 4239
4240C<i!> and C<I!> are also allowed, but only for completeness' sake:
951ba7fe 4241they are identical to C<i> and C<I>.
ef54e1a4 4242
19799a22 4243The actual sizes (in bytes) of native shorts, ints, longs, and long
3b10bc60 4244longs on the platform where Perl was built are also available from
4245the command line:
4246
4247 $ perl -V:{short,int,long{,long}}size
4248 shortsize='2';
4249 intsize='4';
4250 longsize='4';
4251 longlongsize='8';
4252
4253or programmatically via the C<Config> module:
19799a22
GS
4254
4255 use Config;
4256 print $Config{shortsize}, "\n";
4257 print $Config{intsize}, "\n";
4258 print $Config{longsize}, "\n";
4259 print $Config{longlongsize}, "\n";
ef54e1a4 4260
3b10bc60 4261C<$Config{longlongsize}> is undefined on systems without
4262long long support.
851646ae 4263
ef54e1a4
JH
4264=item *
4265
3b10bc60 4266The integer formats C<s>, C<S>, C<i>, C<I>, C<l>, C<L>, C<j>, and C<J> are
4267inherently non-portable between processors and operating systems because
4268they obey native byteorder and endianness. For example, a 4-byte integer
42690x12345678 (305419896 decimal) would be ordered natively (arranged in and
4270handled by the CPU registers) into bytes as
61eff3bc 4271
5ed4f2ec 4272 0x12 0x34 0x56 0x78 # big-endian
4273 0x78 0x56 0x34 0x12 # little-endian
61eff3bc 4274
3b10bc60 4275Basically, Intel and VAX CPUs are little-endian, while everybody else,
4276including Motorola m68k/88k, PPC, Sparc, HP PA, Power, and Cray, are
8f1da26d
TC
4277big-endian. Alpha and MIPS can be either: Digital/Compaq uses (well, used)
4278them in little-endian mode, but SGI/Cray uses them in big-endian mode.
719a3cf5 4279
3b10bc60 4280The names I<big-endian> and I<little-endian> are comic references to the
4281egg-eating habits of the little-endian Lilliputians and the big-endian
4282Blefuscudians from the classic Jonathan Swift satire, I<Gulliver's Travels>.
4283This entered computer lingo via the paper "On Holy Wars and a Plea for
4284Peace" by Danny Cohen, USC/ISI IEN 137, April 1, 1980.
61eff3bc 4285
140cb37e 4286Some systems may have even weirder byte orders such as
61eff3bc 4287
5ed4f2ec 4288 0x56 0x78 0x12 0x34
4289 0x34 0x12 0x78 0x56
61eff3bc 4290
3b10bc60 4291You can determine your system endianness with this incantation:
ef54e1a4 4292
3b10bc60 4293 printf("%#02x ", $_) for unpack("W*", pack L=>0x12345678);
ef54e1a4 4294
d99ad34e 4295The byteorder on the platform where Perl was built is also available
726ea183 4296via L<Config>:
ef54e1a4 4297
5ed4f2ec 4298 use Config;
3b10bc60 4299 print "$Config{byteorder}\n";
4300
4301or from the command line:
ef54e1a4 4302
3b10bc60 4303 $ perl -V:byteorder
719a3cf5 4304
3b10bc60 4305Byteorders C<"1234"> and C<"12345678"> are little-endian; C<"4321">
4306and C<"87654321"> are big-endian.
4307
4308For portably packed integers, either use the formats C<n>, C<N>, C<v>,
4309and C<V> or else use the C<< > >> and C<< < >> modifiers described
4310immediately below. See also L<perlport>.
ef54e1a4
JH
4311
4312=item *
4313
3b10bc60 4314Starting with Perl 5.9.2, integer and floating-point formats, along with
4315the C<p> and C<P> formats and C<()> groups, may all be followed by the
4316C<< > >> or C<< < >> endianness modifiers to respectively enforce big-
4317or little-endian byte-order. These modifiers are especially useful
8f1da26d 4318given how C<n>, C<N>, C<v>, and C<V> don't cover signed integers,
3b10bc60 431964-bit integers, or floating-point values.
4320
bea6df1c 4321Here are some concerns to keep in mind when using an endianness modifier:
3b10bc60 4322
4323=over
4324
4325=item *
4326
4327Exchanging signed integers between different platforms works only
4328when all platforms store them in the same format. Most platforms store
4329signed integers in two's-complement notation, so usually this is not an issue.
1109a392 4330
3b10bc60 4331=item *
1109a392 4332
3b10bc60 4333The C<< > >> or C<< < >> modifiers can only be used on floating-point
1109a392 4334formats on big- or little-endian machines. Otherwise, attempting to
3b10bc60 4335use them raises an exception.
1109a392 4336
3b10bc60 4337=item *
4338
4339Forcing big- or little-endian byte-order on floating-point values for
4340data exchange can work only if all platforms use the same
4341binary representation such as IEEE floating-point. Even if all
4342platforms are using IEEE, there may still be subtle differences. Being able
4343to use C<< > >> or C<< < >> on floating-point values can be useful,
80d38338 4344but also dangerous if you don't know exactly what you're doing.
3b10bc60 4345It is not a general way to portably store floating-point values.
4346
4347=item *
1109a392 4348
3b10bc60 4349When using C<< > >> or C<< < >> on a C<()> group, this affects
4350all types inside the group that accept byte-order modifiers,
4351including all subgroups. It is silently ignored for all other
66c611c5
MHM
4352types. You are not allowed to override the byte-order within a group
4353that already has a byte-order modifier suffix.
4354
3b10bc60 4355=back
4356
1109a392
MHM
4357=item *
4358
3b10bc60 4359Real numbers (floats and doubles) are in native machine format only.
4360Due to the multiplicity of floating-point formats and the lack of a
4361standard "network" representation for them, no facility for interchange has been
4362made. This means that packed floating-point data written on one machine
4363may not be readable on another, even if both use IEEE floating-point
4364arithmetic (because the endianness of the memory representation is not part
851646ae 4365of the IEEE spec). See also L<perlport>.
5a929a98 4366
3b10bc60 4367If you know I<exactly> what you're doing, you can use the C<< > >> or C<< < >>
4368modifiers to force big- or little-endian byte-order on floating-point values.
1109a392 4369
3b10bc60 4370Because Perl uses doubles (or long doubles, if configured) internally for
4371all numeric calculation, converting from double into float and thence
4372to double again loses precision, so C<unpack("f", pack("f", $foo)>)
4373will not in general equal $foo.
5a929a98 4374
851646ae
JH
4375=item *
4376
3b10bc60 4377Pack and unpack can operate in two modes: character mode (C<C0> mode) where
4378the packed string is processed per character, and UTF-8 mode (C<U0> mode)
f337b084 4379where the packed string is processed in its UTF-8-encoded Unicode form on
3b10bc60 4380a byte-by-byte basis. Character mode is the default unless the format string
4381starts with C<U>. You can always switch mode mid-format with an explicit
4382C<C0> or C<U0> in the format. This mode remains in effect until the next
4383mode change, or until the end of the C<()> group it (directly) applies to.
036b4402 4384
8f1da26d
TC
4385Using C<C0> to get Unicode characters while using C<U0> to get I<non>-Unicode
4386bytes is not necessarily obvious. Probably only the first of these
4387is what you want:
4388
4389 $ perl -CS -E 'say "\x{3B1}\x{3C9}"' |
4390 perl -CS -ne 'printf "%v04X\n", $_ for unpack("C0A*", $_)'
4391 03B1.03C9
4392 $ perl -CS -E 'say "\x{3B1}\x{3C9}"' |
4393 perl -CS -ne 'printf "%v02X\n", $_ for unpack("U0A*", $_)'
4394 CE.B1.CF.89
4395 $ perl -CS -E 'say "\x{3B1}\x{3C9}"' |
4396 perl -C0 -ne 'printf "%v02X\n", $_ for unpack("C0A*", $_)'
4397 CE.B1.CF.89
4398 $ perl -CS -E 'say "\x{3B1}\x{3C9}"' |
4399 perl -C0 -ne 'printf "%v02X\n", $_ for unpack("U0A*", $_)'
4400 C3.8E.C2.B1.C3.8F.C2.89
4401
4402Those examples also illustrate that you should not try to use
4403C<pack>/C<unpack> as a substitute for the L<Encode> module.
4404
036b4402
GS
4405=item *
4406
3b10bc60 4407You must yourself do any alignment or padding by inserting, for example,
4408enough C<"x">es while packing. There is no way for pack() and unpack()
4409to know where characters are going to or coming from, so they
4410handle their output and input as flat sequences of characters.
851646ae 4411
17f4a12d
IZ
4412=item *
4413
3b10bc60 4414A C<()> group is a sub-TEMPLATE enclosed in parentheses. A group may
4415take a repeat count either as postfix, or for unpack(), also via the C</>
4416template character. Within each repetition of a group, positioning with
4417C<@> starts over at 0. Therefore, the result of
49704364 4418
3b10bc60 4419 pack("@1A((@2A)@3A)", qw[X Y Z])
49704364 4420
3b10bc60 4421is the string C<"\0X\0\0YZ">.
49704364 4422
18529408
IZ
4423=item *
4424
3b10bc60 4425C<x> and C<X> accept the C<!> modifier to act as alignment commands: they
4426jump forward or back to the closest position aligned at a multiple of C<count>
4427characters. For example, to pack() or unpack() a C structure like
666f95b9 4428
3b10bc60 4429 struct {
4430 char c; /* one signed, 8-bit character */
4431 double d;
4432 char cc[2];
4433 }
4434
4435one may need to use the template C<c x![d] d c[2]>. This assumes that
4436doubles must be aligned to the size of double.
4437
4438For alignment commands, a C<count> of 0 is equivalent to a C<count> of 1;
4439both are no-ops.
666f95b9 4440
62f95557
IZ
4441=item *
4442
3b10bc60 4443C<n>, C<N>, C<v> and C<V> accept the C<!> modifier to
4444represent signed 16-/32-bit integers in big-/little-endian order.
4445This is portable only when all platforms sharing packed data use the
4446same binary representation for signed integers; for example, when all
4447platforms use two's-complement representation.
068bd2e7
MHM
4448
4449=item *
4450
3b10bc60 4451Comments can be embedded in a TEMPLATE using C<#> through the end of line.
4452White space can separate pack codes from each other, but modifiers and
4453repeat counts must follow immediately. Breaking complex templates into
4454individual line-by-line components, suitably annotated, can do as much to
4455improve legibility and maintainability of pack/unpack formats as C</x> can
4456for complicated pattern matches.
17f4a12d 4457
2b6c5635
GS
4458=item *
4459
bea6df1c 4460If TEMPLATE requires more arguments than pack() is given, pack()
cf264981 4461assumes additional C<""> arguments. If TEMPLATE requires fewer arguments
3b10bc60 4462than given, extra arguments are ignored.
2b6c5635 4463
5a929a98 4464=back
a0d0e21e
LW
4465
4466Examples:
4467
f337b084 4468 $foo = pack("WWWW",65,66,67,68);
a0d0e21e 4469 # foo eq "ABCD"
f337b084 4470 $foo = pack("W4",65,66,67,68);
a0d0e21e 4471 # same thing
f337b084
TH
4472 $foo = pack("W4",0x24b6,0x24b7,0x24b8,0x24b9);
4473 # same thing with Unicode circled letters.
a0ed51b3 4474 $foo = pack("U4",0x24b6,0x24b7,0x24b8,0x24b9);
4d0444a3
FC
4475 # same thing with Unicode circled letters. You don't get the
4476 # UTF-8 bytes because the U at the start of the format caused
4477 # a switch to U0-mode, so the UTF-8 bytes get joined into
4478 # characters
f337b084
TH
4479 $foo = pack("C0U4",0x24b6,0x24b7,0x24b8,0x24b9);
4480 # foo eq "\xe2\x92\xb6\xe2\x92\xb7\xe2\x92\xb8\xe2\x92\xb9"
4d0444a3
FC
4481 # This is the UTF-8 encoding of the string in the
4482 # previous example
a0d0e21e
LW
4483
4484 $foo = pack("ccxxcc",65,66,67,68);
4485 # foo eq "AB\0\0CD"
4486
3b10bc60 4487 # NOTE: The examples above featuring "W" and "c" are true
9ccd05c0 4488 # only on ASCII and ASCII-derived systems such as ISO Latin 1
3b10bc60 4489 # and UTF-8. On EBCDIC systems, the first example would be
4490 # $foo = pack("WWWW",193,194,195,196);
9ccd05c0 4491
a0d0e21e 4492 $foo = pack("s2",1,2);
ce7b6f06
KW
4493 # "\001\000\002\000" on little-endian
4494 # "\000\001\000\002" on big-endian
a0d0e21e
LW
4495
4496 $foo = pack("a4","abcd","x","y","z");
4497 # "abcd"
4498
4499 $foo = pack("aaaa","abcd","x","y","z");
4500 # "axyz"
4501
4502 $foo = pack("a14","abcdefg");
4503 # "abcdefg\0\0\0\0\0\0\0"
4504
4505 $foo = pack("i9pl", gmtime);
4506 # a real struct tm (on my system anyway)
4507
5a929a98
VU
4508 $utmp_template = "Z8 Z8 Z16 L";
4509 $utmp = pack($utmp_template, @utmp1);
4510 # a struct utmp (BSDish)
4511
4512 @utmp2 = unpack($utmp_template, $utmp);
4513 # "@utmp1" eq "@utmp2"
4514
a0d0e21e 4515 sub bintodec {
a9a5a0dc 4516 unpack("N", pack("B32", substr("0" x 32 . shift, -32)));
a0d0e21e
LW
4517 }
4518
851646ae
JH
4519 $foo = pack('sx2l', 12, 34);
4520 # short 12, two zero bytes padding, long 34
4521 $bar = pack('s@4l', 12, 34);
4522 # short 12, zero fill to position 4, long 34
4523 # $foo eq $bar
28be1210
TH
4524 $baz = pack('s.l', 12, 4, 34);
4525 # short 12, zero fill to position 4, long 34
851646ae 4526
1109a392
MHM
4527 $foo = pack('nN', 42, 4711);
4528 # pack big-endian 16- and 32-bit unsigned integers
4529 $foo = pack('S>L>', 42, 4711);
4530 # exactly the same
4531 $foo = pack('s<l<', -42, 4711);
4532 # pack little-endian 16- and 32-bit signed integers
66c611c5
MHM
4533 $foo = pack('(sl)<', -42, 4711);
4534 # exactly the same
1109a392 4535
5a929a98 4536The same template may generally also be used in unpack().
a0d0e21e 4537
8f1da26d
TC
4538=item package NAMESPACE
4539
6fa4d285
DG
4540=item package NAMESPACE VERSION
4541X<package> X<module> X<namespace> X<version>
4542
8f1da26d 4543=item package NAMESPACE BLOCK
cb1a09d0 4544
4e4da3ac
Z
4545=item package NAMESPACE VERSION BLOCK
4546X<package> X<module> X<namespace> X<version>
4547
8f1da26d
TC
4548Declares the BLOCK or the rest of the compilation unit as being in the
4549given namespace. The scope of the package declaration is either the
4e4da3ac 4550supplied code BLOCK or, in the absence of a BLOCK, from the declaration
8f1da26d
TC
4551itself through the end of current scope (the enclosing block, file, or
4552C<eval>). That is, the forms without a BLOCK are operative through the end
4553of the current scope, just like the C<my>, C<state>, and C<our> operators.
4554All unqualified dynamic identifiers in this scope will be in the given
4555namespace, except where overridden by another C<package> declaration or
4556when they're one of the special identifiers that qualify into C<main::>,
4557like C<STDOUT>, C<ARGV>, C<ENV>, and the punctuation variables.
4e4da3ac 4558
3b10bc60 4559A package statement affects dynamic variables only, including those
4560you've used C<local> on, but I<not> lexical variables, which are created
8f1da26d 4561with C<my>, C<state>, or C<our>. Typically it would be the first
3b10bc60 4562declaration in a file included by C<require> or C<use>. You can switch into a
4563package in more than one place, since this only determines which default
4564symbol table the compiler uses for the rest of that block. You can refer to
4565identifiers in other packages than the current one by prefixing the identifier
4566with the package name and a double colon, as in C<$SomePack::var>
4567or C<ThatPack::INPUT_HANDLE>. If package name is omitted, the C<main>
4568package as assumed. That is, C<$::sail> is equivalent to
4569C<$main::sail> (as well as to C<$main'sail>, still seen in ancient
4570code, mostly from Perl 4).
4571
bd12309b 4572If VERSION is provided, C<package> sets the C<$VERSION> variable in the given
a2bff36e
DG
4573namespace to a L<version> object with the VERSION provided. VERSION must be a
4574"strict" style version number as defined by the L<version> module: a positive
4575decimal number (integer or decimal-fraction) without exponentiation or else a
4576dotted-decimal v-string with a leading 'v' character and at least three
4577components. You should set C<$VERSION> only once per package.
6fa4d285 4578
cb1a09d0
AD
4579See L<perlmod/"Packages"> for more information about packages, modules,
4580and classes. See L<perlsub> for other scoping issues.
4581
a0d0e21e 4582=item pipe READHANDLE,WRITEHANDLE
d74e8afc 4583X<pipe>
a0d0e21e
LW
4584
4585Opens a pair of connected pipes like the corresponding system call.
4586Note that if you set up a loop of piped processes, deadlock can occur
4587unless you are very careful. In addition, note that Perl's pipes use
9124316e 4588IO buffering, so you may need to set C<$|> to flush your WRITEHANDLE
a0d0e21e
LW
4589after each command, depending on the application.
4590
96090e4f
LB
4591See L<IPC::Open2>, L<IPC::Open3>, and
4592L<perlipc/"Bidirectional Communication with Another Process">
4633a7c4
LW
4593for examples of such things.
4594
3b10bc60 4595On systems that support a close-on-exec flag on files, that flag is set
4596on all newly opened file descriptors whose C<fileno>s are I<higher> than
4597the current value of $^F (by default 2 for C<STDERR>). See L<perlvar/$^F>.
4771b018 4598
cfa52385
FC
4599=item __PACKAGE__
4600X<__PACKAGE__>
4601
4602A special token that returns the name of the package in which it occurs.
4603
532eee96 4604=item pop ARRAY
d74e8afc 4605X<pop> X<stack>
a0d0e21e 4606
f5a93a43
TC
4607=item pop EXPR
4608
54310121 4609=item pop
28757baa 4610
a0d0e21e 4611Pops and returns the last value of the array, shortening the array by
cd7f9af7 4612one element.
a0d0e21e 4613
3b10bc60 4614Returns the undefined value if the array is empty, although this may also
4615happen at other times. If ARRAY is omitted, pops the C<@ARGV> array in the
4616main program, but the C<@_> array in subroutines, just like C<shift>.
a0d0e21e 4617
f5a93a43
TC
4618Starting with Perl 5.14, C<pop> can take a scalar EXPR, which must hold a
4619reference to an unblessed array. The argument will be dereferenced
4620automatically. This aspect of C<pop> is considered highly experimental.
4621The exact behaviour may change in a future version of Perl.
cba5a3b0 4622
a0d0e21e 4623=item pos SCALAR
d74e8afc 4624X<pos> X<match, position>
a0d0e21e 4625
54310121 4626=item pos
bbce6d69 4627
7664c618 4628Returns the offset of where the last C<m//g> search left off for the
4629variable in question (C<$_> is used when the variable is not
4630specified). Note that 0 is a valid match offset. C<undef> indicates
4631that the search position is reset (usually due to match failure, but
4632can also be because no match has yet been run on the scalar).
4633
4634C<pos> directly accesses the location used by the regexp engine to
4635store the offset, so assigning to C<pos> will change that offset, and
4636so will also influence the C<\G> zero-width assertion in regular
4637expressions. Both of these effects take place for the next match, so
4638you can't affect the position with C<pos> during the current match,
4639such as in C<(?{pos() = 5})> or C<s//pos() = 5/e>.
4640
f9179917
FC
4641Setting C<pos> also resets the I<matched with zero-length> flag, described
4642under L<perlre/"Repeated Patterns Matching a Zero-length Substring">.
4643
7664c618 4644Because a failed C<m//gc> match doesn't reset the offset, the return
4645from C<pos> won't change either in this case. See L<perlre> and
44a8e56a 4646L<perlop>.
a0d0e21e
LW
4647
4648=item print FILEHANDLE LIST
d74e8afc 4649X<print>
a0d0e21e 4650
dee33c94
TC
4651=item print FILEHANDLE
4652
a0d0e21e
LW
4653=item print LIST
4654
4655=item print
4656
19799a22 4657Prints a string or a list of strings. Returns true if successful.
dee33c94
TC
4658FILEHANDLE may be a scalar variable containing the name of or a reference
4659to the filehandle, thus introducing one level of indirection. (NOTE: If
4660FILEHANDLE is a variable and the next token is a term, it may be
4661misinterpreted as an operator unless you interpose a C<+> or put
8f1da26d
TC
4662parentheses around the arguments.) If FILEHANDLE is omitted, prints to the
4663last selected (see L</select>) output handle. If LIST is omitted, prints
4664C<$_> to the currently selected output handle. To use FILEHANDLE alone to
4665print the content of C<$_> to it, you must use a real filehandle like
4666C<FH>, not an indirect one like C<$fh>. To set the default output handle
4667to something other than STDOUT, use the select operation.
4668
4669The current value of C<$,> (if any) is printed between each LIST item. The
4670current value of C<$\> (if any) is printed after the entire LIST has been
4671printed. Because print takes a LIST, anything in the LIST is evaluated in
4672list context, including any subroutines whose return lists you pass to
4673C<print>. Be careful not to follow the print keyword with a left
4674parenthesis unless you want the corresponding right parenthesis to
4675terminate the arguments to the print; put parentheses around all arguments
4676(or interpose a C<+>, but that doesn't look as good).
4677
4678If you're storing handles in an array or hash, or in general whenever
4679you're using any expression more complex than a bareword handle or a plain,
4680unsubscripted scalar variable to retrieve it, you will have to use a block
4681returning the filehandle value instead, in which case the LIST may not be
4682omitted:
4633a7c4
LW
4683
4684 print { $files[$i] } "stuff\n";
4685 print { $OK ? STDOUT : STDERR } "stuff\n";
4686
785fd561
DG
4687Printing to a closed pipe or socket will generate a SIGPIPE signal. See
4688L<perlipc> for more on signal handling.
4689
5f05dabc 4690=item printf FILEHANDLE FORMAT, LIST
d74e8afc 4691X<printf>
a0d0e21e 4692
dee33c94
TC
4693=item printf FILEHANDLE
4694
5f05dabc 4695=item printf FORMAT, LIST
a0d0e21e 4696
dee33c94
TC
4697=item printf
4698
7660c0ab 4699Equivalent to C<print FILEHANDLE sprintf(FORMAT, LIST)>, except that C<$\>
dee33c94 4700(the output record separator) is not appended. The first argument of the
01aa884e
KW
4701list will be interpreted as the C<printf> format. See
4702L<sprintf|/sprintf FORMAT, LIST> for an
dee33c94
TC
4703explanation of the format argument. If you omit the LIST, C<$_> is used;
4704to use FILEHANDLE without a LIST, you must use a real filehandle like
4705C<FH>, not an indirect one like C<$fh>. If C<use locale> is in effect and
4706POSIX::setlocale() has been called, the character used for the decimal
3b10bc60 4707separator in formatted floating-point numbers is affected by the LC_NUMERIC
dee33c94 4708locale setting. See L<perllocale> and L<POSIX>.
a0d0e21e 4709
19799a22
GS
4710Don't fall into the trap of using a C<printf> when a simple
4711C<print> would do. The C<print> is more efficient and less
28757baa 4712error prone.
4713
da0045b7 4714=item prototype FUNCTION
d74e8afc 4715X<prototype>
da0045b7 4716
4717Returns the prototype of a function as a string (or C<undef> if the
5f05dabc 4718function has no prototype). FUNCTION is a reference to, or the name of,
4719the function whose prototype you want to retrieve.
da0045b7 4720
2b5ab1e7 4721If FUNCTION is a string starting with C<CORE::>, the rest is taken as a
e1020413 4722name for a Perl builtin. If the builtin is not I<overridable> (such as
0a2ca743
RGS
4723C<qw//>) or if its arguments cannot be adequately expressed by a prototype
4724(such as C<system>), prototype() returns C<undef>, because the builtin
4725does not really behave like a Perl function. Otherwise, the string
4726describing the equivalent prototype is returned.
b6c543e3 4727
532eee96 4728=item push ARRAY,LIST
1dc8ecb8 4729X<push> X<stack>
a0d0e21e 4730
f5a93a43
TC
4731=item push EXPR,LIST
4732
8f1da26d
TC
4733Treats ARRAY as a stack by appending the values of LIST to the end of
4734ARRAY. The length of ARRAY increases by the length of LIST. Has the same
4735effect as
a0d0e21e
LW
4736
4737 for $value (LIST) {
a9a5a0dc 4738 $ARRAY[++$#ARRAY] = $value;
a0d0e21e
LW
4739 }
4740
cde9c211
SP
4741but is more efficient. Returns the number of elements in the array following
4742the completed C<push>.
a0d0e21e 4743
f5a93a43
TC
4744Starting with Perl 5.14, C<push> can take a scalar EXPR, which must hold a
4745reference to an unblessed array. The argument will be dereferenced
4746automatically. This aspect of C<push> is considered highly experimental.
4747The exact behaviour may change in a future version of Perl.
cba5a3b0 4748
a0d0e21e
LW
4749=item q/STRING/
4750
4751=item qq/STRING/
4752
945c54fd 4753=item qx/STRING/
a0d0e21e
LW
4754
4755=item qw/STRING/
4756
1d888ee3
MK
4757Generalized quotes. See L<perlop/"Quote-Like Operators">.
4758
4759=item qr/STRING/
4760
4761Regexp-like quote. See L<perlop/"Regexp Quote-Like Operators">.
a0d0e21e
LW
4762
4763=item quotemeta EXPR
d74e8afc 4764X<quotemeta> X<metacharacter>
a0d0e21e 4765
54310121 4766=item quotemeta
bbce6d69 4767
36bbe248 4768Returns the value of EXPR with all non-"word"
a034a98d
DD
4769characters backslashed. (That is, all characters not matching
4770C</[A-Za-z_0-9]/> will be preceded by a backslash in the
4771returned string, regardless of any locale settings.)
4772This is the internal function implementing
7660c0ab 4773the C<\Q> escape in double-quoted strings.
a0d0e21e 4774
7660c0ab 4775If EXPR is omitted, uses C<$_>.
bbce6d69 4776
9702b155
RGS
4777quotemeta (and C<\Q> ... C<\E>) are useful when interpolating strings into
4778regular expressions, because by default an interpolated variable will be
4779considered a mini-regular expression. For example:
4780
4781 my $sentence = 'The quick brown fox jumped over the lazy dog';
4782 my $substring = 'quick.*?fox';
4783 $sentence =~ s{$substring}{big bad wolf};
4784
4785Will cause C<$sentence> to become C<'The big bad wolf jumped over...'>.
4786
4787On the other hand:
4788
4789 my $sentence = 'The quick brown fox jumped over the lazy dog';
4790 my $substring = 'quick.*?fox';
4791 $sentence =~ s{\Q$substring\E}{big bad wolf};
4792
4793Or:
4794
4795 my $sentence = 'The quick brown fox jumped over the lazy dog';
4796 my $substring = 'quick.*?fox';
4797 my $quoted_substring = quotemeta($substring);
4798 $sentence =~ s{$quoted_substring}{big bad wolf};
4799
8f1da26d
TC
4800Will both leave the sentence as is. Normally, when accepting literal string
4801input from the user, quotemeta() or C<\Q> must be used.
9702b155 4802
b29c72cb 4803In Perl 5.14, all characters whose code points are above 127 are not
8f1da26d 4804quoted in UTF8-encoded strings, but all are quoted in UTF-8 strings.
b29c72cb
KW
4805It is planned to change this behavior in 5.16, but the exact rules
4806haven't been determined yet.
4807
a0d0e21e 4808=item rand EXPR
d74e8afc 4809X<rand> X<random>
a0d0e21e
LW
4810
4811=item rand
4812
7660c0ab 4813Returns a random fractional number greater than or equal to C<0> and less
3e3baf6d 4814than the value of EXPR. (EXPR should be positive.) If EXPR is
351f3254 4815omitted, the value C<1> is used. Currently EXPR with the value C<0> is
3b10bc60 4816also special-cased as C<1> (this was undocumented before Perl 5.8.0
4817and is subject to change in future versions of Perl). Automatically calls
351f3254 4818C<srand> unless C<srand> has already been called. See also C<srand>.
a0d0e21e 4819
6063ba18
WM
4820Apply C<int()> to the value returned by C<rand()> if you want random
4821integers instead of random fractional numbers. For example,
4822
4823 int(rand(10))
4824
4825returns a random integer between C<0> and C<9>, inclusive.
4826
2f9daede 4827(Note: If your rand function consistently returns numbers that are too
a0d0e21e 4828large or too small, then your version of Perl was probably compiled
2f9daede 4829with the wrong number of RANDBITS.)
a0d0e21e 4830
9700c45b
JV
4831B<C<rand()> is not cryptographically secure. You should not rely
4832on it in security-sensitive situations.> As of this writing, a
4833number of third-party CPAN modules offer random number generators
4834intended by their authors to be cryptographically secure,
4835including: L<Math::Random::Secure>, L<Math::Random::MT::Perl>, and
4836L<Math::TrulyRandom>.
4837
a0d0e21e 4838=item read FILEHANDLE,SCALAR,LENGTH,OFFSET
f723aae1 4839X<read> X<file, read>
a0d0e21e
LW
4840
4841=item read FILEHANDLE,SCALAR,LENGTH
4842
9124316e
JH
4843Attempts to read LENGTH I<characters> of data into variable SCALAR
4844from the specified FILEHANDLE. Returns the number of characters
b5fe5ca2 4845actually read, C<0> at end of file, or undef if there was an error (in
b49f3be6
SG
4846the latter case C<$!> is also set). SCALAR will be grown or shrunk
4847so that the last character actually read is the last character of the
4848scalar after the read.
4849
4850An OFFSET may be specified to place the read data at some place in the
4851string other than the beginning. A negative OFFSET specifies
4852placement at that many characters counting backwards from the end of
4853the string. A positive OFFSET greater than the length of SCALAR
4854results in the string being padded to the required size with C<"\0">
4855bytes before the result of the read is appended.
4856
80d38338 4857The call is implemented in terms of either Perl's or your system's native
01aa884e
KW
4858fread(3) library function. To get a true read(2) system call, see
4859L<sysread|/sysread FILEHANDLE,SCALAR,LENGTH,OFFSET>.
9124316e
JH
4860
4861Note the I<characters>: depending on the status of the filehandle,
8f1da26d 4862either (8-bit) bytes or characters are read. By default, all
9124316e 4863filehandles operate on bytes, but for example if the filehandle has
fae2c0fb 4864been opened with the C<:utf8> I/O layer (see L</open>, and the C<open>
8f1da26d 4865pragma, L<open>), the I/O will operate on UTF8-encoded Unicode
1d714267
JH
4866characters, not bytes. Similarly for the C<:encoding> pragma:
4867in that case pretty much any characters can be read.
a0d0e21e
LW
4868
4869=item readdir DIRHANDLE
d74e8afc 4870X<readdir>
a0d0e21e 4871
19799a22 4872Returns the next directory entry for a directory opened by C<opendir>.
5a964f20 4873If used in list context, returns all the rest of the entries in the
3b10bc60 4874directory. If there are no more entries, returns the undefined value in
4875scalar context and the empty list in list context.
a0d0e21e 4876
19799a22 4877If you're planning to filetest the return values out of a C<readdir>, you'd
5f05dabc 4878better prepend the directory in question. Otherwise, because we didn't
19799a22 4879C<chdir> there, it would have been testing the wrong file.
cb1a09d0 4880
b0169937
GS
4881 opendir(my $dh, $some_dir) || die "can't opendir $some_dir: $!";
4882 @dots = grep { /^\./ && -f "$some_dir/$_" } readdir($dh);
4883 closedir $dh;
cb1a09d0 4884
114c60ec
BG
4885As of Perl 5.11.2 you can use a bare C<readdir> in a C<while> loop,
4886which will set C<$_> on every iteration.
4887
4888 opendir(my $dh, $some_dir) || die;
4889 while(readdir $dh) {
4890 print "$some_dir/$_\n";
4891 }
4892 closedir $dh;
4893
84902520 4894=item readline EXPR
e4b7ebf3
RGS
4895
4896=item readline
d74e8afc 4897X<readline> X<gets> X<fgets>
84902520 4898
e4b7ebf3 4899Reads from the filehandle whose typeglob is contained in EXPR (or from
8f1da26d 4900C<*ARGV> if EXPR is not provided). In scalar context, each call reads and
80d38338 4901returns the next line until end-of-file is reached, whereupon the
0f03d336 4902subsequent call returns C<undef>. In list context, reads until end-of-file
e4b7ebf3 4903is reached and returns a list of lines. Note that the notion of "line"
80d38338 4904used here is whatever you may have defined with C<$/> or
e4b7ebf3 4905C<$INPUT_RECORD_SEPARATOR>). See L<perlvar/"$/">.
fbad3eb5 4906
0f03d336 4907When C<$/> is set to C<undef>, when C<readline> is in scalar
80d38338 4908context (i.e., file slurp mode), and when an empty file is read, it
449bc448 4909returns C<''> the first time, followed by C<undef> subsequently.
fbad3eb5 4910
61eff3bc
JH
4911This is the internal function implementing the C<< <EXPR> >>
4912operator, but you can use it directly. The C<< <EXPR> >>
84902520
TB
4913operator is discussed in more detail in L<perlop/"I/O Operators">.
4914
5a964f20 4915 $line = <STDIN>;
5ed4f2ec 4916 $line = readline(*STDIN); # same thing
5a964f20 4917
0f03d336 4918If C<readline> encounters an operating system error, C<$!> will be set
4919with the corresponding error message. It can be helpful to check
4920C<$!> when you are reading from filehandles you don't trust, such as a
4921tty or a socket. The following example uses the operator form of
4922C<readline> and dies if the result is not defined.
4923
5ed4f2ec 4924 while ( ! eof($fh) ) {
4925 defined( $_ = <$fh> ) or die "readline failed: $!";
4926 ...
4927 }
0f03d336 4928
4929Note that you have can't handle C<readline> errors that way with the
4930C<ARGV> filehandle. In that case, you have to open each element of
4931C<@ARGV> yourself since C<eof> handles C<ARGV> differently.
4932
4933 foreach my $arg (@ARGV) {
4934 open(my $fh, $arg) or warn "Can't open $arg: $!";
4935
4936 while ( ! eof($fh) ) {
4937 defined( $_ = <$fh> )
4938 or die "readline failed for $arg: $!";
4939 ...
00cb5da1 4940 }
00cb5da1 4941 }
e00e4ce9 4942
a0d0e21e 4943=item readlink EXPR
d74e8afc 4944X<readlink>
a0d0e21e 4945
54310121 4946=item readlink
bbce6d69 4947
a0d0e21e 4948Returns the value of a symbolic link, if symbolic links are
3b10bc60 4949implemented. If not, raises an exception. If there is a system
184e9718 4950error, returns the undefined value and sets C<$!> (errno). If EXPR is
7660c0ab 4951omitted, uses C<$_>.
a0d0e21e 4952
ea9eb35a
BJ
4953Portability issues: L<perlport/readlink>.
4954
84902520 4955=item readpipe EXPR
8d7403e6
RGS
4956
4957=item readpipe
d74e8afc 4958X<readpipe>
84902520 4959
5a964f20 4960EXPR is executed as a system command.
84902520
TB
4961The collected standard output of the command is returned.
4962In scalar context, it comes back as a single (potentially
4963multi-line) string. In list context, returns a list of lines
7660c0ab 4964(however you've defined lines with C<$/> or C<$INPUT_RECORD_SEPARATOR>).
84902520
TB
4965This is the internal function implementing the C<qx/EXPR/>
4966operator, but you can use it directly. The C<qx/EXPR/>
4967operator is discussed in more detail in L<perlop/"I/O Operators">.
8d7403e6 4968If EXPR is omitted, uses C<$_>.
84902520 4969
399388f4 4970=item recv SOCKET,SCALAR,LENGTH,FLAGS
d74e8afc 4971X<recv>
a0d0e21e 4972
9124316e
JH
4973Receives a message on a socket. Attempts to receive LENGTH characters
4974of data into variable SCALAR from the specified SOCKET filehandle.
4975SCALAR will be grown or shrunk to the length actually read. Takes the
4976same flags as the system call of the same name. Returns the address
4977of the sender if SOCKET's protocol supports this; returns an empty
4978string otherwise. If there's an error, returns the undefined value.
4979This call is actually implemented in terms of recvfrom(2) system call.
4980See L<perlipc/"UDP: Message Passing"> for examples.
4981
4982Note the I<characters>: depending on the status of the socket, either
4983(8-bit) bytes or characters are received. By default all sockets
4984operate on bytes, but for example if the socket has been changed using
740d4bb2 4985binmode() to operate with the C<:encoding(utf8)> I/O layer (see the
8f1da26d 4986C<open> pragma, L<open>), the I/O will operate on UTF8-encoded Unicode
740d4bb2
JW
4987characters, not bytes. Similarly for the C<:encoding> pragma: in that
4988case pretty much any characters can be read.
a0d0e21e
LW
4989
4990=item redo LABEL
d74e8afc 4991X<redo>
a0d0e21e
LW
4992
4993=item redo
4994
4995The C<redo> command restarts the loop block without evaluating the
98293880 4996conditional again. The C<continue> block, if any, is not executed. If
a0d0e21e 4997the LABEL is omitted, the command refers to the innermost enclosing
cf264981
SP
4998loop. Programs that want to lie to themselves about what was just input
4999normally use this command:
a0d0e21e
LW
5000
5001 # a simpleminded Pascal comment stripper
5002 # (warning: assumes no { or } in strings)
4633a7c4 5003 LINE: while (<STDIN>) {
a9a5a0dc
VP
5004 while (s|({.*}.*){.*}|$1 |) {}
5005 s|{.*}| |;
5006 if (s|{.*| |) {
5007 $front = $_;
5008 while (<STDIN>) {
5009 if (/}/) { # end of comment?
5010 s|^|$front\{|;
5011 redo LINE;
5012 }
5013 }
5ed4f2ec 5014 }
a9a5a0dc 5015 print;
a0d0e21e
LW
5016 }
5017
80d38338 5018C<redo> cannot be used to retry a block that returns a value such as
8f1da26d 5019C<eval {}>, C<sub {}>, or C<do {}>, and should not be used to exit
2b5ab1e7 5020a grep() or map() operation.
4968c1e4 5021
6c1372ed
GS
5022Note that a block by itself is semantically identical to a loop
5023that executes once. Thus C<redo> inside such a block will effectively
5024turn it into a looping construct.
5025
98293880 5026See also L</continue> for an illustration of how C<last>, C<next>, and
1d2dff63
GS
5027C<redo> work.
5028
a0d0e21e 5029=item ref EXPR
d74e8afc 5030X<ref> X<reference>
a0d0e21e 5031
54310121 5032=item ref
bbce6d69 5033
8a2e0804
A
5034Returns a non-empty string if EXPR is a reference, the empty
5035string otherwise. If EXPR
7660c0ab 5036is not specified, C<$_> will be used. The value returned depends on the
bbce6d69 5037type of thing the reference is a reference to.
a0d0e21e
LW
5038Builtin types include:
5039
a0d0e21e
LW
5040 SCALAR
5041 ARRAY
5042 HASH
5043 CODE
19799a22 5044 REF
a0d0e21e 5045 GLOB
19799a22 5046 LVALUE
cc10766d
RGS
5047 FORMAT
5048 IO
5049 VSTRING
5050 Regexp
a0d0e21e 5051
54310121 5052If the referenced object has been blessed into a package, then that package
19799a22 5053name is returned instead. You can think of C<ref> as a C<typeof> operator.
a0d0e21e
LW
5054
5055 if (ref($r) eq "HASH") {
a9a5a0dc 5056 print "r is a reference to a hash.\n";
54310121 5057 }
2b5ab1e7 5058 unless (ref($r)) {
a9a5a0dc 5059 print "r is not a reference at all.\n";
54310121 5060 }
a0d0e21e 5061
85dd5c8b
WL
5062The return value C<LVALUE> indicates a reference to an lvalue that is not
5063a variable. You get this from taking the reference of function calls like
5064C<pos()> or C<substr()>. C<VSTRING> is returned if the reference points
603c58be 5065to a L<version string|perldata/"Version Strings">.
85dd5c8b
WL
5066
5067The result C<Regexp> indicates that the argument is a regular expression
5068resulting from C<qr//>.
5069
a0d0e21e
LW
5070See also L<perlref>.
5071
5072=item rename OLDNAME,NEWNAME
d74e8afc 5073X<rename> X<move> X<mv> X<ren>
a0d0e21e 5074
19799a22
GS
5075Changes the name of a file; an existing file NEWNAME will be
5076clobbered. Returns true for success, false otherwise.
5077
2b5ab1e7
TC
5078Behavior of this function varies wildly depending on your system
5079implementation. For example, it will usually not work across file system
5080boundaries, even though the system I<mv> command sometimes compensates
5081for this. Other restrictions include whether it works on directories,
5082open files, or pre-existing files. Check L<perlport> and either the
5083rename(2) manpage or equivalent system documentation for details.
a0d0e21e 5084
dd184578
RGS
5085For a platform independent C<move> function look at the L<File::Copy>
5086module.
5087
ea9eb35a
BJ
5088Portability issues: L<perlport/rename>.
5089
16070b82 5090=item require VERSION
d74e8afc 5091X<require>
16070b82 5092
a0d0e21e
LW
5093=item require EXPR
5094
5095=item require
5096
3b825e41
RK
5097Demands a version of Perl specified by VERSION, or demands some semantics
5098specified by EXPR or by C<$_> if EXPR is not supplied.
44dcb63b 5099
3b825e41
RK
5100VERSION may be either a numeric argument such as 5.006, which will be
5101compared to C<$]>, or a literal of the form v5.6.1, which will be compared
3b10bc60 5102to C<$^V> (aka $PERL_VERSION). An exception is raised if
3b825e41
RK
5103VERSION is greater than the version of the current Perl interpreter.
5104Compare with L</use>, which can do a similar check at compile time.
5105
5106Specifying VERSION as a literal of the form v5.6.1 should generally be
5107avoided, because it leads to misleading error messages under earlier
cf264981 5108versions of Perl that do not support this syntax. The equivalent numeric
3b825e41 5109version should be used instead.
44dcb63b 5110
5ed4f2ec 5111 require v5.6.1; # run time version check
5112 require 5.6.1; # ditto
5113 require 5.006_001; # ditto; preferred for backwards compatibility
a0d0e21e 5114
362eead3
RGS
5115Otherwise, C<require> demands that a library file be included if it
5116hasn't already been included. The file is included via the do-FILE
73c71df6
CW
5117mechanism, which is essentially just a variety of C<eval> with the
5118caveat that lexical variables in the invoking script will be invisible
5119to the included code. Has semantics similar to the following subroutine:
a0d0e21e
LW
5120
5121 sub require {
20907158
AMS
5122 my ($filename) = @_;
5123 if (exists $INC{$filename}) {
5124 return 1 if $INC{$filename};
5125 die "Compilation failed in require";
5126 }
5127 my ($realfilename,$result);
5128 ITER: {
5129 foreach $prefix (@INC) {
5130 $realfilename = "$prefix/$filename";
5131 if (-f $realfilename) {
5132 $INC{$filename} = $realfilename;
5133 $result = do $realfilename;
5134 last ITER;
5135 }
5136 }
5137 die "Can't find $filename in \@INC";
5138 }
5139 if ($@) {
5140 $INC{$filename} = undef;
5141 die $@;
5142 } elsif (!$result) {
5143 delete $INC{$filename};
5144 die "$filename did not return true value";
5145 } else {
5146 return $result;
5147 }
a0d0e21e
LW
5148 }
5149
5150Note that the file will not be included twice under the same specified
a12755f0
SB
5151name.
5152
5153The file must return true as the last statement to indicate
a0d0e21e 5154successful execution of any initialization code, so it's customary to
19799a22
GS
5155end such a file with C<1;> unless you're sure it'll return true
5156otherwise. But it's better just to put the C<1;>, in case you add more
a0d0e21e
LW
5157statements.
5158
54310121 5159If EXPR is a bareword, the require assumes a "F<.pm>" extension and
da0045b7 5160replaces "F<::>" with "F</>" in the filename for you,
54310121 5161to make it easy to load standard modules. This form of loading of
a0d0e21e
LW
5162modules does not risk altering your namespace.
5163
ee580363
GS
5164In other words, if you try this:
5165
5ed4f2ec 5166 require Foo::Bar; # a splendid bareword
ee580363 5167
b76cc8ba 5168The require function will actually look for the "F<Foo/Bar.pm>" file in the
7660c0ab 5169directories specified in the C<@INC> array.
ee580363 5170
5a964f20 5171But if you try this:
ee580363
GS
5172
5173 $class = 'Foo::Bar';
5ed4f2ec 5174 require $class; # $class is not a bareword
5a964f20 5175 #or
5ed4f2ec 5176 require "Foo::Bar"; # not a bareword because of the ""
ee580363 5177
b76cc8ba 5178The require function will look for the "F<Foo::Bar>" file in the @INC array and
19799a22 5179will complain about not finding "F<Foo::Bar>" there. In this case you can do:
ee580363
GS
5180
5181 eval "require $class";
5182
3b10bc60 5183Now that you understand how C<require> looks for files with a
a91233bf
RGS
5184bareword argument, there is a little extra functionality going on behind
5185the scenes. Before C<require> looks for a "F<.pm>" extension, it will
5186first look for a similar filename with a "F<.pmc>" extension. If this file
5187is found, it will be loaded in place of any file ending in a "F<.pm>"
5188extension.
662cc546 5189
8f1da26d 5190You can also insert hooks into the import facility by putting Perl code
1c3d5054 5191directly into the @INC array. There are three forms of hooks: subroutine
8f1da26d 5192references, array references, and blessed objects.
d54b56d5
RGS
5193
5194Subroutine references are the simplest case. When the inclusion system
5195walks through @INC and encounters a subroutine, this subroutine gets
3b10bc60 5196called with two parameters, the first a reference to itself, and the
5197second the name of the file to be included (e.g., "F<Foo/Bar.pm>"). The
5198subroutine should return either nothing or else a list of up to three
5199values in the following order:
1f0bdf18
NC
5200
5201=over
5202
5203=item 1
5204
1f0bdf18
NC
5205A filehandle, from which the file will be read.
5206
cec0e1a7 5207=item 2
1f0bdf18 5208
60d352b3
RGS
5209A reference to a subroutine. If there is no filehandle (previous item),
5210then this subroutine is expected to generate one line of source code per
8f1da26d
TC
5211call, writing the line into C<$_> and returning 1, then finally at end of
5212file returning 0. If there is a filehandle, then the subroutine will be
b8921b3e 5213called to act as a simple source filter, with the line as read in C<$_>.
60d352b3
RGS
5214Again, return 1 for each valid line, and 0 after all lines have been
5215returned.
1f0bdf18 5216
cec0e1a7 5217=item 3
1f0bdf18
NC
5218
5219Optional state for the subroutine. The state is passed in as C<$_[1]>. A
5220reference to the subroutine itself is passed in as C<$_[0]>.
5221
5222=back
5223
5224If an empty list, C<undef>, or nothing that matches the first 3 values above
3b10bc60 5225is returned, then C<require> looks at the remaining elements of @INC.
5226Note that this filehandle must be a real filehandle (strictly a typeglob
8f1da26d
TC
5227or reference to a typeglob, whether blessed or unblessed); tied filehandles
5228will be ignored and processing will stop there.
d54b56d5
RGS
5229
5230If the hook is an array reference, its first element must be a subroutine
5231reference. This subroutine is called as above, but the first parameter is
3b10bc60 5232the array reference. This lets you indirectly pass arguments to
d54b56d5
RGS
5233the subroutine.
5234
5235In other words, you can write:
5236
5237 push @INC, \&my_sub;
5238 sub my_sub {
a9a5a0dc
VP
5239 my ($coderef, $filename) = @_; # $coderef is \&my_sub
5240 ...
d54b56d5
RGS
5241 }
5242
5243or:
5244
5245 push @INC, [ \&my_sub, $x, $y, ... ];
5246 sub my_sub {
a9a5a0dc
VP
5247 my ($arrayref, $filename) = @_;
5248 # Retrieve $x, $y, ...
5249 my @parameters = @$arrayref[1..$#$arrayref];
5250 ...
d54b56d5
RGS
5251 }
5252
cf264981 5253If the hook is an object, it must provide an INC method that will be
d54b56d5 5254called as above, the first parameter being the object itself. (Note that
92c6daad
NC
5255you must fully qualify the sub's name, as unqualified C<INC> is always forced
5256into package C<main>.) Here is a typical code layout:
d54b56d5
RGS
5257
5258 # In Foo.pm
5259 package Foo;
5260 sub new { ... }
5261 sub Foo::INC {
a9a5a0dc
VP
5262 my ($self, $filename) = @_;
5263 ...
d54b56d5
RGS
5264 }
5265
5266 # In the main program
797f796a 5267 push @INC, Foo->new(...);
d54b56d5 5268
3b10bc60 5269These hooks are also permitted to set the %INC entry
9ae8cd5b
RGS
5270corresponding to the files they have loaded. See L<perlvar/%INC>.
5271
ee580363 5272For a yet-more-powerful import facility, see L</use> and L<perlmod>.
a0d0e21e
LW
5273
5274=item reset EXPR
d74e8afc 5275X<reset>
a0d0e21e
LW
5276
5277=item reset
5278
5279Generally used in a C<continue> block at the end of a loop to clear
7660c0ab 5280variables and reset C<??> searches so that they work again. The
a0d0e21e
LW
5281expression is interpreted as a list of single characters (hyphens
5282allowed for ranges). All variables and arrays beginning with one of
5283those letters are reset to their pristine state. If the expression is
3b10bc60 5284omitted, one-match searches (C<?pattern?>) are reset to match again.
5285Only resets variables or searches in the current package. Always returns
52861. Examples:
a0d0e21e 5287
5ed4f2ec 5288 reset 'X'; # reset all X variables
5289 reset 'a-z'; # reset lower case variables
5290 reset; # just reset ?one-time? searches
a0d0e21e 5291
7660c0ab 5292Resetting C<"A-Z"> is not recommended because you'll wipe out your
2b5ab1e7 5293C<@ARGV> and C<@INC> arrays and your C<%ENV> hash. Resets only package
3b10bc60 5294variables; lexical variables are unaffected, but they clean themselves
2b5ab1e7
TC
5295up on scope exit anyway, so you'll probably want to use them instead.
5296See L</my>.
a0d0e21e 5297
54310121 5298=item return EXPR
d74e8afc 5299X<return>
54310121 5300
5301=item return
5302
b76cc8ba 5303Returns from a subroutine, C<eval>, or C<do FILE> with the value
5a964f20 5304given in EXPR. Evaluation of EXPR may be in list, scalar, or void
54310121 5305context, depending on how the return value will be used, and the context
01aa884e 5306may vary from one execution to the next (see L</wantarray>). If no EXPR
2b5ab1e7 5307is given, returns an empty list in list context, the undefined value in
3b10bc60 5308scalar context, and (of course) nothing at all in void context.
a0d0e21e 5309
3b10bc60 5310(In the absence of an explicit C<return>, a subroutine, eval,
5311or do FILE automatically returns the value of the last expression
2b5ab1e7 5312evaluated.)
a0d0e21e
LW
5313
5314=item reverse LIST
d74e8afc 5315X<reverse> X<rev> X<invert>
a0d0e21e 5316
5a964f20
TC
5317In list context, returns a list value consisting of the elements
5318of LIST in the opposite order. In scalar context, concatenates the
2b5ab1e7 5319elements of LIST and returns a string value with all characters
a0ed51b3 5320in the opposite order.
4633a7c4 5321
9649ed94 5322 print join(", ", reverse "world", "Hello"); # Hello, world
4633a7c4 5323
9649ed94 5324 print scalar reverse "dlrow ,", "olleH"; # Hello, world
2f9daede 5325
2d713cbd
RGS
5326Used without arguments in scalar context, reverse() reverses C<$_>.
5327
9649ed94
B
5328 $_ = "dlrow ,olleH";
5329 print reverse; # No output, list context
5330 print scalar reverse; # Hello, world
5331
437d4214 5332Note that reversing an array to itself (as in C<@a = reverse @a>) will
80d38338 5333preserve non-existent elements whenever possible, i.e., for non magical
437d4214
VP
5334arrays or tied arrays with C<EXISTS> and C<DELETE> methods.
5335
2f9daede
TP
5336This operator is also handy for inverting a hash, although there are some
5337caveats. If a value is duplicated in the original hash, only one of those
5338can be represented as a key in the inverted hash. Also, this has to
5339unwind one hash and build a whole new one, which may take some time
2b5ab1e7 5340on a large hash, such as from a DBM file.
2f9daede 5341
5ed4f2ec 5342 %by_name = reverse %by_address; # Invert the hash
a0d0e21e
LW
5343
5344=item rewinddir DIRHANDLE
d74e8afc 5345X<rewinddir>
a0d0e21e
LW
5346
5347Sets the current position to the beginning of the directory for the
19799a22 5348C<readdir> routine on DIRHANDLE.
a0d0e21e 5349
ea9eb35a
BJ
5350Portability issues: L<perlport/rewinddir>.
5351
a0d0e21e 5352=item rindex STR,SUBSTR,POSITION
d74e8afc 5353X<rindex>
a0d0e21e
LW
5354
5355=item rindex STR,SUBSTR
5356
ff551661 5357Works just like index() except that it returns the position of the I<last>
a0d0e21e 5358occurrence of SUBSTR in STR. If POSITION is specified, returns the
ff551661 5359last occurrence beginning at or before that position.
a0d0e21e
LW
5360
5361=item rmdir FILENAME
d74e8afc 5362X<rmdir> X<rd> X<directory, remove>
a0d0e21e 5363
54310121 5364=item rmdir
bbce6d69 5365
974da8e5 5366Deletes the directory specified by FILENAME if that directory is
8f1da26d 5367empty. If it succeeds it returns true; otherwise it returns false and
974da8e5 5368sets C<$!> (errno). If FILENAME is omitted, uses C<$_>.
a0d0e21e 5369
e1020413 5370To remove a directory tree recursively (C<rm -rf> on Unix) look at
dd184578
RGS
5371the C<rmtree> function of the L<File::Path> module.
5372
a0d0e21e
LW
5373=item s///
5374
9f4b9cd0 5375The substitution operator. See L<perlop/"Regexp Quote-Like Operators">.
a0d0e21e 5376
0d863452
RH
5377=item say FILEHANDLE LIST
5378X<say>
5379
dee33c94
TC
5380=item say FILEHANDLE
5381
0d863452
RH
5382=item say LIST
5383
5384=item say
5385
dee33c94
TC
5386Just like C<print>, but implicitly appends a newline. C<say LIST> is
5387simply an abbreviation for C<{ local $\ = "\n"; print LIST }>. To use
5388FILEHANDLE without a LIST to print the contents of C<$_> to it, you must
5389use a real filehandle like C<FH>, not an indirect one like C<$fh>.
f406c1e8 5390
4a904372
FC
5391This keyword is available only when the C<"say"> feature
5392is enabled, or when prefixed with C<CORE::>; see
8f1da26d
TC
5393L<feature>. Alternately, include a C<use v5.10> or later to the current
5394scope.
0d863452 5395
a0d0e21e 5396=item scalar EXPR
d74e8afc 5397X<scalar> X<context>
a0d0e21e 5398
5a964f20 5399Forces EXPR to be interpreted in scalar context and returns the value
54310121 5400of EXPR.
cb1a09d0
AD
5401
5402 @counts = ( scalar @a, scalar @b, scalar @c );
5403
54310121 5404There is no equivalent operator to force an expression to
2b5ab1e7 5405be interpolated in list context because in practice, this is never
cb1a09d0
AD
5406needed. If you really wanted to do so, however, you could use
5407the construction C<@{[ (some expression) ]}>, but usually a simple
5408C<(some expression)> suffices.
a0d0e21e 5409
8f1da26d
TC
5410Because C<scalar> is a unary operator, if you accidentally use a
5411parenthesized list for the EXPR, this behaves as a scalar comma expression,
5412evaluating all but the last element in void context and returning the final
5413element evaluated in scalar context. This is seldom what you want.
62c18ce2
GS
5414
5415The following single statement:
5416
5ed4f2ec 5417 print uc(scalar(&foo,$bar)),$baz;
62c18ce2
GS
5418
5419is the moral equivalent of these two:
5420
5ed4f2ec 5421 &foo;
5422 print(uc($bar),$baz);
62c18ce2
GS
5423
5424See L<perlop> for more details on unary operators and the comma operator.
5425
a0d0e21e 5426=item seek FILEHANDLE,POSITION,WHENCE
d74e8afc 5427X<seek> X<fseek> X<filehandle, position>
a0d0e21e 5428
19799a22 5429Sets FILEHANDLE's position, just like the C<fseek> call of C<stdio>.
8903cb82 5430FILEHANDLE may be an expression whose value gives the name of the
9124316e 5431filehandle. The values for WHENCE are C<0> to set the new position
8f1da26d
TC
5432I<in bytes> to POSITION; C<1> to set it to the current position plus
5433POSITION; and C<2> to set it to EOF plus POSITION, typically
5434negative. For WHENCE you may use the constants C<SEEK_SET>,
9124316e 5435C<SEEK_CUR>, and C<SEEK_END> (start of the file, current position, end
8f1da26d 5436of the file) from the L<Fcntl> module. Returns C<1> on success, false
9124316e
JH
5437otherwise.
5438
5439Note the I<in bytes>: even if the filehandle has been set to
740d4bb2 5440operate on characters (for example by using the C<:encoding(utf8)> open
fae2c0fb 5441layer), tell() will return byte offsets, not character offsets
9124316e 5442(because implementing that would render seek() and tell() rather slow).
8903cb82 5443
3b10bc60 5444If you want to position the file for C<sysread> or C<syswrite>, don't use
5445C<seek>, because buffering makes its effect on the file's read-write position
19799a22 5446unpredictable and non-portable. Use C<sysseek> instead.
a0d0e21e 5447
2b5ab1e7
TC
5448Due to the rules and rigors of ANSI C, on some systems you have to do a
5449seek whenever you switch between reading and writing. Amongst other
5450things, this may have the effect of calling stdio's clearerr(3).
5451A WHENCE of C<1> (C<SEEK_CUR>) is useful for not moving the file position:
cb1a09d0
AD
5452
5453 seek(TEST,0,1);
5454
5455This is also useful for applications emulating C<tail -f>. Once you hit
3b10bc60 5456EOF on your read and then sleep for a while, you (probably) have to stick in a
5457dummy seek() to reset things. The C<seek> doesn't change the position,
8903cb82 5458but it I<does> clear the end-of-file condition on the handle, so that the
3b10bc60 5459next C<< <FILE> >> makes Perl try again to read something. (We hope.)
cb1a09d0 5460
3b10bc60 5461If that doesn't work (some I/O implementations are particularly
5462cantankerous), you might need something like this:
cb1a09d0
AD
5463
5464 for (;;) {
a9a5a0dc 5465 for ($curpos = tell(FILE); $_ = <FILE>;
f86cebdf 5466 $curpos = tell(FILE)) {
a9a5a0dc
VP
5467 # search for some stuff and put it into files
5468 }
5469 sleep($for_a_while);
5470 seek(FILE, $curpos, 0);
cb1a09d0
AD
5471 }
5472
a0d0e21e 5473=item seekdir DIRHANDLE,POS
d74e8afc 5474X<seekdir>
a0d0e21e 5475
19799a22 5476Sets the current position for the C<readdir> routine on DIRHANDLE. POS
cf264981
SP
5477must be a value returned by C<telldir>. C<seekdir> also has the same caveats
5478about possible directory compaction as the corresponding system library
a0d0e21e
LW
5479routine.
5480
5481=item select FILEHANDLE
d74e8afc 5482X<select> X<filehandle, default>
a0d0e21e
LW
5483
5484=item select
5485
b5dffda6
RGS
5486Returns the currently selected filehandle. If FILEHANDLE is supplied,
5487sets the new current default filehandle for output. This has two
8f1da26d 5488effects: first, a C<write> or a C<print> without a filehandle
a0d0e21e 5489default to this FILEHANDLE. Second, references to variables related to
8f1da26d
TC
5490output will refer to this output channel.
5491
5492For example, to set the top-of-form format for more than one
5493output channel, you might do the following:
a0d0e21e
LW
5494
5495 select(REPORT1);
5496 $^ = 'report1_top';
5497 select(REPORT2);
5498 $^ = 'report2_top';
5499
5500FILEHANDLE may be an expression whose value gives the name of the
5501actual filehandle. Thus:
5502
5503 $oldfh = select(STDERR); $| = 1; select($oldfh);
5504
4633a7c4
LW
5505Some programmers may prefer to think of filehandles as objects with
5506methods, preferring to write the last example as:
a0d0e21e 5507
28757baa 5508 use IO::Handle;
a0d0e21e
LW
5509 STDERR->autoflush(1);
5510
ea9eb35a
BJ
5511Portability issues: L<perlport/select>.
5512
a0d0e21e 5513=item select RBITS,WBITS,EBITS,TIMEOUT
d74e8afc 5514X<select>
a0d0e21e 5515
3b10bc60 5516This calls the select(2) syscall with the bit masks specified, which
19799a22 5517can be constructed using C<fileno> and C<vec>, along these lines:
a0d0e21e
LW
5518
5519 $rin = $win = $ein = '';
f0815dd4
TC
5520 vec($rin, fileno(STDIN), 1) = 1;
5521 vec($win, fileno(STDOUT), 1) = 1;
a0d0e21e
LW
5522 $ein = $rin | $win;
5523
3b10bc60 5524If you want to select on many filehandles, you may wish to write a
5525subroutine like this:
a0d0e21e
LW
5526
5527 sub fhbits {
f0815dd4
TC
5528 my @fhlist = @_;
5529 my $bits = "";
5530 for my $fh (@fhlist) {
5531 vec($bits, fileno($fh), 1) = 1;
a9a5a0dc 5532 }
f0815dd4 5533 return $bits;
a0d0e21e 5534 }
f0815dd4 5535 $rin = fhbits(*STDIN, *TTY, *MYSOCK);
a0d0e21e
LW
5536
5537The usual idiom is:
5538
5539 ($nfound,$timeleft) =
5540 select($rout=$rin, $wout=$win, $eout=$ein, $timeout);
5541
54310121 5542or to block until something becomes ready just do this
a0d0e21e
LW
5543
5544 $nfound = select($rout=$rin, $wout=$win, $eout=$ein, undef);
5545
19799a22
GS
5546Most systems do not bother to return anything useful in $timeleft, so
5547calling select() in scalar context just returns $nfound.
c07a80fd 5548
5f05dabc 5549Any of the bit masks can also be undef. The timeout, if specified, is
a0d0e21e 5550in seconds, which may be fractional. Note: not all implementations are
be119125 5551capable of returning the $timeleft. If not, they always return
19799a22 5552$timeleft equal to the supplied $timeout.
a0d0e21e 5553
ff68c719 5554You can effect a sleep of 250 milliseconds this way:
a0d0e21e
LW
5555
5556 select(undef, undef, undef, 0.25);
5557
b09fc1d8 5558Note that whether C<select> gets restarted after signals (say, SIGALRM)
8b0ac1d7
MHM
5559is implementation-dependent. See also L<perlport> for notes on the
5560portability of C<select>.
40454f26 5561
f0815dd4 5562On error, C<select> behaves just like select(2): it returns
4189264e 5563-1 and sets C<$!>.
353e5636 5564
8f1da26d
TC
5565On some Unixes, select(2) may report a socket file descriptor as "ready for
5566reading" even when no data is available, and thus any subsequent C<read>
5567would block. This can be avoided if you always use O_NONBLOCK on the
5568socket. See select(2) and fcntl(2) for further details.
ec8ce15a 5569
f0815dd4
TC
5570The standard C<IO::Select> module provides a user-friendlier interface
5571to C<select>, mostly because it does all the bit-mask work for you.
5572
19799a22 5573B<WARNING>: One should not attempt to mix buffered I/O (like C<read>
61eff3bc 5574or <FH>) with C<select>, except as permitted by POSIX, and even
19799a22 5575then only on POSIX systems. You have to use C<sysread> instead.
a0d0e21e 5576
ea9eb35a
BJ
5577Portability issues: L<perlport/select>.
5578
a0d0e21e 5579=item semctl ID,SEMNUM,CMD,ARG
d74e8afc 5580X<semctl>
a0d0e21e 5581
3b10bc60 5582Calls the System V IPC function semctl(2). You'll probably have to say
0ade1984
JH
5583
5584 use IPC::SysV;
5585
5586first to get the correct constant definitions. If CMD is IPC_STAT or
cf264981 5587GETALL, then ARG must be a variable that will hold the returned
e4038a1f
MS
5588semid_ds structure or semaphore value array. Returns like C<ioctl>:
5589the undefined value for error, "C<0 but true>" for zero, or the actual
5590return value otherwise. The ARG must consist of a vector of native
106325ad 5591short integers, which may be created with C<pack("s!",(0)x$nsem)>.
4755096e
GS
5592See also L<perlipc/"SysV IPC">, C<IPC::SysV>, C<IPC::Semaphore>
5593documentation.
a0d0e21e 5594
ea9eb35a
BJ
5595Portability issues: L<perlport/semctl>.
5596
a0d0e21e 5597=item semget KEY,NSEMS,FLAGS
d74e8afc 5598X<semget>
a0d0e21e 5599
3b10bc60 5600Calls the System V IPC function semget(2). Returns the semaphore id, or
8f1da26d 5601the undefined value on error. See also
4755096e
GS
5602L<perlipc/"SysV IPC">, C<IPC::SysV>, C<IPC::SysV::Semaphore>
5603documentation.
a0d0e21e 5604
ea9eb35a
BJ
5605Portability issues: L<perlport/semget>.
5606
a0d0e21e 5607=item semop KEY,OPSTRING
d74e8afc 5608X<semop>
a0d0e21e 5609
80d38338 5610Calls the System V IPC function semop(2) for semaphore operations
5354997a 5611such as signalling and waiting. OPSTRING must be a packed array of
a0d0e21e 5612semop structures. Each semop structure can be generated with
cf264981
SP
5613C<pack("s!3", $semnum, $semop, $semflag)>. The length of OPSTRING
5614implies the number of semaphore operations. Returns true if
8f1da26d 5615successful, false on error. As an example, the
19799a22 5616following code waits on semaphore $semnum of semaphore id $semid:
a0d0e21e 5617
f878ba33 5618 $semop = pack("s!3", $semnum, -1, 0);
a0d0e21e
LW
5619 die "Semaphore trouble: $!\n" unless semop($semid, $semop);
5620
4755096e
GS
5621To signal the semaphore, replace C<-1> with C<1>. See also
5622L<perlipc/"SysV IPC">, C<IPC::SysV>, and C<IPC::SysV::Semaphore>
5623documentation.
a0d0e21e 5624
ea9eb35a
BJ
5625Portability issues: L<perlport/semop>.
5626
a0d0e21e 5627=item send SOCKET,MSG,FLAGS,TO
d74e8afc 5628X<send>
a0d0e21e
LW
5629
5630=item send SOCKET,MSG,FLAGS
5631
3b10bc60 5632Sends a message on a socket. Attempts to send the scalar MSG to the SOCKET
5633filehandle. Takes the same flags as the system call of the same name. On
5634unconnected sockets, you must specify a destination to I<send to>, in which
5635case it does a sendto(2) syscall. Returns the number of characters sent,
5636or the undefined value on error. The sendmsg(2) syscall is currently
5637unimplemented. See L<perlipc/"UDP: Message Passing"> for examples.
9124316e
JH
5638
5639Note the I<characters>: depending on the status of the socket, either
5640(8-bit) bytes or characters are sent. By default all sockets operate
5641on bytes, but for example if the socket has been changed using
740d4bb2
JW
5642binmode() to operate with the C<:encoding(utf8)> I/O layer (see
5643L</open>, or the C<open> pragma, L<open>), the I/O will operate on UTF-8
5644encoded Unicode characters, not bytes. Similarly for the C<:encoding>
5645pragma: in that case pretty much any characters can be sent.
a0d0e21e
LW
5646
5647=item setpgrp PID,PGRP
d74e8afc 5648X<setpgrp> X<group>
a0d0e21e 5649
7660c0ab 5650Sets the current process group for the specified PID, C<0> for the current
3b10bc60 5651process. Raises an exception when used on a machine that doesn't
81777298
GS
5652implement POSIX setpgid(2) or BSD setpgrp(2). If the arguments are omitted,
5653it defaults to C<0,0>. Note that the BSD 4.2 version of C<setpgrp> does not
5654accept any arguments, so only C<setpgrp(0,0)> is portable. See also
5655C<POSIX::setsid()>.
a0d0e21e 5656
ea9eb35a
BJ
5657Portability issues: L<perlport/setpgrp>.
5658
a0d0e21e 5659=item setpriority WHICH,WHO,PRIORITY
d74e8afc 5660X<setpriority> X<priority> X<nice> X<renice>
a0d0e21e
LW
5661
5662Sets the current priority for a process, a process group, or a user.
3b10bc60 5663(See setpriority(2).) Raises an exception when used on a machine
f86cebdf 5664that doesn't implement setpriority(2).
a0d0e21e 5665
ea9eb35a
BJ
5666Portability issues: L<perlport/setpriority>.
5667
a0d0e21e 5668=item setsockopt SOCKET,LEVEL,OPTNAME,OPTVAL
d74e8afc 5669X<setsockopt>
a0d0e21e 5670
8f1da26d
TC
5671Sets the socket option requested. Returns C<undef> on error.
5672Use integer constants provided by the C<Socket> module for
23d0437f
GA
5673LEVEL and OPNAME. Values for LEVEL can also be obtained from
5674getprotobyname. OPTVAL might either be a packed string or an integer.
5675An integer OPTVAL is shorthand for pack("i", OPTVAL).
5676
3b10bc60 5677An example disabling Nagle's algorithm on a socket:
23d0437f
GA
5678
5679 use Socket qw(IPPROTO_TCP TCP_NODELAY);
5680 setsockopt($socket, IPPROTO_TCP, TCP_NODELAY, 1);
a0d0e21e 5681
ea9eb35a
BJ
5682Portability issues: L<perlport/setsockopt>.
5683
532eee96 5684=item shift ARRAY
d74e8afc 5685X<shift>
a0d0e21e 5686
f5a93a43
TC
5687=item shift EXPR
5688
a0d0e21e
LW
5689=item shift
5690
5691Shifts the first value of the array off and returns it, shortening the
5692array by 1 and moving everything down. If there are no elements in the
5693array, returns the undefined value. If ARRAY is omitted, shifts the
7660c0ab 5694C<@_> array within the lexical scope of subroutines and formats, and the
80d38338 5695C<@ARGV> array outside a subroutine and also within the lexical scopes
3c10abe3 5696established by the C<eval STRING>, C<BEGIN {}>, C<INIT {}>, C<CHECK {}>,
8f1da26d 5697C<UNITCHECK {}>, and C<END {}> constructs.
4f25aa18 5698
f5a93a43
TC
5699Starting with Perl 5.14, C<shift> can take a scalar EXPR, which must hold a
5700reference to an unblessed array. The argument will be dereferenced
5701automatically. This aspect of C<shift> is considered highly experimental.
5702The exact behaviour may change in a future version of Perl.
cba5a3b0 5703
a1b2c429 5704See also C<unshift>, C<push>, and C<pop>. C<shift> and C<unshift> do the
19799a22 5705same thing to the left end of an array that C<pop> and C<push> do to the
977336f5 5706right end.
a0d0e21e
LW
5707
5708=item shmctl ID,CMD,ARG
d74e8afc 5709X<shmctl>
a0d0e21e 5710
0ade1984
JH
5711Calls the System V IPC function shmctl. You'll probably have to say
5712
5713 use IPC::SysV;
5714
7660c0ab 5715first to get the correct constant definitions. If CMD is C<IPC_STAT>,
cf264981 5716then ARG must be a variable that will hold the returned C<shmid_ds>
8f1da26d
TC
5717structure. Returns like ioctl: C<undef> for error; "C<0> but
5718true" for zero; and the actual return value otherwise.
4755096e 5719See also L<perlipc/"SysV IPC"> and C<IPC::SysV> documentation.
a0d0e21e 5720
ea9eb35a
BJ
5721Portability issues: L<perlport/shmctl>.
5722
a0d0e21e 5723=item shmget KEY,SIZE,FLAGS
d74e8afc 5724X<shmget>
a0d0e21e
LW
5725
5726Calls the System V IPC function shmget. Returns the shared memory
8f1da26d 5727segment id, or C<undef> on error.
4755096e 5728See also L<perlipc/"SysV IPC"> and C<IPC::SysV> documentation.
a0d0e21e 5729
ea9eb35a
BJ
5730Portability issues: L<perlport/shmget>.
5731
a0d0e21e 5732=item shmread ID,VAR,POS,SIZE
d74e8afc
ITB
5733X<shmread>
5734X<shmwrite>
a0d0e21e
LW
5735
5736=item shmwrite ID,STRING,POS,SIZE
5737
5738Reads or writes the System V shared memory segment ID starting at
5739position POS for size SIZE by attaching to it, copying in/out, and
5a964f20 5740detaching from it. When reading, VAR must be a variable that will
a0d0e21e
LW
5741hold the data read. When writing, if STRING is too long, only SIZE
5742bytes are used; if STRING is too short, nulls are written to fill out
8f1da26d 5743SIZE bytes. Return true if successful, false on error.
4755096e 5744shmread() taints the variable. See also L<perlipc/"SysV IPC">,
8f1da26d 5745C<IPC::SysV>, and the C<IPC::Shareable> module from CPAN.
a0d0e21e 5746
ea9eb35a
BJ
5747Portability issues: L<perlport/shmread> and L<perlport/shmwrite>.
5748
a0d0e21e 5749=item shutdown SOCKET,HOW
d74e8afc 5750X<shutdown>
a0d0e21e
LW
5751
5752Shuts down a socket connection in the manner indicated by HOW, which
3b10bc60 5753has the same interpretation as in the syscall of the same name.
a0d0e21e 5754
f86cebdf
GS
5755 shutdown(SOCKET, 0); # I/we have stopped reading data
5756 shutdown(SOCKET, 1); # I/we have stopped writing data
5757 shutdown(SOCKET, 2); # I/we have stopped using this socket
5a964f20
TC
5758
5759This is useful with sockets when you want to tell the other
5760side you're done writing but not done reading, or vice versa.
b76cc8ba 5761It's also a more insistent form of close because it also
19799a22 5762disables the file descriptor in any forked copies in other
5a964f20
TC
5763processes.
5764
3b10bc60 5765Returns C<1> for success; on error, returns C<undef> if
f126b98b
PF
5766the first argument is not a valid filehandle, or returns C<0> and sets
5767C<$!> for any other failure.
5768
a0d0e21e 5769=item sin EXPR
d74e8afc 5770X<sin> X<sine> X<asin> X<arcsine>
a0d0e21e 5771
54310121 5772=item sin
bbce6d69 5773
a0d0e21e 5774Returns the sine of EXPR (expressed in radians). If EXPR is omitted,
7660c0ab 5775returns sine of C<$_>.
a0d0e21e 5776
ca6e1c26 5777For the inverse sine operation, you may use the C<Math::Trig::asin>
28757baa 5778function, or use this relation:
5779
5780 sub asin { atan2($_[0], sqrt(1 - $_[0] * $_[0])) }
5781
a0d0e21e 5782=item sleep EXPR
d74e8afc 5783X<sleep> X<pause>
a0d0e21e
LW
5784
5785=item sleep
5786
80d38338
TC
5787Causes the script to sleep for (integer) EXPR seconds, or forever if no
5788argument is given. Returns the integer number of seconds actually slept.
b48653af 5789
7660c0ab 5790May be interrupted if the process receives a signal such as C<SIGALRM>.
b48653af
MS
5791
5792 eval {
5793 local $SIG{ALARM} = sub { die "Alarm!\n" };
5794 sleep;
5795 };
5796 die $@ unless $@ eq "Alarm!\n";
5797
5798You probably cannot mix C<alarm> and C<sleep> calls, because C<sleep>
5799is often implemented using C<alarm>.
a0d0e21e
LW
5800
5801On some older systems, it may sleep up to a full second less than what
5802you requested, depending on how it counts seconds. Most modern systems
5a964f20
TC
5803always sleep the full amount. They may appear to sleep longer than that,
5804however, because your process might not be scheduled right away in a
5805busy multitasking system.
a0d0e21e 5806
2bc69794
BS
5807For delays of finer granularity than one second, the Time::HiRes module
5808(from CPAN, and starting from Perl 5.8 part of the standard
5809distribution) provides usleep(). You may also use Perl's four-argument
5810version of select() leaving the first three arguments undefined, or you
5811might be able to use the C<syscall> interface to access setitimer(2) if
5812your system supports it. See L<perlfaq8> for details.
cb1a09d0 5813
b6e2112e 5814See also the POSIX module's C<pause> function.
5f05dabc 5815
a0d0e21e 5816=item socket SOCKET,DOMAIN,TYPE,PROTOCOL
d74e8afc 5817X<socket>
a0d0e21e
LW
5818
5819Opens a socket of the specified kind and attaches it to filehandle
19799a22 5820SOCKET. DOMAIN, TYPE, and PROTOCOL are specified the same as for
3b10bc60 5821the syscall of the same name. You should C<use Socket> first
19799a22
GS
5822to get the proper definitions imported. See the examples in
5823L<perlipc/"Sockets: Client/Server Communication">.
a0d0e21e 5824
8d2a6795
GS
5825On systems that support a close-on-exec flag on files, the flag will
5826be set for the newly opened file descriptor, as determined by the
5827value of $^F. See L<perlvar/$^F>.
5828
a0d0e21e 5829=item socketpair SOCKET1,SOCKET2,DOMAIN,TYPE,PROTOCOL
d74e8afc 5830X<socketpair>
a0d0e21e
LW
5831
5832Creates an unnamed pair of sockets in the specified domain, of the
5f05dabc 5833specified type. DOMAIN, TYPE, and PROTOCOL are specified the same as
3b10bc60 5834for the syscall of the same name. If unimplemented, raises an exception.
5835Returns true if successful.
a0d0e21e 5836
8d2a6795
GS
5837On systems that support a close-on-exec flag on files, the flag will
5838be set for the newly opened file descriptors, as determined by the value
5839of $^F. See L<perlvar/$^F>.
5840
19799a22 5841Some systems defined C<pipe> in terms of C<socketpair>, in which a call
5a964f20
TC
5842to C<pipe(Rdr, Wtr)> is essentially:
5843
5844 use Socket;
5845 socketpair(Rdr, Wtr, AF_UNIX, SOCK_STREAM, PF_UNSPEC);
5846 shutdown(Rdr, 1); # no more writing for reader
5847 shutdown(Wtr, 0); # no more reading for writer
5848
02fc2eee
NC
5849See L<perlipc> for an example of socketpair use. Perl 5.8 and later will
5850emulate socketpair using IP sockets to localhost if your system implements
5851sockets but not socketpair.
5a964f20 5852
ea9eb35a
BJ
5853Portability issues: L<perlport/socketpair>.
5854
a0d0e21e 5855=item sort SUBNAME LIST
d74e8afc 5856X<sort> X<qsort> X<quicksort> X<mergesort>
a0d0e21e
LW
5857
5858=item sort BLOCK LIST
5859
5860=item sort LIST
5861
41d39f30 5862In list context, this sorts the LIST and returns the sorted list value.
9fdc1d08 5863In scalar context, the behaviour of C<sort()> is undefined.
41d39f30
A
5864
5865If SUBNAME or BLOCK is omitted, C<sort>s in standard string comparison
5866order. If SUBNAME is specified, it gives the name of a subroutine
5867that returns an integer less than, equal to, or greater than C<0>,
3b10bc60 5868depending on how the elements of the list are to be ordered. (The
5869C<< <=> >> and C<cmp> operators are extremely useful in such routines.)
41d39f30
A
5870SUBNAME may be a scalar variable name (unsubscripted), in which case
5871the value provides the name of (or a reference to) the actual
5872subroutine to use. In place of a SUBNAME, you can provide a BLOCK as
5873an anonymous, in-line sort subroutine.
a0d0e21e 5874
8f1da26d
TC
5875If the subroutine's prototype is C<($$)>, the elements to be compared are
5876passed by reference in C<@_>, as for a normal subroutine. This is slower
5877than unprototyped subroutines, where the elements to be compared are passed
5878into the subroutine as the package global variables $a and $b (see example
5879below). Note that in the latter case, it is usually highly counter-productive
5880to declare $a and $b as lexicals.
43481408 5881
c106e8bb
RH
5882The values to be compared are always passed by reference and should not
5883be modified.
a0d0e21e 5884
0a753a76 5885You also cannot exit out of the sort block or subroutine using any of the
19799a22 5886loop control operators described in L<perlsyn> or with C<goto>.
0a753a76 5887
a034a98d
DD
5888When C<use locale> is in effect, C<sort LIST> sorts LIST according to the
5889current collation locale. See L<perllocale>.
5890
db5021a3
MS
5891sort() returns aliases into the original list, much as a for loop's index
5892variable aliases the list elements. That is, modifying an element of a
5893list returned by sort() (for example, in a C<foreach>, C<map> or C<grep>)
5894actually modifies the element in the original list. This is usually
5895something to be avoided when writing clear code.
5896
58c7fc7c 5897Perl 5.6 and earlier used a quicksort algorithm to implement sort.
8f1da26d 5898That algorithm was not stable, so I<could> go quadratic. (A I<stable> sort
58c7fc7c
JH
5899preserves the input order of elements that compare equal. Although
5900quicksort's run time is O(NlogN) when averaged over all arrays of
5901length N, the time can be O(N**2), I<quadratic> behavior, for some
5902inputs.) In 5.7, the quicksort implementation was replaced with
cf264981 5903a stable mergesort algorithm whose worst-case behavior is O(NlogN).
58c7fc7c
JH
5904But benchmarks indicated that for some inputs, on some platforms,
5905the original quicksort was faster. 5.8 has a sort pragma for
5906limited control of the sort. Its rather blunt control of the
cf264981 5907underlying algorithm may not persist into future Perls, but the
58c7fc7c 5908ability to characterize the input or output in implementation
c25fe68d 5909independent ways quite probably will. See L<the sort pragma|sort>.
c16425f1 5910
a0d0e21e
LW
5911Examples:
5912
5913 # sort lexically
5914 @articles = sort @files;
e1d16ab7 5915
a0d0e21e
LW
5916 # same thing, but with explicit sort routine
5917 @articles = sort {$a cmp $b} @files;
e1d16ab7 5918
cb1a09d0 5919 # now case-insensitively
54310121 5920 @articles = sort {uc($a) cmp uc($b)} @files;
e1d16ab7 5921
a0d0e21e
LW
5922 # same thing in reversed order
5923 @articles = sort {$b cmp $a} @files;
e1d16ab7 5924
a0d0e21e
LW
5925 # sort numerically ascending
5926 @articles = sort {$a <=> $b} @files;
e1d16ab7 5927
a0d0e21e
LW
5928 # sort numerically descending
5929 @articles = sort {$b <=> $a} @files;
e1d16ab7 5930
19799a22
GS
5931 # this sorts the %age hash by value instead of key
5932 # using an in-line function
5933 @eldest = sort { $age{$b} <=> $age{$a} } keys %age;
e1d16ab7 5934
a0d0e21e
LW
5935 # sort using explicit subroutine name
5936 sub byage {
4d0444a3 5937 $age{$a} <=> $age{$b}; # presuming numeric
a0d0e21e
LW
5938 }
5939 @sortedclass = sort byage @class;
e1d16ab7 5940
19799a22
GS
5941 sub backwards { $b cmp $a }
5942 @harry = qw(dog cat x Cain Abel);
5943 @george = qw(gone chased yz Punished Axed);
a0d0e21e 5944 print sort @harry;
e1d16ab7 5945 # prints AbelCaincatdogx
a0d0e21e 5946 print sort backwards @harry;
e1d16ab7 5947 # prints xdogcatCainAbel
a0d0e21e 5948 print sort @george, 'to', @harry;
e1d16ab7 5949 # prints AbelAxedCainPunishedcatchaseddoggonetoxyz
a0d0e21e 5950
54310121 5951 # inefficiently sort by descending numeric compare using
5952 # the first integer after the first = sign, or the
cb1a09d0
AD
5953 # whole record case-insensitively otherwise
5954
e1d16ab7 5955 my @new = sort {
5956 ($b =~ /=(\d+)/)[0] <=> ($a =~ /=(\d+)/)[0]
4d0444a3
FC
5957 ||
5958 uc($a) cmp uc($b)
cb1a09d0
AD
5959 } @old;
5960
5961 # same thing, but much more efficiently;
5962 # we'll build auxiliary indices instead
5963 # for speed
e1d16ab7 5964 my @nums = @caps = ();
54310121 5965 for (@old) {
e1d16ab7 5966 push @nums, ( /=(\d+)/ ? $1 : undef );
5967 push @caps, uc($_);
54310121 5968 }
cb1a09d0 5969
e1d16ab7 5970 my @new = @old[ sort {
4d0444a3
FC
5971 $nums[$b] <=> $nums[$a]
5972 ||
5973 $caps[$a] cmp $caps[$b]
5974 } 0..$#old
5975 ];
cb1a09d0 5976
19799a22 5977 # same thing, but without any temps
cb1a09d0 5978 @new = map { $_->[0] }
19799a22 5979 sort { $b->[1] <=> $a->[1]
4d0444a3
FC
5980 ||
5981 $a->[2] cmp $b->[2]
5982 } map { [$_, /=(\d+)/, uc($_)] } @old;
61eff3bc 5983
43481408
GS
5984 # using a prototype allows you to use any comparison subroutine
5985 # as a sort subroutine (including other package's subroutines)
5986 package other;
5ed4f2ec 5987 sub backwards ($$) { $_[1] cmp $_[0]; } # $a and $b are not set here
e1d16ab7 5988
43481408
GS
5989 package main;
5990 @new = sort other::backwards @old;
e1d16ab7 5991
58c7fc7c
JH
5992 # guarantee stability, regardless of algorithm
5993 use sort 'stable';
5994 @new = sort { substr($a, 3, 5) cmp substr($b, 3, 5) } @old;
e1d16ab7 5995
268e9d79
JL
5996 # force use of mergesort (not portable outside Perl 5.8)
5997 use sort '_mergesort'; # note discouraging _
58c7fc7c 5998 @new = sort { substr($a, 3, 5) cmp substr($b, 3, 5) } @old;
58c7fc7c 5999
1cb246e8
RGS
6000Warning: syntactical care is required when sorting the list returned from
6001a function. If you want to sort the list returned by the function call
6002C<find_records(@key)>, you can use:
a9320c62 6003
a9320c62
B
6004 @contact = sort { $a cmp $b } find_records @key;
6005 @contact = sort +find_records(@key);
6006 @contact = sort &find_records(@key);
6007 @contact = sort(find_records(@key));
6008
6009If instead you want to sort the array @key with the comparison routine
1cb246e8
RGS
6010C<find_records()> then you can use:
6011
a9320c62
B
6012 @contact = sort { find_records() } @key;
6013 @contact = sort find_records(@key);
6014 @contact = sort(find_records @key);
6015 @contact = sort(find_records (@key));
6016
19799a22
GS
6017If you're using strict, you I<must not> declare $a
6018and $b as lexicals. They are package globals. That means
1cb246e8 6019that if you're in the C<main> package and type
13a2d996 6020
47223a36 6021 @articles = sort {$b <=> $a} @files;
13a2d996 6022
47223a36
JH
6023then C<$a> and C<$b> are C<$main::a> and C<$main::b> (or C<$::a> and C<$::b>),
6024but if you're in the C<FooPack> package, it's the same as typing
cb1a09d0
AD
6025
6026 @articles = sort {$FooPack::b <=> $FooPack::a} @files;
6027
55497cff 6028The comparison function is required to behave. If it returns
7660c0ab
A
6029inconsistent results (sometimes saying C<$x[1]> is less than C<$x[2]> and
6030sometimes saying the opposite, for example) the results are not
6031well-defined.
55497cff 6032
03190201 6033Because C<< <=> >> returns C<undef> when either operand is C<NaN>
7698aede 6034(not-a-number), and also because C<sort> raises an exception unless the
8f1da26d
TC
6035result of a comparison is defined, be careful when sorting with a
6036comparison function like C<< $a <=> $b >> any lists that might contain a
6037C<NaN>. The following example takes advantage that C<NaN != NaN> to
3b10bc60 6038eliminate any C<NaN>s from the input list.
03190201
JL
6039
6040 @result = sort { $a <=> $b } grep { $_ == $_ } @input;
6041
f5a93a43 6042=item splice ARRAY or EXPR,OFFSET,LENGTH,LIST
d74e8afc 6043X<splice>
a0d0e21e 6044
f5a93a43 6045=item splice ARRAY or EXPR,OFFSET,LENGTH
a0d0e21e 6046
f5a93a43 6047=item splice ARRAY or EXPR,OFFSET
a0d0e21e 6048
f5a93a43 6049=item splice ARRAY or EXPR
453f9044 6050
a0d0e21e 6051Removes the elements designated by OFFSET and LENGTH from an array, and
5a964f20
TC
6052replaces them with the elements of LIST, if any. In list context,
6053returns the elements removed from the array. In scalar context,
43051805 6054returns the last element removed, or C<undef> if no elements are
48cdf507 6055removed. The array grows or shrinks as necessary.
19799a22 6056If OFFSET is negative then it starts that far from the end of the array.
48cdf507 6057If LENGTH is omitted, removes everything from OFFSET onward.
d0920e03
MJD
6058If LENGTH is negative, removes the elements from OFFSET onward
6059except for -LENGTH elements at the end of the array.
8cbc2e3b 6060If both OFFSET and LENGTH are omitted, removes everything. If OFFSET is
3b10bc60 6061past the end of the array, Perl issues a warning, and splices at the
8cbc2e3b 6062end of the array.
453f9044 6063
e1dccc0d 6064The following equivalences hold (assuming C<< $#a >= $i >> )
a0d0e21e 6065
5ed4f2ec 6066 push(@a,$x,$y) splice(@a,@a,0,$x,$y)
6067 pop(@a) splice(@a,-1)
6068 shift(@a) splice(@a,0,1)
6069 unshift(@a,$x,$y) splice(@a,0,0,$x,$y)
6070 $a[$i] = $y splice(@a,$i,1,$y)
a0d0e21e
LW
6071
6072Example, assuming array lengths are passed before arrays:
6073
5ed4f2ec 6074 sub aeq { # compare two list values
a9a5a0dc
VP
6075 my(@a) = splice(@_,0,shift);
6076 my(@b) = splice(@_,0,shift);
6077 return 0 unless @a == @b; # same len?
6078 while (@a) {
6079 return 0 if pop(@a) ne pop(@b);
6080 }
6081 return 1;
a0d0e21e
LW
6082 }
6083 if (&aeq($len,@foo[1..$len],0+@bar,@bar)) { ... }
6084
f5a93a43
TC
6085Starting with Perl 5.14, C<splice> can take scalar EXPR, which must hold a
6086reference to an unblessed array. The argument will be dereferenced
6087automatically. This aspect of C<splice> is considered highly experimental.
6088The exact behaviour may change in a future version of Perl.
532eee96 6089
a0d0e21e 6090=item split /PATTERN/,EXPR,LIMIT
d74e8afc 6091X<split>
a0d0e21e
LW
6092
6093=item split /PATTERN/,EXPR
6094
6095=item split /PATTERN/
6096
6097=item split
6098
b2e26e6e
DJ
6099Splits the string EXPR into a list of strings and returns that list. By
6100default, empty leading fields are preserved, and empty trailing ones are
ab7ee80f 6101deleted. (If all fields are empty, they are considered to be trailing.)
a0d0e21e 6102
a6d8037e 6103In scalar context, returns the number of fields found.
a0d0e21e 6104
7660c0ab 6105If EXPR is omitted, splits the C<$_> string. If PATTERN is also omitted,
4633a7c4
LW
6106splits on whitespace (after skipping any leading whitespace). Anything
6107matching PATTERN is taken to be a delimiter separating the fields. (Note
fb73857a 6108that the delimiter may be longer than one character.)
6109
836e0ee7 6110If LIMIT is specified and positive, it represents the maximum number
e833de1e
BS
6111of fields the EXPR will be split into, though the actual number of
6112fields returned depends on the number of times PATTERN matches within
6113EXPR. If LIMIT is unspecified or zero, trailing null fields are
6114stripped (which potential users of C<pop> would do well to remember).
6115If LIMIT is negative, it is treated as if an arbitrarily large LIMIT
6116had been specified. Note that splitting an EXPR that evaluates to the
6117empty string always returns the empty list, regardless of the LIMIT
6118specified.
a0d0e21e 6119
3b10bc60 6120A pattern matching the empty string (not to be confused with
6121an empty pattern C<//>, which is just one member of the set of patterns
7698aede 6122matching the empty string), splits EXPR into individual
3b10bc60 6123characters. For example:
a0d0e21e 6124
8241c1c0 6125 print join(':', split(/ */, 'hi there')), "\n";
a0d0e21e
LW
6126
6127produces the output 'h:i:t:h:e:r:e'.
6128
3b10bc60 6129As a special case for C<split>, the empty pattern C<//> specifically
6130matches the empty string; this is not be confused with the normal use
6131of an empty pattern to mean the last successful match. So to split
6132a string into individual characters, the following:
6de67870 6133
8241c1c0 6134 print join(':', split(//, 'hi there')), "\n";
52ea55c9 6135
de5763b0 6136produces the output 'h:i: :t:h:e:r:e'.
52ea55c9 6137
12977212
FC
6138Empty leading fields are produced when there are positive-width matches at
6139the beginning of the string; a zero-width match at the beginning of
6140the string does not produce an empty field. For example:
0156e0fd
RB
6141
6142 print join(':', split(/(?=\w)/, 'hi there!'));
6143
12977212
FC
6144produces the output 'h:i :t:h:e:r:e!'. Empty trailing fields, on the other
6145hand, are produced when there is a match at the end of the string (and
6146when LIMIT is given and is not 0), regardless of the length of the match.
6147For example:
6148
8241c1c0
B
6149 print join(':', split(//, 'hi there!', -1)), "\n";
6150 print join(':', split(/\W/, 'hi there!', -1)), "\n";
12977212
FC
6151
6152produce the output 'h:i: :t:h:e:r:e:!:' and 'hi:there:', respectively,
6153both with an empty trailing field.
0156e0fd 6154
5f05dabc 6155The LIMIT parameter can be used to split a line partially
a0d0e21e
LW
6156
6157 ($login, $passwd, $remainder) = split(/:/, $_, 3);
6158
b5da07fd
TB
6159When assigning to a list, if LIMIT is omitted, or zero, Perl supplies
6160a LIMIT one larger than the number of variables in the list, to avoid
a0d0e21e
LW
6161unnecessary work. For the list above LIMIT would have been 4 by
6162default. In time critical applications it behooves you not to split
6163into more fields than you really need.
6164
19799a22 6165If the PATTERN contains parentheses, additional list elements are
a0d0e21e
LW
6166created from each matching substring in the delimiter.
6167
da0045b7 6168 split(/([,-])/, "1-10,20", 3);
a0d0e21e
LW
6169
6170produces the list value
6171
6172 (1, '-', 10, ',', 20)
6173
19799a22 6174If you had the entire header of a normal Unix email message in $header,
4633a7c4
LW
6175you could split it up into fields and their values this way:
6176
9f4b9cd0 6177 $header =~ s/\n(?=\s)//g; # fix continuation lines
fb73857a 6178 %hdrs = (UNIX_FROM => split /^(\S*?):\s*/m, $header);
4633a7c4 6179
a0d0e21e
LW
6180The pattern C</PATTERN/> may be replaced with an expression to specify
6181patterns that vary at runtime. (To do runtime compilation only once,
748a9306
LW
6182use C</$variable/o>.)
6183
5da728e2
A
6184As a special case, specifying a PATTERN of space (S<C<' '>>) will split on
6185white space just as C<split> with no arguments does. Thus, S<C<split(' ')>> can
6186be used to emulate B<awk>'s default behavior, whereas S<C<split(/ /)>>
3b10bc60 6187will give you as many initial null fields (empty string) as there are leading spaces.
5da728e2 6188A C<split> on C</\s+/> is like a S<C<split(' ')>> except that any leading
19799a22 6189whitespace produces a null first field. A C<split> with no arguments
5da728e2 6190really does a S<C<split(' ', $_)>> internally.
a0d0e21e 6191
cc50a203 6192A PATTERN of C</^/> is treated as if it were C</^/m>, since it isn't
1ec94568
MG
6193much use otherwise.
6194
a0d0e21e
LW
6195Example:
6196
5a964f20
TC
6197 open(PASSWD, '/etc/passwd');
6198 while (<PASSWD>) {
5b3eff12
MS
6199 chomp;
6200 ($login, $passwd, $uid, $gid,
f86cebdf 6201 $gcos, $home, $shell) = split(/:/);
a9a5a0dc 6202 #...
a0d0e21e
LW
6203 }
6204
6de67870
JP
6205As with regular pattern matching, any capturing parentheses that are not
6206matched in a C<split()> will be set to C<undef> when returned:
6207
6208 @fields = split /(A)|B/, "1A2B3";
6209 # @fields is (1, 'A', 2, undef, 3)
a0d0e21e 6210
5f05dabc 6211=item sprintf FORMAT, LIST
d74e8afc 6212X<sprintf>
a0d0e21e 6213
6662521e
GS
6214Returns a string formatted by the usual C<printf> conventions of the C
6215library function C<sprintf>. See below for more details
01aa884e 6216and see L<sprintf(3)> or L<printf(3)> on your system for an explanation of
6662521e
GS
6217the general principles.
6218
6219For example:
6220
6221 # Format number with up to 8 leading zeroes
6222 $result = sprintf("%08d", $number);
6223
6224 # Round number to 3 digits after decimal point
6225 $rounded = sprintf("%.3f", $number);
74a77017 6226
3b10bc60 6227Perl does its own C<sprintf> formatting: it emulates the C
6228function sprintf(3), but doesn't use it except for floating-point
6229numbers, and even then only standard modifiers are allowed.
6230Non-standard extensions in your local sprintf(3) are
6231therefore unavailable from Perl.
74a77017 6232
194e7b38
DC
6233Unlike C<printf>, C<sprintf> does not do what you probably mean when you
6234pass it an array as your first argument. The array is given scalar context,
6235and instead of using the 0th element of the array as the format, Perl will
6236use the count of elements in the array as the format, which is almost never
6237useful.
6238
19799a22 6239Perl's C<sprintf> permits the following universally-known conversions:
74a77017 6240
5ed4f2ec 6241 %% a percent sign
6242 %c a character with the given number
6243 %s a string
6244 %d a signed integer, in decimal
6245 %u an unsigned integer, in decimal
6246 %o an unsigned integer, in octal
6247 %x an unsigned integer, in hexadecimal
6248 %e a floating-point number, in scientific notation
6249 %f a floating-point number, in fixed decimal notation
6250 %g a floating-point number, in %e or %f notation
74a77017 6251
1b3f7d21 6252In addition, Perl permits the following widely-supported conversions:
74a77017 6253
5ed4f2ec 6254 %X like %x, but using upper-case letters
6255 %E like %e, but using an upper-case "E"
6256 %G like %g, but with an upper-case "E" (if applicable)
6257 %b an unsigned integer, in binary
6258 %B like %b, but using an upper-case "B" with the # flag
6259 %p a pointer (outputs the Perl value's address in hexadecimal)
6260 %n special: *stores* the number of characters output so far
4d0444a3 6261 into the next variable in the parameter list
74a77017 6262
1b3f7d21
CS
6263Finally, for backward (and we do mean "backward") compatibility, Perl
6264permits these unnecessary but widely-supported conversions:
74a77017 6265
5ed4f2ec 6266 %i a synonym for %d
6267 %D a synonym for %ld
6268 %U a synonym for %lu
6269 %O a synonym for %lo
6270 %F a synonym for %f
74a77017 6271
7b8dd722
HS
6272Note that the number of exponent digits in the scientific notation produced
6273by C<%e>, C<%E>, C<%g> and C<%G> for numbers with the modulus of the
b73fd64e
JH
6274exponent less than 100 is system-dependent: it may be three or less
6275(zero-padded as necessary). In other words, 1.23 times ten to the
627699th may be either "1.23e99" or "1.23e099".
d764f01a 6277
80d38338 6278Between the C<%> and the format letter, you may specify several
7b8dd722
HS
6279additional attributes controlling the interpretation of the format.
6280In order, these are:
74a77017 6281
7b8dd722
HS
6282=over 4
6283
6284=item format parameter index
6285
6286An explicit format parameter index, such as C<2$>. By default sprintf
6287will format the next unused argument in the list, but this allows you
3b10bc60 6288to take the arguments out of order:
7b8dd722
HS
6289
6290 printf '%2$d %1$d', 12, 34; # prints "34 12"
6291 printf '%3$d %d %1$d', 1, 2, 3; # prints "3 1 1"
6292
6293=item flags
6294
6295one or more of:
e6bb52fd 6296
7a81c58e
A
6297 space prefix non-negative number with a space
6298 + prefix non-negative number with a plus sign
74a77017
CS
6299 - left-justify within the field
6300 0 use zeros, not spaces, to right-justify
e6bb52fd
TS
6301 # ensure the leading "0" for any octal,
6302 prefix non-zero hexadecimal with "0x" or "0X",
6303 prefix non-zero binary with "0b" or "0B"
7b8dd722
HS
6304
6305For example:
6306
e6bb52fd
TS
6307 printf '<% d>', 12; # prints "< 12>"
6308 printf '<%+d>', 12; # prints "<+12>"
6309 printf '<%6s>', 12; # prints "< 12>"
6310 printf '<%-6s>', 12; # prints "<12 >"
6311 printf '<%06s>', 12; # prints "<000012>"
6312 printf '<%#o>', 12; # prints "<014>"
6313 printf '<%#x>', 12; # prints "<0xc>"
6314 printf '<%#X>', 12; # prints "<0XC>"
6315 printf '<%#b>', 12; # prints "<0b1100>"
6316 printf '<%#B>', 12; # prints "<0B1100>"
7b8dd722 6317
9911cee9
TS
6318When a space and a plus sign are given as the flags at once,
6319a plus sign is used to prefix a positive number.
6320
6321 printf '<%+ d>', 12; # prints "<+12>"
6322 printf '<% +d>', 12; # prints "<+12>"
6323
e6bb52fd
TS
6324When the # flag and a precision are given in the %o conversion,
6325the precision is incremented if it's necessary for the leading "0".
6326
6327 printf '<%#.5o>', 012; # prints "<00012>"
6328 printf '<%#.5o>', 012345; # prints "<012345>"
6329 printf '<%#.0o>', 0; # prints "<0>"
6330
7b8dd722
HS
6331=item vector flag
6332
3b10bc60 6333This flag tells Perl to interpret the supplied string as a vector of
920f3fa9
DM
6334integers, one for each character in the string. Perl applies the format to
6335each integer in turn, then joins the resulting strings with a separator (a
6336dot C<.> by default). This can be useful for displaying ordinal values of
6337characters in arbitrary strings:
7b8dd722 6338
920f3fa9 6339 printf "%vd", "AB\x{100}"; # prints "65.66.256"
7b8dd722
HS
6340 printf "version is v%vd\n", $^V; # Perl's version
6341
6342Put an asterisk C<*> before the C<v> to override the string to
6343use to separate the numbers:
6344
6345 printf "address is %*vX\n", ":", $addr; # IPv6 address
6346 printf "bits are %0*v8b\n", " ", $bits; # random bitstring
6347
6348You can also explicitly specify the argument number to use for
3b10bc60 6349the join string using something like C<*2$v>; for example:
7b8dd722
HS
6350
6351 printf '%*4$vX %*4$vX %*4$vX', @addr[1..3], ":"; # 3 IPv6 addresses
6352
6353=item (minimum) width
6354
6355Arguments are usually formatted to be only as wide as required to
6356display the given value. You can override the width by putting
6357a number here, or get the width from the next argument (with C<*>)
3b10bc60 6358or from a specified argument (e.g., with C<*2$>):
7b8dd722 6359
8f1da26d
TC
6360 printf "<%s>", "a"; # prints "<a>"
6361 printf "<%6s>", "a"; # prints "< a>"
6362 printf "<%*s>", 6, "a"; # prints "< a>"
6363 printf "<%*2$s>", "a", 6; # prints "< a>"
6364 printf "<%2s>", "long"; # prints "<long>" (does not truncate)
7b8dd722 6365
19799a22
GS
6366If a field width obtained through C<*> is negative, it has the same
6367effect as the C<-> flag: left-justification.
74a77017 6368
7b8dd722 6369=item precision, or maximum width
d74e8afc 6370X<precision>
7b8dd722 6371
6c8c9a8e 6372You can specify a precision (for numeric conversions) or a maximum
7b8dd722 6373width (for string conversions) by specifying a C<.> followed by a number.
8f1da26d 6374For floating-point formats except C<g> and C<G>, this specifies
3b10bc60 6375how many places right of the decimal point to show (the default being 6).
6376For example:
7b8dd722
HS
6377
6378 # these examples are subject to system-specific variation
6379 printf '<%f>', 1; # prints "<1.000000>"
6380 printf '<%.1f>', 1; # prints "<1.0>"
6381 printf '<%.0f>', 1; # prints "<1>"
6382 printf '<%e>', 10; # prints "<1.000000e+01>"
6383 printf '<%.1e>', 10; # prints "<1.0e+01>"
6384
3b10bc60 6385For "g" and "G", this specifies the maximum number of digits to show,
7698aede 6386including those prior to the decimal point and those after it; for
3b10bc60 6387example:
1ff2d182 6388
3b10bc60 6389 # These examples are subject to system-specific variation.
1ff2d182
AS
6390 printf '<%g>', 1; # prints "<1>"
6391 printf '<%.10g>', 1; # prints "<1>"
6392 printf '<%g>', 100; # prints "<100>"
6393 printf '<%.1g>', 100; # prints "<1e+02>"
6394 printf '<%.2g>', 100.01; # prints "<1e+02>"
6395 printf '<%.5g>', 100.01; # prints "<100.01>"
6396 printf '<%.4g>', 100.01; # prints "<100>"
6397
7b8dd722 6398For integer conversions, specifying a precision implies that the
9911cee9
TS
6399output of the number itself should be zero-padded to this width,
6400where the 0 flag is ignored:
6401
6402 printf '<%.6d>', 1; # prints "<000001>"
6403 printf '<%+.6d>', 1; # prints "<+000001>"
6404 printf '<%-10.6d>', 1; # prints "<000001 >"
6405 printf '<%10.6d>', 1; # prints "< 000001>"
6406 printf '<%010.6d>', 1; # prints "< 000001>"
6407 printf '<%+10.6d>', 1; # prints "< +000001>"
7b8dd722
HS
6408
6409 printf '<%.6x>', 1; # prints "<000001>"
6410 printf '<%#.6x>', 1; # prints "<0x000001>"
6411 printf '<%-10.6x>', 1; # prints "<000001 >"
9911cee9
TS
6412 printf '<%10.6x>', 1; # prints "< 000001>"
6413 printf '<%010.6x>', 1; # prints "< 000001>"
6414 printf '<%#10.6x>', 1; # prints "< 0x000001>"
7b8dd722
HS
6415
6416For string conversions, specifying a precision truncates the string
3b10bc60 6417to fit the specified width:
7b8dd722
HS
6418
6419 printf '<%.5s>', "truncated"; # prints "<trunc>"
6420 printf '<%10.5s>', "truncated"; # prints "< trunc>"
6421
6422You can also get the precision from the next argument using C<.*>:
b22c7a20 6423
7b8dd722
HS
6424 printf '<%.6x>', 1; # prints "<000001>"
6425 printf '<%.*x>', 6, 1; # prints "<000001>"
6426
3b10bc60 6427If a precision obtained through C<*> is negative, it counts
6428as having no precision at all.
9911cee9
TS
6429
6430 printf '<%.*s>', 7, "string"; # prints "<string>"
6431 printf '<%.*s>', 3, "string"; # prints "<str>"
6432 printf '<%.*s>', 0, "string"; # prints "<>"
6433 printf '<%.*s>', -1, "string"; # prints "<string>"
6434
6435 printf '<%.*d>', 1, 0; # prints "<0>"
6436 printf '<%.*d>', 0, 0; # prints "<>"
6437 printf '<%.*d>', -1, 0; # prints "<0>"
6438
7b8dd722 6439You cannot currently get the precision from a specified number,
3b10bc60 6440but it is intended that this will be possible in the future, for
6441example using C<.*2$>:
7b8dd722 6442
3b10bc60 6443 printf "<%.*2$x>", 1, 6; # INVALID, but in future will print "<000001>"
7b8dd722
HS
6444
6445=item size
6446
6447For numeric conversions, you can specify the size to interpret the
1ff2d182
AS
6448number as using C<l>, C<h>, C<V>, C<q>, C<L>, or C<ll>. For integer
6449conversions (C<d u o x X b i D U O>), numbers are usually assumed to be
6450whatever the default integer size is on your platform (usually 32 or 64
6451bits), but you can override this to use instead one of the standard C types,
6452as supported by the compiler used to build Perl:
7b8dd722 6453
3d21943e
JV
6454 hh interpret integer as C type "char" or "unsigned char"
6455 on Perl 5.14 or later
7b8dd722 6456 h interpret integer as C type "short" or "unsigned short"
1c2e8cca 6457 j interpret integer as C type "intmax_t" on Perl 5.14
3d21943e
JV
6458 or later, and only with a C99 compiler (unportable)
6459 l interpret integer as C type "long" or "unsigned long"
6460 q, L, or ll interpret integer as C type "long long", "unsigned long long",
6461 or "quad" (typically 64-bit integers)
1c2e8cca
MG
6462 t interpret integer as C type "ptrdiff_t" on Perl 5.14 or later
6463 z interpret integer as C type "size_t" on Perl 5.14 or later
3d21943e
JV
6464
6465As of 5.14, none of these raises an exception if they are not supported on
6466your platform. However, if warnings are enabled, a warning of the
6467C<printf> warning class is issued on an unsupported conversion flag.
6468Should you instead prefer an exception, do this:
6469
6470 use warnings FATAL => "printf";
6471
6472If you would like to know about a version dependency before you
6473start running the program, put something like this at its top:
6474
6475 use 5.014; # for hh/j/t/z/ printf modifiers
7b8dd722 6476
3d21943e 6477You can find out whether your Perl supports quads via L<Config>:
7b8dd722 6478
5ed4f2ec 6479 use Config;
3b10bc60 6480 if ($Config{use64bitint} eq "define" || $Config{longsize} >= 8) {
6481 print "Nice quads!\n";
6482 }
1ff2d182 6483
3b10bc60 6484For floating-point conversions (C<e f g E F G>), numbers are usually assumed
6485to be the default floating-point size on your platform (double or long double),
6486but you can force "long double" with C<q>, C<L>, or C<ll> if your
1ff2d182
AS
6487platform supports them. You can find out whether your Perl supports long
6488doubles via L<Config>:
6489
5ed4f2ec 6490 use Config;
3b10bc60 6491 print "long doubles\n" if $Config{d_longdbl} eq "define";
1ff2d182 6492
3b10bc60 6493You can find out whether Perl considers "long double" to be the default
6494floating-point size to use on your platform via L<Config>:
1ff2d182 6495
3b10bc60 6496 use Config;
6497 if ($Config{uselongdouble} eq "define") {
6498 print "long doubles by default\n";
6499 }
1ff2d182 6500
3b10bc60 6501It can also be that long doubles and doubles are the same thing:
1ff2d182
AS
6502
6503 use Config;
6504 ($Config{doublesize} == $Config{longdblsize}) &&
6505 print "doubles are long doubles\n";
6506
3b10bc60 6507The size specifier C<V> has no effect for Perl code, but is supported for
6508compatibility with XS code. It means "use the standard size for a Perl
6509integer or floating-point number", which is the default.
7b8dd722 6510
a472f209
HS
6511=item order of arguments
6512
3b10bc60 6513Normally, sprintf() takes the next unused argument as the value to
a472f209
HS
6514format for each format specification. If the format specification
6515uses C<*> to require additional arguments, these are consumed from
3b10bc60 6516the argument list in the order they appear in the format
6517specification I<before> the value to format. Where an argument is
6518specified by an explicit index, this does not affect the normal
6519order for the arguments, even when the explicitly specified index
6520would have been the next argument.
a472f209
HS
6521
6522So:
6523
3b10bc60 6524 printf "<%*.*s>", $a, $b, $c;
a472f209 6525
3b10bc60 6526uses C<$a> for the width, C<$b> for the precision, and C<$c>
6527as the value to format; while:
a472f209 6528
3b10bc60 6529 printf "<%*1$.*s>", $a, $b;
a472f209 6530
3b10bc60 6531would use C<$a> for the width and precision, and C<$b> as the
a472f209
HS
6532value to format.
6533
3b10bc60 6534Here are some more examples; be aware that when using an explicit
6535index, the C<$> may need escaping:
a472f209 6536
5ed4f2ec 6537 printf "%2\$d %d\n", 12, 34; # will print "34 12\n"
6538 printf "%2\$d %d %d\n", 12, 34; # will print "34 12 34\n"
6539 printf "%3\$d %d %d\n", 12, 34, 56; # will print "56 12 34\n"
6540 printf "%2\$*3\$d %d\n", 12, 34, 3; # will print " 34 12\n"
a472f209 6541
7b8dd722 6542=back
b22c7a20 6543
3b10bc60 6544If C<use locale> is in effect and POSIX::setlocale() has been called,
6545the character used for the decimal separator in formatted floating-point
6546numbers is affected by the LC_NUMERIC locale. See L<perllocale>
7e4353e9 6547and L<POSIX>.
a0d0e21e
LW
6548
6549=item sqrt EXPR
d74e8afc 6550X<sqrt> X<root> X<square root>
a0d0e21e 6551
54310121 6552=item sqrt
bbce6d69 6553
3b10bc60 6554Return the positive square root of EXPR. If EXPR is omitted, uses
6555C<$_>. Works only for non-negative operands unless you've
6556loaded the C<Math::Complex> module.
2b5ab1e7
TC
6557
6558 use Math::Complex;
3b10bc60 6559 print sqrt(-4); # prints 2i
a0d0e21e
LW
6560
6561=item srand EXPR
d74e8afc 6562X<srand> X<seed> X<randseed>
a0d0e21e 6563
93dc8474
CS
6564=item srand
6565
83832992 6566Sets and returns the random number seed for the C<rand> operator.
0686c0b8 6567
0686c0b8
JH
6568The point of the function is to "seed" the C<rand> function so that
6569C<rand> can produce a different sequence each time you run your
83832992
KW
6570program. When called with a parameter, C<srand> uses that for the seed;
6571otherwise it (semi-)randomly chooses a seed. In either case, starting with
6572Perl 5.14, it returns the seed.
6573
6574If C<srand()> is not called explicitly, it is called implicitly without a
6575parameter at the first use of the C<rand> operator. However, this was not true
6576of versions of Perl before 5.004, so if your script will run under older
6577Perl versions, it should call C<srand>; otherwise most programs won't call
6578C<srand()> at all.
6579
6580But there are a few situations in recent Perls where programs are likely to
6581want to call C<srand>. One is for generating predictable results generally for
6582testing or debugging. There, you use C<srand($seed)>, with the same C<$seed>
6583each time. Another other case is where you need a cryptographically-strong
6584starting point rather than the generally acceptable default, which is based on
6585time of day, process ID, and memory allocation, or the F</dev/urandom> device
6586if available. And still another case is that you may want to call C<srand()>
6587after a C<fork()> to avoid child processes sharing the same seed value as the
6588parent (and consequently each other).
6589
6590Do B<not> call C<srand()> (i.e., without an argument) more than once per
d460397b 6591process. The internal state of the random number generator should
0686c0b8 6592contain more entropy than can be provided by any seed, so calling
83832992 6593C<srand()> again actually I<loses> randomness.
0686c0b8 6594
e0b236fe
JH
6595Most implementations of C<srand> take an integer and will silently
6596truncate decimal numbers. This means C<srand(42)> will usually
6597produce the same results as C<srand(42.1)>. To be safe, always pass
6598C<srand> an integer.
0686c0b8
JH
6599
6600In versions of Perl prior to 5.004 the default seed was just the
6601current C<time>. This isn't a particularly good seed, so many old
6602programs supply their own seed value (often C<time ^ $$> or C<time ^
6603($$ + ($$ << 15))>), but that isn't necessary any more.
93dc8474 6604
cf264981
SP
6605For cryptographic purposes, however, you need something much more random
6606than the default seed. Checksumming the compressed output of one or more
2f9daede
TP
6607rapidly changing operating system status programs is the usual method. For
6608example:
28757baa 6609
784d6566 6610 srand (time ^ $$ ^ unpack "%L*", `ps axww | gzip -f`);
28757baa 6611
83918a83
RGS
6612If you're particularly concerned with this, search the CPAN for
6613random number generator modules instead of rolling out your own.
0078ec44 6614
54310121 6615Frequently called programs (like CGI scripts) that simply use
28757baa 6616
6617 time ^ $$
6618
54310121 6619for a seed can fall prey to the mathematical property that
28757baa 6620
6621 a^b == (a+1)^(b+1)
6622
0078ec44 6623one-third of the time. So don't do that.
f86702cc 6624
83832992
KW
6625A typical use of the returned seed is for a test program which has too many
6626combinations to test comprehensively in the time available to it each run. It
6627can test a random subset each time, and should there be a failure, log the seed
8f1da26d 6628used for that run so that it can later be used to reproduce the same results.
83832992 6629
a0d0e21e 6630=item stat FILEHANDLE
435fbc73 6631X<stat> X<file, status> X<ctime>
a0d0e21e
LW
6632
6633=item stat EXPR
6634
5228a96c
SP
6635=item stat DIRHANDLE
6636
54310121 6637=item stat
bbce6d69 6638
1d2dff63 6639Returns a 13-element list giving the status info for a file, either
5228a96c 6640the file opened via FILEHANDLE or DIRHANDLE, or named by EXPR. If EXPR is
8f1da26d 6641omitted, it stats C<$_> (not C<_>!). Returns the empty list if C<stat> fails. Typically
5228a96c 6642used as follows:
a0d0e21e
LW
6643
6644 ($dev,$ino,$mode,$nlink,$uid,$gid,$rdev,$size,
6645 $atime,$mtime,$ctime,$blksize,$blocks)
6646 = stat($filename);
6647
54310121 6648Not all fields are supported on all filesystem types. Here are the
61967be2 6649meanings of the fields:
c07a80fd 6650
54310121 6651 0 dev device number of filesystem
6652 1 ino inode number
6653 2 mode file mode (type and permissions)
6654 3 nlink number of (hard) links to the file
6655 4 uid numeric user ID of file's owner
6656 5 gid numeric group ID of file's owner
6657 6 rdev the device identifier (special files only)
6658 7 size total size of file, in bytes
1c74f1bd
GS
6659 8 atime last access time in seconds since the epoch
6660 9 mtime last modify time in seconds since the epoch
df2a7e48 6661 10 ctime inode change time in seconds since the epoch (*)
54310121 6662 11 blksize preferred block size for file system I/O
6663 12 blocks actual number of blocks allocated
c07a80fd 6664
6665(The epoch was at 00:00 January 1, 1970 GMT.)
6666
3e2557b2
RGS
6667(*) Not all fields are supported on all filesystem types. Notably, the
6668ctime field is non-portable. In particular, you cannot expect it to be a
8f1da26d 6669"creation time"; see L<perlport/"Files and Filesystems"> for details.
df2a7e48 6670
61967be2 6671If C<stat> is passed the special filehandle consisting of an underline, no
a0d0e21e 6672stat is done, but the current contents of the stat structure from the
61967be2 6673last C<stat>, C<lstat>, or filetest are returned. Example:
a0d0e21e
LW
6674
6675 if (-x $file && (($d) = stat(_)) && $d < 0) {
a9a5a0dc 6676 print "$file is executable NFS file\n";
a0d0e21e
LW
6677 }
6678
ca6e1c26
JH
6679(This works on machines only for which the device number is negative
6680under NFS.)
a0d0e21e 6681
2b5ab1e7 6682Because the mode contains both the file type and its permissions, you
b76cc8ba 6683should mask off the file type portion and (s)printf using a C<"%o">
2b5ab1e7
TC
6684if you want to see the real permissions.
6685
6686 $mode = (stat($filename))[2];
6687 printf "Permissions are %04o\n", $mode & 07777;
6688
19799a22 6689In scalar context, C<stat> returns a boolean value indicating success
1d2dff63
GS
6690or failure, and, if successful, sets the information associated with
6691the special filehandle C<_>.
6692
dd184578 6693The L<File::stat> module provides a convenient, by-name access mechanism:
2b5ab1e7
TC
6694
6695 use File::stat;
6696 $sb = stat($filename);
b76cc8ba 6697 printf "File is %s, size is %s, perm %04o, mtime %s\n",
a9a5a0dc
VP
6698 $filename, $sb->size, $sb->mode & 07777,
6699 scalar localtime $sb->mtime;
2b5ab1e7 6700
ca6e1c26
JH
6701You can import symbolic mode constants (C<S_IF*>) and functions
6702(C<S_IS*>) from the Fcntl module:
6703
6704 use Fcntl ':mode';
6705
6706 $mode = (stat($filename))[2];
6707
6708 $user_rwx = ($mode & S_IRWXU) >> 6;
6709 $group_read = ($mode & S_IRGRP) >> 3;
6710 $other_execute = $mode & S_IXOTH;
6711
3155e0b0 6712 printf "Permissions are %04o\n", S_IMODE($mode), "\n";
ca6e1c26
JH
6713
6714 $is_setuid = $mode & S_ISUID;
ad605d16 6715 $is_directory = S_ISDIR($mode);
ca6e1c26
JH
6716
6717You could write the last two using the C<-u> and C<-d> operators.
3b10bc60 6718Commonly available C<S_IF*> constants are:
ca6e1c26
JH
6719
6720 # Permissions: read, write, execute, for user, group, others.
6721
6722 S_IRWXU S_IRUSR S_IWUSR S_IXUSR
6723 S_IRWXG S_IRGRP S_IWGRP S_IXGRP
6724 S_IRWXO S_IROTH S_IWOTH S_IXOTH
61eff3bc 6725
3cee8101 6726 # Setuid/Setgid/Stickiness/SaveText.
7df0fd0b 6727 # Note that the exact meaning of these is system-dependent.
ca6e1c26
JH
6728
6729 S_ISUID S_ISGID S_ISVTX S_ISTXT
6730
7df0fd0b
FC
6731 # File types. Not all are necessarily available on
6732 # your system.
ca6e1c26 6733
7df0fd0b
FC
6734 S_IFREG S_IFDIR S_IFLNK S_IFBLK S_IFCHR
6735 S_IFIFO S_IFSOCK S_IFWHT S_ENFMT
ca6e1c26 6736
7df0fd0b
FC
6737 # The following are compatibility aliases for S_IRUSR,
6738 # S_IWUSR, and S_IXUSR.
ca6e1c26
JH
6739
6740 S_IREAD S_IWRITE S_IEXEC
6741
61967be2 6742and the C<S_IF*> functions are
ca6e1c26 6743
7df0fd0b
FC
6744 S_IMODE($mode) the part of $mode containing the permission
6745 bits and the setuid/setgid/sticky bits
ca6e1c26 6746
7df0fd0b
FC
6747 S_IFMT($mode) the part of $mode containing the file type
6748 which can be bit-anded with (for example)
6749 S_IFREG or with the following functions
ca6e1c26 6750
61967be2 6751 # The operators -f, -d, -l, -b, -c, -p, and -S.
ca6e1c26
JH
6752
6753 S_ISREG($mode) S_ISDIR($mode) S_ISLNK($mode)
6754 S_ISBLK($mode) S_ISCHR($mode) S_ISFIFO($mode) S_ISSOCK($mode)
6755
6756 # No direct -X operator counterpart, but for the first one
6757 # the -g operator is often equivalent. The ENFMT stands for
6758 # record flocking enforcement, a platform-dependent feature.
6759
6760 S_ISENFMT($mode) S_ISWHT($mode)
6761
6762See your native chmod(2) and stat(2) documentation for more details
61967be2 6763about the C<S_*> constants. To get status info for a symbolic link
c837d5b4 6764instead of the target file behind the link, use the C<lstat> function.
ca6e1c26 6765
ea9eb35a
BJ
6766Portability issues: L<perlport/stat>.
6767
36fb85f3
RGS
6768=item state EXPR
6769X<state>
6770
6771=item state TYPE EXPR
6772
6773=item state EXPR : ATTRS
6774
6775=item state TYPE EXPR : ATTRS
6776
4a904372 6777C<state> declares a lexically scoped variable, just like C<my>.
b708784e 6778However, those variables will never be reinitialized, contrary to
36fb85f3
RGS
6779lexical variables that are reinitialized each time their enclosing block
6780is entered.
e476d66f 6781See L<perlsub/"Persistent Private Variables"> for details.
36fb85f3 6782
3b10bc60 6783C<state> variables are enabled only when the C<use feature "state"> pragma
4a904372 6784is in effect, unless the keyword is written as C<CORE::state>.
e476d66f 6785See also L<feature>.
36fb85f3 6786
a0d0e21e 6787=item study SCALAR
d74e8afc 6788X<study>
a0d0e21e
LW
6789
6790=item study
6791
184e9718 6792Takes extra time to study SCALAR (C<$_> if unspecified) in anticipation of
a0d0e21e
LW
6793doing many pattern matches on the string before it is next modified.
6794This may or may not save time, depending on the nature and number of
8f1da26d 6795patterns you are searching and the distribution of character
3b10bc60 6796frequencies in the string to be searched; you probably want to compare
8f1da26d 6797run times with and without it to see which is faster. Those loops
cf264981 6798that scan for many short constant strings (including the constant
4185c919
NC
6799parts of more complex patterns) will benefit most.
6800(The way C<study> works is this: a linked list of every
a0d0e21e 6801character in the string to be searched is made, so we know, for
7660c0ab 6802example, where all the C<'k'> characters are. From each search string,
a0d0e21e
LW
6803the rarest character is selected, based on some static frequency tables
6804constructed from some C programs and English text. Only those places
6805that contain this "rarest" character are examined.)
6806
5a964f20 6807For example, here is a loop that inserts index producing entries
a0d0e21e
LW
6808before any line containing a certain pattern:
6809
6810 while (<>) {
a9a5a0dc
VP
6811 study;
6812 print ".IX foo\n" if /\bfoo\b/;
6813 print ".IX bar\n" if /\bbar\b/;
6814 print ".IX blurfl\n" if /\bblurfl\b/;
6815 # ...
6816 print;
a0d0e21e
LW
6817 }
6818
3b10bc60 6819In searching for C</\bfoo\b/>, only locations in C<$_> that contain C<f>
951ba7fe 6820will be looked at, because C<f> is rarer than C<o>. In general, this is
a0d0e21e
LW
6821a big win except in pathological cases. The only question is whether
6822it saves you more time than it took to build the linked list in the
6823first place.
6824
6825Note that if you have to look for strings that you don't know till
19799a22 6826runtime, you can build an entire loop as a string and C<eval> that to
a0d0e21e 6827avoid recompiling all your patterns all the time. Together with
80d38338 6828undefining C<$/> to input entire files as one record, this can be quite
f86cebdf 6829fast, often faster than specialized programs like fgrep(1). The following
184e9718 6830scans a list of files (C<@files>) for a list of words (C<@words>), and prints
a0d0e21e
LW
6831out the names of those files that contain a match:
6832
6833 $search = 'while (<>) { study;';
6834 foreach $word (@words) {
a9a5a0dc 6835 $search .= "++\$seen{\$ARGV} if /\\b$word\\b/;\n";
a0d0e21e
LW
6836 }
6837 $search .= "}";
6838 @ARGV = @files;
6839 undef $/;
5ed4f2ec 6840 eval $search; # this screams
6841 $/ = "\n"; # put back to normal input delimiter
a0d0e21e 6842 foreach $file (sort keys(%seen)) {
a9a5a0dc 6843 print $file, "\n";
a0d0e21e
LW
6844 }
6845
1d2de774 6846=item sub NAME BLOCK
d74e8afc 6847X<sub>
cb1a09d0 6848
1d2de774 6849=item sub NAME (PROTO) BLOCK
cb1a09d0 6850
1d2de774
JH
6851=item sub NAME : ATTRS BLOCK
6852
6853=item sub NAME (PROTO) : ATTRS BLOCK
6854
8f1da26d
TC
6855This is subroutine definition, not a real function I<per se>. Without a
6856BLOCK it's just a forward declaration. Without a NAME, it's an anonymous
6857function declaration, so does return a value: the CODE ref of the closure
6858just created.
cb1a09d0 6859
1d2de774 6860See L<perlsub> and L<perlref> for details about subroutines and
8f1da26d 6861references; see L<attributes> and L<Attribute::Handlers> for more
1d2de774 6862information about attributes.
cb1a09d0 6863
87275199 6864=item substr EXPR,OFFSET,LENGTH,REPLACEMENT
d74e8afc 6865X<substr> X<substring> X<mid> X<left> X<right>
7b8d334a 6866
87275199 6867=item substr EXPR,OFFSET,LENGTH
a0d0e21e
LW
6868
6869=item substr EXPR,OFFSET
6870
6871Extracts a substring out of EXPR and returns it. First character is at
e1dccc0d 6872offset zero. If OFFSET is negative, starts
8f1da26d
TC
6873that far back from the end of the string. If LENGTH is omitted, returns
6874everything through the end of the string. If LENGTH is negative, leaves that
748a9306
LW
6875many characters off the end of the string.
6876
e1de3ec0 6877 my $s = "The black cat climbed the green tree";
5ed4f2ec 6878 my $color = substr $s, 4, 5; # black
6879 my $middle = substr $s, 4, -11; # black cat climbed the
6880 my $end = substr $s, 14; # climbed the green tree
6881 my $tail = substr $s, -4; # tree
6882 my $z = substr $s, -4, 2; # tr
e1de3ec0 6883
2b5ab1e7 6884You can use the substr() function as an lvalue, in which case EXPR
87275199
GS
6885must itself be an lvalue. If you assign something shorter than LENGTH,
6886the string will shrink, and if you assign something longer than LENGTH,
2b5ab1e7 6887the string will grow to accommodate it. To keep the string the same
3b10bc60 6888length, you may need to pad or chop your value using C<sprintf>.
a0d0e21e 6889
87275199
GS
6890If OFFSET and LENGTH specify a substring that is partly outside the
6891string, only the part within the string is returned. If the substring
6892is beyond either end of the string, substr() returns the undefined
6893value and produces a warning. When used as an lvalue, specifying a
3b10bc60 6894substring that is entirely outside the string raises an exception.
87275199
GS
6895Here's an example showing the behavior for boundary cases:
6896
6897 my $name = 'fred';
5ed4f2ec 6898 substr($name, 4) = 'dy'; # $name is now 'freddy'
3b10bc60 6899 my $null = substr $name, 6, 2; # returns "" (no warning)
5ed4f2ec 6900 my $oops = substr $name, 7; # returns undef, with warning
3b10bc60 6901 substr($name, 7) = 'gap'; # raises an exception
87275199 6902
2b5ab1e7 6903An alternative to using substr() as an lvalue is to specify the
7b8d334a 6904replacement string as the 4th argument. This allows you to replace
2b5ab1e7
TC
6905parts of the EXPR and return what was there before in one operation,
6906just as you can with splice().
7b8d334a 6907
e1de3ec0 6908 my $s = "The black cat climbed the green tree";
5ed4f2ec 6909 my $z = substr $s, 14, 7, "jumped from"; # climbed
e1de3ec0
GS
6910 # $s is now "The black cat jumped from the green tree"
6911
8f1da26d 6912Note that the lvalue returned by the three-argument version of substr() acts as
91f73676
DM
6913a 'magic bullet'; each time it is assigned to, it remembers which part
6914of the original string is being modified; for example:
6915
6916 $x = '1234';
6917 for (substr($x,1,2)) {
5ed4f2ec 6918 $_ = 'a'; print $x,"\n"; # prints 1a4
6919 $_ = 'xyz'; print $x,"\n"; # prints 1xyz4
91f73676 6920 $x = '56789';
5ed4f2ec 6921 $_ = 'pq'; print $x,"\n"; # prints 5pq9
91f73676
DM
6922 }
6923
91f73676
DM
6924Prior to Perl version 5.9.1, the result of using an lvalue multiple times was
6925unspecified.
c67bbae0 6926
a0d0e21e 6927=item symlink OLDFILE,NEWFILE
d74e8afc 6928X<symlink> X<link> X<symbolic link> X<link, symbolic>
a0d0e21e
LW
6929
6930Creates a new filename symbolically linked to the old filename.
7660c0ab 6931Returns C<1> for success, C<0> otherwise. On systems that don't support
3b10bc60 6932symbolic links, raises an exception. To check for that,
a0d0e21e
LW
6933use eval:
6934
2b5ab1e7 6935 $symlink_exists = eval { symlink("",""); 1 };
a0d0e21e 6936
ea9eb35a
BJ
6937Portability issues: L<perlport/symlink>.
6938
5702da47 6939=item syscall NUMBER, LIST
d74e8afc 6940X<syscall> X<system call>
a0d0e21e
LW
6941
6942Calls the system call specified as the first element of the list,
6943passing the remaining elements as arguments to the system call. If
3b10bc60 6944unimplemented, raises an exception. The arguments are interpreted
a0d0e21e
LW
6945as follows: if a given argument is numeric, the argument is passed as
6946an int. If not, the pointer to the string value is passed. You are
6947responsible to make sure a string is pre-extended long enough to
a3cb178b 6948receive any result that might be written into a string. You can't use a
19799a22 6949string literal (or other read-only string) as an argument to C<syscall>
a3cb178b
GS
6950because Perl has to assume that any string pointer might be written
6951through. If your
a0d0e21e 6952integer arguments are not literals and have never been interpreted in a
7660c0ab 6953numeric context, you may need to add C<0> to them to force them to look
19799a22 6954like numbers. This emulates the C<syswrite> function (or vice versa):
a0d0e21e 6955
5ed4f2ec 6956 require 'syscall.ph'; # may need to run h2ph
a3cb178b
GS
6957 $s = "hi there\n";
6958 syscall(&SYS_write, fileno(STDOUT), $s, length $s);
a0d0e21e 6959
3b10bc60 6960Note that Perl supports passing of up to only 14 arguments to your syscall,
6961which in practice should (usually) suffice.
a0d0e21e 6962
fb73857a 6963Syscall returns whatever value returned by the system call it calls.
19799a22 6964If the system call fails, C<syscall> returns C<-1> and sets C<$!> (errno).
8f1da26d
TC
6965Note that some system calls I<can> legitimately return C<-1>. The proper
6966way to handle such calls is to assign C<$!=0> before the call, then
6967check the value of C<$!> if C<syscall> returns C<-1>.
fb73857a 6968
6969There's a problem with C<syscall(&SYS_pipe)>: it returns the file
8f1da26d 6970number of the read end of the pipe it creates, but there is no way
b76cc8ba 6971to retrieve the file number of the other end. You can avoid this
19799a22 6972problem by using C<pipe> instead.
fb73857a 6973
ea9eb35a
BJ
6974Portability issues: L<perlport/syscall>.
6975
c07a80fd 6976=item sysopen FILEHANDLE,FILENAME,MODE
d74e8afc 6977X<sysopen>
c07a80fd 6978
6979=item sysopen FILEHANDLE,FILENAME,MODE,PERMS
6980
8f1da26d
TC
6981Opens the file whose filename is given by FILENAME, and associates it with
6982FILEHANDLE. If FILEHANDLE is an expression, its value is used as the real
6983filehandle wanted; an undefined scalar will be suitably autovivified. This
6984function calls the underlying operating system's I<open>(2) function with the
6985parameters FILENAME, MODE, and PERMS.
c07a80fd 6986
6987The possible values and flag bits of the MODE parameter are
8f1da26d
TC
6988system-dependent; they are available via the standard module C<Fcntl>. See
6989the documentation of your operating system's I<open>(2) syscall to see
6990which values and flag bits are available. You may combine several flags
ea2b5ef6
JH
6991using the C<|>-operator.
6992
6993Some of the most common values are C<O_RDONLY> for opening the file in
6994read-only mode, C<O_WRONLY> for opening the file in write-only mode,
c188b257 6995and C<O_RDWR> for opening the file in read-write mode.
d74e8afc 6996X<O_RDONLY> X<O_RDWR> X<O_WRONLY>
ea2b5ef6 6997
adf5897a 6998For historical reasons, some values work on almost every system
3b10bc60 6999supported by Perl: 0 means read-only, 1 means write-only, and 2
adf5897a 7000means read/write. We know that these values do I<not> work under
7c5ffed3 7001OS/390 & VM/ESA Unix and on the Macintosh; you probably don't want to
4af147f6 7002use them in new code.
c07a80fd 7003
19799a22 7004If the file named by FILENAME does not exist and the C<open> call creates
7660c0ab 7005it (typically because MODE includes the C<O_CREAT> flag), then the value of
5a964f20 7006PERMS specifies the permissions of the newly created file. If you omit
19799a22 7007the PERMS argument to C<sysopen>, Perl uses the octal value C<0666>.
5a964f20 7008These permission values need to be in octal, and are modified by your
0591cd52 7009process's current C<umask>.
d74e8afc 7010X<O_CREAT>
0591cd52 7011
ea2b5ef6
JH
7012In many systems the C<O_EXCL> flag is available for opening files in
7013exclusive mode. This is B<not> locking: exclusiveness means here that
c188b257
PF
7014if the file already exists, sysopen() fails. C<O_EXCL> may not work
7015on network filesystems, and has no effect unless the C<O_CREAT> flag
7016is set as well. Setting C<O_CREAT|O_EXCL> prevents the file from
7017being opened if it is a symbolic link. It does not protect against
7018symbolic links in the file's path.
d74e8afc 7019X<O_EXCL>
c188b257
PF
7020
7021Sometimes you may want to truncate an already-existing file. This
7022can be done using the C<O_TRUNC> flag. The behavior of
7023C<O_TRUNC> with C<O_RDONLY> is undefined.
d74e8afc 7024X<O_TRUNC>
ea2b5ef6 7025
19799a22 7026You should seldom if ever use C<0644> as argument to C<sysopen>, because
2b5ab1e7
TC
7027that takes away the user's option to have a more permissive umask.
7028Better to omit it. See the perlfunc(1) entry on C<umask> for more
7029on this.
c07a80fd 7030
4af147f6 7031Note that C<sysopen> depends on the fdopen() C library function.
e1020413 7032On many Unix systems, fdopen() is known to fail when file descriptors
4af147f6
CS
7033exceed a certain value, typically 255. If you need more file
7034descriptors than that, consider rebuilding Perl to use the C<sfio>
7035library, or perhaps using the POSIX::open() function.
7036
2b5ab1e7 7037See L<perlopentut> for a kinder, gentler explanation of opening files.
28757baa 7038
ea9eb35a
BJ
7039Portability issues: L<perlport/sysopen>.
7040
a0d0e21e 7041=item sysread FILEHANDLE,SCALAR,LENGTH,OFFSET
d74e8afc 7042X<sysread>
a0d0e21e
LW
7043
7044=item sysread FILEHANDLE,SCALAR,LENGTH
7045
3874323d 7046Attempts to read LENGTH bytes of data into variable SCALAR from the
3b10bc60 7047specified FILEHANDLE, using the read(2). It bypasses
3874323d
JH
7048buffered IO, so mixing this with other kinds of reads, C<print>,
7049C<write>, C<seek>, C<tell>, or C<eof> can cause confusion because the
7050perlio or stdio layers usually buffers data. Returns the number of
7051bytes actually read, C<0> at end of file, or undef if there was an
7052error (in the latter case C<$!> is also set). SCALAR will be grown or
7053shrunk so that the last byte actually read is the last byte of the
7054scalar after the read.
ff68c719 7055
7056An OFFSET may be specified to place the read data at some place in the
7057string other than the beginning. A negative OFFSET specifies
9124316e
JH
7058placement at that many characters counting backwards from the end of
7059the string. A positive OFFSET greater than the length of SCALAR
7060results in the string being padded to the required size with C<"\0">
7061bytes before the result of the read is appended.
a0d0e21e 7062
2b5ab1e7 7063There is no syseof() function, which is ok, since eof() doesn't work
80d38338 7064well on device files (like ttys) anyway. Use sysread() and check
19799a22 7065for a return value for 0 to decide whether you're done.
2b5ab1e7 7066
3874323d
JH
7067Note that if the filehandle has been marked as C<:utf8> Unicode
7068characters are read instead of bytes (the LENGTH, OFFSET, and the
5eadf7c5 7069return value of sysread() are in Unicode characters).
3874323d
JH
7070The C<:encoding(...)> layer implicitly introduces the C<:utf8> layer.
7071See L</binmode>, L</open>, and the C<open> pragma, L<open>.
7072
137443ea 7073=item sysseek FILEHANDLE,POSITION,WHENCE
d74e8afc 7074X<sysseek> X<lseek>
137443ea 7075
8f1da26d
TC
7076Sets FILEHANDLE's system position in bytes using lseek(2). FILEHANDLE may
7077be an expression whose value gives the name of the filehandle. The values
7078for WHENCE are C<0> to set the new position to POSITION; C<1> to set the it
7079to the current position plus POSITION; and C<2> to set it to EOF plus
7080POSITION, typically negative.
9124316e
JH
7081
7082Note the I<in bytes>: even if the filehandle has been set to operate
740d4bb2
JW
7083on characters (for example by using the C<:encoding(utf8)> I/O layer),
7084tell() will return byte offsets, not character offsets (because
80d38338 7085implementing that would render sysseek() unacceptably slow).
9124316e 7086
8f1da26d
TC
7087sysseek() bypasses normal buffered IO, so mixing it with reads other
7088than C<sysread> (for example C<< <> >> or read()) C<print>, C<write>,
9124316e 7089C<seek>, C<tell>, or C<eof> may cause confusion.
86989e5d
JH
7090
7091For WHENCE, you may also use the constants C<SEEK_SET>, C<SEEK_CUR>,
7092and C<SEEK_END> (start of the file, current position, end of the file)
7093from the Fcntl module. Use of the constants is also more portable
7094than relying on 0, 1, and 2. For example to define a "systell" function:
7095
5ed4f2ec 7096 use Fcntl 'SEEK_CUR';
7097 sub systell { sysseek($_[0], 0, SEEK_CUR) }
8903cb82 7098
7099Returns the new position, or the undefined value on failure. A position
19799a22
GS
7100of zero is returned as the string C<"0 but true">; thus C<sysseek> returns
7101true on success and false on failure, yet you can still easily determine
8903cb82 7102the new position.
137443ea 7103
a0d0e21e 7104=item system LIST
d74e8afc 7105X<system> X<shell>
a0d0e21e 7106
8bf3b016
GS
7107=item system PROGRAM LIST
7108
19799a22 7109Does exactly the same thing as C<exec LIST>, except that a fork is
8f1da26d 7110done first and the parent process waits for the child process to
80d38338 7111exit. Note that argument processing varies depending on the
19799a22
GS
7112number of arguments. If there is more than one argument in LIST,
7113or if LIST is an array with more than one value, starts the program
7114given by the first element of the list with arguments given by the
7115rest of the list. If there is only one scalar argument, the argument
7116is checked for shell metacharacters, and if there are any, the
7117entire argument is passed to the system's command shell for parsing
7118(this is C</bin/sh -c> on Unix platforms, but varies on other
7119platforms). If there are no shell metacharacters in the argument,
7120it is split into words and passed directly to C<execvp>, which is
7121more efficient.
7122
0f897271
GS
7123Beginning with v5.6.0, Perl will attempt to flush all files opened for
7124output before any operation that may do a fork, but this may not be
7125supported on some platforms (see L<perlport>). To be safe, you may need
7126to set C<$|> ($AUTOFLUSH in English) or call the C<autoflush()> method
7127of C<IO::Handle> on any open handles.
a2008d6d 7128
9d6eb86e 7129The return value is the exit status of the program as returned by the
25379e53
RGS
7130C<wait> call. To get the actual exit value, shift right by eight (see
7131below). See also L</exec>. This is I<not> what you want to use to capture
8f1da26d 7132the output from a command; for that you should use merely backticks or
d5a9bfb0 7133C<qx//>, as described in L<perlop/"`STRING`">. Return value of -1
25379e53
RGS
7134indicates a failure to start the program or an error of the wait(2) system
7135call (inspect $! for the reason).
a0d0e21e 7136
1af1c0d6
JV
7137If you'd like to make C<system> (and many other bits of Perl) die on error,
7138have a look at the L<autodie> pragma.
7139
19799a22
GS
7140Like C<exec>, C<system> allows you to lie to a program about its name if
7141you use the C<system PROGRAM LIST> syntax. Again, see L</exec>.
8bf3b016 7142
4c2e8b59
BD
7143Since C<SIGINT> and C<SIGQUIT> are ignored during the execution of
7144C<system>, if you expect your program to terminate on receipt of these
7145signals you will need to arrange to do so yourself based on the return
7146value.
28757baa 7147
7148 @args = ("command", "arg1", "arg2");
54310121 7149 system(@args) == 0
a9a5a0dc 7150 or die "system @args failed: $?"
28757baa 7151
95da743b 7152If you'd like to manually inspect C<system>'s failure, you can check all
1af1c0d6 7153possible failure modes by inspecting C<$?> like this:
28757baa 7154
4ef107a6 7155 if ($? == -1) {
a9a5a0dc 7156 print "failed to execute: $!\n";
4ef107a6
DM
7157 }
7158 elsif ($? & 127) {
a9a5a0dc
VP
7159 printf "child died with signal %d, %s coredump\n",
7160 ($? & 127), ($? & 128) ? 'with' : 'without';
4ef107a6
DM
7161 }
7162 else {
a9a5a0dc 7163 printf "child exited with value %d\n", $? >> 8;
4ef107a6
DM
7164 }
7165
3b10bc60 7166Alternatively, you may inspect the value of C<${^CHILD_ERROR_NATIVE}>
7167with the C<W*()> calls from the POSIX module.
9d6eb86e 7168
3b10bc60 7169When C<system>'s arguments are executed indirectly by the shell,
7170results and return codes are subject to its quirks.
c8db1d39 7171See L<perlop/"`STRING`"> and L</exec> for details.
bb32b41a 7172
0a18a49b
MH
7173Since C<system> does a C<fork> and C<wait> it may affect a C<SIGCHLD>
7174handler. See L<perlipc> for details.
7175
ea9eb35a
BJ
7176Portability issues: L<perlport/system>.
7177
a0d0e21e 7178=item syswrite FILEHANDLE,SCALAR,LENGTH,OFFSET
d74e8afc 7179X<syswrite>
a0d0e21e
LW
7180
7181=item syswrite FILEHANDLE,SCALAR,LENGTH
7182
145d37e2
GA
7183=item syswrite FILEHANDLE,SCALAR
7184
3874323d 7185Attempts to write LENGTH bytes of data from variable SCALAR to the
3b10bc60 7186specified FILEHANDLE, using write(2). If LENGTH is
3874323d 7187not specified, writes whole SCALAR. It bypasses buffered IO, so
9124316e 7188mixing this with reads (other than C<sysread())>, C<print>, C<write>,
3874323d 7189C<seek>, C<tell>, or C<eof> may cause confusion because the perlio and
8f1da26d 7190stdio layers usually buffer data. Returns the number of bytes
3874323d
JH
7191actually written, or C<undef> if there was an error (in this case the
7192errno variable C<$!> is also set). If the LENGTH is greater than the
3b10bc60 7193data available in the SCALAR after the OFFSET, only as much data as is
3874323d 7194available will be written.
ff68c719 7195
7196An OFFSET may be specified to write the data from some part of the
7197string other than the beginning. A negative OFFSET specifies writing
9124316e 7198that many characters counting backwards from the end of the string.
3b10bc60 7199If SCALAR is of length zero, you can only use an OFFSET of 0.
9124316e 7200
8f1da26d 7201B<WARNING>: If the filehandle is marked C<:utf8>, Unicode characters
3b10bc60 7202encoded in UTF-8 are written instead of bytes, and the LENGTH, OFFSET, and
8f1da26d 7203return value of syswrite() are in (UTF8-encoded Unicode) characters.
3874323d 7204The C<:encoding(...)> layer implicitly introduces the C<:utf8> layer.
8f1da26d
TC
7205Alternately, if the handle is not marked with an encoding but you
7206attempt to write characters with code points over 255, raises an exception.
3874323d 7207See L</binmode>, L</open>, and the C<open> pragma, L<open>.
a0d0e21e
LW
7208
7209=item tell FILEHANDLE
d74e8afc 7210X<tell>
a0d0e21e
LW
7211
7212=item tell
7213
9124316e
JH
7214Returns the current position I<in bytes> for FILEHANDLE, or -1 on
7215error. FILEHANDLE may be an expression whose value gives the name of
7216the actual filehandle. If FILEHANDLE is omitted, assumes the file
7217last read.
7218
7219Note the I<in bytes>: even if the filehandle has been set to
740d4bb2
JW
7220operate on characters (for example by using the C<:encoding(utf8)> open
7221layer), tell() will return byte offsets, not character offsets (because
7222that would render seek() and tell() rather slow).
2b5ab1e7 7223
cfd73201
JH
7224The return value of tell() for the standard streams like the STDIN
7225depends on the operating system: it may return -1 or something else.
7226tell() on pipes, fifos, and sockets usually returns -1.
7227
19799a22 7228There is no C<systell> function. Use C<sysseek(FH, 0, 1)> for that.
a0d0e21e 7229
3b10bc60 7230Do not use tell() (or other buffered I/O operations) on a filehandle
8f1da26d 7231that has been manipulated by sysread(), syswrite(), or sysseek().
59c9df15 7232Those functions ignore the buffering, while tell() does not.
9124316e 7233
a0d0e21e 7234=item telldir DIRHANDLE
d74e8afc 7235X<telldir>
a0d0e21e 7236
19799a22
GS
7237Returns the current position of the C<readdir> routines on DIRHANDLE.
7238Value may be given to C<seekdir> to access a particular location in a
cf264981
SP
7239directory. C<telldir> has the same caveats about possible directory
7240compaction as the corresponding system library routine.
a0d0e21e 7241
4633a7c4 7242=item tie VARIABLE,CLASSNAME,LIST
d74e8afc 7243X<tie>
a0d0e21e 7244
4633a7c4
LW
7245This function binds a variable to a package class that will provide the
7246implementation for the variable. VARIABLE is the name of the variable
7247to be enchanted. CLASSNAME is the name of a class implementing objects
19799a22 7248of correct type. Any additional arguments are passed to the C<new>
8a059744
GS
7249method of the class (meaning C<TIESCALAR>, C<TIEHANDLE>, C<TIEARRAY>,
7250or C<TIEHASH>). Typically these are arguments such as might be passed
19799a22
GS
7251to the C<dbm_open()> function of C. The object returned by the C<new>
7252method is also returned by the C<tie> function, which would be useful
8a059744 7253if you want to access other methods in CLASSNAME.
a0d0e21e 7254
19799a22 7255Note that functions such as C<keys> and C<values> may return huge lists
1d2dff63 7256when used on large objects, like DBM files. You may prefer to use the
19799a22 7257C<each> function to iterate over such. Example:
a0d0e21e
LW
7258
7259 # print out history file offsets
4633a7c4 7260 use NDBM_File;
da0045b7 7261 tie(%HIST, 'NDBM_File', '/usr/lib/news/history', 1, 0);
a0d0e21e 7262 while (($key,$val) = each %HIST) {
a9a5a0dc 7263 print $key, ' = ', unpack('L',$val), "\n";
a0d0e21e
LW
7264 }
7265 untie(%HIST);
7266
aa689395 7267A class implementing a hash should have the following methods:
a0d0e21e 7268
4633a7c4 7269 TIEHASH classname, LIST
a0d0e21e
LW
7270 FETCH this, key
7271 STORE this, key, value
7272 DELETE this, key
8a059744 7273 CLEAR this
a0d0e21e
LW
7274 EXISTS this, key
7275 FIRSTKEY this
7276 NEXTKEY this, lastkey
a3bcc51e 7277 SCALAR this
8a059744 7278 DESTROY this
d7da42b7 7279 UNTIE this
a0d0e21e 7280
4633a7c4 7281A class implementing an ordinary array should have the following methods:
a0d0e21e 7282
4633a7c4 7283 TIEARRAY classname, LIST
a0d0e21e
LW
7284 FETCH this, key
7285 STORE this, key, value
8a059744
GS
7286 FETCHSIZE this
7287 STORESIZE this, count
7288 CLEAR this
7289 PUSH this, LIST
7290 POP this
7291 SHIFT this
7292 UNSHIFT this, LIST
7293 SPLICE this, offset, length, LIST
7294 EXTEND this, count
7295 DESTROY this
d7da42b7 7296 UNTIE this
8a059744 7297
3b10bc60 7298A class implementing a filehandle should have the following methods:
8a059744
GS
7299
7300 TIEHANDLE classname, LIST
7301 READ this, scalar, length, offset
7302 READLINE this
7303 GETC this
7304 WRITE this, scalar, length, offset
7305 PRINT this, LIST
7306 PRINTF this, format, LIST
e08f2115
GA
7307 BINMODE this
7308 EOF this
7309 FILENO this
7310 SEEK this, position, whence
7311 TELL this
7312 OPEN this, mode, LIST
8a059744
GS
7313 CLOSE this
7314 DESTROY this
d7da42b7 7315 UNTIE this
a0d0e21e 7316
4633a7c4 7317A class implementing a scalar should have the following methods:
a0d0e21e 7318
4633a7c4 7319 TIESCALAR classname, LIST
54310121 7320 FETCH this,
a0d0e21e 7321 STORE this, value
8a059744 7322 DESTROY this
d7da42b7 7323 UNTIE this
8a059744
GS
7324
7325Not all methods indicated above need be implemented. See L<perltie>,
2b5ab1e7 7326L<Tie::Hash>, L<Tie::Array>, L<Tie::Scalar>, and L<Tie::Handle>.
a0d0e21e 7327
3b10bc60 7328Unlike C<dbmopen>, the C<tie> function will not C<use> or C<require> a module
7329for you; you need to do that explicitly yourself. See L<DB_File>
19799a22 7330or the F<Config> module for interesting C<tie> implementations.
4633a7c4 7331
b687b08b 7332For further details see L<perltie>, L<"tied VARIABLE">.
cc6b7395 7333
f3cbc334 7334=item tied VARIABLE
d74e8afc 7335X<tied>
f3cbc334
RS
7336
7337Returns a reference to the object underlying VARIABLE (the same value
19799a22 7338that was originally returned by the C<tie> call that bound the variable
f3cbc334
RS
7339to a package.) Returns the undefined value if VARIABLE isn't tied to a
7340package.
7341
a0d0e21e 7342=item time
d74e8afc 7343X<time> X<epoch>
a0d0e21e 7344
da0045b7 7345Returns the number of non-leap seconds since whatever time the system
ef4d88db
NC
7346considers to be the epoch, suitable for feeding to C<gmtime> and
7347C<localtime>. On most systems the epoch is 00:00:00 UTC, January 1, 1970;
7348a prominent exception being Mac OS Classic which uses 00:00:00, January 1,
73491904 in the current local time zone for its epoch.
a0d0e21e 7350
8f1da26d
TC
7351For measuring time in better granularity than one second, use the
7352L<Time::HiRes> module from Perl 5.8 onwards (or from CPAN before then), or,
7353if you have gettimeofday(2), you may be able to use the C<syscall>
7354interface of Perl. See L<perlfaq8> for details.
68f8bed4 7355
435fbc73
GS
7356For date and time processing look at the many related modules on CPAN.
7357For a comprehensive date and time representation look at the
7358L<DateTime> module.
7359
a0d0e21e 7360=item times
d74e8afc 7361X<times>
a0d0e21e 7362
8f1da26d
TC
7363Returns a four-element list giving the user and system times in
7364seconds for this process and any exited children of this process.
a0d0e21e
LW
7365
7366 ($user,$system,$cuser,$csystem) = times;
7367
dc19f4fb
MJD
7368In scalar context, C<times> returns C<$user>.
7369
3b10bc60 7370Children's times are only included for terminated children.
2a958fe2 7371
ea9eb35a
BJ
7372Portability issues: L<perlport/times>.
7373
a0d0e21e
LW
7374=item tr///
7375
9f4b9cd0
SP
7376The transliteration operator. Same as C<y///>. See
7377L<perlop/"Quote and Quote-like Operators">.
a0d0e21e
LW
7378
7379=item truncate FILEHANDLE,LENGTH
d74e8afc 7380X<truncate>
a0d0e21e
LW
7381
7382=item truncate EXPR,LENGTH
7383
7384Truncates the file opened on FILEHANDLE, or named by EXPR, to the
3b10bc60 7385specified length. Raises an exception if truncate isn't implemented
8f1da26d 7386on your system. Returns true if successful, C<undef> on error.
a0d0e21e 7387
90ddc76f
MS
7388The behavior is undefined if LENGTH is greater than the length of the
7389file.
7390
8577f58c 7391The position in the file of FILEHANDLE is left unchanged. You may want to
96090e4f 7392call L<seek|/"seek FILEHANDLE,POSITION,WHENCE"> before writing to the file.
8577f58c 7393
ea9eb35a
BJ
7394Portability issues: L<perlport/truncate>.
7395
a0d0e21e 7396=item uc EXPR
d74e8afc 7397X<uc> X<uppercase> X<toupper>
a0d0e21e 7398
54310121 7399=item uc
bbce6d69 7400
a0d0e21e 7401Returns an uppercased version of EXPR. This is the internal function
3980dc9c 7402implementing the C<\U> escape in double-quoted strings.
983ffd37 7403It does not attempt to do titlecase mapping on initial letters. See
3980dc9c 7404L</ucfirst> for that.
a0d0e21e 7405
7660c0ab 7406If EXPR is omitted, uses C<$_>.
bbce6d69 7407
3980dc9c
KW
7408This function behaves the same way under various pragma, such as in a locale,
7409as L</lc> does.
7410
a0d0e21e 7411=item ucfirst EXPR
d74e8afc 7412X<ucfirst> X<uppercase>
a0d0e21e 7413
54310121 7414=item ucfirst
bbce6d69 7415
ad0029c4
JH
7416Returns the value of EXPR with the first character in uppercase
7417(titlecase in Unicode). This is the internal function implementing
3980dc9c 7418the C<\u> escape in double-quoted strings.
a0d0e21e 7419
7660c0ab 7420If EXPR is omitted, uses C<$_>.
bbce6d69 7421
3980dc9c
KW
7422This function behaves the same way under various pragma, such as in a locale,
7423as L</lc> does.
7424
a0d0e21e 7425=item umask EXPR
d74e8afc 7426X<umask>
a0d0e21e
LW
7427
7428=item umask
7429
2f9daede 7430Sets the umask for the process to EXPR and returns the previous value.
eec2d3df
GS
7431If EXPR is omitted, merely returns the current umask.
7432
0591cd52
NT
7433The Unix permission C<rwxr-x---> is represented as three sets of three
7434bits, or three octal digits: C<0750> (the leading 0 indicates octal
b5a41e52 7435and isn't one of the digits). The C<umask> value is such a number
0591cd52
NT
7436representing disabled permissions bits. The permission (or "mode")
7437values you pass C<mkdir> or C<sysopen> are modified by your umask, so
7438even if you tell C<sysopen> to create a file with permissions C<0777>,
8f1da26d 7439if your umask is C<0022>, then the file will actually be created with
0591cd52
NT
7440permissions C<0755>. If your C<umask> were C<0027> (group can't
7441write; others can't read, write, or execute), then passing
8f1da26d
TC
7442C<sysopen> C<0666> would create a file with mode C<0640> (because
7443C<0666 &~ 027> is C<0640>).
0591cd52
NT
7444
7445Here's some advice: supply a creation mode of C<0666> for regular
19799a22
GS
7446files (in C<sysopen>) and one of C<0777> for directories (in
7447C<mkdir>) and executable files. This gives users the freedom of
0591cd52
NT
7448choice: if they want protected files, they might choose process umasks
7449of C<022>, C<027>, or even the particularly antisocial mask of C<077>.
7450Programs should rarely if ever make policy decisions better left to
7451the user. The exception to this is when writing files that should be
7452kept private: mail files, web browser cookies, I<.rhosts> files, and
7453so on.
7454
f86cebdf 7455If umask(2) is not implemented on your system and you are trying to
3b10bc60 7456restrict access for I<yourself> (i.e., C<< (EXPR & 0700) > 0 >>),
7457raises an exception. If umask(2) is not implemented and you are
eec2d3df
GS
7458not trying to restrict access for yourself, returns C<undef>.
7459
7460Remember that a umask is a number, usually given in octal; it is I<not> a
7461string of octal digits. See also L</oct>, if all you have is a string.
a0d0e21e 7462
ea9eb35a
BJ
7463Portability issues: L<perlport/umask>.
7464
a0d0e21e 7465=item undef EXPR
d74e8afc 7466X<undef> X<undefine>
a0d0e21e
LW
7467
7468=item undef
7469
54310121 7470Undefines the value of EXPR, which must be an lvalue. Use only on a
19799a22 7471scalar value, an array (using C<@>), a hash (using C<%>), a subroutine
3b10bc60 7472(using C<&>), or a typeglob (using C<*>). Saying C<undef $hash{$key}>
20408e3c 7473will probably not do what you expect on most predefined variables or
4509d391 7474DBM list values, so don't do that; see L</delete>. Always returns the
20408e3c
GS
7475undefined value. You can omit the EXPR, in which case nothing is
7476undefined, but you still get an undefined value that you could, for
3b10bc60 7477instance, return from a subroutine, assign to a variable, or pass as a
20408e3c 7478parameter. Examples:
a0d0e21e
LW
7479
7480 undef $foo;
f86cebdf 7481 undef $bar{'blurfl'}; # Compare to: delete $bar{'blurfl'};
a0d0e21e 7482 undef @ary;
aa689395 7483 undef %hash;
a0d0e21e 7484 undef &mysub;
20408e3c 7485 undef *xyz; # destroys $xyz, @xyz, %xyz, &xyz, etc.
54310121 7486 return (wantarray ? (undef, $errmsg) : undef) if $they_blew_it;
2f9daede
TP
7487 select undef, undef, undef, 0.25;
7488 ($a, $b, undef, $c) = &foo; # Ignore third value returned
a0d0e21e 7489
5a964f20
TC
7490Note that this is a unary operator, not a list operator.
7491
a0d0e21e 7492=item unlink LIST
dd184578 7493X<unlink> X<delete> X<remove> X<rm> X<del>
a0d0e21e 7494
54310121 7495=item unlink
bbce6d69 7496
40ea6f68 7497Deletes a list of files. On success, it returns the number of files
7498it successfully deleted. On failure, it returns false and sets C<$!>
7499(errno):
a0d0e21e 7500
40ea6f68 7501 my $unlinked = unlink 'a', 'b', 'c';
a0d0e21e 7502 unlink @goners;
40ea6f68 7503 unlink glob "*.bak";
a0d0e21e 7504
40ea6f68 7505On error, C<unlink> will not tell you which files it could not remove.
734c9e01 7506If you want to know which files you could not remove, try them one
40ea6f68 7507at a time:
a0d0e21e 7508
40ea6f68 7509 foreach my $file ( @goners ) {
7510 unlink $file or warn "Could not unlink $file: $!";
3b10bc60 7511 }
40ea6f68 7512
7513Note: C<unlink> will not attempt to delete directories unless you are
7514superuser and the B<-U> flag is supplied to Perl. Even if these
7515conditions are met, be warned that unlinking a directory can inflict
7516damage on your filesystem. Finally, using C<unlink> on directories is
7517not supported on many operating systems. Use C<rmdir> instead.
7518
7519If LIST is omitted, C<unlink> uses C<$_>.
bbce6d69 7520
a0d0e21e 7521=item unpack TEMPLATE,EXPR
d74e8afc 7522X<unpack>
a0d0e21e 7523
13dcffc6
CS
7524=item unpack TEMPLATE
7525
19799a22 7526C<unpack> does the reverse of C<pack>: it takes a string
2b6c5635 7527and expands it out into a list of values.
19799a22 7528(In scalar context, it returns merely the first value produced.)
2b6c5635 7529
eae68503 7530If EXPR is omitted, unpacks the C<$_> string.
3980dc9c 7531See L<perlpacktut> for an introduction to this function.
13dcffc6 7532
2b6c5635
GS
7533The string is broken into chunks described by the TEMPLATE. Each chunk
7534is converted separately to a value. Typically, either the string is a result
f337b084 7535of C<pack>, or the characters of the string represent a C structure of some
2b6c5635
GS
7536kind.
7537
19799a22 7538The TEMPLATE has the same format as in the C<pack> function.
a0d0e21e
LW
7539Here's a subroutine that does substring:
7540
7541 sub substr {
5ed4f2ec 7542 my($what,$where,$howmuch) = @_;
7543 unpack("x$where a$howmuch", $what);
a0d0e21e
LW
7544 }
7545
7546and then there's
7547
f337b084 7548 sub ordinal { unpack("W",$_[0]); } # same as ord()
a0d0e21e 7549
2b6c5635 7550In addition to fields allowed in pack(), you may prefix a field with
61eff3bc
JH
7551a %<number> to indicate that
7552you want a <number>-bit checksum of the items instead of the items
2b6c5635
GS
7553themselves. Default is a 16-bit checksum. Checksum is calculated by
7554summing numeric values of expanded values (for string fields the sum of
8f1da26d 7555C<ord($char)> is taken; for bit fields the sum of zeroes and ones).
2b6c5635
GS
7556
7557For example, the following
a0d0e21e
LW
7558computes the same number as the System V sum program:
7559
19799a22 7560 $checksum = do {
5ed4f2ec 7561 local $/; # slurp!
7562 unpack("%32W*",<>) % 65535;
19799a22 7563 };
a0d0e21e
LW
7564
7565The following efficiently counts the number of set bits in a bit vector:
7566
7567 $setbits = unpack("%32b*", $selectmask);
7568
951ba7fe 7569The C<p> and C<P> formats should be used with care. Since Perl
3160c391
GS
7570has no way of checking whether the value passed to C<unpack()>
7571corresponds to a valid memory location, passing a pointer value that's
7572not known to be valid is likely to have disastrous consequences.
7573
49704364
WL
7574If there are more pack codes or if the repeat count of a field or a group
7575is larger than what the remainder of the input string allows, the result
3b10bc60 7576is not well defined: the repeat count may be decreased, or
7577C<unpack()> may produce empty strings or zeros, or it may raise an exception.
7578If the input string is longer than one described by the TEMPLATE,
7579the remainder of that input string is ignored.
2b6c5635 7580
851646ae 7581See L</pack> for more examples and notes.
5a929a98 7582
98293880 7583=item untie VARIABLE
d74e8afc 7584X<untie>
98293880 7585
01aa884e
KW
7586Breaks the binding between a variable and a package.
7587(See L<tie|/tie VARIABLE,CLASSNAME,LIST>.)
1188453a 7588Has no effect if the variable is not tied.
98293880 7589
532eee96 7590=item unshift ARRAY,LIST
d74e8afc 7591X<unshift>
a0d0e21e 7592
f5a93a43
TC
7593=item unshift EXPR,LIST
7594
19799a22 7595Does the opposite of a C<shift>. Or the opposite of a C<push>,
a0d0e21e 7596depending on how you look at it. Prepends list to the front of the
8f1da26d 7597array and returns the new number of elements in the array.
a0d0e21e 7598
76e4c2bb 7599 unshift(@ARGV, '-e') unless $ARGV[0] =~ /^-/;
a0d0e21e
LW
7600
7601Note the LIST is prepended whole, not one element at a time, so the
19799a22 7602prepended elements stay in the same order. Use C<reverse> to do the
a0d0e21e
LW
7603reverse.
7604
f5a93a43
TC
7605Starting with Perl 5.14, C<unshift> can take a scalar EXPR, which must hold
7606a reference to an unblessed array. The argument will be dereferenced
7607automatically. This aspect of C<unshift> is considered highly
7608experimental. The exact behaviour may change in a future version of Perl.
cba5a3b0 7609
f6c8478c 7610=item use Module VERSION LIST
d74e8afc 7611X<use> X<module> X<import>
f6c8478c
GS
7612
7613=item use Module VERSION
7614
a0d0e21e
LW
7615=item use Module LIST
7616
7617=item use Module
7618
da0045b7 7619=item use VERSION
7620
a0d0e21e
LW
7621Imports some semantics into the current package from the named module,
7622generally by aliasing certain subroutine or variable names into your
7623package. It is exactly equivalent to
7624
6d9d0573 7625 BEGIN { require Module; Module->import( LIST ); }
a0d0e21e 7626
54310121 7627except that Module I<must> be a bareword.
8f1da26d 7628The importation can be made conditional; see L<if>.
da0045b7 7629
bd12309b
DG
7630In the peculiar C<use VERSION> form, VERSION may be either a positive
7631decimal fraction such as 5.006, which will be compared to C<$]>, or a v-string
7632of the form v5.6.1, which will be compared to C<$^V> (aka $PERL_VERSION). An
3b10bc60 7633exception is raised if VERSION is greater than the version of the
c986422f
RGS
7634current Perl interpreter; Perl will not attempt to parse the rest of the
7635file. Compare with L</require>, which can do a similar check at run time.
7636Symmetrically, C<no VERSION> allows you to specify that you want a version
3b10bc60 7637of Perl older than the specified one.
3b825e41
RK
7638
7639Specifying VERSION as a literal of the form v5.6.1 should generally be
7640avoided, because it leads to misleading error messages under earlier
2e8342de
RGS
7641versions of Perl (that is, prior to 5.6.0) that do not support this
7642syntax. The equivalent numeric version should be used instead.
fbc891ce 7643
5ed4f2ec 7644 use v5.6.1; # compile time version check
7645 use 5.6.1; # ditto
7646 use 5.006_001; # ditto; preferred for backwards compatibility
16070b82
GS
7647
7648This is often useful if you need to check the current Perl version before
2e8342de
RGS
7649C<use>ing library modules that won't work with older versions of Perl.
7650(We try not to do this more than we have to.)
da0045b7 7651
3b10bc60 7652Also, if the specified Perl version is greater than or equal to 5.9.5,
c986422f
RGS
7653C<use VERSION> will also load the C<feature> pragma and enable all
7654features available in the requested version. See L<feature>.
3b10bc60 7655Similarly, if the specified Perl version is greater than or equal to
5cc917d6
RGS
76565.11.0, strictures are enabled lexically as with C<use strict> (except
7657that the F<strict.pm> file is not actually loaded).
7dfde25d 7658
19799a22 7659The C<BEGIN> forces the C<require> and C<import> to happen at compile time. The
7660c0ab 7660C<require> makes sure the module is loaded into memory if it hasn't been
3b10bc60 7661yet. The C<import> is not a builtin; it's just an ordinary static method
19799a22 7662call into the C<Module> package to tell the module to import the list of
a0d0e21e 7663features back into the current package. The module can implement its
19799a22
GS
7664C<import> method any way it likes, though most modules just choose to
7665derive their C<import> method via inheritance from the C<Exporter> class that
7666is defined in the C<Exporter> module. See L<Exporter>. If no C<import>
593b9c14
YST
7667method can be found then the call is skipped, even if there is an AUTOLOAD
7668method.
cb1a09d0 7669
31686daf
JP
7670If you do not want to call the package's C<import> method (for instance,
7671to stop your namespace from being altered), explicitly supply the empty list:
cb1a09d0
AD
7672
7673 use Module ();
7674
7675That is exactly equivalent to
7676
5a964f20 7677 BEGIN { require Module }
a0d0e21e 7678
da0045b7 7679If the VERSION argument is present between Module and LIST, then the
71be2cbc 7680C<use> will call the VERSION method in class Module with the given
7681version as an argument. The default VERSION method, inherited from
44dcb63b 7682the UNIVERSAL class, croaks if the given version is larger than the
b76cc8ba 7683value of the variable C<$Module::VERSION>.
f6c8478c
GS
7684
7685Again, there is a distinction between omitting LIST (C<import> called
7686with no arguments) and an explicit empty LIST C<()> (C<import> not
7687called). Note that there is no comma after VERSION!
da0045b7 7688
a0d0e21e
LW
7689Because this is a wide-open interface, pragmas (compiler directives)
7690are also implemented this way. Currently implemented pragmas are:
7691
f3798619 7692 use constant;
4633a7c4 7693 use diagnostics;
f3798619 7694 use integer;
4438c4b7
JH
7695 use sigtrap qw(SEGV BUS);
7696 use strict qw(subs vars refs);
7697 use subs qw(afunc blurfl);
7698 use warnings qw(all);
58c7fc7c 7699 use sort qw(stable _quicksort _mergesort);
a0d0e21e 7700
19799a22 7701Some of these pseudo-modules import semantics into the current
5a964f20
TC
7702block scope (like C<strict> or C<integer>, unlike ordinary modules,
7703which import symbols into the current package (which are effective
7704through the end of the file).
a0d0e21e 7705
c362798e
Z
7706Because C<use> takes effect at compile time, it doesn't respect the
7707ordinary flow control of the code being compiled. In particular, putting
7708a C<use> inside the false branch of a conditional doesn't prevent it
3b10bc60 7709from being processed. If a module or pragma only needs to be loaded
c362798e
Z
7710conditionally, this can be done using the L<if> pragma:
7711
7712 use if $] < 5.008, "utf8";
7713 use if WANT_WARNINGS, warnings => qw(all);
7714
8f1da26d 7715There's a corresponding C<no> declaration that unimports meanings imported
19799a22 7716by C<use>, i.e., it calls C<unimport Module LIST> instead of C<import>.
80d38338
TC
7717It behaves just as C<import> does with VERSION, an omitted or empty LIST,
7718or no unimport method being found.
a0d0e21e
LW
7719
7720 no integer;
7721 no strict 'refs';
4438c4b7 7722 no warnings;
a0d0e21e 7723
e0de7c21 7724Care should be taken when using the C<no VERSION> form of C<no>. It is
8f1da26d 7725I<only> meant to be used to assert that the running Perl is of a earlier
e0de7c21
RS
7726version than its argument and I<not> to undo the feature-enabling side effects
7727of C<use VERSION>.
7728
ac634a9a 7729See L<perlmodlib> for a list of standard modules and pragmas. See L<perlrun>
3b10bc60 7730for the C<-M> and C<-m> command-line options to Perl that give C<use>
31686daf 7731functionality from the command-line.
a0d0e21e
LW
7732
7733=item utime LIST
d74e8afc 7734X<utime>
a0d0e21e
LW
7735
7736Changes the access and modification times on each file of a list of
8f1da26d 7737files. The first two elements of the list must be the NUMERIC access
a0d0e21e 7738and modification times, in that order. Returns the number of files
46cdf678 7739successfully changed. The inode change time of each file is set
4bc2a53d 7740to the current time. For example, this code has the same effect as the
a4142048
WL
7741Unix touch(1) command when the files I<already exist> and belong to
7742the user running the program:
a0d0e21e
LW
7743
7744 #!/usr/bin/perl
2c21a326
GA
7745 $atime = $mtime = time;
7746 utime $atime, $mtime, @ARGV;
4bc2a53d 7747
3b10bc60 7748Since Perl 5.7.2, if the first two elements of the list are C<undef>,
7749the utime(2) syscall from your C library is called with a null second
4bc2a53d 7750argument. On most systems, this will set the file's access and
80d38338 7751modification times to the current time (i.e., equivalent to the example
3b10bc60 7752above) and will work even on files you don't own provided you have write
a4142048 7753permission:
c6f7b413 7754
3b10bc60 7755 for $file (@ARGV) {
7756 utime(undef, undef, $file)
7757 || warn "couldn't touch $file: $!";
7758 }
c6f7b413 7759
2c21a326
GA
7760Under NFS this will use the time of the NFS server, not the time of
7761the local machine. If there is a time synchronization problem, the
7762NFS server and local machine will have different times. The Unix
7763touch(1) command will in fact normally use this form instead of the
7764one shown in the first example.
7765
3b10bc60 7766Passing only one of the first two elements as C<undef> is
7767equivalent to passing a 0 and will not have the effect
7768described when both are C<undef>. This also triggers an
2c21a326
GA
7769uninitialized warning.
7770
3b10bc60 7771On systems that support futimes(2), you may pass filehandles among the
7772files. On systems that don't support futimes(2), passing filehandles raises
7773an exception. Filehandles must be passed as globs or glob references to be
7774recognized; barewords are considered filenames.
e96b369d 7775
ea9eb35a
BJ
7776Portability issues: L<perlport/utime>.
7777
532eee96 7778=item values HASH
d74e8afc 7779X<values>
a0d0e21e 7780
532eee96 7781=item values ARRAY
aeedbbed 7782
f5a93a43
TC
7783=item values EXPR
7784
aeedbbed 7785Returns a list consisting of all the values of the named hash, or the values
8f1da26d 7786of an array. (In scalar context, returns the number of values.)
504f80c1
JH
7787
7788The values are returned in an apparently random order. The actual
3b10bc60 7789random order is subject to change in future versions of Perl, but it
504f80c1 7790is guaranteed to be the same order as either the C<keys> or C<each>
4546b9e6
JH
7791function would produce on the same (unmodified) hash. Since Perl
77925.8.1 the ordering is different even between different runs of Perl
7793for security reasons (see L<perlsec/"Algorithmic Complexity Attacks">).
504f80c1 7794
aeedbbed 7795As a side effect, calling values() resets the HASH or ARRAY's internal
8f1da26d 7796iterator;
2f65b2f0 7797see L</each>. (In particular, calling values() in void context resets
aeedbbed 7798the iterator with no other overhead. Apart from resetting the iterator,
80d38338 7799C<values @array> in list context is the same as plain C<@array>.
aeedbbed
NC
7800We recommend that you use void context C<keys @array> for this, but reasoned
7801that it taking C<values @array> out would require more documentation than
7802leaving it in.)
7803
8ea1e5d4
GS
7804Note that the values are not copied, which means modifying them will
7805modify the contents of the hash:
2b5ab1e7 7806
5ed4f2ec 7807 for (values %hash) { s/foo/bar/g } # modifies %hash values
8ea1e5d4 7808 for (@hash{keys %hash}) { s/foo/bar/g } # same
2b5ab1e7 7809
f5a93a43
TC
7810Starting with Perl 5.14, C<values> can take a scalar EXPR, which must hold
7811a reference to an unblessed hash or array. The argument will be
7812dereferenced automatically. This aspect of C<values> is considered highly
7813experimental. The exact behaviour may change in a future version of Perl.
cba5a3b0
DG
7814
7815 for (values $hashref) { ... }
7816 for (values $obj->get_arrayref) { ... }
7817
19799a22 7818See also C<keys>, C<each>, and C<sort>.
a0d0e21e
LW
7819
7820=item vec EXPR,OFFSET,BITS
d74e8afc 7821X<vec> X<bit> X<bit vector>
a0d0e21e 7822
e69129f1 7823Treats the string in EXPR as a bit vector made up of elements of
8f1da26d 7824width BITS and returns the value of the element specified by OFFSET
e69129f1
GS
7825as an unsigned integer. BITS therefore specifies the number of bits
7826that are reserved for each element in the bit vector. This must
7827be a power of two from 1 to 32 (or 64, if your platform supports
7828that).
c5a0f51a 7829
b76cc8ba 7830If BITS is 8, "elements" coincide with bytes of the input string.
c73032f5
IZ
7831
7832If BITS is 16 or more, bytes of the input string are grouped into chunks
7833of size BITS/8, and each group is converted to a number as with
b1866b2d 7834pack()/unpack() with big-endian formats C<n>/C<N> (and analogously
c73032f5
IZ
7835for BITS==64). See L<"pack"> for details.
7836
7837If bits is 4 or less, the string is broken into bytes, then the bits
7838of each byte are broken into 8/BITS groups. Bits of a byte are
7839numbered in a little-endian-ish way, as in C<0x01>, C<0x02>,
7840C<0x04>, C<0x08>, C<0x10>, C<0x20>, C<0x40>, C<0x80>. For example,
7841breaking the single input byte C<chr(0x36)> into two groups gives a list
7842C<(0x6, 0x3)>; breaking it into 4 groups gives C<(0x2, 0x1, 0x3, 0x0)>.
7843
81e118e0
JH
7844C<vec> may also be assigned to, in which case parentheses are needed
7845to give the expression the correct precedence as in
22dc801b 7846
7847 vec($image, $max_x * $x + $y, 8) = 3;
a0d0e21e 7848
fe58ced6
MG
7849If the selected element is outside the string, the value 0 is returned.
7850If an element off the end of the string is written to, Perl will first
7851extend the string with sufficiently many zero bytes. It is an error
80d38338 7852to try to write off the beginning of the string (i.e., negative OFFSET).
fac70343 7853
2575c402
JW
7854If the string happens to be encoded as UTF-8 internally (and thus has
7855the UTF8 flag set), this is ignored by C<vec>, and it operates on the
7856internal byte string, not the conceptual character string, even if you
7857only have characters with values less than 256.
246fae53 7858
fac70343
GS
7859Strings created with C<vec> can also be manipulated with the logical
7860operators C<|>, C<&>, C<^>, and C<~>. These operators will assume a bit
7861vector operation is desired when both operands are strings.
c5a0f51a 7862See L<perlop/"Bitwise String Operators">.
a0d0e21e 7863
7660c0ab 7864The following code will build up an ASCII string saying C<'PerlPerlPerl'>.
19799a22 7865The comments show the string after each step. Note that this code works
cca87523
GS
7866in the same way on big-endian or little-endian machines.
7867
7868 my $foo = '';
5ed4f2ec 7869 vec($foo, 0, 32) = 0x5065726C; # 'Perl'
e69129f1
GS
7870
7871 # $foo eq "Perl" eq "\x50\x65\x72\x6C", 32 bits
5ed4f2ec 7872 print vec($foo, 0, 8); # prints 80 == 0x50 == ord('P')
7873
7874 vec($foo, 2, 16) = 0x5065; # 'PerlPe'
7875 vec($foo, 3, 16) = 0x726C; # 'PerlPerl'
7876 vec($foo, 8, 8) = 0x50; # 'PerlPerlP'
7877 vec($foo, 9, 8) = 0x65; # 'PerlPerlPe'
7878 vec($foo, 20, 4) = 2; # 'PerlPerlPe' . "\x02"
7879 vec($foo, 21, 4) = 7; # 'PerlPerlPer'
7880 # 'r' is "\x72"
7881 vec($foo, 45, 2) = 3; # 'PerlPerlPer' . "\x0c"
7882 vec($foo, 93, 1) = 1; # 'PerlPerlPer' . "\x2c"
7883 vec($foo, 94, 1) = 1; # 'PerlPerlPerl'
7884 # 'l' is "\x6c"
cca87523 7885
19799a22 7886To transform a bit vector into a string or list of 0's and 1's, use these:
a0d0e21e
LW
7887
7888 $bits = unpack("b*", $vector);
7889 @bits = split(//, unpack("b*", $vector));
7890
7660c0ab 7891If you know the exact length in bits, it can be used in place of the C<*>.
a0d0e21e 7892
e69129f1
GS
7893Here is an example to illustrate how the bits actually fall in place:
7894
7895 #!/usr/bin/perl -wl
7896
7897 print <<'EOT';
b76cc8ba 7898 0 1 2 3
e69129f1
GS
7899 unpack("V",$_) 01234567890123456789012345678901
7900 ------------------------------------------------------------------
7901 EOT
7902
7903 for $w (0..3) {
7904 $width = 2**$w;
7905 for ($shift=0; $shift < $width; ++$shift) {
7906 for ($off=0; $off < 32/$width; ++$off) {
7907 $str = pack("B*", "0"x32);
7908 $bits = (1<<$shift);
7909 vec($str, $off, $width) = $bits;
7910 $res = unpack("b*",$str);
7911 $val = unpack("V", $str);
7912 write;
7913 }
7914 }
7915 }
7916
7917 format STDOUT =
7918 vec($_,@#,@#) = @<< == @######### @>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>
7919 $off, $width, $bits, $val, $res
7920 .
7921 __END__
7922
80d38338
TC
7923Regardless of the machine architecture on which it runs, the
7924example above should print the following table:
e69129f1 7925
b76cc8ba 7926 0 1 2 3
e69129f1
GS
7927 unpack("V",$_) 01234567890123456789012345678901
7928 ------------------------------------------------------------------
7929 vec($_, 0, 1) = 1 == 1 10000000000000000000000000000000
7930 vec($_, 1, 1) = 1 == 2 01000000000000000000000000000000
7931 vec($_, 2, 1) = 1 == 4 00100000000000000000000000000000
7932 vec($_, 3, 1) = 1 == 8 00010000000000000000000000000000
7933 vec($_, 4, 1) = 1 == 16 00001000000000000000000000000000
7934 vec($_, 5, 1) = 1 == 32 00000100000000000000000000000000
7935 vec($_, 6, 1) = 1 == 64 00000010000000000000000000000000
7936 vec($_, 7, 1) = 1 == 128 00000001000000000000000000000000
7937 vec($_, 8, 1) = 1 == 256 00000000100000000000000000000000
7938 vec($_, 9, 1) = 1 == 512 00000000010000000000000000000000
7939 vec($_,10, 1) = 1 == 1024 00000000001000000000000000000000
7940 vec($_,11, 1) = 1 == 2048 00000000000100000000000000000000
7941 vec($_,12, 1) = 1 == 4096 00000000000010000000000000000000
7942 vec($_,13, 1) = 1 == 8192 00000000000001000000000000000000
7943 vec($_,14, 1) = 1 == 16384 00000000000000100000000000000000
7944 vec($_,15, 1) = 1 == 32768 00000000000000010000000000000000
7945 vec($_,16, 1) = 1 == 65536 00000000000000001000000000000000
7946 vec($_,17, 1) = 1 == 131072 00000000000000000100000000000000
7947 vec($_,18, 1) = 1 == 262144 00000000000000000010000000000000
7948 vec($_,19, 1) = 1 == 524288 00000000000000000001000000000000
7949 vec($_,20, 1) = 1 == 1048576 00000000000000000000100000000000
7950 vec($_,21, 1) = 1 == 2097152 00000000000000000000010000000000
7951 vec($_,22, 1) = 1 == 4194304 00000000000000000000001000000000
7952 vec($_,23, 1) = 1 == 8388608 00000000000000000000000100000000
7953 vec($_,24, 1) = 1 == 16777216 00000000000000000000000010000000
7954 vec($_,25, 1) = 1 == 33554432 00000000000000000000000001000000
7955 vec($_,26, 1) = 1 == 67108864 00000000000000000000000000100000
7956 vec($_,27, 1) = 1 == 134217728 00000000000000000000000000010000
7957 vec($_,28, 1) = 1 == 268435456 00000000000000000000000000001000
7958 vec($_,29, 1) = 1 == 536870912 00000000000000000000000000000100
7959 vec($_,30, 1) = 1 == 1073741824 00000000000000000000000000000010
7960 vec($_,31, 1) = 1 == 2147483648 00000000000000000000000000000001
7961 vec($_, 0, 2) = 1 == 1 10000000000000000000000000000000
7962 vec($_, 1, 2) = 1 == 4 00100000000000000000000000000000
7963 vec($_, 2, 2) = 1 == 16 00001000000000000000000000000000
7964 vec($_, 3, 2) = 1 == 64 00000010000000000000000000000000
7965 vec($_, 4, 2) = 1 == 256 00000000100000000000000000000000
7966 vec($_, 5, 2) = 1 == 1024 00000000001000000000000000000000
7967 vec($_, 6, 2) = 1 == 4096 00000000000010000000000000000000
7968 vec($_, 7, 2) = 1 == 16384 00000000000000100000000000000000
7969 vec($_, 8, 2) = 1 == 65536 00000000000000001000000000000000
7970 vec($_, 9, 2) = 1 == 262144 00000000000000000010000000000000
7971 vec($_,10, 2) = 1 == 1048576 00000000000000000000100000000000
7972 vec($_,11, 2) = 1 == 4194304 00000000000000000000001000000000
7973 vec($_,12, 2) = 1 == 16777216 00000000000000000000000010000000
7974 vec($_,13, 2) = 1 == 67108864 00000000000000000000000000100000
7975 vec($_,14, 2) = 1 == 268435456 00000000000000000000000000001000
7976 vec($_,15, 2) = 1 == 1073741824 00000000000000000000000000000010
7977 vec($_, 0, 2) = 2 == 2 01000000000000000000000000000000
7978 vec($_, 1, 2) = 2 == 8 00010000000000000000000000000000
7979 vec($_, 2, 2) = 2 == 32 00000100000000000000000000000000
7980 vec($_, 3, 2) = 2 == 128 00000001000000000000000000000000
7981 vec($_, 4, 2) = 2 == 512 00000000010000000000000000000000
7982 vec($_, 5, 2) = 2 == 2048 00000000000100000000000000000000
7983 vec($_, 6, 2) = 2 == 8192 00000000000001000000000000000000
7984 vec($_, 7, 2) = 2 == 32768 00000000000000010000000000000000
7985 vec($_, 8, 2) = 2 == 131072 00000000000000000100000000000000
7986 vec($_, 9, 2) = 2 == 524288 00000000000000000001000000000000
7987 vec($_,10, 2) = 2 == 2097152 00000000000000000000010000000000
7988 vec($_,11, 2) = 2 == 8388608 00000000000000000000000100000000
7989 vec($_,12, 2) = 2 == 33554432 00000000000000000000000001000000
7990 vec($_,13, 2) = 2 == 134217728 00000000000000000000000000010000
7991 vec($_,14, 2) = 2 == 536870912 00000000000000000000000000000100
7992 vec($_,15, 2) = 2 == 2147483648 00000000000000000000000000000001
7993 vec($_, 0, 4) = 1 == 1 10000000000000000000000000000000
7994 vec($_, 1, 4) = 1 == 16 00001000000000000000000000000000
7995 vec($_, 2, 4) = 1 == 256 00000000100000000000000000000000
7996 vec($_, 3, 4) = 1 == 4096 00000000000010000000000000000000
7997 vec($_, 4, 4) = 1 == 65536 00000000000000001000000000000000
7998 vec($_, 5, 4) = 1 == 1048576 00000000000000000000100000000000
7999 vec($_, 6, 4) = 1 == 16777216 00000000000000000000000010000000
8000 vec($_, 7, 4) = 1 == 268435456 00000000000000000000000000001000
8001 vec($_, 0, 4) = 2 == 2 01000000000000000000000000000000
8002 vec($_, 1, 4) = 2 == 32 00000100000000000000000000000000
8003 vec($_, 2, 4) = 2 == 512 00000000010000000000000000000000
8004 vec($_, 3, 4) = 2 == 8192 00000000000001000000000000000000
8005 vec($_, 4, 4) = 2 == 131072 00000000000000000100000000000000
8006 vec($_, 5, 4) = 2 == 2097152 00000000000000000000010000000000
8007 vec($_, 6, 4) = 2 == 33554432 00000000000000000000000001000000
8008 vec($_, 7, 4) = 2 == 536870912 00000000000000000000000000000100
8009 vec($_, 0, 4) = 4 == 4 00100000000000000000000000000000
8010 vec($_, 1, 4) = 4 == 64 00000010000000000000000000000000
8011 vec($_, 2, 4) = 4 == 1024 00000000001000000000000000000000
8012 vec($_, 3, 4) = 4 == 16384 00000000000000100000000000000000
8013 vec($_, 4, 4) = 4 == 262144 00000000000000000010000000000000
8014 vec($_, 5, 4) = 4 == 4194304 00000000000000000000001000000000
8015 vec($_, 6, 4) = 4 == 67108864 00000000000000000000000000100000
8016 vec($_, 7, 4) = 4 == 1073741824 00000000000000000000000000000010
8017 vec($_, 0, 4) = 8 == 8 00010000000000000000000000000000
8018 vec($_, 1, 4) = 8 == 128 00000001000000000000000000000000
8019 vec($_, 2, 4) = 8 == 2048 00000000000100000000000000000000
8020 vec($_, 3, 4) = 8 == 32768 00000000000000010000000000000000
8021 vec($_, 4, 4) = 8 == 524288 00000000000000000001000000000000
8022 vec($_, 5, 4) = 8 == 8388608 00000000000000000000000100000000
8023 vec($_, 6, 4) = 8 == 134217728 00000000000000000000000000010000
8024 vec($_, 7, 4) = 8 == 2147483648 00000000000000000000000000000001
8025 vec($_, 0, 8) = 1 == 1 10000000000000000000000000000000
8026 vec($_, 1, 8) = 1 == 256 00000000100000000000000000000000
8027 vec($_, 2, 8) = 1 == 65536 00000000000000001000000000000000
8028 vec($_, 3, 8) = 1 == 16777216 00000000000000000000000010000000
8029 vec($_, 0, 8) = 2 == 2 01000000000000000000000000000000
8030 vec($_, 1, 8) = 2 == 512 00000000010000000000000000000000
8031 vec($_, 2, 8) = 2 == 131072 00000000000000000100000000000000
8032 vec($_, 3, 8) = 2 == 33554432 00000000000000000000000001000000
8033 vec($_, 0, 8) = 4 == 4 00100000000000000000000000000000
8034 vec($_, 1, 8) = 4 == 1024 00000000001000000000000000000000
8035 vec($_, 2, 8) = 4 == 262144 00000000000000000010000000000000
8036 vec($_, 3, 8) = 4 == 67108864 00000000000000000000000000100000
8037 vec($_, 0, 8) = 8 == 8 00010000000000000000000000000000
8038 vec($_, 1, 8) = 8 == 2048 00000000000100000000000000000000
8039 vec($_, 2, 8) = 8 == 524288 00000000000000000001000000000000
8040 vec($_, 3, 8) = 8 == 134217728 00000000000000000000000000010000
8041 vec($_, 0, 8) = 16 == 16 00001000000000000000000000000000
8042 vec($_, 1, 8) = 16 == 4096 00000000000010000000000000000000
8043 vec($_, 2, 8) = 16 == 1048576 00000000000000000000100000000000
8044 vec($_, 3, 8) = 16 == 268435456 00000000000000000000000000001000
8045 vec($_, 0, 8) = 32 == 32 00000100000000000000000000000000
8046 vec($_, 1, 8) = 32 == 8192 00000000000001000000000000000000
8047 vec($_, 2, 8) = 32 == 2097152 00000000000000000000010000000000
8048 vec($_, 3, 8) = 32 == 536870912 00000000000000000000000000000100
8049 vec($_, 0, 8) = 64 == 64 00000010000000000000000000000000
8050 vec($_, 1, 8) = 64 == 16384 00000000000000100000000000000000
8051 vec($_, 2, 8) = 64 == 4194304 00000000000000000000001000000000
8052 vec($_, 3, 8) = 64 == 1073741824 00000000000000000000000000000010
8053 vec($_, 0, 8) = 128 == 128 00000001000000000000000000000000
8054 vec($_, 1, 8) = 128 == 32768 00000000000000010000000000000000
8055 vec($_, 2, 8) = 128 == 8388608 00000000000000000000000100000000
8056 vec($_, 3, 8) = 128 == 2147483648 00000000000000000000000000000001
8057
a0d0e21e 8058=item wait
d74e8afc 8059X<wait>
a0d0e21e 8060
3b10bc60 8061Behaves like wait(2) on your system: it waits for a child
2b5ab1e7 8062process to terminate and returns the pid of the deceased process, or
e5218da5 8063C<-1> if there are no child processes. The status is returned in C<$?>
ca8d723e 8064and C<${^CHILD_ERROR_NATIVE}>.
2b5ab1e7
TC
8065Note that a return value of C<-1> could mean that child processes are
8066being automatically reaped, as described in L<perlipc>.
a0d0e21e 8067
c69ca1d4 8068If you use wait in your handler for $SIG{CHLD} it may accidentally for the
0a18a49b
MH
8069child created by qx() or system(). See L<perlipc> for details.
8070
ea9eb35a
BJ
8071Portability issues: L<perlport/wait>.
8072
a0d0e21e 8073=item waitpid PID,FLAGS
d74e8afc 8074X<waitpid>
a0d0e21e 8075
2b5ab1e7
TC
8076Waits for a particular child process to terminate and returns the pid of
8077the deceased process, or C<-1> if there is no such child process. On some
8078systems, a value of 0 indicates that there are processes still running.
ca8d723e 8079The status is returned in C<$?> and C<${^CHILD_ERROR_NATIVE}>. If you say
a0d0e21e 8080
5f05dabc 8081 use POSIX ":sys_wait_h";
5a964f20 8082 #...
b76cc8ba 8083 do {
a9a5a0dc 8084 $kid = waitpid(-1, WNOHANG);
84b74420 8085 } while $kid > 0;
a0d0e21e 8086
2b5ab1e7
TC
8087then you can do a non-blocking wait for all pending zombie processes.
8088Non-blocking wait is available on machines supporting either the
3b10bc60 8089waitpid(2) or wait4(2) syscalls. However, waiting for a particular
2b5ab1e7
TC
8090pid with FLAGS of C<0> is implemented everywhere. (Perl emulates the
8091system call by remembering the status values of processes that have
8092exited but have not been harvested by the Perl script yet.)
a0d0e21e 8093
2b5ab1e7
TC
8094Note that on some systems, a return value of C<-1> could mean that child
8095processes are being automatically reaped. See L<perlipc> for details,
8096and for other examples.
5a964f20 8097
ea9eb35a
BJ
8098Portability issues: L<perlport/waitpid>.
8099
a0d0e21e 8100=item wantarray
d74e8afc 8101X<wantarray> X<context>
a0d0e21e 8102
cc37eb0b 8103Returns true if the context of the currently executing subroutine or
20f13e4a 8104C<eval> is looking for a list value. Returns false if the context is
cc37eb0b
RGS
8105looking for a scalar. Returns the undefined value if the context is
8106looking for no value (void context).
a0d0e21e 8107
5ed4f2ec 8108 return unless defined wantarray; # don't bother doing more
54310121 8109 my @a = complex_calculation();
8110 return wantarray ? @a : "@a";
a0d0e21e 8111
20f13e4a 8112C<wantarray()>'s result is unspecified in the top level of a file,
3c10abe3
AG
8113in a C<BEGIN>, C<UNITCHECK>, C<CHECK>, C<INIT> or C<END> block, or
8114in a C<DESTROY> method.
20f13e4a 8115
19799a22
GS
8116This function should have been named wantlist() instead.
8117
a0d0e21e 8118=item warn LIST
d74e8afc 8119X<warn> X<warning> X<STDERR>
a0d0e21e 8120
2d6d0015 8121Prints the value of LIST to STDERR. If the last element of LIST does
afd8c9c8
DM
8122not end in a newline, it appends the same file/line number text as C<die>
8123does.
774d564b 8124
a96d0188 8125If the output is empty and C<$@> already contains a value (typically from a
7660c0ab 8126previous eval) that value is used after appending C<"\t...caught">
19799a22
GS
8127to C<$@>. This is useful for staying almost, but not entirely similar to
8128C<die>.
43051805 8129
7660c0ab 8130If C<$@> is empty then the string C<"Warning: Something's wrong"> is used.
43051805 8131
774d564b 8132No message is printed if there is a C<$SIG{__WARN__}> handler
8133installed. It is the handler's responsibility to deal with the message
19799a22 8134as it sees fit (like, for instance, converting it into a C<die>). Most
80d38338 8135handlers must therefore arrange to actually display the
19799a22 8136warnings that they are not prepared to deal with, by calling C<warn>
774d564b 8137again in the handler. Note that this is quite safe and will not
8138produce an endless loop, since C<__WARN__> hooks are not called from
8139inside one.
8140
8141You will find this behavior is slightly different from that of
8142C<$SIG{__DIE__}> handlers (which don't suppress the error text, but can
19799a22 8143instead call C<die> again to change it).
774d564b 8144
8145Using a C<__WARN__> handler provides a powerful way to silence all
8146warnings (even the so-called mandatory ones). An example:
8147
8148 # wipe out *all* compile-time warnings
8149 BEGIN { $SIG{'__WARN__'} = sub { warn $_[0] if $DOWARN } }
8150 my $foo = 10;
8151 my $foo = 20; # no warning about duplicate my $foo,
8152 # but hey, you asked for it!
8153 # no compile-time or run-time warnings before here
8154 $DOWARN = 1;
8155
8156 # run-time warnings enabled after here
8157 warn "\$foo is alive and $foo!"; # does show up
8158
8f1da26d 8159See L<perlvar> for details on setting C<%SIG> entries and for more
2b5ab1e7
TC
8160examples. See the Carp module for other kinds of warnings using its
8161carp() and cluck() functions.
a0d0e21e 8162
15a348aa
NC
8163=item when EXPR BLOCK
8164X<when>
8165
8166=item when BLOCK
8167
8f1da26d
TC
8168C<when> is analogous to the C<case> keyword in other languages. Used with a
8169C<foreach> loop or the experimental C<given> block, C<when> can be used in
8170Perl to implement C<switch>/C<case> like statements. Available as a
8171statement after Perl 5.10 and as a statement modifier after 5.14.
8172Here are three examples:
8173
8174 use v5.10;
8175 foreach (@fruits) {
8176 when (/apples?/) {
8177 say "I like apples."
8178 }
8179 when (/oranges?/) {
8180 say "I don't like oranges."
8181 }
8182 default {
8183 say "I don't like anything"
8184 }
8185 }
15a348aa 8186
8f1da26d
TC
8187 # require 5.14 for when as statement modifier
8188 use v5.14;
8189 foreach (@fruits) {
8190 say "I like apples." when /apples?/;
8191 say "I don't like oranges." when /oranges?;
8192 default { say "I don't like anything" }
8193 }
8194
8195 use v5.10;
15a348aa
NC
8196 given ($fruit) {
8197 when (/apples?/) {
8f1da26d 8198 say "I like apples."
15a348aa
NC
8199 }
8200 when (/oranges?/) {
8f1da26d 8201 say "I don't like oranges."
15a348aa
NC
8202 }
8203 default {
8f1da26d 8204 say "I don't like anything"
15a348aa
NC
8205 }
8206 }
8207
8208See L<perlsyn/"Switch statements"> for detailed information.
8209
a0d0e21e 8210=item write FILEHANDLE
d74e8afc 8211X<write>
a0d0e21e
LW
8212
8213=item write EXPR
8214
8215=item write
8216
5a964f20 8217Writes a formatted record (possibly multi-line) to the specified FILEHANDLE,
a0d0e21e 8218using the format associated with that file. By default the format for
54310121 8219a file is the one having the same name as the filehandle, but the
19799a22 8220format for the current output channel (see the C<select> function) may be set
184e9718 8221explicitly by assigning the name of the format to the C<$~> variable.
a0d0e21e 8222
8f1da26d
TC
8223Top of form processing is handled automatically: if there is insufficient
8224room on the current page for the formatted record, the page is advanced by
8225writing a form feed, a special top-of-page format is used to format the new
8226page header before the record is written. By default, the top-of-page
8227format is the name of the filehandle with "_TOP" appended. This would be a
8228problem with autovivified filehandles, but it may be dynamically set to the
8229format of your choice by assigning the name to the C<$^> variable while
8230that filehandle is selected. The number of lines remaining on the current
8231page is in variable C<$->, which can be set to C<0> to force a new page.
a0d0e21e
LW
8232
8233If FILEHANDLE is unspecified, output goes to the current default output
8234channel, which starts out as STDOUT but may be changed by the
19799a22 8235C<select> operator. If the FILEHANDLE is an EXPR, then the expression
a0d0e21e
LW
8236is evaluated and the resulting string is used to look up the name of
8237the FILEHANDLE at run time. For more on formats, see L<perlform>.
8238
19799a22 8239Note that write is I<not> the opposite of C<read>. Unfortunately.
a0d0e21e
LW
8240
8241=item y///
8242
9f4b9cd0
SP
8243The transliteration operator. Same as C<tr///>. See
8244L<perlop/"Quote and Quote-like Operators">.
a0d0e21e
LW
8245
8246=back
8f1da26d
TC
8247
8248=cut