| 1 |
=head1 NAME |
| 2 |
|
| 3 |
Proc::FastSpawn - fork+exec, or spawn, a subprocess as quickly as possible |
| 4 |
|
| 5 |
=head1 SYNOPSIS |
| 6 |
|
| 7 |
use Proc::FastSpawn; |
| 8 |
|
| 9 |
# simple use |
| 10 |
my $pid = spawn "/bin/echo", ["echo", "hello, world"]; |
| 11 |
... |
| 12 |
waitpid $pid, 0; |
| 13 |
|
| 14 |
# with environment |
| 15 |
my $pid = spawn "/bin/echo", ["echo", "hello, world"], ["PATH=/bin", "HOME=/tmp"]; |
| 16 |
|
| 17 |
# inheriting file descriptors |
| 18 |
pipe R, W or die; |
| 19 |
fd_inherit fileno W; |
| 20 |
my $pid = spawn "/bin/sh", ["sh", "-c", "echo a pipe >&" . fileno W]; |
| 21 |
close W; |
| 22 |
print <R>; |
| 23 |
|
| 24 |
=head1 DESCRIPTION |
| 25 |
|
| 26 |
The purpose of this small (in scope and footprint) module is simple: |
| 27 |
spawn a subprocess asynchronously as efficiently and/or fast as |
| 28 |
possible. Basically the same as calling fork+exec (on POSIX), but |
| 29 |
hopefully faster than those two syscalls. |
| 30 |
|
| 31 |
Apart from fork overhead, this module also allows you to fork+exec |
| 32 |
programs when otherwise you couldn't - for example, when you use POSIX |
| 33 |
threads in your perl process then it generally isn't safe to call |
| 34 |
fork from perl, but it is safe to use this module to execute external |
| 35 |
processes. |
| 36 |
|
| 37 |
If neither of these are problems for you, you can safely ignore this |
| 38 |
module. |
| 39 |
|
| 40 |
So when is fork+exec not fast enough, how can you do it faster, and why |
| 41 |
would it matter? |
| 42 |
|
| 43 |
Forking a process requires making a complete copy of a process. Even |
| 44 |
thought almost every implementation only copies page tables and not the |
| 45 |
memory itself, this is still not free. For example, on my 3.6GHz amd64 |
| 46 |
box, I can fork a 5GB process only twenty times a second. For a real-time |
| 47 |
process that must meet stricter deadlines, this is too slow. For a busy |
| 48 |
and big web server, starting CGI scripts might mean unacceptable overhead. |
| 49 |
|
| 50 |
A workaround is to use C<vfork> - this function isn't very portable, but |
| 51 |
it avoids the memory copy that C<fork> has to do. Some systems have an |
| 52 |
optimised implementation of C<spawn>, and some systems have nothing. |
| 53 |
|
| 54 |
This module tries to abstract these differences away. |
| 55 |
|
| 56 |
As for what improvements to expect - on the 3.6GHz amd64 box that this |
| 57 |
module was originally developed on, a 3MB perl process (basically just |
| 58 |
perl + Proc::FastSpawn) takes 3.6s to run /bin/true 10000 times using |
| 59 |
fork+exec, and only 2.6s when using vfork+exec. In a 22MB process, the |
| 60 |
difference is already 5.0s vs 2.6s, and so on. |
| 61 |
|
| 62 |
=head1 FUNCTIONS |
| 63 |
|
| 64 |
All the following functions are currently exported by default. |
| 65 |
|
| 66 |
=over 4 |
| 67 |
|
| 68 |
=cut |
| 69 |
|
| 70 |
package Proc::FastSpawn; |
| 71 |
|
| 72 |
# only used on WIN32 - maddeningly complex and doesn't even work |
| 73 |
sub _quote { |
| 74 |
$_[0] = [@{ $_[0] }]; # make copy |
| 75 |
|
| 76 |
for (@{ $_[0] }) { |
| 77 |
if (/[\x01-\x20"]/) { # some sources say only space, "\t\n\v need to be escaped, microsoft says space and tab |
| 78 |
s/(\\*)"/$1$1\\"/g; # double + extra escape before " |
| 79 |
s/(\\+)$/$1$1/; # just double at end |
| 80 |
$_ = '"' . $_ . '"'; |
| 81 |
} |
| 82 |
} |
| 83 |
} |
| 84 |
|
| 85 |
BEGIN { |
| 86 |
$VERSION = '1.2'; |
| 87 |
|
| 88 |
our @ISA = qw(Exporter); |
| 89 |
our @EXPORT = qw(spawn spawnp fd_inherit); |
| 90 |
require Exporter; |
| 91 |
|
| 92 |
require XSLoader; |
| 93 |
XSLoader::load (__PACKAGE__, $VERSION); |
| 94 |
} |
| 95 |
|
| 96 |
=item $pid = spawn $path, \@argv[, \@envp] |
| 97 |
|
| 98 |
Creates a new process and tries to make it execute C<$path>, with the given |
| 99 |
arguments and optionally the given environment variables, similar to |
| 100 |
calling fork + execv, or execve. |
| 101 |
|
| 102 |
Returns the PID of the new process if successful. On any error, C<undef> |
| 103 |
is currently returned. Failure to execution might or might not be reported |
| 104 |
as C<undef>, or via a subprocess exit status of C<127>. |
| 105 |
|
| 106 |
=item $pid = spawnp $file, \@argv[, \@envp] |
| 107 |
|
| 108 |
Like C<spawn>, but searches C<$file> in C<$ENV{PATH}> like the shell would |
| 109 |
do. |
| 110 |
|
| 111 |
=item fd_inherit $fileno[, $on] |
| 112 |
|
| 113 |
File descriptors can be inherited by the spawned processes or not. This is |
| 114 |
decided on a per file descriptor basis. This module does nothing to any |
| 115 |
preexisting handles, but with this call, you can change the state of a |
| 116 |
single file descriptor to either be inherited (C<$on> is true or missing) |
| 117 |
or not C<$on> is false). |
| 118 |
|
| 119 |
Free portability pro-tip: it seems native win32 perls ignore $^F and set |
| 120 |
all file handles to be inherited by default - but this function can switch |
| 121 |
it off. |
| 122 |
|
| 123 |
=back |
| 124 |
|
| 125 |
=head1 PORTABILITY NOTES |
| 126 |
|
| 127 |
On POSIX systems, this module currently calls vfork+exec, spawn, or |
| 128 |
fork+exec, depending on the platform. If your platform has a good vfork or |
| 129 |
spawn but is misdetected and falls back to slow fork+exec, drop me a note. |
| 130 |
|
| 131 |
On win32, the C<_spawn> family of functions is used, and the module tries |
| 132 |
hard to patch the new process into perl's internal pid table, so the pid |
| 133 |
returned should work with other Perl functions such as waitpid. Also, |
| 134 |
win32 doesn't have a meaningful way to quote arguments containing |
| 135 |
"special" characters, so this module tries it's best to quote those |
| 136 |
strings itself. Other typical platform limitations (such as being able to |
| 137 |
only have 64 or so subprocesses) are not worked around. |
| 138 |
|
| 139 |
=head1 AUTHOR |
| 140 |
|
| 141 |
Marc Lehmann <schmorp@schmorp.de> |
| 142 |
http://home.schmorp.de/ |
| 143 |
|
| 144 |
=cut |
| 145 |
|
| 146 |
1 |
| 147 |
|