ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/App-Staticperl/staticperl.pod
Revision: 1.27
Committed: Tue Dec 21 19:32:34 2010 UTC (13 years, 4 months ago) by root
Branch: MAIN
Changes since 1.26: +20 -15 lines
Log Message:
*** empty log message ***

File Contents

# User Rev Content
1 root 1.1 =head1 NAME
2    
3 root 1.8 staticperl - perl, libc, 100 modules, all in one 500kb file
4 root 1.1
5     =head1 SYNOPSIS
6    
7     staticperl help # print the embedded documentation
8     staticperl fetch # fetch and unpack perl sources
9     staticperl configure # fetch and then configure perl
10     staticperl build # configure and then build perl
11     staticperl install # build and then install perl
12     staticperl clean # clean most intermediate files (restart at configure)
13     staticperl distclean # delete everything installed by this script
14     staticperl cpan # invoke CPAN shell
15     staticperl instmod path... # install unpacked modules
16     staticperl instcpan modulename... # install modules from CPAN
17     staticperl mkbundle <bundle-args...> # see documentation
18     staticperl mkperl <bundle-args...> # see documentation
19 root 1.14 staticperl mkapp appname <bundle-args...> # see documentation
20 root 1.1
21     Typical Examples:
22    
23     staticperl install # fetch, configure, build and install perl
24     staticperl cpan # run interactive cpan shell
25     staticperl mkperl -M '"Config_heavy.pl"' # build a perl that supports -V
26     staticperl mkperl -MAnyEvent::Impl::Perl -MAnyEvent::HTTPD -MURI -MURI::http
27     # build a perl with the above modules linked in
28 root 1.14 staticperl mkapp myapp --boot mainprog mymodules
29     # build a binary "myapp" from mainprog and mymodules
30 root 1.1
31     =head1 DESCRIPTION
32    
33 root 1.16 This script helps you to create single-file perl interpreters
34     or applications, or embedding a perl interpreter in your
35     applications. Single-file means that it is fully self-contained - no
36     separate shared objects, no autoload fragments, no .pm or .pl files are
37     needed. And when linking statically, you can create (or embed) a single
38     file that contains perl interpreter, libc, all the modules you need, all
39     the libraries you need and of course your actual program.
40 root 1.1
41 root 1.8 With F<uClibc> and F<upx> on x86, you can create a single 500kb binary
42     that contains perl and 100 modules such as POSIX, AnyEvent, EV, IO::AIO,
43     Coro and so on. Or any other choice of modules.
44 root 1.1
45 root 1.20 To see how this turns out, you can try out smallperl and bigperl, two
46     pre-built static and compressed perl binaries with many and even more
47     modules: just follow the links at L<http://staticperl.schmorp.de/>.
48    
49 root 1.3 The created files do not need write access to the file system (like PAR
50 root 1.1 does). In fact, since this script is in many ways similar to PAR::Packer,
51     here are the differences:
52    
53     =over 4
54    
55     =item * The generated executables are much smaller than PAR created ones.
56    
57     Shared objects and the perl binary contain a lot of extra info, while
58     the static nature of F<staticperl> allows the linker to remove all
59     functionality and meta-info not required by the final executable. Even
60     extensions statically compiled into perl at build time will only be
61     present in the final executable when needed.
62    
63     In addition, F<staticperl> can strip perl sources much more effectively
64     than PAR.
65    
66     =item * The generated executables start much faster.
67    
68     There is no need to unpack files, or even to parse Zip archives (which is
69     slow and memory-consuming business).
70    
71     =item * The generated executables don't need a writable filesystem.
72    
73     F<staticperl> loads all required files directly from memory. There is no
74     need to unpack files into a temporary directory.
75    
76 root 1.17 =item * More control over included files, more burden.
77 root 1.1
78 root 1.3 PAR tries to be maintenance and hassle-free - it tries to include more
79 root 1.17 files than necessary to make sure everything works out of the box. It
80     mostly succeeds at this, but he extra files (such as the unicode database)
81     can take substantial amounts of memory and file size.
82 root 1.1
83     With F<staticperl>, the burden is mostly with the developer - only direct
84     compile-time dependencies and L<AutoLoader> are handled automatically.
85     This means the modules to include often need to be tweaked manually.
86    
87 root 1.17 All this does not preclude more permissive modes to be implemented in
88     the future, but right now, you have to resolve state hidden dependencies
89     manually.
90    
91 root 1.1 =item * PAR works out of the box, F<staticperl> does not.
92    
93     Maintaining your own custom perl build can be a pain in the ass, and while
94     F<staticperl> tries to make this easy, it still requires a custom perl
95     build and possibly fiddling with some modules. PAR is likely to produce
96     results faster.
97    
98 root 1.13 Ok, PAR never has worked for me out of the box, and for some people,
99     F<staticperl> does work out of the box, as they don't count "fiddling with
100     module use lists" against it, but nevertheless, F<staticperl> is certainly
101     a bit more difficult to use.
102    
103 root 1.1 =back
104    
105     =head1 HOW DOES IT WORK?
106    
107     Simple: F<staticperl> downloads, compile and installs a perl version of
108     your choice in F<~/.staticperl>. You can add extra modules either by
109     letting F<staticperl> install them for you automatically, or by using CPAN
110     and doing it interactively. This usually takes 5-10 minutes, depending on
111 root 1.3 the speed of your computer and your internet connection.
112 root 1.1
113     It is possible to do program development at this stage, too.
114    
115     Afterwards, you create a list of files and modules you want to include,
116 root 1.3 and then either build a new perl binary (that acts just like a normal perl
117 root 1.1 except everything is compiled in), or you create bundle files (basically C
118     sources you can use to embed all files into your project).
119    
120 root 1.18 This step is very fast (a few seconds if PPI is not used for stripping, or
121     the stripped files are in the cache), and can be tweaked and repeated as
122     often as necessary.
123 root 1.1
124     =head1 THE F<STATICPERL> SCRIPT
125    
126     This module installs a script called F<staticperl> into your perl
127 root 1.21 binary directory. The script is fully self-contained, and can be
128     used without perl (for example, in an uClibc chroot environment). In
129     fact, it can be extracted from the C<App::Staticperl> distribution
130     tarball as F<bin/staticperl>, without any installation. The
131     newest (possibly alpha) version can also be downloaded from
132     L<http://staticperl.schmorp.de/staticperl>.
133 root 1.1
134     F<staticperl> interprets the first argument as a command to execute,
135     optionally followed by any parameters.
136    
137     There are two command categories: the "phase 1" commands which deal with
138     installing perl and perl modules, and the "phase 2" commands, which deal
139     with creating binaries and bundle files.
140    
141     =head2 PHASE 1 COMMANDS: INSTALLING PERL
142    
143     The most important command is F<install>, which does basically
144     everything. The default is to download and install perl 5.12.2 and a few
145     modules required by F<staticperl> itself, but all this can (and should) be
146     changed - see L<CONFIGURATION>, below.
147    
148     The command
149    
150     staticperl install
151    
152 root 1.24 is normally all you need: It installs the perl interpreter in
153 root 1.1 F<~/.staticperl/perl>. It downloads, configures, builds and installs the
154     perl interpreter if required.
155    
156 root 1.24 Most of the following F<staticperl> subcommands simply run one or more
157     steps of this sequence.
158    
159     If it fails, then most commonly because the compiler options I selected
160     are not supported by your compiler - either edit the F<staticperl> script
161     yourself or create F<~/.staticperl> shell script where your set working
162     C<PERL_CCFLAGS> etc. variables.
163 root 1.1
164 root 1.3 To force recompilation or reinstallation, you need to run F<staticperl
165 root 1.1 distclean> first.
166    
167     =over 4
168    
169 root 1.20 =item F<staticperl version>
170    
171     Prints some info about the version of the F<staticperl> script you are using.
172    
173 root 1.1 =item F<staticperl fetch>
174    
175     Runs only the download and unpack phase, unless this has already happened.
176    
177     =item F<staticperl configure>
178    
179     Configures the unpacked perl sources, potentially after downloading them first.
180    
181     =item F<staticperl build>
182    
183     Builds the configured perl sources, potentially after automatically
184     configuring them.
185    
186     =item F<staticperl install>
187    
188 root 1.3 Wipes the perl installation directory (usually F<~/.staticperl/perl>) and
189     installs the perl distribution, potentially after building it first.
190 root 1.1
191     =item F<staticperl cpan> [args...]
192    
193 root 1.3 Starts an interactive CPAN shell that you can use to install further
194     modules. Installs the perl first if necessary, but apart from that,
195 root 1.1 no magic is involved: you could just as well run it manually via
196     F<~/.staticperl/perl/bin/cpan>.
197    
198     Any additional arguments are simply passed to the F<cpan> command.
199    
200     =item F<staticperl instcpan> module...
201    
202     Tries to install all the modules given and their dependencies, using CPAN.
203    
204     Example:
205    
206     staticperl instcpan EV AnyEvent::HTTPD Coro
207    
208     =item F<staticperl instsrc> directory...
209    
210     In the unlikely case that you have unpacked perl modules around and want
211 root 1.3 to install from these instead of from CPAN, you can do this using this
212 root 1.1 command by specifying all the directories with modules in them that you
213     want to have built.
214    
215     =item F<staticperl clean>
216    
217 root 1.12 Deletes the perl source directory (and potentially cleans up other
218     intermediate files). This can be used to clean up files only needed for
219 root 1.24 building perl, without removing the installed perl interpreter.
220 root 1.12
221     At the moment, it doesn't delete downloaded tarballs.
222 root 1.1
223 root 1.24 The exact semantics of this command will probably change.
224    
225 root 1.1 =item F<staticperl distclean>
226    
227     This wipes your complete F<~/.staticperl> directory. Be careful with this,
228     it nukes your perl download, perl sources, perl distribution and any
229     installed modules. It is useful if you wish to start over "from scratch"
230     or when you want to uninstall F<staticperl>.
231    
232     =back
233    
234     =head2 PHASE 2 COMMANDS: BUILDING PERL BUNDLES
235    
236     Building (linking) a new F<perl> binary is handled by a separate
237     script. To make it easy to use F<staticperl> from a F<chroot>, the script
238     is embedded into F<staticperl>, which will write it out and call for you
239     with any arguments you pass:
240    
241     staticperl mkbundle mkbundle-args...
242    
243     In the oh so unlikely case of something not working here, you
244 root 1.2 can run the script manually as well (by default it is written to
245 root 1.1 F<~/.staticperl/mkbundle>).
246    
247     F<mkbundle> is a more conventional command and expect the argument
248 root 1.3 syntax commonly used on UNIX clones. For example, this command builds
249 root 1.1 a new F<perl> binary and includes F<Config.pm> (for F<perl -V>),
250     F<AnyEvent::HTTPD>, F<URI> and a custom F<httpd> script (from F<eg/httpd>
251     in this distribution):
252    
253     # first make sure we have perl and the required modules
254     staticperl instcpan AnyEvent::HTTPD
255    
256     # now build the perl
257     staticperl mkperl -M'"Config_heavy.pl"' -MAnyEvent::Impl::Perl \
258     -MAnyEvent::HTTPD -MURI::http \
259     --add 'eg/httpd httpd.pm'
260    
261     # finally, invoke it
262     ./perl -Mhttpd
263    
264     As you can see, things are not quite as trivial: the L<Config> module has
265     a hidden dependency which is not even a perl module (F<Config_heavy.pl>),
266     L<AnyEvent> needs at least one event loop backend that we have to
267 root 1.3 specify manually (here L<AnyEvent::Impl::Perl>), and the F<URI> module
268 root 1.1 (required by L<AnyEvent::HTTPD>) implements various URI schemes as extra
269     modules - since L<AnyEvent::HTTPD> only needs C<http> URIs, we only need
270 root 1.3 to include that module. I found out about these dependencies by carefully
271     watching any error messages about missing modules...
272 root 1.1
273 root 1.14 Instead of building a new perl binary, you can also build a standalone
274     application:
275    
276     # build the app
277     staticperl mkapp app --boot eg/httpd \
278     -MAnyEvent::Impl::Perl -MAnyEvent::HTTPD -MURI::http
279    
280     # run it
281     ./app
282    
283 root 1.25 Here are the three phase 2 commands:
284    
285     =over 4
286    
287     =item F<staticperl mkbundle> args...
288    
289     The "default" bundle command - it interprets the given bundle options and
290     writes out F<bundle.h>, F<bundle.c>, F<bundle.ccopts> and F<bundle.ldopts>
291     files, useful for embedding.
292    
293     =item F<staticperl mkperl> args...
294    
295     Creates a bundle just like F<staticperl mkbundle> (in fact, it's the same
296     as invoking F<staticperl mkbundle --perl> args...), but then compiles and
297     links a new perl interpreter that embeds the created bundle, then deletes
298     all intermediate files.
299    
300     =item F<staticperl mkapp> filename args...
301    
302     Does the same as F<staticperl mkbundle> (in fact, it's the same as
303     invoking F<staticperl mkbundle --app> filename args...), but then compiles
304     and links a new standalone application that simply initialises the perl
305     interpreter.
306    
307     The difference to F<staticperl mkperl> is that the standalone application
308     does not act like a perl interpreter would - in fact, by default it would
309     just do nothing and exit immediately, so you should specify some code to
310     be executed via the F<--boot> option.
311    
312     =back
313    
314 root 1.1 =head3 OPTION PROCESSING
315    
316 root 1.3 All options can be given as arguments on the command line (typically
317     using long (e.g. C<--verbose>) or short option (e.g. C<-v>) style). Since
318 root 1.27 specifying a lot of options can make the command line very long and
319     unwieldy, you can put all long options into a "bundle specification file"
320     (one option per line, with or without C<--> prefix) and specify this
321     bundle file instead.
322 root 1.1
323 root 1.27 For example, the command given earlier to link a new F<perl> could also
324     look like this:
325 root 1.1
326     staticperl mkperl httpd.bundle
327    
328 root 1.27 With all options stored in the F<httpd.bundle> file (one option per line,
329     everything after the option is an argument):
330    
331 root 1.1 use "Config_heavy.pl"
332     use AnyEvent::Impl::Perl
333     use AnyEvent::HTTPD
334     use URI::http
335     add eg/httpd httpd.pm
336    
337 root 1.2 All options that specify modules or files to be added are processed in the
338 root 1.25 order given on the command line.
339 root 1.2
340 root 1.27 =head3 BUNDLE CREATION WORKFLOW / STATICPELR MKBUNDLE OPTIONS
341 root 1.19
342 root 1.26 F<staticperl mkbundle> works by first assembling a list of candidate
343     files and modules to include, then filtering them by include/exclude
344 root 1.27 patterns. The remaining modules (together with their direct dependencies,
345     such as link libraries and L<AutoLoader> files) are then converted into
346     bundle files suitable for embedding. F<staticperl mkbundle> can then
347     optionally build a new perl interpreter or a standalone application.
348 root 1.19
349     =over 4
350    
351 root 1.26 =item Step 0: Generic argument processing.
352 root 1.19
353 root 1.26 The following options influence F<staticperl mkbundle> itself.
354 root 1.1
355     =over 4
356    
357 root 1.27 =item C<--verbose> | C<-v>
358 root 1.2
359     Increases the verbosity level by one (the default is C<1>).
360    
361 root 1.27 =item C<--quiet> | C<-q>
362 root 1.2
363     Decreases the verbosity level by one.
364    
365 root 1.26 =item any other argument
366 root 1.2
367 root 1.26 Any other argument is interpreted as a bundle specification file, which
368 root 1.27 supports all options (without extra quoting), one option per line, in the
369     format C<option> or C<option argument>. They will effectively be expanded
370     and processed as if they were directly written on the command line, in
371     place of the file name.
372 root 1.2
373 root 1.26 =back
374 root 1.2
375 root 1.26 =item Step 1: gather candidate files and modules
376 root 1.2
377 root 1.26 In this step, modules, perl libraries (F<.pl> files) and other files are
378     selected for inclusion in the bundle. The relevant options are executed
379     in order (this makes a difference mostly for C<--eval>, which can rely on
380     earlier C<--use> options to have been executed).
381 root 1.2
382 root 1.26 =over 4
383 root 1.2
384 root 1.26 =item C<--use> F<module> | C<-M>F<module>
385 root 1.14
386 root 1.26 Include the named module and trace direct dependencies. This is done by
387 root 1.2 C<require>'ing the module in a subprocess and tracing which other modules
388 root 1.26 and files it actually loads.
389 root 1.2
390     Example: include AnyEvent and AnyEvent::Impl::Perl.
391    
392     staticperl mkbundle --use AnyEvent --use AnyEvent::Impl::Perl
393    
394     Sometimes you want to load old-style "perl libraries" (F<.pl> files), or
395     maybe other weirdly named files. To do that, you need to quote the name in
396 root 1.3 single or double quotes. When given on the command line, you probably need
397 root 1.2 to quote once more to avoid your shell interpreting it. Common cases that
398     need this are F<Config_heavy.pl> and F<utf8_heavy.pl>.
399    
400     Example: include the required files for F<perl -V> to work in all its
401     glory (F<Config.pm> is included automatically by this).
402    
403     # bourne shell
404     staticperl mkbundle --use '"Config_heavy.pl"'
405    
406     # bundle specification file
407     use "Config_heavy.pl"
408    
409 root 1.26 The C<-M>module syntax is included as an alias that might be easier to
410     remember than C<--use>. Or maybe it confuses people. Time will tell. Or
411     maybe not. Sigh.
412 root 1.2
413 root 1.26 =item C<--eval> "perl code" | C<-e> "perl code"
414 root 1.2
415     Sometimes it is easier (or necessary) to specify dependencies using perl
416     code, or maybe one of the modules you use need a special use statement. In
417 root 1.26 that case, you can use C<--eval> to execute some perl snippet or set some
418     variables or whatever you need. All files C<require>'d or C<use>'d while
419     executing the snippet are included in the final bundle.
420 root 1.2
421     Keep in mind that F<mkbundle> will only C<require> the modules named
422     by the C<--use> option, so do not expect the symbols from modules you
423 root 1.3 C<--use>'d earlier on the command line to be available.
424 root 1.2
425     Example: force L<AnyEvent> to detect a backend and therefore include it
426     in the final bundle.
427    
428     staticperl mkbundle --eval 'use AnyEvent; AnyEvent::detect'
429    
430     # or like this
431 root 1.26 staticperl mkbundle -MAnyEvent --eval 'AnyEvent::detect'
432 root 1.2
433     Example: use a separate "bootstrap" script that C<use>'s lots of modules
434 root 1.26 and also include this in the final bundle, to be executed automatically
435     when the interpreter is initialised.
436 root 1.2
437     staticperl mkbundle --eval 'do "bootstrap"' --boot bootstrap
438    
439 root 1.26 =item C<--boot> F<filename>
440    
441     Include the given file in the bundle and arrange for it to be
442     executed (using C<require>) before the main program when the new perl
443     is initialised. This can be used to modify C<@INC> or do similar
444     modifications before the perl interpreter executes scripts given on the
445     command line (or via C<-e>). This works even in an embedded interpreter -
446     the file will be executed during interpreter initialisation in that case.
447    
448     =item C<--incglob> pattern
449    
450     This goes through all standard library directories and tries to match any
451     F<.pm> and F<.pl> files against the extended glob pattern (see below). If
452     a file matches, it is added. The pattern is matched against the full path
453     of the file (sans the library directory prefix), e.g. F<Sys/Syslog.pm>.
454    
455     This is very useful to include "everything":
456    
457     --incglob '*'
458    
459     It is also useful for including perl libraries, or trees of those, such as
460     the unicode database files needed by some perl builtins, the regex engine
461     and other modules.
462    
463     --incglob '/unicore/**.pl'
464    
465     =item C<--add> F<file> | C<--add> "F<file> alias"
466    
467     Adds the given (perl) file into the bundle (and optionally call it
468     "alias"). The F<file> is either an absolute path or a path relative to
469     the current directory. If an alias is specified, then this is the name it
470     will use for C<@INC> searches, otherfile the F<file> will be used as the
471     internal name.
472    
473     This switch is used to include extra files into the bundle.
474    
475     Example: embed the file F<httpd> in the current directory as F<httpd.pm>
476     when creating the bundle.
477    
478     staticperl mkperl --add "httpd httpd.pm"
479    
480     Example: add local files as extra modules in the bundle.
481    
482     # specification file
483     add file1 myfiles/file1.pm
484     add file2 myfiles/file2.pm
485     add file3 myfiles/file3.pl
486    
487     # then later, in perl, use
488     use myfiles::file1;
489     require myfiles::file2;
490     my $res = do "myfiles/file3.pl";
491    
492     =item C<--binadd> F<file> | C<--add> "F<file> alias"
493    
494     Just like C<--add>, except that it treats the file as binary and adds it
495     without any postprocessing (perl files might get stripped to reduce their
496     size).
497    
498     You should probably add a C</> prefix to avoid clashing with embedded perl
499     files (whose paths do not start with C</>), and/or use a special directory
500     prefix, such as C</res/name>.
501    
502     You can later get a copy of these files by calling C<staticperl::find
503     "alias">.
504    
505     An alternative way to embed binary files is to convert them to perl and
506     use C<do> to get the contents - this method is a bit cumbersome, but works
507     both inside and outside of a staticperl bundle:
508 root 1.2
509 root 1.26 # a "binary" file, call it "bindata.pl"
510     <<'SOME_MARKER'
511     binary data NOT containing SOME_MARKER
512     SOME_MARKER
513 root 1.2
514 root 1.26 # load the binary
515     chomp (my $data = do "bindata.pl");
516    
517     =back
518    
519     =item Step 2: filter all files using C<--include> and C<--exclude> options.
520    
521     After all candidate files and modules are added, they are I<filtered>
522     by a combination of C<--include> and C<--exclude> patterns (there is an
523     implicit C<--include **> at the end, so if no filters are specified, all
524     files are included).
525    
526     All that this step does is potentially reduce the number of files that are
527     to be included - no new files are added during this step.
528    
529     =over 4
530    
531     =item C<--include> pattern | C<-i> pattern | C<--exclude> pattern | C<-x> pattern
532    
533     These specify an include or exclude pattern to be applied to the candidate
534     file list. An include makes sure that the given files will be part of the
535     resulting file set, an exclude will exclude remaining files. The patterns
536     are "extended glob patterns" (see below).
537    
538     The patterns are applied "in order" - files included via earlier
539     C<--include> specifications cannot be removed by any following
540     C<--exclude>, and likewise, and file excluded by an earlier C<--exclude>
541     cannot be added by any following C<--include>.
542    
543     For example, to include everything except C<Devel> modules, but still
544     include F<Devel::PPPort>, you could use this:
545    
546     --incglob '*' -i '/Devel/PPPort.pm' -x '/Devel/**'
547    
548     =back
549    
550     =item Step 3: add any extra or "hidden" dependencies.
551    
552     F<staticperl> currently knows about three extra types of depdendencies
553     that are added automatically. Only one (F<.packlist> files) is currently
554     optional and can be influenced, the others are always included:
555    
556     =over 4
557    
558     =item C<--usepacklist>
559 root 1.20
560     Read F<.packlist> files for each distribution that happens to match a
561     module name you specified. Sounds weird, and it is, so expect semantics to
562     change somehow in the future.
563    
564     The idea is that most CPAN distributions have a F<.pm> file that matches
565     the name of the distribution (which is rather reasonable after all).
566    
567     If this switch is enabled, then if any of the F<.pm> files that have been
568     selected match an install distribution, then all F<.pm>, F<.pl>, F<.al>
569     and F<.ix> files installed by this distribution are also included.
570    
571     For example, using this switch, when the L<URI> module is specified, then
572     all L<URI> submodules that have been installed via the CPAN distribution
573     are included as well, so you don't have to manually specify them.
574    
575 root 1.26 =item L<AutoLoader> splitfiles
576 root 1.18
577 root 1.26 Some modules use L<AutoLoader> - less commonly (hopefully) used functions
578     are split into separate F<.al> files, and an index (F<.ix>) file contains
579     the prototypes.
580 root 1.18
581 root 1.26 Both F<.ix> and F<.al> files will be detected automatically and added to
582     the bundle.
583 root 1.18
584 root 1.26 =item link libraries (F<.a> files)
585    
586     Modules using XS (or any other non-perl language extension compiled at
587     installation time) will have a static archive (typically F<.a>). These
588     will automatically be added to the linker options in F<bundle.ldopts>.
589    
590     Should F<staticperl> find a dynamic link library (typically F<.so>) it
591     will warn about it - obviously this shouldn't happen unless you use
592     F<staticperl> on the wrong perl, or one (probably wrongly) configured to
593     use dynamic loading.
594    
595     =item extra libraries (F<extralibs.ld>)
596    
597     Some modules need linking against external libraries - these are found in
598     F<extralibs.ld> and added to F<bundle.ldopts>.
599    
600     =back
601    
602     =item Step 4: write bundle files and optionally link a program
603    
604     At this point, the select files will be read, processed (stripped) and
605     finally the bundle files get written to disk, and F<staticperl mkbundle>
606     is normally finished. Optionally, it can go a step further and either link
607     a new F<perl> binary with all selected modules and files inside, or build
608     a standalone application.
609    
610     Both the contents of the bundle files and any extra linking is controlled
611     by these options:
612    
613     =over 4
614 root 1.18
615 root 1.26 =item C<--strip> C<none>|C<pod>|C<ppi>
616 root 1.18
617 root 1.26 Specify the stripping method applied to reduce the file of the perl
618     sources included.
619 root 1.18
620 root 1.26 The default is C<pod>, which uses the L<Pod::Strip> module to remove all
621     pod documentation, which is very fast and reduces file size a lot.
622 root 1.2
623 root 1.26 The C<ppi> method uses L<PPI> to parse and condense the perl sources. This
624     saves a lot more than just L<Pod::Strip>, and is generally safer,
625     but is also a lot slower (some files take almost a minute to strip -
626     F<staticperl> maintains a cache of stripped files to speed up subsequent
627     runs for this reason). Note that this method doesn't optimise for raw file
628     size, but for best compression (that means that the uncompressed file size
629     is a bit larger, but the files compress better, e.g. with F<upx>).
630    
631     Last not least, if you need accurate line numbers in error messages,
632     or in the unlikely case where C<pod> is too slow, or some module gets
633     mistreated, you can specify C<none> to not mangle included perl sources in
634     any way.
635 root 1.2
636 root 1.26 =item --perl
637 root 1.2
638 root 1.26 After writing out the bundle files, try to link a new perl interpreter. It
639     will be called F<perl> and will be left in the current working
640     directory. The bundle files will be removed.
641 root 1.2
642 root 1.26 This switch is automatically used when F<staticperl> is invoked with the
643     C<mkperl> command instead of C<mkbundle>.
644 root 1.2
645 root 1.26 Example: build a new F<./perl> binary with only L<common::sense> inside -
646     it will be even smaller than the standard perl interpreter as none of the
647     modules of the base distribution (such as L<Fcntl>) will be included.
648 root 1.2
649 root 1.26 staticperl mkperl -Mcommon::sense
650 root 1.10
651 root 1.26 =item --app name
652 root 1.10
653 root 1.26 After writing out the bundle files, try to link a new standalone
654     program. It will be called C<name>, and the bundle files get removed after
655     linking it.
656 root 1.10
657 root 1.26 This switch is automatically used when F<staticperl> is invoked with the
658     C<mkapp> command instead of C<mkbundle>.
659 root 1.10
660 root 1.26 The difference to the (mutually exclusive) C<--perl> option is that the
661     binary created by this option will not try to act as a perl interpreter -
662     instead it will simply initialise the perl interpreter, clean it up and
663     exit.
664 root 1.18
665 root 1.26 This means that, by default, it will do nothing but burna few CPU cycles
666     - for it to do something useful you I<must> add some boot code, e.g. with
667     the C<--boot> option.
668 root 1.18
669 root 1.26 Example: create a standalone perl binary called F<./myexe> that will
670     execute F<appfile> when it is started.
671 root 1.18
672 root 1.26 staticperl mkbundle --app myexe --boot appfile
673 root 1.18
674 root 1.2 =item --static
675    
676 root 1.26 Add C<-static> to F<bundle.ldopts>, which means a fully static (if
677     supported by the OS) executable will be created. This is not immensely
678     useful when just creating the bundle files, but is most useful when
679     linking a binary with the C<--perl> or C<--app> options.
680    
681     The default is to link the new binary dynamically (that means all perl
682     modules are linked statically, but all external libraries are still
683 root 1.2 referenced dynamically).
684    
685     Keep in mind that Solaris doesn't support static linking at all, and
686 root 1.26 systems based on GNU libc don't really support it in a very usable
687     fashion either. Try uClibc if you want to create fully statically linked
688     executables, or try the C<--staticlib> option to link only some libraries
689 root 1.2 statically.
690    
691 root 1.18 =item --staticlib libname
692    
693     When not linking fully statically, this option allows you to link specific
694     libraries statically. What it does is simply replace all occurances of
695     C<-llibname> with the GCC-specific C<-Wl,-Bstatic -llibname -Wl,-Bdynamic>
696     option.
697    
698     This will have no effect unless the library is actually linked against,
699     specifically, C<--staticlib> will not link against the named library
700     unless it would be linked against anyway.
701    
702     Example: link libcrypt statically into the binary.
703    
704     staticperl mkperl -MIO::AIO --staticlib crypt
705    
706 root 1.26 # ldopts might now contain:
707 root 1.18 # -lm -Wl,-Bstatic -lcrypt -Wl,-Bdynamic -lpthread
708    
709 root 1.26 =back
710 root 1.1
711     =back
712    
713 root 1.18 =head3 EXTENDED GLOB PATTERNS
714    
715     Some options of F<staticperl mkbundle> expect an I<extended glob
716     pattern>. This is neither a normal shell glob nor a regex, but something
717     in between. The idea has been copied from rsync, and there are the current
718     matching rules:
719    
720     =over 4
721    
722     =item Patterns starting with F</> will be a anchored at the root of the library tree.
723    
724     That is, F</unicore> will match the F<unicore> directory in C<@INC>, but
725     nothing inside, and neither any other file or directory called F<unicore>
726     anywhere else in the hierarchy.
727    
728     =item Patterns not starting with F</> will be anchored at the end of the path.
729    
730     That is, F<idna.pl> will match any file called F<idna.pl> anywhere in the
731     hierarchy, but not any directories of the same name.
732    
733     =item A F<*> matches any single component.
734    
735     That is, F</unicore/*.pl> would match all F<.pl> files directly inside
736     C</unicore>, not any deeper level F<.pl> files. Or in other words, F<*>
737     will not match slashes.
738    
739     =item A F<**> matches anything.
740    
741     That is, F</unicore/**.pl> would match all F<.pl> files under F</unicore>,
742     no matter how deeply nested they are inside subdirectories.
743    
744     =item A F<?> matches a single character within a component.
745    
746     That is, F</Encode/??.pm> matches F</Encode/JP.pm>, but not the
747     hypothetical F</Encode/J/.pm>, as F<?> does not match F</>.
748    
749     =back
750    
751 root 1.15 =head2 F<STATICPERL> CONFIGURATION AND HOOKS
752 root 1.1
753 root 1.20 During (each) startup, F<staticperl> tries to source some shell files to
754     allow you to fine-tune/override configuration settings.
755    
756     In them you can override shell variables, or define shell functions
757     ("hooks") to be called at specific phases during installation. For
758     example, you could define a C<postinstall> hook to install additional
759     modules from CPAN each time you start from scratch.
760    
761     If the env variable C<$STATICPERLRC> is set, then F<staticperl> will try
762     to source the file named with it only. Otherwise, it tries the following
763     shell files in order:
764 root 1.2
765     /etc/staticperlrc
766     ~/.staticperlrc
767     $STATICPERL/rc
768    
769     Note that the last file is erased during F<staticperl distclean>, so
770     generally should not be used.
771    
772     =head3 CONFIGURATION VARIABLES
773    
774     =head4 Variables you I<should> override
775    
776     =over 4
777    
778     =item C<EMAIL>
779    
780     The e-mail address of the person who built this binary. Has no good
781     default, so should be specified by you.
782    
783     =item C<CPAN>
784    
785     The URL of the CPAN mirror to use (e.g. L<http://mirror.netcologne.de/cpan/>).
786    
787 root 1.6 =item C<EXTRA_MODULES>
788 root 1.2
789 root 1.6 Additional modules installed during F<staticperl install>. Here you can
790     set which modules you want have to installed from CPAN.
791 root 1.2
792 root 1.11 Example: I really really need EV, AnyEvent, Coro and AnyEvent::AIO.
793 root 1.2
794 root 1.11 EXTRA_MODULES="EV AnyEvent Coro AnyEvent::AIO"
795 root 1.2
796 root 1.6 Note that you can also use a C<postinstall> hook to achieve this, and
797     more.
798 root 1.2
799 root 1.11 =back
800    
801     =head4 Variables you might I<want> to override
802    
803     =over 4
804    
805     =item C<STATICPERL>
806    
807     The directory where staticperl stores all its files
808     (default: F<~/.staticperl>).
809    
810 root 1.6 =item C<PERL_MM_USE_DEFAULT>, C<EV_EXTRA_DEFS>, ...
811 root 1.2
812     Usually set to C<1> to make modules "less inquisitive" during their
813     installation, you can set any environment variable you want - some modules
814     (such as L<Coro> or L<EV>) use environment variables for further tweaking.
815    
816 root 1.11 =item C<PERL_VERSION>
817 root 1.6
818 root 1.11 The perl version to install - default is currently C<5.12.2>, but C<5.8.9>
819     is also a good choice (5.8.9 is much smaller than 5.12.2, while 5.10.1 is
820     about as big as 5.12.2).
821 root 1.2
822 root 1.11 =item C<PERL_PREFIX>
823 root 1.2
824 root 1.6 The prefix where perl gets installed (default: F<$STATICPERL/perl>),
825     i.e. where the F<bin> and F<lib> subdirectories will end up.
826 root 1.2
827 root 1.10 =item C<PERL_CONFIGURE>
828    
829     Additional Configure options - these are simply passed to the perl
830     Configure script. For example, if you wanted to enable dynamic loading,
831     you could pass C<-Dusedl>. To enable ithreads (Why would you want that
832     insanity? Don't! Use L<forks> instead!) you would pass C<-Duseithreads>
833     and so on.
834    
835     More commonly, you would either activate 64 bit integer support
836     (C<-Duse64bitint>), or disable large files support (-Uuselargefiles), to
837     reduce filesize further.
838    
839 root 1.24 =item C<PERL_CC>, C<PERL_CCFLAGS>, C<PERL_OPTIMIZE>, C<PERL_LDFLAGS>, C<PERL_LIBS>
840 root 1.2
841 root 1.6 These flags are passed to perl's F<Configure> script, and are generally
842     optimised for small size (at the cost of performance). Since they also
843     contain subtle workarounds around various build issues, changing these
844 root 1.24 usually requires understanding their default values - best look at
845     the top of the F<staticperl> script for more info on these, and use a
846     F<~/.staticperlrc> to override them.
847    
848     Most of the variables override (or modify) the corresponding F<Configure>
849     variable, except C<PERL_CCFLAGS>, which gets appended.
850 root 1.2
851     =back
852    
853 root 1.5 =head4 Variables you probably I<do not want> to override
854 root 1.2
855     =over 4
856    
857 root 1.23 =item C<MAKE>
858    
859     The make command to use - default is C<make>.
860    
861 root 1.2 =item C<MKBUNDLE>
862    
863     Where F<staticperl> writes the C<mkbundle> command to
864     (default: F<$STATICPERL/mkbundle>).
865    
866     =item C<STATICPERL_MODULES>
867    
868     Additional modules needed by C<mkbundle> - should therefore not be changed
869     unless you know what you are doing.
870    
871     =back
872    
873     =head3 OVERRIDABLE HOOKS
874    
875     In addition to environment variables, it is possible to provide some
876     shell functions that are called at specific times. To provide your own
877 root 1.3 commands, just define the corresponding function.
878 root 1.2
879     Example: install extra modules from CPAN and from some directories
880     at F<staticperl install> time.
881    
882     postinstall() {
883 root 1.4 rm -rf lib/threads* # weg mit Schaden
884 root 1.2 instcpan IO::AIO EV
885     instsrc ~/src/AnyEvent
886     instsrc ~/src/XML-Sablotron-1.0100001
887 root 1.4 instcpan Anyevent::AIO AnyEvent::HTTPD
888 root 1.2 }
889    
890     =over 4
891    
892 root 1.12 =item preconfigure
893    
894     Called just before running F<./Configur> in the perl source
895     directory. Current working directory is the perl source directory.
896    
897     This can be used to set any C<PERL_xxx> variables, which might be costly
898     to compute.
899    
900 root 1.2 =item postconfigure
901    
902     Called after configuring, but before building perl. Current working
903     directory is the perl source directory.
904    
905 root 1.12 Could be used to tailor/patch config.sh (followed by F<sh Configure -S>)
906     or do any other modifications.
907 root 1.2
908     =item postbuild
909    
910     Called after building, but before installing perl. Current working
911     directory is the perl source directory.
912    
913     I have no clue what this could be used for - tell me.
914    
915     =item postinstall
916    
917     Called after perl and any extra modules have been installed in C<$PREFIX>,
918     but before setting the "installation O.K." flag.
919    
920     The current working directory is C<$PREFIX>, but maybe you should not rely
921     on that.
922    
923     This hook is most useful to customise the installation, by deleting files,
924     or installing extra modules using the C<instcpan> or C<instsrc> functions.
925    
926     The script must return with a zero exit status, or the installation will
927     fail.
928    
929     =back
930 root 1.1
931 root 1.9 =head1 ANATOMY OF A BUNDLE
932    
933     When not building a new perl binary, C<mkbundle> will leave a number of
934     files in the current working directory, which can be used to embed a perl
935     interpreter in your program.
936    
937     Intimate knowledge of L<perlembed> and preferably some experience with
938     embedding perl is highly recommended.
939    
940     C<mkperl> (or the C<--perl> option) basically does this to link the new
941     interpreter (it also adds a main program to F<bundle.>):
942    
943     $Config{cc} $(cat bundle.ccopts) -o perl bundle.c $(cat bundle.ldopts)
944    
945     =over 4
946    
947     =item bundle.h
948    
949     A header file that contains the prototypes of the few symbols "exported"
950     by bundle.c, and also exposes the perl headers to the application.
951    
952     =over 4
953    
954     =item staticperl_init ()
955    
956     Initialises the perl interpreter. You can use the normal perl functions
957     after calling this function, for example, to define extra functions or
958     to load a .pm file that contains some initialisation code, or the main
959     program function:
960    
961     XS (xsfunction)
962     {
963     dXSARGS;
964    
965     // now we have items, ST(i) etc.
966     }
967    
968     static void
969     run_myapp(void)
970     {
971     staticperl_init ();
972     newXSproto ("myapp::xsfunction", xsfunction, __FILE__, "$$;$");
973     eval_pv ("require myapp::main", 1); // executes "myapp/main.pm"
974     }
975    
976     =item staticperl_xs_init (pTHX)
977    
978     Sometimes you need direct control over C<perl_parse> and C<perl_run>, in
979     which case you do not want to use C<staticperl_init> but call them on your
980     own.
981    
982     Then you need this function - either pass it directly as the C<xs_init>
983     function to C<perl_parse>, or call it from your own C<xs_init> function.
984    
985     =item staticperl_cleanup ()
986    
987     In the unlikely case that you want to destroy the perl interpreter, here
988     is the corresponding function.
989    
990     =item PerlInterpreter *staticperl
991    
992     The perl interpreter pointer used by staticperl. Not normally so useful,
993     but there it is.
994    
995     =back
996    
997     =item bundle.ccopts
998    
999     Contains the compiler options required to compile at least F<bundle.c> and
1000     any file that includes F<bundle.h> - you should probably use it in your
1001     C<CFLAGS>.
1002    
1003     =item bundle.ldopts
1004    
1005     The linker options needed to link the final program.
1006    
1007     =back
1008    
1009     =head1 RUNTIME FUNCTIONALITY
1010    
1011     Binaries created with C<mkbundle>/C<mkperl> contain extra functions, which
1012     are required to access the bundled perl sources, but might be useful for
1013     other purposes.
1014    
1015     In addition, for the embedded loading of perl files to work, F<staticperl>
1016     overrides the C<@INC> array.
1017    
1018     =over 4
1019    
1020     =item $file = staticperl::find $path
1021    
1022     Returns the data associated with the given C<$path>
1023     (e.g. C<Digest/MD5.pm>, C<auto/POSIX/autosplit.ix>), which is basically
1024     the UNIX path relative to the perl library directory.
1025    
1026     Returns C<undef> if the file isn't embedded.
1027    
1028     =item @paths = staticperl::list
1029    
1030     Returns the list of all paths embedded in this binary.
1031    
1032     =back
1033    
1034     =head1 FULLY STATIC BINARIES - BUILDROOT
1035    
1036 root 1.10 To make truly static (Linux-) libraries, you might want to have a look at
1037 root 1.9 buildroot (L<http://buildroot.uclibc.org/>).
1038    
1039     Buildroot is primarily meant to set up a cross-compile environment (which
1040     is not so useful as perl doesn't quite like cross compiles), but it can also compile
1041     a chroot environment where you can use F<staticperl>.
1042    
1043     To do so, download buildroot, and enable "Build options => development
1044     files in target filesystem" and optionally "Build options => gcc
1045     optimization level (optimize for size)". At the time of writing, I had
1046     good experiences with GCC 4.4.x but not GCC 4.5.
1047    
1048     To minimise code size, I used C<-pipe -ffunction-sections -fdata-sections
1049     -finline-limit=8 -fno-builtin-strlen -mtune=i386>. The C<-mtune=i386>
1050     doesn't decrease codesize much, but it makes the file much more
1051     compressible.
1052    
1053     If you don't need Coro or threads, you can go with "linuxthreads.old" (or
1054     no thread support). For Coro, it is highly recommended to switch to a
1055     uClibc newer than 0.9.31 (at the time of this writing, I used the 20101201
1056     snapshot) and enable NPTL, otherwise Coro needs to be configured with the
1057     ultra-slow pthreads backend to work around linuxthreads bugs (it also uses
1058     twice the address space needed for stacks).
1059    
1060 root 1.10 If you use C<linuxthreads.old>, then you should also be aware that
1061     uClibc shares C<errno> between all threads when statically linking. See
1062     L<http://lists.uclibc.org/pipermail/uclibc/2010-June/044157.html> for a
1063     workaround (And L<https://bugs.uclibc.org/2089> for discussion).
1064    
1065 root 1.11 C<ccache> support is also recommended, especially if you want
1066     to play around with buildroot options. Enabling the C<miniperl>
1067     package will probably enable all options required for a successful
1068     perl build. F<staticperl> itself additionally needs either C<wget>
1069     (recommended, for CPAN) or C<curl>.
1070 root 1.9
1071     As for shells, busybox should provide all that is needed, but the default
1072     busybox configuration doesn't include F<comm> which is needed by perl -
1073     either make a custom busybox config, or compile coreutils.
1074    
1075     For the latter route, you might find that bash has some bugs that keep
1076     it from working properly in a chroot - either use dash (and link it to
1077     F</bin/sh> inside the chroot) or link busybox to F</bin/sh>, using it's
1078     built-in ash shell.
1079    
1080     Finally, you need F</dev/null> inside the chroot for many scripts to work
1081     - F<cp /dev/null output/target/dev> or bind-mounting your F</dev> will
1082     both provide this.
1083    
1084     After you have compiled and set up your buildroot target, you can copy
1085     F<staticperl> from the C<App::Staticperl> distribution or from your
1086     perl f<bin> directory (if you installed it) into the F<output/target>
1087     filesystem, chroot inside and run it.
1088    
1089 root 1.17 =head1 RECIPES / SPECIFIC MODULES
1090    
1091     This section contains some common(?) recipes and information about
1092     problems with some common modules or perl constructs that require extra
1093     files to be included.
1094    
1095     =head2 MODULES
1096    
1097     =over 4
1098    
1099     =item utf8
1100    
1101     Some functionality in the utf8 module, such as swash handling (used
1102     for unicode character ranges in regexes) is implemented in the
1103 root 1.18 C<"utf8_heavy.pl"> library:
1104    
1105     -M'"utf8_heavy.pl"'
1106 root 1.17
1107     Many Unicode properties in turn are defined in separate modules,
1108     such as C<"unicore/Heavy.pl"> and more specific data tables such as
1109 root 1.18 C<"unicore/To/Digit.pl"> or C<"unicore/lib/Perl/Word.pl">. These tables
1110     are big (7MB uncompressed, although F<staticperl> contains special
1111     handling for those files), so including them on demand by your application
1112     only might pay off.
1113 root 1.17
1114 root 1.18 To simply include the whole unicode database, use:
1115 root 1.17
1116 root 1.18 --incglob '/unicore/*.pl'
1117 root 1.17
1118     =item AnyEvent
1119    
1120     AnyEvent needs a backend implementation that it will load in a delayed
1121     fashion. The L<AnyEvent::Impl::Perl> backend is the default choice
1122     for AnyEvent if it can't find anything else, and is usually a safe
1123     fallback. If you plan to use e.g. L<EV> (L<POE>...), then you need to
1124     include the L<AnyEvent::Impl::EV> (L<AnyEvent::Impl::POE>...) backend as
1125     well.
1126    
1127     If you want to handle IRIs or IDNs (L<AnyEvent::Util> punycode and idn
1128     functions), you also need to include C<"AnyEvent/Util/idna.pl"> and
1129     C<"AnyEvent/Util/uts46data.pl">.
1130    
1131 root 1.20 Or you can use C<--usepacklist> and specify C<-MAnyEvent> to include
1132     everything.
1133    
1134 root 1.18 =item Carp
1135    
1136     Carp had (in older versions of perl) a dependency on L<Carp::Heavy>. As of
1137     perl 5.12.2 (maybe earlier), this dependency no longer exists.
1138    
1139     =item Config
1140    
1141     The F<perl -V> switch (as well as many modules) needs L<Config>, which in
1142     turn might need L<"Config_heavy.pl">. Including the latter gives you
1143     both.
1144    
1145     =item Term::ReadLine::Perl
1146    
1147 root 1.20 Also needs L<Term::ReadLine::readline>, or C<--usepacklist>.
1148 root 1.18
1149 root 1.17 =item URI
1150    
1151     URI implements schemes as separate modules - the generic URL scheme is
1152     implemented in L<URI::_generic>, HTTP is implemented in L<URI::http>. If
1153 root 1.20 you need to use any of these schemes, you should include these manually,
1154     or use C<--usepacklist>.
1155 root 1.17
1156     =back
1157    
1158     =head2 RECIPES
1159    
1160     =over 4
1161    
1162 root 1.18 =item Linking everything in
1163    
1164     To link just about everything installed in the perl library into a new
1165     perl, try this:
1166    
1167     staticperl mkperl --strip ppi --incglob '*'
1168    
1169 root 1.17 =item Getting rid of netdb function
1170    
1171     The perl core has lots of netdb functions (C<getnetbyname>, C<getgrent>
1172     and so on) that few applications use. You can avoid compiling them in by
1173     putting the following fragment into a C<preconfigure> hook:
1174    
1175     preconfigure() {
1176     for sym in \
1177     d_getgrnam_r d_endgrent d_endgrent_r d_endhent \
1178     d_endhostent_r d_endnent d_endnetent_r d_endpent \
1179     d_endprotoent_r d_endpwent d_endpwent_r d_endsent \
1180     d_endservent_r d_getgrent d_getgrent_r d_getgrgid_r \
1181     d_getgrnam_r d_gethbyaddr d_gethent d_getsbyport \
1182     d_gethostbyaddr_r d_gethostbyname_r d_gethostent_r \
1183     d_getlogin_r d_getnbyaddr d_getnbyname d_getnent \
1184     d_getnetbyaddr_r d_getnetbyname_r d_getnetent_r \
1185     d_getpent d_getpbyname d_getpbynumber d_getprotobyname_r \
1186     d_getprotobynumber_r d_getprotoent_r d_getpwent \
1187     d_getpwent_r d_getpwnam_r d_getpwuid_r d_getsent \
1188     d_getservbyname_r d_getservbyport_r d_getservent_r \
1189     d_getspnam_r d_getsbyname
1190     # d_gethbyname
1191     do
1192     PERL_CONFIGURE="$PERL_CONFIGURE -U$sym"
1193     done
1194     }
1195    
1196     This mostly gains space when linking staticaly, as the functions will
1197 root 1.21 likely not be linked in. The gain for dynamically-linked binaries is
1198 root 1.17 smaller.
1199    
1200     Also, this leaves C<gethostbyname> in - not only is it actually used
1201     often, the L<Socket> module also exposes it, so leaving it out usually
1202     gains little. Why Socket exposes a C function that is in the core already
1203     is anybody's guess.
1204    
1205     =back
1206    
1207 root 1.1 =head1 AUTHOR
1208    
1209     Marc Lehmann <schmorp@schmorp.de>
1210     http://software.schmorp.de/pkg/staticperl.html