ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/cvsroot/App-Staticperl/staticperl.pod
Revision: 1.20
Committed: Fri Dec 10 20:29:17 2010 UTC (13 years, 7 months ago) by root
Branch: MAIN
CVS Tags: rel-0_91, rel-0_911
Changes since 1.19: +45 -9 lines
Log Message:
*** empty log message ***

File Contents

# User Rev Content
1 root 1.1 =head1 NAME
2    
3 root 1.8 staticperl - perl, libc, 100 modules, all in one 500kb file
4 root 1.1
5     =head1 SYNOPSIS
6    
7     staticperl help # print the embedded documentation
8     staticperl fetch # fetch and unpack perl sources
9     staticperl configure # fetch and then configure perl
10     staticperl build # configure and then build perl
11     staticperl install # build and then install perl
12     staticperl clean # clean most intermediate files (restart at configure)
13     staticperl distclean # delete everything installed by this script
14     staticperl cpan # invoke CPAN shell
15     staticperl instmod path... # install unpacked modules
16     staticperl instcpan modulename... # install modules from CPAN
17     staticperl mkbundle <bundle-args...> # see documentation
18     staticperl mkperl <bundle-args...> # see documentation
19 root 1.14 staticperl mkapp appname <bundle-args...> # see documentation
20 root 1.1
21     Typical Examples:
22    
23     staticperl install # fetch, configure, build and install perl
24     staticperl cpan # run interactive cpan shell
25     staticperl mkperl -M '"Config_heavy.pl"' # build a perl that supports -V
26     staticperl mkperl -MAnyEvent::Impl::Perl -MAnyEvent::HTTPD -MURI -MURI::http
27     # build a perl with the above modules linked in
28 root 1.14 staticperl mkapp myapp --boot mainprog mymodules
29     # build a binary "myapp" from mainprog and mymodules
30 root 1.1
31     =head1 DESCRIPTION
32    
33 root 1.16 This script helps you to create single-file perl interpreters
34     or applications, or embedding a perl interpreter in your
35     applications. Single-file means that it is fully self-contained - no
36     separate shared objects, no autoload fragments, no .pm or .pl files are
37     needed. And when linking statically, you can create (or embed) a single
38     file that contains perl interpreter, libc, all the modules you need, all
39     the libraries you need and of course your actual program.
40 root 1.1
41 root 1.8 With F<uClibc> and F<upx> on x86, you can create a single 500kb binary
42     that contains perl and 100 modules such as POSIX, AnyEvent, EV, IO::AIO,
43     Coro and so on. Or any other choice of modules.
44 root 1.1
45 root 1.20 To see how this turns out, you can try out smallperl and bigperl, two
46     pre-built static and compressed perl binaries with many and even more
47     modules: just follow the links at L<http://staticperl.schmorp.de/>.
48    
49 root 1.3 The created files do not need write access to the file system (like PAR
50 root 1.1 does). In fact, since this script is in many ways similar to PAR::Packer,
51     here are the differences:
52    
53     =over 4
54    
55     =item * The generated executables are much smaller than PAR created ones.
56    
57     Shared objects and the perl binary contain a lot of extra info, while
58     the static nature of F<staticperl> allows the linker to remove all
59     functionality and meta-info not required by the final executable. Even
60     extensions statically compiled into perl at build time will only be
61     present in the final executable when needed.
62    
63     In addition, F<staticperl> can strip perl sources much more effectively
64     than PAR.
65    
66     =item * The generated executables start much faster.
67    
68     There is no need to unpack files, or even to parse Zip archives (which is
69     slow and memory-consuming business).
70    
71     =item * The generated executables don't need a writable filesystem.
72    
73     F<staticperl> loads all required files directly from memory. There is no
74     need to unpack files into a temporary directory.
75    
76 root 1.17 =item * More control over included files, more burden.
77 root 1.1
78 root 1.3 PAR tries to be maintenance and hassle-free - it tries to include more
79 root 1.17 files than necessary to make sure everything works out of the box. It
80     mostly succeeds at this, but he extra files (such as the unicode database)
81     can take substantial amounts of memory and file size.
82 root 1.1
83     With F<staticperl>, the burden is mostly with the developer - only direct
84     compile-time dependencies and L<AutoLoader> are handled automatically.
85     This means the modules to include often need to be tweaked manually.
86    
87 root 1.17 All this does not preclude more permissive modes to be implemented in
88     the future, but right now, you have to resolve state hidden dependencies
89     manually.
90    
91 root 1.1 =item * PAR works out of the box, F<staticperl> does not.
92    
93     Maintaining your own custom perl build can be a pain in the ass, and while
94     F<staticperl> tries to make this easy, it still requires a custom perl
95     build and possibly fiddling with some modules. PAR is likely to produce
96     results faster.
97    
98 root 1.13 Ok, PAR never has worked for me out of the box, and for some people,
99     F<staticperl> does work out of the box, as they don't count "fiddling with
100     module use lists" against it, but nevertheless, F<staticperl> is certainly
101     a bit more difficult to use.
102    
103 root 1.1 =back
104    
105     =head1 HOW DOES IT WORK?
106    
107     Simple: F<staticperl> downloads, compile and installs a perl version of
108     your choice in F<~/.staticperl>. You can add extra modules either by
109     letting F<staticperl> install them for you automatically, or by using CPAN
110     and doing it interactively. This usually takes 5-10 minutes, depending on
111 root 1.3 the speed of your computer and your internet connection.
112 root 1.1
113     It is possible to do program development at this stage, too.
114    
115     Afterwards, you create a list of files and modules you want to include,
116 root 1.3 and then either build a new perl binary (that acts just like a normal perl
117 root 1.1 except everything is compiled in), or you create bundle files (basically C
118     sources you can use to embed all files into your project).
119    
120 root 1.18 This step is very fast (a few seconds if PPI is not used for stripping, or
121     the stripped files are in the cache), and can be tweaked and repeated as
122     often as necessary.
123 root 1.1
124     =head1 THE F<STATICPERL> SCRIPT
125    
126     This module installs a script called F<staticperl> into your perl
127     binary directory. The script is fully self-contained, and can be used
128     without perl (for example, in an uClibc chroot environment). In fact,
129     it can be extracted from the C<App::Staticperl> distribution tarball as
130     F<bin/staticperl>, without any installation.
131    
132     F<staticperl> interprets the first argument as a command to execute,
133     optionally followed by any parameters.
134    
135     There are two command categories: the "phase 1" commands which deal with
136     installing perl and perl modules, and the "phase 2" commands, which deal
137     with creating binaries and bundle files.
138    
139     =head2 PHASE 1 COMMANDS: INSTALLING PERL
140    
141     The most important command is F<install>, which does basically
142     everything. The default is to download and install perl 5.12.2 and a few
143     modules required by F<staticperl> itself, but all this can (and should) be
144     changed - see L<CONFIGURATION>, below.
145    
146     The command
147    
148     staticperl install
149    
150     Is normally all you need: It installs the perl interpreter in
151     F<~/.staticperl/perl>. It downloads, configures, builds and installs the
152     perl interpreter if required.
153    
154     Most of the following commands simply run one or more steps of this
155     sequence.
156    
157 root 1.3 To force recompilation or reinstallation, you need to run F<staticperl
158 root 1.1 distclean> first.
159    
160     =over 4
161    
162 root 1.20 =item F<staticperl version>
163    
164     Prints some info about the version of the F<staticperl> script you are using.
165    
166 root 1.1 =item F<staticperl fetch>
167    
168     Runs only the download and unpack phase, unless this has already happened.
169    
170     =item F<staticperl configure>
171    
172     Configures the unpacked perl sources, potentially after downloading them first.
173    
174     =item F<staticperl build>
175    
176     Builds the configured perl sources, potentially after automatically
177     configuring them.
178    
179     =item F<staticperl install>
180    
181 root 1.3 Wipes the perl installation directory (usually F<~/.staticperl/perl>) and
182     installs the perl distribution, potentially after building it first.
183 root 1.1
184     =item F<staticperl cpan> [args...]
185    
186 root 1.3 Starts an interactive CPAN shell that you can use to install further
187     modules. Installs the perl first if necessary, but apart from that,
188 root 1.1 no magic is involved: you could just as well run it manually via
189     F<~/.staticperl/perl/bin/cpan>.
190    
191     Any additional arguments are simply passed to the F<cpan> command.
192    
193     =item F<staticperl instcpan> module...
194    
195     Tries to install all the modules given and their dependencies, using CPAN.
196    
197     Example:
198    
199     staticperl instcpan EV AnyEvent::HTTPD Coro
200    
201     =item F<staticperl instsrc> directory...
202    
203     In the unlikely case that you have unpacked perl modules around and want
204 root 1.3 to install from these instead of from CPAN, you can do this using this
205 root 1.1 command by specifying all the directories with modules in them that you
206     want to have built.
207    
208     =item F<staticperl clean>
209    
210 root 1.12 Deletes the perl source directory (and potentially cleans up other
211     intermediate files). This can be used to clean up files only needed for
212     building perl, without removing the installed perl interpreter, or to
213     force a re-build from scratch.
214    
215     At the moment, it doesn't delete downloaded tarballs.
216 root 1.1
217     =item F<staticperl distclean>
218    
219     This wipes your complete F<~/.staticperl> directory. Be careful with this,
220     it nukes your perl download, perl sources, perl distribution and any
221     installed modules. It is useful if you wish to start over "from scratch"
222     or when you want to uninstall F<staticperl>.
223    
224     =back
225    
226     =head2 PHASE 2 COMMANDS: BUILDING PERL BUNDLES
227    
228     Building (linking) a new F<perl> binary is handled by a separate
229     script. To make it easy to use F<staticperl> from a F<chroot>, the script
230     is embedded into F<staticperl>, which will write it out and call for you
231     with any arguments you pass:
232    
233     staticperl mkbundle mkbundle-args...
234    
235     In the oh so unlikely case of something not working here, you
236 root 1.2 can run the script manually as well (by default it is written to
237 root 1.1 F<~/.staticperl/mkbundle>).
238    
239     F<mkbundle> is a more conventional command and expect the argument
240 root 1.3 syntax commonly used on UNIX clones. For example, this command builds
241 root 1.1 a new F<perl> binary and includes F<Config.pm> (for F<perl -V>),
242     F<AnyEvent::HTTPD>, F<URI> and a custom F<httpd> script (from F<eg/httpd>
243     in this distribution):
244    
245     # first make sure we have perl and the required modules
246     staticperl instcpan AnyEvent::HTTPD
247    
248     # now build the perl
249     staticperl mkperl -M'"Config_heavy.pl"' -MAnyEvent::Impl::Perl \
250     -MAnyEvent::HTTPD -MURI::http \
251     --add 'eg/httpd httpd.pm'
252    
253     # finally, invoke it
254     ./perl -Mhttpd
255    
256     As you can see, things are not quite as trivial: the L<Config> module has
257     a hidden dependency which is not even a perl module (F<Config_heavy.pl>),
258     L<AnyEvent> needs at least one event loop backend that we have to
259 root 1.3 specify manually (here L<AnyEvent::Impl::Perl>), and the F<URI> module
260 root 1.1 (required by L<AnyEvent::HTTPD>) implements various URI schemes as extra
261     modules - since L<AnyEvent::HTTPD> only needs C<http> URIs, we only need
262 root 1.3 to include that module. I found out about these dependencies by carefully
263     watching any error messages about missing modules...
264 root 1.1
265 root 1.14 Instead of building a new perl binary, you can also build a standalone
266     application:
267    
268     # build the app
269     staticperl mkapp app --boot eg/httpd \
270     -MAnyEvent::Impl::Perl -MAnyEvent::HTTPD -MURI::http
271    
272     # run it
273     ./app
274    
275 root 1.1 =head3 OPTION PROCESSING
276    
277 root 1.3 All options can be given as arguments on the command line (typically
278     using long (e.g. C<--verbose>) or short option (e.g. C<-v>) style). Since
279     specifying a lot of modules can make the command line very cumbersome,
280 root 1.1 you can put all long options into a "bundle specification file" (with or
281     without C<--> prefix) and specify this bundle file instead.
282    
283     For example, the command given earlier could also look like this:
284    
285     staticperl mkperl httpd.bundle
286    
287     And all options could be in F<httpd.bundle>:
288    
289     use "Config_heavy.pl"
290     use AnyEvent::Impl::Perl
291     use AnyEvent::HTTPD
292     use URI::http
293     add eg/httpd httpd.pm
294    
295 root 1.2 All options that specify modules or files to be added are processed in the
296 root 1.3 order given on the command line (that affects the C<--use> and C<--eval>
297 root 1.2 options at the moment).
298    
299 root 1.19 =head3 PACKAGE SELECTION WORKFLOW
300    
301     F<staticperl mkbundle> has a number of options to control package
302     selection. This section describes how they interact with each other. Also,
303     since I am still a newbie w.r.t. these issues, maybe future versions of
304     F<staticperl> will change this, so watch out :)
305    
306     The idiom "in order" means "in order that they are specified on the
307     commandline". If you use a bundle specification file, then the options
308     will be processed as if they were given in place of the bundle file name.
309    
310     =over 4
311    
312     =item 1. apply all C<--use>, C<--eval>, C<--add>, C<--addbin> and
313     C<--incglob> options, in order.
314    
315     In addition, C<--use> and C<--eval> dependencies will be added when the
316     options are processed.
317    
318     =item 2. apply all C<--include> and C<--exclude> options, in order.
319    
320     All this step does is potentially reduce the number of files already
321     selected or found in phase 1.
322    
323     =item 3. find all modules (== F<.pm> files), gather their static archives
324 root 1.20 (F<.a>) and AutoLoader splitfiles (F<.ix> and F<.al> files), find any
325     extra libraries they need for linking (F<extralibs.ld>) and optionally
326     evaluate any F<.packlist> files.
327 root 1.19
328     This step is required to link against XS extensions and also adds files
329     required for L<AutoLoader> to do it's job.
330    
331     =back
332    
333     After this, all the files selected for bundling will be read and processed
334     (stripped), the bundle files will be written, and optionally a new F<perl>
335     or application binary will be linked.
336    
337 root 1.1 =head3 MKBUNDLE OPTIONS
338    
339     =over 4
340    
341 root 1.2 =item --verbose | -v
342    
343     Increases the verbosity level by one (the default is C<1>).
344    
345     =item --quiet | -q
346    
347     Decreases the verbosity level by one.
348    
349     =item --strip none|pod|ppi
350    
351     Specify the stripping method applied to reduce the file of the perl
352     sources included.
353    
354     The default is C<pod>, which uses the L<Pod::Strip> module to remove all
355 root 1.3 pod documentation, which is very fast and reduces file size a lot.
356 root 1.2
357     The C<ppi> method uses L<PPI> to parse and condense the perl sources. This
358 root 1.18 saves a lot more than just L<Pod::Strip>, and is generally safer,
359     but is also a lot slower (some files take almost a minute to strip -
360     F<staticperl> maintains a cache of stripped files to speed up subsequent
361     runs for this reason). Note that this method doesn't optimise for raw file
362     size, but for best compression (that means that the uncompressed file size
363     is a bit larger, but the files compress better, e.g. with F<upx>).
364 root 1.2
365 root 1.9 Last not least, if you need accurate line numbers in error messages,
366     or in the unlikely case where C<pod> is too slow, or some module gets
367     mistreated, you can specify C<none> to not mangle included perl sources in
368     any way.
369 root 1.2
370     =item --perl
371    
372     After writing out the bundle files, try to link a new perl interpreter. It
373     will be called F<perl> and will be left in the current working
374     directory. The bundle files will be removed.
375    
376 root 1.3 This switch is automatically used when F<staticperl> is invoked with the
377 root 1.2 C<mkperl> command (instead of C<mkbundle>):
378    
379     # build a new ./perl with only common::sense in it - very small :)
380     staticperl mkperl -Mcommon::sense
381    
382 root 1.14 =item --app name
383    
384     After writing out the bundle files, try to link a new standalone
385     program. It will be called C<name>, and the bundle files get removed after
386     linking it.
387    
388     The difference to the (mutually exclusive) C<--perl> option is that the
389     binary created by this option will not try to act as a perl interpreter -
390     instead it will simply initialise the perl interpreter, clean it up and
391     exit.
392    
393     This switch is automatically used when F<staticperl> is invoked with the
394     C<mkapp> command (instead of C<mkbundle>):
395    
396     To let it do something useful you I<must> add some boot code, e.g. with
397     the C<--boot> option.
398    
399     Example: create a standalone perl binary that will execute F<appfile> when
400     it is started.
401    
402     staticperl mkbundle --app myexe --boot appfile
403    
404 root 1.2 =item --use module | -Mmodule
405    
406     Include the named module and all direct dependencies. This is done by
407     C<require>'ing the module in a subprocess and tracing which other modules
408     and files it actually loads. If the module uses L<AutoLoader>, then all
409     splitfiles will be included as well.
410    
411     Example: include AnyEvent and AnyEvent::Impl::Perl.
412    
413     staticperl mkbundle --use AnyEvent --use AnyEvent::Impl::Perl
414    
415     Sometimes you want to load old-style "perl libraries" (F<.pl> files), or
416     maybe other weirdly named files. To do that, you need to quote the name in
417 root 1.3 single or double quotes. When given on the command line, you probably need
418 root 1.2 to quote once more to avoid your shell interpreting it. Common cases that
419     need this are F<Config_heavy.pl> and F<utf8_heavy.pl>.
420    
421     Example: include the required files for F<perl -V> to work in all its
422     glory (F<Config.pm> is included automatically by this).
423    
424     # bourne shell
425     staticperl mkbundle --use '"Config_heavy.pl"'
426    
427     # bundle specification file
428     use "Config_heavy.pl"
429    
430     The C<-Mmodule> syntax is included as an alias that might be easier to
431     remember than C<use>. Or maybe it confuses people. Time will tell. Or
432     maybe not. Argh.
433    
434     =item --eval "perl code" | -e "perl code"
435    
436     Sometimes it is easier (or necessary) to specify dependencies using perl
437     code, or maybe one of the modules you use need a special use statement. In
438     that case, you can use C<eval> to execute some perl snippet or set some
439     variables or whatever you need. All files C<require>'d or C<use>'d in the
440     script are included in the final bundle.
441    
442     Keep in mind that F<mkbundle> will only C<require> the modules named
443     by the C<--use> option, so do not expect the symbols from modules you
444 root 1.3 C<--use>'d earlier on the command line to be available.
445 root 1.2
446     Example: force L<AnyEvent> to detect a backend and therefore include it
447     in the final bundle.
448    
449     staticperl mkbundle --eval 'use AnyEvent; AnyEvent::detect'
450    
451     # or like this
452     staticperl mkbundle -MAnyEvent --eval 'use AnyEvent; AnyEvent::detect'
453    
454     Example: use a separate "bootstrap" script that C<use>'s lots of modules
455     and include this in the final bundle, to be executed automatically.
456    
457     staticperl mkbundle --eval 'do "bootstrap"' --boot bootstrap
458    
459     =item --boot filename
460    
461     Include the given file in the bundle and arrange for it to be executed
462     (using a C<require>) before anything else when the new perl is
463     initialised. This can be used to modify C<@INC> or anything else before
464 root 1.3 the perl interpreter executes scripts given on the command line (or via
465 root 1.2 C<-e>). This works even in an embedded interpreter.
466    
467 root 1.20 =item --usepacklist
468    
469     Read F<.packlist> files for each distribution that happens to match a
470     module name you specified. Sounds weird, and it is, so expect semantics to
471     change somehow in the future.
472    
473     The idea is that most CPAN distributions have a F<.pm> file that matches
474     the name of the distribution (which is rather reasonable after all).
475    
476     If this switch is enabled, then if any of the F<.pm> files that have been
477     selected match an install distribution, then all F<.pm>, F<.pl>, F<.al>
478     and F<.ix> files installed by this distribution are also included.
479    
480     For example, using this switch, when the L<URI> module is specified, then
481     all L<URI> submodules that have been installed via the CPAN distribution
482     are included as well, so you don't have to manually specify them.
483    
484 root 1.18 =item --incglob pattern
485    
486     This goes through all library directories and tries to match any F<.pm>
487     and F<.pl> files against the extended glob pattern (see below). If a file
488     matches, it is added. This switch will automatically detect L<AutoLoader>
489     files and the required link libraries for XS modules, but it will I<not>
490     scan the file for dependencies (at the moment).
491    
492     This is mainly useful to include "everything":
493    
494     --incglob '*'
495    
496     Or to include perl libraries, or trees of those, such as the unicode
497     database files needed by many other modules:
498    
499     --incglob '/unicore/**.pl'
500    
501     =item --add file | --add "file alias"
502 root 1.2
503     Adds the given (perl) file into the bundle (and optionally call it
504     "alias"). This is useful to include any custom files into the bundle.
505    
506     Example: embed the file F<httpd> as F<httpd.pm> when creating the bundle.
507    
508     staticperl mkperl --add "httpd httpd.pm"
509    
510     It is also a great way to add any custom modules:
511    
512     # specification file
513     add file1 myfiles/file1
514     add file2 myfiles/file2
515     add file3 myfiles/file3
516    
517 root 1.18 =item --binadd file | --add "file alias"
518 root 1.10
519     Just like C<--add>, except that it treats the file as binary and adds it
520     without any processing.
521    
522     You should probably add a C</> prefix to avoid clashing with embedded
523     perl files (whose paths do not start with C</>), and/or use a special
524     directory, such as C</res/name>.
525    
526     You can later get a copy of these files by calling C<staticperl::find
527     "alias">.
528    
529 root 1.18 =item --include pattern | -i pattern | --exclude pattern | -x pattern
530    
531     These two options define an include/exclude filter that is used after all
532     files selected by the other options have been found. Each include/exclude
533     is applied to all files found so far - an include makes sure that the
534     given files will be part of the resulting file set, an exclude will
535     exclude files. The patterns are "extended glob patterns" (see below).
536    
537     For example, to include everything, except C<Devel> modules, but still
538     include F<Devel::PPPort>, you could use this:
539    
540     --incglob '*' -i '/Devel/PPPort.pm' -x '/Devel/**'
541    
542 root 1.2 =item --static
543    
544     When C<--perl> is also given, link statically instead of dynamically. The
545     default is to link the new perl interpreter fully dynamic (that means all
546     perl modules are linked statically, but all external libraries are still
547     referenced dynamically).
548    
549     Keep in mind that Solaris doesn't support static linking at all, and
550     systems based on GNU libc don't really support it in a usable fashion
551     either. Try uClibc if you want to create fully statically linked
552     executables, or try the C<--staticlibs> option to link only some libraries
553     statically.
554    
555 root 1.18 =item --staticlib libname
556    
557     When not linking fully statically, this option allows you to link specific
558     libraries statically. What it does is simply replace all occurances of
559     C<-llibname> with the GCC-specific C<-Wl,-Bstatic -llibname -Wl,-Bdynamic>
560     option.
561    
562     This will have no effect unless the library is actually linked against,
563     specifically, C<--staticlib> will not link against the named library
564     unless it would be linked against anyway.
565    
566     Example: link libcrypt statically into the binary.
567    
568     staticperl mkperl -MIO::AIO --staticlib crypt
569    
570     # ldopts might nwo contain:
571     # -lm -Wl,-Bstatic -lcrypt -Wl,-Bdynamic -lpthread
572    
573 root 1.2 =item any other argument
574    
575     Any other argument is interpreted as a bundle specification file, which
576     supports most long options (without extra quoting), one option per line.
577 root 1.1
578     =back
579    
580 root 1.18 =head3 EXTENDED GLOB PATTERNS
581    
582     Some options of F<staticperl mkbundle> expect an I<extended glob
583     pattern>. This is neither a normal shell glob nor a regex, but something
584     in between. The idea has been copied from rsync, and there are the current
585     matching rules:
586    
587     =over 4
588    
589     =item Patterns starting with F</> will be a anchored at the root of the library tree.
590    
591     That is, F</unicore> will match the F<unicore> directory in C<@INC>, but
592     nothing inside, and neither any other file or directory called F<unicore>
593     anywhere else in the hierarchy.
594    
595     =item Patterns not starting with F</> will be anchored at the end of the path.
596    
597     That is, F<idna.pl> will match any file called F<idna.pl> anywhere in the
598     hierarchy, but not any directories of the same name.
599    
600     =item A F<*> matches any single component.
601    
602     That is, F</unicore/*.pl> would match all F<.pl> files directly inside
603     C</unicore>, not any deeper level F<.pl> files. Or in other words, F<*>
604     will not match slashes.
605    
606     =item A F<**> matches anything.
607    
608     That is, F</unicore/**.pl> would match all F<.pl> files under F</unicore>,
609     no matter how deeply nested they are inside subdirectories.
610    
611     =item A F<?> matches a single character within a component.
612    
613     That is, F</Encode/??.pm> matches F</Encode/JP.pm>, but not the
614     hypothetical F</Encode/J/.pm>, as F<?> does not match F</>.
615    
616     =back
617    
618 root 1.15 =head2 F<STATICPERL> CONFIGURATION AND HOOKS
619 root 1.1
620 root 1.20 During (each) startup, F<staticperl> tries to source some shell files to
621     allow you to fine-tune/override configuration settings.
622    
623     In them you can override shell variables, or define shell functions
624     ("hooks") to be called at specific phases during installation. For
625     example, you could define a C<postinstall> hook to install additional
626     modules from CPAN each time you start from scratch.
627    
628     If the env variable C<$STATICPERLRC> is set, then F<staticperl> will try
629     to source the file named with it only. Otherwise, it tries the following
630     shell files in order:
631 root 1.2
632     /etc/staticperlrc
633     ~/.staticperlrc
634     $STATICPERL/rc
635    
636     Note that the last file is erased during F<staticperl distclean>, so
637     generally should not be used.
638    
639     =head3 CONFIGURATION VARIABLES
640    
641     =head4 Variables you I<should> override
642    
643     =over 4
644    
645     =item C<EMAIL>
646    
647     The e-mail address of the person who built this binary. Has no good
648     default, so should be specified by you.
649    
650     =item C<CPAN>
651    
652     The URL of the CPAN mirror to use (e.g. L<http://mirror.netcologne.de/cpan/>).
653    
654 root 1.6 =item C<EXTRA_MODULES>
655 root 1.2
656 root 1.6 Additional modules installed during F<staticperl install>. Here you can
657     set which modules you want have to installed from CPAN.
658 root 1.2
659 root 1.11 Example: I really really need EV, AnyEvent, Coro and AnyEvent::AIO.
660 root 1.2
661 root 1.11 EXTRA_MODULES="EV AnyEvent Coro AnyEvent::AIO"
662 root 1.2
663 root 1.6 Note that you can also use a C<postinstall> hook to achieve this, and
664     more.
665 root 1.2
666 root 1.11 =back
667    
668     =head4 Variables you might I<want> to override
669    
670     =over 4
671    
672     =item C<STATICPERL>
673    
674     The directory where staticperl stores all its files
675     (default: F<~/.staticperl>).
676    
677 root 1.6 =item C<PERL_MM_USE_DEFAULT>, C<EV_EXTRA_DEFS>, ...
678 root 1.2
679     Usually set to C<1> to make modules "less inquisitive" during their
680     installation, you can set any environment variable you want - some modules
681     (such as L<Coro> or L<EV>) use environment variables for further tweaking.
682    
683 root 1.11 =item C<PERL_VERSION>
684 root 1.6
685 root 1.11 The perl version to install - default is currently C<5.12.2>, but C<5.8.9>
686     is also a good choice (5.8.9 is much smaller than 5.12.2, while 5.10.1 is
687     about as big as 5.12.2).
688 root 1.2
689 root 1.11 =item C<PERL_PREFIX>
690 root 1.2
691 root 1.6 The prefix where perl gets installed (default: F<$STATICPERL/perl>),
692     i.e. where the F<bin> and F<lib> subdirectories will end up.
693 root 1.2
694 root 1.10 =item C<PERL_CONFIGURE>
695    
696     Additional Configure options - these are simply passed to the perl
697     Configure script. For example, if you wanted to enable dynamic loading,
698     you could pass C<-Dusedl>. To enable ithreads (Why would you want that
699     insanity? Don't! Use L<forks> instead!) you would pass C<-Duseithreads>
700     and so on.
701    
702     More commonly, you would either activate 64 bit integer support
703     (C<-Duse64bitint>), or disable large files support (-Uuselargefiles), to
704     reduce filesize further.
705    
706 root 1.6 =item C<PERL_CPPFLAGS>, C<PERL_OPTIMIZE>, C<PERL_LDFLAGS>, C<PERL_LIBS>
707 root 1.2
708 root 1.6 These flags are passed to perl's F<Configure> script, and are generally
709     optimised for small size (at the cost of performance). Since they also
710     contain subtle workarounds around various build issues, changing these
711     usually requires understanding their default values - best look at the top
712     of the F<staticperl> script for more info on these.
713 root 1.2
714     =back
715    
716 root 1.5 =head4 Variables you probably I<do not want> to override
717 root 1.2
718     =over 4
719    
720     =item C<MKBUNDLE>
721    
722     Where F<staticperl> writes the C<mkbundle> command to
723     (default: F<$STATICPERL/mkbundle>).
724    
725     =item C<STATICPERL_MODULES>
726    
727     Additional modules needed by C<mkbundle> - should therefore not be changed
728     unless you know what you are doing.
729    
730     =back
731    
732     =head3 OVERRIDABLE HOOKS
733    
734     In addition to environment variables, it is possible to provide some
735     shell functions that are called at specific times. To provide your own
736 root 1.3 commands, just define the corresponding function.
737 root 1.2
738     Example: install extra modules from CPAN and from some directories
739     at F<staticperl install> time.
740    
741     postinstall() {
742 root 1.4 rm -rf lib/threads* # weg mit Schaden
743 root 1.2 instcpan IO::AIO EV
744     instsrc ~/src/AnyEvent
745     instsrc ~/src/XML-Sablotron-1.0100001
746 root 1.4 instcpan Anyevent::AIO AnyEvent::HTTPD
747 root 1.2 }
748    
749     =over 4
750    
751 root 1.12 =item preconfigure
752    
753     Called just before running F<./Configur> in the perl source
754     directory. Current working directory is the perl source directory.
755    
756     This can be used to set any C<PERL_xxx> variables, which might be costly
757     to compute.
758    
759 root 1.2 =item postconfigure
760    
761     Called after configuring, but before building perl. Current working
762     directory is the perl source directory.
763    
764 root 1.12 Could be used to tailor/patch config.sh (followed by F<sh Configure -S>)
765     or do any other modifications.
766 root 1.2
767     =item postbuild
768    
769     Called after building, but before installing perl. Current working
770     directory is the perl source directory.
771    
772     I have no clue what this could be used for - tell me.
773    
774     =item postinstall
775    
776     Called after perl and any extra modules have been installed in C<$PREFIX>,
777     but before setting the "installation O.K." flag.
778    
779     The current working directory is C<$PREFIX>, but maybe you should not rely
780     on that.
781    
782     This hook is most useful to customise the installation, by deleting files,
783     or installing extra modules using the C<instcpan> or C<instsrc> functions.
784    
785     The script must return with a zero exit status, or the installation will
786     fail.
787    
788     =back
789 root 1.1
790 root 1.9 =head1 ANATOMY OF A BUNDLE
791    
792     When not building a new perl binary, C<mkbundle> will leave a number of
793     files in the current working directory, which can be used to embed a perl
794     interpreter in your program.
795    
796     Intimate knowledge of L<perlembed> and preferably some experience with
797     embedding perl is highly recommended.
798    
799     C<mkperl> (or the C<--perl> option) basically does this to link the new
800     interpreter (it also adds a main program to F<bundle.>):
801    
802     $Config{cc} $(cat bundle.ccopts) -o perl bundle.c $(cat bundle.ldopts)
803    
804     =over 4
805    
806     =item bundle.h
807    
808     A header file that contains the prototypes of the few symbols "exported"
809     by bundle.c, and also exposes the perl headers to the application.
810    
811     =over 4
812    
813     =item staticperl_init ()
814    
815     Initialises the perl interpreter. You can use the normal perl functions
816     after calling this function, for example, to define extra functions or
817     to load a .pm file that contains some initialisation code, or the main
818     program function:
819    
820     XS (xsfunction)
821     {
822     dXSARGS;
823    
824     // now we have items, ST(i) etc.
825     }
826    
827     static void
828     run_myapp(void)
829     {
830     staticperl_init ();
831     newXSproto ("myapp::xsfunction", xsfunction, __FILE__, "$$;$");
832     eval_pv ("require myapp::main", 1); // executes "myapp/main.pm"
833     }
834    
835     =item staticperl_xs_init (pTHX)
836    
837     Sometimes you need direct control over C<perl_parse> and C<perl_run>, in
838     which case you do not want to use C<staticperl_init> but call them on your
839     own.
840    
841     Then you need this function - either pass it directly as the C<xs_init>
842     function to C<perl_parse>, or call it from your own C<xs_init> function.
843    
844     =item staticperl_cleanup ()
845    
846     In the unlikely case that you want to destroy the perl interpreter, here
847     is the corresponding function.
848    
849     =item PerlInterpreter *staticperl
850    
851     The perl interpreter pointer used by staticperl. Not normally so useful,
852     but there it is.
853    
854     =back
855    
856     =item bundle.ccopts
857    
858     Contains the compiler options required to compile at least F<bundle.c> and
859     any file that includes F<bundle.h> - you should probably use it in your
860     C<CFLAGS>.
861    
862     =item bundle.ldopts
863    
864     The linker options needed to link the final program.
865    
866     =back
867    
868     =head1 RUNTIME FUNCTIONALITY
869    
870     Binaries created with C<mkbundle>/C<mkperl> contain extra functions, which
871     are required to access the bundled perl sources, but might be useful for
872     other purposes.
873    
874     In addition, for the embedded loading of perl files to work, F<staticperl>
875     overrides the C<@INC> array.
876    
877     =over 4
878    
879     =item $file = staticperl::find $path
880    
881     Returns the data associated with the given C<$path>
882     (e.g. C<Digest/MD5.pm>, C<auto/POSIX/autosplit.ix>), which is basically
883     the UNIX path relative to the perl library directory.
884    
885     Returns C<undef> if the file isn't embedded.
886    
887     =item @paths = staticperl::list
888    
889     Returns the list of all paths embedded in this binary.
890    
891     =back
892    
893     =head1 FULLY STATIC BINARIES - BUILDROOT
894    
895 root 1.10 To make truly static (Linux-) libraries, you might want to have a look at
896 root 1.9 buildroot (L<http://buildroot.uclibc.org/>).
897    
898     Buildroot is primarily meant to set up a cross-compile environment (which
899     is not so useful as perl doesn't quite like cross compiles), but it can also compile
900     a chroot environment where you can use F<staticperl>.
901    
902     To do so, download buildroot, and enable "Build options => development
903     files in target filesystem" and optionally "Build options => gcc
904     optimization level (optimize for size)". At the time of writing, I had
905     good experiences with GCC 4.4.x but not GCC 4.5.
906    
907     To minimise code size, I used C<-pipe -ffunction-sections -fdata-sections
908     -finline-limit=8 -fno-builtin-strlen -mtune=i386>. The C<-mtune=i386>
909     doesn't decrease codesize much, but it makes the file much more
910     compressible.
911    
912     If you don't need Coro or threads, you can go with "linuxthreads.old" (or
913     no thread support). For Coro, it is highly recommended to switch to a
914     uClibc newer than 0.9.31 (at the time of this writing, I used the 20101201
915     snapshot) and enable NPTL, otherwise Coro needs to be configured with the
916     ultra-slow pthreads backend to work around linuxthreads bugs (it also uses
917     twice the address space needed for stacks).
918    
919 root 1.10 If you use C<linuxthreads.old>, then you should also be aware that
920     uClibc shares C<errno> between all threads when statically linking. See
921     L<http://lists.uclibc.org/pipermail/uclibc/2010-June/044157.html> for a
922     workaround (And L<https://bugs.uclibc.org/2089> for discussion).
923    
924 root 1.11 C<ccache> support is also recommended, especially if you want
925     to play around with buildroot options. Enabling the C<miniperl>
926     package will probably enable all options required for a successful
927     perl build. F<staticperl> itself additionally needs either C<wget>
928     (recommended, for CPAN) or C<curl>.
929 root 1.9
930     As for shells, busybox should provide all that is needed, but the default
931     busybox configuration doesn't include F<comm> which is needed by perl -
932     either make a custom busybox config, or compile coreutils.
933    
934     For the latter route, you might find that bash has some bugs that keep
935     it from working properly in a chroot - either use dash (and link it to
936     F</bin/sh> inside the chroot) or link busybox to F</bin/sh>, using it's
937     built-in ash shell.
938    
939     Finally, you need F</dev/null> inside the chroot for many scripts to work
940     - F<cp /dev/null output/target/dev> or bind-mounting your F</dev> will
941     both provide this.
942    
943     After you have compiled and set up your buildroot target, you can copy
944     F<staticperl> from the C<App::Staticperl> distribution or from your
945     perl f<bin> directory (if you installed it) into the F<output/target>
946     filesystem, chroot inside and run it.
947    
948 root 1.17 =head1 RECIPES / SPECIFIC MODULES
949    
950     This section contains some common(?) recipes and information about
951     problems with some common modules or perl constructs that require extra
952     files to be included.
953    
954     =head2 MODULES
955    
956     =over 4
957    
958     =item utf8
959    
960     Some functionality in the utf8 module, such as swash handling (used
961     for unicode character ranges in regexes) is implemented in the
962 root 1.18 C<"utf8_heavy.pl"> library:
963    
964     -M'"utf8_heavy.pl"'
965 root 1.17
966     Many Unicode properties in turn are defined in separate modules,
967     such as C<"unicore/Heavy.pl"> and more specific data tables such as
968 root 1.18 C<"unicore/To/Digit.pl"> or C<"unicore/lib/Perl/Word.pl">. These tables
969     are big (7MB uncompressed, although F<staticperl> contains special
970     handling for those files), so including them on demand by your application
971     only might pay off.
972 root 1.17
973 root 1.18 To simply include the whole unicode database, use:
974 root 1.17
975 root 1.18 --incglob '/unicore/*.pl'
976 root 1.17
977     =item AnyEvent
978    
979     AnyEvent needs a backend implementation that it will load in a delayed
980     fashion. The L<AnyEvent::Impl::Perl> backend is the default choice
981     for AnyEvent if it can't find anything else, and is usually a safe
982     fallback. If you plan to use e.g. L<EV> (L<POE>...), then you need to
983     include the L<AnyEvent::Impl::EV> (L<AnyEvent::Impl::POE>...) backend as
984     well.
985    
986     If you want to handle IRIs or IDNs (L<AnyEvent::Util> punycode and idn
987     functions), you also need to include C<"AnyEvent/Util/idna.pl"> and
988     C<"AnyEvent/Util/uts46data.pl">.
989    
990 root 1.20 Or you can use C<--usepacklist> and specify C<-MAnyEvent> to include
991     everything.
992    
993 root 1.18 =item Carp
994    
995     Carp had (in older versions of perl) a dependency on L<Carp::Heavy>. As of
996     perl 5.12.2 (maybe earlier), this dependency no longer exists.
997    
998     =item Config
999    
1000     The F<perl -V> switch (as well as many modules) needs L<Config>, which in
1001     turn might need L<"Config_heavy.pl">. Including the latter gives you
1002     both.
1003    
1004     =item Term::ReadLine::Perl
1005    
1006 root 1.20 Also needs L<Term::ReadLine::readline>, or C<--usepacklist>.
1007 root 1.18
1008 root 1.17 =item URI
1009    
1010     URI implements schemes as separate modules - the generic URL scheme is
1011     implemented in L<URI::_generic>, HTTP is implemented in L<URI::http>. If
1012 root 1.20 you need to use any of these schemes, you should include these manually,
1013     or use C<--usepacklist>.
1014 root 1.17
1015     =back
1016    
1017     =head2 RECIPES
1018    
1019     =over 4
1020    
1021 root 1.18 =item Linking everything in
1022    
1023     To link just about everything installed in the perl library into a new
1024     perl, try this:
1025    
1026     staticperl mkperl --strip ppi --incglob '*'
1027    
1028 root 1.17 =item Getting rid of netdb function
1029    
1030     The perl core has lots of netdb functions (C<getnetbyname>, C<getgrent>
1031     and so on) that few applications use. You can avoid compiling them in by
1032     putting the following fragment into a C<preconfigure> hook:
1033    
1034     preconfigure() {
1035     for sym in \
1036     d_getgrnam_r d_endgrent d_endgrent_r d_endhent \
1037     d_endhostent_r d_endnent d_endnetent_r d_endpent \
1038     d_endprotoent_r d_endpwent d_endpwent_r d_endsent \
1039     d_endservent_r d_getgrent d_getgrent_r d_getgrgid_r \
1040     d_getgrnam_r d_gethbyaddr d_gethent d_getsbyport \
1041     d_gethostbyaddr_r d_gethostbyname_r d_gethostent_r \
1042     d_getlogin_r d_getnbyaddr d_getnbyname d_getnent \
1043     d_getnetbyaddr_r d_getnetbyname_r d_getnetent_r \
1044     d_getpent d_getpbyname d_getpbynumber d_getprotobyname_r \
1045     d_getprotobynumber_r d_getprotoent_r d_getpwent \
1046     d_getpwent_r d_getpwnam_r d_getpwuid_r d_getsent \
1047     d_getservbyname_r d_getservbyport_r d_getservent_r \
1048     d_getspnam_r d_getsbyname
1049     # d_gethbyname
1050     do
1051     PERL_CONFIGURE="$PERL_CONFIGURE -U$sym"
1052     done
1053     }
1054    
1055     This mostly gains space when linking staticaly, as the functions will
1056     liekly not be linked in. The gain for dynamically-linked binaries is
1057     smaller.
1058    
1059     Also, this leaves C<gethostbyname> in - not only is it actually used
1060     often, the L<Socket> module also exposes it, so leaving it out usually
1061     gains little. Why Socket exposes a C function that is in the core already
1062     is anybody's guess.
1063    
1064     =back
1065    
1066 root 1.1 =head1 AUTHOR
1067    
1068     Marc Lehmann <schmorp@schmorp.de>
1069     http://software.schmorp.de/pkg/staticperl.html