ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/App-Staticperl/staticperl.pod
Revision: 1.27
Committed: Tue Dec 21 19:32:34 2010 UTC (13 years, 6 months ago) by root
Branch: MAIN
Changes since 1.26: +20 -15 lines
Log Message:
*** empty log message ***

File Contents

# Content
1 =head1 NAME
2
3 staticperl - perl, libc, 100 modules, all in one 500kb file
4
5 =head1 SYNOPSIS
6
7 staticperl help # print the embedded documentation
8 staticperl fetch # fetch and unpack perl sources
9 staticperl configure # fetch and then configure perl
10 staticperl build # configure and then build perl
11 staticperl install # build and then install perl
12 staticperl clean # clean most intermediate files (restart at configure)
13 staticperl distclean # delete everything installed by this script
14 staticperl cpan # invoke CPAN shell
15 staticperl instmod path... # install unpacked modules
16 staticperl instcpan modulename... # install modules from CPAN
17 staticperl mkbundle <bundle-args...> # see documentation
18 staticperl mkperl <bundle-args...> # see documentation
19 staticperl mkapp appname <bundle-args...> # see documentation
20
21 Typical Examples:
22
23 staticperl install # fetch, configure, build and install perl
24 staticperl cpan # run interactive cpan shell
25 staticperl mkperl -M '"Config_heavy.pl"' # build a perl that supports -V
26 staticperl mkperl -MAnyEvent::Impl::Perl -MAnyEvent::HTTPD -MURI -MURI::http
27 # build a perl with the above modules linked in
28 staticperl mkapp myapp --boot mainprog mymodules
29 # build a binary "myapp" from mainprog and mymodules
30
31 =head1 DESCRIPTION
32
33 This script helps you to create single-file perl interpreters
34 or applications, or embedding a perl interpreter in your
35 applications. Single-file means that it is fully self-contained - no
36 separate shared objects, no autoload fragments, no .pm or .pl files are
37 needed. And when linking statically, you can create (or embed) a single
38 file that contains perl interpreter, libc, all the modules you need, all
39 the libraries you need and of course your actual program.
40
41 With F<uClibc> and F<upx> on x86, you can create a single 500kb binary
42 that contains perl and 100 modules such as POSIX, AnyEvent, EV, IO::AIO,
43 Coro and so on. Or any other choice of modules.
44
45 To see how this turns out, you can try out smallperl and bigperl, two
46 pre-built static and compressed perl binaries with many and even more
47 modules: just follow the links at L<http://staticperl.schmorp.de/>.
48
49 The created files do not need write access to the file system (like PAR
50 does). In fact, since this script is in many ways similar to PAR::Packer,
51 here are the differences:
52
53 =over 4
54
55 =item * The generated executables are much smaller than PAR created ones.
56
57 Shared objects and the perl binary contain a lot of extra info, while
58 the static nature of F<staticperl> allows the linker to remove all
59 functionality and meta-info not required by the final executable. Even
60 extensions statically compiled into perl at build time will only be
61 present in the final executable when needed.
62
63 In addition, F<staticperl> can strip perl sources much more effectively
64 than PAR.
65
66 =item * The generated executables start much faster.
67
68 There is no need to unpack files, or even to parse Zip archives (which is
69 slow and memory-consuming business).
70
71 =item * The generated executables don't need a writable filesystem.
72
73 F<staticperl> loads all required files directly from memory. There is no
74 need to unpack files into a temporary directory.
75
76 =item * More control over included files, more burden.
77
78 PAR tries to be maintenance and hassle-free - it tries to include more
79 files than necessary to make sure everything works out of the box. It
80 mostly succeeds at this, but he extra files (such as the unicode database)
81 can take substantial amounts of memory and file size.
82
83 With F<staticperl>, the burden is mostly with the developer - only direct
84 compile-time dependencies and L<AutoLoader> are handled automatically.
85 This means the modules to include often need to be tweaked manually.
86
87 All this does not preclude more permissive modes to be implemented in
88 the future, but right now, you have to resolve state hidden dependencies
89 manually.
90
91 =item * PAR works out of the box, F<staticperl> does not.
92
93 Maintaining your own custom perl build can be a pain in the ass, and while
94 F<staticperl> tries to make this easy, it still requires a custom perl
95 build and possibly fiddling with some modules. PAR is likely to produce
96 results faster.
97
98 Ok, PAR never has worked for me out of the box, and for some people,
99 F<staticperl> does work out of the box, as they don't count "fiddling with
100 module use lists" against it, but nevertheless, F<staticperl> is certainly
101 a bit more difficult to use.
102
103 =back
104
105 =head1 HOW DOES IT WORK?
106
107 Simple: F<staticperl> downloads, compile and installs a perl version of
108 your choice in F<~/.staticperl>. You can add extra modules either by
109 letting F<staticperl> install them for you automatically, or by using CPAN
110 and doing it interactively. This usually takes 5-10 minutes, depending on
111 the speed of your computer and your internet connection.
112
113 It is possible to do program development at this stage, too.
114
115 Afterwards, you create a list of files and modules you want to include,
116 and then either build a new perl binary (that acts just like a normal perl
117 except everything is compiled in), or you create bundle files (basically C
118 sources you can use to embed all files into your project).
119
120 This step is very fast (a few seconds if PPI is not used for stripping, or
121 the stripped files are in the cache), and can be tweaked and repeated as
122 often as necessary.
123
124 =head1 THE F<STATICPERL> SCRIPT
125
126 This module installs a script called F<staticperl> into your perl
127 binary directory. The script is fully self-contained, and can be
128 used without perl (for example, in an uClibc chroot environment). In
129 fact, it can be extracted from the C<App::Staticperl> distribution
130 tarball as F<bin/staticperl>, without any installation. The
131 newest (possibly alpha) version can also be downloaded from
132 L<http://staticperl.schmorp.de/staticperl>.
133
134 F<staticperl> interprets the first argument as a command to execute,
135 optionally followed by any parameters.
136
137 There are two command categories: the "phase 1" commands which deal with
138 installing perl and perl modules, and the "phase 2" commands, which deal
139 with creating binaries and bundle files.
140
141 =head2 PHASE 1 COMMANDS: INSTALLING PERL
142
143 The most important command is F<install>, which does basically
144 everything. The default is to download and install perl 5.12.2 and a few
145 modules required by F<staticperl> itself, but all this can (and should) be
146 changed - see L<CONFIGURATION>, below.
147
148 The command
149
150 staticperl install
151
152 is normally all you need: It installs the perl interpreter in
153 F<~/.staticperl/perl>. It downloads, configures, builds and installs the
154 perl interpreter if required.
155
156 Most of the following F<staticperl> subcommands simply run one or more
157 steps of this sequence.
158
159 If it fails, then most commonly because the compiler options I selected
160 are not supported by your compiler - either edit the F<staticperl> script
161 yourself or create F<~/.staticperl> shell script where your set working
162 C<PERL_CCFLAGS> etc. variables.
163
164 To force recompilation or reinstallation, you need to run F<staticperl
165 distclean> first.
166
167 =over 4
168
169 =item F<staticperl version>
170
171 Prints some info about the version of the F<staticperl> script you are using.
172
173 =item F<staticperl fetch>
174
175 Runs only the download and unpack phase, unless this has already happened.
176
177 =item F<staticperl configure>
178
179 Configures the unpacked perl sources, potentially after downloading them first.
180
181 =item F<staticperl build>
182
183 Builds the configured perl sources, potentially after automatically
184 configuring them.
185
186 =item F<staticperl install>
187
188 Wipes the perl installation directory (usually F<~/.staticperl/perl>) and
189 installs the perl distribution, potentially after building it first.
190
191 =item F<staticperl cpan> [args...]
192
193 Starts an interactive CPAN shell that you can use to install further
194 modules. Installs the perl first if necessary, but apart from that,
195 no magic is involved: you could just as well run it manually via
196 F<~/.staticperl/perl/bin/cpan>.
197
198 Any additional arguments are simply passed to the F<cpan> command.
199
200 =item F<staticperl instcpan> module...
201
202 Tries to install all the modules given and their dependencies, using CPAN.
203
204 Example:
205
206 staticperl instcpan EV AnyEvent::HTTPD Coro
207
208 =item F<staticperl instsrc> directory...
209
210 In the unlikely case that you have unpacked perl modules around and want
211 to install from these instead of from CPAN, you can do this using this
212 command by specifying all the directories with modules in them that you
213 want to have built.
214
215 =item F<staticperl clean>
216
217 Deletes the perl source directory (and potentially cleans up other
218 intermediate files). This can be used to clean up files only needed for
219 building perl, without removing the installed perl interpreter.
220
221 At the moment, it doesn't delete downloaded tarballs.
222
223 The exact semantics of this command will probably change.
224
225 =item F<staticperl distclean>
226
227 This wipes your complete F<~/.staticperl> directory. Be careful with this,
228 it nukes your perl download, perl sources, perl distribution and any
229 installed modules. It is useful if you wish to start over "from scratch"
230 or when you want to uninstall F<staticperl>.
231
232 =back
233
234 =head2 PHASE 2 COMMANDS: BUILDING PERL BUNDLES
235
236 Building (linking) a new F<perl> binary is handled by a separate
237 script. To make it easy to use F<staticperl> from a F<chroot>, the script
238 is embedded into F<staticperl>, which will write it out and call for you
239 with any arguments you pass:
240
241 staticperl mkbundle mkbundle-args...
242
243 In the oh so unlikely case of something not working here, you
244 can run the script manually as well (by default it is written to
245 F<~/.staticperl/mkbundle>).
246
247 F<mkbundle> is a more conventional command and expect the argument
248 syntax commonly used on UNIX clones. For example, this command builds
249 a new F<perl> binary and includes F<Config.pm> (for F<perl -V>),
250 F<AnyEvent::HTTPD>, F<URI> and a custom F<httpd> script (from F<eg/httpd>
251 in this distribution):
252
253 # first make sure we have perl and the required modules
254 staticperl instcpan AnyEvent::HTTPD
255
256 # now build the perl
257 staticperl mkperl -M'"Config_heavy.pl"' -MAnyEvent::Impl::Perl \
258 -MAnyEvent::HTTPD -MURI::http \
259 --add 'eg/httpd httpd.pm'
260
261 # finally, invoke it
262 ./perl -Mhttpd
263
264 As you can see, things are not quite as trivial: the L<Config> module has
265 a hidden dependency which is not even a perl module (F<Config_heavy.pl>),
266 L<AnyEvent> needs at least one event loop backend that we have to
267 specify manually (here L<AnyEvent::Impl::Perl>), and the F<URI> module
268 (required by L<AnyEvent::HTTPD>) implements various URI schemes as extra
269 modules - since L<AnyEvent::HTTPD> only needs C<http> URIs, we only need
270 to include that module. I found out about these dependencies by carefully
271 watching any error messages about missing modules...
272
273 Instead of building a new perl binary, you can also build a standalone
274 application:
275
276 # build the app
277 staticperl mkapp app --boot eg/httpd \
278 -MAnyEvent::Impl::Perl -MAnyEvent::HTTPD -MURI::http
279
280 # run it
281 ./app
282
283 Here are the three phase 2 commands:
284
285 =over 4
286
287 =item F<staticperl mkbundle> args...
288
289 The "default" bundle command - it interprets the given bundle options and
290 writes out F<bundle.h>, F<bundle.c>, F<bundle.ccopts> and F<bundle.ldopts>
291 files, useful for embedding.
292
293 =item F<staticperl mkperl> args...
294
295 Creates a bundle just like F<staticperl mkbundle> (in fact, it's the same
296 as invoking F<staticperl mkbundle --perl> args...), but then compiles and
297 links a new perl interpreter that embeds the created bundle, then deletes
298 all intermediate files.
299
300 =item F<staticperl mkapp> filename args...
301
302 Does the same as F<staticperl mkbundle> (in fact, it's the same as
303 invoking F<staticperl mkbundle --app> filename args...), but then compiles
304 and links a new standalone application that simply initialises the perl
305 interpreter.
306
307 The difference to F<staticperl mkperl> is that the standalone application
308 does not act like a perl interpreter would - in fact, by default it would
309 just do nothing and exit immediately, so you should specify some code to
310 be executed via the F<--boot> option.
311
312 =back
313
314 =head3 OPTION PROCESSING
315
316 All options can be given as arguments on the command line (typically
317 using long (e.g. C<--verbose>) or short option (e.g. C<-v>) style). Since
318 specifying a lot of options can make the command line very long and
319 unwieldy, you can put all long options into a "bundle specification file"
320 (one option per line, with or without C<--> prefix) and specify this
321 bundle file instead.
322
323 For example, the command given earlier to link a new F<perl> could also
324 look like this:
325
326 staticperl mkperl httpd.bundle
327
328 With all options stored in the F<httpd.bundle> file (one option per line,
329 everything after the option is an argument):
330
331 use "Config_heavy.pl"
332 use AnyEvent::Impl::Perl
333 use AnyEvent::HTTPD
334 use URI::http
335 add eg/httpd httpd.pm
336
337 All options that specify modules or files to be added are processed in the
338 order given on the command line.
339
340 =head3 BUNDLE CREATION WORKFLOW / STATICPELR MKBUNDLE OPTIONS
341
342 F<staticperl mkbundle> works by first assembling a list of candidate
343 files and modules to include, then filtering them by include/exclude
344 patterns. The remaining modules (together with their direct dependencies,
345 such as link libraries and L<AutoLoader> files) are then converted into
346 bundle files suitable for embedding. F<staticperl mkbundle> can then
347 optionally build a new perl interpreter or a standalone application.
348
349 =over 4
350
351 =item Step 0: Generic argument processing.
352
353 The following options influence F<staticperl mkbundle> itself.
354
355 =over 4
356
357 =item C<--verbose> | C<-v>
358
359 Increases the verbosity level by one (the default is C<1>).
360
361 =item C<--quiet> | C<-q>
362
363 Decreases the verbosity level by one.
364
365 =item any other argument
366
367 Any other argument is interpreted as a bundle specification file, which
368 supports all options (without extra quoting), one option per line, in the
369 format C<option> or C<option argument>. They will effectively be expanded
370 and processed as if they were directly written on the command line, in
371 place of the file name.
372
373 =back
374
375 =item Step 1: gather candidate files and modules
376
377 In this step, modules, perl libraries (F<.pl> files) and other files are
378 selected for inclusion in the bundle. The relevant options are executed
379 in order (this makes a difference mostly for C<--eval>, which can rely on
380 earlier C<--use> options to have been executed).
381
382 =over 4
383
384 =item C<--use> F<module> | C<-M>F<module>
385
386 Include the named module and trace direct dependencies. This is done by
387 C<require>'ing the module in a subprocess and tracing which other modules
388 and files it actually loads.
389
390 Example: include AnyEvent and AnyEvent::Impl::Perl.
391
392 staticperl mkbundle --use AnyEvent --use AnyEvent::Impl::Perl
393
394 Sometimes you want to load old-style "perl libraries" (F<.pl> files), or
395 maybe other weirdly named files. To do that, you need to quote the name in
396 single or double quotes. When given on the command line, you probably need
397 to quote once more to avoid your shell interpreting it. Common cases that
398 need this are F<Config_heavy.pl> and F<utf8_heavy.pl>.
399
400 Example: include the required files for F<perl -V> to work in all its
401 glory (F<Config.pm> is included automatically by this).
402
403 # bourne shell
404 staticperl mkbundle --use '"Config_heavy.pl"'
405
406 # bundle specification file
407 use "Config_heavy.pl"
408
409 The C<-M>module syntax is included as an alias that might be easier to
410 remember than C<--use>. Or maybe it confuses people. Time will tell. Or
411 maybe not. Sigh.
412
413 =item C<--eval> "perl code" | C<-e> "perl code"
414
415 Sometimes it is easier (or necessary) to specify dependencies using perl
416 code, or maybe one of the modules you use need a special use statement. In
417 that case, you can use C<--eval> to execute some perl snippet or set some
418 variables or whatever you need. All files C<require>'d or C<use>'d while
419 executing the snippet are included in the final bundle.
420
421 Keep in mind that F<mkbundle> will only C<require> the modules named
422 by the C<--use> option, so do not expect the symbols from modules you
423 C<--use>'d earlier on the command line to be available.
424
425 Example: force L<AnyEvent> to detect a backend and therefore include it
426 in the final bundle.
427
428 staticperl mkbundle --eval 'use AnyEvent; AnyEvent::detect'
429
430 # or like this
431 staticperl mkbundle -MAnyEvent --eval 'AnyEvent::detect'
432
433 Example: use a separate "bootstrap" script that C<use>'s lots of modules
434 and also include this in the final bundle, to be executed automatically
435 when the interpreter is initialised.
436
437 staticperl mkbundle --eval 'do "bootstrap"' --boot bootstrap
438
439 =item C<--boot> F<filename>
440
441 Include the given file in the bundle and arrange for it to be
442 executed (using C<require>) before the main program when the new perl
443 is initialised. This can be used to modify C<@INC> or do similar
444 modifications before the perl interpreter executes scripts given on the
445 command line (or via C<-e>). This works even in an embedded interpreter -
446 the file will be executed during interpreter initialisation in that case.
447
448 =item C<--incglob> pattern
449
450 This goes through all standard library directories and tries to match any
451 F<.pm> and F<.pl> files against the extended glob pattern (see below). If
452 a file matches, it is added. The pattern is matched against the full path
453 of the file (sans the library directory prefix), e.g. F<Sys/Syslog.pm>.
454
455 This is very useful to include "everything":
456
457 --incglob '*'
458
459 It is also useful for including perl libraries, or trees of those, such as
460 the unicode database files needed by some perl builtins, the regex engine
461 and other modules.
462
463 --incglob '/unicore/**.pl'
464
465 =item C<--add> F<file> | C<--add> "F<file> alias"
466
467 Adds the given (perl) file into the bundle (and optionally call it
468 "alias"). The F<file> is either an absolute path or a path relative to
469 the current directory. If an alias is specified, then this is the name it
470 will use for C<@INC> searches, otherfile the F<file> will be used as the
471 internal name.
472
473 This switch is used to include extra files into the bundle.
474
475 Example: embed the file F<httpd> in the current directory as F<httpd.pm>
476 when creating the bundle.
477
478 staticperl mkperl --add "httpd httpd.pm"
479
480 Example: add local files as extra modules in the bundle.
481
482 # specification file
483 add file1 myfiles/file1.pm
484 add file2 myfiles/file2.pm
485 add file3 myfiles/file3.pl
486
487 # then later, in perl, use
488 use myfiles::file1;
489 require myfiles::file2;
490 my $res = do "myfiles/file3.pl";
491
492 =item C<--binadd> F<file> | C<--add> "F<file> alias"
493
494 Just like C<--add>, except that it treats the file as binary and adds it
495 without any postprocessing (perl files might get stripped to reduce their
496 size).
497
498 You should probably add a C</> prefix to avoid clashing with embedded perl
499 files (whose paths do not start with C</>), and/or use a special directory
500 prefix, such as C</res/name>.
501
502 You can later get a copy of these files by calling C<staticperl::find
503 "alias">.
504
505 An alternative way to embed binary files is to convert them to perl and
506 use C<do> to get the contents - this method is a bit cumbersome, but works
507 both inside and outside of a staticperl bundle:
508
509 # a "binary" file, call it "bindata.pl"
510 <<'SOME_MARKER'
511 binary data NOT containing SOME_MARKER
512 SOME_MARKER
513
514 # load the binary
515 chomp (my $data = do "bindata.pl");
516
517 =back
518
519 =item Step 2: filter all files using C<--include> and C<--exclude> options.
520
521 After all candidate files and modules are added, they are I<filtered>
522 by a combination of C<--include> and C<--exclude> patterns (there is an
523 implicit C<--include **> at the end, so if no filters are specified, all
524 files are included).
525
526 All that this step does is potentially reduce the number of files that are
527 to be included - no new files are added during this step.
528
529 =over 4
530
531 =item C<--include> pattern | C<-i> pattern | C<--exclude> pattern | C<-x> pattern
532
533 These specify an include or exclude pattern to be applied to the candidate
534 file list. An include makes sure that the given files will be part of the
535 resulting file set, an exclude will exclude remaining files. The patterns
536 are "extended glob patterns" (see below).
537
538 The patterns are applied "in order" - files included via earlier
539 C<--include> specifications cannot be removed by any following
540 C<--exclude>, and likewise, and file excluded by an earlier C<--exclude>
541 cannot be added by any following C<--include>.
542
543 For example, to include everything except C<Devel> modules, but still
544 include F<Devel::PPPort>, you could use this:
545
546 --incglob '*' -i '/Devel/PPPort.pm' -x '/Devel/**'
547
548 =back
549
550 =item Step 3: add any extra or "hidden" dependencies.
551
552 F<staticperl> currently knows about three extra types of depdendencies
553 that are added automatically. Only one (F<.packlist> files) is currently
554 optional and can be influenced, the others are always included:
555
556 =over 4
557
558 =item C<--usepacklist>
559
560 Read F<.packlist> files for each distribution that happens to match a
561 module name you specified. Sounds weird, and it is, so expect semantics to
562 change somehow in the future.
563
564 The idea is that most CPAN distributions have a F<.pm> file that matches
565 the name of the distribution (which is rather reasonable after all).
566
567 If this switch is enabled, then if any of the F<.pm> files that have been
568 selected match an install distribution, then all F<.pm>, F<.pl>, F<.al>
569 and F<.ix> files installed by this distribution are also included.
570
571 For example, using this switch, when the L<URI> module is specified, then
572 all L<URI> submodules that have been installed via the CPAN distribution
573 are included as well, so you don't have to manually specify them.
574
575 =item L<AutoLoader> splitfiles
576
577 Some modules use L<AutoLoader> - less commonly (hopefully) used functions
578 are split into separate F<.al> files, and an index (F<.ix>) file contains
579 the prototypes.
580
581 Both F<.ix> and F<.al> files will be detected automatically and added to
582 the bundle.
583
584 =item link libraries (F<.a> files)
585
586 Modules using XS (or any other non-perl language extension compiled at
587 installation time) will have a static archive (typically F<.a>). These
588 will automatically be added to the linker options in F<bundle.ldopts>.
589
590 Should F<staticperl> find a dynamic link library (typically F<.so>) it
591 will warn about it - obviously this shouldn't happen unless you use
592 F<staticperl> on the wrong perl, or one (probably wrongly) configured to
593 use dynamic loading.
594
595 =item extra libraries (F<extralibs.ld>)
596
597 Some modules need linking against external libraries - these are found in
598 F<extralibs.ld> and added to F<bundle.ldopts>.
599
600 =back
601
602 =item Step 4: write bundle files and optionally link a program
603
604 At this point, the select files will be read, processed (stripped) and
605 finally the bundle files get written to disk, and F<staticperl mkbundle>
606 is normally finished. Optionally, it can go a step further and either link
607 a new F<perl> binary with all selected modules and files inside, or build
608 a standalone application.
609
610 Both the contents of the bundle files and any extra linking is controlled
611 by these options:
612
613 =over 4
614
615 =item C<--strip> C<none>|C<pod>|C<ppi>
616
617 Specify the stripping method applied to reduce the file of the perl
618 sources included.
619
620 The default is C<pod>, which uses the L<Pod::Strip> module to remove all
621 pod documentation, which is very fast and reduces file size a lot.
622
623 The C<ppi> method uses L<PPI> to parse and condense the perl sources. This
624 saves a lot more than just L<Pod::Strip>, and is generally safer,
625 but is also a lot slower (some files take almost a minute to strip -
626 F<staticperl> maintains a cache of stripped files to speed up subsequent
627 runs for this reason). Note that this method doesn't optimise for raw file
628 size, but for best compression (that means that the uncompressed file size
629 is a bit larger, but the files compress better, e.g. with F<upx>).
630
631 Last not least, if you need accurate line numbers in error messages,
632 or in the unlikely case where C<pod> is too slow, or some module gets
633 mistreated, you can specify C<none> to not mangle included perl sources in
634 any way.
635
636 =item --perl
637
638 After writing out the bundle files, try to link a new perl interpreter. It
639 will be called F<perl> and will be left in the current working
640 directory. The bundle files will be removed.
641
642 This switch is automatically used when F<staticperl> is invoked with the
643 C<mkperl> command instead of C<mkbundle>.
644
645 Example: build a new F<./perl> binary with only L<common::sense> inside -
646 it will be even smaller than the standard perl interpreter as none of the
647 modules of the base distribution (such as L<Fcntl>) will be included.
648
649 staticperl mkperl -Mcommon::sense
650
651 =item --app name
652
653 After writing out the bundle files, try to link a new standalone
654 program. It will be called C<name>, and the bundle files get removed after
655 linking it.
656
657 This switch is automatically used when F<staticperl> is invoked with the
658 C<mkapp> command instead of C<mkbundle>.
659
660 The difference to the (mutually exclusive) C<--perl> option is that the
661 binary created by this option will not try to act as a perl interpreter -
662 instead it will simply initialise the perl interpreter, clean it up and
663 exit.
664
665 This means that, by default, it will do nothing but burna few CPU cycles
666 - for it to do something useful you I<must> add some boot code, e.g. with
667 the C<--boot> option.
668
669 Example: create a standalone perl binary called F<./myexe> that will
670 execute F<appfile> when it is started.
671
672 staticperl mkbundle --app myexe --boot appfile
673
674 =item --static
675
676 Add C<-static> to F<bundle.ldopts>, which means a fully static (if
677 supported by the OS) executable will be created. This is not immensely
678 useful when just creating the bundle files, but is most useful when
679 linking a binary with the C<--perl> or C<--app> options.
680
681 The default is to link the new binary dynamically (that means all perl
682 modules are linked statically, but all external libraries are still
683 referenced dynamically).
684
685 Keep in mind that Solaris doesn't support static linking at all, and
686 systems based on GNU libc don't really support it in a very usable
687 fashion either. Try uClibc if you want to create fully statically linked
688 executables, or try the C<--staticlib> option to link only some libraries
689 statically.
690
691 =item --staticlib libname
692
693 When not linking fully statically, this option allows you to link specific
694 libraries statically. What it does is simply replace all occurances of
695 C<-llibname> with the GCC-specific C<-Wl,-Bstatic -llibname -Wl,-Bdynamic>
696 option.
697
698 This will have no effect unless the library is actually linked against,
699 specifically, C<--staticlib> will not link against the named library
700 unless it would be linked against anyway.
701
702 Example: link libcrypt statically into the binary.
703
704 staticperl mkperl -MIO::AIO --staticlib crypt
705
706 # ldopts might now contain:
707 # -lm -Wl,-Bstatic -lcrypt -Wl,-Bdynamic -lpthread
708
709 =back
710
711 =back
712
713 =head3 EXTENDED GLOB PATTERNS
714
715 Some options of F<staticperl mkbundle> expect an I<extended glob
716 pattern>. This is neither a normal shell glob nor a regex, but something
717 in between. The idea has been copied from rsync, and there are the current
718 matching rules:
719
720 =over 4
721
722 =item Patterns starting with F</> will be a anchored at the root of the library tree.
723
724 That is, F</unicore> will match the F<unicore> directory in C<@INC>, but
725 nothing inside, and neither any other file or directory called F<unicore>
726 anywhere else in the hierarchy.
727
728 =item Patterns not starting with F</> will be anchored at the end of the path.
729
730 That is, F<idna.pl> will match any file called F<idna.pl> anywhere in the
731 hierarchy, but not any directories of the same name.
732
733 =item A F<*> matches any single component.
734
735 That is, F</unicore/*.pl> would match all F<.pl> files directly inside
736 C</unicore>, not any deeper level F<.pl> files. Or in other words, F<*>
737 will not match slashes.
738
739 =item A F<**> matches anything.
740
741 That is, F</unicore/**.pl> would match all F<.pl> files under F</unicore>,
742 no matter how deeply nested they are inside subdirectories.
743
744 =item A F<?> matches a single character within a component.
745
746 That is, F</Encode/??.pm> matches F</Encode/JP.pm>, but not the
747 hypothetical F</Encode/J/.pm>, as F<?> does not match F</>.
748
749 =back
750
751 =head2 F<STATICPERL> CONFIGURATION AND HOOKS
752
753 During (each) startup, F<staticperl> tries to source some shell files to
754 allow you to fine-tune/override configuration settings.
755
756 In them you can override shell variables, or define shell functions
757 ("hooks") to be called at specific phases during installation. For
758 example, you could define a C<postinstall> hook to install additional
759 modules from CPAN each time you start from scratch.
760
761 If the env variable C<$STATICPERLRC> is set, then F<staticperl> will try
762 to source the file named with it only. Otherwise, it tries the following
763 shell files in order:
764
765 /etc/staticperlrc
766 ~/.staticperlrc
767 $STATICPERL/rc
768
769 Note that the last file is erased during F<staticperl distclean>, so
770 generally should not be used.
771
772 =head3 CONFIGURATION VARIABLES
773
774 =head4 Variables you I<should> override
775
776 =over 4
777
778 =item C<EMAIL>
779
780 The e-mail address of the person who built this binary. Has no good
781 default, so should be specified by you.
782
783 =item C<CPAN>
784
785 The URL of the CPAN mirror to use (e.g. L<http://mirror.netcologne.de/cpan/>).
786
787 =item C<EXTRA_MODULES>
788
789 Additional modules installed during F<staticperl install>. Here you can
790 set which modules you want have to installed from CPAN.
791
792 Example: I really really need EV, AnyEvent, Coro and AnyEvent::AIO.
793
794 EXTRA_MODULES="EV AnyEvent Coro AnyEvent::AIO"
795
796 Note that you can also use a C<postinstall> hook to achieve this, and
797 more.
798
799 =back
800
801 =head4 Variables you might I<want> to override
802
803 =over 4
804
805 =item C<STATICPERL>
806
807 The directory where staticperl stores all its files
808 (default: F<~/.staticperl>).
809
810 =item C<PERL_MM_USE_DEFAULT>, C<EV_EXTRA_DEFS>, ...
811
812 Usually set to C<1> to make modules "less inquisitive" during their
813 installation, you can set any environment variable you want - some modules
814 (such as L<Coro> or L<EV>) use environment variables for further tweaking.
815
816 =item C<PERL_VERSION>
817
818 The perl version to install - default is currently C<5.12.2>, but C<5.8.9>
819 is also a good choice (5.8.9 is much smaller than 5.12.2, while 5.10.1 is
820 about as big as 5.12.2).
821
822 =item C<PERL_PREFIX>
823
824 The prefix where perl gets installed (default: F<$STATICPERL/perl>),
825 i.e. where the F<bin> and F<lib> subdirectories will end up.
826
827 =item C<PERL_CONFIGURE>
828
829 Additional Configure options - these are simply passed to the perl
830 Configure script. For example, if you wanted to enable dynamic loading,
831 you could pass C<-Dusedl>. To enable ithreads (Why would you want that
832 insanity? Don't! Use L<forks> instead!) you would pass C<-Duseithreads>
833 and so on.
834
835 More commonly, you would either activate 64 bit integer support
836 (C<-Duse64bitint>), or disable large files support (-Uuselargefiles), to
837 reduce filesize further.
838
839 =item C<PERL_CC>, C<PERL_CCFLAGS>, C<PERL_OPTIMIZE>, C<PERL_LDFLAGS>, C<PERL_LIBS>
840
841 These flags are passed to perl's F<Configure> script, and are generally
842 optimised for small size (at the cost of performance). Since they also
843 contain subtle workarounds around various build issues, changing these
844 usually requires understanding their default values - best look at
845 the top of the F<staticperl> script for more info on these, and use a
846 F<~/.staticperlrc> to override them.
847
848 Most of the variables override (or modify) the corresponding F<Configure>
849 variable, except C<PERL_CCFLAGS>, which gets appended.
850
851 =back
852
853 =head4 Variables you probably I<do not want> to override
854
855 =over 4
856
857 =item C<MAKE>
858
859 The make command to use - default is C<make>.
860
861 =item C<MKBUNDLE>
862
863 Where F<staticperl> writes the C<mkbundle> command to
864 (default: F<$STATICPERL/mkbundle>).
865
866 =item C<STATICPERL_MODULES>
867
868 Additional modules needed by C<mkbundle> - should therefore not be changed
869 unless you know what you are doing.
870
871 =back
872
873 =head3 OVERRIDABLE HOOKS
874
875 In addition to environment variables, it is possible to provide some
876 shell functions that are called at specific times. To provide your own
877 commands, just define the corresponding function.
878
879 Example: install extra modules from CPAN and from some directories
880 at F<staticperl install> time.
881
882 postinstall() {
883 rm -rf lib/threads* # weg mit Schaden
884 instcpan IO::AIO EV
885 instsrc ~/src/AnyEvent
886 instsrc ~/src/XML-Sablotron-1.0100001
887 instcpan Anyevent::AIO AnyEvent::HTTPD
888 }
889
890 =over 4
891
892 =item preconfigure
893
894 Called just before running F<./Configur> in the perl source
895 directory. Current working directory is the perl source directory.
896
897 This can be used to set any C<PERL_xxx> variables, which might be costly
898 to compute.
899
900 =item postconfigure
901
902 Called after configuring, but before building perl. Current working
903 directory is the perl source directory.
904
905 Could be used to tailor/patch config.sh (followed by F<sh Configure -S>)
906 or do any other modifications.
907
908 =item postbuild
909
910 Called after building, but before installing perl. Current working
911 directory is the perl source directory.
912
913 I have no clue what this could be used for - tell me.
914
915 =item postinstall
916
917 Called after perl and any extra modules have been installed in C<$PREFIX>,
918 but before setting the "installation O.K." flag.
919
920 The current working directory is C<$PREFIX>, but maybe you should not rely
921 on that.
922
923 This hook is most useful to customise the installation, by deleting files,
924 or installing extra modules using the C<instcpan> or C<instsrc> functions.
925
926 The script must return with a zero exit status, or the installation will
927 fail.
928
929 =back
930
931 =head1 ANATOMY OF A BUNDLE
932
933 When not building a new perl binary, C<mkbundle> will leave a number of
934 files in the current working directory, which can be used to embed a perl
935 interpreter in your program.
936
937 Intimate knowledge of L<perlembed> and preferably some experience with
938 embedding perl is highly recommended.
939
940 C<mkperl> (or the C<--perl> option) basically does this to link the new
941 interpreter (it also adds a main program to F<bundle.>):
942
943 $Config{cc} $(cat bundle.ccopts) -o perl bundle.c $(cat bundle.ldopts)
944
945 =over 4
946
947 =item bundle.h
948
949 A header file that contains the prototypes of the few symbols "exported"
950 by bundle.c, and also exposes the perl headers to the application.
951
952 =over 4
953
954 =item staticperl_init ()
955
956 Initialises the perl interpreter. You can use the normal perl functions
957 after calling this function, for example, to define extra functions or
958 to load a .pm file that contains some initialisation code, or the main
959 program function:
960
961 XS (xsfunction)
962 {
963 dXSARGS;
964
965 // now we have items, ST(i) etc.
966 }
967
968 static void
969 run_myapp(void)
970 {
971 staticperl_init ();
972 newXSproto ("myapp::xsfunction", xsfunction, __FILE__, "$$;$");
973 eval_pv ("require myapp::main", 1); // executes "myapp/main.pm"
974 }
975
976 =item staticperl_xs_init (pTHX)
977
978 Sometimes you need direct control over C<perl_parse> and C<perl_run>, in
979 which case you do not want to use C<staticperl_init> but call them on your
980 own.
981
982 Then you need this function - either pass it directly as the C<xs_init>
983 function to C<perl_parse>, or call it from your own C<xs_init> function.
984
985 =item staticperl_cleanup ()
986
987 In the unlikely case that you want to destroy the perl interpreter, here
988 is the corresponding function.
989
990 =item PerlInterpreter *staticperl
991
992 The perl interpreter pointer used by staticperl. Not normally so useful,
993 but there it is.
994
995 =back
996
997 =item bundle.ccopts
998
999 Contains the compiler options required to compile at least F<bundle.c> and
1000 any file that includes F<bundle.h> - you should probably use it in your
1001 C<CFLAGS>.
1002
1003 =item bundle.ldopts
1004
1005 The linker options needed to link the final program.
1006
1007 =back
1008
1009 =head1 RUNTIME FUNCTIONALITY
1010
1011 Binaries created with C<mkbundle>/C<mkperl> contain extra functions, which
1012 are required to access the bundled perl sources, but might be useful for
1013 other purposes.
1014
1015 In addition, for the embedded loading of perl files to work, F<staticperl>
1016 overrides the C<@INC> array.
1017
1018 =over 4
1019
1020 =item $file = staticperl::find $path
1021
1022 Returns the data associated with the given C<$path>
1023 (e.g. C<Digest/MD5.pm>, C<auto/POSIX/autosplit.ix>), which is basically
1024 the UNIX path relative to the perl library directory.
1025
1026 Returns C<undef> if the file isn't embedded.
1027
1028 =item @paths = staticperl::list
1029
1030 Returns the list of all paths embedded in this binary.
1031
1032 =back
1033
1034 =head1 FULLY STATIC BINARIES - BUILDROOT
1035
1036 To make truly static (Linux-) libraries, you might want to have a look at
1037 buildroot (L<http://buildroot.uclibc.org/>).
1038
1039 Buildroot is primarily meant to set up a cross-compile environment (which
1040 is not so useful as perl doesn't quite like cross compiles), but it can also compile
1041 a chroot environment where you can use F<staticperl>.
1042
1043 To do so, download buildroot, and enable "Build options => development
1044 files in target filesystem" and optionally "Build options => gcc
1045 optimization level (optimize for size)". At the time of writing, I had
1046 good experiences with GCC 4.4.x but not GCC 4.5.
1047
1048 To minimise code size, I used C<-pipe -ffunction-sections -fdata-sections
1049 -finline-limit=8 -fno-builtin-strlen -mtune=i386>. The C<-mtune=i386>
1050 doesn't decrease codesize much, but it makes the file much more
1051 compressible.
1052
1053 If you don't need Coro or threads, you can go with "linuxthreads.old" (or
1054 no thread support). For Coro, it is highly recommended to switch to a
1055 uClibc newer than 0.9.31 (at the time of this writing, I used the 20101201
1056 snapshot) and enable NPTL, otherwise Coro needs to be configured with the
1057 ultra-slow pthreads backend to work around linuxthreads bugs (it also uses
1058 twice the address space needed for stacks).
1059
1060 If you use C<linuxthreads.old>, then you should also be aware that
1061 uClibc shares C<errno> between all threads when statically linking. See
1062 L<http://lists.uclibc.org/pipermail/uclibc/2010-June/044157.html> for a
1063 workaround (And L<https://bugs.uclibc.org/2089> for discussion).
1064
1065 C<ccache> support is also recommended, especially if you want
1066 to play around with buildroot options. Enabling the C<miniperl>
1067 package will probably enable all options required for a successful
1068 perl build. F<staticperl> itself additionally needs either C<wget>
1069 (recommended, for CPAN) or C<curl>.
1070
1071 As for shells, busybox should provide all that is needed, but the default
1072 busybox configuration doesn't include F<comm> which is needed by perl -
1073 either make a custom busybox config, or compile coreutils.
1074
1075 For the latter route, you might find that bash has some bugs that keep
1076 it from working properly in a chroot - either use dash (and link it to
1077 F</bin/sh> inside the chroot) or link busybox to F</bin/sh>, using it's
1078 built-in ash shell.
1079
1080 Finally, you need F</dev/null> inside the chroot for many scripts to work
1081 - F<cp /dev/null output/target/dev> or bind-mounting your F</dev> will
1082 both provide this.
1083
1084 After you have compiled and set up your buildroot target, you can copy
1085 F<staticperl> from the C<App::Staticperl> distribution or from your
1086 perl f<bin> directory (if you installed it) into the F<output/target>
1087 filesystem, chroot inside and run it.
1088
1089 =head1 RECIPES / SPECIFIC MODULES
1090
1091 This section contains some common(?) recipes and information about
1092 problems with some common modules or perl constructs that require extra
1093 files to be included.
1094
1095 =head2 MODULES
1096
1097 =over 4
1098
1099 =item utf8
1100
1101 Some functionality in the utf8 module, such as swash handling (used
1102 for unicode character ranges in regexes) is implemented in the
1103 C<"utf8_heavy.pl"> library:
1104
1105 -M'"utf8_heavy.pl"'
1106
1107 Many Unicode properties in turn are defined in separate modules,
1108 such as C<"unicore/Heavy.pl"> and more specific data tables such as
1109 C<"unicore/To/Digit.pl"> or C<"unicore/lib/Perl/Word.pl">. These tables
1110 are big (7MB uncompressed, although F<staticperl> contains special
1111 handling for those files), so including them on demand by your application
1112 only might pay off.
1113
1114 To simply include the whole unicode database, use:
1115
1116 --incglob '/unicore/*.pl'
1117
1118 =item AnyEvent
1119
1120 AnyEvent needs a backend implementation that it will load in a delayed
1121 fashion. The L<AnyEvent::Impl::Perl> backend is the default choice
1122 for AnyEvent if it can't find anything else, and is usually a safe
1123 fallback. If you plan to use e.g. L<EV> (L<POE>...), then you need to
1124 include the L<AnyEvent::Impl::EV> (L<AnyEvent::Impl::POE>...) backend as
1125 well.
1126
1127 If you want to handle IRIs or IDNs (L<AnyEvent::Util> punycode and idn
1128 functions), you also need to include C<"AnyEvent/Util/idna.pl"> and
1129 C<"AnyEvent/Util/uts46data.pl">.
1130
1131 Or you can use C<--usepacklist> and specify C<-MAnyEvent> to include
1132 everything.
1133
1134 =item Carp
1135
1136 Carp had (in older versions of perl) a dependency on L<Carp::Heavy>. As of
1137 perl 5.12.2 (maybe earlier), this dependency no longer exists.
1138
1139 =item Config
1140
1141 The F<perl -V> switch (as well as many modules) needs L<Config>, which in
1142 turn might need L<"Config_heavy.pl">. Including the latter gives you
1143 both.
1144
1145 =item Term::ReadLine::Perl
1146
1147 Also needs L<Term::ReadLine::readline>, or C<--usepacklist>.
1148
1149 =item URI
1150
1151 URI implements schemes as separate modules - the generic URL scheme is
1152 implemented in L<URI::_generic>, HTTP is implemented in L<URI::http>. If
1153 you need to use any of these schemes, you should include these manually,
1154 or use C<--usepacklist>.
1155
1156 =back
1157
1158 =head2 RECIPES
1159
1160 =over 4
1161
1162 =item Linking everything in
1163
1164 To link just about everything installed in the perl library into a new
1165 perl, try this:
1166
1167 staticperl mkperl --strip ppi --incglob '*'
1168
1169 =item Getting rid of netdb function
1170
1171 The perl core has lots of netdb functions (C<getnetbyname>, C<getgrent>
1172 and so on) that few applications use. You can avoid compiling them in by
1173 putting the following fragment into a C<preconfigure> hook:
1174
1175 preconfigure() {
1176 for sym in \
1177 d_getgrnam_r d_endgrent d_endgrent_r d_endhent \
1178 d_endhostent_r d_endnent d_endnetent_r d_endpent \
1179 d_endprotoent_r d_endpwent d_endpwent_r d_endsent \
1180 d_endservent_r d_getgrent d_getgrent_r d_getgrgid_r \
1181 d_getgrnam_r d_gethbyaddr d_gethent d_getsbyport \
1182 d_gethostbyaddr_r d_gethostbyname_r d_gethostent_r \
1183 d_getlogin_r d_getnbyaddr d_getnbyname d_getnent \
1184 d_getnetbyaddr_r d_getnetbyname_r d_getnetent_r \
1185 d_getpent d_getpbyname d_getpbynumber d_getprotobyname_r \
1186 d_getprotobynumber_r d_getprotoent_r d_getpwent \
1187 d_getpwent_r d_getpwnam_r d_getpwuid_r d_getsent \
1188 d_getservbyname_r d_getservbyport_r d_getservent_r \
1189 d_getspnam_r d_getsbyname
1190 # d_gethbyname
1191 do
1192 PERL_CONFIGURE="$PERL_CONFIGURE -U$sym"
1193 done
1194 }
1195
1196 This mostly gains space when linking staticaly, as the functions will
1197 likely not be linked in. The gain for dynamically-linked binaries is
1198 smaller.
1199
1200 Also, this leaves C<gethostbyname> in - not only is it actually used
1201 often, the L<Socket> module also exposes it, so leaving it out usually
1202 gains little. Why Socket exposes a C function that is in the core already
1203 is anybody's guess.
1204
1205 =back
1206
1207 =head1 AUTHOR
1208
1209 Marc Lehmann <schmorp@schmorp.de>
1210 http://software.schmorp.de/pkg/staticperl.html