ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/App-Staticperl/staticperl.pod
Revision: 1.20
Committed: Fri Dec 10 20:29:17 2010 UTC (13 years, 5 months ago) by root
Branch: MAIN
CVS Tags: rel-0_91, rel-0_911
Changes since 1.19: +45 -9 lines
Log Message:
*** empty log message ***

File Contents

# Content
1 =head1 NAME
2
3 staticperl - perl, libc, 100 modules, all in one 500kb file
4
5 =head1 SYNOPSIS
6
7 staticperl help # print the embedded documentation
8 staticperl fetch # fetch and unpack perl sources
9 staticperl configure # fetch and then configure perl
10 staticperl build # configure and then build perl
11 staticperl install # build and then install perl
12 staticperl clean # clean most intermediate files (restart at configure)
13 staticperl distclean # delete everything installed by this script
14 staticperl cpan # invoke CPAN shell
15 staticperl instmod path... # install unpacked modules
16 staticperl instcpan modulename... # install modules from CPAN
17 staticperl mkbundle <bundle-args...> # see documentation
18 staticperl mkperl <bundle-args...> # see documentation
19 staticperl mkapp appname <bundle-args...> # see documentation
20
21 Typical Examples:
22
23 staticperl install # fetch, configure, build and install perl
24 staticperl cpan # run interactive cpan shell
25 staticperl mkperl -M '"Config_heavy.pl"' # build a perl that supports -V
26 staticperl mkperl -MAnyEvent::Impl::Perl -MAnyEvent::HTTPD -MURI -MURI::http
27 # build a perl with the above modules linked in
28 staticperl mkapp myapp --boot mainprog mymodules
29 # build a binary "myapp" from mainprog and mymodules
30
31 =head1 DESCRIPTION
32
33 This script helps you to create single-file perl interpreters
34 or applications, or embedding a perl interpreter in your
35 applications. Single-file means that it is fully self-contained - no
36 separate shared objects, no autoload fragments, no .pm or .pl files are
37 needed. And when linking statically, you can create (or embed) a single
38 file that contains perl interpreter, libc, all the modules you need, all
39 the libraries you need and of course your actual program.
40
41 With F<uClibc> and F<upx> on x86, you can create a single 500kb binary
42 that contains perl and 100 modules such as POSIX, AnyEvent, EV, IO::AIO,
43 Coro and so on. Or any other choice of modules.
44
45 To see how this turns out, you can try out smallperl and bigperl, two
46 pre-built static and compressed perl binaries with many and even more
47 modules: just follow the links at L<http://staticperl.schmorp.de/>.
48
49 The created files do not need write access to the file system (like PAR
50 does). In fact, since this script is in many ways similar to PAR::Packer,
51 here are the differences:
52
53 =over 4
54
55 =item * The generated executables are much smaller than PAR created ones.
56
57 Shared objects and the perl binary contain a lot of extra info, while
58 the static nature of F<staticperl> allows the linker to remove all
59 functionality and meta-info not required by the final executable. Even
60 extensions statically compiled into perl at build time will only be
61 present in the final executable when needed.
62
63 In addition, F<staticperl> can strip perl sources much more effectively
64 than PAR.
65
66 =item * The generated executables start much faster.
67
68 There is no need to unpack files, or even to parse Zip archives (which is
69 slow and memory-consuming business).
70
71 =item * The generated executables don't need a writable filesystem.
72
73 F<staticperl> loads all required files directly from memory. There is no
74 need to unpack files into a temporary directory.
75
76 =item * More control over included files, more burden.
77
78 PAR tries to be maintenance and hassle-free - it tries to include more
79 files than necessary to make sure everything works out of the box. It
80 mostly succeeds at this, but he extra files (such as the unicode database)
81 can take substantial amounts of memory and file size.
82
83 With F<staticperl>, the burden is mostly with the developer - only direct
84 compile-time dependencies and L<AutoLoader> are handled automatically.
85 This means the modules to include often need to be tweaked manually.
86
87 All this does not preclude more permissive modes to be implemented in
88 the future, but right now, you have to resolve state hidden dependencies
89 manually.
90
91 =item * PAR works out of the box, F<staticperl> does not.
92
93 Maintaining your own custom perl build can be a pain in the ass, and while
94 F<staticperl> tries to make this easy, it still requires a custom perl
95 build and possibly fiddling with some modules. PAR is likely to produce
96 results faster.
97
98 Ok, PAR never has worked for me out of the box, and for some people,
99 F<staticperl> does work out of the box, as they don't count "fiddling with
100 module use lists" against it, but nevertheless, F<staticperl> is certainly
101 a bit more difficult to use.
102
103 =back
104
105 =head1 HOW DOES IT WORK?
106
107 Simple: F<staticperl> downloads, compile and installs a perl version of
108 your choice in F<~/.staticperl>. You can add extra modules either by
109 letting F<staticperl> install them for you automatically, or by using CPAN
110 and doing it interactively. This usually takes 5-10 minutes, depending on
111 the speed of your computer and your internet connection.
112
113 It is possible to do program development at this stage, too.
114
115 Afterwards, you create a list of files and modules you want to include,
116 and then either build a new perl binary (that acts just like a normal perl
117 except everything is compiled in), or you create bundle files (basically C
118 sources you can use to embed all files into your project).
119
120 This step is very fast (a few seconds if PPI is not used for stripping, or
121 the stripped files are in the cache), and can be tweaked and repeated as
122 often as necessary.
123
124 =head1 THE F<STATICPERL> SCRIPT
125
126 This module installs a script called F<staticperl> into your perl
127 binary directory. The script is fully self-contained, and can be used
128 without perl (for example, in an uClibc chroot environment). In fact,
129 it can be extracted from the C<App::Staticperl> distribution tarball as
130 F<bin/staticperl>, without any installation.
131
132 F<staticperl> interprets the first argument as a command to execute,
133 optionally followed by any parameters.
134
135 There are two command categories: the "phase 1" commands which deal with
136 installing perl and perl modules, and the "phase 2" commands, which deal
137 with creating binaries and bundle files.
138
139 =head2 PHASE 1 COMMANDS: INSTALLING PERL
140
141 The most important command is F<install>, which does basically
142 everything. The default is to download and install perl 5.12.2 and a few
143 modules required by F<staticperl> itself, but all this can (and should) be
144 changed - see L<CONFIGURATION>, below.
145
146 The command
147
148 staticperl install
149
150 Is normally all you need: It installs the perl interpreter in
151 F<~/.staticperl/perl>. It downloads, configures, builds and installs the
152 perl interpreter if required.
153
154 Most of the following commands simply run one or more steps of this
155 sequence.
156
157 To force recompilation or reinstallation, you need to run F<staticperl
158 distclean> first.
159
160 =over 4
161
162 =item F<staticperl version>
163
164 Prints some info about the version of the F<staticperl> script you are using.
165
166 =item F<staticperl fetch>
167
168 Runs only the download and unpack phase, unless this has already happened.
169
170 =item F<staticperl configure>
171
172 Configures the unpacked perl sources, potentially after downloading them first.
173
174 =item F<staticperl build>
175
176 Builds the configured perl sources, potentially after automatically
177 configuring them.
178
179 =item F<staticperl install>
180
181 Wipes the perl installation directory (usually F<~/.staticperl/perl>) and
182 installs the perl distribution, potentially after building it first.
183
184 =item F<staticperl cpan> [args...]
185
186 Starts an interactive CPAN shell that you can use to install further
187 modules. Installs the perl first if necessary, but apart from that,
188 no magic is involved: you could just as well run it manually via
189 F<~/.staticperl/perl/bin/cpan>.
190
191 Any additional arguments are simply passed to the F<cpan> command.
192
193 =item F<staticperl instcpan> module...
194
195 Tries to install all the modules given and their dependencies, using CPAN.
196
197 Example:
198
199 staticperl instcpan EV AnyEvent::HTTPD Coro
200
201 =item F<staticperl instsrc> directory...
202
203 In the unlikely case that you have unpacked perl modules around and want
204 to install from these instead of from CPAN, you can do this using this
205 command by specifying all the directories with modules in them that you
206 want to have built.
207
208 =item F<staticperl clean>
209
210 Deletes the perl source directory (and potentially cleans up other
211 intermediate files). This can be used to clean up files only needed for
212 building perl, without removing the installed perl interpreter, or to
213 force a re-build from scratch.
214
215 At the moment, it doesn't delete downloaded tarballs.
216
217 =item F<staticperl distclean>
218
219 This wipes your complete F<~/.staticperl> directory. Be careful with this,
220 it nukes your perl download, perl sources, perl distribution and any
221 installed modules. It is useful if you wish to start over "from scratch"
222 or when you want to uninstall F<staticperl>.
223
224 =back
225
226 =head2 PHASE 2 COMMANDS: BUILDING PERL BUNDLES
227
228 Building (linking) a new F<perl> binary is handled by a separate
229 script. To make it easy to use F<staticperl> from a F<chroot>, the script
230 is embedded into F<staticperl>, which will write it out and call for you
231 with any arguments you pass:
232
233 staticperl mkbundle mkbundle-args...
234
235 In the oh so unlikely case of something not working here, you
236 can run the script manually as well (by default it is written to
237 F<~/.staticperl/mkbundle>).
238
239 F<mkbundle> is a more conventional command and expect the argument
240 syntax commonly used on UNIX clones. For example, this command builds
241 a new F<perl> binary and includes F<Config.pm> (for F<perl -V>),
242 F<AnyEvent::HTTPD>, F<URI> and a custom F<httpd> script (from F<eg/httpd>
243 in this distribution):
244
245 # first make sure we have perl and the required modules
246 staticperl instcpan AnyEvent::HTTPD
247
248 # now build the perl
249 staticperl mkperl -M'"Config_heavy.pl"' -MAnyEvent::Impl::Perl \
250 -MAnyEvent::HTTPD -MURI::http \
251 --add 'eg/httpd httpd.pm'
252
253 # finally, invoke it
254 ./perl -Mhttpd
255
256 As you can see, things are not quite as trivial: the L<Config> module has
257 a hidden dependency which is not even a perl module (F<Config_heavy.pl>),
258 L<AnyEvent> needs at least one event loop backend that we have to
259 specify manually (here L<AnyEvent::Impl::Perl>), and the F<URI> module
260 (required by L<AnyEvent::HTTPD>) implements various URI schemes as extra
261 modules - since L<AnyEvent::HTTPD> only needs C<http> URIs, we only need
262 to include that module. I found out about these dependencies by carefully
263 watching any error messages about missing modules...
264
265 Instead of building a new perl binary, you can also build a standalone
266 application:
267
268 # build the app
269 staticperl mkapp app --boot eg/httpd \
270 -MAnyEvent::Impl::Perl -MAnyEvent::HTTPD -MURI::http
271
272 # run it
273 ./app
274
275 =head3 OPTION PROCESSING
276
277 All options can be given as arguments on the command line (typically
278 using long (e.g. C<--verbose>) or short option (e.g. C<-v>) style). Since
279 specifying a lot of modules can make the command line very cumbersome,
280 you can put all long options into a "bundle specification file" (with or
281 without C<--> prefix) and specify this bundle file instead.
282
283 For example, the command given earlier could also look like this:
284
285 staticperl mkperl httpd.bundle
286
287 And all options could be in F<httpd.bundle>:
288
289 use "Config_heavy.pl"
290 use AnyEvent::Impl::Perl
291 use AnyEvent::HTTPD
292 use URI::http
293 add eg/httpd httpd.pm
294
295 All options that specify modules or files to be added are processed in the
296 order given on the command line (that affects the C<--use> and C<--eval>
297 options at the moment).
298
299 =head3 PACKAGE SELECTION WORKFLOW
300
301 F<staticperl mkbundle> has a number of options to control package
302 selection. This section describes how they interact with each other. Also,
303 since I am still a newbie w.r.t. these issues, maybe future versions of
304 F<staticperl> will change this, so watch out :)
305
306 The idiom "in order" means "in order that they are specified on the
307 commandline". If you use a bundle specification file, then the options
308 will be processed as if they were given in place of the bundle file name.
309
310 =over 4
311
312 =item 1. apply all C<--use>, C<--eval>, C<--add>, C<--addbin> and
313 C<--incglob> options, in order.
314
315 In addition, C<--use> and C<--eval> dependencies will be added when the
316 options are processed.
317
318 =item 2. apply all C<--include> and C<--exclude> options, in order.
319
320 All this step does is potentially reduce the number of files already
321 selected or found in phase 1.
322
323 =item 3. find all modules (== F<.pm> files), gather their static archives
324 (F<.a>) and AutoLoader splitfiles (F<.ix> and F<.al> files), find any
325 extra libraries they need for linking (F<extralibs.ld>) and optionally
326 evaluate any F<.packlist> files.
327
328 This step is required to link against XS extensions and also adds files
329 required for L<AutoLoader> to do it's job.
330
331 =back
332
333 After this, all the files selected for bundling will be read and processed
334 (stripped), the bundle files will be written, and optionally a new F<perl>
335 or application binary will be linked.
336
337 =head3 MKBUNDLE OPTIONS
338
339 =over 4
340
341 =item --verbose | -v
342
343 Increases the verbosity level by one (the default is C<1>).
344
345 =item --quiet | -q
346
347 Decreases the verbosity level by one.
348
349 =item --strip none|pod|ppi
350
351 Specify the stripping method applied to reduce the file of the perl
352 sources included.
353
354 The default is C<pod>, which uses the L<Pod::Strip> module to remove all
355 pod documentation, which is very fast and reduces file size a lot.
356
357 The C<ppi> method uses L<PPI> to parse and condense the perl sources. This
358 saves a lot more than just L<Pod::Strip>, and is generally safer,
359 but is also a lot slower (some files take almost a minute to strip -
360 F<staticperl> maintains a cache of stripped files to speed up subsequent
361 runs for this reason). Note that this method doesn't optimise for raw file
362 size, but for best compression (that means that the uncompressed file size
363 is a bit larger, but the files compress better, e.g. with F<upx>).
364
365 Last not least, if you need accurate line numbers in error messages,
366 or in the unlikely case where C<pod> is too slow, or some module gets
367 mistreated, you can specify C<none> to not mangle included perl sources in
368 any way.
369
370 =item --perl
371
372 After writing out the bundle files, try to link a new perl interpreter. It
373 will be called F<perl> and will be left in the current working
374 directory. The bundle files will be removed.
375
376 This switch is automatically used when F<staticperl> is invoked with the
377 C<mkperl> command (instead of C<mkbundle>):
378
379 # build a new ./perl with only common::sense in it - very small :)
380 staticperl mkperl -Mcommon::sense
381
382 =item --app name
383
384 After writing out the bundle files, try to link a new standalone
385 program. It will be called C<name>, and the bundle files get removed after
386 linking it.
387
388 The difference to the (mutually exclusive) C<--perl> option is that the
389 binary created by this option will not try to act as a perl interpreter -
390 instead it will simply initialise the perl interpreter, clean it up and
391 exit.
392
393 This switch is automatically used when F<staticperl> is invoked with the
394 C<mkapp> command (instead of C<mkbundle>):
395
396 To let it do something useful you I<must> add some boot code, e.g. with
397 the C<--boot> option.
398
399 Example: create a standalone perl binary that will execute F<appfile> when
400 it is started.
401
402 staticperl mkbundle --app myexe --boot appfile
403
404 =item --use module | -Mmodule
405
406 Include the named module and all direct dependencies. This is done by
407 C<require>'ing the module in a subprocess and tracing which other modules
408 and files it actually loads. If the module uses L<AutoLoader>, then all
409 splitfiles will be included as well.
410
411 Example: include AnyEvent and AnyEvent::Impl::Perl.
412
413 staticperl mkbundle --use AnyEvent --use AnyEvent::Impl::Perl
414
415 Sometimes you want to load old-style "perl libraries" (F<.pl> files), or
416 maybe other weirdly named files. To do that, you need to quote the name in
417 single or double quotes. When given on the command line, you probably need
418 to quote once more to avoid your shell interpreting it. Common cases that
419 need this are F<Config_heavy.pl> and F<utf8_heavy.pl>.
420
421 Example: include the required files for F<perl -V> to work in all its
422 glory (F<Config.pm> is included automatically by this).
423
424 # bourne shell
425 staticperl mkbundle --use '"Config_heavy.pl"'
426
427 # bundle specification file
428 use "Config_heavy.pl"
429
430 The C<-Mmodule> syntax is included as an alias that might be easier to
431 remember than C<use>. Or maybe it confuses people. Time will tell. Or
432 maybe not. Argh.
433
434 =item --eval "perl code" | -e "perl code"
435
436 Sometimes it is easier (or necessary) to specify dependencies using perl
437 code, or maybe one of the modules you use need a special use statement. In
438 that case, you can use C<eval> to execute some perl snippet or set some
439 variables or whatever you need. All files C<require>'d or C<use>'d in the
440 script are included in the final bundle.
441
442 Keep in mind that F<mkbundle> will only C<require> the modules named
443 by the C<--use> option, so do not expect the symbols from modules you
444 C<--use>'d earlier on the command line to be available.
445
446 Example: force L<AnyEvent> to detect a backend and therefore include it
447 in the final bundle.
448
449 staticperl mkbundle --eval 'use AnyEvent; AnyEvent::detect'
450
451 # or like this
452 staticperl mkbundle -MAnyEvent --eval 'use AnyEvent; AnyEvent::detect'
453
454 Example: use a separate "bootstrap" script that C<use>'s lots of modules
455 and include this in the final bundle, to be executed automatically.
456
457 staticperl mkbundle --eval 'do "bootstrap"' --boot bootstrap
458
459 =item --boot filename
460
461 Include the given file in the bundle and arrange for it to be executed
462 (using a C<require>) before anything else when the new perl is
463 initialised. This can be used to modify C<@INC> or anything else before
464 the perl interpreter executes scripts given on the command line (or via
465 C<-e>). This works even in an embedded interpreter.
466
467 =item --usepacklist
468
469 Read F<.packlist> files for each distribution that happens to match a
470 module name you specified. Sounds weird, and it is, so expect semantics to
471 change somehow in the future.
472
473 The idea is that most CPAN distributions have a F<.pm> file that matches
474 the name of the distribution (which is rather reasonable after all).
475
476 If this switch is enabled, then if any of the F<.pm> files that have been
477 selected match an install distribution, then all F<.pm>, F<.pl>, F<.al>
478 and F<.ix> files installed by this distribution are also included.
479
480 For example, using this switch, when the L<URI> module is specified, then
481 all L<URI> submodules that have been installed via the CPAN distribution
482 are included as well, so you don't have to manually specify them.
483
484 =item --incglob pattern
485
486 This goes through all library directories and tries to match any F<.pm>
487 and F<.pl> files against the extended glob pattern (see below). If a file
488 matches, it is added. This switch will automatically detect L<AutoLoader>
489 files and the required link libraries for XS modules, but it will I<not>
490 scan the file for dependencies (at the moment).
491
492 This is mainly useful to include "everything":
493
494 --incglob '*'
495
496 Or to include perl libraries, or trees of those, such as the unicode
497 database files needed by many other modules:
498
499 --incglob '/unicore/**.pl'
500
501 =item --add file | --add "file alias"
502
503 Adds the given (perl) file into the bundle (and optionally call it
504 "alias"). This is useful to include any custom files into the bundle.
505
506 Example: embed the file F<httpd> as F<httpd.pm> when creating the bundle.
507
508 staticperl mkperl --add "httpd httpd.pm"
509
510 It is also a great way to add any custom modules:
511
512 # specification file
513 add file1 myfiles/file1
514 add file2 myfiles/file2
515 add file3 myfiles/file3
516
517 =item --binadd file | --add "file alias"
518
519 Just like C<--add>, except that it treats the file as binary and adds it
520 without any processing.
521
522 You should probably add a C</> prefix to avoid clashing with embedded
523 perl files (whose paths do not start with C</>), and/or use a special
524 directory, such as C</res/name>.
525
526 You can later get a copy of these files by calling C<staticperl::find
527 "alias">.
528
529 =item --include pattern | -i pattern | --exclude pattern | -x pattern
530
531 These two options define an include/exclude filter that is used after all
532 files selected by the other options have been found. Each include/exclude
533 is applied to all files found so far - an include makes sure that the
534 given files will be part of the resulting file set, an exclude will
535 exclude files. The patterns are "extended glob patterns" (see below).
536
537 For example, to include everything, except C<Devel> modules, but still
538 include F<Devel::PPPort>, you could use this:
539
540 --incglob '*' -i '/Devel/PPPort.pm' -x '/Devel/**'
541
542 =item --static
543
544 When C<--perl> is also given, link statically instead of dynamically. The
545 default is to link the new perl interpreter fully dynamic (that means all
546 perl modules are linked statically, but all external libraries are still
547 referenced dynamically).
548
549 Keep in mind that Solaris doesn't support static linking at all, and
550 systems based on GNU libc don't really support it in a usable fashion
551 either. Try uClibc if you want to create fully statically linked
552 executables, or try the C<--staticlibs> option to link only some libraries
553 statically.
554
555 =item --staticlib libname
556
557 When not linking fully statically, this option allows you to link specific
558 libraries statically. What it does is simply replace all occurances of
559 C<-llibname> with the GCC-specific C<-Wl,-Bstatic -llibname -Wl,-Bdynamic>
560 option.
561
562 This will have no effect unless the library is actually linked against,
563 specifically, C<--staticlib> will not link against the named library
564 unless it would be linked against anyway.
565
566 Example: link libcrypt statically into the binary.
567
568 staticperl mkperl -MIO::AIO --staticlib crypt
569
570 # ldopts might nwo contain:
571 # -lm -Wl,-Bstatic -lcrypt -Wl,-Bdynamic -lpthread
572
573 =item any other argument
574
575 Any other argument is interpreted as a bundle specification file, which
576 supports most long options (without extra quoting), one option per line.
577
578 =back
579
580 =head3 EXTENDED GLOB PATTERNS
581
582 Some options of F<staticperl mkbundle> expect an I<extended glob
583 pattern>. This is neither a normal shell glob nor a regex, but something
584 in between. The idea has been copied from rsync, and there are the current
585 matching rules:
586
587 =over 4
588
589 =item Patterns starting with F</> will be a anchored at the root of the library tree.
590
591 That is, F</unicore> will match the F<unicore> directory in C<@INC>, but
592 nothing inside, and neither any other file or directory called F<unicore>
593 anywhere else in the hierarchy.
594
595 =item Patterns not starting with F</> will be anchored at the end of the path.
596
597 That is, F<idna.pl> will match any file called F<idna.pl> anywhere in the
598 hierarchy, but not any directories of the same name.
599
600 =item A F<*> matches any single component.
601
602 That is, F</unicore/*.pl> would match all F<.pl> files directly inside
603 C</unicore>, not any deeper level F<.pl> files. Or in other words, F<*>
604 will not match slashes.
605
606 =item A F<**> matches anything.
607
608 That is, F</unicore/**.pl> would match all F<.pl> files under F</unicore>,
609 no matter how deeply nested they are inside subdirectories.
610
611 =item A F<?> matches a single character within a component.
612
613 That is, F</Encode/??.pm> matches F</Encode/JP.pm>, but not the
614 hypothetical F</Encode/J/.pm>, as F<?> does not match F</>.
615
616 =back
617
618 =head2 F<STATICPERL> CONFIGURATION AND HOOKS
619
620 During (each) startup, F<staticperl> tries to source some shell files to
621 allow you to fine-tune/override configuration settings.
622
623 In them you can override shell variables, or define shell functions
624 ("hooks") to be called at specific phases during installation. For
625 example, you could define a C<postinstall> hook to install additional
626 modules from CPAN each time you start from scratch.
627
628 If the env variable C<$STATICPERLRC> is set, then F<staticperl> will try
629 to source the file named with it only. Otherwise, it tries the following
630 shell files in order:
631
632 /etc/staticperlrc
633 ~/.staticperlrc
634 $STATICPERL/rc
635
636 Note that the last file is erased during F<staticperl distclean>, so
637 generally should not be used.
638
639 =head3 CONFIGURATION VARIABLES
640
641 =head4 Variables you I<should> override
642
643 =over 4
644
645 =item C<EMAIL>
646
647 The e-mail address of the person who built this binary. Has no good
648 default, so should be specified by you.
649
650 =item C<CPAN>
651
652 The URL of the CPAN mirror to use (e.g. L<http://mirror.netcologne.de/cpan/>).
653
654 =item C<EXTRA_MODULES>
655
656 Additional modules installed during F<staticperl install>. Here you can
657 set which modules you want have to installed from CPAN.
658
659 Example: I really really need EV, AnyEvent, Coro and AnyEvent::AIO.
660
661 EXTRA_MODULES="EV AnyEvent Coro AnyEvent::AIO"
662
663 Note that you can also use a C<postinstall> hook to achieve this, and
664 more.
665
666 =back
667
668 =head4 Variables you might I<want> to override
669
670 =over 4
671
672 =item C<STATICPERL>
673
674 The directory where staticperl stores all its files
675 (default: F<~/.staticperl>).
676
677 =item C<PERL_MM_USE_DEFAULT>, C<EV_EXTRA_DEFS>, ...
678
679 Usually set to C<1> to make modules "less inquisitive" during their
680 installation, you can set any environment variable you want - some modules
681 (such as L<Coro> or L<EV>) use environment variables for further tweaking.
682
683 =item C<PERL_VERSION>
684
685 The perl version to install - default is currently C<5.12.2>, but C<5.8.9>
686 is also a good choice (5.8.9 is much smaller than 5.12.2, while 5.10.1 is
687 about as big as 5.12.2).
688
689 =item C<PERL_PREFIX>
690
691 The prefix where perl gets installed (default: F<$STATICPERL/perl>),
692 i.e. where the F<bin> and F<lib> subdirectories will end up.
693
694 =item C<PERL_CONFIGURE>
695
696 Additional Configure options - these are simply passed to the perl
697 Configure script. For example, if you wanted to enable dynamic loading,
698 you could pass C<-Dusedl>. To enable ithreads (Why would you want that
699 insanity? Don't! Use L<forks> instead!) you would pass C<-Duseithreads>
700 and so on.
701
702 More commonly, you would either activate 64 bit integer support
703 (C<-Duse64bitint>), or disable large files support (-Uuselargefiles), to
704 reduce filesize further.
705
706 =item C<PERL_CPPFLAGS>, C<PERL_OPTIMIZE>, C<PERL_LDFLAGS>, C<PERL_LIBS>
707
708 These flags are passed to perl's F<Configure> script, and are generally
709 optimised for small size (at the cost of performance). Since they also
710 contain subtle workarounds around various build issues, changing these
711 usually requires understanding their default values - best look at the top
712 of the F<staticperl> script for more info on these.
713
714 =back
715
716 =head4 Variables you probably I<do not want> to override
717
718 =over 4
719
720 =item C<MKBUNDLE>
721
722 Where F<staticperl> writes the C<mkbundle> command to
723 (default: F<$STATICPERL/mkbundle>).
724
725 =item C<STATICPERL_MODULES>
726
727 Additional modules needed by C<mkbundle> - should therefore not be changed
728 unless you know what you are doing.
729
730 =back
731
732 =head3 OVERRIDABLE HOOKS
733
734 In addition to environment variables, it is possible to provide some
735 shell functions that are called at specific times. To provide your own
736 commands, just define the corresponding function.
737
738 Example: install extra modules from CPAN and from some directories
739 at F<staticperl install> time.
740
741 postinstall() {
742 rm -rf lib/threads* # weg mit Schaden
743 instcpan IO::AIO EV
744 instsrc ~/src/AnyEvent
745 instsrc ~/src/XML-Sablotron-1.0100001
746 instcpan Anyevent::AIO AnyEvent::HTTPD
747 }
748
749 =over 4
750
751 =item preconfigure
752
753 Called just before running F<./Configur> in the perl source
754 directory. Current working directory is the perl source directory.
755
756 This can be used to set any C<PERL_xxx> variables, which might be costly
757 to compute.
758
759 =item postconfigure
760
761 Called after configuring, but before building perl. Current working
762 directory is the perl source directory.
763
764 Could be used to tailor/patch config.sh (followed by F<sh Configure -S>)
765 or do any other modifications.
766
767 =item postbuild
768
769 Called after building, but before installing perl. Current working
770 directory is the perl source directory.
771
772 I have no clue what this could be used for - tell me.
773
774 =item postinstall
775
776 Called after perl and any extra modules have been installed in C<$PREFIX>,
777 but before setting the "installation O.K." flag.
778
779 The current working directory is C<$PREFIX>, but maybe you should not rely
780 on that.
781
782 This hook is most useful to customise the installation, by deleting files,
783 or installing extra modules using the C<instcpan> or C<instsrc> functions.
784
785 The script must return with a zero exit status, or the installation will
786 fail.
787
788 =back
789
790 =head1 ANATOMY OF A BUNDLE
791
792 When not building a new perl binary, C<mkbundle> will leave a number of
793 files in the current working directory, which can be used to embed a perl
794 interpreter in your program.
795
796 Intimate knowledge of L<perlembed> and preferably some experience with
797 embedding perl is highly recommended.
798
799 C<mkperl> (or the C<--perl> option) basically does this to link the new
800 interpreter (it also adds a main program to F<bundle.>):
801
802 $Config{cc} $(cat bundle.ccopts) -o perl bundle.c $(cat bundle.ldopts)
803
804 =over 4
805
806 =item bundle.h
807
808 A header file that contains the prototypes of the few symbols "exported"
809 by bundle.c, and also exposes the perl headers to the application.
810
811 =over 4
812
813 =item staticperl_init ()
814
815 Initialises the perl interpreter. You can use the normal perl functions
816 after calling this function, for example, to define extra functions or
817 to load a .pm file that contains some initialisation code, or the main
818 program function:
819
820 XS (xsfunction)
821 {
822 dXSARGS;
823
824 // now we have items, ST(i) etc.
825 }
826
827 static void
828 run_myapp(void)
829 {
830 staticperl_init ();
831 newXSproto ("myapp::xsfunction", xsfunction, __FILE__, "$$;$");
832 eval_pv ("require myapp::main", 1); // executes "myapp/main.pm"
833 }
834
835 =item staticperl_xs_init (pTHX)
836
837 Sometimes you need direct control over C<perl_parse> and C<perl_run>, in
838 which case you do not want to use C<staticperl_init> but call them on your
839 own.
840
841 Then you need this function - either pass it directly as the C<xs_init>
842 function to C<perl_parse>, or call it from your own C<xs_init> function.
843
844 =item staticperl_cleanup ()
845
846 In the unlikely case that you want to destroy the perl interpreter, here
847 is the corresponding function.
848
849 =item PerlInterpreter *staticperl
850
851 The perl interpreter pointer used by staticperl. Not normally so useful,
852 but there it is.
853
854 =back
855
856 =item bundle.ccopts
857
858 Contains the compiler options required to compile at least F<bundle.c> and
859 any file that includes F<bundle.h> - you should probably use it in your
860 C<CFLAGS>.
861
862 =item bundle.ldopts
863
864 The linker options needed to link the final program.
865
866 =back
867
868 =head1 RUNTIME FUNCTIONALITY
869
870 Binaries created with C<mkbundle>/C<mkperl> contain extra functions, which
871 are required to access the bundled perl sources, but might be useful for
872 other purposes.
873
874 In addition, for the embedded loading of perl files to work, F<staticperl>
875 overrides the C<@INC> array.
876
877 =over 4
878
879 =item $file = staticperl::find $path
880
881 Returns the data associated with the given C<$path>
882 (e.g. C<Digest/MD5.pm>, C<auto/POSIX/autosplit.ix>), which is basically
883 the UNIX path relative to the perl library directory.
884
885 Returns C<undef> if the file isn't embedded.
886
887 =item @paths = staticperl::list
888
889 Returns the list of all paths embedded in this binary.
890
891 =back
892
893 =head1 FULLY STATIC BINARIES - BUILDROOT
894
895 To make truly static (Linux-) libraries, you might want to have a look at
896 buildroot (L<http://buildroot.uclibc.org/>).
897
898 Buildroot is primarily meant to set up a cross-compile environment (which
899 is not so useful as perl doesn't quite like cross compiles), but it can also compile
900 a chroot environment where you can use F<staticperl>.
901
902 To do so, download buildroot, and enable "Build options => development
903 files in target filesystem" and optionally "Build options => gcc
904 optimization level (optimize for size)". At the time of writing, I had
905 good experiences with GCC 4.4.x but not GCC 4.5.
906
907 To minimise code size, I used C<-pipe -ffunction-sections -fdata-sections
908 -finline-limit=8 -fno-builtin-strlen -mtune=i386>. The C<-mtune=i386>
909 doesn't decrease codesize much, but it makes the file much more
910 compressible.
911
912 If you don't need Coro or threads, you can go with "linuxthreads.old" (or
913 no thread support). For Coro, it is highly recommended to switch to a
914 uClibc newer than 0.9.31 (at the time of this writing, I used the 20101201
915 snapshot) and enable NPTL, otherwise Coro needs to be configured with the
916 ultra-slow pthreads backend to work around linuxthreads bugs (it also uses
917 twice the address space needed for stacks).
918
919 If you use C<linuxthreads.old>, then you should also be aware that
920 uClibc shares C<errno> between all threads when statically linking. See
921 L<http://lists.uclibc.org/pipermail/uclibc/2010-June/044157.html> for a
922 workaround (And L<https://bugs.uclibc.org/2089> for discussion).
923
924 C<ccache> support is also recommended, especially if you want
925 to play around with buildroot options. Enabling the C<miniperl>
926 package will probably enable all options required for a successful
927 perl build. F<staticperl> itself additionally needs either C<wget>
928 (recommended, for CPAN) or C<curl>.
929
930 As for shells, busybox should provide all that is needed, but the default
931 busybox configuration doesn't include F<comm> which is needed by perl -
932 either make a custom busybox config, or compile coreutils.
933
934 For the latter route, you might find that bash has some bugs that keep
935 it from working properly in a chroot - either use dash (and link it to
936 F</bin/sh> inside the chroot) or link busybox to F</bin/sh>, using it's
937 built-in ash shell.
938
939 Finally, you need F</dev/null> inside the chroot for many scripts to work
940 - F<cp /dev/null output/target/dev> or bind-mounting your F</dev> will
941 both provide this.
942
943 After you have compiled and set up your buildroot target, you can copy
944 F<staticperl> from the C<App::Staticperl> distribution or from your
945 perl f<bin> directory (if you installed it) into the F<output/target>
946 filesystem, chroot inside and run it.
947
948 =head1 RECIPES / SPECIFIC MODULES
949
950 This section contains some common(?) recipes and information about
951 problems with some common modules or perl constructs that require extra
952 files to be included.
953
954 =head2 MODULES
955
956 =over 4
957
958 =item utf8
959
960 Some functionality in the utf8 module, such as swash handling (used
961 for unicode character ranges in regexes) is implemented in the
962 C<"utf8_heavy.pl"> library:
963
964 -M'"utf8_heavy.pl"'
965
966 Many Unicode properties in turn are defined in separate modules,
967 such as C<"unicore/Heavy.pl"> and more specific data tables such as
968 C<"unicore/To/Digit.pl"> or C<"unicore/lib/Perl/Word.pl">. These tables
969 are big (7MB uncompressed, although F<staticperl> contains special
970 handling for those files), so including them on demand by your application
971 only might pay off.
972
973 To simply include the whole unicode database, use:
974
975 --incglob '/unicore/*.pl'
976
977 =item AnyEvent
978
979 AnyEvent needs a backend implementation that it will load in a delayed
980 fashion. The L<AnyEvent::Impl::Perl> backend is the default choice
981 for AnyEvent if it can't find anything else, and is usually a safe
982 fallback. If you plan to use e.g. L<EV> (L<POE>...), then you need to
983 include the L<AnyEvent::Impl::EV> (L<AnyEvent::Impl::POE>...) backend as
984 well.
985
986 If you want to handle IRIs or IDNs (L<AnyEvent::Util> punycode and idn
987 functions), you also need to include C<"AnyEvent/Util/idna.pl"> and
988 C<"AnyEvent/Util/uts46data.pl">.
989
990 Or you can use C<--usepacklist> and specify C<-MAnyEvent> to include
991 everything.
992
993 =item Carp
994
995 Carp had (in older versions of perl) a dependency on L<Carp::Heavy>. As of
996 perl 5.12.2 (maybe earlier), this dependency no longer exists.
997
998 =item Config
999
1000 The F<perl -V> switch (as well as many modules) needs L<Config>, which in
1001 turn might need L<"Config_heavy.pl">. Including the latter gives you
1002 both.
1003
1004 =item Term::ReadLine::Perl
1005
1006 Also needs L<Term::ReadLine::readline>, or C<--usepacklist>.
1007
1008 =item URI
1009
1010 URI implements schemes as separate modules - the generic URL scheme is
1011 implemented in L<URI::_generic>, HTTP is implemented in L<URI::http>. If
1012 you need to use any of these schemes, you should include these manually,
1013 or use C<--usepacklist>.
1014
1015 =back
1016
1017 =head2 RECIPES
1018
1019 =over 4
1020
1021 =item Linking everything in
1022
1023 To link just about everything installed in the perl library into a new
1024 perl, try this:
1025
1026 staticperl mkperl --strip ppi --incglob '*'
1027
1028 =item Getting rid of netdb function
1029
1030 The perl core has lots of netdb functions (C<getnetbyname>, C<getgrent>
1031 and so on) that few applications use. You can avoid compiling them in by
1032 putting the following fragment into a C<preconfigure> hook:
1033
1034 preconfigure() {
1035 for sym in \
1036 d_getgrnam_r d_endgrent d_endgrent_r d_endhent \
1037 d_endhostent_r d_endnent d_endnetent_r d_endpent \
1038 d_endprotoent_r d_endpwent d_endpwent_r d_endsent \
1039 d_endservent_r d_getgrent d_getgrent_r d_getgrgid_r \
1040 d_getgrnam_r d_gethbyaddr d_gethent d_getsbyport \
1041 d_gethostbyaddr_r d_gethostbyname_r d_gethostent_r \
1042 d_getlogin_r d_getnbyaddr d_getnbyname d_getnent \
1043 d_getnetbyaddr_r d_getnetbyname_r d_getnetent_r \
1044 d_getpent d_getpbyname d_getpbynumber d_getprotobyname_r \
1045 d_getprotobynumber_r d_getprotoent_r d_getpwent \
1046 d_getpwent_r d_getpwnam_r d_getpwuid_r d_getsent \
1047 d_getservbyname_r d_getservbyport_r d_getservent_r \
1048 d_getspnam_r d_getsbyname
1049 # d_gethbyname
1050 do
1051 PERL_CONFIGURE="$PERL_CONFIGURE -U$sym"
1052 done
1053 }
1054
1055 This mostly gains space when linking staticaly, as the functions will
1056 liekly not be linked in. The gain for dynamically-linked binaries is
1057 smaller.
1058
1059 Also, this leaves C<gethostbyname> in - not only is it actually used
1060 often, the L<Socket> module also exposes it, so leaving it out usually
1061 gains little. Why Socket exposes a C function that is in the core already
1062 is anybody's guess.
1063
1064 =back
1065
1066 =head1 AUTHOR
1067
1068 Marc Lehmann <schmorp@schmorp.de>
1069 http://software.schmorp.de/pkg/staticperl.html