ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/cvsroot/App-Staticperl/staticperl.pod
Revision: 1.25
Committed: Tue Dec 21 12:59:29 2010 UTC (13 years, 6 months ago) by root
Branch: MAIN
Changes since 1.24: +36 -5 lines
Log Message:
*** empty log message ***

File Contents

# Content
1 =head1 NAME
2
3 staticperl - perl, libc, 100 modules, all in one 500kb file
4
5 =head1 SYNOPSIS
6
7 staticperl help # print the embedded documentation
8 staticperl fetch # fetch and unpack perl sources
9 staticperl configure # fetch and then configure perl
10 staticperl build # configure and then build perl
11 staticperl install # build and then install perl
12 staticperl clean # clean most intermediate files (restart at configure)
13 staticperl distclean # delete everything installed by this script
14 staticperl cpan # invoke CPAN shell
15 staticperl instmod path... # install unpacked modules
16 staticperl instcpan modulename... # install modules from CPAN
17 staticperl mkbundle <bundle-args...> # see documentation
18 staticperl mkperl <bundle-args...> # see documentation
19 staticperl mkapp appname <bundle-args...> # see documentation
20
21 Typical Examples:
22
23 staticperl install # fetch, configure, build and install perl
24 staticperl cpan # run interactive cpan shell
25 staticperl mkperl -M '"Config_heavy.pl"' # build a perl that supports -V
26 staticperl mkperl -MAnyEvent::Impl::Perl -MAnyEvent::HTTPD -MURI -MURI::http
27 # build a perl with the above modules linked in
28 staticperl mkapp myapp --boot mainprog mymodules
29 # build a binary "myapp" from mainprog and mymodules
30
31 =head1 DESCRIPTION
32
33 This script helps you to create single-file perl interpreters
34 or applications, or embedding a perl interpreter in your
35 applications. Single-file means that it is fully self-contained - no
36 separate shared objects, no autoload fragments, no .pm or .pl files are
37 needed. And when linking statically, you can create (or embed) a single
38 file that contains perl interpreter, libc, all the modules you need, all
39 the libraries you need and of course your actual program.
40
41 With F<uClibc> and F<upx> on x86, you can create a single 500kb binary
42 that contains perl and 100 modules such as POSIX, AnyEvent, EV, IO::AIO,
43 Coro and so on. Or any other choice of modules.
44
45 To see how this turns out, you can try out smallperl and bigperl, two
46 pre-built static and compressed perl binaries with many and even more
47 modules: just follow the links at L<http://staticperl.schmorp.de/>.
48
49 The created files do not need write access to the file system (like PAR
50 does). In fact, since this script is in many ways similar to PAR::Packer,
51 here are the differences:
52
53 =over 4
54
55 =item * The generated executables are much smaller than PAR created ones.
56
57 Shared objects and the perl binary contain a lot of extra info, while
58 the static nature of F<staticperl> allows the linker to remove all
59 functionality and meta-info not required by the final executable. Even
60 extensions statically compiled into perl at build time will only be
61 present in the final executable when needed.
62
63 In addition, F<staticperl> can strip perl sources much more effectively
64 than PAR.
65
66 =item * The generated executables start much faster.
67
68 There is no need to unpack files, or even to parse Zip archives (which is
69 slow and memory-consuming business).
70
71 =item * The generated executables don't need a writable filesystem.
72
73 F<staticperl> loads all required files directly from memory. There is no
74 need to unpack files into a temporary directory.
75
76 =item * More control over included files, more burden.
77
78 PAR tries to be maintenance and hassle-free - it tries to include more
79 files than necessary to make sure everything works out of the box. It
80 mostly succeeds at this, but he extra files (such as the unicode database)
81 can take substantial amounts of memory and file size.
82
83 With F<staticperl>, the burden is mostly with the developer - only direct
84 compile-time dependencies and L<AutoLoader> are handled automatically.
85 This means the modules to include often need to be tweaked manually.
86
87 All this does not preclude more permissive modes to be implemented in
88 the future, but right now, you have to resolve state hidden dependencies
89 manually.
90
91 =item * PAR works out of the box, F<staticperl> does not.
92
93 Maintaining your own custom perl build can be a pain in the ass, and while
94 F<staticperl> tries to make this easy, it still requires a custom perl
95 build and possibly fiddling with some modules. PAR is likely to produce
96 results faster.
97
98 Ok, PAR never has worked for me out of the box, and for some people,
99 F<staticperl> does work out of the box, as they don't count "fiddling with
100 module use lists" against it, but nevertheless, F<staticperl> is certainly
101 a bit more difficult to use.
102
103 =back
104
105 =head1 HOW DOES IT WORK?
106
107 Simple: F<staticperl> downloads, compile and installs a perl version of
108 your choice in F<~/.staticperl>. You can add extra modules either by
109 letting F<staticperl> install them for you automatically, or by using CPAN
110 and doing it interactively. This usually takes 5-10 minutes, depending on
111 the speed of your computer and your internet connection.
112
113 It is possible to do program development at this stage, too.
114
115 Afterwards, you create a list of files and modules you want to include,
116 and then either build a new perl binary (that acts just like a normal perl
117 except everything is compiled in), or you create bundle files (basically C
118 sources you can use to embed all files into your project).
119
120 This step is very fast (a few seconds if PPI is not used for stripping, or
121 the stripped files are in the cache), and can be tweaked and repeated as
122 often as necessary.
123
124 =head1 THE F<STATICPERL> SCRIPT
125
126 This module installs a script called F<staticperl> into your perl
127 binary directory. The script is fully self-contained, and can be
128 used without perl (for example, in an uClibc chroot environment). In
129 fact, it can be extracted from the C<App::Staticperl> distribution
130 tarball as F<bin/staticperl>, without any installation. The
131 newest (possibly alpha) version can also be downloaded from
132 L<http://staticperl.schmorp.de/staticperl>.
133
134 F<staticperl> interprets the first argument as a command to execute,
135 optionally followed by any parameters.
136
137 There are two command categories: the "phase 1" commands which deal with
138 installing perl and perl modules, and the "phase 2" commands, which deal
139 with creating binaries and bundle files.
140
141 =head2 PHASE 1 COMMANDS: INSTALLING PERL
142
143 The most important command is F<install>, which does basically
144 everything. The default is to download and install perl 5.12.2 and a few
145 modules required by F<staticperl> itself, but all this can (and should) be
146 changed - see L<CONFIGURATION>, below.
147
148 The command
149
150 staticperl install
151
152 is normally all you need: It installs the perl interpreter in
153 F<~/.staticperl/perl>. It downloads, configures, builds and installs the
154 perl interpreter if required.
155
156 Most of the following F<staticperl> subcommands simply run one or more
157 steps of this sequence.
158
159 If it fails, then most commonly because the compiler options I selected
160 are not supported by your compiler - either edit the F<staticperl> script
161 yourself or create F<~/.staticperl> shell script where your set working
162 C<PERL_CCFLAGS> etc. variables.
163
164 To force recompilation or reinstallation, you need to run F<staticperl
165 distclean> first.
166
167 =over 4
168
169 =item F<staticperl version>
170
171 Prints some info about the version of the F<staticperl> script you are using.
172
173 =item F<staticperl fetch>
174
175 Runs only the download and unpack phase, unless this has already happened.
176
177 =item F<staticperl configure>
178
179 Configures the unpacked perl sources, potentially after downloading them first.
180
181 =item F<staticperl build>
182
183 Builds the configured perl sources, potentially after automatically
184 configuring them.
185
186 =item F<staticperl install>
187
188 Wipes the perl installation directory (usually F<~/.staticperl/perl>) and
189 installs the perl distribution, potentially after building it first.
190
191 =item F<staticperl cpan> [args...]
192
193 Starts an interactive CPAN shell that you can use to install further
194 modules. Installs the perl first if necessary, but apart from that,
195 no magic is involved: you could just as well run it manually via
196 F<~/.staticperl/perl/bin/cpan>.
197
198 Any additional arguments are simply passed to the F<cpan> command.
199
200 =item F<staticperl instcpan> module...
201
202 Tries to install all the modules given and their dependencies, using CPAN.
203
204 Example:
205
206 staticperl instcpan EV AnyEvent::HTTPD Coro
207
208 =item F<staticperl instsrc> directory...
209
210 In the unlikely case that you have unpacked perl modules around and want
211 to install from these instead of from CPAN, you can do this using this
212 command by specifying all the directories with modules in them that you
213 want to have built.
214
215 =item F<staticperl clean>
216
217 Deletes the perl source directory (and potentially cleans up other
218 intermediate files). This can be used to clean up files only needed for
219 building perl, without removing the installed perl interpreter.
220
221 At the moment, it doesn't delete downloaded tarballs.
222
223 The exact semantics of this command will probably change.
224
225 =item F<staticperl distclean>
226
227 This wipes your complete F<~/.staticperl> directory. Be careful with this,
228 it nukes your perl download, perl sources, perl distribution and any
229 installed modules. It is useful if you wish to start over "from scratch"
230 or when you want to uninstall F<staticperl>.
231
232 =back
233
234 =head2 PHASE 2 COMMANDS: BUILDING PERL BUNDLES
235
236 Building (linking) a new F<perl> binary is handled by a separate
237 script. To make it easy to use F<staticperl> from a F<chroot>, the script
238 is embedded into F<staticperl>, which will write it out and call for you
239 with any arguments you pass:
240
241 staticperl mkbundle mkbundle-args...
242
243 In the oh so unlikely case of something not working here, you
244 can run the script manually as well (by default it is written to
245 F<~/.staticperl/mkbundle>).
246
247 F<mkbundle> is a more conventional command and expect the argument
248 syntax commonly used on UNIX clones. For example, this command builds
249 a new F<perl> binary and includes F<Config.pm> (for F<perl -V>),
250 F<AnyEvent::HTTPD>, F<URI> and a custom F<httpd> script (from F<eg/httpd>
251 in this distribution):
252
253 # first make sure we have perl and the required modules
254 staticperl instcpan AnyEvent::HTTPD
255
256 # now build the perl
257 staticperl mkperl -M'"Config_heavy.pl"' -MAnyEvent::Impl::Perl \
258 -MAnyEvent::HTTPD -MURI::http \
259 --add 'eg/httpd httpd.pm'
260
261 # finally, invoke it
262 ./perl -Mhttpd
263
264 As you can see, things are not quite as trivial: the L<Config> module has
265 a hidden dependency which is not even a perl module (F<Config_heavy.pl>),
266 L<AnyEvent> needs at least one event loop backend that we have to
267 specify manually (here L<AnyEvent::Impl::Perl>), and the F<URI> module
268 (required by L<AnyEvent::HTTPD>) implements various URI schemes as extra
269 modules - since L<AnyEvent::HTTPD> only needs C<http> URIs, we only need
270 to include that module. I found out about these dependencies by carefully
271 watching any error messages about missing modules...
272
273 Instead of building a new perl binary, you can also build a standalone
274 application:
275
276 # build the app
277 staticperl mkapp app --boot eg/httpd \
278 -MAnyEvent::Impl::Perl -MAnyEvent::HTTPD -MURI::http
279
280 # run it
281 ./app
282
283 Here are the three phase 2 commands:
284
285 =over 4
286
287 =item F<staticperl mkbundle> args...
288
289 The "default" bundle command - it interprets the given bundle options and
290 writes out F<bundle.h>, F<bundle.c>, F<bundle.ccopts> and F<bundle.ldopts>
291 files, useful for embedding.
292
293 =item F<staticperl mkperl> args...
294
295 Creates a bundle just like F<staticperl mkbundle> (in fact, it's the same
296 as invoking F<staticperl mkbundle --perl> args...), but then compiles and
297 links a new perl interpreter that embeds the created bundle, then deletes
298 all intermediate files.
299
300 =item F<staticperl mkapp> filename args...
301
302 Does the same as F<staticperl mkbundle> (in fact, it's the same as
303 invoking F<staticperl mkbundle --app> filename args...), but then compiles
304 and links a new standalone application that simply initialises the perl
305 interpreter.
306
307 The difference to F<staticperl mkperl> is that the standalone application
308 does not act like a perl interpreter would - in fact, by default it would
309 just do nothing and exit immediately, so you should specify some code to
310 be executed via the F<--boot> option.
311
312 =back
313
314 =head3 OPTION PROCESSING
315
316 All options can be given as arguments on the command line (typically
317 using long (e.g. C<--verbose>) or short option (e.g. C<-v>) style). Since
318 specifying a lot of modules can make the command line very cumbersome, you
319 can put all long options into a "bundle specification file" (one option
320 per line, with or without C<--> prefix) and specify this bundle file
321 instead.
322
323 For example, the command given earlier could also look like this:
324
325 staticperl mkperl httpd.bundle
326
327 And all options could be in F<httpd.bundle>:
328
329 use "Config_heavy.pl"
330 use AnyEvent::Impl::Perl
331 use AnyEvent::HTTPD
332 use URI::http
333 add eg/httpd httpd.pm
334
335 All options that specify modules or files to be added are processed in the
336 order given on the command line.
337
338 =head3 PACKAGE SELECTION WORKFLOW
339
340 F<staticperl mkbundle> has a number of options to control package
341 selection. This section describes how they interact with each other. Also,
342 since I am still a newbie w.r.t. these issues, maybe future versions of
343 F<staticperl> will change this, so watch out :)
344
345 The idiom "in order" means "in order that they are specified on the
346 commandline". If you use a bundle specification file, then the options
347 will be processed as if they were given in place of the bundle file name.
348
349 =over 4
350
351 =item 1. apply all C<--use>, C<--eval>, C<--add>, C<--addbin> and
352 C<--incglob> options, in order.
353
354 In addition, C<--use> and C<--eval> dependencies will be added when the
355 options are processed.
356
357 =item 2. apply all C<--include> and C<--exclude> options, in order.
358
359 All this step does is potentially reduce the number of files already
360 selected or found in phase 1.
361
362 =item 3. find all modules (== F<.pm> files), gather their static archives
363 (F<.a>) and AutoLoader splitfiles (F<.ix> and F<.al> files), find any
364 extra libraries they need for linking (F<extralibs.ld>) and optionally
365 evaluate any F<.packlist> files.
366
367 This step is required to link against XS extensions and also adds files
368 required for L<AutoLoader> to do it's job.
369
370 =back
371
372 After this, all the files selected for bundling will be read and processed
373 (stripped), the bundle files will be written, and optionally a new F<perl>
374 or application binary will be linked.
375
376 =head3 MKBUNDLE OPTIONS
377
378 =over 4
379
380 =item --verbose | -v
381
382 Increases the verbosity level by one (the default is C<1>).
383
384 =item --quiet | -q
385
386 Decreases the verbosity level by one.
387
388 =item --strip none|pod|ppi
389
390 Specify the stripping method applied to reduce the file of the perl
391 sources included.
392
393 The default is C<pod>, which uses the L<Pod::Strip> module to remove all
394 pod documentation, which is very fast and reduces file size a lot.
395
396 The C<ppi> method uses L<PPI> to parse and condense the perl sources. This
397 saves a lot more than just L<Pod::Strip>, and is generally safer,
398 but is also a lot slower (some files take almost a minute to strip -
399 F<staticperl> maintains a cache of stripped files to speed up subsequent
400 runs for this reason). Note that this method doesn't optimise for raw file
401 size, but for best compression (that means that the uncompressed file size
402 is a bit larger, but the files compress better, e.g. with F<upx>).
403
404 Last not least, if you need accurate line numbers in error messages,
405 or in the unlikely case where C<pod> is too slow, or some module gets
406 mistreated, you can specify C<none> to not mangle included perl sources in
407 any way.
408
409 =item --perl
410
411 After writing out the bundle files, try to link a new perl interpreter. It
412 will be called F<perl> and will be left in the current working
413 directory. The bundle files will be removed.
414
415 This switch is automatically used when F<staticperl> is invoked with the
416 C<mkperl> command (instead of C<mkbundle>):
417
418 # build a new ./perl with only common::sense in it - very small :)
419 staticperl mkperl -Mcommon::sense
420
421 =item --app name
422
423 After writing out the bundle files, try to link a new standalone
424 program. It will be called C<name>, and the bundle files get removed after
425 linking it.
426
427 The difference to the (mutually exclusive) C<--perl> option is that the
428 binary created by this option will not try to act as a perl interpreter -
429 instead it will simply initialise the perl interpreter, clean it up and
430 exit.
431
432 This switch is automatically used when F<staticperl> is invoked with the
433 C<mkapp> command (instead of C<mkbundle>):
434
435 To let it do something useful you I<must> add some boot code, e.g. with
436 the C<--boot> option.
437
438 Example: create a standalone perl binary that will execute F<appfile> when
439 it is started.
440
441 staticperl mkbundle --app myexe --boot appfile
442
443 =item --use module | -Mmodule
444
445 Include the named module and all direct dependencies. This is done by
446 C<require>'ing the module in a subprocess and tracing which other modules
447 and files it actually loads. If the module uses L<AutoLoader>, then all
448 splitfiles will be included as well.
449
450 Example: include AnyEvent and AnyEvent::Impl::Perl.
451
452 staticperl mkbundle --use AnyEvent --use AnyEvent::Impl::Perl
453
454 Sometimes you want to load old-style "perl libraries" (F<.pl> files), or
455 maybe other weirdly named files. To do that, you need to quote the name in
456 single or double quotes. When given on the command line, you probably need
457 to quote once more to avoid your shell interpreting it. Common cases that
458 need this are F<Config_heavy.pl> and F<utf8_heavy.pl>.
459
460 Example: include the required files for F<perl -V> to work in all its
461 glory (F<Config.pm> is included automatically by this).
462
463 # bourne shell
464 staticperl mkbundle --use '"Config_heavy.pl"'
465
466 # bundle specification file
467 use "Config_heavy.pl"
468
469 The C<-Mmodule> syntax is included as an alias that might be easier to
470 remember than C<use>. Or maybe it confuses people. Time will tell. Or
471 maybe not. Argh.
472
473 =item --eval "perl code" | -e "perl code"
474
475 Sometimes it is easier (or necessary) to specify dependencies using perl
476 code, or maybe one of the modules you use need a special use statement. In
477 that case, you can use C<eval> to execute some perl snippet or set some
478 variables or whatever you need. All files C<require>'d or C<use>'d in the
479 script are included in the final bundle.
480
481 Keep in mind that F<mkbundle> will only C<require> the modules named
482 by the C<--use> option, so do not expect the symbols from modules you
483 C<--use>'d earlier on the command line to be available.
484
485 Example: force L<AnyEvent> to detect a backend and therefore include it
486 in the final bundle.
487
488 staticperl mkbundle --eval 'use AnyEvent; AnyEvent::detect'
489
490 # or like this
491 staticperl mkbundle -MAnyEvent --eval 'use AnyEvent; AnyEvent::detect'
492
493 Example: use a separate "bootstrap" script that C<use>'s lots of modules
494 and include this in the final bundle, to be executed automatically.
495
496 staticperl mkbundle --eval 'do "bootstrap"' --boot bootstrap
497
498 =item --boot filename
499
500 Include the given file in the bundle and arrange for it to be executed
501 (using a C<require>) before anything else when the new perl is
502 initialised. This can be used to modify C<@INC> or anything else before
503 the perl interpreter executes scripts given on the command line (or via
504 C<-e>). This works even in an embedded interpreter.
505
506 =item --usepacklist
507
508 Read F<.packlist> files for each distribution that happens to match a
509 module name you specified. Sounds weird, and it is, so expect semantics to
510 change somehow in the future.
511
512 The idea is that most CPAN distributions have a F<.pm> file that matches
513 the name of the distribution (which is rather reasonable after all).
514
515 If this switch is enabled, then if any of the F<.pm> files that have been
516 selected match an install distribution, then all F<.pm>, F<.pl>, F<.al>
517 and F<.ix> files installed by this distribution are also included.
518
519 For example, using this switch, when the L<URI> module is specified, then
520 all L<URI> submodules that have been installed via the CPAN distribution
521 are included as well, so you don't have to manually specify them.
522
523 =item --incglob pattern
524
525 This goes through all library directories and tries to match any F<.pm>
526 and F<.pl> files against the extended glob pattern (see below). If a file
527 matches, it is added. This switch will automatically detect L<AutoLoader>
528 files and the required link libraries for XS modules, but it will I<not>
529 scan the file for dependencies (at the moment).
530
531 This is mainly useful to include "everything":
532
533 --incglob '*'
534
535 Or to include perl libraries, or trees of those, such as the unicode
536 database files needed by many other modules:
537
538 --incglob '/unicore/**.pl'
539
540 =item --add file | --add "file alias"
541
542 Adds the given (perl) file into the bundle (and optionally call it
543 "alias"). This is useful to include any custom files into the bundle.
544
545 Example: embed the file F<httpd> as F<httpd.pm> when creating the bundle.
546
547 staticperl mkperl --add "httpd httpd.pm"
548
549 It is also a great way to add any custom modules:
550
551 # specification file
552 add file1 myfiles/file1
553 add file2 myfiles/file2
554 add file3 myfiles/file3
555
556 =item --binadd file | --add "file alias"
557
558 Just like C<--add>, except that it treats the file as binary and adds it
559 without any processing.
560
561 You should probably add a C</> prefix to avoid clashing with embedded
562 perl files (whose paths do not start with C</>), and/or use a special
563 directory, such as C</res/name>.
564
565 You can later get a copy of these files by calling C<staticperl::find
566 "alias">.
567
568 =item --include pattern | -i pattern | --exclude pattern | -x pattern
569
570 These two options define an include/exclude filter that is used after all
571 files selected by the other options have been found. Each include/exclude
572 is applied to all files found so far - an include makes sure that the
573 given files will be part of the resulting file set, an exclude will
574 exclude files. The patterns are "extended glob patterns" (see below).
575
576 For example, to include everything, except C<Devel> modules, but still
577 include F<Devel::PPPort>, you could use this:
578
579 --incglob '*' -i '/Devel/PPPort.pm' -x '/Devel/**'
580
581 =item --static
582
583 When C<--perl> is also given, link statically instead of dynamically. The
584 default is to link the new perl interpreter fully dynamic (that means all
585 perl modules are linked statically, but all external libraries are still
586 referenced dynamically).
587
588 Keep in mind that Solaris doesn't support static linking at all, and
589 systems based on GNU libc don't really support it in a usable fashion
590 either. Try uClibc if you want to create fully statically linked
591 executables, or try the C<--staticlibs> option to link only some libraries
592 statically.
593
594 =item --staticlib libname
595
596 When not linking fully statically, this option allows you to link specific
597 libraries statically. What it does is simply replace all occurances of
598 C<-llibname> with the GCC-specific C<-Wl,-Bstatic -llibname -Wl,-Bdynamic>
599 option.
600
601 This will have no effect unless the library is actually linked against,
602 specifically, C<--staticlib> will not link against the named library
603 unless it would be linked against anyway.
604
605 Example: link libcrypt statically into the binary.
606
607 staticperl mkperl -MIO::AIO --staticlib crypt
608
609 # ldopts might nwo contain:
610 # -lm -Wl,-Bstatic -lcrypt -Wl,-Bdynamic -lpthread
611
612 =item any other argument
613
614 Any other argument is interpreted as a bundle specification file, which
615 supports most long options (without extra quoting), one option per line.
616
617 =back
618
619 =head3 EXTENDED GLOB PATTERNS
620
621 Some options of F<staticperl mkbundle> expect an I<extended glob
622 pattern>. This is neither a normal shell glob nor a regex, but something
623 in between. The idea has been copied from rsync, and there are the current
624 matching rules:
625
626 =over 4
627
628 =item Patterns starting with F</> will be a anchored at the root of the library tree.
629
630 That is, F</unicore> will match the F<unicore> directory in C<@INC>, but
631 nothing inside, and neither any other file or directory called F<unicore>
632 anywhere else in the hierarchy.
633
634 =item Patterns not starting with F</> will be anchored at the end of the path.
635
636 That is, F<idna.pl> will match any file called F<idna.pl> anywhere in the
637 hierarchy, but not any directories of the same name.
638
639 =item A F<*> matches any single component.
640
641 That is, F</unicore/*.pl> would match all F<.pl> files directly inside
642 C</unicore>, not any deeper level F<.pl> files. Or in other words, F<*>
643 will not match slashes.
644
645 =item A F<**> matches anything.
646
647 That is, F</unicore/**.pl> would match all F<.pl> files under F</unicore>,
648 no matter how deeply nested they are inside subdirectories.
649
650 =item A F<?> matches a single character within a component.
651
652 That is, F</Encode/??.pm> matches F</Encode/JP.pm>, but not the
653 hypothetical F</Encode/J/.pm>, as F<?> does not match F</>.
654
655 =back
656
657 =head2 F<STATICPERL> CONFIGURATION AND HOOKS
658
659 During (each) startup, F<staticperl> tries to source some shell files to
660 allow you to fine-tune/override configuration settings.
661
662 In them you can override shell variables, or define shell functions
663 ("hooks") to be called at specific phases during installation. For
664 example, you could define a C<postinstall> hook to install additional
665 modules from CPAN each time you start from scratch.
666
667 If the env variable C<$STATICPERLRC> is set, then F<staticperl> will try
668 to source the file named with it only. Otherwise, it tries the following
669 shell files in order:
670
671 /etc/staticperlrc
672 ~/.staticperlrc
673 $STATICPERL/rc
674
675 Note that the last file is erased during F<staticperl distclean>, so
676 generally should not be used.
677
678 =head3 CONFIGURATION VARIABLES
679
680 =head4 Variables you I<should> override
681
682 =over 4
683
684 =item C<EMAIL>
685
686 The e-mail address of the person who built this binary. Has no good
687 default, so should be specified by you.
688
689 =item C<CPAN>
690
691 The URL of the CPAN mirror to use (e.g. L<http://mirror.netcologne.de/cpan/>).
692
693 =item C<EXTRA_MODULES>
694
695 Additional modules installed during F<staticperl install>. Here you can
696 set which modules you want have to installed from CPAN.
697
698 Example: I really really need EV, AnyEvent, Coro and AnyEvent::AIO.
699
700 EXTRA_MODULES="EV AnyEvent Coro AnyEvent::AIO"
701
702 Note that you can also use a C<postinstall> hook to achieve this, and
703 more.
704
705 =back
706
707 =head4 Variables you might I<want> to override
708
709 =over 4
710
711 =item C<STATICPERL>
712
713 The directory where staticperl stores all its files
714 (default: F<~/.staticperl>).
715
716 =item C<PERL_MM_USE_DEFAULT>, C<EV_EXTRA_DEFS>, ...
717
718 Usually set to C<1> to make modules "less inquisitive" during their
719 installation, you can set any environment variable you want - some modules
720 (such as L<Coro> or L<EV>) use environment variables for further tweaking.
721
722 =item C<PERL_VERSION>
723
724 The perl version to install - default is currently C<5.12.2>, but C<5.8.9>
725 is also a good choice (5.8.9 is much smaller than 5.12.2, while 5.10.1 is
726 about as big as 5.12.2).
727
728 =item C<PERL_PREFIX>
729
730 The prefix where perl gets installed (default: F<$STATICPERL/perl>),
731 i.e. where the F<bin> and F<lib> subdirectories will end up.
732
733 =item C<PERL_CONFIGURE>
734
735 Additional Configure options - these are simply passed to the perl
736 Configure script. For example, if you wanted to enable dynamic loading,
737 you could pass C<-Dusedl>. To enable ithreads (Why would you want that
738 insanity? Don't! Use L<forks> instead!) you would pass C<-Duseithreads>
739 and so on.
740
741 More commonly, you would either activate 64 bit integer support
742 (C<-Duse64bitint>), or disable large files support (-Uuselargefiles), to
743 reduce filesize further.
744
745 =item C<PERL_CC>, C<PERL_CCFLAGS>, C<PERL_OPTIMIZE>, C<PERL_LDFLAGS>, C<PERL_LIBS>
746
747 These flags are passed to perl's F<Configure> script, and are generally
748 optimised for small size (at the cost of performance). Since they also
749 contain subtle workarounds around various build issues, changing these
750 usually requires understanding their default values - best look at
751 the top of the F<staticperl> script for more info on these, and use a
752 F<~/.staticperlrc> to override them.
753
754 Most of the variables override (or modify) the corresponding F<Configure>
755 variable, except C<PERL_CCFLAGS>, which gets appended.
756
757 =back
758
759 =head4 Variables you probably I<do not want> to override
760
761 =over 4
762
763 =item C<MAKE>
764
765 The make command to use - default is C<make>.
766
767 =item C<MKBUNDLE>
768
769 Where F<staticperl> writes the C<mkbundle> command to
770 (default: F<$STATICPERL/mkbundle>).
771
772 =item C<STATICPERL_MODULES>
773
774 Additional modules needed by C<mkbundle> - should therefore not be changed
775 unless you know what you are doing.
776
777 =back
778
779 =head3 OVERRIDABLE HOOKS
780
781 In addition to environment variables, it is possible to provide some
782 shell functions that are called at specific times. To provide your own
783 commands, just define the corresponding function.
784
785 Example: install extra modules from CPAN and from some directories
786 at F<staticperl install> time.
787
788 postinstall() {
789 rm -rf lib/threads* # weg mit Schaden
790 instcpan IO::AIO EV
791 instsrc ~/src/AnyEvent
792 instsrc ~/src/XML-Sablotron-1.0100001
793 instcpan Anyevent::AIO AnyEvent::HTTPD
794 }
795
796 =over 4
797
798 =item preconfigure
799
800 Called just before running F<./Configur> in the perl source
801 directory. Current working directory is the perl source directory.
802
803 This can be used to set any C<PERL_xxx> variables, which might be costly
804 to compute.
805
806 =item postconfigure
807
808 Called after configuring, but before building perl. Current working
809 directory is the perl source directory.
810
811 Could be used to tailor/patch config.sh (followed by F<sh Configure -S>)
812 or do any other modifications.
813
814 =item postbuild
815
816 Called after building, but before installing perl. Current working
817 directory is the perl source directory.
818
819 I have no clue what this could be used for - tell me.
820
821 =item postinstall
822
823 Called after perl and any extra modules have been installed in C<$PREFIX>,
824 but before setting the "installation O.K." flag.
825
826 The current working directory is C<$PREFIX>, but maybe you should not rely
827 on that.
828
829 This hook is most useful to customise the installation, by deleting files,
830 or installing extra modules using the C<instcpan> or C<instsrc> functions.
831
832 The script must return with a zero exit status, or the installation will
833 fail.
834
835 =back
836
837 =head1 ANATOMY OF A BUNDLE
838
839 When not building a new perl binary, C<mkbundle> will leave a number of
840 files in the current working directory, which can be used to embed a perl
841 interpreter in your program.
842
843 Intimate knowledge of L<perlembed> and preferably some experience with
844 embedding perl is highly recommended.
845
846 C<mkperl> (or the C<--perl> option) basically does this to link the new
847 interpreter (it also adds a main program to F<bundle.>):
848
849 $Config{cc} $(cat bundle.ccopts) -o perl bundle.c $(cat bundle.ldopts)
850
851 =over 4
852
853 =item bundle.h
854
855 A header file that contains the prototypes of the few symbols "exported"
856 by bundle.c, and also exposes the perl headers to the application.
857
858 =over 4
859
860 =item staticperl_init ()
861
862 Initialises the perl interpreter. You can use the normal perl functions
863 after calling this function, for example, to define extra functions or
864 to load a .pm file that contains some initialisation code, or the main
865 program function:
866
867 XS (xsfunction)
868 {
869 dXSARGS;
870
871 // now we have items, ST(i) etc.
872 }
873
874 static void
875 run_myapp(void)
876 {
877 staticperl_init ();
878 newXSproto ("myapp::xsfunction", xsfunction, __FILE__, "$$;$");
879 eval_pv ("require myapp::main", 1); // executes "myapp/main.pm"
880 }
881
882 =item staticperl_xs_init (pTHX)
883
884 Sometimes you need direct control over C<perl_parse> and C<perl_run>, in
885 which case you do not want to use C<staticperl_init> but call them on your
886 own.
887
888 Then you need this function - either pass it directly as the C<xs_init>
889 function to C<perl_parse>, or call it from your own C<xs_init> function.
890
891 =item staticperl_cleanup ()
892
893 In the unlikely case that you want to destroy the perl interpreter, here
894 is the corresponding function.
895
896 =item PerlInterpreter *staticperl
897
898 The perl interpreter pointer used by staticperl. Not normally so useful,
899 but there it is.
900
901 =back
902
903 =item bundle.ccopts
904
905 Contains the compiler options required to compile at least F<bundle.c> and
906 any file that includes F<bundle.h> - you should probably use it in your
907 C<CFLAGS>.
908
909 =item bundle.ldopts
910
911 The linker options needed to link the final program.
912
913 =back
914
915 =head1 RUNTIME FUNCTIONALITY
916
917 Binaries created with C<mkbundle>/C<mkperl> contain extra functions, which
918 are required to access the bundled perl sources, but might be useful for
919 other purposes.
920
921 In addition, for the embedded loading of perl files to work, F<staticperl>
922 overrides the C<@INC> array.
923
924 =over 4
925
926 =item $file = staticperl::find $path
927
928 Returns the data associated with the given C<$path>
929 (e.g. C<Digest/MD5.pm>, C<auto/POSIX/autosplit.ix>), which is basically
930 the UNIX path relative to the perl library directory.
931
932 Returns C<undef> if the file isn't embedded.
933
934 =item @paths = staticperl::list
935
936 Returns the list of all paths embedded in this binary.
937
938 =back
939
940 =head1 FULLY STATIC BINARIES - BUILDROOT
941
942 To make truly static (Linux-) libraries, you might want to have a look at
943 buildroot (L<http://buildroot.uclibc.org/>).
944
945 Buildroot is primarily meant to set up a cross-compile environment (which
946 is not so useful as perl doesn't quite like cross compiles), but it can also compile
947 a chroot environment where you can use F<staticperl>.
948
949 To do so, download buildroot, and enable "Build options => development
950 files in target filesystem" and optionally "Build options => gcc
951 optimization level (optimize for size)". At the time of writing, I had
952 good experiences with GCC 4.4.x but not GCC 4.5.
953
954 To minimise code size, I used C<-pipe -ffunction-sections -fdata-sections
955 -finline-limit=8 -fno-builtin-strlen -mtune=i386>. The C<-mtune=i386>
956 doesn't decrease codesize much, but it makes the file much more
957 compressible.
958
959 If you don't need Coro or threads, you can go with "linuxthreads.old" (or
960 no thread support). For Coro, it is highly recommended to switch to a
961 uClibc newer than 0.9.31 (at the time of this writing, I used the 20101201
962 snapshot) and enable NPTL, otherwise Coro needs to be configured with the
963 ultra-slow pthreads backend to work around linuxthreads bugs (it also uses
964 twice the address space needed for stacks).
965
966 If you use C<linuxthreads.old>, then you should also be aware that
967 uClibc shares C<errno> between all threads when statically linking. See
968 L<http://lists.uclibc.org/pipermail/uclibc/2010-June/044157.html> for a
969 workaround (And L<https://bugs.uclibc.org/2089> for discussion).
970
971 C<ccache> support is also recommended, especially if you want
972 to play around with buildroot options. Enabling the C<miniperl>
973 package will probably enable all options required for a successful
974 perl build. F<staticperl> itself additionally needs either C<wget>
975 (recommended, for CPAN) or C<curl>.
976
977 As for shells, busybox should provide all that is needed, but the default
978 busybox configuration doesn't include F<comm> which is needed by perl -
979 either make a custom busybox config, or compile coreutils.
980
981 For the latter route, you might find that bash has some bugs that keep
982 it from working properly in a chroot - either use dash (and link it to
983 F</bin/sh> inside the chroot) or link busybox to F</bin/sh>, using it's
984 built-in ash shell.
985
986 Finally, you need F</dev/null> inside the chroot for many scripts to work
987 - F<cp /dev/null output/target/dev> or bind-mounting your F</dev> will
988 both provide this.
989
990 After you have compiled and set up your buildroot target, you can copy
991 F<staticperl> from the C<App::Staticperl> distribution or from your
992 perl f<bin> directory (if you installed it) into the F<output/target>
993 filesystem, chroot inside and run it.
994
995 =head1 RECIPES / SPECIFIC MODULES
996
997 This section contains some common(?) recipes and information about
998 problems with some common modules or perl constructs that require extra
999 files to be included.
1000
1001 =head2 MODULES
1002
1003 =over 4
1004
1005 =item utf8
1006
1007 Some functionality in the utf8 module, such as swash handling (used
1008 for unicode character ranges in regexes) is implemented in the
1009 C<"utf8_heavy.pl"> library:
1010
1011 -M'"utf8_heavy.pl"'
1012
1013 Many Unicode properties in turn are defined in separate modules,
1014 such as C<"unicore/Heavy.pl"> and more specific data tables such as
1015 C<"unicore/To/Digit.pl"> or C<"unicore/lib/Perl/Word.pl">. These tables
1016 are big (7MB uncompressed, although F<staticperl> contains special
1017 handling for those files), so including them on demand by your application
1018 only might pay off.
1019
1020 To simply include the whole unicode database, use:
1021
1022 --incglob '/unicore/*.pl'
1023
1024 =item AnyEvent
1025
1026 AnyEvent needs a backend implementation that it will load in a delayed
1027 fashion. The L<AnyEvent::Impl::Perl> backend is the default choice
1028 for AnyEvent if it can't find anything else, and is usually a safe
1029 fallback. If you plan to use e.g. L<EV> (L<POE>...), then you need to
1030 include the L<AnyEvent::Impl::EV> (L<AnyEvent::Impl::POE>...) backend as
1031 well.
1032
1033 If you want to handle IRIs or IDNs (L<AnyEvent::Util> punycode and idn
1034 functions), you also need to include C<"AnyEvent/Util/idna.pl"> and
1035 C<"AnyEvent/Util/uts46data.pl">.
1036
1037 Or you can use C<--usepacklist> and specify C<-MAnyEvent> to include
1038 everything.
1039
1040 =item Carp
1041
1042 Carp had (in older versions of perl) a dependency on L<Carp::Heavy>. As of
1043 perl 5.12.2 (maybe earlier), this dependency no longer exists.
1044
1045 =item Config
1046
1047 The F<perl -V> switch (as well as many modules) needs L<Config>, which in
1048 turn might need L<"Config_heavy.pl">. Including the latter gives you
1049 both.
1050
1051 =item Term::ReadLine::Perl
1052
1053 Also needs L<Term::ReadLine::readline>, or C<--usepacklist>.
1054
1055 =item URI
1056
1057 URI implements schemes as separate modules - the generic URL scheme is
1058 implemented in L<URI::_generic>, HTTP is implemented in L<URI::http>. If
1059 you need to use any of these schemes, you should include these manually,
1060 or use C<--usepacklist>.
1061
1062 =back
1063
1064 =head2 RECIPES
1065
1066 =over 4
1067
1068 =item Linking everything in
1069
1070 To link just about everything installed in the perl library into a new
1071 perl, try this:
1072
1073 staticperl mkperl --strip ppi --incglob '*'
1074
1075 =item Getting rid of netdb function
1076
1077 The perl core has lots of netdb functions (C<getnetbyname>, C<getgrent>
1078 and so on) that few applications use. You can avoid compiling them in by
1079 putting the following fragment into a C<preconfigure> hook:
1080
1081 preconfigure() {
1082 for sym in \
1083 d_getgrnam_r d_endgrent d_endgrent_r d_endhent \
1084 d_endhostent_r d_endnent d_endnetent_r d_endpent \
1085 d_endprotoent_r d_endpwent d_endpwent_r d_endsent \
1086 d_endservent_r d_getgrent d_getgrent_r d_getgrgid_r \
1087 d_getgrnam_r d_gethbyaddr d_gethent d_getsbyport \
1088 d_gethostbyaddr_r d_gethostbyname_r d_gethostent_r \
1089 d_getlogin_r d_getnbyaddr d_getnbyname d_getnent \
1090 d_getnetbyaddr_r d_getnetbyname_r d_getnetent_r \
1091 d_getpent d_getpbyname d_getpbynumber d_getprotobyname_r \
1092 d_getprotobynumber_r d_getprotoent_r d_getpwent \
1093 d_getpwent_r d_getpwnam_r d_getpwuid_r d_getsent \
1094 d_getservbyname_r d_getservbyport_r d_getservent_r \
1095 d_getspnam_r d_getsbyname
1096 # d_gethbyname
1097 do
1098 PERL_CONFIGURE="$PERL_CONFIGURE -U$sym"
1099 done
1100 }
1101
1102 This mostly gains space when linking staticaly, as the functions will
1103 likely not be linked in. The gain for dynamically-linked binaries is
1104 smaller.
1105
1106 Also, this leaves C<gethostbyname> in - not only is it actually used
1107 often, the L<Socket> module also exposes it, so leaving it out usually
1108 gains little. Why Socket exposes a C function that is in the core already
1109 is anybody's guess.
1110
1111 =back
1112
1113 =head1 AUTHOR
1114
1115 Marc Lehmann <schmorp@schmorp.de>
1116 http://software.schmorp.de/pkg/staticperl.html