ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/libecb/ecb.pod
(Generate patch)

Comparing libecb/ecb.pod (file contents):
Revision 1.24 by root, Tue May 31 21:10:37 2011 UTC vs.
Revision 1.43 by root, Tue May 29 14:09:49 2012 UTC

54only a generic name is used (C<expr>, C<cond>, C<value> and so on), then 54only a generic name is used (C<expr>, C<cond>, C<value> and so on), then
55the corresponding function relies on C to implement the correct types, and 55the corresponding function relies on C to implement the correct types, and
56is usually implemented as a macro. Specifically, a "bool" in this manual 56is usually implemented as a macro. Specifically, a "bool" in this manual
57refers to any kind of boolean value, not a specific type. 57refers to any kind of boolean value, not a specific type.
58 58
59=head2 TYPES / TYPE SUPPORT
60
61ecb.h makes sure that the following types are defined (in the expected way):
62
63 int8_t uint8_t int16_t uint16_t
64 int32_t uint32_t int64_t uint64_t
65 intptr_t uintptr_t ptrdiff_t
66
67The macro C<ECB_PTRSIZE> is defined to the size of a pointer on this
68platform (currently C<4> or C<8>).
69
70=head2 LANGUAGE/COMPILER VERSIONS
71
72=over 4
73
74=item ECB_C99
75
76Expands to a true value (suitable for testing in by the preprocessor)
77if the environment claims to be C99 compliant.
78
79=item ECB_C11
80
81Expands to a true value (suitable for testing in by the preprocessor)
82if the environment claims to be C11 compliant.
83
84=item ECB_GCC_VERSION(major,minor)
85
86Expands to a true value (suitable for testing in by the preprocessor)
87if the compiler used is GNU C and the version is the givne version, or
88higher.
89
90This macro tries to return false on compilers that claim to be GCC
91compatible but aren't.
92
93=back
94
59=head2 GCC ATTRIBUTES 95=head2 GCC ATTRIBUTES
60 96
61A major part of libecb deals with GCC attributes. These are additional 97A major part of libecb deals with GCC attributes. These are additional
62attributes that you cna assign to functions, variables and sometimes even 98attributes that you can assign to functions, variables and sometimes even
63types - much like C<const> or C<volatile> in C. 99types - much like C<const> or C<volatile> in C.
64 100
65While GCC allows declarations to show up in many surprising places, 101While GCC allows declarations to show up in many surprising places,
66but not in many expeted places, the safest way is to put attribute 102but not in many expected places, the safest way is to put attribute
67declarations before the whole declaration: 103declarations before the whole declaration:
68 104
69 ecb_const int mysqrt (int a); 105 ecb_const int mysqrt (int a);
70 ecb_unused int i; 106 ecb_unused int i;
71 107
101 #else 137 #else
102 return 0; 138 return 0;
103 #endif 139 #endif
104 } 140 }
105 141
142=item ecb_inline
143
144This is not actually an attribute, but you use it like one. It expands
145either to C<static inline> or to just C<static>, if inline isn't
146supported. It should be used to declare functions that should be inlined,
147for code size or speed reasons.
148
149Example: inline this function, it surely will reduce codesize.
150
151 ecb_inline int
152 negmul (int a, int b)
153 {
154 return - (a * b);
155 }
156
106=item ecb_noinline 157=item ecb_noinline
107 158
108Prevent a function from being inlined - it might be optimised away, but 159Prevent a function from being inlined - it might be optimised away, but
109not inlined into other functions. This is useful if you know your function 160not inlined into other functions. This is useful if you know your function
110is rarely called and large enough for inlining not to be helpful. 161is rarely called and large enough for inlining not to be helpful.
184 235
185In addition to placing cold functions together (or at least away from hot 236In addition to placing cold functions together (or at least away from hot
186functions), this knowledge can be used in other ways, for example, the 237functions), this knowledge can be used in other ways, for example, the
187function will be optimised for size, as opposed to speed, and codepaths 238function will be optimised for size, as opposed to speed, and codepaths
188leading to calls to those functions can automatically be marked as if 239leading to calls to those functions can automatically be marked as if
189C<ecb_unlikely> had been used to reach them. 240C<ecb_expect_false> had been used to reach them.
190 241
191Good examples for such functions would be error reporting functions, or 242Good examples for such functions would be error reporting functions, or
192functions only called in exceptional or rare cases. 243functions only called in exceptional or rare cases.
193 244
194=item ecb_artificial 245=item ecb_artificial
256 307
257Evaluates C<expr> and returns it. In addition, it tells the compiler that 308Evaluates C<expr> and returns it. In addition, it tells the compiler that
258the C<expr> evaluates to C<value> a lot, which can be used for static 309the C<expr> evaluates to C<value> a lot, which can be used for static
259branch optimisations. 310branch optimisations.
260 311
261Usually, you want to use the more intuitive C<ecb_likely> and 312Usually, you want to use the more intuitive C<ecb_expect_true> and
262C<ecb_unlikely> functions instead. 313C<ecb_expect_false> functions instead.
263 314
315=item bool ecb_expect_true (cond)
316
264=item bool ecb_likely (cond) 317=item bool ecb_expect_false (cond)
265
266=item bool ecb_unlikely (cond)
267 318
268These two functions expect a expression that is true or false and return 319These two functions expect a expression that is true or false and return
269C<1> or C<0>, respectively, so when used in the condition of an C<if> or 320C<1> or C<0>, respectively, so when used in the condition of an C<if> or
270other conditional statement, it will not change the program: 321other conditional statement, it will not change the program:
271 322
272 /* these two do the same thing */ 323 /* these two do the same thing */
273 if (some_condition) ...; 324 if (some_condition) ...;
274 if (ecb_likely (some_condition)) ...; 325 if (ecb_expect_true (some_condition)) ...;
275 326
276However, by using C<ecb_likely>, you tell the compiler that the condition 327However, by using C<ecb_expect_true>, you tell the compiler that the
277is likely to be true (and for C<ecb_unlikely>, that it is unlikely to be 328condition is likely to be true (and for C<ecb_expect_false>, that it is
278true). 329unlikely to be true).
279 330
280For example, when you check for a null pointer and expect this to be a 331For example, when you check for a null pointer and expect this to be a
281rare, exceptional, case, then use C<ecb_unlikely>: 332rare, exceptional, case, then use C<ecb_expect_false>:
282 333
283 void my_free (void *ptr) 334 void my_free (void *ptr)
284 { 335 {
285 if (ecb_unlikely (ptr == 0)) 336 if (ecb_expect_false (ptr == 0))
286 return; 337 return;
287 } 338 }
288 339
289Consequent use of these functions to mark away exceptional cases or to 340Consequent use of these functions to mark away exceptional cases or to
290tell the compiler what the hot path through a function is can increase 341tell the compiler what the hot path through a function is can increase
291performance considerably. 342performance considerably.
343
344You might know these functions under the name C<likely> and C<unlikely>
345- while these are common aliases, we find that the expect name is easier
346to understand when quickly skimming code. If you wish, you can use
347C<ecb_likely> instead of C<ecb_expect_true> and C<ecb_unlikely> instead of
348C<ecb_expect_false> - these are simply aliases.
292 349
293A very good example is in a function that reserves more space for some 350A very good example is in a function that reserves more space for some
294memory block (for example, inside an implementation of a string stream) - 351memory block (for example, inside an implementation of a string stream) -
295each time something is added, you have to check for a buffer overrun, but 352each time something is added, you have to check for a buffer overrun, but
296you expect that most checks will turn out to be false: 353you expect that most checks will turn out to be false:
297 354
298 /* make sure we have "size" extra room in our buffer */ 355 /* make sure we have "size" extra room in our buffer */
299 ecb_inline void 356 ecb_inline void
300 reserve (int size) 357 reserve (int size)
301 { 358 {
302 if (ecb_unlikely (current + size > end)) 359 if (ecb_expect_false (current + size > end))
303 real_reserve_method (size); /* presumably noinline */ 360 real_reserve_method (size); /* presumably noinline */
304 } 361 }
305 362
306=item bool ecb_assume (cond) 363=item bool ecb_assume (cond)
307 364
310 367
311This can be used to teach the compiler about invariants or other 368This can be used to teach the compiler about invariants or other
312conditions that might improve code generation, but which are impossible to 369conditions that might improve code generation, but which are impossible to
313deduce form the code itself. 370deduce form the code itself.
314 371
315For example, the example reservation function from the C<ecb_unlikely> 372For example, the example reservation function from the C<ecb_expect_false>
316description could be written thus (only C<ecb_assume> was added): 373description could be written thus (only C<ecb_assume> was added):
317 374
318 ecb_inline void 375 ecb_inline void
319 reserve (int size) 376 reserve (int size)
320 { 377 {
321 if (ecb_unlikely (current + size > end)) 378 if (ecb_expect_false (current + size > end))
322 real_reserve_method (size); /* presumably noinline */ 379 real_reserve_method (size); /* presumably noinline */
323 380
324 ecb_assume (current + size <= end); 381 ecb_assume (current + size <= end);
325 } 382 }
326 383
375After processing the node, (part of) the next node might already be in 432After processing the node, (part of) the next node might already be in
376cache. 433cache.
377 434
378=back 435=back
379 436
380=head2 BIT FIDDLING / BITSTUFFS 437=head2 BIT FIDDLING / BIT WIZARDRY
381 438
382=over 4 439=over 4
383 440
384=item bool ecb_big_endian () 441=item bool ecb_big_endian ()
385 442
391 448
392On systems that are neither, their return values are unspecified. 449On systems that are neither, their return values are unspecified.
393 450
394=item int ecb_ctz32 (uint32_t x) 451=item int ecb_ctz32 (uint32_t x)
395 452
453=item int ecb_ctz64 (uint64_t x)
454
396Returns the index of the least significant bit set in C<x> (or 455Returns the index of the least significant bit set in C<x> (or
397equivalently the number of bits set to 0 before the least significant bit 456equivalently the number of bits set to 0 before the least significant bit
398set), starting from 0. If C<x> is 0 the result is undefined. A common use 457set), starting from 0. If C<x> is 0 the result is undefined.
399case is to compute the integer binary logarithm, i.e., C<floor (log2 458
459For smaller types than C<uint32_t> you can safely use C<ecb_ctz32>.
460
400(n))>. For example: 461For example:
401 462
402 ecb_ctz32 (3) = 0 463 ecb_ctz32 (3) = 0
403 ecb_ctz32 (6) = 1 464 ecb_ctz32 (6) = 1
404 465
466=item bool ecb_is_pot32 (uint32_t x)
467
468=item bool ecb_is_pot64 (uint32_t x)
469
470Return true iff C<x> is a power of two or C<x == 0>.
471
472For smaller types then C<uint32_t> you can safely use C<ecb_is_pot32>.
473
474=item int ecb_ld32 (uint32_t x)
475
476=item int ecb_ld64 (uint64_t x)
477
478Returns the index of the most significant bit set in C<x>, or the number
479of digits the number requires in binary (so that C<< 2**ld <= x <
4802**(ld+1) >>). If C<x> is 0 the result is undefined. A common use case is
481to compute the integer binary logarithm, i.e. C<floor (log2 (n))>, for
482example to see how many bits a certain number requires to be encoded.
483
484This function is similar to the "count leading zero bits" function, except
485that that one returns how many zero bits are "in front" of the number (in
486the given data type), while C<ecb_ld> returns how many bits the number
487itself requires.
488
489For smaller types than C<uint32_t> you can safely use C<ecb_ld32>.
490
405=item int ecb_popcount32 (uint32_t x) 491=item int ecb_popcount32 (uint32_t x)
406 492
493=item int ecb_popcount64 (uint64_t x)
494
407Returns the number of bits set to 1 in C<x>. For example: 495Returns the number of bits set to 1 in C<x>.
496
497For smaller types than C<uint32_t> you can safely use C<ecb_popcount32>.
498
499For example:
408 500
409 ecb_popcount32 (7) = 3 501 ecb_popcount32 (7) = 3
410 ecb_popcount32 (255) = 8 502 ecb_popcount32 (255) = 8
411 503
504=item uint8_t ecb_bitrev8 (uint8_t x)
505
506=item uint16_t ecb_bitrev16 (uint16_t x)
507
508=item uint32_t ecb_bitrev32 (uint32_t x)
509
510Reverses the bits in x, i.e. the MSB becomes the LSB, MSB-1 becomes LSB+1
511and so on.
512
513Example:
514
515 ecb_bitrev8 (0xa7) = 0xea
516 ecb_bitrev32 (0xffcc4411) = 0x882233ff
517
412=item uint32_t ecb_bswap16 (uint32_t x) 518=item uint32_t ecb_bswap16 (uint32_t x)
413 519
414=item uint32_t ecb_bswap32 (uint32_t x) 520=item uint32_t ecb_bswap32 (uint32_t x)
415 521
522=item uint64_t ecb_bswap64 (uint64_t x)
523
416These two functions return the value of the 16-bit (32-bit) value C<x> 524These functions return the value of the 16-bit (32-bit, 64-bit) value
417after reversing the order of bytes (0x11223344 becomes 0x44332211). 525C<x> after reversing the order of bytes (0x11223344 becomes 0x44332211 in
526C<ecb_bswap32>).
527
528=item uint8_t ecb_rotl8 (uint8_t x, unsigned int count)
529
530=item uint16_t ecb_rotl16 (uint16_t x, unsigned int count)
531
532=item uint32_t ecb_rotl32 (uint32_t x, unsigned int count)
533
534=item uint64_t ecb_rotl64 (uint64_t x, unsigned int count)
535
536=item uint8_t ecb_rotr8 (uint8_t x, unsigned int count)
537
538=item uint16_t ecb_rotr16 (uint16_t x, unsigned int count)
418 539
419=item uint32_t ecb_rotr32 (uint32_t x, unsigned int count) 540=item uint32_t ecb_rotr32 (uint32_t x, unsigned int count)
420 541
421=item uint32_t ecb_rotl32 (uint32_t x, unsigned int count) 542=item uint64_t ecb_rotr64 (uint64_t x, unsigned int count)
422 543
423These two functions return the value of C<x> after rotating all the bits 544These two families of functions return the value of C<x> after rotating
424by C<count> positions to the right or left respectively. 545all the bits by C<count> positions to the right (C<ecb_rotr>) or left
546(C<ecb_rotl>).
425 547
426Current GCC versions understand these functions and usually compile them 548Current GCC versions understand these functions and usually compile them
427to "optimal" code (e.g. a single C<roll> on x86). 549to "optimal" code (e.g. a single C<rol> or a combination of C<shld> on
550x86).
428 551
429=back 552=back
430 553
431=head2 ARITHMETIC 554=head2 ARITHMETIC
432 555
433=over 4 556=over 4
434 557
435=item x = ecb_mod (m, n) 558=item x = ecb_mod (m, n)
436 559
437Returns the positive remainder of the modulo operation between C<m> and 560Returns C<m> modulo C<n>, which is the same as the positive remainder
438C<n>, using floored division. Unlike the C modulo operator C<%>, this 561of the division operation between C<m> and C<n>, using floored
439function ensures that the return value is always positive and that the two 562division. Unlike the C remainder operator C<%>, this function ensures that
563the return value is always positive and that the two numbers I<m> and
440numbers I<m> and I<m' = m + i * n> result in the same value modulo I<n> - 564I<m' = m + i * n> result in the same value modulo I<n> - in other words,
441the C<%> operator usually has a behaviour change at C<m = 0>. 565C<ecb_mod> implements the mathematical modulo operation, which is missing
566in the language.
442 567
443C<n> must be strictly positive (i.e. C<< >= 1 >>), while C<m> must be 568C<n> must be strictly positive (i.e. C<< >= 1 >>), while C<m> must be
444negatable, that is, both C<m> and C<-m> must be representable in its 569negatable, that is, both C<m> and C<-m> must be representable in its
445type. 570type (this typically excludes the minimum signed integer value, the same
571limitation as for C</> and C<%> in C).
446 572
447Current GCC versions compile this into an efficient branchless sequence on 573Current GCC versions compile this into an efficient branchless sequence on
448many systems. 574almost all CPUs.
449 575
450For example, when you want to rotate forward through the members of an 576For example, when you want to rotate forward through the members of an
451array for increasing C<m> (which might be negative), then you should use 577array for increasing C<m> (which might be negative), then you should use
452C<ecb_mod>, as the C<%> operator might give either negative results, or 578C<ecb_mod>, as the C<%> operator might give either negative results, or
453change direction for negative values: 579change direction for negative values:
454 580
455 for (m = -100; m <= 100; ++m) 581 for (m = -100; m <= 100; ++m)
456 int elem = myarray [ecb_mod (m, ecb_array_length (myarray))]; 582 int elem = myarray [ecb_mod (m, ecb_array_length (myarray))];
457 583
584=item x = ecb_div_rd (val, div)
585
586=item x = ecb_div_ru (val, div)
587
588Returns C<val> divided by C<div> rounded down or up, respectively.
589C<val> and C<div> must have integer types and C<div> must be strictly
590positive. Note that these functions are implemented with macros in C
591and with function templates in C++.
592
458=back 593=back
459 594
460=head2 UTILITY 595=head2 UTILITY
461 596
462=over 4 597=over 4
471 for (i = 0; i < ecb_array_length (primes); i++) 606 for (i = 0; i < ecb_array_length (primes); i++)
472 sum += primes [i]; 607 sum += primes [i];
473 608
474=back 609=back
475 610
611=head2 SYMBOLS GOVERNING COMPILATION OF ECB.H ITSELF
476 612
613These symbols need to be defined before including F<ecb.h> the first time.
614
615=over 4
616
617=item ECB_NO_THRADS
618
619If F<ecb.h> is never used from multiple threads, then this symbol can
620be defined, in which case memory fences (and similar constructs) are
621completely removed, leading to more efficient code and fewer dependencies.
622
623Setting this symbol to a true value implies C<ECB_NO_SMP>.
624
625=item ECB_NO_SMP
626
627The weaker version of C<ECB_NO_THREADS> - if F<ecb.h> is used from
628multiple threads, but never concurrently (e.g. if the system the program
629runs on has only a single CPU with a single core, no hyperthreading and so
630on), then this symbol can be defined, leading to more efficient code and
631fewer dependencies.
632
633=back
634
635

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines