ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/libecb/ecb.pod
(Generate patch)

Comparing libecb/ecb.pod (file contents):
Revision 1.25 by root, Tue May 31 21:52:31 2011 UTC vs.
Revision 1.49 by root, Thu Jun 28 14:50:01 2012 UTC

54only a generic name is used (C<expr>, C<cond>, C<value> and so on), then 54only a generic name is used (C<expr>, C<cond>, C<value> and so on), then
55the corresponding function relies on C to implement the correct types, and 55the corresponding function relies on C to implement the correct types, and
56is usually implemented as a macro. Specifically, a "bool" in this manual 56is usually implemented as a macro. Specifically, a "bool" in this manual
57refers to any kind of boolean value, not a specific type. 57refers to any kind of boolean value, not a specific type.
58 58
59=head2 TYPES / TYPE SUPPORT
60
61ecb.h makes sure that the following types are defined (in the expected way):
62
63 int8_t uint8_t int16_t uint16_t
64 int32_t uint32_t int64_t uint64_t
65 intptr_t uintptr_t
66
67The macro C<ECB_PTRSIZE> is defined to the size of a pointer on this
68platform (currently C<4> or C<8>) and can be used in preprocessor
69expressions.
70
71For C<ptrdiff_t> and C<size_t> use C<stddef.h>.
72
73=head2 LANGUAGE/COMPILER VERSIONS
74
75All the following symbols expand to an expression that can be tested in
76preprocessor instructions as well as treated as a boolean (use C<!!> to
77ensure it's either C<0> or C<1> if you need that).
78
79=over 4
80
81=item ECB_C
82
83True if the implementation defines the C<__STDC__> macro to a true value,
84which is typically true for both C and C++ compilers.
85
86=item ECB_C99
87
88True if the implementation claims to be compliant to C99 (ISO/IEC
899899:1999) or any later version.
90
91Note that later versions (ECB_C11) remove core features again (for
92example, variable length arrays).
93
94=item ECB_C11
95
96True if the implementation claims to be compliant to C11 (ISO/IEC
979899:2011) or any later version.
98
99=item ECB_CPP
100
101True if the implementation defines the C<__cplusplus__> macro to a true
102value, which is typically true for C++ compilers.
103
104=item ECB_CPP11
105
106True if the implementation claims to be compliant to ISO/IEC 14882:2011
107(C++11) or any later version.
108
109=item ECB_GCC_VERSION(major,minor)
110
111Expands to a true value (suitable for testing in by the preprocessor)
112if the compiler used is GNU C and the version is the given version, or
113higher.
114
115This macro tries to return false on compilers that claim to be GCC
116compatible but aren't.
117
118=back
119
59=head2 GCC ATTRIBUTES 120=head2 GCC ATTRIBUTES
60 121
61A major part of libecb deals with GCC attributes. These are additional 122A major part of libecb deals with GCC attributes. These are additional
62attributes that you cna assign to functions, variables and sometimes even 123attributes that you can assign to functions, variables and sometimes even
63types - much like C<const> or C<volatile> in C. 124types - much like C<const> or C<volatile> in C.
64 125
65While GCC allows declarations to show up in many surprising places, 126While GCC allows declarations to show up in many surprising places,
66but not in many expeted places, the safest way is to put attribute 127but not in many expected places, the safest way is to put attribute
67declarations before the whole declaration: 128declarations before the whole declaration:
68 129
69 ecb_const int mysqrt (int a); 130 ecb_const int mysqrt (int a);
70 ecb_unused int i; 131 ecb_unused int i;
71 132
101 #else 162 #else
102 return 0; 163 return 0;
103 #endif 164 #endif
104 } 165 }
105 166
167=item ecb_inline
168
169This is not actually an attribute, but you use it like one. It expands
170either to C<static inline> or to just C<static>, if inline isn't
171supported. It should be used to declare functions that should be inlined,
172for code size or speed reasons.
173
174Example: inline this function, it surely will reduce codesize.
175
176 ecb_inline int
177 negmul (int a, int b)
178 {
179 return - (a * b);
180 }
181
106=item ecb_noinline 182=item ecb_noinline
107 183
108Prevent a function from being inlined - it might be optimised away, but 184Prevent a function from being inlined - it might be optimised away, but
109not inlined into other functions. This is useful if you know your function 185not inlined into other functions. This is useful if you know your function
110is rarely called and large enough for inlining not to be helpful. 186is rarely called and large enough for inlining not to be helpful.
184 260
185In addition to placing cold functions together (or at least away from hot 261In addition to placing cold functions together (or at least away from hot
186functions), this knowledge can be used in other ways, for example, the 262functions), this knowledge can be used in other ways, for example, the
187function will be optimised for size, as opposed to speed, and codepaths 263function will be optimised for size, as opposed to speed, and codepaths
188leading to calls to those functions can automatically be marked as if 264leading to calls to those functions can automatically be marked as if
189C<ecb_unlikely> had been used to reach them. 265C<ecb_expect_false> had been used to reach them.
190 266
191Good examples for such functions would be error reporting functions, or 267Good examples for such functions would be error reporting functions, or
192functions only called in exceptional or rare cases. 268functions only called in exceptional or rare cases.
193 269
194=item ecb_artificial 270=item ecb_artificial
256 332
257Evaluates C<expr> and returns it. In addition, it tells the compiler that 333Evaluates C<expr> and returns it. In addition, it tells the compiler that
258the C<expr> evaluates to C<value> a lot, which can be used for static 334the C<expr> evaluates to C<value> a lot, which can be used for static
259branch optimisations. 335branch optimisations.
260 336
261Usually, you want to use the more intuitive C<ecb_likely> and 337Usually, you want to use the more intuitive C<ecb_expect_true> and
262C<ecb_unlikely> functions instead. 338C<ecb_expect_false> functions instead.
263 339
340=item bool ecb_expect_true (cond)
341
264=item bool ecb_likely (cond) 342=item bool ecb_expect_false (cond)
265
266=item bool ecb_unlikely (cond)
267 343
268These two functions expect a expression that is true or false and return 344These two functions expect a expression that is true or false and return
269C<1> or C<0>, respectively, so when used in the condition of an C<if> or 345C<1> or C<0>, respectively, so when used in the condition of an C<if> or
270other conditional statement, it will not change the program: 346other conditional statement, it will not change the program:
271 347
272 /* these two do the same thing */ 348 /* these two do the same thing */
273 if (some_condition) ...; 349 if (some_condition) ...;
274 if (ecb_likely (some_condition)) ...; 350 if (ecb_expect_true (some_condition)) ...;
275 351
276However, by using C<ecb_likely>, you tell the compiler that the condition 352However, by using C<ecb_expect_true>, you tell the compiler that the
277is likely to be true (and for C<ecb_unlikely>, that it is unlikely to be 353condition is likely to be true (and for C<ecb_expect_false>, that it is
278true). 354unlikely to be true).
279 355
280For example, when you check for a null pointer and expect this to be a 356For example, when you check for a null pointer and expect this to be a
281rare, exceptional, case, then use C<ecb_unlikely>: 357rare, exceptional, case, then use C<ecb_expect_false>:
282 358
283 void my_free (void *ptr) 359 void my_free (void *ptr)
284 { 360 {
285 if (ecb_unlikely (ptr == 0)) 361 if (ecb_expect_false (ptr == 0))
286 return; 362 return;
287 } 363 }
288 364
289Consequent use of these functions to mark away exceptional cases or to 365Consequent use of these functions to mark away exceptional cases or to
290tell the compiler what the hot path through a function is can increase 366tell the compiler what the hot path through a function is can increase
291performance considerably. 367performance considerably.
368
369You might know these functions under the name C<likely> and C<unlikely>
370- while these are common aliases, we find that the expect name is easier
371to understand when quickly skimming code. If you wish, you can use
372C<ecb_likely> instead of C<ecb_expect_true> and C<ecb_unlikely> instead of
373C<ecb_expect_false> - these are simply aliases.
292 374
293A very good example is in a function that reserves more space for some 375A very good example is in a function that reserves more space for some
294memory block (for example, inside an implementation of a string stream) - 376memory block (for example, inside an implementation of a string stream) -
295each time something is added, you have to check for a buffer overrun, but 377each time something is added, you have to check for a buffer overrun, but
296you expect that most checks will turn out to be false: 378you expect that most checks will turn out to be false:
297 379
298 /* make sure we have "size" extra room in our buffer */ 380 /* make sure we have "size" extra room in our buffer */
299 ecb_inline void 381 ecb_inline void
300 reserve (int size) 382 reserve (int size)
301 { 383 {
302 if (ecb_unlikely (current + size > end)) 384 if (ecb_expect_false (current + size > end))
303 real_reserve_method (size); /* presumably noinline */ 385 real_reserve_method (size); /* presumably noinline */
304 } 386 }
305 387
306=item bool ecb_assume (cond) 388=item bool ecb_assume (cond)
307 389
310 392
311This can be used to teach the compiler about invariants or other 393This can be used to teach the compiler about invariants or other
312conditions that might improve code generation, but which are impossible to 394conditions that might improve code generation, but which are impossible to
313deduce form the code itself. 395deduce form the code itself.
314 396
315For example, the example reservation function from the C<ecb_unlikely> 397For example, the example reservation function from the C<ecb_expect_false>
316description could be written thus (only C<ecb_assume> was added): 398description could be written thus (only C<ecb_assume> was added):
317 399
318 ecb_inline void 400 ecb_inline void
319 reserve (int size) 401 reserve (int size)
320 { 402 {
321 if (ecb_unlikely (current + size > end)) 403 if (ecb_expect_false (current + size > end))
322 real_reserve_method (size); /* presumably noinline */ 404 real_reserve_method (size); /* presumably noinline */
323 405
324 ecb_assume (current + size <= end); 406 ecb_assume (current + size <= end);
325 } 407 }
326 408
375After processing the node, (part of) the next node might already be in 457After processing the node, (part of) the next node might already be in
376cache. 458cache.
377 459
378=back 460=back
379 461
380=head2 BIT FIDDLING / BITSTUFFS 462=head2 BIT FIDDLING / BIT WIZARDRY
381 463
382=over 4 464=over 4
383 465
384=item bool ecb_big_endian () 466=item bool ecb_big_endian ()
385 467
391 473
392On systems that are neither, their return values are unspecified. 474On systems that are neither, their return values are unspecified.
393 475
394=item int ecb_ctz32 (uint32_t x) 476=item int ecb_ctz32 (uint32_t x)
395 477
478=item int ecb_ctz64 (uint64_t x)
479
396Returns the index of the least significant bit set in C<x> (or 480Returns the index of the least significant bit set in C<x> (or
397equivalently the number of bits set to 0 before the least significant bit 481equivalently the number of bits set to 0 before the least significant bit
398set), starting from 0. If C<x> is 0 the result is undefined. A common use 482set), starting from 0. If C<x> is 0 the result is undefined.
399case is to compute the integer binary logarithm, i.e., C<floor (log2 483
484For smaller types than C<uint32_t> you can safely use C<ecb_ctz32>.
485
400(n))>. For example: 486For example:
401 487
402 ecb_ctz32 (3) = 0 488 ecb_ctz32 (3) = 0
403 ecb_ctz32 (6) = 1 489 ecb_ctz32 (6) = 1
404 490
491=item bool ecb_is_pot32 (uint32_t x)
492
493=item bool ecb_is_pot64 (uint32_t x)
494
495Return true iff C<x> is a power of two or C<x == 0>.
496
497For smaller types then C<uint32_t> you can safely use C<ecb_is_pot32>.
498
499=item int ecb_ld32 (uint32_t x)
500
501=item int ecb_ld64 (uint64_t x)
502
503Returns the index of the most significant bit set in C<x>, or the number
504of digits the number requires in binary (so that C<< 2**ld <= x <
5052**(ld+1) >>). If C<x> is 0 the result is undefined. A common use case is
506to compute the integer binary logarithm, i.e. C<floor (log2 (n))>, for
507example to see how many bits a certain number requires to be encoded.
508
509This function is similar to the "count leading zero bits" function, except
510that that one returns how many zero bits are "in front" of the number (in
511the given data type), while C<ecb_ld> returns how many bits the number
512itself requires.
513
514For smaller types than C<uint32_t> you can safely use C<ecb_ld32>.
515
405=item int ecb_popcount32 (uint32_t x) 516=item int ecb_popcount32 (uint32_t x)
406 517
518=item int ecb_popcount64 (uint64_t x)
519
407Returns the number of bits set to 1 in C<x>. For example: 520Returns the number of bits set to 1 in C<x>.
521
522For smaller types than C<uint32_t> you can safely use C<ecb_popcount32>.
523
524For example:
408 525
409 ecb_popcount32 (7) = 3 526 ecb_popcount32 (7) = 3
410 ecb_popcount32 (255) = 8 527 ecb_popcount32 (255) = 8
411 528
529=item uint8_t ecb_bitrev8 (uint8_t x)
530
531=item uint16_t ecb_bitrev16 (uint16_t x)
532
533=item uint32_t ecb_bitrev32 (uint32_t x)
534
535Reverses the bits in x, i.e. the MSB becomes the LSB, MSB-1 becomes LSB+1
536and so on.
537
538Example:
539
540 ecb_bitrev8 (0xa7) = 0xea
541 ecb_bitrev32 (0xffcc4411) = 0x882233ff
542
412=item uint32_t ecb_bswap16 (uint32_t x) 543=item uint32_t ecb_bswap16 (uint32_t x)
413 544
414=item uint32_t ecb_bswap32 (uint32_t x) 545=item uint32_t ecb_bswap32 (uint32_t x)
415 546
547=item uint64_t ecb_bswap64 (uint64_t x)
548
416These two functions return the value of the 16-bit (32-bit) value C<x> 549These functions return the value of the 16-bit (32-bit, 64-bit) value
417after reversing the order of bytes (0x11223344 becomes 0x44332211). 550C<x> after reversing the order of bytes (0x11223344 becomes 0x44332211 in
551C<ecb_bswap32>).
552
553=item uint8_t ecb_rotl8 (uint8_t x, unsigned int count)
554
555=item uint16_t ecb_rotl16 (uint16_t x, unsigned int count)
556
557=item uint32_t ecb_rotl32 (uint32_t x, unsigned int count)
558
559=item uint64_t ecb_rotl64 (uint64_t x, unsigned int count)
560
561=item uint8_t ecb_rotr8 (uint8_t x, unsigned int count)
562
563=item uint16_t ecb_rotr16 (uint16_t x, unsigned int count)
418 564
419=item uint32_t ecb_rotr32 (uint32_t x, unsigned int count) 565=item uint32_t ecb_rotr32 (uint32_t x, unsigned int count)
420 566
421=item uint32_t ecb_rotl32 (uint32_t x, unsigned int count) 567=item uint64_t ecb_rotr64 (uint64_t x, unsigned int count)
422 568
423These two functions return the value of C<x> after rotating all the bits 569These two families of functions return the value of C<x> after rotating
424by C<count> positions to the right or left respectively. 570all the bits by C<count> positions to the right (C<ecb_rotr>) or left
571(C<ecb_rotl>).
425 572
426Current GCC versions understand these functions and usually compile them 573Current GCC versions understand these functions and usually compile them
427to "optimal" code (e.g. a single C<roll> on x86). 574to "optimal" code (e.g. a single C<rol> or a combination of C<shld> on
575x86).
428 576
429=back 577=back
430 578
431=head2 ARITHMETIC 579=head2 ARITHMETIC
432 580
442C<ecb_mod> implements the mathematical modulo operation, which is missing 590C<ecb_mod> implements the mathematical modulo operation, which is missing
443in the language. 591in the language.
444 592
445C<n> must be strictly positive (i.e. C<< >= 1 >>), while C<m> must be 593C<n> must be strictly positive (i.e. C<< >= 1 >>), while C<m> must be
446negatable, that is, both C<m> and C<-m> must be representable in its 594negatable, that is, both C<m> and C<-m> must be representable in its
447type (this typically includes the minimum signed integer value, the same 595type (this typically excludes the minimum signed integer value, the same
448limitation as for C</> and C<%> in C). 596limitation as for C</> and C<%> in C).
449 597
450Current GCC versions compile this into an efficient branchless sequence on 598Current GCC versions compile this into an efficient branchless sequence on
451many systems. 599almost all CPUs.
452 600
453For example, when you want to rotate forward through the members of an 601For example, when you want to rotate forward through the members of an
454array for increasing C<m> (which might be negative), then you should use 602array for increasing C<m> (which might be negative), then you should use
455C<ecb_mod>, as the C<%> operator might give either negative results, or 603C<ecb_mod>, as the C<%> operator might give either negative results, or
456change direction for negative values: 604change direction for negative values:
457 605
458 for (m = -100; m <= 100; ++m) 606 for (m = -100; m <= 100; ++m)
459 int elem = myarray [ecb_mod (m, ecb_array_length (myarray))]; 607 int elem = myarray [ecb_mod (m, ecb_array_length (myarray))];
460 608
609=item x = ecb_div_rd (val, div)
610
611=item x = ecb_div_ru (val, div)
612
613Returns C<val> divided by C<div> rounded down or up, respectively.
614C<val> and C<div> must have integer types and C<div> must be strictly
615positive. Note that these functions are implemented with macros in C
616and with function templates in C++.
617
461=back 618=back
462 619
463=head2 UTILITY 620=head2 UTILITY
464 621
465=over 4 622=over 4
474 for (i = 0; i < ecb_array_length (primes); i++) 631 for (i = 0; i < ecb_array_length (primes); i++)
475 sum += primes [i]; 632 sum += primes [i];
476 633
477=back 634=back
478 635
636=head2 SYMBOLS GOVERNING COMPILATION OF ECB.H ITSELF
479 637
638These symbols need to be defined before including F<ecb.h> the first time.
639
640=over 4
641
642=item ECB_NO_THRADS
643
644If F<ecb.h> is never used from multiple threads, then this symbol can
645be defined, in which case memory fences (and similar constructs) are
646completely removed, leading to more efficient code and fewer dependencies.
647
648Setting this symbol to a true value implies C<ECB_NO_SMP>.
649
650=item ECB_NO_SMP
651
652The weaker version of C<ECB_NO_THREADS> - if F<ecb.h> is used from
653multiple threads, but never concurrently (e.g. if the system the program
654runs on has only a single CPU with a single core, no hyperthreading and so
655on), then this symbol can be defined, leading to more efficient code and
656fewer dependencies.
657
658=back
659
660

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines