ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/libecb/ecb.pod
(Generate patch)

Comparing libecb/ecb.pod (file contents):
Revision 1.18 by root, Fri May 27 00:01:28 2011 UTC vs.
Revision 1.45 by root, Tue May 29 14:35:43 2012 UTC

15It mainly provides a number of wrappers around GCC built-ins, together 15It mainly provides a number of wrappers around GCC built-ins, together
16with replacement functions for other compilers. In addition to this, 16with replacement functions for other compilers. In addition to this,
17it provides a number of other lowlevel C utilities, such as endianness 17it provides a number of other lowlevel C utilities, such as endianness
18detection, byte swapping or bit rotations. 18detection, byte swapping or bit rotations.
19 19
20Or in other words, things that should be built-in into any standard C 20Or in other words, things that should be built into any standard C system,
21system, but aren't. 21but aren't, implemented as efficient as possible with GCC, and still
22correct with other compilers.
22 23
23More might come. 24More might come.
24 25
25=head2 ABOUT THE HEADER 26=head2 ABOUT THE HEADER
26 27
53only a generic name is used (C<expr>, C<cond>, C<value> and so on), then 54only a generic name is used (C<expr>, C<cond>, C<value> and so on), then
54the corresponding function relies on C to implement the correct types, and 55the corresponding function relies on C to implement the correct types, and
55is usually implemented as a macro. Specifically, a "bool" in this manual 56is usually implemented as a macro. Specifically, a "bool" in this manual
56refers to any kind of boolean value, not a specific type. 57refers to any kind of boolean value, not a specific type.
57 58
59=head2 TYPES / TYPE SUPPORT
60
61ecb.h makes sure that the following types are defined (in the expected way):
62
63 int8_t uint8_t int16_t uint16_t
64 int32_t uint32_t int64_t uint64_t
65 intptr_t uintptr_t ptrdiff_t
66
67The macro C<ECB_PTRSIZE> is defined to the size of a pointer on this
68platform (currently C<4> or C<8>) and can be used in preprocessor
69expressions.
70
71=head2 LANGUAGE/COMPILER VERSIONS
72
73All the following symbols expand to an expressionb that cna be tested in
74preprocessor instructions as well as treated as a boolean (use C<!!> to
75ensure it's either C<0> or C<1> if you need that).
76
77=over 4
78
79=item ECB_C
80
81True if the implementation defines the C<__STDC__> macro to a true value,
82which is typically true for both C and C++ compilers.
83
84=item ECB_C99
85
86True if the implementation claims to be C99 compliant.
87
88=item ECB_C11
89
90True if the implementation claims to be C11 compliant.
91
92=item ECB_CPP
93
94True if the implementation defines the C<__cplusplus__> macro to a true
95value, which is typically true for C++ compilers.
96
97=item ECB_CPP98
98
99True if the implementation claims to be compliant to ISO/IEC 14882:1998
100(the first C++ ISO standard) or any later vwersion. Typically true for all
101C++ compilers.
102
103=item ECB_CPP11
104
105True if the implementation claims to be compliant to ISO/IEC 14882:2011
106(C++11) or any later vwersion.
107
108=item ECB_GCC_VERSION(major,minor)
109
110Expands to a true value (suitable for testing in by the preprocessor)
111if the compiler used is GNU C and the version is the givne version, or
112higher.
113
114This macro tries to return false on compilers that claim to be GCC
115compatible but aren't.
116
117=back
118
58=head2 GCC ATTRIBUTES 119=head2 GCC ATTRIBUTES
59 120
60blabla where to put, what others 121A major part of libecb deals with GCC attributes. These are additional
122attributes that you can assign to functions, variables and sometimes even
123types - much like C<const> or C<volatile> in C.
124
125While GCC allows declarations to show up in many surprising places,
126but not in many expected places, the safest way is to put attribute
127declarations before the whole declaration:
128
129 ecb_const int mysqrt (int a);
130 ecb_unused int i;
131
132For variables, it is often nicer to put the attribute after the name, and
133avoid multiple declarations using commas:
134
135 int i ecb_unused;
61 136
62=over 4 137=over 4
63 138
64=item ecb_attribute ((attrs...)) 139=item ecb_attribute ((attrs...))
65 140
86 #else 161 #else
87 return 0; 162 return 0;
88 #endif 163 #endif
89 } 164 }
90 165
166=item ecb_inline
167
168This is not actually an attribute, but you use it like one. It expands
169either to C<static inline> or to just C<static>, if inline isn't
170supported. It should be used to declare functions that should be inlined,
171for code size or speed reasons.
172
173Example: inline this function, it surely will reduce codesize.
174
175 ecb_inline int
176 negmul (int a, int b)
177 {
178 return - (a * b);
179 }
180
91=item ecb_noinline 181=item ecb_noinline
92 182
93Prevent a function from being inlined - it might be optimised away, but 183Prevent a function from being inlined - it might be optimised away, but
94not inlined into other functions. This is useful if you know your function 184not inlined into other functions. This is useful if you know your function
95is rarely called and large enough for inlining not to be helpful. 185is rarely called and large enough for inlining not to be helpful.
105 { 195 {
106 puts (errline); 196 puts (errline);
107 abort (); 197 abort ();
108 } 198 }
109 199
110In this case, the compiler would probbaly be smart enough to decude it on 200In this case, the compiler would probably be smart enough to deduce it on
111it's own, so this is mainly useful for declarations. 201its own, so this is mainly useful for declarations.
112 202
113=item ecb_const 203=item ecb_const
114 204
115Declares that the function only depends on the values of it's arguments, 205Declares that the function only depends on the values of its arguments,
116much like a mathematical function. It specifically does not read or write 206much like a mathematical function. It specifically does not read or write
117any memory any arguments might point to, global variables, or call any 207any memory any arguments might point to, global variables, or call any
118non-const functions. It also must not have any side effects. 208non-const functions. It also must not have any side effects.
119 209
120Such a function can be optimised much more aggressively by the compiler - 210Such a function can be optimised much more aggressively by the compiler -
121for example, multiple calls with the same arguments can be optimised into 211for example, multiple calls with the same arguments can be optimised into
122a single call, which wouldn't be possible if the compiler would have to 212a single call, which wouldn't be possible if the compiler would have to
123expect any side effects. 213expect any side effects.
124 214
125It is best suited for functions in the sense of mathematical functions, 215It is best suited for functions in the sense of mathematical functions,
126such as a function return the square root of its input argument. 216such as a function returning the square root of its input argument.
127 217
128Not suited would be a function that calculates the hash of some memory 218Not suited would be a function that calculates the hash of some memory
129area you pass in, prints some messages or looks at a global variable to 219area you pass in, prints some messages or looks at a global variable to
130decide on rounding. 220decide on rounding.
131 221
154possible. 244possible.
155 245
156The compiler reacts by trying to place hot functions near to each other in 246The compiler reacts by trying to place hot functions near to each other in
157memory. 247memory.
158 248
159Whether a function is hot or not often depend son the whole program, 249Whether a function is hot or not often depends on the whole program,
160and less on the function itself. C<ecb_cold> is likely more useful in 250and less on the function itself. C<ecb_cold> is likely more useful in
161practise. 251practise.
162 252
163=item ecb_cold 253=item ecb_cold
164 254
169 259
170In addition to placing cold functions together (or at least away from hot 260In addition to placing cold functions together (or at least away from hot
171functions), this knowledge can be used in other ways, for example, the 261functions), this knowledge can be used in other ways, for example, the
172function will be optimised for size, as opposed to speed, and codepaths 262function will be optimised for size, as opposed to speed, and codepaths
173leading to calls to those functions can automatically be marked as if 263leading to calls to those functions can automatically be marked as if
174C<ecb_unlikel> had been used to reach them. 264C<ecb_expect_false> had been used to reach them.
175 265
176Good examples for such functions would be error reporting functions, or 266Good examples for such functions would be error reporting functions, or
177functions only called in exceptional or rare cases. 267functions only called in exceptional or rare cases.
178 268
179=item ecb_artificial 269=item ecb_artificial
241 331
242Evaluates C<expr> and returns it. In addition, it tells the compiler that 332Evaluates C<expr> and returns it. In addition, it tells the compiler that
243the C<expr> evaluates to C<value> a lot, which can be used for static 333the C<expr> evaluates to C<value> a lot, which can be used for static
244branch optimisations. 334branch optimisations.
245 335
246Usually, you want to use the more intuitive C<ecb_likely> and 336Usually, you want to use the more intuitive C<ecb_expect_true> and
247C<ecb_unlikely> functions instead. 337C<ecb_expect_false> functions instead.
248 338
339=item bool ecb_expect_true (cond)
340
249=item bool ecb_likely (cond) 341=item bool ecb_expect_false (cond)
250
251=item bool ecb_unlikely (cond)
252 342
253These two functions expect a expression that is true or false and return 343These two functions expect a expression that is true or false and return
254C<1> or C<0>, respectively, so when used in the condition of an C<if> or 344C<1> or C<0>, respectively, so when used in the condition of an C<if> or
255other conditional statement, it will not change the program: 345other conditional statement, it will not change the program:
256 346
257 /* these two do the same thing */ 347 /* these two do the same thing */
258 if (some_condition) ...; 348 if (some_condition) ...;
259 if (ecb_likely (some_condition)) ...; 349 if (ecb_expect_true (some_condition)) ...;
260 350
261However, by using C<ecb_likely>, you tell the compiler that the condition 351However, by using C<ecb_expect_true>, you tell the compiler that the
262is likely to be true (and for C<ecb_unlikely>, that it is unlikely to be 352condition is likely to be true (and for C<ecb_expect_false>, that it is
263true). 353unlikely to be true).
264 354
265For example, when you check for a null pointer and expect this to be a 355For example, when you check for a null pointer and expect this to be a
266rare, exceptional, case, then use C<ecb_unlikely>: 356rare, exceptional, case, then use C<ecb_expect_false>:
267 357
268 void my_free (void *ptr) 358 void my_free (void *ptr)
269 { 359 {
270 if (ecb_unlikely (ptr == 0)) 360 if (ecb_expect_false (ptr == 0))
271 return; 361 return;
272 } 362 }
273 363
274Consequent use of these functions to mark away exceptional cases or to 364Consequent use of these functions to mark away exceptional cases or to
275tell the compiler what the hot path through a function is can increase 365tell the compiler what the hot path through a function is can increase
276performance considerably. 366performance considerably.
367
368You might know these functions under the name C<likely> and C<unlikely>
369- while these are common aliases, we find that the expect name is easier
370to understand when quickly skimming code. If you wish, you can use
371C<ecb_likely> instead of C<ecb_expect_true> and C<ecb_unlikely> instead of
372C<ecb_expect_false> - these are simply aliases.
277 373
278A very good example is in a function that reserves more space for some 374A very good example is in a function that reserves more space for some
279memory block (for example, inside an implementation of a string stream) - 375memory block (for example, inside an implementation of a string stream) -
280each time something is added, you have to check for a buffer overrun, but 376each time something is added, you have to check for a buffer overrun, but
281you expect that most checks will turn out to be false: 377you expect that most checks will turn out to be false:
282 378
283 /* make sure we have "size" extra room in our buffer */ 379 /* make sure we have "size" extra room in our buffer */
284 ecb_inline void 380 ecb_inline void
285 reserve (int size) 381 reserve (int size)
286 { 382 {
287 if (ecb_unlikely (current + size > end)) 383 if (ecb_expect_false (current + size > end))
288 real_reserve_method (size); /* presumably noinline */ 384 real_reserve_method (size); /* presumably noinline */
289 } 385 }
290 386
291=item bool ecb_assume (cond) 387=item bool ecb_assume (cond)
292 388
295 391
296This can be used to teach the compiler about invariants or other 392This can be used to teach the compiler about invariants or other
297conditions that might improve code generation, but which are impossible to 393conditions that might improve code generation, but which are impossible to
298deduce form the code itself. 394deduce form the code itself.
299 395
300For example, the example reservation function from the C<ecb_unlikely> 396For example, the example reservation function from the C<ecb_expect_false>
301description could be written thus (only C<ecb_assume> was added): 397description could be written thus (only C<ecb_assume> was added):
302 398
303 ecb_inline void 399 ecb_inline void
304 reserve (int size) 400 reserve (int size)
305 { 401 {
306 if (ecb_unlikely (current + size > end)) 402 if (ecb_expect_false (current + size > end))
307 real_reserve_method (size); /* presumably noinline */ 403 real_reserve_method (size); /* presumably noinline */
308 404
309 ecb_assume (current + size <= end); 405 ecb_assume (current + size <= end);
310 } 406 }
311 407
360After processing the node, (part of) the next node might already be in 456After processing the node, (part of) the next node might already be in
361cache. 457cache.
362 458
363=back 459=back
364 460
365=head2 BIT FIDDLING / BITSTUFFS 461=head2 BIT FIDDLING / BIT WIZARDRY
366 462
367=over 4 463=over 4
368 464
369=item bool ecb_big_endian () 465=item bool ecb_big_endian ()
370 466
372 468
373These two functions return true if the byte order is big endian 469These two functions return true if the byte order is big endian
374(most-significant byte first) or little endian (least-significant byte 470(most-significant byte first) or little endian (least-significant byte
375first) respectively. 471first) respectively.
376 472
473On systems that are neither, their return values are unspecified.
474
377=item int ecb_ctz32 (uint32_t x) 475=item int ecb_ctz32 (uint32_t x)
378 476
477=item int ecb_ctz64 (uint64_t x)
478
379Returns the index of the least significant bit set in C<x> (or 479Returns the index of the least significant bit set in C<x> (or
380equivalently the number of bits set to 0 before the least significant 480equivalently the number of bits set to 0 before the least significant bit
381bit set), starting from 0. If C<x> is 0 the result is undefined. A 481set), starting from 0. If C<x> is 0 the result is undefined.
382common use case is to compute the integer binary logarithm, i.e., 482
383floor(log2(n)). For example: 483For smaller types than C<uint32_t> you can safely use C<ecb_ctz32>.
484
485For example:
384 486
385 ecb_ctz32 (3) = 0 487 ecb_ctz32 (3) = 0
386 ecb_ctz32 (6) = 1 488 ecb_ctz32 (6) = 1
387 489
490=item bool ecb_is_pot32 (uint32_t x)
491
492=item bool ecb_is_pot64 (uint32_t x)
493
494Return true iff C<x> is a power of two or C<x == 0>.
495
496For smaller types then C<uint32_t> you can safely use C<ecb_is_pot32>.
497
498=item int ecb_ld32 (uint32_t x)
499
500=item int ecb_ld64 (uint64_t x)
501
502Returns the index of the most significant bit set in C<x>, or the number
503of digits the number requires in binary (so that C<< 2**ld <= x <
5042**(ld+1) >>). If C<x> is 0 the result is undefined. A common use case is
505to compute the integer binary logarithm, i.e. C<floor (log2 (n))>, for
506example to see how many bits a certain number requires to be encoded.
507
508This function is similar to the "count leading zero bits" function, except
509that that one returns how many zero bits are "in front" of the number (in
510the given data type), while C<ecb_ld> returns how many bits the number
511itself requires.
512
513For smaller types than C<uint32_t> you can safely use C<ecb_ld32>.
514
388=item int ecb_popcount32 (uint32_t x) 515=item int ecb_popcount32 (uint32_t x)
389 516
517=item int ecb_popcount64 (uint64_t x)
518
390Returns the number of bits set to 1 in C<x>. For example: 519Returns the number of bits set to 1 in C<x>.
520
521For smaller types than C<uint32_t> you can safely use C<ecb_popcount32>.
522
523For example:
391 524
392 ecb_popcount32 (7) = 3 525 ecb_popcount32 (7) = 3
393 ecb_popcount32 (255) = 8 526 ecb_popcount32 (255) = 8
394 527
528=item uint8_t ecb_bitrev8 (uint8_t x)
529
530=item uint16_t ecb_bitrev16 (uint16_t x)
531
532=item uint32_t ecb_bitrev32 (uint32_t x)
533
534Reverses the bits in x, i.e. the MSB becomes the LSB, MSB-1 becomes LSB+1
535and so on.
536
537Example:
538
539 ecb_bitrev8 (0xa7) = 0xea
540 ecb_bitrev32 (0xffcc4411) = 0x882233ff
541
395=item uint32_t ecb_bswap16 (uint32_t x) 542=item uint32_t ecb_bswap16 (uint32_t x)
396 543
397=item uint32_t ecb_bswap32 (uint32_t x) 544=item uint32_t ecb_bswap32 (uint32_t x)
398 545
546=item uint64_t ecb_bswap64 (uint64_t x)
547
399These two functions return the value of the 16-bit (32-bit) variable 548These functions return the value of the 16-bit (32-bit, 64-bit) value
400C<x> after reversing the order of bytes. 549C<x> after reversing the order of bytes (0x11223344 becomes 0x44332211 in
550C<ecb_bswap32>).
551
552=item uint8_t ecb_rotl8 (uint8_t x, unsigned int count)
553
554=item uint16_t ecb_rotl16 (uint16_t x, unsigned int count)
555
556=item uint32_t ecb_rotl32 (uint32_t x, unsigned int count)
557
558=item uint64_t ecb_rotl64 (uint64_t x, unsigned int count)
559
560=item uint8_t ecb_rotr8 (uint8_t x, unsigned int count)
561
562=item uint16_t ecb_rotr16 (uint16_t x, unsigned int count)
401 563
402=item uint32_t ecb_rotr32 (uint32_t x, unsigned int count) 564=item uint32_t ecb_rotr32 (uint32_t x, unsigned int count)
403 565
404=item uint32_t ecb_rotl32 (uint32_t x, unsigned int count) 566=item uint64_t ecb_rotr64 (uint64_t x, unsigned int count)
405 567
406These two functions return the value of C<x> after shifting all the bits 568These two families of functions return the value of C<x> after rotating
407by C<count> positions to the right or left respectively. 569all the bits by C<count> positions to the right (C<ecb_rotr>) or left
570(C<ecb_rotl>).
571
572Current GCC versions understand these functions and usually compile them
573to "optimal" code (e.g. a single C<rol> or a combination of C<shld> on
574x86).
408 575
409=back 576=back
410 577
411=head2 ARITHMETIC 578=head2 ARITHMETIC
412 579
413=over 4 580=over 4
414 581
415=item x = ecb_mod (m, n) 582=item x = ecb_mod (m, n)
416 583
417Returns the positive remainder of the modulo operation between C<m> and 584Returns C<m> modulo C<n>, which is the same as the positive remainder
585of the division operation between C<m> and C<n>, using floored
418C<n>. Unlike the C modulo operator C<%>, this function ensures that the 586division. Unlike the C remainder operator C<%>, this function ensures that
419return value is always positive). 587the return value is always positive and that the two numbers I<m> and
588I<m' = m + i * n> result in the same value modulo I<n> - in other words,
589C<ecb_mod> implements the mathematical modulo operation, which is missing
590in the language.
420 591
421C<n> must be strictly positive (i.e. C<< >1 >>), while C<m> must be 592C<n> must be strictly positive (i.e. C<< >= 1 >>), while C<m> must be
422negatable, that is, both C<m> and C<-m> must be representable in its 593negatable, that is, both C<m> and C<-m> must be representable in its
423type. 594type (this typically excludes the minimum signed integer value, the same
595limitation as for C</> and C<%> in C).
596
597Current GCC versions compile this into an efficient branchless sequence on
598almost all CPUs.
599
600For example, when you want to rotate forward through the members of an
601array for increasing C<m> (which might be negative), then you should use
602C<ecb_mod>, as the C<%> operator might give either negative results, or
603change direction for negative values:
604
605 for (m = -100; m <= 100; ++m)
606 int elem = myarray [ecb_mod (m, ecb_array_length (myarray))];
607
608=item x = ecb_div_rd (val, div)
609
610=item x = ecb_div_ru (val, div)
611
612Returns C<val> divided by C<div> rounded down or up, respectively.
613C<val> and C<div> must have integer types and C<div> must be strictly
614positive. Note that these functions are implemented with macros in C
615and with function templates in C++.
424 616
425=back 617=back
426 618
427=head2 UTILITY 619=head2 UTILITY
428 620
429=over 4 621=over 4
430 622
431=item element_count = ecb_array_length (name) [MACRO] 623=item element_count = ecb_array_length (name)
432 624
433Returns the number of elements in the array C<name>. For example: 625Returns the number of elements in the array C<name>. For example:
434 626
435 int primes[] = { 2, 3, 5, 7, 11 }; 627 int primes[] = { 2, 3, 5, 7, 11 };
436 int sum = 0; 628 int sum = 0;
438 for (i = 0; i < ecb_array_length (primes); i++) 630 for (i = 0; i < ecb_array_length (primes); i++)
439 sum += primes [i]; 631 sum += primes [i];
440 632
441=back 633=back
442 634
635=head2 SYMBOLS GOVERNING COMPILATION OF ECB.H ITSELF
443 636
637These symbols need to be defined before including F<ecb.h> the first time.
638
639=over 4
640
641=item ECB_NO_THRADS
642
643If F<ecb.h> is never used from multiple threads, then this symbol can
644be defined, in which case memory fences (and similar constructs) are
645completely removed, leading to more efficient code and fewer dependencies.
646
647Setting this symbol to a true value implies C<ECB_NO_SMP>.
648
649=item ECB_NO_SMP
650
651The weaker version of C<ECB_NO_THREADS> - if F<ecb.h> is used from
652multiple threads, but never concurrently (e.g. if the system the program
653runs on has only a single CPU with a single core, no hyperthreading and so
654on), then this symbol can be defined, leading to more efficient code and
655fewer dependencies.
656
657=back
658
659

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines