ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/libecb/ecb.pod
(Generate patch)

Comparing libecb/ecb.pod (file contents):
Revision 1.14 by root, Thu May 26 23:23:08 2011 UTC vs.
Revision 1.45 by root, Tue May 29 14:35:43 2012 UTC

3=head2 ABOUT LIBECB 3=head2 ABOUT LIBECB
4 4
5Libecb is currently a simple header file that doesn't require any 5Libecb is currently a simple header file that doesn't require any
6configuration to use or include in your project. 6configuration to use or include in your project.
7 7
8It's part of the e-suite of libraries, other memembers of which include 8It's part of the e-suite of libraries, other members of which include
9libev and libeio. 9libev and libeio.
10 10
11Its homepage can be found here: 11Its homepage can be found here:
12 12
13 http://software.schmorp.de/pkg/libecb 13 http://software.schmorp.de/pkg/libecb
14 14
15It mainly provides a number of wrappers around GCC built-ins, together 15It mainly provides a number of wrappers around GCC built-ins, together
16with replacement functions for other compilers. In addition to this, 16with replacement functions for other compilers. In addition to this,
17it provides a number of other lowlevel C utilities, such endienness 17it provides a number of other lowlevel C utilities, such as endianness
18detection, byte swapping or bit rotations. 18detection, byte swapping or bit rotations.
19
20Or in other words, things that should be built into any standard C system,
21but aren't, implemented as efficient as possible with GCC, and still
22correct with other compilers.
19 23
20More might come. 24More might come.
21 25
22=head2 ABOUT THE HEADER 26=head2 ABOUT THE HEADER
23 27
27 #include <ecb.h> 31 #include <ecb.h>
28 32
29The header should work fine for both C and C++ compilation, and gives you 33The header should work fine for both C and C++ compilation, and gives you
30all of F<inttypes.h> in addition to the ECB symbols. 34all of F<inttypes.h> in addition to the ECB symbols.
31 35
32There are currently no objetc files to link to - future versions might 36There are currently no object files to link to - future versions might
33come with an (optional) object code library to link against, to reduce 37come with an (optional) object code library to link against, to reduce
34code size or gain access to additional features. 38code size or gain access to additional features.
35 39
36It also currently includes everything from F<inttypes.h>. 40It also currently includes everything from F<inttypes.h>.
37 41
50only a generic name is used (C<expr>, C<cond>, C<value> and so on), then 54only a generic name is used (C<expr>, C<cond>, C<value> and so on), then
51the corresponding function relies on C to implement the correct types, and 55the corresponding function relies on C to implement the correct types, and
52is usually implemented as a macro. Specifically, a "bool" in this manual 56is usually implemented as a macro. Specifically, a "bool" in this manual
53refers to any kind of boolean value, not a specific type. 57refers to any kind of boolean value, not a specific type.
54 58
59=head2 TYPES / TYPE SUPPORT
60
61ecb.h makes sure that the following types are defined (in the expected way):
62
63 int8_t uint8_t int16_t uint16_t
64 int32_t uint32_t int64_t uint64_t
65 intptr_t uintptr_t ptrdiff_t
66
67The macro C<ECB_PTRSIZE> is defined to the size of a pointer on this
68platform (currently C<4> or C<8>) and can be used in preprocessor
69expressions.
70
71=head2 LANGUAGE/COMPILER VERSIONS
72
73All the following symbols expand to an expressionb that cna be tested in
74preprocessor instructions as well as treated as a boolean (use C<!!> to
75ensure it's either C<0> or C<1> if you need that).
76
77=over 4
78
79=item ECB_C
80
81True if the implementation defines the C<__STDC__> macro to a true value,
82which is typically true for both C and C++ compilers.
83
84=item ECB_C99
85
86True if the implementation claims to be C99 compliant.
87
88=item ECB_C11
89
90True if the implementation claims to be C11 compliant.
91
92=item ECB_CPP
93
94True if the implementation defines the C<__cplusplus__> macro to a true
95value, which is typically true for C++ compilers.
96
97=item ECB_CPP98
98
99True if the implementation claims to be compliant to ISO/IEC 14882:1998
100(the first C++ ISO standard) or any later vwersion. Typically true for all
101C++ compilers.
102
103=item ECB_CPP11
104
105True if the implementation claims to be compliant to ISO/IEC 14882:2011
106(C++11) or any later vwersion.
107
108=item ECB_GCC_VERSION(major,minor)
109
110Expands to a true value (suitable for testing in by the preprocessor)
111if the compiler used is GNU C and the version is the givne version, or
112higher.
113
114This macro tries to return false on compilers that claim to be GCC
115compatible but aren't.
116
117=back
118
55=head2 GCC ATTRIBUTES 119=head2 GCC ATTRIBUTES
56 120
57blabla where to put, what others 121A major part of libecb deals with GCC attributes. These are additional
122attributes that you can assign to functions, variables and sometimes even
123types - much like C<const> or C<volatile> in C.
124
125While GCC allows declarations to show up in many surprising places,
126but not in many expected places, the safest way is to put attribute
127declarations before the whole declaration:
128
129 ecb_const int mysqrt (int a);
130 ecb_unused int i;
131
132For variables, it is often nicer to put the attribute after the name, and
133avoid multiple declarations using commas:
134
135 int i ecb_unused;
58 136
59=over 4 137=over 4
60 138
61=item ecb_attribute ((attrs...)) 139=item ecb_attribute ((attrs...))
62 140
63A simple wrapper that expands to C<__attribute__((attrs))> on GCC, and 141A simple wrapper that expands to C<__attribute__((attrs))> on GCC, and to
64to nothing on other compilers, so the effect is that only GCC sees these. 142nothing on other compilers, so the effect is that only GCC sees these.
143
144Example: use the C<deprecated> attribute on a function.
145
146 ecb_attribute((__deprecated__)) void
147 do_not_use_me_anymore (void);
65 148
66=item ecb_unused 149=item ecb_unused
67 150
68Marks a function or a variable as "unused", which simply suppresses a 151Marks a function or a variable as "unused", which simply suppresses a
69warning by GCC when it detects it as unused. This is useful when you e.g. 152warning by GCC when it detects it as unused. This is useful when you e.g.
70declare a variable but do not always use it: 153declare a variable but do not always use it:
71 154
155 {
156 int var ecb_unused;
157
158 #ifdef SOMECONDITION
159 var = ...;
160 return var;
161 #else
162 return 0;
163 #endif
164 }
165
166=item ecb_inline
167
168This is not actually an attribute, but you use it like one. It expands
169either to C<static inline> or to just C<static>, if inline isn't
170supported. It should be used to declare functions that should be inlined,
171for code size or speed reasons.
172
173Example: inline this function, it surely will reduce codesize.
174
175 ecb_inline int
176 negmul (int a, int b)
72 { 177 {
73 int var ecb_unused; 178 return - (a * b);
74
75 #ifdef SOMECONDITION
76 var = ...;
77 return var;
78 #else
79 return 0;
80 #endif
81 } 179 }
82 180
83=item ecb_noinline 181=item ecb_noinline
84 182
85Prevent a function from being inlined - it might be optimised away, but 183Prevent a function from being inlined - it might be optimised away, but
86not inlined into other functions. This is useful if you know your function 184not inlined into other functions. This is useful if you know your function
87is rarely called and large enough for inlining not to be helpful. 185is rarely called and large enough for inlining not to be helpful.
88 186
89=item ecb_noreturn 187=item ecb_noreturn
90 188
189Marks a function as "not returning, ever". Some typical functions that
190don't return are C<exit> or C<abort> (which really works hard to not
191return), and now you can make your own:
192
193 ecb_noreturn void
194 my_abort (const char *errline)
195 {
196 puts (errline);
197 abort ();
198 }
199
200In this case, the compiler would probably be smart enough to deduce it on
201its own, so this is mainly useful for declarations.
202
91=item ecb_const 203=item ecb_const
92 204
205Declares that the function only depends on the values of its arguments,
206much like a mathematical function. It specifically does not read or write
207any memory any arguments might point to, global variables, or call any
208non-const functions. It also must not have any side effects.
209
210Such a function can be optimised much more aggressively by the compiler -
211for example, multiple calls with the same arguments can be optimised into
212a single call, which wouldn't be possible if the compiler would have to
213expect any side effects.
214
215It is best suited for functions in the sense of mathematical functions,
216such as a function returning the square root of its input argument.
217
218Not suited would be a function that calculates the hash of some memory
219area you pass in, prints some messages or looks at a global variable to
220decide on rounding.
221
222See C<ecb_pure> for a slightly less restrictive class of functions.
223
93=item ecb_pure 224=item ecb_pure
94 225
226Similar to C<ecb_const>, declares a function that has no side
227effects. Unlike C<ecb_const>, the function is allowed to examine global
228variables and any other memory areas (such as the ones passed to it via
229pointers).
230
231While these functions cannot be optimised as aggressively as C<ecb_const>
232functions, they can still be optimised away in many occasions, and the
233compiler has more freedom in moving calls to them around.
234
235Typical examples for such functions would be C<strlen> or C<memcmp>. A
236function that calculates the MD5 sum of some input and updates some MD5
237state passed as argument would I<NOT> be pure, however, as it would modify
238some memory area that is not the return value.
239
95=item ecb_hot 240=item ecb_hot
96 241
242This declares a function as "hot" with regards to the cache - the function
243is used so often, that it is very beneficial to keep it in the cache if
244possible.
245
246The compiler reacts by trying to place hot functions near to each other in
247memory.
248
249Whether a function is hot or not often depends on the whole program,
250and less on the function itself. C<ecb_cold> is likely more useful in
251practise.
252
97=item ecb_cold 253=item ecb_cold
98 254
255The opposite of C<ecb_hot> - declares a function as "cold" with regards to
256the cache, or in other words, this function is not called often, or not at
257speed-critical times, and keeping it in the cache might be a waste of said
258cache.
259
260In addition to placing cold functions together (or at least away from hot
261functions), this knowledge can be used in other ways, for example, the
262function will be optimised for size, as opposed to speed, and codepaths
263leading to calls to those functions can automatically be marked as if
264C<ecb_expect_false> had been used to reach them.
265
266Good examples for such functions would be error reporting functions, or
267functions only called in exceptional or rare cases.
268
99=item ecb_artificial 269=item ecb_artificial
270
271Declares the function as "artificial", in this case meaning that this
272function is not really mean to be a function, but more like an accessor
273- many methods in C++ classes are mere accessor functions, and having a
274crash reported in such a method, or single-stepping through them, is not
275usually so helpful, especially when it's inlined to just a few instructions.
276
277Marking them as artificial will instruct the debugger about just this,
278leading to happier debugging and thus happier lives.
279
280Example: in some kind of smart-pointer class, mark the pointer accessor as
281artificial, so that the whole class acts more like a pointer and less like
282some C++ abstraction monster.
283
284 template<typename T>
285 struct my_smart_ptr
286 {
287 T *value;
288
289 ecb_artificial
290 operator T *()
291 {
292 return value;
293 }
294 };
100 295
101=back 296=back
102 297
103=head2 OPTIMISATION HINTS 298=head2 OPTIMISATION HINTS
104 299
136 331
137Evaluates C<expr> and returns it. In addition, it tells the compiler that 332Evaluates C<expr> and returns it. In addition, it tells the compiler that
138the C<expr> evaluates to C<value> a lot, which can be used for static 333the C<expr> evaluates to C<value> a lot, which can be used for static
139branch optimisations. 334branch optimisations.
140 335
141Usually, you want to use the more intuitive C<ecb_likely> and 336Usually, you want to use the more intuitive C<ecb_expect_true> and
142C<ecb_unlikely> functions instead. 337C<ecb_expect_false> functions instead.
143 338
144=item bool ecb_likely (bool) 339=item bool ecb_expect_true (cond)
145 340
146=item bool ecb_unlikely (bool) 341=item bool ecb_expect_false (cond)
147 342
148These two functions expect a expression that is true or false and return 343These two functions expect a expression that is true or false and return
149C<1> or C<0>, respectively, so when used in the condition of an C<if> or 344C<1> or C<0>, respectively, so when used in the condition of an C<if> or
150other conditional statement, it will not change the program: 345other conditional statement, it will not change the program:
151 346
152 /* these two do the same thing */ 347 /* these two do the same thing */
153 if (some_condition) ...; 348 if (some_condition) ...;
154 if (ecb_likely (some_condition)) ...; 349 if (ecb_expect_true (some_condition)) ...;
155 350
156However, by using C<ecb_likely>, you tell the compiler that the condition 351However, by using C<ecb_expect_true>, you tell the compiler that the
157is likely to be true (and for C<ecb_unlikely>, that it is unlikely to be 352condition is likely to be true (and for C<ecb_expect_false>, that it is
158true). 353unlikely to be true).
159 354
160For example, when you check for a null pointer and expect this to be a 355For example, when you check for a null pointer and expect this to be a
161rare, exceptional, case, then use C<ecb_unlikely>: 356rare, exceptional, case, then use C<ecb_expect_false>:
162 357
163 void my_free (void *ptr) 358 void my_free (void *ptr)
164 { 359 {
165 if (ecb_unlikely (ptr == 0)) 360 if (ecb_expect_false (ptr == 0))
166 return; 361 return;
167 } 362 }
168 363
169Consequent use of these functions to mark away exceptional cases or to 364Consequent use of these functions to mark away exceptional cases or to
170tell the compiler what the hot path through a function is can increase 365tell the compiler what the hot path through a function is can increase
171performance considerably. 366performance considerably.
367
368You might know these functions under the name C<likely> and C<unlikely>
369- while these are common aliases, we find that the expect name is easier
370to understand when quickly skimming code. If you wish, you can use
371C<ecb_likely> instead of C<ecb_expect_true> and C<ecb_unlikely> instead of
372C<ecb_expect_false> - these are simply aliases.
172 373
173A very good example is in a function that reserves more space for some 374A very good example is in a function that reserves more space for some
174memory block (for example, inside an implementation of a string stream) - 375memory block (for example, inside an implementation of a string stream) -
175each time something is added, you have to check for a buffer overrun, but 376each time something is added, you have to check for a buffer overrun, but
176you expect that most checks will turn out to be false: 377you expect that most checks will turn out to be false:
177 378
178 /* make sure we have "size" extra room in our buffer */ 379 /* make sure we have "size" extra room in our buffer */
179 ecb_inline void 380 ecb_inline void
180 reserve (int size) 381 reserve (int size)
181 { 382 {
182 if (ecb_unlikely (current + size > end)) 383 if (ecb_expect_false (current + size > end))
183 real_reserve_method (size); /* presumably noinline */ 384 real_reserve_method (size); /* presumably noinline */
184 } 385 }
185 386
186=item bool ecb_assume (cond) 387=item bool ecb_assume (cond)
187 388
190 391
191This can be used to teach the compiler about invariants or other 392This can be used to teach the compiler about invariants or other
192conditions that might improve code generation, but which are impossible to 393conditions that might improve code generation, but which are impossible to
193deduce form the code itself. 394deduce form the code itself.
194 395
195For example, the example reservation function from the C<ecb_unlikely> 396For example, the example reservation function from the C<ecb_expect_false>
196description could be written thus (only C<ecb_assume> was added): 397description could be written thus (only C<ecb_assume> was added):
197 398
198 ecb_inline void 399 ecb_inline void
199 reserve (int size) 400 reserve (int size)
200 { 401 {
201 if (ecb_unlikely (current + size > end)) 402 if (ecb_expect_false (current + size > end))
202 real_reserve_method (size); /* presumably noinline */ 403 real_reserve_method (size); /* presumably noinline */
203 404
204 ecb_assume (current + size <= end); 405 ecb_assume (current + size <= end);
205 } 406 }
206 407
255After processing the node, (part of) the next node might already be in 456After processing the node, (part of) the next node might already be in
256cache. 457cache.
257 458
258=back 459=back
259 460
260=head2 BIT FIDDLING / BITSTUFFS 461=head2 BIT FIDDLING / BIT WIZARDRY
261 462
262=over 4 463=over 4
263 464
264=item bool ecb_big_endian () 465=item bool ecb_big_endian ()
265 466
267 468
268These two functions return true if the byte order is big endian 469These two functions return true if the byte order is big endian
269(most-significant byte first) or little endian (least-significant byte 470(most-significant byte first) or little endian (least-significant byte
270first) respectively. 471first) respectively.
271 472
473On systems that are neither, their return values are unspecified.
474
272=item int ecb_ctz32 (uint32_t x) 475=item int ecb_ctz32 (uint32_t x)
273 476
477=item int ecb_ctz64 (uint64_t x)
478
274Returns the index of the least significant bit set in C<x> (or 479Returns the index of the least significant bit set in C<x> (or
275equivalently the number of bits set to 0 before the least significant 480equivalently the number of bits set to 0 before the least significant bit
276bit set), starting from 0. If C<x> is 0 the result is undefined. A 481set), starting from 0. If C<x> is 0 the result is undefined.
277common use case is to compute the integer binary logarithm, i.e.,
278floor(log2(n)). For example:
279 482
483For smaller types than C<uint32_t> you can safely use C<ecb_ctz32>.
484
485For example:
486
280 ecb_ctz32(3) = 0 487 ecb_ctz32 (3) = 0
281 ecb_ctz32(6) = 1 488 ecb_ctz32 (6) = 1
489
490=item bool ecb_is_pot32 (uint32_t x)
491
492=item bool ecb_is_pot64 (uint32_t x)
493
494Return true iff C<x> is a power of two or C<x == 0>.
495
496For smaller types then C<uint32_t> you can safely use C<ecb_is_pot32>.
497
498=item int ecb_ld32 (uint32_t x)
499
500=item int ecb_ld64 (uint64_t x)
501
502Returns the index of the most significant bit set in C<x>, or the number
503of digits the number requires in binary (so that C<< 2**ld <= x <
5042**(ld+1) >>). If C<x> is 0 the result is undefined. A common use case is
505to compute the integer binary logarithm, i.e. C<floor (log2 (n))>, for
506example to see how many bits a certain number requires to be encoded.
507
508This function is similar to the "count leading zero bits" function, except
509that that one returns how many zero bits are "in front" of the number (in
510the given data type), while C<ecb_ld> returns how many bits the number
511itself requires.
512
513For smaller types than C<uint32_t> you can safely use C<ecb_ld32>.
282 514
283=item int ecb_popcount32 (uint32_t x) 515=item int ecb_popcount32 (uint32_t x)
284 516
517=item int ecb_popcount64 (uint64_t x)
518
285Returns the number of bits set to 1 in C<x>. For example: 519Returns the number of bits set to 1 in C<x>.
286 520
521For smaller types than C<uint32_t> you can safely use C<ecb_popcount32>.
522
523For example:
524
287 ecb_popcount32(7) = 3 525 ecb_popcount32 (7) = 3
288 ecb_popcount32(255) = 8 526 ecb_popcount32 (255) = 8
527
528=item uint8_t ecb_bitrev8 (uint8_t x)
529
530=item uint16_t ecb_bitrev16 (uint16_t x)
531
532=item uint32_t ecb_bitrev32 (uint32_t x)
533
534Reverses the bits in x, i.e. the MSB becomes the LSB, MSB-1 becomes LSB+1
535and so on.
536
537Example:
538
539 ecb_bitrev8 (0xa7) = 0xea
540 ecb_bitrev32 (0xffcc4411) = 0x882233ff
289 541
290=item uint32_t ecb_bswap16 (uint32_t x) 542=item uint32_t ecb_bswap16 (uint32_t x)
291 543
292=item uint32_t ecb_bswap32 (uint32_t x) 544=item uint32_t ecb_bswap32 (uint32_t x)
293 545
546=item uint64_t ecb_bswap64 (uint64_t x)
547
294These two functions return the value of the 16-bit (32-bit) variable 548These functions return the value of the 16-bit (32-bit, 64-bit) value
295C<x> after reversing the order of bytes. 549C<x> after reversing the order of bytes (0x11223344 becomes 0x44332211 in
550C<ecb_bswap32>).
551
552=item uint8_t ecb_rotl8 (uint8_t x, unsigned int count)
553
554=item uint16_t ecb_rotl16 (uint16_t x, unsigned int count)
555
556=item uint32_t ecb_rotl32 (uint32_t x, unsigned int count)
557
558=item uint64_t ecb_rotl64 (uint64_t x, unsigned int count)
559
560=item uint8_t ecb_rotr8 (uint8_t x, unsigned int count)
561
562=item uint16_t ecb_rotr16 (uint16_t x, unsigned int count)
296 563
297=item uint32_t ecb_rotr32 (uint32_t x, unsigned int count) 564=item uint32_t ecb_rotr32 (uint32_t x, unsigned int count)
298 565
299=item uint32_t ecb_rotl32 (uint32_t x, unsigned int count) 566=item uint64_t ecb_rotr64 (uint64_t x, unsigned int count)
300 567
301These two functions return the value of C<x> after shifting all the bits 568These two families of functions return the value of C<x> after rotating
302by C<count> positions to the right or left respectively. 569all the bits by C<count> positions to the right (C<ecb_rotr>) or left
570(C<ecb_rotl>).
571
572Current GCC versions understand these functions and usually compile them
573to "optimal" code (e.g. a single C<rol> or a combination of C<shld> on
574x86).
303 575
304=back 576=back
305 577
306=head2 ARITHMETIC 578=head2 ARITHMETIC
307 579
308=over 4 580=over 4
309 581
310=item x = ecb_mod (m, n) 582=item x = ecb_mod (m, n)
311 583
312Returns the positive remainder of the modulo operation between C<m> and 584Returns C<m> modulo C<n>, which is the same as the positive remainder
585of the division operation between C<m> and C<n>, using floored
313C<n>. Unlike the C moduloe operator C<%>, this function ensures that the 586division. Unlike the C remainder operator C<%>, this function ensures that
314return value is always positive). 587the return value is always positive and that the two numbers I<m> and
588I<m' = m + i * n> result in the same value modulo I<n> - in other words,
589C<ecb_mod> implements the mathematical modulo operation, which is missing
590in the language.
315 591
316C<n> must be strictly positive (i.e. C<< >1 >>), while C<m> must be 592C<n> must be strictly positive (i.e. C<< >= 1 >>), while C<m> must be
317negatable, that is, both C<m> and C<-m> must be representable in its 593negatable, that is, both C<m> and C<-m> must be representable in its
318type. 594type (this typically excludes the minimum signed integer value, the same
595limitation as for C</> and C<%> in C).
596
597Current GCC versions compile this into an efficient branchless sequence on
598almost all CPUs.
599
600For example, when you want to rotate forward through the members of an
601array for increasing C<m> (which might be negative), then you should use
602C<ecb_mod>, as the C<%> operator might give either negative results, or
603change direction for negative values:
604
605 for (m = -100; m <= 100; ++m)
606 int elem = myarray [ecb_mod (m, ecb_array_length (myarray))];
607
608=item x = ecb_div_rd (val, div)
609
610=item x = ecb_div_ru (val, div)
611
612Returns C<val> divided by C<div> rounded down or up, respectively.
613C<val> and C<div> must have integer types and C<div> must be strictly
614positive. Note that these functions are implemented with macros in C
615and with function templates in C++.
319 616
320=back 617=back
321 618
322=head2 UTILITY 619=head2 UTILITY
323 620
324=over 4 621=over 4
325 622
326=item element_count = ecb_array_length (name) [MACRO] 623=item element_count = ecb_array_length (name)
327 624
328Returns the number of elements in the array C<name>. For example: 625Returns the number of elements in the array C<name>. For example:
329 626
330 int primes[] = { 2, 3, 5, 7, 11 }; 627 int primes[] = { 2, 3, 5, 7, 11 };
331 int sum = 0; 628 int sum = 0;
333 for (i = 0; i < ecb_array_length (primes); i++) 630 for (i = 0; i < ecb_array_length (primes); i++)
334 sum += primes [i]; 631 sum += primes [i];
335 632
336=back 633=back
337 634
635=head2 SYMBOLS GOVERNING COMPILATION OF ECB.H ITSELF
338 636
637These symbols need to be defined before including F<ecb.h> the first time.
638
639=over 4
640
641=item ECB_NO_THRADS
642
643If F<ecb.h> is never used from multiple threads, then this symbol can
644be defined, in which case memory fences (and similar constructs) are
645completely removed, leading to more efficient code and fewer dependencies.
646
647Setting this symbol to a true value implies C<ECB_NO_SMP>.
648
649=item ECB_NO_SMP
650
651The weaker version of C<ECB_NO_THREADS> - if F<ecb.h> is used from
652multiple threads, but never concurrently (e.g. if the system the program
653runs on has only a single CPU with a single core, no hyperthreading and so
654on), then this symbol can be defined, leading to more efficient code and
655fewer dependencies.
656
657=back
658
659

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines