ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/libecb/ecb.pod
(Generate patch)

Comparing libecb/ecb.pod (file contents):
Revision 1.1 by root, Thu May 26 19:39:40 2011 UTC vs.
Revision 1.18 by root, Fri May 27 00:01:28 2011 UTC

1=head1 LIBECB - e-C-Builtins
2
3=head2 ABOUT LIBECB
4
5Libecb is currently a simple header file that doesn't require any
6configuration to use or include in your project.
7
8It's part of the e-suite of libraries, other members of which include
9libev and libeio.
10
11Its homepage can be found here:
12
13 http://software.schmorp.de/pkg/libecb
14
15It mainly provides a number of wrappers around GCC built-ins, together
16with replacement functions for other compilers. In addition to this,
17it provides a number of other lowlevel C utilities, such as endianness
18detection, byte swapping or bit rotations.
19
20Or in other words, things that should be built-in into any standard C
21system, but aren't.
22
23More might come.
24
25=head2 ABOUT THE HEADER
26
27At the moment, all you have to do is copy F<ecb.h> somewhere where your
28compiler can find it and include it:
29
30 #include <ecb.h>
31
32The header should work fine for both C and C++ compilation, and gives you
33all of F<inttypes.h> in addition to the ECB symbols.
34
35There are currently no object files to link to - future versions might
36come with an (optional) object code library to link against, to reduce
37code size or gain access to additional features.
38
39It also currently includes everything from F<inttypes.h>.
40
41=head2 ABOUT THIS MANUAL / CONVENTIONS
42
43This manual mainly describes each (public) function available after
44including the F<ecb.h> header. The header might define other symbols than
45these, but these are not part of the public API, and not supported in any
46way.
47
48When the manual mentions a "function" then this could be defined either as
49as inline function, a macro, or an external symbol.
50
51When functions use a concrete standard type, such as C<int> or
52C<uint32_t>, then the corresponding function works only with that type. If
53only a generic name is used (C<expr>, C<cond>, C<value> and so on), then
54the corresponding function relies on C to implement the correct types, and
55is usually implemented as a macro. Specifically, a "bool" in this manual
56refers to any kind of boolean value, not a specific type.
1 57
2=head2 GCC ATTRIBUTES 58=head2 GCC ATTRIBUTES
3 59
60blabla where to put, what others
61
4=over 4 62=over 4
5 63
6=item ecb_attribute(attrlist) 64=item ecb_attribute ((attrs...))
7=item ecb_noinline ecb_attribute ((noinline))
8=item ecb_noreturn ecb_attribute ((noreturn))
9=item ecb_unused ecb_attribute ((unused))
10=item ecb_const ecb_attribute ((const))
11=item ecb_pure ecb_attribute ((pure))
12=item ecb_hot ecb_attribute ((hot)) /* 4.3 */
13=item ecb_cold ecb_attribute ((cold)) /* 4.3 */
14 65
66A simple wrapper that expands to C<__attribute__((attrs))> on GCC, and to
67nothing on other compilers, so the effect is that only GCC sees these.
68
69Example: use the C<deprecated> attribute on a function.
70
71 ecb_attribute((__deprecated__)) void
72 do_not_use_me_anymore (void);
73
74=item ecb_unused
75
76Marks a function or a variable as "unused", which simply suppresses a
77warning by GCC when it detects it as unused. This is useful when you e.g.
78declare a variable but do not always use it:
79
80 {
81 int var ecb_unused;
82
83 #ifdef SOMECONDITION
84 var = ...;
85 return var;
86 #else
87 return 0;
88 #endif
89 }
90
91=item ecb_noinline
92
93Prevent a function from being inlined - it might be optimised away, but
94not inlined into other functions. This is useful if you know your function
95is rarely called and large enough for inlining not to be helpful.
96
97=item ecb_noreturn
98
99Marks a function as "not returning, ever". Some typical functions that
100don't return are C<exit> or C<abort> (which really works hard to not
101return), and now you can make your own:
102
103 ecb_noreturn void
104 my_abort (const char *errline)
105 {
106 puts (errline);
107 abort ();
108 }
109
110In this case, the compiler would probbaly be smart enough to decude it on
111it's own, so this is mainly useful for declarations.
112
113=item ecb_const
114
115Declares that the function only depends on the values of it's arguments,
116much like a mathematical function. It specifically does not read or write
117any memory any arguments might point to, global variables, or call any
118non-const functions. It also must not have any side effects.
119
120Such a function can be optimised much more aggressively by the compiler -
121for example, multiple calls with the same arguments can be optimised into
122a single call, which wouldn't be possible if the compiler would have to
123expect any side effects.
124
125It is best suited for functions in the sense of mathematical functions,
126such as a function return the square root of its input argument.
127
128Not suited would be a function that calculates the hash of some memory
129area you pass in, prints some messages or looks at a global variable to
130decide on rounding.
131
132See C<ecb_pure> for a slightly less restrictive class of functions.
133
134=item ecb_pure
135
136Similar to C<ecb_const>, declares a function that has no side
137effects. Unlike C<ecb_const>, the function is allowed to examine global
138variables and any other memory areas (such as the ones passed to it via
139pointers).
140
141While these functions cannot be optimised as aggressively as C<ecb_const>
142functions, they can still be optimised away in many occasions, and the
143compiler has more freedom in moving calls to them around.
144
145Typical examples for such functions would be C<strlen> or C<memcmp>. A
146function that calculates the MD5 sum of some input and updates some MD5
147state passed as argument would I<NOT> be pure, however, as it would modify
148some memory area that is not the return value.
149
150=item ecb_hot
151
152This declares a function as "hot" with regards to the cache - the function
153is used so often, that it is very beneficial to keep it in the cache if
154possible.
155
156The compiler reacts by trying to place hot functions near to each other in
157memory.
158
159Whether a function is hot or not often depend son the whole program,
160and less on the function itself. C<ecb_cold> is likely more useful in
161practise.
162
163=item ecb_cold
164
165The opposite of C<ecb_hot> - declares a function as "cold" with regards to
166the cache, or in other words, this function is not called often, or not at
167speed-critical times, and keeping it in the cache might be a waste of said
168cache.
169
170In addition to placing cold functions together (or at least away from hot
171functions), this knowledge can be used in other ways, for example, the
172function will be optimised for size, as opposed to speed, and codepaths
173leading to calls to those functions can automatically be marked as if
174C<ecb_unlikel> had been used to reach them.
175
176Good examples for such functions would be error reporting functions, or
177functions only called in exceptional or rare cases.
178
179=item ecb_artificial
180
181Declares the function as "artificial", in this case meaning that this
182function is not really mean to be a function, but more like an accessor
183- many methods in C++ classes are mere accessor functions, and having a
184crash reported in such a method, or single-stepping through them, is not
185usually so helpful, especially when it's inlined to just a few instructions.
186
187Marking them as artificial will instruct the debugger about just this,
188leading to happier debugging and thus happier lives.
189
190Example: in some kind of smart-pointer class, mark the pointer accessor as
191artificial, so that the whole class acts more like a pointer and less like
192some C++ abstraction monster.
193
194 template<typename T>
195 struct my_smart_ptr
196 {
197 T *value;
198
199 ecb_artificial
200 operator T *()
201 {
202 return value;
203 }
204 };
205
15 =back 206=back
16 207
17=head2 OPTIMISATION HINTS 208=head2 OPTIMISATION HINTS
18 209
19=over 4 210=over 4
20 211
21=item bool ecb_is_constant(expr) 212=item bool ecb_is_constant(expr)
22 213
214Returns true iff the expression can be deduced to be a compile-time
215constant, and false otherwise.
216
217For example, when you have a C<rndm16> function that returns a 16 bit
218random number, and you have a function that maps this to a range from
2190..n-1, then you could use this inline function in a header file:
220
221 ecb_inline uint32_t
222 rndm (uint32_t n)
223 {
224 return (n * (uint32_t)rndm16 ()) >> 16;
225 }
226
227However, for powers of two, you could use a normal mask, but that is only
228worth it if, at compile time, you can detect this case. This is the case
229when the passed number is a constant and also a power of two (C<n & (n -
2301) == 0>):
231
232 ecb_inline uint32_t
233 rndm (uint32_t n)
234 {
235 return is_constant (n) && !(n & (n - 1))
236 ? rndm16 () & (num - 1)
237 : (n * (uint32_t)rndm16 ()) >> 16;
238 }
239
23=item bool ecb_expect(expr,value) 240=item bool ecb_expect (expr, value)
24 241
242Evaluates C<expr> and returns it. In addition, it tells the compiler that
243the C<expr> evaluates to C<value> a lot, which can be used for static
244branch optimisations.
245
246Usually, you want to use the more intuitive C<ecb_likely> and
247C<ecb_unlikely> functions instead.
248
249=item bool ecb_likely (cond)
250
25=item bool ecb_unlikely(bool) 251=item bool ecb_unlikely (cond)
26 252
27=item bool ecb_likely(bool) 253These two functions expect a expression that is true or false and return
254C<1> or C<0>, respectively, so when used in the condition of an C<if> or
255other conditional statement, it will not change the program:
28 256
257 /* these two do the same thing */
258 if (some_condition) ...;
259 if (ecb_likely (some_condition)) ...;
260
261However, by using C<ecb_likely>, you tell the compiler that the condition
262is likely to be true (and for C<ecb_unlikely>, that it is unlikely to be
263true).
264
265For example, when you check for a null pointer and expect this to be a
266rare, exceptional, case, then use C<ecb_unlikely>:
267
268 void my_free (void *ptr)
269 {
270 if (ecb_unlikely (ptr == 0))
271 return;
272 }
273
274Consequent use of these functions to mark away exceptional cases or to
275tell the compiler what the hot path through a function is can increase
276performance considerably.
277
278A very good example is in a function that reserves more space for some
279memory block (for example, inside an implementation of a string stream) -
280each time something is added, you have to check for a buffer overrun, but
281you expect that most checks will turn out to be false:
282
283 /* make sure we have "size" extra room in our buffer */
284 ecb_inline void
285 reserve (int size)
286 {
287 if (ecb_unlikely (current + size > end))
288 real_reserve_method (size); /* presumably noinline */
289 }
290
29=item bool ecb_assume(cond) 291=item bool ecb_assume (cond)
30 292
293Try to tell the compiler that some condition is true, even if it's not
294obvious.
295
296This can be used to teach the compiler about invariants or other
297conditions that might improve code generation, but which are impossible to
298deduce form the code itself.
299
300For example, the example reservation function from the C<ecb_unlikely>
301description could be written thus (only C<ecb_assume> was added):
302
303 ecb_inline void
304 reserve (int size)
305 {
306 if (ecb_unlikely (current + size > end))
307 real_reserve_method (size); /* presumably noinline */
308
309 ecb_assume (current + size <= end);
310 }
311
312If you then call this function twice, like this:
313
314 reserve (10);
315 reserve (1);
316
317Then the compiler I<might> be able to optimise out the second call
318completely, as it knows that C<< current + 1 > end >> is false and the
319call will never be executed.
320
31=item bool ecb_unreachable() 321=item bool ecb_unreachable ()
32 322
323This function does nothing itself, except tell the compiler that it will
324never be executed. Apart from suppressing a warning in some cases, this
325function can be used to implement C<ecb_assume> or similar functions.
326
33=item bool ecb_prefetch(addr,rw,locality) 327=item bool ecb_prefetch (addr, rw, locality)
34 328
329Tells the compiler to try to prefetch memory at the given C<addr>ess
330for either reading (C<rw> = 0) or writing (C<rw> = 1). A C<locality> of
331C<0> means that there will only be one access later, C<3> means that
332the data will likely be accessed very often, and values in between mean
333something... in between. The memory pointed to by the address does not
334need to be accessible (it could be a null pointer for example), but C<rw>
335and C<locality> must be compile-time constants.
336
337An obvious way to use this is to prefetch some data far away, in a big
338array you loop over. This prefetches memory some 128 array elements later,
339in the hope that it will be ready when the CPU arrives at that location.
340
341 int sum = 0;
342
343 for (i = 0; i < N; ++i)
344 {
345 sum += arr [i]
346 ecb_prefetch (arr + i + 128, 0, 0);
347 }
348
349It's hard to predict how far to prefetch, and most CPUs that can prefetch
350are often good enough to predict this kind of behaviour themselves. It
351gets more interesting with linked lists, especially when you do some fair
352processing on each list element:
353
354 for (node *n = start; n; n = n->next)
355 {
356 ecb_prefetch (n->next, 0, 0);
357 ... do medium amount of work with *n
358 }
359
360After processing the node, (part of) the next node might already be in
361cache.
362
35 =back 363=back
36 364
37=head2 BIT FIDDLING / BITSTUFFS 365=head2 BIT FIDDLING / BITSTUFFS
38 366
367=over 4
368
39bool ecb_big_endian (); 369=item bool ecb_big_endian ()
370
40bool ecb_little_endian (); 371=item bool ecb_little_endian ()
372
373These two functions return true if the byte order is big endian
374(most-significant byte first) or little endian (least-significant byte
375first) respectively.
376
41int ecb_ctz32 (uint32_t x); 377=item int ecb_ctz32 (uint32_t x)
378
379Returns the index of the least significant bit set in C<x> (or
380equivalently the number of bits set to 0 before the least significant
381bit set), starting from 0. If C<x> is 0 the result is undefined. A
382common use case is to compute the integer binary logarithm, i.e.,
383floor(log2(n)). For example:
384
385 ecb_ctz32 (3) = 0
386 ecb_ctz32 (6) = 1
387
42int ecb_popcount32 (uint32_t x); 388=item int ecb_popcount32 (uint32_t x)
43uint32_t ecb_bswap32 (uint32_t x); 389
390Returns the number of bits set to 1 in C<x>. For example:
391
392 ecb_popcount32 (7) = 3
393 ecb_popcount32 (255) = 8
394
44uint32_t ecb_bswap16 (uint32_t x); 395=item uint32_t ecb_bswap16 (uint32_t x)
396
397=item uint32_t ecb_bswap32 (uint32_t x)
398
399These two functions return the value of the 16-bit (32-bit) variable
400C<x> after reversing the order of bytes.
401
45uint32_t ecb_rotr32 (uint32_t x, unsigned int count); 402=item uint32_t ecb_rotr32 (uint32_t x, unsigned int count)
403
46uint32_t ecb_rotl32 (uint32_t x, unsigned int count); 404=item uint32_t ecb_rotl32 (uint32_t x, unsigned int count)
405
406These two functions return the value of C<x> after shifting all the bits
407by C<count> positions to the right or left respectively.
408
409=back
47 410
48=head2 ARITHMETIC 411=head2 ARITHMETIC
49 412
413=over 4
414
50x = ecb_mod (m, n) 415=item x = ecb_mod (m, n)
416
417Returns the positive remainder of the modulo operation between C<m> and
418C<n>. Unlike the C modulo operator C<%>, this function ensures that the
419return value is always positive).
420
421C<n> must be strictly positive (i.e. C<< >1 >>), while C<m> must be
422negatable, that is, both C<m> and C<-m> must be representable in its
423type.
424
425=back
51 426
52=head2 UTILITY 427=head2 UTILITY
53 428
54ecb_array_length (name) 429=over 4
55 430
431=item element_count = ecb_array_length (name) [MACRO]
56 432
433Returns the number of elements in the array C<name>. For example:
434
435 int primes[] = { 2, 3, 5, 7, 11 };
436 int sum = 0;
437
438 for (i = 0; i < ecb_array_length (primes); i++)
439 sum += primes [i];
440
441=back
442
443

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines