ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/libecb/ecb.pod
(Generate patch)

Comparing libecb/ecb.pod (file contents):
Revision 1.14 by root, Thu May 26 23:23:08 2011 UTC vs.
Revision 1.56 by root, Mon Jun 9 17:31:57 2014 UTC

3=head2 ABOUT LIBECB 3=head2 ABOUT LIBECB
4 4
5Libecb is currently a simple header file that doesn't require any 5Libecb is currently a simple header file that doesn't require any
6configuration to use or include in your project. 6configuration to use or include in your project.
7 7
8It's part of the e-suite of libraries, other memembers of which include 8It's part of the e-suite of libraries, other members of which include
9libev and libeio. 9libev and libeio.
10 10
11Its homepage can be found here: 11Its homepage can be found here:
12 12
13 http://software.schmorp.de/pkg/libecb 13 http://software.schmorp.de/pkg/libecb
14 14
15It mainly provides a number of wrappers around GCC built-ins, together 15It mainly provides a number of wrappers around GCC built-ins, together
16with replacement functions for other compilers. In addition to this, 16with replacement functions for other compilers. In addition to this,
17it provides a number of other lowlevel C utilities, such endienness 17it provides a number of other lowlevel C utilities, such as endianness
18detection, byte swapping or bit rotations. 18detection, byte swapping or bit rotations.
19
20Or in other words, things that should be built into any standard C system,
21but aren't, implemented as efficient as possible with GCC, and still
22correct with other compilers.
19 23
20More might come. 24More might come.
21 25
22=head2 ABOUT THE HEADER 26=head2 ABOUT THE HEADER
23 27
27 #include <ecb.h> 31 #include <ecb.h>
28 32
29The header should work fine for both C and C++ compilation, and gives you 33The header should work fine for both C and C++ compilation, and gives you
30all of F<inttypes.h> in addition to the ECB symbols. 34all of F<inttypes.h> in addition to the ECB symbols.
31 35
32There are currently no objetc files to link to - future versions might 36There are currently no object files to link to - future versions might
33come with an (optional) object code library to link against, to reduce 37come with an (optional) object code library to link against, to reduce
34code size or gain access to additional features. 38code size or gain access to additional features.
35 39
36It also currently includes everything from F<inttypes.h>. 40It also currently includes everything from F<inttypes.h>.
37 41
50only a generic name is used (C<expr>, C<cond>, C<value> and so on), then 54only a generic name is used (C<expr>, C<cond>, C<value> and so on), then
51the corresponding function relies on C to implement the correct types, and 55the corresponding function relies on C to implement the correct types, and
52is usually implemented as a macro. Specifically, a "bool" in this manual 56is usually implemented as a macro. Specifically, a "bool" in this manual
53refers to any kind of boolean value, not a specific type. 57refers to any kind of boolean value, not a specific type.
54 58
59=head2 TYPES / TYPE SUPPORT
60
61ecb.h makes sure that the following types are defined (in the expected way):
62
63 int8_t uint8_t int16_t uint16_t
64 int32_t uint32_t int64_t uint64_t
65 intptr_t uintptr_t
66
67The macro C<ECB_PTRSIZE> is defined to the size of a pointer on this
68platform (currently C<4> or C<8>) and can be used in preprocessor
69expressions.
70
71For C<ptrdiff_t> and C<size_t> use C<stddef.h>.
72
73=head2 LANGUAGE/COMPILER VERSIONS
74
75All the following symbols expand to an expression that can be tested in
76preprocessor instructions as well as treated as a boolean (use C<!!> to
77ensure it's either C<0> or C<1> if you need that).
78
79=over 4
80
81=item ECB_C
82
83True if the implementation defines the C<__STDC__> macro to a true value,
84while not claiming to be C++.
85
86=item ECB_C99
87
88True if the implementation claims to be compliant to C99 (ISO/IEC
899899:1999) or any later version, while not claiming to be C++.
90
91Note that later versions (ECB_C11) remove core features again (for
92example, variable length arrays).
93
94=item ECB_C11
95
96True if the implementation claims to be compliant to C11 (ISO/IEC
979899:2011) or any later version, while not claiming to be C++.
98
99=item ECB_CPP
100
101True if the implementation defines the C<__cplusplus__> macro to a true
102value, which is typically true for C++ compilers.
103
104=item ECB_CPP11
105
106True if the implementation claims to be compliant to ISO/IEC 14882:2011
107(C++11) or any later version.
108
109=item ECB_GCC_VERSION(major,minor)
110
111Expands to a true value (suitable for testing in by the preprocessor)
112if the compiler used is GNU C and the version is the given version, or
113higher.
114
115This macro tries to return false on compilers that claim to be GCC
116compatible but aren't.
117
118=item ECB_EXTERN_C
119
120Expands to C<extern "C"> in C++, and a simple C<extern> in C.
121
122This can be used to declare a single external C function:
123
124 ECB_EXTERN_C int printf (const char *format, ...);
125
126=item ECB_EXTERN_C_BEG / ECB_EXTERN_C_END
127
128These two macros can be used to wrap multiple C<extern "C"> definitions -
129they expand to nothing in C.
130
131They are most useful in header files:
132
133 ECB_EXTERN_C_BEG
134
135 int mycfun1 (int x);
136 int mycfun2 (int x);
137
138 ECB_EXTERN_C_END
139
140=item ECB_STDFP
141
142If this evaluates to a true value (suitable for testing in by the
143preprocessor), then C<float> and C<double> use IEEE 754 single/binary32
144and double/binary64 representations internally I<and> the endianness of
145both types match the endianness of C<uint32_t> and C<uint64_t>.
146
147This means you can just copy the bits of a C<float> (or C<double>) to an
148C<uint32_t> (or C<uint64_t>) and get the raw IEEE 754 bit representation
149without having to think about format or endianness.
150
151This is true for basically all modern platforms, although F<ecb.h> might
152not be able to deduce this correctly everywhere and might err on the safe
153side.
154
155=item ECB_AMD64, ECB_AMD64_X32
156
157These two macros are defined to C<1> on the x86_64/amd64 ABI and the X32
158ABI, respectively, and undefined elsewhere.
159
160The designers of the new X32 ABI for some inexplicable reason decided to
161make it look exactly like amd64, even though it's completely incompatible
162to that ABI, breaking about every piece of software that assumed that
163C<__x86_64> stands for, well, the x86-64 ABI, making these macros
164necessary.
165
166=back
167
55=head2 GCC ATTRIBUTES 168=head2 GCC ATTRIBUTES
56 169
57blabla where to put, what others 170A major part of libecb deals with GCC attributes. These are additional
171attributes that you can assign to functions, variables and sometimes even
172types - much like C<const> or C<volatile> in C.
173
174While GCC allows declarations to show up in many surprising places,
175but not in many expected places, the safest way is to put attribute
176declarations before the whole declaration:
177
178 ecb_const int mysqrt (int a);
179 ecb_unused int i;
180
181For variables, it is often nicer to put the attribute after the name, and
182avoid multiple declarations using commas:
183
184 int i ecb_unused;
58 185
59=over 4 186=over 4
60 187
61=item ecb_attribute ((attrs...)) 188=item ecb_attribute ((attrs...))
62 189
63A simple wrapper that expands to C<__attribute__((attrs))> on GCC, and 190A simple wrapper that expands to C<__attribute__((attrs))> on GCC, and to
64to nothing on other compilers, so the effect is that only GCC sees these. 191nothing on other compilers, so the effect is that only GCC sees these.
192
193Example: use the C<deprecated> attribute on a function.
194
195 ecb_attribute((__deprecated__)) void
196 do_not_use_me_anymore (void);
65 197
66=item ecb_unused 198=item ecb_unused
67 199
68Marks a function or a variable as "unused", which simply suppresses a 200Marks a function or a variable as "unused", which simply suppresses a
69warning by GCC when it detects it as unused. This is useful when you e.g. 201warning by GCC when it detects it as unused. This is useful when you e.g.
70declare a variable but do not always use it: 202declare a variable but do not always use it:
71 203
204 {
205 int var ecb_unused;
206
207 #ifdef SOMECONDITION
208 var = ...;
209 return var;
210 #else
211 return 0;
212 #endif
213 }
214
215=item ecb_deprecated
216
217Similar to C<ecb_unused>, but marks a function, variable or type as
218deprecated. This makes some compilers warn when the type is used.
219
220=item ecb_inline
221
222This is not actually an attribute, but you use it like one. It expands
223either to C<static inline> or to just C<static>, if inline isn't
224supported. It should be used to declare functions that should be inlined,
225for code size or speed reasons.
226
227Example: inline this function, it surely will reduce codesize.
228
229 ecb_inline int
230 negmul (int a, int b)
72 { 231 {
73 int var ecb_unused; 232 return - (a * b);
74
75 #ifdef SOMECONDITION
76 var = ...;
77 return var;
78 #else
79 return 0;
80 #endif
81 } 233 }
82 234
83=item ecb_noinline 235=item ecb_noinline
84 236
85Prevent a function from being inlined - it might be optimised away, but 237Prevent a function from being inlined - it might be optimised away, but
86not inlined into other functions. This is useful if you know your function 238not inlined into other functions. This is useful if you know your function
87is rarely called and large enough for inlining not to be helpful. 239is rarely called and large enough for inlining not to be helpful.
88 240
89=item ecb_noreturn 241=item ecb_noreturn
90 242
243Marks a function as "not returning, ever". Some typical functions that
244don't return are C<exit> or C<abort> (which really works hard to not
245return), and now you can make your own:
246
247 ecb_noreturn void
248 my_abort (const char *errline)
249 {
250 puts (errline);
251 abort ();
252 }
253
254In this case, the compiler would probably be smart enough to deduce it on
255its own, so this is mainly useful for declarations.
256
257=item ecb_restrict
258
259Expands to the C<restrict> keyword or equivalent on compilers that support
260them, and to nothing on others. Must be specified on a pointer type or
261an array index to indicate that the memory doesn't alias with any other
262restricted pointer in the same scope.
263
264Example: multiply a vector, and allow the compiler to parallelise the
265loop, because it knows it doesn't overwrite input values.
266
267 void
268 multiply (float *ecb_restrict src,
269 float *ecb_restrict dst,
270 int len, float factor)
271 {
272 int i;
273
274 for (i = 0; i < len; ++i)
275 dst [i] = src [i] * factor;
276 }
277
91=item ecb_const 278=item ecb_const
92 279
280Declares that the function only depends on the values of its arguments,
281much like a mathematical function. It specifically does not read or write
282any memory any arguments might point to, global variables, or call any
283non-const functions. It also must not have any side effects.
284
285Such a function can be optimised much more aggressively by the compiler -
286for example, multiple calls with the same arguments can be optimised into
287a single call, which wouldn't be possible if the compiler would have to
288expect any side effects.
289
290It is best suited for functions in the sense of mathematical functions,
291such as a function returning the square root of its input argument.
292
293Not suited would be a function that calculates the hash of some memory
294area you pass in, prints some messages or looks at a global variable to
295decide on rounding.
296
297See C<ecb_pure> for a slightly less restrictive class of functions.
298
93=item ecb_pure 299=item ecb_pure
94 300
301Similar to C<ecb_const>, declares a function that has no side
302effects. Unlike C<ecb_const>, the function is allowed to examine global
303variables and any other memory areas (such as the ones passed to it via
304pointers).
305
306While these functions cannot be optimised as aggressively as C<ecb_const>
307functions, they can still be optimised away in many occasions, and the
308compiler has more freedom in moving calls to them around.
309
310Typical examples for such functions would be C<strlen> or C<memcmp>. A
311function that calculates the MD5 sum of some input and updates some MD5
312state passed as argument would I<NOT> be pure, however, as it would modify
313some memory area that is not the return value.
314
95=item ecb_hot 315=item ecb_hot
96 316
317This declares a function as "hot" with regards to the cache - the function
318is used so often, that it is very beneficial to keep it in the cache if
319possible.
320
321The compiler reacts by trying to place hot functions near to each other in
322memory.
323
324Whether a function is hot or not often depends on the whole program,
325and less on the function itself. C<ecb_cold> is likely more useful in
326practise.
327
97=item ecb_cold 328=item ecb_cold
98 329
330The opposite of C<ecb_hot> - declares a function as "cold" with regards to
331the cache, or in other words, this function is not called often, or not at
332speed-critical times, and keeping it in the cache might be a waste of said
333cache.
334
335In addition to placing cold functions together (or at least away from hot
336functions), this knowledge can be used in other ways, for example, the
337function will be optimised for size, as opposed to speed, and codepaths
338leading to calls to those functions can automatically be marked as if
339C<ecb_expect_false> had been used to reach them.
340
341Good examples for such functions would be error reporting functions, or
342functions only called in exceptional or rare cases.
343
99=item ecb_artificial 344=item ecb_artificial
345
346Declares the function as "artificial", in this case meaning that this
347function is not really meant to be a function, but more like an accessor
348- many methods in C++ classes are mere accessor functions, and having a
349crash reported in such a method, or single-stepping through them, is not
350usually so helpful, especially when it's inlined to just a few instructions.
351
352Marking them as artificial will instruct the debugger about just this,
353leading to happier debugging and thus happier lives.
354
355Example: in some kind of smart-pointer class, mark the pointer accessor as
356artificial, so that the whole class acts more like a pointer and less like
357some C++ abstraction monster.
358
359 template<typename T>
360 struct my_smart_ptr
361 {
362 T *value;
363
364 ecb_artificial
365 operator T *()
366 {
367 return value;
368 }
369 };
100 370
101=back 371=back
102 372
103=head2 OPTIMISATION HINTS 373=head2 OPTIMISATION HINTS
104 374
136 406
137Evaluates C<expr> and returns it. In addition, it tells the compiler that 407Evaluates C<expr> and returns it. In addition, it tells the compiler that
138the C<expr> evaluates to C<value> a lot, which can be used for static 408the C<expr> evaluates to C<value> a lot, which can be used for static
139branch optimisations. 409branch optimisations.
140 410
141Usually, you want to use the more intuitive C<ecb_likely> and 411Usually, you want to use the more intuitive C<ecb_expect_true> and
142C<ecb_unlikely> functions instead. 412C<ecb_expect_false> functions instead.
143 413
144=item bool ecb_likely (bool) 414=item bool ecb_expect_true (cond)
145 415
146=item bool ecb_unlikely (bool) 416=item bool ecb_expect_false (cond)
147 417
148These two functions expect a expression that is true or false and return 418These two functions expect a expression that is true or false and return
149C<1> or C<0>, respectively, so when used in the condition of an C<if> or 419C<1> or C<0>, respectively, so when used in the condition of an C<if> or
150other conditional statement, it will not change the program: 420other conditional statement, it will not change the program:
151 421
152 /* these two do the same thing */ 422 /* these two do the same thing */
153 if (some_condition) ...; 423 if (some_condition) ...;
154 if (ecb_likely (some_condition)) ...; 424 if (ecb_expect_true (some_condition)) ...;
155 425
156However, by using C<ecb_likely>, you tell the compiler that the condition 426However, by using C<ecb_expect_true>, you tell the compiler that the
157is likely to be true (and for C<ecb_unlikely>, that it is unlikely to be 427condition is likely to be true (and for C<ecb_expect_false>, that it is
158true). 428unlikely to be true).
159 429
160For example, when you check for a null pointer and expect this to be a 430For example, when you check for a null pointer and expect this to be a
161rare, exceptional, case, then use C<ecb_unlikely>: 431rare, exceptional, case, then use C<ecb_expect_false>:
162 432
163 void my_free (void *ptr) 433 void my_free (void *ptr)
164 { 434 {
165 if (ecb_unlikely (ptr == 0)) 435 if (ecb_expect_false (ptr == 0))
166 return; 436 return;
167 } 437 }
168 438
169Consequent use of these functions to mark away exceptional cases or to 439Consequent use of these functions to mark away exceptional cases or to
170tell the compiler what the hot path through a function is can increase 440tell the compiler what the hot path through a function is can increase
171performance considerably. 441performance considerably.
442
443You might know these functions under the name C<likely> and C<unlikely>
444- while these are common aliases, we find that the expect name is easier
445to understand when quickly skimming code. If you wish, you can use
446C<ecb_likely> instead of C<ecb_expect_true> and C<ecb_unlikely> instead of
447C<ecb_expect_false> - these are simply aliases.
172 448
173A very good example is in a function that reserves more space for some 449A very good example is in a function that reserves more space for some
174memory block (for example, inside an implementation of a string stream) - 450memory block (for example, inside an implementation of a string stream) -
175each time something is added, you have to check for a buffer overrun, but 451each time something is added, you have to check for a buffer overrun, but
176you expect that most checks will turn out to be false: 452you expect that most checks will turn out to be false:
177 453
178 /* make sure we have "size" extra room in our buffer */ 454 /* make sure we have "size" extra room in our buffer */
179 ecb_inline void 455 ecb_inline void
180 reserve (int size) 456 reserve (int size)
181 { 457 {
182 if (ecb_unlikely (current + size > end)) 458 if (ecb_expect_false (current + size > end))
183 real_reserve_method (size); /* presumably noinline */ 459 real_reserve_method (size); /* presumably noinline */
184 } 460 }
185 461
186=item bool ecb_assume (cond) 462=item bool ecb_assume (cond)
187 463
190 466
191This can be used to teach the compiler about invariants or other 467This can be used to teach the compiler about invariants or other
192conditions that might improve code generation, but which are impossible to 468conditions that might improve code generation, but which are impossible to
193deduce form the code itself. 469deduce form the code itself.
194 470
195For example, the example reservation function from the C<ecb_unlikely> 471For example, the example reservation function from the C<ecb_expect_false>
196description could be written thus (only C<ecb_assume> was added): 472description could be written thus (only C<ecb_assume> was added):
197 473
198 ecb_inline void 474 ecb_inline void
199 reserve (int size) 475 reserve (int size)
200 { 476 {
201 if (ecb_unlikely (current + size > end)) 477 if (ecb_expect_false (current + size > end))
202 real_reserve_method (size); /* presumably noinline */ 478 real_reserve_method (size); /* presumably noinline */
203 479
204 ecb_assume (current + size <= end); 480 ecb_assume (current + size <= end);
205 } 481 }
206 482
255After processing the node, (part of) the next node might already be in 531After processing the node, (part of) the next node might already be in
256cache. 532cache.
257 533
258=back 534=back
259 535
260=head2 BIT FIDDLING / BITSTUFFS 536=head2 BIT FIDDLING / BIT WIZARDRY
261 537
262=over 4 538=over 4
263 539
264=item bool ecb_big_endian () 540=item bool ecb_big_endian ()
265 541
267 543
268These two functions return true if the byte order is big endian 544These two functions return true if the byte order is big endian
269(most-significant byte first) or little endian (least-significant byte 545(most-significant byte first) or little endian (least-significant byte
270first) respectively. 546first) respectively.
271 547
548On systems that are neither, their return values are unspecified.
549
272=item int ecb_ctz32 (uint32_t x) 550=item int ecb_ctz32 (uint32_t x)
273 551
552=item int ecb_ctz64 (uint64_t x)
553
274Returns the index of the least significant bit set in C<x> (or 554Returns the index of the least significant bit set in C<x> (or
275equivalently the number of bits set to 0 before the least significant 555equivalently the number of bits set to 0 before the least significant bit
276bit set), starting from 0. If C<x> is 0 the result is undefined. A 556set), starting from 0. If C<x> is 0 the result is undefined.
277common use case is to compute the integer binary logarithm, i.e.,
278floor(log2(n)). For example:
279 557
558For smaller types than C<uint32_t> you can safely use C<ecb_ctz32>.
559
560For example:
561
280 ecb_ctz32(3) = 0 562 ecb_ctz32 (3) = 0
281 ecb_ctz32(6) = 1 563 ecb_ctz32 (6) = 1
564
565=item bool ecb_is_pot32 (uint32_t x)
566
567=item bool ecb_is_pot64 (uint32_t x)
568
569Return true iff C<x> is a power of two or C<x == 0>.
570
571For smaller types then C<uint32_t> you can safely use C<ecb_is_pot32>.
572
573=item int ecb_ld32 (uint32_t x)
574
575=item int ecb_ld64 (uint64_t x)
576
577Returns the index of the most significant bit set in C<x>, or the number
578of digits the number requires in binary (so that C<< 2**ld <= x <
5792**(ld+1) >>). If C<x> is 0 the result is undefined. A common use case is
580to compute the integer binary logarithm, i.e. C<floor (log2 (n))>, for
581example to see how many bits a certain number requires to be encoded.
582
583This function is similar to the "count leading zero bits" function, except
584that that one returns how many zero bits are "in front" of the number (in
585the given data type), while C<ecb_ld> returns how many bits the number
586itself requires.
587
588For smaller types than C<uint32_t> you can safely use C<ecb_ld32>.
282 589
283=item int ecb_popcount32 (uint32_t x) 590=item int ecb_popcount32 (uint32_t x)
284 591
592=item int ecb_popcount64 (uint64_t x)
593
285Returns the number of bits set to 1 in C<x>. For example: 594Returns the number of bits set to 1 in C<x>.
286 595
596For smaller types than C<uint32_t> you can safely use C<ecb_popcount32>.
597
598For example:
599
287 ecb_popcount32(7) = 3 600 ecb_popcount32 (7) = 3
288 ecb_popcount32(255) = 8 601 ecb_popcount32 (255) = 8
602
603=item uint8_t ecb_bitrev8 (uint8_t x)
604
605=item uint16_t ecb_bitrev16 (uint16_t x)
606
607=item uint32_t ecb_bitrev32 (uint32_t x)
608
609Reverses the bits in x, i.e. the MSB becomes the LSB, MSB-1 becomes LSB+1
610and so on.
611
612Example:
613
614 ecb_bitrev8 (0xa7) = 0xea
615 ecb_bitrev32 (0xffcc4411) = 0x882233ff
289 616
290=item uint32_t ecb_bswap16 (uint32_t x) 617=item uint32_t ecb_bswap16 (uint32_t x)
291 618
292=item uint32_t ecb_bswap32 (uint32_t x) 619=item uint32_t ecb_bswap32 (uint32_t x)
293 620
621=item uint64_t ecb_bswap64 (uint64_t x)
622
294These two functions return the value of the 16-bit (32-bit) variable 623These functions return the value of the 16-bit (32-bit, 64-bit) value
295C<x> after reversing the order of bytes. 624C<x> after reversing the order of bytes (0x11223344 becomes 0x44332211 in
625C<ecb_bswap32>).
626
627=item uint8_t ecb_rotl8 (uint8_t x, unsigned int count)
628
629=item uint16_t ecb_rotl16 (uint16_t x, unsigned int count)
630
631=item uint32_t ecb_rotl32 (uint32_t x, unsigned int count)
632
633=item uint64_t ecb_rotl64 (uint64_t x, unsigned int count)
634
635=item uint8_t ecb_rotr8 (uint8_t x, unsigned int count)
636
637=item uint16_t ecb_rotr16 (uint16_t x, unsigned int count)
296 638
297=item uint32_t ecb_rotr32 (uint32_t x, unsigned int count) 639=item uint32_t ecb_rotr32 (uint32_t x, unsigned int count)
298 640
299=item uint32_t ecb_rotl32 (uint32_t x, unsigned int count) 641=item uint64_t ecb_rotr64 (uint64_t x, unsigned int count)
300 642
301These two functions return the value of C<x> after shifting all the bits 643These two families of functions return the value of C<x> after rotating
302by C<count> positions to the right or left respectively. 644all the bits by C<count> positions to the right (C<ecb_rotr>) or left
645(C<ecb_rotl>).
646
647Current GCC versions understand these functions and usually compile them
648to "optimal" code (e.g. a single C<rol> or a combination of C<shld> on
649x86).
303 650
304=back 651=back
305 652
653=head2 FLOATING POINT FIDDLING
654
655=over 4
656
657=item uint32_t ecb_float_to_binary32 (float x) [-UECB_NO_LIBM]
658
659=item uint64_t ecb_double_to_binary64 (double x) [-UECB_NO_LIBM]
660
661These functions each take an argument in the native C<float> or C<double>
662type and return the IEEE 754 bit representation of it.
663
664The bit representation is just as IEEE 754 defines it, i.e. the sign bit
665will be the most significant bit, followed by exponent and mantissa.
666
667This function should work even when the native floating point format isn't
668IEEE compliant, of course at a speed and code size penalty, and of course
669also within reasonable limits (it tries to convert NaNs, infinities and
670denormals, but will likely convert negative zero to positive zero).
671
672On all modern platforms (where C<ECB_STDFP> is true), the compiler should
673be able to optimise away this function completely.
674
675These functions can be helpful when serialising floats to the network - you
676can serialise the return value like a normal uint32_t/uint64_t.
677
678Another use for these functions is to manipulate floating point values
679directly.
680
681Silly example: toggle the sign bit of a float.
682
683 /* On gcc-4.7 on amd64, */
684 /* this results in a single add instruction to toggle the bit, and 4 extra */
685 /* instructions to move the float value to an integer register and back. */
686
687 x = ecb_binary32_to_float (ecb_float_to_binary32 (x) ^ 0x80000000U)
688
689=item float ecb_binary32_to_float (uint32_t x) [-UECB_NO_LIBM]
690
691=item double ecb_binary32_to_double (uint64_t x) [-UECB_NO_LIBM]
692
693The reverse operation of the previos function - takes the bit representation
694of an IEEE binary32 or binary64 number and converts it to the native C<float>
695or C<double> format.
696
697This function should work even when the native floating point format isn't
698IEEE compliant, of course at a speed and code size penalty, and of course
699also within reasonable limits (it tries to convert normals and denormals,
700and might be lucky for infinities, and with extraordinary luck, also for
701negative zero).
702
703On all modern platforms (where C<ECB_STDFP> is true), the compiler should
704be able to optimise away this function completely.
705
706=back
707
306=head2 ARITHMETIC 708=head2 ARITHMETIC
307 709
308=over 4 710=over 4
309 711
310=item x = ecb_mod (m, n) 712=item x = ecb_mod (m, n)
311 713
312Returns the positive remainder of the modulo operation between C<m> and 714Returns C<m> modulo C<n>, which is the same as the positive remainder
715of the division operation between C<m> and C<n>, using floored
313C<n>. Unlike the C moduloe operator C<%>, this function ensures that the 716division. Unlike the C remainder operator C<%>, this function ensures that
314return value is always positive). 717the return value is always positive and that the two numbers I<m> and
718I<m' = m + i * n> result in the same value modulo I<n> - in other words,
719C<ecb_mod> implements the mathematical modulo operation, which is missing
720in the language.
315 721
316C<n> must be strictly positive (i.e. C<< >1 >>), while C<m> must be 722C<n> must be strictly positive (i.e. C<< >= 1 >>), while C<m> must be
317negatable, that is, both C<m> and C<-m> must be representable in its 723negatable, that is, both C<m> and C<-m> must be representable in its
318type. 724type (this typically excludes the minimum signed integer value, the same
725limitation as for C</> and C<%> in C).
726
727Current GCC versions compile this into an efficient branchless sequence on
728almost all CPUs.
729
730For example, when you want to rotate forward through the members of an
731array for increasing C<m> (which might be negative), then you should use
732C<ecb_mod>, as the C<%> operator might give either negative results, or
733change direction for negative values:
734
735 for (m = -100; m <= 100; ++m)
736 int elem = myarray [ecb_mod (m, ecb_array_length (myarray))];
737
738=item x = ecb_div_rd (val, div)
739
740=item x = ecb_div_ru (val, div)
741
742Returns C<val> divided by C<div> rounded down or up, respectively.
743C<val> and C<div> must have integer types and C<div> must be strictly
744positive. Note that these functions are implemented with macros in C
745and with function templates in C++.
319 746
320=back 747=back
321 748
322=head2 UTILITY 749=head2 UTILITY
323 750
324=over 4 751=over 4
325 752
326=item element_count = ecb_array_length (name) [MACRO] 753=item element_count = ecb_array_length (name)
327 754
328Returns the number of elements in the array C<name>. For example: 755Returns the number of elements in the array C<name>. For example:
329 756
330 int primes[] = { 2, 3, 5, 7, 11 }; 757 int primes[] = { 2, 3, 5, 7, 11 };
331 int sum = 0; 758 int sum = 0;
333 for (i = 0; i < ecb_array_length (primes); i++) 760 for (i = 0; i < ecb_array_length (primes); i++)
334 sum += primes [i]; 761 sum += primes [i];
335 762
336=back 763=back
337 764
765=head2 SYMBOLS GOVERNING COMPILATION OF ECB.H ITSELF
338 766
767These symbols need to be defined before including F<ecb.h> the first time.
768
769=over 4
770
771=item ECB_NO_THREADS
772
773If F<ecb.h> is never used from multiple threads, then this symbol can
774be defined, in which case memory fences (and similar constructs) are
775completely removed, leading to more efficient code and fewer dependencies.
776
777Setting this symbol to a true value implies C<ECB_NO_SMP>.
778
779=item ECB_NO_SMP
780
781The weaker version of C<ECB_NO_THREADS> - if F<ecb.h> is used from
782multiple threads, but never concurrently (e.g. if the system the program
783runs on has only a single CPU with a single core, no hyperthreading and so
784on), then this symbol can be defined, leading to more efficient code and
785fewer dependencies.
786
787=item ECB_NO_LIBM
788
789When defined to C<1>, do not export any functions that might introduce
790dependencies on the math library (usually called F<-lm>) - these are
791marked with [-UECB_NO_LIBM].
792
793=back
794
795

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines