ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/libecb/ecb.pod
(Generate patch)

Comparing libecb/ecb.pod (file contents):
Revision 1.38 by sf-exg, Thu Aug 25 16:06:08 2011 UTC vs.
Revision 1.98 by root, Fri Aug 20 20:06:47 2021 UTC

10 10
11Its homepage can be found here: 11Its homepage can be found here:
12 12
13 http://software.schmorp.de/pkg/libecb 13 http://software.schmorp.de/pkg/libecb
14 14
15It mainly provides a number of wrappers around GCC built-ins, together 15It mainly provides a number of wrappers around many compiler built-ins,
16with replacement functions for other compilers. In addition to this, 16together with replacement functions for other compilers. In addition
17it provides a number of other lowlevel C utilities, such as endianness 17to this, it provides a number of other lowlevel C utilities, such as
18detection, byte swapping or bit rotations. 18endianness detection, byte swapping or bit rotations.
19 19
20Or in other words, things that should be built into any standard C system, 20Or in other words, things that should be built into any standard C
21but aren't, implemented as efficient as possible with GCC, and still 21system, but aren't, implemented as efficient as possible with GCC (clang,
22correct with other compilers. 22msvc...), and still correct with other compilers.
23 23
24More might come. 24More might come.
25 25
26=head2 ABOUT THE HEADER 26=head2 ABOUT THE HEADER
27 27
54only a generic name is used (C<expr>, C<cond>, C<value> and so on), then 54only a generic name is used (C<expr>, C<cond>, C<value> and so on), then
55the corresponding function relies on C to implement the correct types, and 55the corresponding function relies on C to implement the correct types, and
56is usually implemented as a macro. Specifically, a "bool" in this manual 56is usually implemented as a macro. Specifically, a "bool" in this manual
57refers to any kind of boolean value, not a specific type. 57refers to any kind of boolean value, not a specific type.
58 58
59=head2 TYPES / TYPE SUPPORT
60
61ecb.h makes sure that the following types are defined (in the expected way):
62
63 int8_t uint8_
64 int16_t uint16_t
65 int32_t uint32_
66 int64_t uint64_t
67 int_fast8_t uint_fast8_t
68 int_fast16_t uint_fast16_t
69 int_fast32_t uint_fast32_t
70 int_fast64_t uint_fast64_t
71 intptr_t uintptr_t
72
73The macro C<ECB_PTRSIZE> is defined to the size of a pointer on this
74platform (currently C<4> or C<8>) and can be used in preprocessor
75expressions.
76
77For C<ptrdiff_t> and C<size_t> use C<stddef.h>/C<cstddef>.
78
79=head2 LANGUAGE/ENVIRONMENT/COMPILER VERSIONS
80
81All the following symbols expand to an expression that can be tested in
82preprocessor instructions as well as treated as a boolean (use C<!!> to
83ensure it's either C<0> or C<1> if you need that).
84
85=over
86
87=item ECB_C
88
89True if the implementation defines the C<__STDC__> macro to a true value,
90while not claiming to be C++, i..e C, but not C++.
91
92=item ECB_C99
93
94True if the implementation claims to be compliant to C99 (ISO/IEC
959899:1999) or any later version, while not claiming to be C++.
96
97Note that later versions (ECB_C11) remove core features again (for
98example, variable length arrays).
99
100=item ECB_C11, ECB_C17
101
102True if the implementation claims to be compliant to C11/C17 (ISO/IEC
1039899:2011, :20187) or any later version, while not claiming to be C++.
104
105=item ECB_CPP
106
107True if the implementation defines the C<__cplusplus__> macro to a true
108value, which is typically true for C++ compilers.
109
110=item ECB_CPP11, ECB_CPP14, ECB_CPP17
111
112True if the implementation claims to be compliant to C++11/C++14/C++17
113(ISO/IEC 14882:2011, :2014, :2017) or any later version.
114
115Note that many C++20 features will likely have their own feature test
116macros (see e.g. L<http://eel.is/c++draft/cpp.predefined#1.8>).
117
118=item ECB_OPTIMIZE_SIZE
119
120Is C<1> when the compiler optimizes for size, C<0> otherwise. This symbol
121can also be defined before including F<ecb.h>, in which case it will be
122unchanged.
123
124=item ECB_GCC_VERSION (major, minor)
125
126Expands to a true value (suitable for testing by the preprocessor) if the
127compiler used is GNU C and the version is the given version, or higher.
128
129This macro tries to return false on compilers that claim to be GCC
130compatible but aren't.
131
132=item ECB_EXTERN_C
133
134Expands to C<extern "C"> in C++, and a simple C<extern> in C.
135
136This can be used to declare a single external C function:
137
138 ECB_EXTERN_C int printf (const char *format, ...);
139
140=item ECB_EXTERN_C_BEG / ECB_EXTERN_C_END
141
142These two macros can be used to wrap multiple C<extern "C"> definitions -
143they expand to nothing in C.
144
145They are most useful in header files:
146
147 ECB_EXTERN_C_BEG
148
149 int mycfun1 (int x);
150 int mycfun2 (int x);
151
152 ECB_EXTERN_C_END
153
154=item ECB_STDFP
155
156If this evaluates to a true value (suitable for testing by the
157preprocessor), then C<float> and C<double> use IEEE 754 single/binary32
158and double/binary64 representations internally I<and> the endianness of
159both types match the endianness of C<uint32_t> and C<uint64_t>.
160
161This means you can just copy the bits of a C<float> (or C<double>) to an
162C<uint32_t> (or C<uint64_t>) and get the raw IEEE 754 bit representation
163without having to think about format or endianness.
164
165This is true for basically all modern platforms, although F<ecb.h> might
166not be able to deduce this correctly everywhere and might err on the safe
167side.
168
169=item ECB_64BIT_NATIVE
170
171Evaluates to a true value (suitable for both preprocessor and C code
172testing) if 64 bit integer types on this architecture are evaluated
173"natively", that is, with similar speeds as 32 bit integers. While 64 bit
174integer support is very common (and in fact required by libecb), 32 bit
175cpus have to emulate operations on them, so you might want to avoid them.
176
177=item ECB_AMD64, ECB_AMD64_X32
178
179These two macros are defined to C<1> on the x86_64/amd64 ABI and the X32
180ABI, respectively, and undefined elsewhere.
181
182The designers of the new X32 ABI for some inexplicable reason decided to
183make it look exactly like amd64, even though it's completely incompatible
184to that ABI, breaking about every piece of software that assumed that
185C<__x86_64> stands for, well, the x86-64 ABI, making these macros
186necessary.
187
188=back
189
190=head2 MACRO TRICKERY
191
192=over
193
194=item ECB_CONCAT (a, b)
195
196Expands any macros in C<a> and C<b>, then concatenates the result to form
197a single token. This is mainly useful to form identifiers from components,
198e.g.:
199
200 #define S1 str
201 #define S2 cpy
202
203 ECB_CONCAT (S1, S2)(dst, src); // == strcpy (dst, src);
204
205=item ECB_STRINGIFY (arg)
206
207Expands any macros in C<arg> and returns the stringified version of
208it. This is mainly useful to get the contents of a macro in string form,
209e.g.:
210
211 #define SQL_LIMIT 100
212 sql_exec ("select * from table limit " ECB_STRINGIFY (SQL_LIMIT));
213
214=item ECB_STRINGIFY_EXPR (expr)
215
216Like C<ECB_STRINGIFY>, but additionally evaluates C<expr> to make sure it
217is a valid expression. This is useful to catch typos or cases where the
218macro isn't available:
219
220 #include <errno.h>
221
222 ECB_STRINGIFY (EDOM); // "33" (on my system at least)
223 ECB_STRINGIFY_EXPR (EDOM); // "33"
224
225 // now imagine we had a typo:
226
227 ECB_STRINGIFY (EDAM); // "EDAM"
228 ECB_STRINGIFY_EXPR (EDAM); // error: EDAM undefined
229
230=back
231
59=head2 GCC ATTRIBUTES 232=head2 ATTRIBUTES
60 233
61A major part of libecb deals with GCC attributes. These are additional 234A major part of libecb deals with additional attributes that can be
62attributes that you can assign to functions, variables and sometimes even 235assigned to functions, variables and sometimes even types - much like
63types - much like C<const> or C<volatile> in C. 236C<const> or C<volatile> in C. They are implemented using either GCC
64 237attributes or other compiler/language specific features. Attributes
65While GCC allows declarations to show up in many surprising places,
66but not in many expected places, the safest way is to put attribute
67declarations before the whole declaration: 238declarations must be put before the whole declaration:
68 239
69 ecb_const int mysqrt (int a); 240 ecb_const int mysqrt (int a);
70 ecb_unused int i; 241 ecb_unused int i;
71 242
72For variables, it is often nicer to put the attribute after the name, and
73avoid multiple declarations using commas:
74
75 int i ecb_unused;
76
77=over 4 243=over
78
79=item ecb_attribute ((attrs...))
80
81A simple wrapper that expands to C<__attribute__((attrs))> on GCC, and to
82nothing on other compilers, so the effect is that only GCC sees these.
83
84Example: use the C<deprecated> attribute on a function.
85
86 ecb_attribute((__deprecated__)) void
87 do_not_use_me_anymore (void);
88 244
89=item ecb_unused 245=item ecb_unused
90 246
91Marks a function or a variable as "unused", which simply suppresses a 247Marks a function or a variable as "unused", which simply suppresses a
92warning by GCC when it detects it as unused. This is useful when you e.g. 248warning by the compiler when it detects it as unused. This is useful when
93declare a variable but do not always use it: 249you e.g. declare a variable but do not always use it:
94 250
95 { 251 {
96 int var ecb_unused; 252 ecb_unused int var;
97 253
98 #ifdef SOMECONDITION 254 #ifdef SOMECONDITION
99 var = ...; 255 var = ...;
100 return var; 256 return var;
101 #else 257 #else
102 return 0; 258 return 0;
103 #endif 259 #endif
104 } 260 }
105 261
262=item ecb_deprecated
263
264Similar to C<ecb_unused>, but marks a function, variable or type as
265deprecated. This makes some compilers warn when the type is used.
266
267=item ecb_deprecated_message (message)
268
269Same as C<ecb_deprecated>, but if possible, the specified diagnostic is
270used instead of a generic depreciation message when the object is being
271used.
272
106=item ecb_inline 273=item ecb_inline
107 274
108This is not actually an attribute, but you use it like one. It expands 275Expands either to (a compiler-specific equivalent of) C<static inline> or
109either to C<static inline> or to just C<static>, if inline isn't 276to just C<static>, if inline isn't supported. It should be used to declare
110supported. It should be used to declare functions that should be inlined, 277functions that should be inlined, for code size or speed reasons.
111for code size or speed reasons.
112 278
113Example: inline this function, it surely will reduce codesize. 279Example: inline this function, it surely will reduce codesize.
114 280
115 ecb_inline int 281 ecb_inline int
116 negmul (int a, int b) 282 negmul (int a, int b)
118 return - (a * b); 284 return - (a * b);
119 } 285 }
120 286
121=item ecb_noinline 287=item ecb_noinline
122 288
123Prevent a function from being inlined - it might be optimised away, but 289Prevents a function from being inlined - it might be optimised away, but
124not inlined into other functions. This is useful if you know your function 290not inlined into other functions. This is useful if you know your function
125is rarely called and large enough for inlining not to be helpful. 291is rarely called and large enough for inlining not to be helpful.
126 292
127=item ecb_noreturn 293=item ecb_noreturn
128 294
138 } 304 }
139 305
140In this case, the compiler would probably be smart enough to deduce it on 306In this case, the compiler would probably be smart enough to deduce it on
141its own, so this is mainly useful for declarations. 307its own, so this is mainly useful for declarations.
142 308
309=item ecb_restrict
310
311Expands to the C<restrict> keyword or equivalent on compilers that support
312them, and to nothing on others. Must be specified on a pointer type or
313an array index to indicate that the memory doesn't alias with any other
314restricted pointer in the same scope.
315
316Example: multiply a vector, and allow the compiler to parallelise the
317loop, because it knows it doesn't overwrite input values.
318
319 void
320 multiply (ecb_restrict float *src,
321 ecb_restrict float *dst,
322 int len, float factor)
323 {
324 int i;
325
326 for (i = 0; i < len; ++i)
327 dst [i] = src [i] * factor;
328 }
329
143=item ecb_const 330=item ecb_const
144 331
145Declares that the function only depends on the values of its arguments, 332Declares that the function only depends on the values of its arguments,
146much like a mathematical function. It specifically does not read or write 333much like a mathematical function. It specifically does not read or write
147any memory any arguments might point to, global variables, or call any 334any memory any arguments might point to, global variables, or call any
207functions only called in exceptional or rare cases. 394functions only called in exceptional or rare cases.
208 395
209=item ecb_artificial 396=item ecb_artificial
210 397
211Declares the function as "artificial", in this case meaning that this 398Declares the function as "artificial", in this case meaning that this
212function is not really mean to be a function, but more like an accessor 399function is not really meant to be a function, but more like an accessor
213- many methods in C++ classes are mere accessor functions, and having a 400- many methods in C++ classes are mere accessor functions, and having a
214crash reported in such a method, or single-stepping through them, is not 401crash reported in such a method, or single-stepping through them, is not
215usually so helpful, especially when it's inlined to just a few instructions. 402usually so helpful, especially when it's inlined to just a few instructions.
216 403
217Marking them as artificial will instruct the debugger about just this, 404Marking them as artificial will instruct the debugger about just this,
235 422
236=back 423=back
237 424
238=head2 OPTIMISATION HINTS 425=head2 OPTIMISATION HINTS
239 426
240=over 4 427=over
241 428
242=item bool ecb_is_constant(expr) 429=item bool ecb_is_constant (expr)
243 430
244Returns true iff the expression can be deduced to be a compile-time 431Returns true iff the expression can be deduced to be a compile-time
245constant, and false otherwise. 432constant, and false otherwise.
246 433
247For example, when you have a C<rndm16> function that returns a 16 bit 434For example, when you have a C<rndm16> function that returns a 16 bit
265 return is_constant (n) && !(n & (n - 1)) 452 return is_constant (n) && !(n & (n - 1))
266 ? rndm16 () & (num - 1) 453 ? rndm16 () & (num - 1)
267 : (n * (uint32_t)rndm16 ()) >> 16; 454 : (n * (uint32_t)rndm16 ()) >> 16;
268 } 455 }
269 456
270=item bool ecb_expect (expr, value) 457=item ecb_expect (expr, value)
271 458
272Evaluates C<expr> and returns it. In addition, it tells the compiler that 459Evaluates C<expr> and returns it. In addition, it tells the compiler that
273the C<expr> evaluates to C<value> a lot, which can be used for static 460the C<expr> evaluates to C<value> a lot, which can be used for static
274branch optimisations. 461branch optimisations.
275 462
322 { 509 {
323 if (ecb_expect_false (current + size > end)) 510 if (ecb_expect_false (current + size > end))
324 real_reserve_method (size); /* presumably noinline */ 511 real_reserve_method (size); /* presumably noinline */
325 } 512 }
326 513
327=item bool ecb_assume (cond) 514=item ecb_assume (cond)
328 515
329Try to tell the compiler that some condition is true, even if it's not 516Tries to tell the compiler that some condition is true, even if it's not
330obvious. 517obvious. This is not a function, but a statement: it cannot be used in
518another expression.
331 519
332This can be used to teach the compiler about invariants or other 520This can be used to teach the compiler about invariants or other
333conditions that might improve code generation, but which are impossible to 521conditions that might improve code generation, but which are impossible to
334deduce form the code itself. 522deduce form the code itself.
335 523
352 540
353Then the compiler I<might> be able to optimise out the second call 541Then the compiler I<might> be able to optimise out the second call
354completely, as it knows that C<< current + 1 > end >> is false and the 542completely, as it knows that C<< current + 1 > end >> is false and the
355call will never be executed. 543call will never be executed.
356 544
357=item bool ecb_unreachable () 545=item ecb_unreachable ()
358 546
359This function does nothing itself, except tell the compiler that it will 547This function does nothing itself, except tell the compiler that it will
360never be executed. Apart from suppressing a warning in some cases, this 548never be executed. Apart from suppressing a warning in some cases, this
361function can be used to implement C<ecb_assume> or similar functions. 549function can be used to implement C<ecb_assume> or similar functionality.
362 550
363=item bool ecb_prefetch (addr, rw, locality) 551=item ecb_prefetch (addr, rw, locality)
364 552
365Tells the compiler to try to prefetch memory at the given C<addr>ess 553Tells the compiler to try to prefetch memory at the given C<addr>ess
366for either reading (C<rw> = 0) or writing (C<rw> = 1). A C<locality> of 554for either reading (C<rw> = 0) or writing (C<rw> = 1). A C<locality> of
367C<0> means that there will only be one access later, C<3> means that 555C<0> means that there will only be one access later, C<3> means that
368the data will likely be accessed very often, and values in between mean 556the data will likely be accessed very often, and values in between mean
369something... in between. The memory pointed to by the address does not 557something... in between. The memory pointed to by the address does not
370need to be accessible (it could be a null pointer for example), but C<rw> 558need to be accessible (it could be a null pointer for example), but C<rw>
371and C<locality> must be compile-time constants. 559and C<locality> must be compile-time constants.
372 560
561This is a statement, not a function: you cannot use it as part of an
562expression.
563
373An obvious way to use this is to prefetch some data far away, in a big 564An obvious way to use this is to prefetch some data far away, in a big
374array you loop over. This prefetches memory some 128 array elements later, 565array you loop over. This prefetches memory some 128 array elements later,
375in the hope that it will be ready when the CPU arrives at that location. 566in the hope that it will be ready when the CPU arrives at that location.
376 567
377 int sum = 0; 568 int sum = 0;
398 589
399=back 590=back
400 591
401=head2 BIT FIDDLING / BIT WIZARDRY 592=head2 BIT FIDDLING / BIT WIZARDRY
402 593
403=over 4 594=over
404 595
405=item bool ecb_big_endian () 596=item bool ecb_big_endian ()
406 597
407=item bool ecb_little_endian () 598=item bool ecb_little_endian ()
408 599
414 605
415=item int ecb_ctz32 (uint32_t x) 606=item int ecb_ctz32 (uint32_t x)
416 607
417=item int ecb_ctz64 (uint64_t x) 608=item int ecb_ctz64 (uint64_t x)
418 609
610=item int ecb_ctz (T x) [C++]
611
419Returns the index of the least significant bit set in C<x> (or 612Returns the index of the least significant bit set in C<x> (or
420equivalently the number of bits set to 0 before the least significant bit 613equivalently the number of bits set to 0 before the least significant bit
421set), starting from 0. If C<x> is 0 the result is undefined. 614set), starting from 0. If C<x> is 0 the result is undefined.
422 615
423For smaller types than C<uint32_t> you can safely use C<ecb_ctz32>. 616For smaller types than C<uint32_t> you can safely use C<ecb_ctz32>.
424 617
618The overloaded C++ C<ecb_ctz> function supports C<uint8_t>, C<uint16_t>,
619C<uint32_t> and C<uint64_t> types.
620
425For example: 621For example:
426 622
427 ecb_ctz32 (3) = 0 623 ecb_ctz32 (3) = 0
428 ecb_ctz32 (6) = 1 624 ecb_ctz32 (6) = 1
429 625
626=item bool ecb_is_pot32 (uint32_t x)
627
628=item bool ecb_is_pot64 (uint32_t x)
629
630=item bool ecb_is_pot (T x) [C++]
631
632Returns true iff C<x> is a power of two or C<x == 0>.
633
634For smaller types than C<uint32_t> you can safely use C<ecb_is_pot32>.
635
636The overloaded C++ C<ecb_is_pot> function supports C<uint8_t>, C<uint16_t>,
637C<uint32_t> and C<uint64_t> types.
638
430=item int ecb_ld32 (uint32_t x) 639=item int ecb_ld32 (uint32_t x)
431 640
432=item int ecb_ld64 (uint64_t x) 641=item int ecb_ld64 (uint64_t x)
642
643=item int ecb_ld64 (T x) [C++]
433 644
434Returns the index of the most significant bit set in C<x>, or the number 645Returns the index of the most significant bit set in C<x>, or the number
435of digits the number requires in binary (so that C<< 2**ld <= x < 646of digits the number requires in binary (so that C<< 2**ld <= x <
4362**(ld+1) >>). If C<x> is 0 the result is undefined. A common use case is 6472**(ld+1) >>). If C<x> is 0 the result is undefined. A common use case is
437to compute the integer binary logarithm, i.e. C<floor (log2 (n))>, for 648to compute the integer binary logarithm, i.e. C<floor (log2 (n))>, for
442the given data type), while C<ecb_ld> returns how many bits the number 653the given data type), while C<ecb_ld> returns how many bits the number
443itself requires. 654itself requires.
444 655
445For smaller types than C<uint32_t> you can safely use C<ecb_ld32>. 656For smaller types than C<uint32_t> you can safely use C<ecb_ld32>.
446 657
658The overloaded C++ C<ecb_ld> function supports C<uint8_t>, C<uint16_t>,
659C<uint32_t> and C<uint64_t> types.
660
447=item int ecb_popcount32 (uint32_t x) 661=item int ecb_popcount32 (uint32_t x)
448 662
449=item int ecb_popcount64 (uint64_t x) 663=item int ecb_popcount64 (uint64_t x)
450 664
665=item int ecb_popcount (T x) [C++]
666
451Returns the number of bits set to 1 in C<x>. 667Returns the number of bits set to 1 in C<x>.
452 668
453For smaller types than C<uint32_t> you can safely use C<ecb_popcount32>. 669For smaller types than C<uint32_t> you can safely use C<ecb_popcount32>.
670
671The overloaded C++ C<ecb_popcount> function supports C<uint8_t>, C<uint16_t>,
672C<uint32_t> and C<uint64_t> types.
454 673
455For example: 674For example:
456 675
457 ecb_popcount32 (7) = 3 676 ecb_popcount32 (7) = 3
458 ecb_popcount32 (255) = 8 677 ecb_popcount32 (255) = 8
459 678
679=item uint8_t ecb_bitrev8 (uint8_t x)
680
681=item uint16_t ecb_bitrev16 (uint16_t x)
682
683=item uint32_t ecb_bitrev32 (uint32_t x)
684
685=item T ecb_bitrev (T x) [C++]
686
687Reverses the bits in x, i.e. the MSB becomes the LSB, MSB-1 becomes LSB+1
688and so on.
689
690The overloaded C++ C<ecb_bitrev> function supports C<uint8_t>, C<uint16_t> and C<uint32_t> types.
691
692Example:
693
694 ecb_bitrev8 (0xa7) = 0xea
695 ecb_bitrev32 (0xffcc4411) = 0x882233ff
696
697=item T ecb_bitrev (T x) [C++]
698
699Overloaded C++ bitrev function.
700
701C<T> must be one of C<uint8_t>, C<uint16_t> or C<uint32_t>.
702
460=item uint32_t ecb_bswap16 (uint32_t x) 703=item uint32_t ecb_bswap16 (uint32_t x)
461 704
462=item uint32_t ecb_bswap32 (uint32_t x) 705=item uint32_t ecb_bswap32 (uint32_t x)
463 706
464=item uint64_t ecb_bswap64 (uint64_t x) 707=item uint64_t ecb_bswap64 (uint64_t x)
708
709=item T ecb_bswap (T x)
465 710
466These functions return the value of the 16-bit (32-bit, 64-bit) value 711These functions return the value of the 16-bit (32-bit, 64-bit) value
467C<x> after reversing the order of bytes (0x11223344 becomes 0x44332211 in 712C<x> after reversing the order of bytes (0x11223344 becomes 0x44332211 in
468C<ecb_bswap32>). 713C<ecb_bswap32>).
469 714
715The overloaded C++ C<ecb_bswap> function supports C<uint8_t>, C<uint16_t>,
716C<uint32_t> and C<uint64_t> types.
717
470=item uint8_t ecb_rotl8 (uint8_t x, unsigned int count) 718=item uint8_t ecb_rotl8 (uint8_t x, unsigned int count)
471 719
472=item uint16_t ecb_rotl16 (uint16_t x, unsigned int count) 720=item uint16_t ecb_rotl16 (uint16_t x, unsigned int count)
473 721
474=item uint32_t ecb_rotl32 (uint32_t x, unsigned int count) 722=item uint32_t ecb_rotl32 (uint32_t x, unsigned int count)
483 731
484=item uint64_t ecb_rotr64 (uint64_t x, unsigned int count) 732=item uint64_t ecb_rotr64 (uint64_t x, unsigned int count)
485 733
486These two families of functions return the value of C<x> after rotating 734These two families of functions return the value of C<x> after rotating
487all the bits by C<count> positions to the right (C<ecb_rotr>) or left 735all the bits by C<count> positions to the right (C<ecb_rotr>) or left
488(C<ecb_rotl>). 736(C<ecb_rotl>). There are no restrictions on the value C<count>, i.e. both
737zero and values equal or larger than the word width work correctly. Also,
738notwithstanding C<count> being unsigned, negative numbers work and shift
739to the opposite direction.
489 740
490Current GCC versions understand these functions and usually compile them 741Current GCC/clang versions understand these functions and usually compile
491to "optimal" code (e.g. a single C<rol> or a combination of C<shld> on 742them to "optimal" code (e.g. a single C<rol> or a combination of C<shld>
492x86). 743on x86).
744
745=item T ecb_rotl (T x, unsigned int count) [C++]
746
747=item T ecb_rotr (T x, unsigned int count) [C++]
748
749Overloaded C++ rotl/rotr functions.
750
751C<T> must be one of C<uint8_t>, C<uint16_t>, C<uint32_t> or C<uint64_t>.
752
753=back
754
755=head2 BIT MIXING, HASHING
756
757Sometimes you have an integer and want to distribute its bits well, for
758example, to use it as a hash in a hashtable. A common example is pointer
759values, which often only have a limited range (e.g. low and high bits are
760often zero).
761
762The following functions try to mix the bits to get a good bias-free
763distribution. They were mainly made for pointers, but the underlying
764integer functions are exposed as well.
765
766As an added benefit, the functions are reversible, so if you find it
767convenient to store only the hash value, you can recover the original
768pointer from the hash ("unmix"), as long as your pinters are 32 or 64 bit
769(if this isn't the case on your platform, drop us a note and we will add
770functions for other bit widths).
771
772The unmix functions are very slightly slower than the mix functions, so
773it is equally very slightly preferable to store the original values wehen
774convenient.
775
776The underlying algorithm if subject to change, so currently these
777functions are not suitable for persistent hash tables, as their result
778value can change between diferent versions of libecb.
779
780=over
781
782=item uintptr_t ecb_ptrmix (void *ptr)
783
784Mixes the bits of a pointer so the result is suitable for hash table
785lookups. In other words, this hashes the pointer value.
786
787=item uintptr_t ecb_ptrmix (T *ptr) [C++]
788
789Overload the C<ecb_ptrmix> function to work for any pointer in C++.
790
791=item void *ecb_ptrunmix (uintptr_t v)
792
793Unmix the hash value into the original pointer. This only works as long
794as the hash value is not truncated, i.e. you used C<uintptr_t> (or
795equivalent) throughout to store it.
796
797=item T *ecb_ptrunmix<T> (uintptr_t v) [C++]
798
799The somewhat less useful template version of C<ecb_ptrunmix> for
800C++. Example:
801
802 sometype *myptr;
803 uintptr_t hash = ecb_ptrmix (myptr);
804 sometype *orig = ecb_ptrunmix<sometype> (hash);
805
806=item uint32_t ecb_mix32 (uint32_t v)
807
808=item uint64_t ecb_mix64 (uint64_t v)
809
810Sometimes you don't have a pointer but an integer whose values are very
811badly distributed. In this case you cna sue these integer versions of the
812mixing function. No C++ template is provided currently.
813
814=item uint32_t ecb_unmix32 (uint32_t v)
815
816=item uint64_t ecb_unmix64 (uint64_t v)
817
818The reverse of the C<ecb_mix> functions - they take a mixed/hashed value
819and recover the original value.
820
821=back
822
823=head2 HOST ENDIANNESS CONVERSION
824
825=over
826
827=item uint_fast16_t ecb_be_u16_to_host (uint_fast16_t v)
828
829=item uint_fast32_t ecb_be_u32_to_host (uint_fast32_t v)
830
831=item uint_fast64_t ecb_be_u64_to_host (uint_fast64_t v)
832
833=item uint_fast16_t ecb_le_u16_to_host (uint_fast16_t v)
834
835=item uint_fast32_t ecb_le_u32_to_host (uint_fast32_t v)
836
837=item uint_fast64_t ecb_le_u64_to_host (uint_fast64_t v)
838
839Convert an unsigned 16, 32 or 64 bit value from big or little endian to host byte order.
840
841The naming convention is C<ecb_>(C<be>|C<le>)C<_u>C<16|32|64>C<_to_host>,
842where C<be> and C<le> stand for big endian and little endian, respectively.
843
844=item uint_fast16_t ecb_host_to_be_u16 (uint_fast16_t v)
845
846=item uint_fast32_t ecb_host_to_be_u32 (uint_fast32_t v)
847
848=item uint_fast64_t ecb_host_to_be_u64 (uint_fast64_t v)
849
850=item uint_fast16_t ecb_host_to_le_u16 (uint_fast16_t v)
851
852=item uint_fast32_t ecb_host_to_le_u32 (uint_fast32_t v)
853
854=item uint_fast64_t ecb_host_to_le_u64 (uint_fast64_t v)
855
856Like above, but converts I<from> host byte order to the specified
857endianness.
858
859=back
860
861In C++ the following additional template functions are supported:
862
863=over
864
865=item T ecb_be_to_host (T v)
866
867=item T ecb_le_to_host (T v)
868
869=item T ecb_host_to_be (T v)
870
871=item T ecb_host_to_le (T v)
872
873=back
874
875These functions work like their C counterparts, above, but use templates,
876which make them useful in generic code.
877
878C<T> must be one of C<uint8_t>, C<uint16_t>, C<uint32_t> or C<uint64_t>
879(so unlike their C counterparts, there is a version for C<uint8_t>, which
880again can be useful in generic code).
881
882=head2 UNALIGNED LOAD/STORE
883
884These function load or store unaligned multi-byte values.
885
886=over
887
888=item uint_fast16_t ecb_peek_u16_u (const void *ptr)
889
890=item uint_fast32_t ecb_peek_u32_u (const void *ptr)
891
892=item uint_fast64_t ecb_peek_u64_u (const void *ptr)
893
894These functions load an unaligned, unsigned 16, 32 or 64 bit value from
895memory.
896
897=item uint_fast16_t ecb_peek_be_u16_u (const void *ptr)
898
899=item uint_fast32_t ecb_peek_be_u32_u (const void *ptr)
900
901=item uint_fast64_t ecb_peek_be_u64_u (const void *ptr)
902
903=item uint_fast16_t ecb_peek_le_u16_u (const void *ptr)
904
905=item uint_fast32_t ecb_peek_le_u32_u (const void *ptr)
906
907=item uint_fast64_t ecb_peek_le_u64_u (const void *ptr)
908
909Like above, but additionally convert from big endian (C<be>) or little
910endian (C<le>) byte order to host byte order while doing so.
911
912=item ecb_poke_u16_u (void *ptr, uint16_t v)
913
914=item ecb_poke_u32_u (void *ptr, uint32_t v)
915
916=item ecb_poke_u64_u (void *ptr, uint64_t v)
917
918These functions store an unaligned, unsigned 16, 32 or 64 bit value to
919memory.
920
921=item ecb_poke_be_u16_u (void *ptr, uint_fast16_t v)
922
923=item ecb_poke_be_u32_u (void *ptr, uint_fast32_t v)
924
925=item ecb_poke_be_u64_u (void *ptr, uint_fast64_t v)
926
927=item ecb_poke_le_u16_u (void *ptr, uint_fast16_t v)
928
929=item ecb_poke_le_u32_u (void *ptr, uint_fast32_t v)
930
931=item ecb_poke_le_u64_u (void *ptr, uint_fast64_t v)
932
933Like above, but additionally convert from host byte order to big endian
934(C<be>) or little endian (C<le>) byte order while doing so.
935
936=back
937
938In C++ the following additional template functions are supported:
939
940=over
941
942=item T ecb_peek<T> (const void *ptr)
943
944=item T ecb_peek_be<T> (const void *ptr)
945
946=item T ecb_peek_le<T> (const void *ptr)
947
948=item T ecb_peek_u<T> (const void *ptr)
949
950=item T ecb_peek_be_u<T> (const void *ptr)
951
952=item T ecb_peek_le_u<T> (const void *ptr)
953
954Similarly to their C counterparts, these functions load an unsigned 8, 16,
95532 or 64 bit value from memory, with optional conversion from big/little
956endian.
957
958Since the type cannot be deduced, it has to be specified explicitly, e.g.
959
960 uint_fast16_t v = ecb_peek<uint16_t> (ptr);
961
962C<T> must be one of C<uint8_t>, C<uint16_t>, C<uint32_t> or C<uint64_t>.
963
964Unlike their C counterparts, these functions support 8 bit quantities
965(C<uint8_t>) and also have an aligned version (without the C<_u> prefix),
966all of which hopefully makes them more useful in generic code.
967
968=item ecb_poke (void *ptr, T v)
969
970=item ecb_poke_be (void *ptr, T v)
971
972=item ecb_poke_le (void *ptr, T v)
973
974=item ecb_poke_u (void *ptr, T v)
975
976=item ecb_poke_be_u (void *ptr, T v)
977
978=item ecb_poke_le_u (void *ptr, T v)
979
980Again, similarly to their C counterparts, these functions store an
981unsigned 8, 16, 32 or z64 bit value to memory, with optional conversion to
982big/little endian.
983
984C<T> must be one of C<uint8_t>, C<uint16_t>, C<uint32_t> or C<uint64_t>.
985
986Unlike their C counterparts, these functions support 8 bit quantities
987(C<uint8_t>) and also have an aligned version (without the C<_u> prefix),
988all of which hopefully makes them more useful in generic code.
989
990=back
991
992=head2 FAST INTEGER TO STRING
993
994Libecb defines a set of very fast integer to decimal string (or integer
995to ascii, short C<i2a>) functions. These work by converting the integer
996to a fixed point representation and then successively multiplying out
997the topmost digits. Unlike some other, also very fast, libraries, ecb's
998algorithm should be completely branchless per digit, and does not rely on
999the presence of special cpu functions (such as clz).
1000
1001There is a high level API that takes an C<int32_t>, C<uint32_t>,
1002C<int64_t> or C<uint64_t> as argument, and a low-level API, which is
1003harder to use but supports slightly more formatting options.
1004
1005=head3 HIGH LEVEL API
1006
1007The high level API consists of four functions, one each for C<int32_t>,
1008C<uint32_t>, C<int64_t> and C<uint64_t>:
1009
1010Example:
1011
1012 char buf[ECB_I2A_MAX_DIGITS + 1];
1013 char *end = ecb_i2a_i32 (buf, 17262);
1014 *end = 0;
1015 // buf now contains "17262"
1016
1017=over
1018
1019=item ECB_I2A_I32_DIGITS (=11)
1020
1021=item char *ecb_i2a_u32 (char *ptr, uint32_t value)
1022
1023Takes an C<uint32_t> I<value> and formats it as a decimal number starting
1024at I<ptr>, using at most C<ECB_I2A_I32_DIGITS> characters. Returns a
1025pointer to just after the generated string, where you would normally put
1026the terminating C<0> character. This function outputs the minimum number
1027of digits.
1028
1029=item ECB_I2A_U32_DIGITS (=10)
1030
1031=item char *ecb_i2a_i32 (char *ptr, int32_t value)
1032
1033Same as C<ecb_i2a_u32>, but formats a C<int32_t> value, including a minus
1034sign if needed.
1035
1036=item ECB_I2A_I64_DIGITS (=20)
1037
1038=item char *ecb_i2a_u64 (char *ptr, uint64_t value)
1039
1040=item ECB_I2A_U64_DIGITS (=21)
1041
1042=item char *ecb_i2a_i64 (char *ptr, int64_t value)
1043
1044Similar to their 32 bit counterparts, these take a 64 bit argument.
1045
1046=item ECB_I2A_MAX_DIGITS (=21)
1047
1048Instead of using a type specific length macro, you can just use
1049C<ECB_I2A_MAX_DIGITS>, which is good enough for any C<ecb_i2a> function.
1050
1051=back
1052
1053=head3 LOW-LEVEL API
1054
1055The functions above use a number of low-level APIs which have some strict
1056limitations, but can be used as building blocks (studying C<ecb_i2a_i32>
1057and related functions is recommended).
1058
1059There are three families of functions: functions that convert a number
1060to a fixed number of digits with leading zeroes (C<ecb_i2a_0N>, C<0>
1061for "leading zeroes"), functions that generate up to N digits, skipping
1062leading zeroes (C<_N>), and functions that can generate more digits, but
1063the leading digit has limited range (C<_xN>).
1064
1065None of the functions deal with negative numbers.
1066
1067Example: convert an IP address in an u32 into dotted-quad:
1068
1069 uint32_t ip = 0x0a000164; // 10.0.1.100
1070 char ips[3 * 4 + 3 + 1];
1071 char *ptr = ips;
1072 ptr = ecb_i2a_3 (ptr, ip >> 24 ); *ptr++ = '.';
1073 ptr = ecb_i2a_3 (ptr, (ip >> 16) & 0xff); *ptr++ = '.';
1074 ptr = ecb_i2a_3 (ptr, (ip >> 8) & 0xff); *ptr++ = '.';
1075 ptr = ecb_i2a_3 (ptr, ip & 0xff); *ptr++ = 0;
1076 printf ("ip: %s\n", ips); // prints "ip: 10.0.1.100"
1077
1078=over
1079
1080=item char *ecb_i2a_02 (char *ptr, uint32_t value) // 32 bit
1081
1082=item char *ecb_i2a_03 (char *ptr, uint32_t value) // 32 bit
1083
1084=item char *ecb_i2a_04 (char *ptr, uint32_t value) // 32 bit
1085
1086=item char *ecb_i2a_05 (char *ptr, uint32_t value) // 64 bit
1087
1088=item char *ecb_i2a_06 (char *ptr, uint32_t value) // 64 bit
1089
1090=item char *ecb_i2a_07 (char *ptr, uint32_t value) // 64 bit
1091
1092=item char *ecb_i2a_08 (char *ptr, uint32_t value) // 64 bit
1093
1094=item char *ecb_i2a_09 (char *ptr, uint32_t value) // 64 bit
1095
1096The C<< ecb_i2a_0I<N> > functions take an unsigned I<value> and convert
1097them to exactly I<N> digits, returning a pointer to the first character
1098after the digits. The I<value> must be in range. The functions marked with
1099I<32 bit> do their calculations internally in 32 bit, the ones marked with
1100I<64 bit> internally use 64 bit integers, which might be slow on 32 bit
1101architectures (the high level API decides on 32 vs. 64 bit versions using
1102C<ECB_64BIT_NATIVE>).
1103
1104=item char *ecb_i2a_2 (char *ptr, uint32_t value) // 32 bit
1105
1106=item char *ecb_i2a_3 (char *ptr, uint32_t value) // 32 bit
1107
1108=item char *ecb_i2a_4 (char *ptr, uint32_t value) // 32 bit
1109
1110=item char *ecb_i2a_5 (char *ptr, uint32_t value) // 64 bit
1111
1112=item char *ecb_i2a_6 (char *ptr, uint32_t value) // 64 bit
1113
1114=item char *ecb_i2a_7 (char *ptr, uint32_t value) // 64 bit
1115
1116=item char *ecb_i2a_8 (char *ptr, uint32_t value) // 64 bit
1117
1118=item char *ecb_i2a_9 (char *ptr, uint32_t value) // 64 bit
1119
1120Similarly, the C<< ecb_i2a_I<N> > functions take an unsigned I<value>
1121and convert them to at most I<N> digits, suppressing leading zeroes, and
1122returning a pointer to the first character after the digits.
1123
1124=item ECB_I2A_MAX_X5 (=59074)
1125
1126=item char *ecb_i2a_x5 (char *ptr, uint32_t value) // 32 bit
1127
1128=item ECB_I2A_MAX_X10 (=2932500665)
1129
1130=item char *ecb_i2a_x10 (char *ptr, uint32_t value) // 64 bit
1131
1132The C<< ecb_i2a_xI<N> >> functions are similar to the C<< ecb_i2a_I<N> >
1133functions, but they can generate one digit more, as long as the number
1134is within range, which is given by the symbols C<ECB_I2A_MAX_X5> (almost
113516 bit range) and C<ECB_I2A_MAX_X10> (a bit more than 31 bit range),
1136respectively.
1137
1138For example, the digit part of a 32 bit signed integer just fits into the
1139C<ECB_I2A_MAX_X10> range, so while C<ecb_i2a_x10> cannot convert a 10
1140digit number, it can convert all 32 bit signed numbers. Sadly, it's not
1141good enough for 32 bit unsigned numbers.
1142
1143=back
1144
1145=head2 FLOATING POINT FIDDLING
1146
1147=over
1148
1149=item ECB_INFINITY [-UECB_NO_LIBM]
1150
1151Evaluates to positive infinity if supported by the platform, otherwise to
1152a truly huge number.
1153
1154=item ECB_NAN [-UECB_NO_LIBM]
1155
1156Evaluates to a quiet NAN if supported by the platform, otherwise to
1157C<ECB_INFINITY>.
1158
1159=item float ecb_ldexpf (float x, int exp) [-UECB_NO_LIBM]
1160
1161Same as C<ldexpf>, but always available.
1162
1163=item uint32_t ecb_float_to_binary16 (float x) [-UECB_NO_LIBM]
1164
1165=item uint32_t ecb_float_to_binary32 (float x) [-UECB_NO_LIBM]
1166
1167=item uint64_t ecb_double_to_binary64 (double x) [-UECB_NO_LIBM]
1168
1169These functions each take an argument in the native C<float> or C<double>
1170type and return the IEEE 754 bit representation of it (binary16/half,
1171binary32/single or binary64/double precision).
1172
1173The bit representation is just as IEEE 754 defines it, i.e. the sign bit
1174will be the most significant bit, followed by exponent and mantissa.
1175
1176This function should work even when the native floating point format isn't
1177IEEE compliant, of course at a speed and code size penalty, and of course
1178also within reasonable limits (it tries to convert NaNs, infinities and
1179denormals, but will likely convert negative zero to positive zero).
1180
1181On all modern platforms (where C<ECB_STDFP> is true), the compiler should
1182be able to optimise away this function completely.
1183
1184These functions can be helpful when serialising floats to the network - you
1185can serialise the return value like a normal uint16_t/uint32_t/uint64_t.
1186
1187Another use for these functions is to manipulate floating point values
1188directly.
1189
1190Silly example: toggle the sign bit of a float.
1191
1192 /* On gcc-4.7 on amd64, */
1193 /* this results in a single add instruction to toggle the bit, and 4 extra */
1194 /* instructions to move the float value to an integer register and back. */
1195
1196 x = ecb_binary32_to_float (ecb_float_to_binary32 (x) ^ 0x80000000U)
1197
1198=item float ecb_binary16_to_float (uint16_t x) [-UECB_NO_LIBM]
1199
1200=item float ecb_binary32_to_float (uint32_t x) [-UECB_NO_LIBM]
1201
1202=item double ecb_binary64_to_double (uint64_t x) [-UECB_NO_LIBM]
1203
1204The reverse operation of the previous function - takes the bit
1205representation of an IEEE binary16, binary32 or binary64 number (half,
1206single or double precision) and converts it to the native C<float> or
1207C<double> format.
1208
1209This function should work even when the native floating point format isn't
1210IEEE compliant, of course at a speed and code size penalty, and of course
1211also within reasonable limits (it tries to convert normals and denormals,
1212and might be lucky for infinities, and with extraordinary luck, also for
1213negative zero).
1214
1215On all modern platforms (where C<ECB_STDFP> is true), the compiler should
1216be able to optimise away this function completely.
1217
1218=item uint16_t ecb_binary32_to_binary16 (uint32_t x)
1219
1220=item uint32_t ecb_binary16_to_binary32 (uint16_t x)
1221
1222Convert a IEEE binary32/single precision to binary16/half format, and vice
1223versa, handling all details (round-to-nearest-even, subnormals, infinity
1224and NaNs) correctly.
1225
1226These are functions are available under C<-DECB_NO_LIBM>, since
1227they do not rely on the platform floating point format. The
1228C<ecb_float_to_binary16> and C<ecb_binary16_to_float> functions are
1229usually what you want.
493 1230
494=back 1231=back
495 1232
496=head2 ARITHMETIC 1233=head2 ARITHMETIC
497 1234
498=over 4 1235=over
499 1236
500=item x = ecb_mod (m, n) 1237=item x = ecb_mod (m, n)
501 1238
502Returns C<m> modulo C<n>, which is the same as the positive remainder 1239Returns C<m> modulo C<n>, which is the same as the positive remainder
503of the division operation between C<m> and C<n>, using floored 1240of the division operation between C<m> and C<n>, using floored
510C<n> must be strictly positive (i.e. C<< >= 1 >>), while C<m> must be 1247C<n> must be strictly positive (i.e. C<< >= 1 >>), while C<m> must be
511negatable, that is, both C<m> and C<-m> must be representable in its 1248negatable, that is, both C<m> and C<-m> must be representable in its
512type (this typically excludes the minimum signed integer value, the same 1249type (this typically excludes the minimum signed integer value, the same
513limitation as for C</> and C<%> in C). 1250limitation as for C</> and C<%> in C).
514 1251
515Current GCC versions compile this into an efficient branchless sequence on 1252Current GCC/clang versions compile this into an efficient branchless
516almost all CPUs. 1253sequence on almost all CPUs.
517 1254
518For example, when you want to rotate forward through the members of an 1255For example, when you want to rotate forward through the members of an
519array for increasing C<m> (which might be negative), then you should use 1256array for increasing C<m> (which might be negative), then you should use
520C<ecb_mod>, as the C<%> operator might give either negative results, or 1257C<ecb_mod>, as the C<%> operator might give either negative results, or
521change direction for negative values: 1258change direction for negative values:
534 1271
535=back 1272=back
536 1273
537=head2 UTILITY 1274=head2 UTILITY
538 1275
539=over 4 1276=over
540 1277
541=item element_count = ecb_array_length (name) 1278=item element_count = ecb_array_length (name)
542 1279
543Returns the number of elements in the array C<name>. For example: 1280Returns the number of elements in the array C<name>. For example:
544 1281
548 for (i = 0; i < ecb_array_length (primes); i++) 1285 for (i = 0; i < ecb_array_length (primes); i++)
549 sum += primes [i]; 1286 sum += primes [i];
550 1287
551=back 1288=back
552 1289
1290=head2 SYMBOLS GOVERNING COMPILATION OF ECB.H ITSELF
553 1291
1292These symbols need to be defined before including F<ecb.h> the first time.
1293
1294=over
1295
1296=item ECB_NO_THREADS
1297
1298If F<ecb.h> is never used from multiple threads, then this symbol can
1299be defined, in which case memory fences (and similar constructs) are
1300completely removed, leading to more efficient code and fewer dependencies.
1301
1302Setting this symbol to a true value implies C<ECB_NO_SMP>.
1303
1304=item ECB_NO_SMP
1305
1306The weaker version of C<ECB_NO_THREADS> - if F<ecb.h> is used from
1307multiple threads, but never concurrently (e.g. if the system the program
1308runs on has only a single CPU with a single core, no hyperthreading and so
1309on), then this symbol can be defined, leading to more efficient code and
1310fewer dependencies.
1311
1312=item ECB_NO_LIBM
1313
1314When defined to C<1>, do not export any functions that might introduce
1315dependencies on the math library (usually called F<-lm>) - these are
1316marked with [-UECB_NO_LIBM].
1317
1318=back
1319
1320=head1 UNDOCUMENTED FUNCTIONALITY
1321
1322F<ecb.h> is full of undocumented functionality as well, some of which is
1323intended to be internal-use only, some of which we forgot to document, and
1324some of which we hide because we are not sure we will keep the interface
1325stable.
1326
1327While you are welcome to rummage around and use whatever you find useful
1328(we can't stop you), keep in mind that we will change undocumented
1329functionality in incompatible ways without thinking twice, while we are
1330considerably more conservative with documented things.
1331
1332=head1 AUTHORS
1333
1334C<libecb> is designed and maintained by:
1335
1336 Emanuele Giaquinta <e.giaquinta@glauco.it>
1337 Marc Alexander Lehmann <schmorp@schmorp.de>
1338
1339

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines