ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/libecb/ecb.pod
(Generate patch)

Comparing libecb/ecb.pod (file contents):
Revision 1.81 by root, Mon Jan 20 21:01:29 2020 UTC vs.
Revision 1.89 by root, Mon Jun 21 23:59:58 2021 UTC

10 10
11Its homepage can be found here: 11Its homepage can be found here:
12 12
13 http://software.schmorp.de/pkg/libecb 13 http://software.schmorp.de/pkg/libecb
14 14
15It mainly provides a number of wrappers around GCC built-ins, together 15It mainly provides a number of wrappers around many compiler built-ins,
16with replacement functions for other compilers. In addition to this, 16together with replacement functions for other compilers. In addition
17it provides a number of other lowlevel C utilities, such as endianness 17to this, it provides a number of other lowlevel C utilities, such as
18detection, byte swapping or bit rotations. 18endianness detection, byte swapping or bit rotations.
19 19
20Or in other words, things that should be built into any standard C system, 20Or in other words, things that should be built into any standard C
21but aren't, implemented as efficient as possible with GCC, and still 21system, but aren't, implemented as efficient as possible with GCC (clang,
22correct with other compilers. 22msvc...), and still correct with other compilers.
23 23
24More might come. 24More might come.
25 25
26=head2 ABOUT THE HEADER 26=head2 ABOUT THE HEADER
27 27
80 80
81All the following symbols expand to an expression that can be tested in 81All the following symbols expand to an expression that can be tested in
82preprocessor instructions as well as treated as a boolean (use C<!!> to 82preprocessor instructions as well as treated as a boolean (use C<!!> to
83ensure it's either C<0> or C<1> if you need that). 83ensure it's either C<0> or C<1> if you need that).
84 84
85=over 4 85=over
86 86
87=item ECB_C 87=item ECB_C
88 88
89True if the implementation defines the C<__STDC__> macro to a true value, 89True if the implementation defines the C<__STDC__> macro to a true value,
90while not claiming to be C++. 90while not claiming to be C++, i..e C, but not C++.
91 91
92=item ECB_C99 92=item ECB_C99
93 93
94True if the implementation claims to be compliant to C99 (ISO/IEC 94True if the implementation claims to be compliant to C99 (ISO/IEC
959899:1999) or any later version, while not claiming to be C++. 959899:1999) or any later version, while not claiming to be C++.
109 109
110=item ECB_CPP11, ECB_CPP14, ECB_CPP17 110=item ECB_CPP11, ECB_CPP14, ECB_CPP17
111 111
112True if the implementation claims to be compliant to C++11/C++14/C++17 112True if the implementation claims to be compliant to C++11/C++14/C++17
113(ISO/IEC 14882:2011, :2014, :2017) or any later version. 113(ISO/IEC 14882:2011, :2014, :2017) or any later version.
114
115Note that many C++20 features will likely have their own feature test
116macros (see e.g. L<http://eel.is/c++draft/cpp.predefined#1.8>).
114 117
115=item ECB_OPTIMIZE_SIZE 118=item ECB_OPTIMIZE_SIZE
116 119
117Is C<1> when the compiler optimizes for size, C<0> otherwise. This symbol 120Is C<1> when the compiler optimizes for size, C<0> otherwise. This symbol
118can also be defined before including F<ecb.h>, in which case it will be 121can also be defined before including F<ecb.h>, in which case it will be
119unchanged. 122unchanged.
120 123
121=item ECB_GCC_VERSION (major, minor) 124=item ECB_GCC_VERSION (major, minor)
122 125
123Expands to a true value (suitable for testing in by the preprocessor) 126Expands to a true value (suitable for testing by the preprocessor) if the
124if the compiler used is GNU C and the version is the given version, or 127compiler used is GNU C and the version is the given version, or higher.
125higher.
126 128
127This macro tries to return false on compilers that claim to be GCC 129This macro tries to return false on compilers that claim to be GCC
128compatible but aren't. 130compatible but aren't.
129 131
130=item ECB_EXTERN_C 132=item ECB_EXTERN_C
149 151
150 ECB_EXTERN_C_END 152 ECB_EXTERN_C_END
151 153
152=item ECB_STDFP 154=item ECB_STDFP
153 155
154If this evaluates to a true value (suitable for testing in by the 156If this evaluates to a true value (suitable for testing by the
155preprocessor), then C<float> and C<double> use IEEE 754 single/binary32 157preprocessor), then C<float> and C<double> use IEEE 754 single/binary32
156and double/binary64 representations internally I<and> the endianness of 158and double/binary64 representations internally I<and> the endianness of
157both types match the endianness of C<uint32_t> and C<uint64_t>. 159both types match the endianness of C<uint32_t> and C<uint64_t>.
158 160
159This means you can just copy the bits of a C<float> (or C<double>) to an 161This means you can just copy the bits of a C<float> (or C<double>) to an
161without having to think about format or endianness. 163without having to think about format or endianness.
162 164
163This is true for basically all modern platforms, although F<ecb.h> might 165This is true for basically all modern platforms, although F<ecb.h> might
164not be able to deduce this correctly everywhere and might err on the safe 166not be able to deduce this correctly everywhere and might err on the safe
165side. 167side.
168
169=item ECB_64BIT_NATIVE
170
171Evaluates to a true value (suitable for both preprocessor and C code
172testing) if 64 bit integer types on this architecture are evaluated
173"natively", that is, with similar speeds as 32 bit integerss. While 64 bit
174integer support is very common (and in fatc required by libecb), 32 bit
175cpus have to emulate operations on them, so you might want to avoid them.
166 176
167=item ECB_AMD64, ECB_AMD64_X32 177=item ECB_AMD64, ECB_AMD64_X32
168 178
169These two macros are defined to C<1> on the x86_64/amd64 ABI and the X32 179These two macros are defined to C<1> on the x86_64/amd64 ABI and the X32
170ABI, respectively, and undefined elsewhere. 180ABI, respectively, and undefined elsewhere.
177 187
178=back 188=back
179 189
180=head2 MACRO TRICKERY 190=head2 MACRO TRICKERY
181 191
182=over 4 192=over
183 193
184=item ECB_CONCAT (a, b) 194=item ECB_CONCAT (a, b)
185 195
186Expands any macros in C<a> and C<b>, then concatenates the result to form 196Expands any macros in C<a> and C<b>, then concatenates the result to form
187a single token. This is mainly useful to form identifiers from components, 197a single token. This is mainly useful to form identifiers from components,
228declarations must be put before the whole declaration: 238declarations must be put before the whole declaration:
229 239
230 ecb_const int mysqrt (int a); 240 ecb_const int mysqrt (int a);
231 ecb_unused int i; 241 ecb_unused int i;
232 242
233=over 4 243=over
234 244
235=item ecb_unused 245=item ecb_unused
236 246
237Marks a function or a variable as "unused", which simply suppresses a 247Marks a function or a variable as "unused", which simply suppresses a
238warning by GCC when it detects it as unused. This is useful when you e.g. 248warning by the compiler when it detects it as unused. This is useful when
239declare a variable but do not always use it: 249you e.g. declare a variable but do not always use it:
240 250
241 { 251 {
242 ecb_unused int var; 252 ecb_unused int var;
243 253
244 #ifdef SOMECONDITION 254 #ifdef SOMECONDITION
412 422
413=back 423=back
414 424
415=head2 OPTIMISATION HINTS 425=head2 OPTIMISATION HINTS
416 426
417=over 4 427=over
418 428
419=item bool ecb_is_constant (expr) 429=item bool ecb_is_constant (expr)
420 430
421Returns true iff the expression can be deduced to be a compile-time 431Returns true iff the expression can be deduced to be a compile-time
422constant, and false otherwise. 432constant, and false otherwise.
579 589
580=back 590=back
581 591
582=head2 BIT FIDDLING / BIT WIZARDRY 592=head2 BIT FIDDLING / BIT WIZARDRY
583 593
584=over 4 594=over
585 595
586=item bool ecb_big_endian () 596=item bool ecb_big_endian ()
587 597
588=item bool ecb_little_endian () 598=item bool ecb_little_endian ()
589 599
723 733
724These two families of functions return the value of C<x> after rotating 734These two families of functions return the value of C<x> after rotating
725all the bits by C<count> positions to the right (C<ecb_rotr>) or left 735all the bits by C<count> positions to the right (C<ecb_rotr>) or left
726(C<ecb_rotl>). 736(C<ecb_rotl>).
727 737
728Current GCC versions understand these functions and usually compile them 738Current GCC/clang versions understand these functions and usually compile
729to "optimal" code (e.g. a single C<rol> or a combination of C<shld> on 739them to "optimal" code (e.g. a single C<rol> or a combination of C<shld>
730x86). 740on x86).
731 741
732=item T ecb_rotl (T x, unsigned int count) [C++] 742=item T ecb_rotl (T x, unsigned int count) [C++]
733 743
734=item T ecb_rotr (T x, unsigned int count) [C++] 744=item T ecb_rotr (T x, unsigned int count) [C++]
735 745
739 749
740=back 750=back
741 751
742=head2 HOST ENDIANNESS CONVERSION 752=head2 HOST ENDIANNESS CONVERSION
743 753
744=over 4 754=over
745 755
746=item uint_fast16_t ecb_be_u16_to_host (uint_fast16_t v) 756=item uint_fast16_t ecb_be_u16_to_host (uint_fast16_t v)
747 757
748=item uint_fast32_t ecb_be_u32_to_host (uint_fast32_t v) 758=item uint_fast32_t ecb_be_u32_to_host (uint_fast32_t v)
749 759
777 787
778=back 788=back
779 789
780In C++ the following additional template functions are supported: 790In C++ the following additional template functions are supported:
781 791
782=over 4 792=over
783 793
784=item T ecb_be_to_host (T v) 794=item T ecb_be_to_host (T v)
785 795
786=item T ecb_le_to_host (T v) 796=item T ecb_le_to_host (T v)
787 797
788=item T ecb_host_to_be (T v) 798=item T ecb_host_to_be (T v)
789 799
790=item T ecb_host_to_le (T v) 800=item T ecb_host_to_le (T v)
801
802=back
791 803
792These functions work like their C counterparts, above, but use templates, 804These functions work like their C counterparts, above, but use templates,
793which make them useful in generic code. 805which make them useful in generic code.
794 806
795C<T> must be one of C<uint8_t>, C<uint16_t>, C<uint32_t> or C<uint64_t> 807C<T> must be one of C<uint8_t>, C<uint16_t>, C<uint32_t> or C<uint64_t>
798 810
799=head2 UNALIGNED LOAD/STORE 811=head2 UNALIGNED LOAD/STORE
800 812
801These function load or store unaligned multi-byte values. 813These function load or store unaligned multi-byte values.
802 814
803=over 4 815=over
804 816
805=item uint_fast16_t ecb_peek_u16_u (const void *ptr) 817=item uint_fast16_t ecb_peek_u16_u (const void *ptr)
806 818
807=item uint_fast32_t ecb_peek_u32_u (const void *ptr) 819=item uint_fast32_t ecb_peek_u32_u (const void *ptr)
808 820
852 864
853=back 865=back
854 866
855In C++ the following additional template functions are supported: 867In C++ the following additional template functions are supported:
856 868
857=over 4 869=over
858 870
859=item T ecb_peek<T> (const void *ptr) 871=item T ecb_peek<T> (const void *ptr)
860 872
861=item T ecb_peek_be<T> (const void *ptr) 873=item T ecb_peek_be<T> (const void *ptr)
862 874
904(C<uint8_t>) and also have an aligned version (without the C<_u> prefix), 916(C<uint8_t>) and also have an aligned version (without the C<_u> prefix),
905all of which hopefully makes them more useful in generic code. 917all of which hopefully makes them more useful in generic code.
906 918
907=back 919=back
908 920
921=head2 FAST INTEGER TO STRING
922
923Libecb defines a set of very fast integer to decimal string (or integer
924to ascii, short C<i2a>) functions. These work by converting the integer
925to a fixed point representation and then successively multiplying out
926the topmost digits. Unlike some other, also very fast, libraries, ecb's
927algorithm should be completely branchless per digit, and does not rely on
928the presence of special cpu functions (such as clz).
929
930There is a high level API that takes an C<int32_t>, C<uint32_t>,
931C<int64_t> or C<uint64_t> as argument, and a low-level API, which is
932harder to use but supports slightly more formatting options.
933
934=head3 HIGH LEVEL API
935
936The high level API consists of four functions, one each for C<int32_t>,
937C<uint32_t>, C<int64_t> and C<uint64_t>:
938
939=over
940
941=item ECB_I2A_I32_DIGITS (=11)
942
943=item char *ecb_i2a_u32 (char *ptr, uint32_t value)
944
945Takes an C<uint32_t> I<value> and formats it as a decimal number starting
946at I<ptr>, using at most C<ECB_I2A_I32_DIGITS> characters. Returns a
947pointer to just after the generated string, where you would normally put
948the temrinating C<0> character. This function outputs the minimum number
949of digits.
950
951=item ECB_I2A_U32_DIGITS (=10)
952
953=item char *ecb_i2a_i32 (char *ptr, int32_t value)
954
955Same as C<ecb_i2a_u32>, but formats a C<int32_t> value, including a minus
956sign if needed.
957
958=item ECB_I2A_I64_DIGITS (=20)
959
960=item char *ecb_i2a_u64 (char *ptr, uint64_t value)
961
962=item ECB_I2A_U64_DIGITS (=21)
963
964=item char *ecb_i2a_i64 (char *ptr, int64_t value)
965
966Similar to their 32 bit counterparts, these take a 64 bit argument.
967
968=item ECB_I2A_DIGITS (=21)
969
970Instead of using a type specific length macro, youi can just use
971C<ECB_I2A_DIGITS>, which is good enough for any C<ecb_i2a> function.
972
973=back
974
975=head3 LOW-LEVEL API
976
977The functions above use a number of low-level APIs which have some strict
978limitaitons, but cna be used as building blocks (study of C<ecb_i2a_i32>
979and related cunctions is recommended).
980
981There are three families of functions: functions that convert a number
982to a fixed number of digits with leading zeroes (C<ecb_i2a_0N>, C<0>
983for "leading zeroes"), functions that generate up to N digits, skipping
984leading zeroes (C<_N>), and functions that can generate more digits, but
985the leading digit has limited range (C<_xN>).
986
987None of the functions deal with negative numbera.
988
989=over
990
991=item char *ecb_i2a_02 (char *ptr, uint32_t value) // 32 bit
992
993=item char *ecb_i2a_03 (char *ptr, uint32_t value) // 32 bit
994
995=item char *ecb_i2a_04 (char *ptr, uint32_t value) // 32 bit
996
997=item char *ecb_i2a_05 (char *ptr, uint32_t value) // 64 bit
998
999=item char *ecb_i2a_06 (char *ptr, uint32_t value) // 64 bit
1000
1001=item char *ecb_i2a_07 (char *ptr, uint32_t value) // 64 bit
1002
1003=item char *ecb_i2a_08 (char *ptr, uint32_t value) // 64 bit
1004
1005=item char *ecb_i2a_09 (char *ptr, uint32_t value) // 64 bit
1006
1007The C<< ecb_i2a_0I<N> > functions take an unsigned I<value> and convert
1008them to exactly I<N> digits, returning a pointer to the first character
1009after the digits. The I<value> must be in range. The functions marked with
1010I<32 bit> do their calculations internally in 32 bit, the ones marked with
1011I<64 bit> internally use 64 bit integers, which might be slow on 32 bit
1012architectures (the high level API decides on 32 vs. 64 bit versions using
1013C<ECB_64BIT_NATIVE>).
1014
1015=item char *ecb_i2a_2 (char *ptr, uint32_t value) // 32 bit
1016
1017=item char *ecb_i2a_3 (char *ptr, uint32_t value) // 32 bit
1018
1019=item char *ecb_i2a_4 (char *ptr, uint32_t value) // 32 bit
1020
1021=item char *ecb_i2a_5 (char *ptr, uint32_t value) // 64 bit
1022
1023=item char *ecb_i2a_6 (char *ptr, uint32_t value) // 64 bit
1024
1025=item char *ecb_i2a_7 (char *ptr, uint32_t value) // 64 bit
1026
1027=item char *ecb_i2a_8 (char *ptr, uint32_t value) // 64 bit
1028
1029=item char *ecb_i2a_9 (char *ptr, uint32_t value) // 64 bit
1030
1031Similarly, the C<< ecb_i2a_I<N> > functions take an unsigned I<value>
1032and convert them to at most I<N> digits, suppressing leading zeroes, and
1033returning a pointer to the first character after the digits.
1034
1035=item ECB_I2A_MAX_X5 (=59074)
1036
1037=item char *ecb_i2a_x5 (char *ptr, uint32_t value) // 32 bit
1038
1039=item ECB_I2A_MAX_X10 (=2932500665)
1040
1041=item char *ecb_i2a_x10 (char *ptr, uint32_t value) // 64 bit
1042
1043The C<< ecb_i2a_xI<N> >> functions are similar to the C<< ecb_i2a_I<N> >
1044functions, but they can generate one digit more, as long as the number
1045is within range, which is given by the symbols C<ECB_I2A_MAX_X5> (almost
104616 bit range) and C<ECB_I2A_MAX_X10> (a bit more than 31 bit range),
1047respectively.
1048
1049For example, the sigit part of a 32 bit signed integer just fits into the
1050C<ECB_I2A_MAX_X10> range, so while C<ecb_i2a_x10> cannot convert a 10
1051digit number, it can convert all 32 bit signed numbers. Sadly, it's not
1052good enough for 32 bit unsigned numbers.
1053
1054=back
1055
909=head2 FLOATING POINT FIDDLING 1056=head2 FLOATING POINT FIDDLING
910 1057
911=over 4 1058=over
912 1059
913=item ECB_INFINITY [-UECB_NO_LIBM] 1060=item ECB_INFINITY [-UECB_NO_LIBM]
914 1061
915Evaluates to positive infinity if supported by the platform, otherwise to 1062Evaluates to positive infinity if supported by the platform, otherwise to
916a truly huge number. 1063a truly huge number.
994 1141
995=back 1142=back
996 1143
997=head2 ARITHMETIC 1144=head2 ARITHMETIC
998 1145
999=over 4 1146=over
1000 1147
1001=item x = ecb_mod (m, n) 1148=item x = ecb_mod (m, n)
1002 1149
1003Returns C<m> modulo C<n>, which is the same as the positive remainder 1150Returns C<m> modulo C<n>, which is the same as the positive remainder
1004of the division operation between C<m> and C<n>, using floored 1151of the division operation between C<m> and C<n>, using floored
1011C<n> must be strictly positive (i.e. C<< >= 1 >>), while C<m> must be 1158C<n> must be strictly positive (i.e. C<< >= 1 >>), while C<m> must be
1012negatable, that is, both C<m> and C<-m> must be representable in its 1159negatable, that is, both C<m> and C<-m> must be representable in its
1013type (this typically excludes the minimum signed integer value, the same 1160type (this typically excludes the minimum signed integer value, the same
1014limitation as for C</> and C<%> in C). 1161limitation as for C</> and C<%> in C).
1015 1162
1016Current GCC versions compile this into an efficient branchless sequence on 1163Current GCC/clang versions compile this into an efficient branchless
1017almost all CPUs. 1164sequence on almost all CPUs.
1018 1165
1019For example, when you want to rotate forward through the members of an 1166For example, when you want to rotate forward through the members of an
1020array for increasing C<m> (which might be negative), then you should use 1167array for increasing C<m> (which might be negative), then you should use
1021C<ecb_mod>, as the C<%> operator might give either negative results, or 1168C<ecb_mod>, as the C<%> operator might give either negative results, or
1022change direction for negative values: 1169change direction for negative values:
1035 1182
1036=back 1183=back
1037 1184
1038=head2 UTILITY 1185=head2 UTILITY
1039 1186
1040=over 4 1187=over
1041 1188
1042=item element_count = ecb_array_length (name) 1189=item element_count = ecb_array_length (name)
1043 1190
1044Returns the number of elements in the array C<name>. For example: 1191Returns the number of elements in the array C<name>. For example:
1045 1192
1053 1200
1054=head2 SYMBOLS GOVERNING COMPILATION OF ECB.H ITSELF 1201=head2 SYMBOLS GOVERNING COMPILATION OF ECB.H ITSELF
1055 1202
1056These symbols need to be defined before including F<ecb.h> the first time. 1203These symbols need to be defined before including F<ecb.h> the first time.
1057 1204
1058=over 4 1205=over
1059 1206
1060=item ECB_NO_THREADS 1207=item ECB_NO_THREADS
1061 1208
1062If F<ecb.h> is never used from multiple threads, then this symbol can 1209If F<ecb.h> is never used from multiple threads, then this symbol can
1063be defined, in which case memory fences (and similar constructs) are 1210be defined, in which case memory fences (and similar constructs) are

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines