ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/cvsroot/libecb/ecb.pod
(Generate patch)

Comparing cvsroot/libecb/ecb.pod (file contents):
Revision 1.84 by root, Mon Jan 20 21:10:16 2020 UTC vs.
Revision 1.91 by root, Tue Jun 22 01:07:29 2021 UTC

10 10
11Its homepage can be found here: 11Its homepage can be found here:
12 12
13 http://software.schmorp.de/pkg/libecb 13 http://software.schmorp.de/pkg/libecb
14 14
15It mainly provides a number of wrappers around GCC built-ins, together 15It mainly provides a number of wrappers around many compiler built-ins,
16with replacement functions for other compilers. In addition to this, 16together with replacement functions for other compilers. In addition
17it provides a number of other lowlevel C utilities, such as endianness 17to this, it provides a number of other lowlevel C utilities, such as
18detection, byte swapping or bit rotations. 18endianness detection, byte swapping or bit rotations.
19 19
20Or in other words, things that should be built into any standard C system, 20Or in other words, things that should be built into any standard C
21but aren't, implemented as efficient as possible with GCC, and still 21system, but aren't, implemented as efficient as possible with GCC (clang,
22correct with other compilers. 22msvc...), and still correct with other compilers.
23 23
24More might come. 24More might come.
25 25
26=head2 ABOUT THE HEADER 26=head2 ABOUT THE HEADER
27 27
80 80
81All the following symbols expand to an expression that can be tested in 81All the following symbols expand to an expression that can be tested in
82preprocessor instructions as well as treated as a boolean (use C<!!> to 82preprocessor instructions as well as treated as a boolean (use C<!!> to
83ensure it's either C<0> or C<1> if you need that). 83ensure it's either C<0> or C<1> if you need that).
84 84
85=over 4 85=over
86 86
87=item ECB_C 87=item ECB_C
88 88
89True if the implementation defines the C<__STDC__> macro to a true value, 89True if the implementation defines the C<__STDC__> macro to a true value,
90while not claiming to be C++, i..e C, but not C++. 90while not claiming to be C++, i..e C, but not C++.
163without having to think about format or endianness. 163without having to think about format or endianness.
164 164
165This is true for basically all modern platforms, although F<ecb.h> might 165This is true for basically all modern platforms, although F<ecb.h> might
166not be able to deduce this correctly everywhere and might err on the safe 166not be able to deduce this correctly everywhere and might err on the safe
167side. 167side.
168
169=item ECB_64BIT_NATIVE
170
171Evaluates to a true value (suitable for both preprocessor and C code
172testing) if 64 bit integer types on this architecture are evaluated
173"natively", that is, with similar speeds as 32 bit integerss. While 64 bit
174integer support is very common (and in fatc required by libecb), 32 bit
175cpus have to emulate operations on them, so you might want to avoid them.
168 176
169=item ECB_AMD64, ECB_AMD64_X32 177=item ECB_AMD64, ECB_AMD64_X32
170 178
171These two macros are defined to C<1> on the x86_64/amd64 ABI and the X32 179These two macros are defined to C<1> on the x86_64/amd64 ABI and the X32
172ABI, respectively, and undefined elsewhere. 180ABI, respectively, and undefined elsewhere.
179 187
180=back 188=back
181 189
182=head2 MACRO TRICKERY 190=head2 MACRO TRICKERY
183 191
184=over 4 192=over
185 193
186=item ECB_CONCAT (a, b) 194=item ECB_CONCAT (a, b)
187 195
188Expands any macros in C<a> and C<b>, then concatenates the result to form 196Expands any macros in C<a> and C<b>, then concatenates the result to form
189a single token. This is mainly useful to form identifiers from components, 197a single token. This is mainly useful to form identifiers from components,
230declarations must be put before the whole declaration: 238declarations must be put before the whole declaration:
231 239
232 ecb_const int mysqrt (int a); 240 ecb_const int mysqrt (int a);
233 ecb_unused int i; 241 ecb_unused int i;
234 242
235=over 4 243=over
236 244
237=item ecb_unused 245=item ecb_unused
238 246
239Marks a function or a variable as "unused", which simply suppresses a 247Marks a function or a variable as "unused", which simply suppresses a
240warning by GCC when it detects it as unused. This is useful when you e.g. 248warning by the compiler when it detects it as unused. This is useful when
241declare a variable but do not always use it: 249you e.g. declare a variable but do not always use it:
242 250
243 { 251 {
244 ecb_unused int var; 252 ecb_unused int var;
245 253
246 #ifdef SOMECONDITION 254 #ifdef SOMECONDITION
414 422
415=back 423=back
416 424
417=head2 OPTIMISATION HINTS 425=head2 OPTIMISATION HINTS
418 426
419=over 4 427=over
420 428
421=item bool ecb_is_constant (expr) 429=item bool ecb_is_constant (expr)
422 430
423Returns true iff the expression can be deduced to be a compile-time 431Returns true iff the expression can be deduced to be a compile-time
424constant, and false otherwise. 432constant, and false otherwise.
581 589
582=back 590=back
583 591
584=head2 BIT FIDDLING / BIT WIZARDRY 592=head2 BIT FIDDLING / BIT WIZARDRY
585 593
586=over 4 594=over
587 595
588=item bool ecb_big_endian () 596=item bool ecb_big_endian ()
589 597
590=item bool ecb_little_endian () 598=item bool ecb_little_endian ()
591 599
725 733
726These two families of functions return the value of C<x> after rotating 734These two families of functions return the value of C<x> after rotating
727all the bits by C<count> positions to the right (C<ecb_rotr>) or left 735all the bits by C<count> positions to the right (C<ecb_rotr>) or left
728(C<ecb_rotl>). 736(C<ecb_rotl>).
729 737
730Current GCC versions understand these functions and usually compile them 738Current GCC/clang versions understand these functions and usually compile
731to "optimal" code (e.g. a single C<rol> or a combination of C<shld> on 739them to "optimal" code (e.g. a single C<rol> or a combination of C<shld>
732x86). 740on x86).
733 741
734=item T ecb_rotl (T x, unsigned int count) [C++] 742=item T ecb_rotl (T x, unsigned int count) [C++]
735 743
736=item T ecb_rotr (T x, unsigned int count) [C++] 744=item T ecb_rotr (T x, unsigned int count) [C++]
737 745
741 749
742=back 750=back
743 751
744=head2 HOST ENDIANNESS CONVERSION 752=head2 HOST ENDIANNESS CONVERSION
745 753
746=over 4 754=over
747 755
748=item uint_fast16_t ecb_be_u16_to_host (uint_fast16_t v) 756=item uint_fast16_t ecb_be_u16_to_host (uint_fast16_t v)
749 757
750=item uint_fast32_t ecb_be_u32_to_host (uint_fast32_t v) 758=item uint_fast32_t ecb_be_u32_to_host (uint_fast32_t v)
751 759
779 787
780=back 788=back
781 789
782In C++ the following additional template functions are supported: 790In C++ the following additional template functions are supported:
783 791
784=over 4 792=over
785 793
786=item T ecb_be_to_host (T v) 794=item T ecb_be_to_host (T v)
787 795
788=item T ecb_le_to_host (T v) 796=item T ecb_le_to_host (T v)
789 797
790=item T ecb_host_to_be (T v) 798=item T ecb_host_to_be (T v)
791 799
792=item T ecb_host_to_le (T v) 800=item T ecb_host_to_le (T v)
801
802=back
793 803
794These functions work like their C counterparts, above, but use templates, 804These functions work like their C counterparts, above, but use templates,
795which make them useful in generic code. 805which make them useful in generic code.
796 806
797C<T> must be one of C<uint8_t>, C<uint16_t>, C<uint32_t> or C<uint64_t> 807C<T> must be one of C<uint8_t>, C<uint16_t>, C<uint32_t> or C<uint64_t>
800 810
801=head2 UNALIGNED LOAD/STORE 811=head2 UNALIGNED LOAD/STORE
802 812
803These function load or store unaligned multi-byte values. 813These function load or store unaligned multi-byte values.
804 814
805=over 4 815=over
806 816
807=item uint_fast16_t ecb_peek_u16_u (const void *ptr) 817=item uint_fast16_t ecb_peek_u16_u (const void *ptr)
808 818
809=item uint_fast32_t ecb_peek_u32_u (const void *ptr) 819=item uint_fast32_t ecb_peek_u32_u (const void *ptr)
810 820
854 864
855=back 865=back
856 866
857In C++ the following additional template functions are supported: 867In C++ the following additional template functions are supported:
858 868
859=over 4 869=over
860 870
861=item T ecb_peek<T> (const void *ptr) 871=item T ecb_peek<T> (const void *ptr)
862 872
863=item T ecb_peek_be<T> (const void *ptr) 873=item T ecb_peek_be<T> (const void *ptr)
864 874
906(C<uint8_t>) and also have an aligned version (without the C<_u> prefix), 916(C<uint8_t>) and also have an aligned version (without the C<_u> prefix),
907all of which hopefully makes them more useful in generic code. 917all of which hopefully makes them more useful in generic code.
908 918
909=back 919=back
910 920
921=head2 FAST INTEGER TO STRING
922
923Libecb defines a set of very fast integer to decimal string (or integer
924to ascii, short C<i2a>) functions. These work by converting the integer
925to a fixed point representation and then successively multiplying out
926the topmost digits. Unlike some other, also very fast, libraries, ecb's
927algorithm should be completely branchless per digit, and does not rely on
928the presence of special cpu functions (such as clz).
929
930There is a high level API that takes an C<int32_t>, C<uint32_t>,
931C<int64_t> or C<uint64_t> as argument, and a low-level API, which is
932harder to use but supports slightly more formatting options.
933
934=head3 HIGH LEVEL API
935
936The high level API consists of four functions, one each for C<int32_t>,
937C<uint32_t>, C<int64_t> and C<uint64_t>:
938
939Example:
940
941 char buf[ECB_I2A_MAX_DIGITS + 1];
942 char *end = ecb_i2a_i32 (buf, 17262);
943 *end = 0;
944 // buf now contains "17262"
945
946=over
947
948=item ECB_I2A_I32_DIGITS (=11)
949
950=item char *ecb_i2a_u32 (char *ptr, uint32_t value)
951
952Takes an C<uint32_t> I<value> and formats it as a decimal number starting
953at I<ptr>, using at most C<ECB_I2A_I32_DIGITS> characters. Returns a
954pointer to just after the generated string, where you would normally put
955the temrinating C<0> character. This function outputs the minimum number
956of digits.
957
958=item ECB_I2A_U32_DIGITS (=10)
959
960=item char *ecb_i2a_i32 (char *ptr, int32_t value)
961
962Same as C<ecb_i2a_u32>, but formats a C<int32_t> value, including a minus
963sign if needed.
964
965=item ECB_I2A_I64_DIGITS (=20)
966
967=item char *ecb_i2a_u64 (char *ptr, uint64_t value)
968
969=item ECB_I2A_U64_DIGITS (=21)
970
971=item char *ecb_i2a_i64 (char *ptr, int64_t value)
972
973Similar to their 32 bit counterparts, these take a 64 bit argument.
974
975=item ECB_I2A_MAX_DIGITS (=21)
976
977Instead of using a type specific length macro, youi can just use
978C<ECB_I2A_MAX_DIGITS>, which is good enough for any C<ecb_i2a> function.
979
980=back
981
982=head3 LOW-LEVEL API
983
984The functions above use a number of low-level APIs which have some strict
985limitaitons, but cna be used as building blocks (study of C<ecb_i2a_i32>
986and related cunctions is recommended).
987
988There are three families of functions: functions that convert a number
989to a fixed number of digits with leading zeroes (C<ecb_i2a_0N>, C<0>
990for "leading zeroes"), functions that generate up to N digits, skipping
991leading zeroes (C<_N>), and functions that can generate more digits, but
992the leading digit has limited range (C<_xN>).
993
994None of the functions deal with negative numbera.
995
996Example: convert an IP address in an u32 into dotted-quad:
997
998 uint32_t ip = 0x0a000164; // 10.0.1.100
999 char ips[3 * 4 + 3 + 1];
1000 char *ptr = ips;
1001 ptr = ecb_i2a_3 (ptr, ip >> 24 ); *ptr++ = '.';
1002 ptr = ecb_i2a_3 (ptr, (ip >> 16) & 0xff); *ptr++ = '.';
1003 ptr = ecb_i2a_3 (ptr, (ip >> 8) & 0xff); *ptr++ = '.';
1004 ptr = ecb_i2a_3 (ptr, ip & 0xff); *ptr++ = 0;
1005 printf ("ip: %s\n", ips); // prints "ip: 10.0.1.100"
1006
1007=over
1008
1009=item char *ecb_i2a_02 (char *ptr, uint32_t value) // 32 bit
1010
1011=item char *ecb_i2a_03 (char *ptr, uint32_t value) // 32 bit
1012
1013=item char *ecb_i2a_04 (char *ptr, uint32_t value) // 32 bit
1014
1015=item char *ecb_i2a_05 (char *ptr, uint32_t value) // 64 bit
1016
1017=item char *ecb_i2a_06 (char *ptr, uint32_t value) // 64 bit
1018
1019=item char *ecb_i2a_07 (char *ptr, uint32_t value) // 64 bit
1020
1021=item char *ecb_i2a_08 (char *ptr, uint32_t value) // 64 bit
1022
1023=item char *ecb_i2a_09 (char *ptr, uint32_t value) // 64 bit
1024
1025The C<< ecb_i2a_0I<N> > functions take an unsigned I<value> and convert
1026them to exactly I<N> digits, returning a pointer to the first character
1027after the digits. The I<value> must be in range. The functions marked with
1028I<32 bit> do their calculations internally in 32 bit, the ones marked with
1029I<64 bit> internally use 64 bit integers, which might be slow on 32 bit
1030architectures (the high level API decides on 32 vs. 64 bit versions using
1031C<ECB_64BIT_NATIVE>).
1032
1033=item char *ecb_i2a_2 (char *ptr, uint32_t value) // 32 bit
1034
1035=item char *ecb_i2a_3 (char *ptr, uint32_t value) // 32 bit
1036
1037=item char *ecb_i2a_4 (char *ptr, uint32_t value) // 32 bit
1038
1039=item char *ecb_i2a_5 (char *ptr, uint32_t value) // 64 bit
1040
1041=item char *ecb_i2a_6 (char *ptr, uint32_t value) // 64 bit
1042
1043=item char *ecb_i2a_7 (char *ptr, uint32_t value) // 64 bit
1044
1045=item char *ecb_i2a_8 (char *ptr, uint32_t value) // 64 bit
1046
1047=item char *ecb_i2a_9 (char *ptr, uint32_t value) // 64 bit
1048
1049Similarly, the C<< ecb_i2a_I<N> > functions take an unsigned I<value>
1050and convert them to at most I<N> digits, suppressing leading zeroes, and
1051returning a pointer to the first character after the digits.
1052
1053=item ECB_I2A_MAX_X5 (=59074)
1054
1055=item char *ecb_i2a_x5 (char *ptr, uint32_t value) // 32 bit
1056
1057=item ECB_I2A_MAX_X10 (=2932500665)
1058
1059=item char *ecb_i2a_x10 (char *ptr, uint32_t value) // 64 bit
1060
1061The C<< ecb_i2a_xI<N> >> functions are similar to the C<< ecb_i2a_I<N> >
1062functions, but they can generate one digit more, as long as the number
1063is within range, which is given by the symbols C<ECB_I2A_MAX_X5> (almost
106416 bit range) and C<ECB_I2A_MAX_X10> (a bit more than 31 bit range),
1065respectively.
1066
1067For example, the sigit part of a 32 bit signed integer just fits into the
1068C<ECB_I2A_MAX_X10> range, so while C<ecb_i2a_x10> cannot convert a 10
1069digit number, it can convert all 32 bit signed numbers. Sadly, it's not
1070good enough for 32 bit unsigned numbers.
1071
1072=back
1073
911=head2 FLOATING POINT FIDDLING 1074=head2 FLOATING POINT FIDDLING
912 1075
913=over 4 1076=over
914 1077
915=item ECB_INFINITY [-UECB_NO_LIBM] 1078=item ECB_INFINITY [-UECB_NO_LIBM]
916 1079
917Evaluates to positive infinity if supported by the platform, otherwise to 1080Evaluates to positive infinity if supported by the platform, otherwise to
918a truly huge number. 1081a truly huge number.
996 1159
997=back 1160=back
998 1161
999=head2 ARITHMETIC 1162=head2 ARITHMETIC
1000 1163
1001=over 4 1164=over
1002 1165
1003=item x = ecb_mod (m, n) 1166=item x = ecb_mod (m, n)
1004 1167
1005Returns C<m> modulo C<n>, which is the same as the positive remainder 1168Returns C<m> modulo C<n>, which is the same as the positive remainder
1006of the division operation between C<m> and C<n>, using floored 1169of the division operation between C<m> and C<n>, using floored
1013C<n> must be strictly positive (i.e. C<< >= 1 >>), while C<m> must be 1176C<n> must be strictly positive (i.e. C<< >= 1 >>), while C<m> must be
1014negatable, that is, both C<m> and C<-m> must be representable in its 1177negatable, that is, both C<m> and C<-m> must be representable in its
1015type (this typically excludes the minimum signed integer value, the same 1178type (this typically excludes the minimum signed integer value, the same
1016limitation as for C</> and C<%> in C). 1179limitation as for C</> and C<%> in C).
1017 1180
1018Current GCC versions compile this into an efficient branchless sequence on 1181Current GCC/clang versions compile this into an efficient branchless
1019almost all CPUs. 1182sequence on almost all CPUs.
1020 1183
1021For example, when you want to rotate forward through the members of an 1184For example, when you want to rotate forward through the members of an
1022array for increasing C<m> (which might be negative), then you should use 1185array for increasing C<m> (which might be negative), then you should use
1023C<ecb_mod>, as the C<%> operator might give either negative results, or 1186C<ecb_mod>, as the C<%> operator might give either negative results, or
1024change direction for negative values: 1187change direction for negative values:
1037 1200
1038=back 1201=back
1039 1202
1040=head2 UTILITY 1203=head2 UTILITY
1041 1204
1042=over 4 1205=over
1043 1206
1044=item element_count = ecb_array_length (name) 1207=item element_count = ecb_array_length (name)
1045 1208
1046Returns the number of elements in the array C<name>. For example: 1209Returns the number of elements in the array C<name>. For example:
1047 1210
1055 1218
1056=head2 SYMBOLS GOVERNING COMPILATION OF ECB.H ITSELF 1219=head2 SYMBOLS GOVERNING COMPILATION OF ECB.H ITSELF
1057 1220
1058These symbols need to be defined before including F<ecb.h> the first time. 1221These symbols need to be defined before including F<ecb.h> the first time.
1059 1222
1060=over 4 1223=over
1061 1224
1062=item ECB_NO_THREADS 1225=item ECB_NO_THREADS
1063 1226
1064If F<ecb.h> is never used from multiple threads, then this symbol can 1227If F<ecb.h> is never used from multiple threads, then this symbol can
1065be defined, in which case memory fences (and similar constructs) are 1228be defined, in which case memory fences (and similar constructs) are

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines