Floating-point extensions part 1: binary floating-point arithmetic
Floating-point extensions for C - Part 1: Binary floating-point arithmetic, ISO/IEC TS 18661-1:2014, defines the following new components for the C standard library, as recommended by ISO/IEC/IEEE 60559:2011 (the current revision of IEEE-754)
__STDC_IEC_60559_BFP__ |
integer constant of type long and value 201ymmL, replaces __STDC_IEC_559__ (macro constant) |
__STDC_IEC_60559_COMPLEX__ |
integer constant of type long and value 201ymmL, replaces __STDC_IEC_559_COMPLEX__ (macro constant) |
Defined in header
<limits.h> | |
CHAR_WIDTH SCHAR_WIDTH UCHAR_WIDTHSHRT_WIDTH USHRT_WIDTHINT_WIDTH UINT_WIDTHLONG_WIDTH ULONG_WIDTHLLONG_WIDTH ULLONG_WIDTH (FP Ext 1 TS) |
width, in bits, of the corresponding type (macro constant) |
Defined in header
<float.h> | |
(FP Ext 1 TS) |
conversions between all supported binary floating-point types and character sequences with at most CR_DECIMAL_DIG significant decimal digits are correctly rounded (this is at least DECIMAL_DIG + 3) (macro constant) |
Defined in header
<fenv.h> | |
femode_t (FP Ext 1 TS) |
collection of dynamic floating-point control modes supported by the implementation, including the dynamic rounding direction mode (typedef) |
FE_DFL_MODE (FP Ext 1 TS) |
pointer to the default femode_t (macro constant) |
FE_SNANS_ALWAYS_SIGNAL (FP Ext 1 TS) |
defined (as integer constant 1) if sNaN arguments cause the functions that suppress qNaNs, like hypot or fmax, to raise FE_INVALID and return a qNaN (macro constant) |
(FP Ext 1 TS) |
sets the specified floating-point exception flags without causing any side-effects that raising them would (function) |
(FP Ext 1 TS) |
tests if given flags are in a saved representation of the floating-point exception flags (function) |
(FP Ext 1 TS) |
gets and sets all the implementations dynamic floating-point control modes collectively (function) |
Defined in header
<stdint.h> | |
INTn_WIDTH UINTn_WIDTHINT_LEASTn_WIDTH UINT_LEASTn_WIDTHINT_FASTn_WIDTH UINT_FASTn_WIDTHINTPTR_WIDTH UINTPTR_WIDTHINTMAX_WIDTH UINTMAX_WIDTHPTRDIFF_WIDTHSIG_ATOMIC_WIDTHSIZE_WIDTHWCHAR_WIDTH WINT_WIDTH (FP Ext 1 TS) |
width, in bits, of the corresponding type (macro constant) |
Defined in header
<stdlib.h> | |
(FP Ext 1 TS) |
convert a single foating-point number to string using the specified snprintf format (function) |
Defined in header
<math.h> | |
FP_INT_UPWARDFP_INT_DOWNWARDFP_INT_TOWARDZERO
FP_INT_TONEARESTFROMZEROFP_INT_TONEAREST (FP Ext 1 TS) |
rounding direction for the functions ceil, floor, trunc, round, and roundeven, suitable for use with fromfp family of functions (macro constant) |
FP_LLOGB0 (FP Ext 1 TS) |
value returned by llogb if the argument is zero (macro constant) |
FP_LLOGBNAN (FP Ext 1 TS) |
value returned by llogb if the argument is NaN (macro constant) |
(FP Ext 1 TS) |
represents a signalling NaN for float, double, long double respectively (macro constant) |
FP_FAST_FADD FP_FAST_FADDL FP_FAST_DADDLFP_FAST_FSUB FP_FAST_FSUBL FP_FAST_DSUBLFP_FAST_FMUL FP_FAST_FMULL FP_FAST_DMULLFP_FAST_FDIV FP_FAST_FDIVL FP_FAST_DDIVLFP_FAST_FFMA FP_FAST_FFMAL FP_FAST_DFMALFP_FAST_FSQRT FP_FAST_FSQRTL FP_FAST_DSQRTL (FP Ext 1 TS) |
if defined, indicates that the corresponding function executes faster than the equivalent function in a larger type followed by a cast to target type (macro constant) |
iseqsig (FP Ext 1 TS) |
(function macro) |
iscanonical (FP Ext 1 TS) |
tests if the floating-point value is canonical (function macro) |
issignaling (FP Ext 1 TS) |
tests if the floating-point value is a signalling NaN (function macro) |
issubnormal (FP Ext 1 TS) |
tests if the floating-point value is subnormal (function macro) |
iszero (FP Ext 1 TS) |
tests if the floating-point value is a zero (positive, negative, unsigned) (function macro) |
(FP Ext 1 TS) |
round to signed integer using the specified rounding direction (function) |
(FP Ext 1 TS) |
round to unsigned integer using the specified rounding direction (function) |
(FP Ext 1 TS) |
round to signed integer using the specified rounding direction, reporting inexactness (function) |
(FP Ext 1 TS) |
round to unsigned integer using the specified rounding direction, reporting inexactness (function) |
(FP Ext 1 TS) |
rounds to nearest, halfway cases to even (function) |
(FP Ext 1 TS) |
equivalent to logb except the return type is long (function) |
(FP Ext 1 TS) |
returns the value of their argument of maximum magnitude (function) |
(FP Ext 1 TS) |
returns the value of their argument of minimum magnitude (function) |
(FP Ext 1 TS) |
returns the next greater representable floating-point value (function) |
(FP Ext 1 TS) |
returns the next smaller representable floating-point value (function) |
(FP Ext 1 TS) |
calculates x+y as if in infinite precision and rounded once to target type (function) |
(FP Ext 1 TS) |
calculates x-y as if in infinite precision and rounded once to target type (function) |
(FP Ext 1 TS) |
calculates x*y as if in infinite precision and rounded once to target type (function) |
(FP Ext 1 TS) |
calculates x/y as if in infinite precision and rounded once to target type (function) |
(FP Ext 1 TS) |
calculates the same as fma as if in infinite precision and rounded once to target type (function) |
(FP Ext 1 TS) |
calculates the same as sqrt as if in infinite precision and rounded once to target type (function) |
(FP Ext 1 TS) |
orders two floating-point values using the ISO 60559 total order relation (function) |
(FP Ext 1 TS) |
orders the magnitudes of two floating-point values using the ISO 60559 total order relation (function) |
(FP Ext 1 TS) |
obtains the ISO 60559 canonical binary encoding of the given floating-point value (function) |
(FP Ext 1 TS) |
extracts the payload from the given NaN value (function) |
(FP Ext 1 TS) |
creates a quiet NaN with the specified payload (function) |
(FP Ext 1 TS) |
creates a signalling NaN with the specified payload (function) |
Defined in header
<tgmath.h> | |
roundeven (FP Ext 1 TS) |
generic overload of roundeven (function) |
llogb (FP Ext 1 TS) |
generic overload of llogb (function) |
fmaxmag (FP Ext 1 TS) |
generic overload of fmaxmag (function) |
fminmag (FP Ext 1 TS) |
generic overload of fminmag (function) |
nextup (FP Ext 1 TS) |
generic overload of nextup (function) |
nextdown (FP Ext 1 TS) |
generic overload of nextdown (function) |
fromfp (FP Ext 1 TS) |
generic overload of fromfp (function) |
ufromfp (FP Ext 1 TS) |
generic overload of ufromfp (function) |
fromfpx (FP Ext 1 TS) |
generic overload of fromfpx (function) |
ufromfpx (FP Ext 1 TS) |
generic overload of ufromfpx (function) |
nextdown (FP Ext 1 TS) |
generic overload of nextdown (function) |
totalorder (FP Ext 1 TS) |
generic overload of totalorder (function) |
totalordermag (FP Ext 1 TS) |
generic overload of totalordermag (function) |
fadd (FP Ext 1 TS) |
generic overload of fadd (function) |
dadd (FP Ext 1 TS) |
generic overload of dadd (function) |
fsub (FP Ext 1 TS) |
generic overload of fsub (function) |
dsub (FP Ext 1 TS) |
generic overload of dsub (function) |
fmul (FP Ext 1 TS) |
generic overload of fmul (function) |
dmul (FP Ext 1 TS) |
generic overload of dmul (function) |
fdiv (FP Ext 1 TS) |
generic overload of fdiv (function) |
ddiv (FP Ext 1 TS) |
generic overload of ddiv (function) |
ffma (FP Ext 1 TS) |
generic overload of ffma (function) |
dfma (FP Ext 1 TS) |
generic overload of dfma (function) |
fsqrt (FP Ext 1 TS) |
generic overload of fsqrt (function) |
dsqrt (FP Ext 1 TS) |
generic overload of dsqrt (function) |
Notes
The standard C macros __STDC_IEC_559__ and __STDC_IEC_559_COMPLEX__ are made obsolete by this technical specification.
All functions and macros added to the C library by this extension are only declared if a macro __STDC_WANT_IEC_60559_BFP_EXT__ is defined before the corresponding header is included.
Besides additions to the standard library, ISO/IEC TS 18661-1:2014 makes a number of changes to the core language, notably splitting floating-point control between static (controlled by the new #pragma STDC FENV_ROUND), and dynamic (controlled by fesetround). Most math.h functions respect the static rounding mode, if set, over the dynamic rounding mode.
This section is incomplete Reason: add to the pragma page or describe the pragma in full here? |