rust/rustc_span at 63a91db022f8df0ed78bfca4633e6f626eae040d - rust

mirror of https://github.com/rust-lang/rust.git synced 2024-11-25 16:24:46 +00:00

History

Jed Brown 0d8a978e8a intrinsics.fmuladdf{16,32,64,128}: expose llvm.fmuladd.* semantics Add intrinsics `fmuladd{f16,f32,f64,f128}`. This computes `(a * b) + c`, to be fused if the code generator determines that (i) the target instruction set has support for a fused operation, and (ii) that the fused operation is more efficient than the equivalent, separate pair of `mul` and `add` instructions. https://llvm.org/docs/LangRef.html#llvm-fmuladd-intrinsic MIRI support is included for f32 and f64. The codegen_cranelift uses the `fma` function from libc, which is a correct implementation, but without the desired performance semantic. I think this requires an update to cranelift to expose a suitable instruction in its IR. I have not tested with codegen_gcc, but it should behave the same way (using `fma` from libc).	2024-10-11 15:32:56 -06:00
..
src	intrinsics.fmuladdf{16,32,64,128}: expose llvm.fmuladd.* semantics	2024-10-11 15:32:56 -06:00
Cargo.toml	add unstable support for outputting file checksums for use in cargo	2024-10-01 21:23:20 -06:00

Jed Brown 0d8a978e8a intrinsics.fmuladdf{16,32,64,128}: expose llvm.fmuladd.* semantics

Add intrinsics `fmuladd{f16,f32,f64,f128}`. This computes `(a * b) +
c`, to be fused if the code generator determines that (i) the target
instruction set has support for a fused operation, and (ii) that the
fused operation is more efficient than the equivalent, separate pair
of `mul` and `add` instructions.

https://llvm.org/docs/LangRef.html#llvm-fmuladd-intrinsic

MIRI support is included for f32 and f64.

The codegen_cranelift uses the `fma` function from libc, which is a
correct implementation, but without the desired performance semantic. I
think this requires an update to cranelift to expose a suitable
instruction in its IR.

I have not tested with codegen_gcc, but it should behave the same
way (using `fma` from libc).

2024-10-11 15:32:56 -06:00

src

intrinsics.fmuladdf{16,32,64,128}: expose llvm.fmuladd.* semantics

2024-10-11 15:32:56 -06:00

Cargo.toml

add unstable support for outputting file checksums for use in cargo

2024-10-01 21:23:20 -06:00