nordic-dev.net/rust - rust

mirror of https://github.com/rust-lang/rust.git synced 2025-06-21 03:57:38 +00:00

Author	SHA1	Message	Date
Jubilee	5d0f52efa4	Rollup merge of #131375 - klensy:clone_on_ref_ptr, r=cjgillot compiler: apply clippy::clone_on_ref_ptr for CI Apply lint https://rust-lang.github.io/rust-clippy/master/index.html#/clone_on_ref_ptr for compiler, also see https://github.com/rust-lang/rust/pull/131225#discussion_r1790109443. Some Arc's can be misplaced with Lrc's, sorry. https://rust-lang.zulipchat.com/#narrow/channel/131828-t-compiler/topic/enable.20more.20clippy.20lints.20for.20compiler.20.28and.5Cor.20std.29	2024-10-29 03:11:39 -07:00
klensy	17636374de	correct LLVMRustCreateThinLTOData arg types	2024-10-29 00:47:20 +03:00
Jubilee	6fd4a76d3b	Rollup merge of #132261 - ChrisCho-H:refactor/cleaner-check-none, r=compiler-errors refactor: cleaner check to return None It's very nit change. Refactor to shorten verbose check when returning None for `backend_feature_name`.	2024-10-28 10:18:52 -07:00
Jubilee	bd43f8e9fd	Rollup merge of #132260 - Zalathar:type-safe-cast, r=compiler-errors cg_llvm: Use a type-safe helper to cast `&str` and `&[u8]` to `const c_char` In `rustc_codegen_llvm` there are many uses of `.as_ptr().cast()` to convert a string or byte-slice to `const c_char`, which then gets passed through FFI. This works, but is fragile, because there's nothing constraining the pointer cast to actually be from `u8` to `c_char`. If the original value changes to something else that has an `as_ptr` method, or the context changes to expect something other than `c_char`, the cast will silently do the wrong thing. By making the cast more explicit via a helper method, we can be sure that it will either perform the intended cast, or fail at compile time.	2024-10-28 10:18:52 -07:00
Jubilee Young	88a9edc091	compiler: Add `is_uninhabited` and use LayoutS accessors This reduces the need of the compiler to peek on the fields of LayoutS.	2024-10-28 09:58:30 -07:00
Alex Crichton	f534974037	Add a new `wide-arithmetic` feature for WebAssembly This commit adds a new rustc target feature named `wide-arithmetic` for WebAssembly targets. This corresponds to the [wide-arithmetic] proposal for WebAssembly which adds new instructions catered towards accelerating integer arithmetic larger than 64-bits. This proposal to WebAssembly is not standard yet so this new feature is flagged as an unstable target feature. Additionally Rust's LLVM version doesn't support this new feature yet since support will first be added in LLVM 20, so the feature filtering logic for LLVM is updated to handle this. I'll also note that I'm not currently planning to add wasm-specific intrinsics to `std::arch::wasm32` at this time. The currently proposed instructions are all accessible through `i128` or `u128`-based operations which Rust already supports, so intrinsic shouldn't be necessary to get access to these new instructions. [wide-arithmetic]: https://github.com/WebAssembly/wide-arithmetic	2024-10-28 08:11:47 -07:00
klensy	746b675c5a	fix clippy::clone_on_ref_ptr for compiler	2024-10-28 18:05:08 +03:00
ChrisCho-H	82bfe05309	refactor: cleaner check to return None	2024-10-28 20:16:35 +09:00
Zalathar	4bd84b23a8	Use a type-safe helper to cast `&str` and `&[u8]` to `*const c_char`	2024-10-28 21:31:32 +11:00
bors	be33e4f3d6	Auto merge of #132167 - Zalathar:llvm-wrappers, r=jieyouxu Replace some LLVMRust wrappers with calls to the LLVM C API This PR removes the LLVMRust wrapper functions for getting/setting linkage and visibility, and replaces them with direct calls to the corresponding functions in LLVM's C API. To make this convenient and sound, two pieces of supporting code have also been added: - A simple proc-macro that derives `TryFrom<u32>` for fieldless enums - A wrapper type for C enum values returned by LLVM functions, to ensure soundness if LLVM returns an enum value we don't know about In a few places, the use of safe wrapper functions means that an `unsafe` block is no longer needed, so the affected code has changed its indentation level.	2024-10-27 03:24:54 +00:00
bors	f7cf41c973	Auto merge of #131900 - mrkajetanp:target-feature-pauth-lr, r=Amanieu rustc_target: Add pauth-lr aarch64 target feature Add the pauth-lr target feature, corresponding to aarch64 FEAT_PAuth_LR. This feature has been added in LLVM 19. It is currently not supported by the Linux hwcap and so we cannot add runtime feature detection for it at this time. r? `@Amanieu`	2024-10-27 00:09:49 +00:00
Zalathar	d976ca8701	Use LLVM-C APIs for getting/setting visibility	2024-10-27 11:05:33 +11:00
许杰友 Jieyou Xu (Joe)	c26280a8ba	Rollup merge of #132124 - Zalathar:consolidate-covstar, r=jieyouxu coverage: Consolidate creation of covmap/covfun records This code for creating covmap/covfun records during codegen was split across multiple functions and files for dubious historical reasons. Having it all in one place makes it easier to follow. This PR also includes two semi-related cleanups: - Getting the codegen context's `coverage_cx` state is made infallible, since it should always exist when running the code paths that need it. - The value of `covfun_section_name` is saved in the codegen context, since it never changes at runtime, and the code that needs it has access to the context anyway. --- Background: Coverage instrumentation generates two kinds of metadata that are embedded in the final binary. There is per-CGU information that goes in the `__llvm_covmap` linker section, and per-function information that goes in the `__llvm_covfun` section (except on Windows, where slightly different section names are used).	2024-10-26 22:01:12 +08:00
Zalathar	96993a9b5e	Use LLVM-C APIs for getting/setting linkage	2024-10-26 20:20:20 +11:00
Zalathar	ec41e6d1b0	Add a wrapper type for raw enum values returned by LLVM	2024-10-26 20:20:20 +11:00
Zalathar	b114040afb	Use safe wrappers `get_visibility` and `set_visibility`	2024-10-26 20:20:20 +11:00
Zalathar	983d258be3	Use safe wrappers `get_linkage` and `set_linkage`	2024-10-26 20:20:18 +11:00
Zalathar	0d653a5866	coverage: Add links to LLVM docs for the coverage mapping format	2024-10-26 17:37:04 +11:00
Deadbeef	f6fea83342	Effects cleanup - removed extra bits from predicates queries that are no longer needed in the new system - removed the need for `non_erasable_generics` to take in tcx and DefId, removed unused arguments in callers	2024-10-26 10:19:07 +08:00
Zalathar	8f07514520	coverage: SSA doesn't need to know about `instrprof_increment`	2024-10-25 14:24:05 +11:00
Zalathar	b3d65852c3	coverage: Emit MC/DC intrinsics using the normal helper method	2024-10-25 14:01:36 +11:00
Zalathar	4923e856be	coverage: Emit `llvm.instrprof.increment` using the normal helper method	2024-10-25 13:55:44 +11:00
Zalathar	0356908cf5	coverage: Store `covfun_section_name` in the codegen context Adding an extra `OnceCell` to `CrateCoverageContext` is much nicer than trying to thread this string through multiple layers of function calls that already have access to the context.	2024-10-25 12:42:42 +11:00
Zalathar	0a96176533	coverage: Make obtaining the codegen coverage context infallible In all the situations where this context is needed, it should always be available.	2024-10-25 12:42:42 +11:00
Zalathar	9f8a6be221	coverage: Consolidate creation of covmap/covfun records There is no need for this code to be split across multiple functions in multiple files.	2024-10-25 12:42:23 +11:00
Stuart Cook	8f354fc94a	Rollup merge of #131956 - Zalathar:llvm-counters, r=compiler-errors,Swatinem coverage: Pass coverage mappings to LLVM as separate structs Instead of trying to cram N different kinds of coverage mapping data into a single list for FFI, pass N different lists of simpler structs. This avoids the need to fill unused fields with dummy values, and avoids the need to tag structs with their underlying kind. It also lets us call the dedicated LLVM constructors for each different mapping type, instead of having to go through the complex general-purpose constructor. Even though this adds multiple new structs to the FFI surface area, the resulting C++ code is simpler and shorter. --- I've structured this mostly as a single atomic patch, rather than a series of incremental changes, because that avoids the need to make fiddly fixes to code that is about to be deleted anyway.	2024-10-24 14:19:57 +11:00
Josh Triplett	ecdc2441b6	"innermost", "outermost", "leftmost", and "rightmost" don't need hyphens These are all standard dictionary words and don't require hyphenation.	2024-10-23 02:45:24 -07:00
bors	f2ba41113d	Auto merge of #130950 - compiler-errors:yeet-eval, r=BoxyUwU Continue to get rid of `ty::Const::{try_}eval` This PR mostly does: Removes all of the `try_eval_` and `eval_` helpers from `ty::Const`, and replace their usages with `try_to_`. Remove `ty::Const::eval`. * Rename `ty::Const::normalize` to `ty::Const::normalize_internal`. This function is still used in the normalization code itself. * Fix some weirdness around the `TransmuteFrom` goal. I'm happy to split it out further; for example, I could probably land the first part which removes the helpers, or the changes to codegen which are more obvious than the changes to tools. r? BoxyUwU Part of https://github.com/rust-lang/rust/issues/130704	2024-10-21 03:46:28 +00:00
Zalathar	3310419d35	Make `llvm::set_section` take a `&CStr`	2024-10-20 17:08:05 +11:00
Zalathar	d1bf77eb34	Pass coverage mappings to LLVM as separate structs	2024-10-20 13:29:34 +11:00
Zalathar	98c4d96957	Reduce visibility of coverage FFI functions/types	2024-10-20 10:55:47 +11:00
Michael Goulet	e83e4e8112	Get rid of const eval_* and try_eval_* helpers	2024-10-19 18:07:35 +00:00
Jubilee Young	45d61b0d26	cg_llvm: Reuse LLVM-C Comdat support Migrate `llvm::set_comdat` and `llvm::SetUniqueComdat` to LLVM-C FFI. Note, now we can call `llvm::set_comdat` only when the target actually supports adding comdat. As this has no convenient LLVM-C API, we implement this as `TargetOptions::supports_comdat`. Co-authored-by: Stuart Cook <Zalathar@users.noreply.github.com>	2024-10-19 10:46:10 -07:00
Jubilee Young	888efe74a3	cg_llvm: Switch `llvm::add_global` to `&CStr`	2024-10-18 17:46:33 -07:00
Kajetan Puchalski	f641c32aad	rustc_target: Add pauth-lr aarch64 target feature Add the pauth-lr target feature, corresponding to aarch64 FEAT_PAuth_LR. This feature has been added in LLVM 19. It is currently not supported by the Linux hwcap and so we cannot add runtime feature detection for it at this time.	2024-10-16 18:00:51 +01:00
Matthew Maurer	e985396145	llvm: Match aarch64 data layout to new LLVM layout LLVM has added 3 new address spaces to support special Windows use cases. These shouldn't trouble us for now, but LLVM requires matching data layouts. See llvm/llvm-project#111879 for details	2024-10-16 01:16:44 +00:00
Taiki Endo	67ebb6c20b	Fix AArch64InlineAsmReg::emit	2024-10-14 06:04:07 +09:00
Trevor Gross	39071fdc58	Rollup merge of #131626 - matthiaskrgr:dont_string, r=lqd remove a couple of redundant String to String conversion	2024-10-12 21:38:38 -05:00
Matthias Krüger	4bc21e318c	remove a couple of redundant String to String conversion	2024-10-12 22:07:46 +02:00
DianQK	1efffe720d	`LLVMConstInt` only allows integer types	2024-10-12 23:02:15 +08:00
Trevor Gross	3f9aa50b70	Rollup merge of #124874 - jedbrown:float-mul-add-fast, r=saethlin intrinsics fmuladdf{32,64}: expose llvm.fmuladd.* semantics Add intrinsics `fmuladd{f32,f64}`. This computes `(a * b) + c`, to be fused if the code generator determines that (i) the target instruction set has support for a fused operation, and (ii) that the fused operation is more efficient than the equivalent, separate pair of `mul` and `add` instructions. https://llvm.org/docs/LangRef.html#llvm-fmuladd-intrinsic The codegen_cranelift uses the `fma` function from libc, which is a correct implementation, but without the desired performance semantic. I think this requires an update to cranelift to expose a suitable instruction in its IR. I have not tested with codegen_gcc, but it should behave the same way (using `fma` from libc). --- This topic has been discussed a few times on Zulip and was suggested, for example, by `@workingjubilee` in [Effect of fma disabled](https://rust-lang.zulipchat.com/#narrow/stream/122651-general/topic/Effect.20of.20fma.20disabled/near/274179331).	2024-10-11 23:57:44 -04:00
Trevor Gross	2c385ba329	Rollup merge of #131543 - Zalathar:goodbye-llvm-17, r=petrochenkov coverage: Remove code related to LLVM 17 In-tree LLVM is 19, and the minimum external LLVM was increased to 18 in #130487.	2024-10-11 16:53:49 -05:00
Jed Brown	0d8a978e8a	intrinsics.fmuladdf{16,32,64,128}: expose llvm.fmuladd.* semantics Add intrinsics `fmuladd{f16,f32,f64,f128}`. This computes `(a * b) + c`, to be fused if the code generator determines that (i) the target instruction set has support for a fused operation, and (ii) that the fused operation is more efficient than the equivalent, separate pair of `mul` and `add` instructions. https://llvm.org/docs/LangRef.html#llvm-fmuladd-intrinsic MIRI support is included for f32 and f64. The codegen_cranelift uses the `fma` function from libc, which is a correct implementation, but without the desired performance semantic. I think this requires an update to cranelift to expose a suitable instruction in its IR. I have not tested with codegen_gcc, but it should behave the same way (using `fma` from libc).	2024-10-11 15:32:56 -06:00
Matthias Krüger	33b1264540	Rollup merge of #131519 - davidlattimore:intrinsics-default-vis, r=Urgau Use Default visibility for rustc-generated C symbol declarations Non-default visibilities should only be used for definitions, not declarations, otherwise linking can fail. This is based on https://github.com/rust-lang/rust/pull/123994. Issue https://github.com/rust-lang/rust/issues/123427 When I changed `default-hidden-visibility` to `default-visibility` in https://github.com/rust-lang/rust/pull/130005, I updated all places in the code that used `default-hidden-visibility`, replicating the hidden-visibility bug to also happen for protected visibility. Without this change, trying to build rustc with `-Z default-visibility=protected` fails with a link error.	2024-10-11 15:36:52 +02:00
Zalathar	9357277de7	coverage: Remove code related to LLVM 17	2024-10-11 21:44:36 +11:00
David Lattimore	42c0494499	Use Default visibility for rustc-generated C symbol declarations Non-default visibilities should only be used for definitions, not declarations, otherwise linking can fail. Co-authored-by: Collin Baker <collinbaker@chromium.org>	2024-10-11 08:43:27 +11:00
Matthias Krüger	edb669350a	Rollup merge of #130741 - mrkajetanp:detect-b16b16, r=Amanieu rustc_target: Add sme-b16b16 as an explicit aarch64 target feature LLVM 20 split out what used to be called b16b16 and correspond to aarch64 FEAT_SVE_B16B16 into sve-b16b16 and sme-b16b16. Add sme-b16b16 as an explicit feature and update the codegen accordingly. Resolves https://github.com/rust-lang/rust/pull/129894.	2024-10-10 22:00:48 +02:00
Matthias Krüger	13976f1f25	Rollup merge of #130308 - davidtwco:tied-target-consolidation, r=wesleywiser codegen_ssa: consolidate tied target checks Fixes #105110. Fixes #105111. `rustc_codegen_llvm` and `rustc_codegen_gcc` duplicated logic for checking if tied target features were partially enabled. This PR consolidates these checks into `rustc_codegen_ssa` in the `codegen_fn_attrs` query, which also is run pre-monomorphisation for each function, which ensures that this check is run for unused functions, as would be expected. Also adds a test confirming that enabling one tied feature doesn't imply another - the appropriate error for this was already being emitted. I did a bisect and narrowed it down to two patches it was likely to be - something in #128796, probably #128221 or #128679.	2024-10-10 22:00:45 +02:00
Kajetan Puchalski	335f67b652	rustc_target: Add sme-b16b16 as an explicit aarch64 target feature LLVM 20 split out what used to be called b16b16 and correspond to aarch64 FEAT_SVE_B16B16 into sve-b16b16 and sme-b16b16. Add sme-b16b16 as an explicit feature and update the codegen accordingly.	2024-10-10 10:24:57 +00:00
Matthias Krüger	e642442f12	Rollup merge of #131424 - workingjubilee:stem-the-tyde-of-glob-imports, r=jieyouxu compiler: Stop reexporting enum-globs from `rustc_target::abi` Three enums had all their variants glob-exported into a distressingly large amount of the tree. Cease to do that, and also cease to glob import the contents of the module that contained them. Redirect relevant imports to their actual source, the `rustc_abi` crate. No functional changes.	2024-10-09 20:27:24 +02:00
Jubilee Young	1379ef592a	compiler: Factor rustc_target::abi out of cg_llvm	2024-10-08 18:24:56 -07:00
Michael Goulet	17eca60c24	Dont ICE when encountering post-mono layout cycle error	2024-10-08 16:46:16 -04:00
bors	cf24c73141	Auto merge of #126733 - ZhuUx:llvm-19-adapt, r=Zalathar [Coverage][MCDC] Adapt mcdc to llvm 19 Related issue: #126672 Also finish task 4 at #124144 [llvm #82448](https://github.com/llvm/llvm-project/pull/82448) has introduced some break changes into mcdc, causing incompatibility between llvm 18 and 19. This draft adapts to that change and gives up supporting for llvm-18.	2024-10-08 07:08:41 +00:00
zhuyunxing	6e3e19f714	coverage. Adapt to mcdc mapping formats introduced by llvm 19	2024-10-08 11:15:24 +08:00
zhuyunxing	99bd601df5	coverage. MCDC ConditionId start from 0 to keep with llvm 19	2024-10-08 10:50:18 +08:00
zhuyunxing	911ac56e95	coverage. Disable supporting mcdc on llvm-18	2024-10-08 10:50:18 +08:00
Stuart Cook	4d63896018	Rollup merge of #130824 - Darksonn:fix-function-return, r=wesleywiser Add missing module flags for `-Zfunction-return=thunk-extern` This fixes a bug in the `-Zfunction-return=thunk-extern` flag. The flag needs to be passed onto LLVM to ensure that functions such as `asan.module_ctor` and `asan.module_dtor` that are created internally in LLVM have the mitigation applied to them. This was originally discovered [in the Linux kernel](https://lore.kernel.org/all/CANiq72myZL4_poCMuNFevtpYYc0V0embjSuKb7y=C+m3vVA_8g@mail.gmail.com/). Original flag PR: #116892 PR for similar issue: #129373 Tracking issue: #116853 cc ``@ojeda`` r? ``@wesleywiser``	2024-10-08 13:19:43 +11:00
Urgau	018ba0528f	Use wide pointers consistenly across the compiler	2024-10-04 14:06:48 +02:00
Jacob Kiesel	bb5a8276be	add unstable support for outputting file checksums for use in cargo	2024-10-01 21:23:20 -06:00
bors	06bb8364aa	Auto merge of #131111 - matthiaskrgr:rollup-n6do187, r=matthiaskrgr Rollup of 4 pull requests Successful merges: - #130005 (Replace -Z default-hidden-visibility with -Z default-visibility) - #130229 (ptr::add/sub: do not claim equivalence with `offset(c as isize)`) - #130773 (Update Unicode escapes in `/library/core/src/char/methods.rs`) - #130933 (rustdoc: lists items that contain multiple paragraphs are more clear) r? `@ghost` `@rustbot` modify labels: rollup	2024-10-01 19:29:26 +00:00
Matthias Krüger	389a399a50	Rollup merge of #130005 - davidlattimore:protected-vis-flag, r=Urgau Replace -Z default-hidden-visibility with -Z default-visibility Issue #105518	2024-10-01 21:09:18 +02:00
Guillaume Gomez	344b6a1668	Rollup merge of #130630 - taiki-e:s390x-clobber-abi, r=Amanieu Support clobber_abi and vector/access registers (clobber-only) in s390x inline assembly This supports `clobber_abi` which is one of the requirements of stabilization mentioned in #93335. This also supports vector registers (as `vreg`) and access registers (as `areg`) as clobber-only, which need to support clobbering of them to implement clobber_abi. Refs: - "1.2.1.1. Register Preservation Rules" section in ELF Application Binary Interface s390x Supplement, Version 1.6.1 (lzsabi_s390x.pdf in https://github.com/IBM/s390x-abi/releases/tag/v1.6.1) - Register definition in LLVM: - Vector registers https://github.com/llvm/llvm-project/blob/llvmorg-19.1.0/llvm/lib/Target/SystemZ/SystemZRegisterInfo.td#L249 - Access registers https://github.com/llvm/llvm-project/blob/llvmorg-19.1.0/llvm/lib/Target/SystemZ/SystemZRegisterInfo.td#L332 I have three questions: - ~~ELF Application Binary Interface s390x Supplement says that `cc` (condition code, bits 18-19 of PSW) is "Volatile". However, we do not have a register class for `cc` and instead mark `cc` as clobbered unless `preserves_flags` is specified (https://github.com/rust-lang/rust/pull/111331). Therefore, in the current implementation, if both `preserves_flags` and `clobber_abi` are specified, `cc` is not marked as clobbered. Is this okay? Or even if `preserves_flags` is used, should `cc` be marked as clobbered if `clobber_abi` is used?~~ UPDATE: resolved https://github.com/rust-lang/rust/pull/130630#issuecomment-2367923121 - ~~ELF Application Binary Interface s390x Supplement says that `pm` (program mask, bits 20-23 of PSW) is "Cleared". There does not appear to be any registers associated with this in either [LLVM](https://github.com/llvm/llvm-project/blob/llvmorg-19.1.0/llvm/lib/Target/SystemZ/SystemZRegisterInfo.td) or [GCC](`33ccc1314d/gcc/config/s390/s390.h (L407-L431)`), so at this point I don't see any way other than to just ignore it. Is this okay as-is?~~ UPDATE: resolved https://github.com/rust-lang/rust/pull/130630#issuecomment-2367923121 - Is "areg" a good name for register class name for access registers? It may be a bit confusing between that and `reg_addr`, which uses the “a” constraint (https://github.com/rust-lang/rust/pull/119431)... Note: - GCC seems to [recognize only `a0` and `a1`](`33ccc1314d/gcc/config/s390/s390.h (L428-L429)`), and using `a[2-15]` [causes errors](https://godbolt.org/z/a46vx8jjn). Given that cg_gcc has a similar problem with other architecture (https://github.com/rust-lang/rustc_codegen_gcc/issues/485), I don't feel this is a blocker for this PR, but it is worth mentioning here. - `vreg` should be able to accept `#[repr(simd)]` types as input if the `vector` target feature added in https://github.com/rust-lang/rust/pull/127506 is enabled, but core_arch has no s390x vector type and both `#[repr(simd)]` and `core::simd` are unstable, so I have not implemented it in this PR. EDIT: And supporting it is probably more complex than doing the equivalent on other architectures... https://github.com/rust-lang/rust/pull/88245#issuecomment-905559591 cc `@uweigand` r? `@Amanieu` `@rustbot` label +O-SystemZ	2024-10-01 17:32:07 +02:00
David Lattimore	f48194ea55	Replace -Z default-hidden-visibility with -Z default-visibility MCP: https://github.com/rust-lang/compiler-team/issues/782 Co-authored-by: bjorn3 <17426603+bjorn3@users.noreply.github.com>	2024-10-01 22:32:13 +10:00
Trevor Gross	acaa6cee07	Rollup merge of #130877 - taiki-e:riscv-atomic, r=Amanieu rustc_target: Add RISC-V atomic-related features This adds the following three target features to unstable riscv_target_feature. - `zaamo` (Zaamo Extension 1.0.0): Atomic Memory Operations (`amo.{w,d}{,.aq,.rl,.aqrl}`) ([definition in LLVM](https://github.com/llvm/llvm-project/blob/llvmorg-19.1.0/llvm/lib/Target/RISCV/RISCVFeatures.td#L229-L231), [available since LLVM 19](`8be079cddd`)) - `zabha` (Zabha Extension 1.0.0): Byte and Halfword Atomic Memory Operations (`amo.{b,h}{,.aq,.rl,.aqrl}`) ([definition in LLVM](https://github.com/llvm/llvm-project/blob/llvmorg-19.1.0/llvm/lib/Target/RISCV/RISCVFeatures.td#L238-L240), [available since LLVM 19](`6b7444964a`)) - `zalrsc` (Zalrsc Extension 1.0.0): Load-Reserved/Store-Conditional Instructions (`lr.{w,d}{,.aq,.rl,.aqrl}` and `sc.{w,d}{,.aq,.rl,.aqrl}`) ([definition in LLVM](https://github.com/llvm/llvm-project/blob/llvmorg-19.1.0/llvm/lib/Target/RISCV/RISCVFeatures.td#L261-L263), [available since LLVM 19](`8be079cddd`)) (Zacas Extension is not included here because it is still marked as experimental in LLVM 19 `70e7d26e56` and will become non-experimental in LLVM 20 `614aeda93b`) `a` implies `zaamo` and `zalrsc`, and `zabha` implies `zaamo`: - After Zaamo and Zalrsc Extensions are frozen, riscv-isa-manual says "The A extension comprises instructions provided by the Zaamo and Zalrsc extensions" (`e87412e621`), and [`a` implies `zaamo` and `zalrsc` in GCC](`08693e29ec/gcc/config/riscv/arch-canonicalize (L44)`). However, in LLVM, [`a` does not define them as implying `zaamo` and `zalrsc`](https://github.com/llvm/llvm-project/blob/llvmorg-19.1.0/llvm/lib/Target/RISCV/RISCVFeatures.td#L206). - Zabha and Zaamo are in a similar situation, [riscv-isa-manual](https://github.com/riscv/riscv-isa-manual/blob/main/src/zabha.adoc) says "The Zabha extension depends upon the Zaamo standard extension", and [`zabha` implies `zaamo` in GCC](`08693e29ec/gcc/config/riscv/arch-canonicalize (L45-L46)`), but [does not in LLVM (but enabling `zabha` without `zaamo` or `a` is not allowed)](https://github.com/llvm/llvm-project/blob/llvmorg-19.1.0/llvm/lib/TargetParser/RISCVISAInfo.cpp#L776-L778). r? `@Amanieu` `@rustbot` label +O-riscv +A-target-feature	2024-09-30 19:18:49 -04:00
Ralf Jung	a78fd694d4	extend comment in global_llvm_features regarding target-cpu=native handling	2024-09-29 12:16:35 +02:00
Taiki Endo	62612af372	rustc_target: Add RISC-V atomic-related features	2024-09-28 11:26:09 +09:00
Josh Stone	4160a54dc5	Use `&raw` in the compiler Like #130865 did for the standard library, we can use `&raw` in the compiler now that stage0 supports it. Also like the other issue, I did not make any doc or test changes at this time.	2024-09-26 20:33:26 -07:00
Alice Ryhl	540e41f8b3	Add missing module flags for function-return=thunk-extern	2024-09-25 15:53:53 +02:00
Josh Stone	0999b019f8	Dogfood `feature(file_buffered)`	2024-09-24 14:25:16 -07:00
Daniel Paoliello	b2fd8a0192	Test fixing raw-dylib	2024-09-24 10:10:31 -07:00
David Wood	207bc77e15	codegen_ssa: consolidate tied feature checking `rustc_codegen_llvm` and `rustc_codegen_gcc` duplicated logic for checking if tied target features were partially enabled. This commit consolidates these checks into `rustc_codegen_ssa` in the `codegen_fn_attrs` query, which also is run pre-monomorphisation for each function, which ensures that this check is run for unused functions, as would be expected.	2024-09-24 15:48:49 +01:00
bors	4cbfcf1b7f	Auto merge of #130389 - Luv-Ray:LLVMMDNodeInContext2, r=nikic llvm: replace some deprecated functions `LLVMMDStringInContext` and `LLVMMDNodeInContext` are deprecated, replace them with `LLVMMDStringInContext2` and `LLVMMDNodeInContext2`. Also replace `Value` with `Metadata` in some function signatures for better consistency.	2024-09-24 12:07:48 +00:00
Michael Goulet	702a644b74	Check vtable projections for validity in miri	2024-09-23 19:38:26 -04:00
Luv-Ray	d7ebf9e541	format	2024-09-23 23:45:13 +08:00
bors	66b0b29e65	Auto merge of #130724 - compiler-errors:bump, r=Mark-Simulacrum Bump stage0 to beta-2024-09-22 and rustfmt to nightly-2024-09-22 I'm doing this to apply the changes to version sorting (https://github.com/rust-lang/rustfmt/pull/6284) that have occurred since rustfmt last upgraded (and a few other miscellaneous changes, like changes to expression overflowing: https://github.com/rust-lang/rustfmt/pull/6260). Eagerly updating rustfmt and formatting-the-world will ideally move some of the pressure off of the beta bump which will happen at the beginning of the next release cycle. You can verify this is correct by checking out the changes, reverting the last commit, reapplying them, and diffing the changes: ``` git fetch git@github.com:compiler-errors/rust.git bump git checkout -b bump FETCH_HEAD git reset --hard HEAD~5 ./x.py fmt --all git diff FETCH_HEAD # ignore the changes to stage0, and rustfmt.toml, # and test file changes in rustdoc-js-std, run-make. ``` Or just take my word for it? Up to the reviewer. r? release	2024-09-23 02:02:22 +00:00
bors	d14c1c75ab	Auto merge of #130680 - saethlin:module-name-to-str, r=jieyouxu Call module_name_to_str instead of just unwrapping This makes the ICE message in https://github.com/rust-lang/rust/issues/130678 more clear. It looks like not calling this function was just an oversight in https://github.com/rust-lang/rust/pull/76859, but clearly not a major one because it's taken us 4 years to notice. try-job: i686-msvc	2024-09-22 23:14:12 +00:00
Michael Goulet	c682aa162b	Reformat using the new identifier sorting from rustfmt	2024-09-22 19:11:29 -04:00
Ben Kimock	6419aeb1ec	Call module_name_to_str instead of just unwrapping	2024-09-21 18:42:51 -04:00
Folkert	5722a80782	remove `#[cmse_nonsecure_entry]`	2024-09-21 13:05:21 +02:00
Folkert de Vries	1ddd67a79a	add `C-cmse-nonsecure-entry` ABI	2024-09-21 13:04:14 +02:00
Michael Goulet	914193c8f4	Do not unnecessarily eval consts in codegen	2024-09-20 20:38:11 -04:00
Guillaume Gomez	bf6389f077	Rollup merge of #128209 - beetrees:no-macos-10.10, r=jieyouxu Remove macOS 10.10 dynamic linker bug workaround Rust's current minimum macOS version is 10.12, so the hack can be removed. This PR also updates the `remove_dir_all` docs to reflect that all supported macOS versions are protected against TOCTOU race conditions (the fallback implementation was already removed in #127683). try-job: dist-x86_64-apple try-job: dist-aarch64-apple try-job: dist-apple-various try-job: aarch64-apple try-job: x86_64-apple-1	2024-09-20 19:46:37 +02:00
Taiki Endo	fa125e2be6	Support clobber_abi and vector/access registers (clobber-only) in s390x inline assembly	2024-09-21 01:51:26 +09:00
Luv-Ray	6da2d6e026	MetadataType type cast	2024-09-19 18:56:02 +08:00
Luv-Ray	e2ec83ced9	move place	2024-09-19 18:52:09 +08:00
Luv-Ray	632342a135	wrap `LLVMSetMetadata`	2024-09-19 18:45:23 +08:00
Nicholas Nethercote	1f359405cb	Reformat some comments. So they are less than 100 chars.	2024-09-19 20:11:28 +10:00
Nicholas Nethercote	5fd16dffdc	Merge adjacent `unsafe extern "C"` blocks.	2024-09-19 20:10:42 +10:00
Nicholas Nethercote	c5af8b2722	Avoid heavy repetition in `llvm/ffi.rs`. Through judicious use of `use` and `Self`.	2024-09-19 20:10:42 +10:00
Nicholas Nethercote	3b071692cb	Remove a low-value local variable.	2024-09-19 20:10:42 +10:00
Nicholas Nethercote	ccd6c6102d	Fix a comment. I'm pretty sure `CodegenCx` applies to codegen units, rather than compilation units.	2024-09-19 20:10:42 +10:00
Nicholas Nethercote	badd8cc8f4	Reduce visibility.	2024-09-19 20:10:42 +10:00
Nicholas Nethercote	bfef2611d9	Reorder `ConstMethods`. It's crazy to have the integer methods in something close to random order. The reordering makes the gaps clear: `const_i64`, `const_i128`, `const_isize`, and `const_u16`. I guess they just aren't needed.	2024-09-19 20:10:42 +10:00
Nicholas Nethercote	fda530d729	Streamline `hidden` visibility setting. In `get_fn` there is a complicated set of if/elses to determine if `hidden` visibility should be applied. There are five calls to `LLVMRustSetVisibility` and some repetition in the comments. This commit streamlines it a bit: - Computes `hidden` and then uses it to determine if a single call to `LLVMRustSetVisibility` occurs. - Converts some of the if/elses into boolean expressions. - Removes the repetitive comments. Overall this makes it quite a bit shorter, and I find it easier to read.	2024-09-19 20:10:42 +10:00
Nicholas Nethercote	eb575506f2	Remove a low-value comment. We rarely use parameter comments, and these ones don't tell us anything interesting.	2024-09-19 20:10:42 +10:00
Nicholas Nethercote	4ce010efcf	Use a macro to factor out some repetitive code. Similar to the existing macro just above.	2024-09-19 20:10:41 +10:00
Nicholas Nethercote	0d78f1e86b	Reduce repetition in `target_is_apple`.	2024-09-19 20:10:41 +10:00
Nicholas Nethercote	9429e64c24	Streamline `report_inline_asm`. By using `use`.	2024-09-19 20:10:41 +10:00
Nicholas Nethercote	63210bd68c	Rename a parameter. This seems to be a typo. `singletree` doesn't make sense, and everywhere else it is `singlethread`.	2024-09-19 20:10:41 +10:00
Nicholas Nethercote	785a26af03	Streamline register methods. These can be made more concise, mostly through appropriate use of `use` declarations.	2024-09-19 20:10:41 +10:00
Luv-Ray	b7c5656713	replace some deprecated functions	2024-09-19 09:39:28 +08:00
Josh Stone	6fd8a50680	Update the minimum external LLVM to 18	2024-09-18 13:53:31 -07:00
Matthias Krüger	21313d7947	Rollup merge of #130457 - nnethercote:cleanup-codegen-traits, r=bjorn3 Cleanup codegen traits The traits governing codegen are quite complicated and hard to follow. This PR cleans them up a bit. r? `@bjorn3`	2024-09-18 17:49:43 +02:00
Nicholas Nethercote	acb832d640	Use associative type defaults in `{Layout,FnAbi}OfHelpers`. This avoids some repetitive boilerplate code.	2024-09-17 10:25:06 +10:00
Nicholas Nethercote	a8d22eb39e	Rename supertraits of `CodegenMethods`. Supertraits of `BuilderMethods` are all called `XyzBuilderMethods`. Supertraits of `CodegenMethods` are all called `XyzMethods`. This commit changes the latter to `XyzCodegenMethods`, for consistency.	2024-09-17 10:24:43 +10:00
Nicholas Nethercote	410a2de0c0	Rename `{ArgAbi,IntrinsicCall}Methods`. They both are part of `BuilderMethods`, and so should have `Builder` in their name like all the other traits in `BuilderMethods`.	2024-09-17 10:24:43 +10:00
Nicholas Nethercote	5f98943b5a	Merge `HasCodegen` into `BuilderMethods`. It has `Backend` and `Deref` boudns, plus an associated type `CodegenCx`, and it has a single use. This commit "inlines" it into `BuilderMethods`, which makes the complicated backend trait situation a little simpler.	2024-09-17 10:24:43 +10:00
Jubilee	68758c0560	Rollup merge of #130325 - workingjubilee:plus-minus-zero-redux, r=RalfJung,jieyouxu Use -0.0 in `intrinsics::simd::reduce_add_unordered` -0.0 is the actual neutral additive float, not +0.0, and this matters to codegen. try-job: aarch64-gnu	2024-09-15 23:51:25 -07:00
Jubilee Young	ab8c202527	Use -0.0 in `intrinsics::simd::reduce_add_unordered` -0.0 is the actual neutral additive float, not +0.0, and this matters to codegen.	2024-09-15 16:40:23 -07:00
Matthias Krüger	0daa636b93	Rollup merge of #129897 - RalfJung:soft-float-ignored, r=Urgau deprecate -Csoft-float because it is unsound (and not fixable) See https://github.com/rust-lang/rust/issues/129893 for details. The general sentiment there seems to be that this flag has no use and sound alternatives exist, so let's add this warning and see if anyone out there disagrees. Also show a different warning on targets where it does nothing (as documented since https://github.com/rust-lang/rust/pull/36261): it seems to correspond to `-mfloat-abi` in GCC/clang, which is an ARM-specific option. To be really sure it does nothing, only forward the flag to LLVM for eabihf targets. This should not change behavior but makes me sleep better ;)	2024-09-15 20:55:12 +02:00
Ralf Jung	60ee1b7ac6	simd_shuffle: require index argument to be a vector	2024-09-14 14:43:24 +02:00
bors	5e842953cc	Auto merge of #130052 - khuey:clear-dilocation-after-const-emission, r=michaelwoerister Don't leave debug locations for constants sitting on the builder indefinitely Because constants are currently emitted before the prologue, leaving the debug location on the IRBuilder spills onto other instructions in the prologue and messes up both line numbers as well as the point LLVM chooses to be the prologue end. Example LLVM IR (irrelevant IR elided): Before: ``` define internal { i64, i64 } `@_ZN3tmp3Foo18var_return_opt_try17he02116165b0fc08cE(ptr` align 8 %self) !dbg !347 { start: %self.dbg.spill = alloca [8 x i8], align 8 %_0 = alloca [16 x i8], align 8 %residual.dbg.spill = alloca [0 x i8], align 1 #dbg_declare(ptr %residual.dbg.spill, !353, !DIExpression(), !357) store ptr %self, ptr %self.dbg.spill, align 8, !dbg !357 #dbg_declare(ptr %self.dbg.spill, !350, !DIExpression(), !358) ``` After: ``` define internal { i64, i64 } `@_ZN3tmp3Foo18var_return_opt_try17h00b17d08874ddd90E(ptr` align 8 %self) !dbg !347 { start: %self.dbg.spill = alloca [8 x i8], align 8 %_0 = alloca [16 x i8], align 8 %residual.dbg.spill = alloca [0 x i8], align 1 #dbg_declare(ptr %residual.dbg.spill, !353, !DIExpression(), !357) store ptr %self, ptr %self.dbg.spill, align 8 #dbg_declare(ptr %self.dbg.spill, !350, !DIExpression(), !358) ``` Note in particular how !357 from %residual.dbg.spill's dbg_declare no longer falls through onto the store to %self.dbg.spill. This fixes argument values at entry when the constant is a ZST (e.g. `<Option as Try>::Residual`). This fixes #130003 (but note that it does not fix issues with argument values and non-ZST constants, which emit their own stores that have debug info on them, like #128945). r? `@michaelwoerister`	2024-09-13 08:57:41 +00:00
Stuart Cook	3ba12756d3	Rollup merge of #130235 - compiler-errors:nested-if, r=michaelwoerister Simplify some nested `if` statements Applies some but not all instances of `clippy::collapsible_if`. Some ended up looking worse afterwards, though, so I left those out. Also applies instances of `clippy::collapsible_else_if` Review with whitespace disabled please.	2024-09-12 20:37:16 +10:00
bors	1f51450c68	Auto merge of #117465 - paulmenage:small-data-limit, r=compiler-errors Add -Z small-data-threshold This flag allows specifying the threshold size above which LLVM should not consider placing small objects in a `.sdata` or `.sbss` section. Support is indicated in the target options via the small-data-threshold-support target option, which can indicate either an LLVM argument or an LLVM module flag. To avoid duplicate specifications in a large number of targets, the default value for support is DefaultForArch, which is translated to a concrete value according to the target's architecture.	2024-09-12 04:27:08 +00:00
Jubilee	a31a8fe0cf	Rollup merge of #130114 - eduardosm:needless-returns, r=compiler-errors Remove needless returns detected by clippy in the compiler	2024-09-11 15:53:22 -07:00
Michael Goulet	954419aab0	Simplify some nested if statements	2024-09-11 13:45:23 -04:00
Paul Menage	3810386bbe	Add -Z small-data-threshold This flag allows specifying the threshold size above which LLVM should not consider placing small objects in a .sdata or .sbss section. Support is indicated in the target options via the small-data-threshold-support target option, which can indicate either an LLVM argument or an LLVM module flag. To avoid duplicate specifications in a large number of targets, the default value for support is DefaultForArch, which is translated to a concrete value according to the target's architecture.	2024-09-10 12:19:16 -07:00
Jubilee	88a2c62652	Rollup merge of #129981 - nnethercote:rm-serialize_bitcode, r=antoyo,tmiasko Remove `serialized_bitcode` from `LtoModuleCodegen`. It's unused. r? ``@bjorn3``	2024-09-09 19:20:36 -07:00
Eduardo Sánchez Muñoz	0b20ffcb63	Remove needless returns detected by clippy in the compiler	2024-09-09 13:32:22 +02:00
Nicholas Nethercote	bbe28cf1d9	Remove `serialized_bitcode` from `LtoModuleCodegen`. It's unused.	2024-09-09 09:00:50 +10:00
bors	12b26c13fb	Auto merge of #129941 - BoxyUwU:bump-boostrap, r=albertlarsan68 Bump boostrap compiler to new beta Accidentally left some comments on the update cfgs commit directly xd	2024-09-07 20:37:30 +00:00
Michael Goulet	bc2244f027	Rollup merge of #129940 - liushuyu:s390x-target-features, r=RalfJung s390x: Fix a regression related to backchain feature In #127506, we introduced a new IBM Z-specific target feature, `backchain`. This particular `target-feature` was available as a function-level attribute in LLVM 17 and below, so some hacks were used to avoid blowing up LLVM when querying the supported LLVM features. This led to an unfortunate regression where `cfg!(target-feature = "backchain")` will always return true. This pull request aims to fix this issue, and a test has been introduced to ensure it will never happen again. Fixes #129927. r? `@RalfJung`	2024-09-07 14:21:22 +03:00
Michael Goulet	6dd07e4e26	Rollup merge of #129891 - nikic:naked-no-san, r=jackh726 Do not request sanitizers for naked functions Naked functions can only contain inline asm, so any instrumentation inserted by sanitizers is illegal. Don't request it. Fixes https://github.com/rust-lang/rust/issues/129224.	2024-09-07 14:21:21 +03:00
Kyle Huey	7ed9f945a2	Don't leave debug locations for constants sitting on the builder indefinitely. Because constants are currently emitted before the prologue, leaving the debug location on the IRBuilder spills onto other instructions in the prologue and messes up both line numbers as well as the point LLVM chooses to be the prologue end. Example LLVM IR (irrelevant IR elided): Before: define internal { i64, i64 } @_ZN3tmp3Foo18var_return_opt_try17he02116165b0fc08cE(ptr align 8 %self) !dbg !347 { start: %self.dbg.spill = alloca [8 x i8], align 8 %_0 = alloca [16 x i8], align 8 %residual.dbg.spill = alloca [0 x i8], align 1 #dbg_declare(ptr %residual.dbg.spill, !353, !DIExpression(), !357) store ptr %self, ptr %self.dbg.spill, align 8, !dbg !357 #dbg_declare(ptr %self.dbg.spill, !350, !DIExpression(), !358) After: define internal { i64, i64 } @_ZN3tmp3Foo18var_return_opt_try17h00b17d08874ddd90E(ptr align 8 %self) !dbg !347 { start: %self.dbg.spill = alloca [8 x i8], align 8 %_0 = alloca [16 x i8], align 8 %residual.dbg.spill = alloca [0 x i8], align 1 #dbg_declare(ptr %residual.dbg.spill, !353, !DIExpression(), !357) store ptr %self, ptr %self.dbg.spill, align 8 #dbg_declare(ptr %self.dbg.spill, !350, !DIExpression(), !358) Note in particular how !357 from %residual.dbg.spill's dbg_declare no longer falls through onto the store to %self.dbg.spill. This fixes argument values at entry when the constant is a ZST (e.g. <Option as Try>::Residual). This fixes #130003 (but note that it does not fix issues with argument values and non-ZST constants, which emit their own stores that have debug info on them, like #128945).	2024-09-06 23:12:18 +00:00
Nikita Popov	54ebb9d489	Do not request sanitizers for naked functions Naked functions can only contain inline asm, so any instrumentation inserted by sanitizers is illegal. Don't request it. Fixes https://github.com/rust-lang/rust/issues/129224.	2024-09-06 14:11:13 +02:00
Matthias Krüger	0180b8fff0	Rollup merge of #129969 - GrigorenkoPV:boxed-ty, r=compiler-errors Make `Ty::boxed_ty` return an `Option` Looks like a good place to use Rust's type system. --- Most of `4ac7bcbaad/compiler/rustc_middle/src/ty/sty.rs (L971-L1963)` looks like it could be moved to `TyKind` (then I guess `Ty` should be made to deref to `TyKind`).	2024-09-06 07:33:58 +02:00
bors	54fdef7799	Auto merge of #121614 - clubby789:no-expect, r=saethlin Don't emit `expect`/`assume` in opt-level=0 LLVM does not make use of expect/assume calls in `opt-level=0`, so we can simplify IR by not emitting them in this case.	2024-09-06 00:42:58 +00:00
Pavel Grigorenko	f6e8a84eea	Make `Ty::boxed_ty` return an `Option`	2024-09-06 00:30:36 +03:00
Matthias Krüger	b89ee99d57	Rollup merge of #128820 - LYF1999:yf/dev, r=nikic fix: get llvm type of global val using `LLVMTypeOf` on a global var always return ptr. so create a new function to access the value type of a global	2024-09-05 18:58:53 +02:00
Boxy	0091b8ab2a	update cfgs	2024-09-05 17:24:01 +01:00
beetrees	0444056aa3	Remove macOS 10.10 dynamic linker bug workaround	2024-09-04 13:13:48 +01:00
clubby789	5b96ae7106	Don't codegen `expect` in opt-level=0	2024-09-04 11:49:00 +00:00
liushuyu	e98e88bfdf	rustc_codegen_llvm: fix a regression where backchain feature ... ... can not be correctly gated using #[cfg] macro	2024-09-03 12:42:57 -06:00
Ralf Jung	df38e644ce	deprecate -Csoft-float because it is unsound (and not fixable)	2024-09-03 12:19:50 +02:00
Alexander Cyon	ac69544a17	chore: Fix typos in 'compiler' (batch 1)	2024-09-02 07:42:38 +02:00
Ralf Jung	d0aedfbb90	interpret, codegen: tweak some comments and checks regarding Box with custom allocator	2024-08-31 11:29:02 +02:00
Guillaume Gomez	d5c40d03dc	Rollup merge of #128970 - DianQK:lint-llvm-ir, r=nikic Add `-Zlint-llvm-ir` This flag is similar to `-Zverify-llvm-ir` and allows us to lint the generated IR. r? compiler	2024-08-29 16:21:47 +02:00
DianQK	9589eb95d2	Add `-Zlint-llvm-ir`	2024-08-29 18:12:31 +08:00
Jubilee	4c8c9e092d	Rollup merge of #128192 - mrkajetanp:feature-detect, r=Amanieu rustc_target: Add various aarch64 features Add various aarch64 features already supported by LLVM and Linux. Additionally include some comment fixes to ensure consistency of feature names with the Arm ARM. Compiler support for features added to stdarch by https://github.com/rust-lang/stdarch/pull/1614. Tracking issue for unstable aarch64 features is https://github.com/rust-lang/rust/issues/127764. List of added features: - FEAT_CSSC - FEAT_ECV - FEAT_FAMINMAX - FEAT_FLAGM2 - FEAT_FP8 - FEAT_FP8DOT2 - FEAT_FP8DOT4 - FEAT_FP8FMA - FEAT_HBC - FEAT_LSE128 - FEAT_LSE2 - FEAT_LUT - FEAT_MOPS - FEAT_LRCPC3 - FEAT_SVE_B16B16 - FEAT_SVE2p1 - FEAT_WFxT - FEAT_SME - FEAT_SME_F16F16 - FEAT_SME_F64F64 - FEAT_SME_F8F16 - FEAT_SME_F8F32 - FEAT_SME_FA64 - FEAT_SME_I16I64 - FEAT_SME_LUTv2 - FEAT_SME2 - FEAT_SME2p1 - FEAT_SSVE_FP8DOT2 - FEAT_SSVE_FP8DOT4 - FEAT_SSVE_FP8FMA FEAT_FPMR is added in the first commit and then removed in a separate one to highlight it being removed from upstream LLVM 19. The intention is for it to be detectable at runtime through stdarch but not have a corresponding Rust compile-time feature.	2024-08-28 19:12:49 -07:00
Zalathar	46e1b5b6dd	coverage: Rename `CodeRegion` to `SourceRegion` LLVM uses the word "code" to refer to a particular kind of coverage mapping. This unrelated usage of the word is confusing, and makes it harder to introduce types whose names correspond to the LLVM classification of coverage kinds.	2024-08-28 22:17:42 +10:00
Matthias Krüger	3299e30abc	Rollup merge of #129635 - compiler-errors:unsafe-blocks, r=spastorino Use unsafe extern blocks throughout the compiler Making this change in preparation for edition 2024. r? spastorino	2024-08-27 18:59:28 +02:00
Kajetan Puchalski	3a0fbb5d4e	rustc_codegen_llvm: Filter out unavailable LLVM features Convert to_llvm_features to return Option<LLVMFeature> so that it can return None if the requested feature is not available for the current LLVM version. Add match rules to filter out aarch64 features not available in LLVM 17.	2024-08-27 11:13:01 +01:00
Kajetan Puchalski	4fc4019cbc	rustc_target: Remove fpmr target feature FEAT_FPMR has been removed from upstream LLVM as of LLVM 19. Remove the feature from the target features list and temporarily hack the LLVM codegen to always enable it until the minimum LLVM version is bumped to 19.	2024-08-27 11:11:47 +01:00
Kajetan Puchalski	4f847bd326	rustc_target: Add various aarch64 features Add various aarch64 features already supported by LLVM and Linux. The features are marked as unstable using a newly added symbol, i.e. aarch64_unstable_target_feature. Additionally include some comment fixes to ensure consistency of feature names with the Arm ARM and support for architecture version target features up to v9.5a. This commit adds compiler support for the following features: - FEAT_CSSC - FEAT_ECV - FEAT_FAMINMAX - FEAT_FLAGM2 - FEAT_FP8 - FEAT_FP8DOT2 - FEAT_FP8DOT4 - FEAT_FP8FMA - FEAT_FPMR - FEAT_HBC - FEAT_LSE128 - FEAT_LSE2 - FEAT_LUT - FEAT_MOPS - FEAT_LRCPC3 - FEAT_SVE_B16B16 - FEAT_SVE2p1 - FEAT_WFxT	2024-08-27 11:11:47 +01:00
Trevor Gross	8ea70e9537	Rollup merge of #129536 - beetrees:f16-f128-inline-asm-aarch64, r=Amanieu Add `f16` and `f128` inline ASM support for `aarch64` Adds `f16` and `f128` inline ASM support for `aarch64`. SIMD vector types are taken from [the ARM intrinsics list](https://developer.arm.com/architectures/instruction-sets/intrinsics/#f:`@navigationhierarchiesreturnbasetype=[float]&f:@navigationhierarchieselementbitsize=[16]&f:@navigationhierarchiesarchitectures=[A64]).` Based on the work of `@lengrongfu` in #127043. Relevant issue: #125398 Tracking issue: #116909 `@rustbot` label +F-f16_and_f128 try-job: aarch64-gnu try-job: aarch64-apple	2024-08-27 01:46:53 -05:00
Trevor Gross	d2ff033302	Rollup merge of #128731 - RalfJung:simd-shuffle-vector, r=workingjubilee simd_shuffle intrinsic: allow argument to be passed as vector See https://github.com/rust-lang/rust/issues/128738 for context. I'd like to get rid of [this hack](`6c0b89dfac/compiler/rustc_codegen_ssa/src/mir/block.rs (L922-L935)`). https://github.com/rust-lang/rust/pull/128537 almost lets us do that since constant SIMD vectors will then be passed as immediate arguments. However, simd_shuffle for some reason actually takes an array as argument, not a vector, so the hack is still required to ensure that the array becomes an immediate (which then later stages of codegen convert into a vector, as that's what LLVM needs). This PR prepares simd_shuffle to also support a vector as the `idx` argument. Once this lands, stdarch can hopefully be updated to pass `idx` as a vector, and then support for arrays can be removed, which finally lets us get rid of that hack.	2024-08-27 01:46:50 -05:00
Trevor Gross	9c26ebe32e	Rollup merge of #126985 - Mrmaxmeier:dwarf-embed-source, r=davidtwco Implement `-Z embed-source` (DWARFv5 source code embedding extension) Implement https://github.com/rust-lang/compiler-team/issues/764 MCP which adds an unstable flag that exposes LLVM's [DWARFv5 source code embedding](https://dwarfstd.org/issues/180201.1.html) support.	2024-08-27 01:46:49 -05:00
Michael Goulet	38e62b9841	Use unsafe extern blocks throughout the compiler	2024-08-26 19:51:05 -04:00
Matthias Krüger	110c3df7fd	Rollup merge of #126013 - nnethercote:unreachable_pub, r=Urgau Add `#[warn(unreachable_pub)]` to a bunch of compiler crates By default `unreachable_pub` identifies things that need not be `pub` and tells you to make them `pub(crate)`. But sometimes those things don't need any kind of visibility. So they way I did these was to remove the visibility entirely for each thing the lint identifies, and then add `pub(crate)` back in everywhere the compiler said it was necessary. (Or occasionally `pub(super)` when context suggested that was appropriate.) Tedious, but results in more `pub` removal. There are plenty more crates to do but this seems like enough for a first PR. r? `@compiler-errors`	2024-08-27 00:41:57 +02:00
beetrees	abd44fc5f4	Add `f16` and `f128` inline ASM support for `aarch64`	2024-08-25 00:13:25 +01:00
Sami Tolvanen	40f1d9d154	Add missing module flags for CFI and KCFI sanitizers Set the cfi-normalize-integers and kcfi-offset module flags when Control-Flow Integrity sanitizers are used, so functions generated by the LLVM backend use the same CFI/KCFI options as rustc. cfi-normalize-integers tells LLVM to also use integer normalization for generated functions when -Zsanitizer-cfi-normalize-integers is used. kcfi-offset specifies the number of prefix nops between the KCFI type hash and the function entry when -Z patchable-function-entry is used. Note that LLVM assumes all indirectly callable functions use the same number of prefix NOPs with -Zsanitizer=kcfi.	2024-08-21 20:23:56 +00:00
Matthias Krüger	e961d6b204	Rollup merge of #129332 - cuviper:cstr-cast, r=compiler-errors Avoid extra `cast()`s after `CStr::as_ptr()` These used to be `&str` literals that did need a pointer cast, but that became a no-op after switching to `c""` literals in #118566.	2024-08-21 18:15:04 +02:00
Matthias Krüger	dea325e583	Rollup merge of #128627 - khuey:DUMMY_SP-line-no, r=nnethercote Special case DUMMY_SP to emit line 0/column 0 locations on DWARF platforms. Line 0 has a special meaning in DWARF. From the version 5 spec: The compiler may emit the value 0 in cases where an instruction cannot be attributed to any source line. DUMMY_SP spans cannot be attributed to any line. However, because rustc internally stores line numbers starting at zero, lookup_debug_loc() adjusts every line number by one. Special casing DUMMY_SP to actually emit line 0 ensures rustc communicates to the debugger that there's no meaningful source code for this instruction, rather than telling the debugger to jump to line 1 randomly.	2024-08-21 18:15:01 +02:00
Josh Stone	e424e7fcaa	Avoid extra `cast()`s after `CStr::as_ptr()` These used to be `&str` literals that did need a pointer cast, but that became a no-op after switching to `c""` literals in #118566.	2024-08-20 14:04:48 -07:00
Kyle Huey	4e9725cd2f	Add a comment.	2024-08-19 17:13:30 -07:00
Trevor Gross	f69e74e2f5	Update some dependency versions that allow better licensing With the new resolver, a few dependencies get brought in twice with different licenses. For example, all dependencies from `wasm-tools` gained Apache-2.0 and MIT options, and with the v2 resolver we were using one version from before and one version from after this change. This made tidy's license check difficult. Update some minimum versions to remove duplicate dependencies and smooth out license checking.	2024-08-18 13:59:27 -05:00
许杰友 Jieyou Xu (Joe)	42b54a98b6	Rollup merge of #129173 - beetrees:statically-known-float, r=compiler-errors Fix `is_val_statically_known` for floats The LLVM intrinsic name for floats differs from the LLVM type name, so handle them explicitly. Also adds support for `f16` and `f128`. `f16`/`f128` tracking issue: #116909	2024-08-18 14:55:22 +08:00
Chris Denton	0156eb57a1	Always use ar_archive_writer for import libs	2024-08-17 19:10:46 +00:00
beetrees	9bc7cea412	Fix `is_val_statically_known` for floats	2024-08-17 02:14:23 +01:00
Nicholas Nethercote	61627438eb	Add `warn(unreachable_pub)` to `rustc_codegen_llvm`.	2024-08-16 08:46:57 +10:00
bors	d2b5aa6552	Auto merge of #128936 - bjorn3:fix_thin_archive_reading, r=jieyouxu Support reading thin archives in ArArchiveBuilder And switch to using ArArchiveBuilder with the LLVM backend too now that all regressions are fixed. Fixes https://github.com/rust-lang/rust/issues/107407 Fixes https://github.com/rust-lang/rust/issues/107162 https://github.com/rust-lang/rust/issues/107495 has been fixed in a previous PR already.	2024-08-15 14:13:52 +00:00
bors	3139ff09e9	Auto merge of #128861 - khuey:mir-inlining-parameters-debuginfo, r=wesleywiser Rework MIR inlining debuginfo so function parameters show up in debuggers. Line numbers of multiply-inlined functions were fixed in #114643 by using a single DISubprogram. That, however, triggered assertions because parameters weren't deduplicated. The "solution" to that in #115417 was to insert a DILexicalScope below the DISubprogram and parent all of the parameters to that scope. That fixed the assertion, but debuggers (including gdb and lldb) don't recognize variables that are not parented to the subprogram itself as parameters, even if they are emitted with DW_TAG_formal_parameter. Consider the program: ```rust use std::env; #[inline(always)] fn square(n: i32) -> i32 { n * n } #[inline(never)] fn square_no_inline(n: i32) -> i32 { n * n } fn main() { let x = square(env::vars().count() as i32); let y = square_no_inline(env::vars().count() as i32); println!("{x} == {y}"); } ``` When making a release build with debug=2 and rustc 1.82.0-nightly (`8b3870784` 2024-08-07) ``` (gdb) r Starting program: /ephemeral/tmp/target/release/tmp [Thread debugging using libthread_db enabled] Using host libthread_db library "/lib/x86_64-linux-gnu/libthread_db.so.1". Breakpoint 1, tmp::square () at src/main.rs:5 5 n * n (gdb) info args No arguments. (gdb) info locals n = 31 (gdb) c Continuing. Breakpoint 2, tmp::square_no_inline (n=31) at src/main.rs:10 10 n * n (gdb) info args n = 31 (gdb) info locals No locals. ``` This issue is particularly annoying because it removes arguments from stack traces. The DWARF for the inlined function looks like this: ``` < 2><0x00002132 GOFF=0x00002132> DW_TAG_subprogram DW_AT_linkage_name _ZN3tmp6square17hc507052ff3d2a488E DW_AT_name square DW_AT_decl_file 0x0000000f /ephemeral/tmp/src/main.rs DW_AT_decl_line 0x00000004 DW_AT_type 0x00001a56<.debug_info+0x00001a56> DW_AT_inline DW_INL_inlined < 3><0x00002142 GOFF=0x00002142> DW_TAG_lexical_block < 4><0x00002143 GOFF=0x00002143> DW_TAG_formal_parameter DW_AT_name n DW_AT_decl_file 0x0000000f /ephemeral/tmp/src/main.rs DW_AT_decl_line 0x00000004 DW_AT_type 0x00001a56<.debug_info+0x00001a56> < 4><0x0000214e GOFF=0x0000214e> DW_TAG_null < 3><0x0000214f GOFF=0x0000214f> DW_TAG_null ``` That DW_TAG_lexical_block inhibits every debugger I've tested from recognizing 'n' as a parameter. This patch removes the additional lexical scope. Parameters can be easily deduplicated by a tuple of their scope and the argument index, at the trivial cost of taking a Hash + Eq bound on DIScope.	2024-08-15 11:42:15 +00:00
bors	026e9ed3f0	Auto merge of #128037 - beetrees:repr128-c-style-use-natvis, r=michaelwoerister Use the `enum2$` Natvis visualiser for repr128 C-style enums Use the preexisting `enum2$` Natvis visualiser to allow PDB debuggers to display fieldless `#[repr(u128)]]`/`#[repr(i128)]]` enums correctly. Tracking issue: #56071 try-job: x86_64-msvc	2024-08-15 09:17:24 +00:00
bjorn3	9de0d147f4	Unconditionally use the LLVM symbol reader This may fix a linker error on MSVC	2024-08-14 16:50:48 +00:00
bors	e9c965df7b	Auto merge of #128812 - nnethercote:shrink-TyKind-FnPtr, r=compiler-errors Shrink `TyKind::FnPtr`. By splitting the `FnSig` within `TyKind::FnPtr` into `FnSigTys` and `FnHeader`, which can be packed more efficiently. This reduces the size of the hot `TyKind` type from 32 bytes to 24 bytes on 64-bit platforms. This reduces peak memory usage by a few percent on some benchmarks. It also reduces cache misses and page faults similarly, though this doesn't translate to clear cycles or wall-time improvements on CI. r? `@compiler-errors`	2024-08-14 00:56:53 +00:00
beetrees	fe4fa2f1da	Use the `enum2$` Natvis visualiser for repr128 C-style enums	2024-08-13 19:53:21 +01:00
Ralf Jung	194baa820d	simd_shuffle intrinsic: allow argument to be passed as vector (not just as array)	2024-08-13 07:51:17 +02:00
Kyle Huey	1c5e3c90cf	Rework MIR inlining debuginfo so function parameters show up in debuggers. Line numbers of multiply-inlined functions were fixed in #114643 by using a single DISubprogram. That, however, triggered assertions because parameters weren't deduplicated. The "solution" to that in #115417 was to insert a DILexicalScope below the DISubprogram and parent all of the parameters to that scope. That fixed the assertion, but debuggers (including gdb and lldb) don't recognize variables that are not parented to the subprogram itself as parameters, even if they are emitted with DW_TAG_formal_parameter. Consider the program: use std::env; fn square(n: i32) -> i32 { n * n } fn square_no_inline(n: i32) -> i32 { n * n } fn main() { let x = square(env::vars().count() as i32); let y = square_no_inline(env::vars().count() as i32); println!("{x} == {y}"); } When making a release build with debug=2 and rustc 1.82.0-nightly (`8b3870784` 2024-08-07) (gdb) r Starting program: /ephemeral/tmp/target/release/tmp [Thread debugging using libthread_db enabled] Using host libthread_db library "/lib/x86_64-linux-gnu/libthread_db.so.1". Breakpoint 1, tmp::square () at src/main.rs:5 5 n * n (gdb) info args No arguments. (gdb) info locals n = 31 (gdb) c Continuing. Breakpoint 2, tmp::square_no_inline (n=31) at src/main.rs:10 10 n * n (gdb) info args n = 31 (gdb) info locals No locals. This issue is particularly annoying because it removes arguments from stack traces. The DWARF for the inlined function looks like this: < 2><0x00002132 GOFF=0x00002132> DW_TAG_subprogram DW_AT_linkage_name _ZN3tmp6square17hc507052ff3d2a488E DW_AT_name square DW_AT_decl_file 0x0000000f /ephemeral/tmp/src/main.rs DW_AT_decl_line 0x00000004 DW_AT_type 0x00001a56<.debug_info+0x00001a56> DW_AT_inline DW_INL_inlined < 3><0x00002142 GOFF=0x00002142> DW_TAG_lexical_block < 4><0x00002143 GOFF=0x00002143> DW_TAG_formal_parameter DW_AT_name n DW_AT_decl_file 0x0000000f /ephemeral/tmp/src/main.rs DW_AT_decl_line 0x00000004 DW_AT_type 0x00001a56<.debug_info+0x00001a56> < 4><0x0000214e GOFF=0x0000214e> DW_TAG_null < 3><0x0000214f GOFF=0x0000214f> DW_TAG_null That DW_TAG_lexical_block inhibits every debugger I've tested from recognizing 'n' as a parameter. This patch removes the additional lexical scope. Parameters can be easily deduplicated by a tuple of their scope and the argument index, at the trivial cost of taking a Hash + Eq bound on DIScope.	2024-08-12 19:20:00 -07:00
Guillaume Gomez	7c6dca9050	Rollup merge of #128978 - compiler-errors:assert-matches, r=jieyouxu Use `assert_matches` around the compiler more It's a useful assertion, especially since it actually prints out the LHS.	2024-08-12 17:09:19 +02:00
Guillaume Gomez	aea5087964	Rollup merge of #128537 - Jamesbarford:118980-const-vector, r=RalfJung,nikic const vector passed through to codegen This allows constant vectors using a repr(simd) type to be propagated through to the backend by reusing the functionality used to do a similar thing for the simd_shuffle intrinsic #118209 r? RalfJung	2024-08-12 17:09:15 +02:00
Guillaume Gomez	095ca33bb6	Rollup merge of #128149 - RalfJung:nontemporal_store, r=jieyouxu,Amanieu,Jubilee nontemporal_store: make sure that the intrinsic is truly just a hint The `!nontemporal` flag for stores in LLVM sounds like it is just a hint, but actually, it is not -- at least on x86, non-temporal stores need very special treatment by the programmer or else the Rust memory model breaks down. LLVM still treats these stores as-if they were normal stores for optimizations, which is [highly dubious](https://github.com/llvm/llvm-project/issues/64521). Let's avoid all that dubiousness by making our own non-temporal stores be truly just a hint, which is possible on some targets (e.g. ARM). On all other targets, non-temporal stores become regular stores. ~~Blocked on https://github.com/rust-lang/stdarch/pull/1541 propagating to the rustc repo, to make sure the `_mm_stream` intrinsics are unaffected by this change.~~ Fixes https://github.com/rust-lang/rust/issues/114582 Cc `@Amanieu` `@workingjubilee`	2024-08-12 17:09:14 +02:00
bors	e08b80c0fb	Auto merge of #128371 - andjo403:rangeAttribute, r=nikic Add range attribute to scalar function results and arguments as LLVM 19 adds the range attribute this starts to use it for better optimization. hade been interesting to see a perf run with the https://github.com/rust-lang/rust/pull/127513 closes https://github.com/rust-lang/rust/issues/50156 cc https://github.com/rust-lang/rust/issues/49572 shall be fixed but not possible to see as there is asserts that already trigger the optimization.	2024-08-12 10:20:00 +00:00
Ralf Jung	75743dc5a0	make the codegen test also cover an ill-behaved arch, and add links	2024-08-12 11:42:38 +02:00
Andreas Jonson	cfadfabfcd	Add range attribute to scalar function results and arguments	2024-08-11 19:40:44 +02:00
Michael Goulet	c361c924a0	Use assert_matches around the compiler	2024-08-11 12:25:39 -04:00
bjorn3	db68a19b61	Fix review comments and other improvements	2024-08-11 10:29:32 +00:00
bjorn3	d63a067bfd	Add fixme for removing LlvmArchiveBuilder in the future	2024-08-10 18:49:36 +00:00
bjorn3	c1f5350df5	Use ArArchiveBuilder with the LLVM backend too All regressions that were blocking usage of ArArchiveBuilder should now be fixed.	2024-08-10 17:45:39 +00:00
Nicholas Nethercote	c4717cc9d1	Shrink `TyKind::FnPtr`. By splitting the `FnSig` within `TyKind::FnPtr` into `FnSigTys` and `FnHeader`, which can be packed more efficiently. This reduces the size of the hot `TyKind` type from 32 bytes to 24 bytes on 64-bit platforms. This reduces peak memory usage by a few percent on some benchmarks. It also reduces cache misses and page faults similarly, though this doesn't translate to clear cycles or wall-time improvements on CI.	2024-08-09 14:33:25 +10:00
yifei	27f92b6c10	fix: get llvm type of global val	2024-08-09 10:58:32 +08:00
Michael Goulet	b916431976	Rename struct_tail_erasing_lifetimes to struct_tail_for_codegen	2024-08-08 12:15:16 -04:00
Michael Goulet	85b5e42d5e	Do normalize when computing struct tails in codegen	2024-08-08 11:58:11 -04:00
James Barford-Evans	27ca35aa1b	const vector passed to codegen	2024-08-08 11:15:03 +01:00
Matthias Krüger	8f39b86a6a	Rollup merge of #128679 - RalfJung:codegen-fn-attrs, r=nikic codegen: better centralize function declaration attribute computation For some reason, the codegen backend has two functions that compute which attributes a function declaration gets: `apply_attrs_llfn` and `attributes::from_fn_attrs`. They are called in different places, on entirely different layers of abstraction. To me the code seems cleaner if we centralize this entirely in `apply_attrs_llfn`, so that's what this PR does.	2024-08-07 20:28:18 +02:00
Matthias Krüger	904f5795a0	Rollup merge of #128221 - calebzulawski:implied-target-features, r=Amanieu Add implied target features to target_feature attribute See [zulip](https://rust-lang.zulipchat.com/#narrow/stream/208962-t-libs.2Fstdarch/topic/Why.20would.20target-feature.20include.20implied.20features.3F) for some context. Adds implied target features, e.g. `#[target_feature(enable = "avx2")]` acts like `#[target_feature(enable = "avx2,avx,sse4.2,sse4.1...")]`. Fixes #128125, fixes #128426 The implied feature sets are taken from [the rust reference](https://doc.rust-lang.org/reference/attributes/codegen.html?highlight=target-fea#x86-or-x86_64), there are certainly more features and targets to add. Please feel free to reassign this to whoever should review it. r? ``@Amanieu``	2024-08-07 20:28:16 +02:00
Ralf Jung	273c67db83	codegen: better centralize function attribute computation	2024-08-07 19:49:48 +02:00
Guillaume Gomez	355eb9c79f	Rollup merge of #128206 - bjorn3:import_lib_writing_refactor, r=jieyouxu Make create_dll_import_lib easier to implement This will make it easier to implement raw-dylib support in cg_clif and cg_gcc. This PR doesn't yet include an create_dll_import_lib implementation for cg_clif as I need to correctly implement dllimport in cg_clif first before raw-dylib can work at all with cg_clif. Required for https://github.com/rust-lang/rustc_codegen_cranelift/issues/1345	2024-08-07 15:59:35 +02:00
Caleb Zulawski	8818c95528	Disallow enabling features without their implied features	2024-08-07 00:45:00 -04:00
Caleb Zulawski	83276f5680	Hide implicit target features from diagnostics when possible	2024-08-07 00:43:52 -04:00
Caleb Zulawski	6b96a60611	Add implied features to non-target-feature functions	2024-08-07 00:41:48 -04:00
Caleb Zulawski	5006711744	Remove redundant implied features	2024-08-07 00:41:48 -04:00
Caleb Zulawski	a25da077cf	Don't use LLVM to compute -Ctarget-feature	2024-08-07 00:41:48 -04:00
Caleb Zulawski	484aca8857	Don't use LLVM's target features	2024-08-07 00:41:48 -04:00
Caleb Zulawski	fbd618d4aa	Refactor and fill out target feature lists	2024-08-07 00:41:48 -04:00
Caleb Zulawski	74653b61a6	Add implied target features to target_feature attribute	2024-08-07 00:41:48 -04:00
Trevor Gross	b3bfd66627	Rollup merge of #128417 - tgross35:f16-f128-math, r=dtolnay Add `f16` and `f128` math functions This adds intrinsics and math functions for `f16` and `f128` floating point types. Support is quite limited and some things are broken so tests don't run on many platforms, but this provides a starting point.	2024-08-06 22:17:32 -05:00
Ralf Jung	697787a92d	RISC-V also has sane nontemporal stores	2024-08-05 10:57:14 +02:00
Ralf Jung	28e0907111	nontemporal_store: make sure that the intrinsic is truly just a hint	2024-08-05 10:57:14 +02:00
Kyle Huey	5dc4a1969c	Fix warning.	2024-08-04 06:09:55 -07:00
Kyle Huey	e587855538	Use Span::is_dummy().	2024-08-04 05:26:50 -07:00
daxpedda	80b74d397f	Implement a implicit target feature mechanism	2024-08-04 08:44:23 +02:00
Kyle Huey	78caecf8f3	Special case DUMMY_SP to emit line 0/column 0 locations on DWARF platforms. Line 0 has a special meaning in DWARF. From the version 5 spec: The compiler may emit the value 0 in cases where an instruction cannot be attributed to any source line. DUMMY_SP spans cannot be attributed to any line. However, because rustc internally stores line numbers starting at zero, lookup_debug_loc() adjusts every line number by one. Special casing DUMMY_SP to actually emit line 0 ensures rustc communicates to the debugger that there's no meaningful source code for this instruction, rather than telling the debugger to jump to line 1 randomly.	2024-08-03 21:18:52 -07:00
Trevor Gross	e6d570241f	Specify the integer type of the `powi` LLVM intrinsic Since LLVM <https://reviews.llvm.org/D99439> (4c7f820b2b20, "Update @llvm.powi to handle different int sizes for the exponent"), the size of the integer can be specified for the `powi` intrinsic. Make use of this so it is more obvious that integer size is consistent across all float types. This feature is available since LLVM 13 (October 2021). Based on bootstrap we currently support >= 17.0, so there should be no support problems.	2024-08-01 15:36:15 -04:00
Matthias Krüger	75dfe1e63d	Rollup merge of #127830 - tgross35:archive-failure-message, r=BoxyUwU When an archive fails to build, print the path Currently the output on failure is as follows: Compiling block-buffer v0.10.4 Compiling crypto-common v0.1.6 Compiling digest v0.10.7 Compiling sha2 v0.10.8 Compiling xz2 v0.1.7 error: failed to build archive: No such file or directory error: could not compile `bootstrap` (lib) due to 1 previous error Change this to print which file is being constructed, to give some hint about what is going on. error: failed to build archive at `path/to/output`: No such file or directory	2024-07-31 15:36:30 +02:00
bjorn3	216686bfa5	Move mingw dlltool invocation to cg_ssa	2024-07-30 10:33:33 +00:00
bjorn3	3c987cbe02	Move computation of decorated names out of the create_dll_import_lib method	2024-07-30 10:32:32 +00:00
bjorn3	bb764bd406	Move is_mingw_gnu_toolchain and i686_decorated_name to cg_ssa	2024-07-30 10:30:09 +00:00
bjorn3	ee89db9b17	Move temp file name generation out of the create_dll_import_lib method	2024-07-30 10:10:41 +00:00
bors	7e3a971870	Auto merge of #128378 - matthiaskrgr:rollup-i3qz9uo, r=matthiaskrgr Rollup of 4 pull requests Successful merges: - #127574 (elaborate unknowable goals) - #128141 (Set branch protection function attributes) - #128315 (Fix vita build of std and forbid unsafe in unsafe in the os/vita module) - #128339 ([rustdoc] Make the buttons remain when code example is clicked) r? `@ghost` `@rustbot` modify labels: rollup	2024-07-30 05:50:05 +00:00
bors	710ce90fbe	Auto merge of #128250 - Amanieu:select_unpredictable, r=nikic Add `select_unpredictable` to force LLVM to use CMOV Since https://reviews.llvm.org/D118118, LLVM will no longer turn CMOVs into branches if it comes from a `select` marked with an `unpredictable` metadata attribute. This PR introduces `core::intrinsics::select_unpredictable` which emits such a `select` and uses it in the implementation of `binary_search_by`.	2024-07-30 03:22:27 +00:00
Matthias Krüger	6b23cb5cdf	Rollup merge of #128141 - nikic:aarch64-bti, r=DianQK,cuviper Set branch protection function attributes Since LLVM 19, it is necessary to set not only module flags, but also function attributes for branch protection on aarch64. See `e15d67cfc2` for the relevant LLVM change. Fixes https://github.com/rust-lang/rust/issues/127829.	2024-07-30 04:31:54 +02:00
Mrmaxmeier	0b87af9d4f	Add `-Z embed-source=yes` to embed source code in DWARF debug info	2024-07-29 12:35:36 +02:00
bors	80d8270d84	Auto merge of #125016 - nicholasbishop:bishop-cb-112, r=tgross35 Update compiler_builtins to 0.1.114 The `weak-intrinsics` feature was removed from compiler_builtins in https://github.com/rust-lang/compiler-builtins/pull/598, so dropped the `compiler-builtins-weak-intrinsics` feature from alloc/std/sysroot. In https://github.com/rust-lang/compiler-builtins/pull/593, some builtins for f16/f128 were added. These don't work for all compiler backends, so add a `compiler-builtins-no-f16-f128` feature and disable it for cranelift and gcc.	2024-07-29 07:41:33 +00:00
Nicholas Nethercote	84ac80f192	Reformat `use` declarations. The previous commit updated `rustfmt.toml` appropriately. This commit is the outcome of running `x fmt --all` with the new formatting options.	2024-07-29 08:26:52 +10:00
Guillaume Gomez	19feb90d69	Rollup merge of #127860 - klensy:dedup, r=Mark-Simulacrum deps: dedup object, wasmparser, wasm-encoder * dedups one `object`, additional dupe will be removed, with next `thorin-dwp` update * `wasmparser` pinned to minor versions, so full merge isn't possible * same with `wasm-encoder` Turned off some features for `wasmparser` (see features https://github.com/bytecodealliance/wasm-tools/blob/v1.208.1/crates/wasmparser/Cargo.toml) in `run-make-support`, looks working?	2024-07-28 20:07:45 +02:00
Amanieu d'Antras	4f78f9fbb0	Force LLVM to use CMOV for binary search Since https://reviews.llvm.org/D118118, LLVM will no longer turn CMOVs into branches if it comes from a `select` marked with an `unpredictable` metadata attribute. This PR introduces `core::intrinsics::select_unpredictable` which emits such a `select` and uses it in the implementation of `binary_search_by`.	2024-07-28 17:24:57 +01:00
klensy	58c9999f25	dedup object waiting on thorin-dwp update dedup one wasmparser run-make-support: drop some features for wasmparser dedupe wasm-encoder	2024-07-28 17:21:07 +03:00
Matthew Maurer	38931cd227	LLVM: LLVM-20.0 removes MMX types See llvm/llvm-project#98505	2024-07-25 17:58:37 +00:00
bors	28e684b470	Auto merge of #127995 - workingjubilee:say-turings-prayer, r=BoxyUwU compiler: Never debug_assert in codegen In the name of Turing and his Hoarey heralds, assert our truths before creating a monster! The `rustc_codegen_llvm` and `rustc_codegen_ssa` crates are fairly critical for rustc's correctness. Small mistakes here can easily result in undefined behavior, since a "small mistake" can mean something like "link and execute the wrong code". We should probably run any and all asserts in these modules unconditionally on whether this is a "debug build", and damn the costs in performance. ...Especially because the costs in performance seem to be nothing. It is not clear how much correctness we gain here, but I'll take free correctness improvements.	2024-07-25 07:52:31 +00:00
Nikita Popov	ea7625f426	Set branch protection function attributes Since LLVM 19, it is necessary to set not only module flags, but also function attributes for branch protection on aarch64. See `e15d67cfc2` for the relevant LLVM change.	2024-07-24 17:13:25 +02:00
Trevor Gross	5e8e46cbd2	Rollup merge of #127506 - liushuyu:s390x-target-features, r=davidtwco rustc_target: add known safe s390x target features This pull request adds known safe target features for s390x (aka IBM Z systems). Currently, these features are unstable since stabilizing the target features requires submitting proposals. The `vector` feature was added in IBM Z13 (`arch11`), and this is a SIMD feature for the newer IBM Z systems. The `backchain` attribute is the IBM Z way of adding frame pointers like unwinding capabilities (the "frame-pointer" switch on IBM Z and IBM POWER platforms will add _emulated_ frame pointers to the binary, which profilers can't use for unwinding the stack). Both attributes can be applied at the LLVM module or function levels. However, the `backchain` attribute has to be enabled for all the functions in the call stack to get a successful unwind process.	2024-07-22 11:40:19 -05:00
Jubilee Young	ce7b069fd8	compiler: Never debug_assert in codegen The gains in performance are not worth the costs in correctness. This is partly because the gains are zero and the costs are unknown.	2024-07-20 00:16:44 -07:00
Trevor Gross	986d6bf9fb	Rollup merge of #121533 - ratmice:wasm_init_fini_array, r=nnethercote Handle .init_array link_section specially on wasm Given that wasm-ld now has support for [.init_array](`8f2bd8ae68/llvm/lib/MC/WasmObjectWriter.cpp (L1852)`), it appears we can easily implement that section by falling through to the normal path rather than taking the typical custom_section path for wasm. The wasm-ld appears to have a bunch of limitations. Only one static with the `link_section` in a crate or else you hit the fatal error in the link above "only one .init_array section fragment supported". They do not get merged. You can still call multiple constructors by setting it to an array. ``` unsafe extern "C" fn ctor() { println!("foo"); } #[used] #[link_section = ".init_array"] static FOO: [unsafe extern "C" fn(); 2] = [ctor, ctor]; ``` Another issue appears to be that if crate A depends on crate B, but A doesn't call any symbols from B and B doesn't `#[export_name = ...]` any symbols, then crate B's constructor will not be called. The workaround to this is to provide an exported symbol in crate B.	2024-07-19 03:27:46 -05:00
liushuyu	01e6e60bf3	rustc_codegen_llvm: properly passing backchain attribute to LLVM ... ... this is a special attribute that was made to be a target-feature in LLVM 18+, but in all previous versions, this "feature" is a naked attribute. We will have to handle this situation differently than all other target-features.	2024-07-17 07:56:00 +08:00
Trevor Gross	63f239c89f	Rollup merge of #124033 - bjorn3:ar_archive_writer_0_3_0, r=davidtwco Sync ar_archive_writer to LLVM 18.1.3 From LLVM 15.0.0-rc3. This adds support for COFF archives containing Arm64EC object files and has various fixes for AIX big archive files.	2024-07-16 16:15:13 -05:00
Trevor Gross	e0af3c61cd	When an archive fails to build, print the path Currently the output on failure is as follows: Compiling block-buffer v0.10.4 Compiling crypto-common v0.1.6 Compiling digest v0.10.7 Compiling sha2 v0.10.8 Compiling xz2 v0.1.7 error: failed to build archive: No such file or directory error: could not compile `bootstrap` (lib) due to 1 previous error Print which file is being constructed to give some hint about what is going on.	2024-07-16 15:44:54 -05:00
Michael Goulet	28503d69ac	Fix unsafe_op_in_unsafe_fn in compiler	2024-07-16 00:02:44 -04:00
Zalathar	d4f1f92426	coverage: Restrict `ExpressionUsed` simplification to `Code` mappings In the future, branch and MC/DC mappings might have expressions that don't correspond to any single point in the control-flow graph. That makes it trickier to keep track of which expressions should expect an `ExpressionUsed` node. We therefore sidestep that complexity by only performing `ExpressionUsed` simplification for expressions associated directly with ordinary `Code` mappings.	2024-07-15 20:54:28 +10:00
Jacob Pratt	cb1ccc1842	Rollup merge of #127654 - nikic:llvm-ndebug-fix, r=cuviper Fix incorrect NDEBUG handling in LLVM bindings We currently compile our LLVM bindings using `-DNDEBUG` if debuginfo for LLVM is disabled. However, `NDEBUG` doesn't have any relation to debuginfo, it controls whether assertions are enabled. Split the LLVM_NDEBUG environment variable into two, so that assertions and debuginfo are controlled independently. After this change, `LLVMRustDIBuilderInsertDeclareAtEnd` triggers an assertion failure on LLVM 19 due to an incorrect cast. Fix it by removing the unused return value entirely. r? `@cuviper`	2024-07-13 00:24:35 -04:00
Nikita Popov	8dfd3a455d	Remove LLVMRustDIBuilderInsertDeclareAtEnd return value The return value changed from an Instruction to a DbgRecord in LLVM 19. As we don't actually use the result, drop the return value entirely to support both.	2024-07-12 22:15:10 +02:00
bors	5e311f933d	Auto merge of #127614 - matthiaskrgr:rollup-8geziwi, r=matthiaskrgr Rollup of 8 pull requests Successful merges: - #124599 (Suggest borrowing on fn argument that is `impl AsRef`) - #127572 (Don't mark `DEBUG_EVENT` struct as `repr(packed)`) - #127588 (core: Limit remaining f16 doctests to x86_64 linux) - #127591 (Make sure that labels are defined after the primary span in diagnostics) - #127598 (Allows `#[diagnostic::do_not_recommend]` to supress trait impls in suggestions as well) - #127599 (Rename `lazy_cell_consume` to `lazy_cell_into_inner`) - #127601 (check is_ident before parse_ident) - #127605 (Remove extern "wasm" ABI) r? `@ghost` `@rustbot` modify labels: rollup	2024-07-11 22:56:52 +00:00
bors	bcf1f6db45	Auto merge of #127487 - tgross35:f16-f128-simd, r=Amanieu Add `f16` and `f128` as simd types in LLVM `@sayantn` is working on adding SIMD for `f16` and hitting the `FloatingPointVector` error. This should fix it and unblock adding support for `simd_fma` and `simd_fabs` in stdarch.	2024-07-11 14:08:58 +00:00
Nikita Popov	8a50bcbdce	Remove extern "wasm" ABI Remove the unstable `extern "wasm"` ABI (`wasm_abi` feature tracked in #83788). As discussed in https://github.com/rust-lang/rust/pull/127513#issuecomment-2220410679 and following, this ABI is a failed experiment that did not end up being used for anything. Keeping support for this ABI in LLVM 19 would require us to switch wasm targets to the `experimental-mv` ABI, which we do not want to do. It should be noted that `Abi::Wasm` was internally used for two things: The `-Z wasm-c-abi=legacy` ABI that is still used by default on some wasm targets, and the `extern "wasm"` ABI. Despite both being `Abi::Wasm` internally, they were not the same. An explicit `extern "wasm"` additionally enabled the `+multivalue` feature. I've opted to remove `Abi::Wasm` in this patch entirely, instead of keeping it as an ABI with only internal usage. Both `-Z wasm-c-abi` variants are now treated as part of the normal C ABI, just with different different treatment in adjust_for_foreign_abi.	2024-07-11 12:20:26 +02:00
bors	8672b2b763	Auto merge of #127001 - beetrees:f16-debuginfo, r=michaelwoerister Add Natvis visualiser and debuginfo tests for `f16` To render `f16`s in debuggers on MSVC targets, this PR changes the compiler to output `f16`s as `struct f16 { bits: u16 }`, and includes a Natvis visualiser that manually converts the `f16`'s bits to a `float` which is can then be displayed by debuggers. `gdb`, `lldb` and `cdb` tests are also included for `f16` . `f16`/`f128` MSVC debug info issue: #121837 Tracking issue: #116909	2024-07-09 09:07:42 +00:00
beetrees	b058de90a3	Add Natvis visualiser and debuginfo tests for `f16`	2024-07-09 03:47:50 +01:00
Trevor Gross	04caa5555a	Add `f16` and `f128` as SIMD types in LLVM	2024-07-08 13:59:39 -04:00
bjorn3	58e551433d	Sync ar_archive_writer to LLVM 18.1.3 From LLVM 15.0.0-rc3. This adds support for COFF archives containing Arm64EC object files and has various fixes for AIX big archive files.	2024-07-07 16:56:35 +00:00
bors	51917ba8f2	Auto merge of #126171 - RalfJung:simd_bitmask_multibyte, r=workingjubilee simd_bitmask intrinsic: add a non-power-of-2 multi-byte example r? `@calebzulawski` `@workingjubilee`	2024-07-05 01:58:22 +00:00
bors	489233170a	Auto merge of #123781 - RalfJung:miri-fn-identity, r=oli-obk Miri function identity hack: account for possible inlining Having a non-lifetime generic is not the only reason a function can be duplicated. Another possibility is that the function may be eligible for cross-crate inlining. So also take into account the inlining attribute in this Miri hack for function pointer identity. That said, `cross_crate_inlinable` will still sometimes return true even for `inline(never)` functions: - when they are `DefKind::Ctor(..) \| DefKind::Closure` -- I assume those cannot be `InlineAttr::Never` anyway? - when `cross_crate_inline_threshold == InliningThreshold::Always` so maybe this is still not quite the right criterion to use for function pointer identity.	2024-07-04 23:45:56 +00:00
Matthias Krüger	c74d620635	Rollup merge of #126803 - tgross35:verbose-asm, r=Amanieu Change `asm-comments` to `verbose-asm`, always emit user comments Implements what is described in https://github.com/rust-lang/compiler-team/issues/762 Tracking issue: https://github.com/rust-lang/rust/issues/126802	2024-07-03 17:26:53 +02:00
bors	c872a1418a	Auto merge of #125507 - compiler-errors:type-length-limit, r=lcnr Re-implement a type-size based limit r? lcnr This PR reintroduces the type length limit added in #37789, which was accidentally made practically useless by the caching changes to `Ty::walk` in #72412, which caused the `walk` function to no longer walk over identical elements. Hitting this length limit is not fatal unless we are in codegen -- so it shouldn't affect passes like the mir inliner which creates potentially very large types (which we observed, for example, when the new trait solver compiles `itertools` in `--release` mode). This also increases the type length limit from `1048576 == 2 20` to `2 24`, which covers all of the code that can be reached with craterbot-check. Individual crates can increase the length limit further if desired. Perf regression is mild and I think we should accept it -- reinstating this limit is important for the new trait solver and to make sure we don't accidentally hit more type-size related regressions in the future. Fixes #125460	2024-07-03 11:56:36 +00:00
Trevor Gross	c15a698f56	Rename the `asm-comments` compiler flag to `verbose-asm` Since this codegen flag now only controls LLVM-generated comments rather than all assembly comments, make the name more accurate (and also match Clang).	2024-07-02 21:42:01 -04:00
Michael Goulet	3273ccea4b	Fix spans	2024-07-02 15:48:48 -04:00
Michael Goulet	9dc129ae82	Give Instance::expect_resolve a span	2024-07-02 15:48:48 -04:00
Ralf Jung	41b98da42d	Miri function identity hack: account for possible inlining	2024-07-02 21:05:30 +02:00
Matthias Krüger	33b0238586	Rollup merge of #127168 - DianQK:cast-size, r=workingjubilee Use the aligned size for alloca at args/ret when the pass mode is cast Fixes #75839. Fixes #121028. The `load` and `store` instructions in LLVM access the aligned size. For example, `load { i64, i32 }` accesses 16 bytes on x86_64: https://alive2.llvm.org/ce/z/n8CHAp. BTW, this example is expected to be optimized to immediate UB by Alive2: https://rust.godbolt.org/z/b7xK7hv1c and https://alive2.llvm.org/ce/z/vZDtZH. r? compiler	2024-07-02 17:47:48 +02:00
DianQK	c453dcd62a	Use the aligned size for alloca at args when the pass mode is cast. The `load` and `store` instructions in LLVM access the aligned size.	2024-07-02 06:33:35 +08:00
Ralf Jung	e9dd39cda4	fix simd_bitmask return type for non-power-of-two inputs, and add tests	2024-07-01 17:25:14 +02:00
Matthias Krüger	5b90824433	Rollup merge of #123237 - bjorn3:debuginfo_refactor, r=compiler-errors Various rustc_codegen_ssa cleanups	2024-06-29 22:10:55 +02:00
Matthew Maurer	9b0ae75ecc	Support `#[patchable_function_entries]` See [RFC](https://github.com/maurer/rust-rfcs/blob/patchable-function-entry/text/0000-patchable-function-entry.md) (yet to be numbered) TODO before submission: * Needs an RFC * Improve error reporting for malformed attributes	2024-06-25 18:23:41 +02:00
Matthew Maurer	ac7595fdb1	Support for -Z patchable-function-entry `-Z patchable-function-entry` works like `-fpatchable-function-entry` on clang/gcc. The arguments are total nop count and function offset. See MCP rust-lang/compiler-team#704	2024-06-25 18:21:42 +02:00
Michael Goulet	faa28be2f1	Rollup merge of #124712 - Enselic:deprecate-inline-threshold, r=pnkfelix Deprecate no-op codegen option `-Cinline-threshold=...` This deprecates `-Cinline-threshold` since using it has no effect. This has been the case since the new LLVM pass manager started being used, more than 2 years ago. Recommend using `-Cllvm-args=--inline-threshold=...` instead. Closes #89742 which is E-help-wanted.	2024-06-24 15:51:00 -04:00
Jubilee Young	b3a1975cdc	compiler(nfc): -Cforce-frame-pointers is a FramePointer	2024-06-23 00:36:33 -07:00
Guillaume Gomez	07e8b3ac01	Rollup merge of #126555 - beetrees:f16-inline-asm-arm, r=Amanieu Add `f16` inline ASM support for 32-bit ARM Adds `f16` inline ASM support for 32-bit ARM. SIMD vector types are taken from [here](https://developer.arm.com/architectures/instruction-sets/intrinsics/#f:`@navigationhierarchiesreturnbasetype=[float]&f:@navigationhierarchieselementbitsize=[16]&f:@navigationhierarchiesarchitectures=[A32]).` Relevant issue: #125398 Tracking issue: #116909 `@rustbot` label +F-f16_and_f128	2024-06-22 12:57:18 +02:00
Jubilee	e7956cd994	Rollup merge of #126530 - beetrees:f16-inline-asm-riscv, r=Amanieu Add `f16` inline ASM support for RISC-V This PR adds `f16` inline ASM support for RISC-V. A `FIXME` is left for `f128` support as LLVM does not support the required `Q` (Quad-Precision Floating-Point) extension yet. Relevant issue: #125398 Tracking issue: #116909 `@rustbot` label +F-f16_and_f128	2024-06-21 21:02:26 -07:00
bjorn3	887f57ff0b	Remove type_i1 and type_struct from cg_ssa They are not representable by Cranelift	2024-06-21 19:30:26 +00:00
bjorn3	aacdce38f7	Remove check_overflow method from MiscMethods It can be retrieved from the Session too.	2024-06-21 19:30:26 +00:00
bjorn3	98e8601ac3	Remove const_bitcast from ConstMethods	2024-06-21 19:26:07 +00:00
bjorn3	7f445329ec	Remove PrintBackendInfo trait It is only implemented for a single type. Directly passing this type is simpler and avoids overhead from indirect calls.	2024-06-21 19:26:06 +00:00
bjorn3	e9ea578147	Move vcall_visibility_metadata optimization hint out of a debuginfo generation method	2024-06-21 19:26:06 +00:00
beetrees	771e44ebd3	Add `f16` inline ASM support for RISC-V	2024-06-21 18:48:20 +01:00
beetrees	753fb070bb	Add `f16` inline ASM support for 32-bit ARM	2024-06-21 18:26:42 +01:00
Oli Scherer	3f34196839	Remove redundant argument from `subdiagnostic` method	2024-06-18 15:42:11 +00:00
Oli Scherer	7ba82d61eb	Use a dedicated type instead of a reference for the diagnostic context This paves the way for tracking more state (e.g. error tainting) in the diagnostic context handle	2024-06-18 15:42:11 +00:00
许杰友 Jieyou Xu (Joe)	1af0e6e0c3	Rollup merge of #126365 - Dirbaio:collapse-debuginfo-statics, r=workingjubilee Honor collapse_debuginfo for statics. fixes #126363 <!-- If this PR is related to an unstable feature or an otherwise tracked effort, please link to the relevant tracking issue here. If you don't know of a related tracking issue or there are none, feel free to ignore this. This PR will get automatically assigned to a reviewer. In case you would like a specific user to review your work, you can assign it to them by using r? <reviewer name> -->	2024-06-16 21:14:41 +01:00
Matthias Krüger	21de99258f	Rollup merge of #126424 - Enselic:sort-target-features, r=lqd Also sort `crt-static` in `--print target-features` output I didn't find `crt-static` at first (for `x86_64-unknown-linux-gnu`), because it was put at the bottom of the large and otherwise sorted list. Fully sort the list before we print it. Note that `llvm_target_features` starts out and remains sorted and does not need to be sorted an extra time. On my machine the diff is just: ```diff $ diff -u /tmp/before2.txt /tmp/after2.txt --- /tmp/before2.txt 2024-06-13 20:40:27.091636592 +0200 +++ /tmp/after2.txt 2024-06-13 20:39:54.584894891 +0200 ``@@`` -20,6 +20,7 ``@@`` bmi1 - Support BMI instructions. bmi2 - Support BMI2 instructions. cmpxchg16b - 64-bit with cmpxchg16b (this is true for most x86-64 chips, but not the first AMD chips). + crt-static - Enables C Run-time Libraries to be statically linked. ermsb - REP MOVS/STOS are fast. f16c - Support 16-bit floating point conversion instructions. fma - Enable three-operand fused multiple-add. ``@@`` -49,7 +50,6 ``@@`` xsavec - Support xsavec instructions. xsaveopt - Support xsaveopt instructions. xsaves - Support xsaves instructions. - crt-static - Enables C Run-time Libraries to be statically linked. Code-generation features supported by LLVM for this target: 16bit-mode - 16-bit mode (i8086). ``` I couldn't find a ui test that tested this output. Let's see if CI finds a regression tests.	2024-06-15 14:40:49 +02:00
Martin Nordholts	f5f067bf9d	Deprecate no-op codegen option `-Cinline-threshold=...` This deprecates `-Cinline-threshold` since using it has no effect. This has been the case since the new LLVM pass manager started being used, more than 2 years ago.	2024-06-14 20:25:17 +02:00
Martin Nordholts	04af37170c	Also sort `crt-static` in `--print target-features` output I didn't find `crt-static` at first (for `x86_64-unknown-linux-gnu`), because it was put at the bottom the large and otherwise sorted list. Fully sort the list before we print it. Note that `llvm_target_features` starts out sorted and does not need to be sorted an extra time.	2024-06-14 19:29:23 +02:00
beetrees	dfc5514527	Add `f16` and `f128` inline ASM support for `x86` and `x86-64`	2024-06-13 16:12:23 +01:00
Dario Nieuwenhuis	9c25d40784	Honor collapse_debuginfo for statics. fixes #126363	2024-06-13 02:44:14 +02:00
Michael Goulet	754b26d882	Rollup merge of #126324 - zmodem:loongarch, r=nikic Adjust LoongArch64 data layouts for LLVM update The data layout was changed in LLVM 19: llvm/llvm-project#93814	2024-06-12 14:26:28 -04:00
Hans Wennborg	4a06a5bc7a	Adjust LoongArch64 data layouts for LLVM update The data layout was changed in LLVM 19: llvm/llvm-project#93814	2024-06-12 12:39:09 +02:00
Nicholas Nethercote	75b164d836	Use `tidy` to sort crate attributes for all compiler crates. We already do this for a number of crates, e.g. `rustc_middle`, `rustc_span`, `rustc_metadata`, `rustc_span`, `rustc_errors`. For the ones we don't, in many cases the attributes are a mess. - There is no consistency about order of attribute kinds (e.g. `allow`/`deny`/`feature`). - Within attribute kind groups (e.g. the `feature` attributes), sometimes the order is alphabetical, and sometimes there is no particular order. - Sometimes the attributes of a particular kind aren't even grouped all together, e.g. there might be a `feature`, then an `allow`, then another `feature`. This commit extends the existing sorting to all compiler crates, increasing consistency. If any new attribute line is added there is now only one place it can go -- no need for arbitrary decisions. Exceptions: - `rustc_log`, `rustc_next_trait_solver` and `rustc_type_ir_macros`, because they have no crate attributes. - `rustc_codegen_gcc`, because it's quasi-external to rustc (e.g. it's ignored in `rustfmt.toml`).	2024-06-12 15:49:10 +10:00
Matthias Krüger	2d7f7ffba5	Rollup merge of #126159 - RalfJung:scalarint-size-mismatch, r=oli-obk ScalarInt: size mismatches are a bug, do not delay the panic Cc [Zulip](https://rust-lang.zulipchat.com/#narrow/stream/146212-t-compiler.2Fconst-eval/topic/Why.20are.20ScalarInt.20to.20iN.2FuN.20methods.20fallible.3F) r? ``@oli-obk``	2024-06-10 21:12:25 +02:00
Ralf Jung	3c57ea0df7	ScalarInt: size mismatches are a bug, do not delay the panic	2024-06-10 13:43:16 +02:00
Ralf Jung	2f2031d2b2	simd packed types: update outdated check, extend codegen test	2024-06-08 21:38:32 +02:00
bors	67caf52fbc	Auto merge of #125406 - tbu-:pr_rm_path_with_extension, r=Nadrieril Directly add extension instead of using `Path::with_extension` `Path::with_extension` has a nice footgun when the original path doesn't contain an extension: Anything after the last dot gets removed.	2024-06-06 10:24:24 +00:00
Boxy	a9702a6668	Add `Ty` to `ConstKind::Value`	2024-06-05 22:25:41 +01:00
Tobias Bucher	f7c51a2d2f	Directly add extension instead of using `Path::with_extension` `Path::with_extension` has a nice footgun when the original path doesn't contain an extension: Anything after the last dot gets removed.	2024-06-04 22:12:31 +02:00
Jubilee	ca9dd62c05	Rollup merge of #125311 - calebzulawski:repr-packed-simd-intrinsics, r=workingjubilee Make repr(packed) vectors work with SIMD intrinsics In #117116 I fixed `#[repr(packed, simd)]` by doing the expected thing and removing padding from the layout. This should be the last step in providing a solution to rust-lang/portable-simd#319	2024-06-02 05:06:47 -07:00
Caleb Zulawski	9bdc5b2455	Improve documentation	2024-06-01 14:17:16 -04:00
Michael Goulet	333458c2cb	Uplift TypeRelation and Relate	2024-06-01 12:50:58 -04:00
Nicholas Bishop	99e6a28804	Add f16/f128 handling in a couple places	2024-05-30 18:33:50 -04:00
Zalathar	c671eaaaff	coverage: Rename MC/DC `conditions_num` to `num_conditions` This value represents a quantity of conditions, not an ID, so the new spelling is more appropriate.	2024-05-30 13:16:07 +10:00
Matthias Krüger	d0311c1303	Rollup merge of #124655 - Darksonn:fixed-x18, r=lqd,estebank Add `-Zfixed-x18` This PR is a follow-up to #124323 that proposes a different implementation. Please read the description of that PR for motivation. See the equivalent flag in [the clang docs](https://clang.llvm.org/docs/ClangCommandLineReference.html#cmdoption-clang-ffixed-x18). MCP: https://github.com/rust-lang/compiler-team/issues/748 Fixes https://github.com/rust-lang/rust/issues/121970 r? rust-lang/compiler	2024-05-29 20:12:32 +02:00
bors	7601adcc76	Auto merge of #125463 - GuillaumeGomez:rollup-287wx4y, r=GuillaumeGomez Rollup of 6 pull requests Successful merges: - #125263 (rust-lld: fallback to rustc's sysroot if there's no path to the linker in the target sysroot) - #125345 (rustc_codegen_llvm: add support for writing summary bitcode) - #125362 (Actually use TAIT instead of emulating it) - #125412 (Don't suggest adding the unexpected cfgs to the build-script it-self) - #125445 (Migrate `run-make/rustdoc-with-short-out-dir-option` to `rmake.rs`) - #125452 (Cleanup check-cfg handling in core and std) r? `@ghost` `@rustbot` modify labels: rollup	2024-05-24 03:04:06 +00:00
Guillaume Gomez	4ee97fc3db	Rollup merge of #125345 - durin42:thin-link-bitcode, r=bjorn3 rustc_codegen_llvm: add support for writing summary bitcode Typical uses of ThinLTO don't have any use for this as a standalone file, but distributed ThinLTO uses this to make the linker phase more efficient. With clang you'd do something like `clang -flto=thin -fthin-link-bitcode=foo.indexing.o -c foo.c` and then get both foo.o (full of bitcode) and foo.indexing.o (just the summary or index part of the bitcode). That's then usable by a two-stage linking process that's more friendly to distributed build systems like bazel, which is why I'm working on this area. I talked some to `@teresajohnson` about naming in this area, as things seem to be a little confused between various blog posts and build systems. "bitcode index" and "bitcode summary" tend to be a little too ambiguous, and she tends to use "thin link bitcode" and "minimized bitcode" (which matches the descriptions in LLVM). Since the clang option is thin-link-bitcode, I went with that to try and not add a new spelling in the world. Per `@dtolnay,` you can work around the lack of this by using `lld --thinlto-index-only` to do the indexing on regular .o files of bitcode, but that is a bit wasteful on actions when we already have all the information in rustc and could just write out the matching minimized bitcode. I didn't test that at all in our infrastructure, because by the time I learned that I already had this patch largely written.	2024-05-23 23:39:26 +02:00
Augie Fackler	a0581b5b7f	cleanup: run rustfmt	2024-05-23 15:10:04 -04:00
Augie Fackler	3ea494190f	cleanup: standardize on summary over index in names I did this in the user-facing logic, but I noticed while fixing a minor defect that I had missed it in a few places in the internal details.	2024-05-23 15:07:43 -04:00
Augie Fackler	de8200c5a4	thinlto: only build summary file if needed If we don't do this, some versions of LLVM (at least 17, experimentally) will double-emit some error messages, which is how I noticed this. Given that it seems to be costing some extra work, let's only request the summary bitcode production if we'll actually bother writing it down, otherwise skip it.	2024-05-23 14:58:30 -04:00
Nicholas Nethercote	8e94226e61	Remove `#[macro_use] extern crate tracing` from `rustc_codegen_llvm`.	2024-05-23 18:02:40 +10:00
Augie Fackler	03d5556ced	cleanup: remove leftover extra block This was needed in an older version of this patch, but never got edited out when it became obsolete.	2024-05-22 14:04:22 -04:00
Augie Fackler	aa91871539	rustc_codegen_llvm: add support for writing summary bitcode Typical uses of ThinLTO don't have any use for this as a standalone file, but distributed ThinLTO uses this to make the linker phase more efficient. With clang you'd do something like `clang -flto=thin -fthin-link-bitcode=foo.indexing.o -c foo.c` and then get both foo.o (full of bitcode) and foo.indexing.o (just the summary or index part of the bitcode). That's then usable by a two-stage linking process that's more friendly to distributed build systems like bazel, which is why I'm working on this area. I talked some to @teresajohnson about naming in this area, as things seem to be a little confused between various blog posts and build systems. "bitcode index" and "bitcode summary" tend to be a little too ambiguous, and she tends to use "thin link bitcode" and "minimized bitcode" (which matches the descriptions in LLVM). Since the clang option is thin-link-bitcode, I went with that to try and not add a new spelling in the world. Per @dtolnay, you can work around the lack of this by using `lld --thinlto-index-only` to do the indexing on regular .o files of bitcode, but that is a bit wasteful on actions when we already have all the information in rustc and could just write out the matching minimized bitcode. I didn't test that at all in our infrastructure, because by the time I learned that I already had this patch largely written.	2024-05-22 14:04:22 -04:00
Scott McMurray	8ee3d29cd9	Stop using `to_hir_binop` in codegen	2024-05-22 01:34:26 -07:00
Matthias Krüger	fd975f75fa	Rollup merge of #125266 - workingjubilee:stream-plastic-love, r=RalfJung,nikic compiler: add simd_ctpop intrinsic Fairly straightforward addition. cc `@rust-lang/opsem` new (extremely boring) intrinsic	2024-05-21 12:47:06 +02:00
Tobias Bucher	fa1b7f2d78	Remove some `Path::to_str` from `rustc_codegen_llvm` Unnecessary panic paths when there's a better option.	2024-05-20 23:17:11 +02:00
Caleb Zulawski	86158f581d	Make repr(packed) vectors work with SIMD intrinsics	2024-05-20 01:09:29 -04:00
Jubilee Young	213351ae9e	clarify the second arg to llvm.ctlz and cttz	2024-05-19 19:12:38 -07:00
Jubilee Young	1914c722b5	compiler: add simd_ctpop intrinsic	2024-05-18 18:11:20 -07:00
Santiago Pastorino	6b46a919e1	Rename Unsafe to Safety	2024-05-17 18:33:37 -03:00
Alice Ryhl	b780fa9219	Use an error struct instead of a panic	2024-05-15 11:14:45 +02:00
Alice Ryhl	518becf5ea	Fail on non-aarch64 targets	2024-05-14 21:09:42 +02:00
Zalathar	bfadc3a9b9	coverage: `CoverageIdsInfo::mcdc_bitmap_bytes` is never needed This code for recalculating `mcdc_bitmap_bytes` doesn't provide any benefit, because its result won't have changed from the value in `FunctionCoverageInfo` that was computed during the MIR instrumentation pass.	2024-05-14 16:41:04 +10:00
bors	6a19a87097	Auto merge of #124972 - matthiaskrgr:rollup-3fablim, r=matthiaskrgr Rollup of 5 pull requests Successful merges: - #124615 (coverage: Further simplify extraction of mapping info from MIR) - #124778 (Fix parse error message for meta items) - #124797 (Refactor float `Primitive`s to a separate `Float` type) - #124888 (Migrate `run-make/rustdoc-output-path` to rmake) - #124957 (Make `Ty::builtin_deref` just return a `Ty`) r? `@ghost` `@rustbot` modify labels: rollup	2024-05-10 16:04:26 +00:00
Matthias Krüger	9a9ec90567	Rollup merge of #124957 - compiler-errors:builtin-deref, r=michaelwoerister Make `Ty::builtin_deref` just return a `Ty` Nowhere in the compiler are we using the mutability part of the `TyAndMut` that we used to return.	2024-05-10 16:10:47 +02:00
Matthias Krüger	1ae0d90b72	Rollup merge of #124797 - beetrees:primitive-float, r=davidtwco Refactor float `Primitive`s to a separate `Float` type Now there are 4 of them, it makes sense to refactor `F16`, `F32`, `F64` and `F128` out of `Primitive` and into a separate `Float` type (like integers already are). This allows patterns like `F16 \| F32 \| F64 \| F128` to be simplified into `Float(_)`, and is consistent with `ty::FloatTy`. As a side effect, this PR also makes the `Ty::primitive_size` method work with `f16` and `f128`. Tracking issue: #116909 `@rustbot` label +F-f16_and_f128	2024-05-10 16:10:46 +02:00
bors	66f877007d	Auto merge of #124932 - RalfJung:temporal, r=compiler-errors codegen: memmove/memset cannot be non-temporal non-temporal memset is not a thing. And for memmove, since the LLVM backend doesn't support this, surely we don't need it in the GCC backend.	2024-05-10 13:55:59 +00:00
Michael Goulet	d50c2b0a52	Make builtin_deref just return a Ty	2024-05-09 22:55:00 -04:00
Michael Goulet	1c19b6ad60	Rename Generics::params to Generics::own_params	2024-05-09 20:58:46 -04:00
Ralf Jung	95582e6fcb	codegen: memmove/memset cannot be non-temporal	2024-05-09 18:59:00 +02:00
Matthew Maurer	4d397d33da	Adjust 64-bit ARM data layouts for LLVM update LLVM has updated data layouts to specify `Fn32` on 64-bit ARM to avoid C++ accidentally underaligning functions when trying to comply with member function ABIs. This should only affect Rust in cases where we had a similar bug (I don't believe we have one), but our data layout must match to generate code. As a compatibility adaptatation, if LLVM is not version 19 yet, `Fn32` gets voided from the data layout. See llvm/llvm-project#90415	2024-05-06 16:53:17 +00:00
beetrees	3769fddba2	Refactor float `Primitive`s to a separate `Float` type	2024-05-06 14:56:10 +01:00
bors	befabbc9e5	Auto merge of #124675 - matthiaskrgr:rollup-x6n79ua, r=matthiaskrgr Rollup of 7 pull requests Successful merges: - #122492 (Implement ptr_as_ref_unchecked) - #123815 (Fix cannot usage in time.rs) - #124059 (default_alloc_error_hook: explain difference to default __rdl_oom in alloc) - #124510 (Add raw identifier in a typo suggestion) - #124555 (coverage: Clean up creation of MC/DC condition bitmaps) - #124593 (Describe and use CStr literals in CStr and CString docs) - #124630 (CI: remove `env-x86_64-apple-tests` YAML anchor) r? `@ghost` `@rustbot` modify labels: rollup	2024-05-03 19:46:04 +00:00
Matthias Krüger	613bccc4ca	Rollup merge of #124555 - Zalathar:init-coverage, r=nnethercote coverage: Clean up creation of MC/DC condition bitmaps This PR improves the code for creating and initializing [MC/DC](https://en.wikipedia.org/wiki/Modified_condition/decision_coverage) condition bitmap variables, as introduced by #123409 and modified by #124255. - The condition bitmap variables are now created eagerly at the start of per-function codegen, via a new `init_coverage` method in `CoverageInfoBuilderMethods`. This avoids having to retroactively create the bitmaps while doing codegen for an individual coverage statement. - As a result, we can now create and initialize those bitmaps using existing safe APIs, instead of having to perform our own unsafe call to `llvm::LLVMBuildAlloca`. - This PR also tweaks the way we count the number of condition bitmaps needed, by tracking the total number of bitmaps needed (max depth + 1), instead of only tracking the maximum depth. This reduces the potential for subtle off-by-one confusion.	2024-05-03 20:33:46 +02:00
bors	0d7b2fb797	Auto merge of #123441 - saethlin:fixed-len-file-names, r=oli-obk Stabilize the size of incr comp object file names The current implementation does not produce stable-length paths, and we create the paths in a way that makes our allocation behavior is nondeterministic. I think `@eddyb` fixed a number of other cases like this in the past, and this PR fixes another one. Whether that actually matters I have no idea, but we still have bimodal behavior in rustc-perf and the non-uniformity in `find` and `ls` was bothering me. I've also removed the truncation of the mangled CGU names. Before this PR incr comp paths look like this: ``` target/debug/incremental/scratch-38izrrq90cex7/s-gux6gz0ow8-1ph76gg-ewe1xj434l26w9up5bedsojpd/261xgo1oqnd90ry5.o ``` And after, they look like this: ``` target/debug/incremental/scratch-035omutqbfkbw/s-gux6borni0-16r3v1j-6n64tmwqzchtgqzwwim5amuga/55v2re42sztc8je9bva6g8ft3.o ``` On the one hand, I'm sure this will break some people's builds because they're on Windows and only a few bytes from the path length limit. But if we're that seriously worried about the length of our file names, I have some other ideas on how to make them smaller. And last time I deleted some hash truncations from the compiler, there was a huge drop in the number if incremental compilation ICEs that were reported: https://github.com/rust-lang/rust/pull/110367https://github.com/rust-lang/rust/pull/110367 --- Upon further reading, this PR actually fixes a bug. This comment says the CGU names are supposed to be a fixed-length hash, and before this PR they aren't: `ca7d34efa9/compiler/rustc_monomorphize/src/partitioning.rs (L445-L448)`	2024-05-03 17:41:48 +00:00
Alice Ryhl	40f0172c6a	Add -Zfixed-x18 Signed-off-by: Alice Ryhl <aliceryhl@google.com>	2024-05-03 14:32:08 +02:00
Waffle Lapkin	698d7a031e	Inline & delete `Ty::new_unit`, since it's just a field access	2024-05-02 17:49:23 +02:00
Zalathar	de972b7321	coverage: Replace `max_decision_depth` with `num_condition_bitmaps` This clearly distinguishes individual decision-depth indices from the total number of condition bitmaps to allocate.	2024-05-01 09:55:22 +10:00
Zalathar	0b3a47900e	coverage: Set up MC/DC bitmaps without additional unsafe code Because this now always takes place at the start of the function, we can just use the normal `alloca` method and then initialize each bitmap immediately. This patch also moves bitmap setup out of the `mcdc_parameters` method, because there is no longer any particular reason for it to be there.	2024-05-01 09:55:22 +10:00
Zalathar	52d608b560	coverage: Eagerly do start-of-function codegen for coverage	2024-05-01 09:06:53 +10:00
Matthias Krüger	784316eadc	Rollup merge of #124511 - nnethercote:rm-extern-crates, r=fee1-dead Remove many `#[macro_use] extern crate foo` items This requires the addition of more `use` items, which often make the code more verbose. But they also make the code easier to read, because `#[macro_use]` obscures where macros are defined. r? `@fee1-dead`	2024-04-30 15:04:08 +02:00
bors	7a58674259	Auto merge of #124255 - RenjiSann:renji/mcdc-nested-expressions, r=Zalathar MCDC coverage: support nested decision coverage #123409 provided the initial MCDC coverage implementation. As referenced in #124144, it does not currently support "nested" decisions, like the following example : ```rust fn nested_if_in_condition(a: bool, b: bool, c: bool) { if a && if b \|\| c { true } else { false } { say("yes"); } else { say("no"); } } ``` Note that there is an if-expression (`if b \|\| c ...`) embedded inside a boolean expression in the decision of an outer if-expression. This PR proposes a workaround for this cases, by introducing a Decision context stack, and by handing several `temporary condition bitmaps` instead of just one. When instrumenting boolean expressions, if the current node is a leaf condition (i.e. not a `\|\|`/`&&` logical operator nor a `!` not operator), we insert a new decision context, such that if there are more boolean expressions inside the condition, they are handled as separate expressions. On the codegen LLVM side, we allocate as many `temp_cond_bitmap`s as necessary to handle the maximum encountered decision depth.	2024-04-29 11:54:49 +00:00
Dorian Péron	60ca9b6e29	mcdc-coverage: Get decision_depth from THIR lowering Use decision context stack to handle nested decisions: - Introduce MCDCDecisionCtx - Use a stack of MCDCDecisionCtx to handle nested decisions	2024-04-29 09:13:40 +00:00
Dorian Péron	ae8c023983	mcdc-coverage: Add decision_depth field in structs Add decision_depth field to TVBitmapUpdate/CondBitmapUpdate statements Add decision_depth field to BcbMappingKinds MCDCBranch and MCDCDecision Add decision_depth field to MCDCBranchSpan and MCDCDecisionSpan	2024-04-29 09:13:40 +00:00
Dorian Péron	3c2f48ede9	mcdc-coverage: Add possibility for codegen llvm to handle several condition bitmaps	2024-04-29 08:41:15 +00:00
Nicholas Nethercote	4814fd0a4b	Remove `extern crate rustc_macros` from numerous crates.	2024-04-29 10:21:54 +10:00
bors	284f94f9c0	Auto merge of #121298 - nikic:writable, r=cuviper Set writable and dead_on_unwind attributes for sret arguments Set the `writable` and `dead_on_unwind` attributes for `sret` arguments. This allows call slot optimization to remove more memcpy's. See https://llvm.org/docs/LangRef.html#parameter-attributes for the specification of these attributes. In short, the statement we're making here is that: * The return slot is writable. * The return slot will not be read if the function unwinds. Fixes https://github.com/rust-lang/rust/issues/90595.	2024-04-25 04:31:56 +00:00
Nikita Popov	3695af697e	Set writable and dead_on_unwind attributes for sret arguments	2024-04-25 11:43:47 +09:00
bors	29a56a3b1c	Auto merge of #122053 - erikdesjardins:alloca, r=nikic Stop using LLVM struct types for alloca The alloca type has no semantic meaning, only the size (and alignment, but we specify it explicitly) matter. Using `[N x i8]` is a more direct way to specify that we want `N` bytes, and avoids relying on LLVM's struct layout. It is likely that a future LLVM version will change to an untyped alloca representation. Split out from #121577. r? `@ghost`	2024-04-24 03:00:44 +00:00
Matthias Krüger	918304b190	Rollup merge of #124003 - WaffleLapkin:dellvmization, r=scottmcm,RalfJung,antoyo Dellvmize some intrinsics (use `u32` instead of `Self` in some integer intrinsics) This implements https://github.com/rust-lang/compiler-team/issues/693 minus what was implemented in #123226. Note: I decided to _not_ change `shl`/... builder methods, as it just doesn't seem worth it. r? ``@scottmcm``	2024-04-23 20:17:51 +02:00
Guillaume Gomez	1a12ec41e9	Rollup merge of #124178 - GuillaumeGomez:llvm-backend, r=oli-obk [cleanup] [llvm backend] Prevent creating the same `Instance::mono` multiple times Just a little thing I came across while going through the code. r? ```@oli-obk```	2024-04-22 20:25:58 +02:00
Ben Kimock	6ee3713b08	Stabilize the size of incr comp object file names	2024-04-22 10:50:07 -04:00
Guillaume Gomez	d34be935ec	Prevent creating the same `Instance::mono` multiple times	2024-04-19 23:13:58 +02:00
zhuyunxing	439dbfa1ec	coverage. Lowering MC/DC statements to llvm-ir	2024-04-20 00:34:40 +08:00
zhuyunxing	cf6b6cb2b4	coverage. Generate Mappings of decisions and conditions for MC/DC	2024-04-19 17:09:26 +08:00
bors	c5de414865	Auto merge of #123144 - dpaoliello:arm64eclib, r=GuillaumeGomez,ChrisDenton,wesleywiser Add support for Arm64EC to the Standard Library Adds the final pieces so that the standard library can be built for arm64ec-pc-windows-msvc (initially added in #119199) * Bumps `windows-sys` to 0.56.0, which adds support for Arm64EC. * Correctly set the `isEC` parameter for LLVM's `writeArchive` function. * Add `#![feature(asm_experimental_arch)]` to library crates where Arm64EC inline assembly is used, as it is currently unstable.	2024-04-18 12:22:52 +00:00
Matthias Krüger	90013ff5ad	Rollup merge of #124090 - durin42:llvm-19-riscv-feature, r=cuviper llvm: update riscv target feature to match LLVM 19 In llvm/llvm-project@9067070d91 they ended up largely reverting llvm/llvm-project@e817966718. This means the change we did in rust-lang/rust@b378059e6b is now only corrct for LLVM 18...so we have to adjust again. ``@rustbot`` label: +llvm-main	2024-04-18 08:37:49 +02:00
Augie Fackler	22b704bac4	llvm: update riscv target feature to match LLVM 19 In llvm/llvm-project@9067070d91 they ended up largely reverting llvm/llvm-project@e817966718. This means the change we did in rust-lang/rust@b378059e6b is now only corrct for LLVM 18...so we have to adjust again. @rustbot label: +llvm-main	2024-04-17 16:15:24 -04:00
Mark Rousskov	649e80184b	Codegen ZSTs without an allocation This makes sure that &[] is just as efficient as indirecting through unsafe code (from_raw_parts). No new stable guarantee is intended about whether or not we do this, this is just an optimization. Co-authored-by: Ralf Jung <post@ralfj.de>	2024-04-16 21:13:21 -04:00
Maybe Waffle	ceead1bda6	Change intrinsic types to use `u32` instead of `T` to match stable reexports	2024-04-16 11:53:04 +00:00
Daniel Paoliello	32f5ca4be7	Add support for Arm64EC to the Standard Library	2024-04-15 16:05:16 -07:00
bors	5dcb678ad8	Auto merge of #122917 - saethlin:atomicptr-to-int, r=nikic Add the missing inttoptr when we ptrtoint in ptr atomics Ralf noticed this here: https://github.com/rust-lang/rust/pull/122220#discussion_r1535172094 Our previous codegen forgot to add the cast back to integer type. The code compiles anyway, because of course all locals are in-memory to start with, so previous codegen would do the integer atomic, store the integer to a local, then load a pointer from that local. Which is definitely _not_ what we wanted: That's an integer-to-pointer transmute, so all pointers returned by these `AtomicPtr` methods didn't have provenance. Yikes. Here's the IR for `AtomicPtr::fetch_byte_add` on 1.76: https://godbolt.org/z/8qTEjeraY ```llvm define noundef ptr `@atomicptr_fetch_byte_add(ptr` noundef nonnull align 8 %a, i64 noundef %v) unnamed_addr #0 !dbg !7 { start: %0 = alloca ptr, align 8, !dbg !12 %val = inttoptr i64 %v to ptr, !dbg !12 call void `@llvm.lifetime.start.p0(i64` 8, ptr %0), !dbg !28 %1 = ptrtoint ptr %val to i64, !dbg !28 %2 = atomicrmw add ptr %a, i64 %1 monotonic, align 8, !dbg !28 store i64 %2, ptr %0, align 8, !dbg !28 %self = load ptr, ptr %0, align 8, !dbg !28 call void `@llvm.lifetime.end.p0(i64` 8, ptr %0), !dbg !28 ret ptr %self, !dbg !33 } ``` r? `@RalfJung` cc `@nikic`	2024-04-15 08:07:47 +00:00
Matthias Krüger	f4f644182b	Rollup merge of #123775 - scottmcm:place-val, r=cjgillot Make `PlaceRef` and `OperandValue::Ref` share a common `PlaceValue` type Both `PlaceRef` and `OperandValue::Ref` need the triple of the backend pointer immediate, the optional backend metadata for DSTs, and the actual alignment of the place (since it can differ from the ABI alignment). This PR introduces a new `PlaceValue` type for those three values, leaving [`PlaceRef`](https://doc.rust-lang.org/nightly/nightly-rustc/rustc_codegen_ssa/mir/place/struct.PlaceRef.html) with the `TyAndLayout` and a `PlaceValue`, just like how [`OperandRef`](https://doc.rust-lang.org/nightly/nightly-rustc/rustc_codegen_ssa/mir/operand/struct.OperandRef.html) is a `TyAndLayout` and an `OperandValue`. This means that various places that use `Ref`s as places can just pass the `PlaceValue` along, like in the below excerpt from the diff: ```diff match operand.val { - OperandValue::Ref(ptr, meta, align) => { - debug_assert_eq!(meta, None); + OperandValue::Ref(source_place_val) => { + debug_assert_eq!(source_place_val.llextra, None); debug_assert!(matches!(operand_kind, OperandValueKind::Ref)); - let fake_place = PlaceRef::new_sized_aligned(ptr, cast, align); + let fake_place = PlaceRef { val: source_place_val, layout: cast }; Some(bx.load_operand(fake_place).val) } ``` There's more refactoring that I'd like to do after this, but I wanted to stop the PR here where it's hopefully easy (albeit probably not quick) to review since I tried to keep every change line-by-line clear. (Most are just adding `.val` to get to a field.) You can also go commit-at-a-time if you'd like. Each passed tidy and the codegen tests on my machine (though I didn't run the cg_gcc ones).	2024-04-12 04:38:21 +02:00
Erik Desjardins	f4426c189f	use [N x i8] for alloca types	2024-04-11 21:42:35 -04:00
bors	05ccc49a44	Auto merge of #123507 - dpaoliello:arm64ecasm, r=Amanieu Add support for Arm64EC inline assembly (as unstable) Compiler support for Arm64EC assembly mostly reuses the existing AArch64 support, except that it needs to block registers that are not permitted: <https://learn.microsoft.com/en-us/windows/arm/arm64ec-abi#register-mapping-and-blocked-registers> For assembly authors there are several caveats and differences that need to be considered, I've provided documentation for this as part of the "Standard Library Support" PR: <https://github.com/rust-lang/rust/pull/123144/files#diff-6b08532480943c8b82f5dbda7ee1521afa74c9f626466aeb308dfa6956397edd> r? rust-lang/compiler	2024-04-11 07:15:04 +00:00
Scott McMurray	d0ae76848a	Add load/store helpers that take `PlaceValue`	2024-04-11 00:10:10 -07:00
Scott McMurray	3596098823	Put `PlaceValue` into `OperandValue::Ref`, rather than 3 tuple fields	2024-04-11 00:10:10 -07:00
Scott McMurray	89502e584b	Make `PlaceRef` hold a `PlaceValue` for the non-layout fields (like `OperandRef` does)	2024-04-11 00:10:10 -07:00
Daniel Paoliello	2e44d29460	Add support for Arm64EC inline assembly	2024-04-10 10:06:44 -07:00
bors	c2239bca5b	Auto merge of #123185 - scottmcm:more-typed-copy, r=compiler-errors Remove my `scalar_copy_backend_type` optimization attempt I added this back in https://github.com/rust-lang/rust/pull/111999 , but I no longer think it's a good idea - It had to get scaled back to only power-of-two things to not break a bunch of targets - LLVM seems to be getting better at memcpy removal anyway - Introducing vector instructions has seemed to sometimes (https://github.com/rust-lang/rust/pull/115515#issuecomment-1750069529) make autovectorization worse So this removes it from the codegen crates entirely, and instead just tries to use <https://doc.rust-lang.org/nightly/nightly-rustc/rustc_codegen_ssa/traits/builder/trait.BuilderMethods.html#method.typed_place_copy> instead of direct `memcpy` so things will still use load/store when a type isn't `OperandValue::Ref`.	2024-04-10 16:32:41 +00:00

... 5 6 7 8 9 ...

2498 Commits