nordic-dev.net/rust - rust

mirror of https://github.com/rust-lang/rust.git synced 2025-06-18 10:38:11 +00:00

Author	SHA1	Message	Date
bors	65899c06f1	Auto merge of #138893 - klensy:thorin-0.9, r=Mark-Simulacrum bump thorin to 0.9 to drop duped deps Bumps `thorin`, removing duped deps. This also changes features for hashbrown: ``` hashbrown v0.15.2 `-- indexmap v2.7.0 \|-- object v0.36.7 \|-- wasmparser v0.219.1 \|-- wasmparser v0.223.0 `-- wit-component v0.223.0 \|-- indexmap feature "default" \|-- indexmap feature "serde" `-- indexmap feature "std" \|-- hashbrown feature "default-hasher" \| \|-- object v0.36.7 () \| `-- wasmparser v0.223.0 () \|-- hashbrown feature "nightly" \| \|-- rustc_data_structures v0.0.0 \| `-- rustc_query_system v0.0.0 `-- hashbrown feature "serde" `-- wasmparser feature "serde" ``` to ``` hashbrown v0.15.2 `-- indexmap v2.7.0 \|-- object v0.36.7 \|-- wasmparser v0.219.1 \|-- wasmparser v0.223.0 `-- wit-component v0.223.0 \|-- indexmap feature "default" \|-- indexmap feature "serde" `-- indexmap feature "std" \|-- hashbrown feature "allocator-api2" \| `-- hashbrown feature "default" \|-- hashbrown feature "default" () \|-- hashbrown feature "default-hasher" \| \|-- object v0.36.7 () \| `-- wasmparser v0.223.0 () \| `-- hashbrown feature "default" () \|-- hashbrown feature "equivalent" \| `-- hashbrown feature "default" () \|-- hashbrown feature "inline-more" \| `-- hashbrown feature "default" () \|-- hashbrown feature "nightly" \| \|-- rustc_data_structures v0.0.0 \| `-- rustc_query_system v0.0.0 \|-- hashbrown feature "raw-entry" \| `-- hashbrown feature "default" (*) `-- hashbrown feature "serde" `-- wasmparser feature "serde" ``` To be safe, as this can be perf-sensitive: `@bors` rollup=never	2025-03-26 07:54:26 +00:00
Mads Marquart	328846c6eb	Rename `is_like_osx` to `is_like_darwin`	2025-03-25 21:53:52 +01:00
Matthias Krüger	b66e9320c5	Rollup merge of #137247 - dpaoliello:cleanllvm, r=Zalathar cg_llvm: Reduce the visibility of types, modules and using declarations in `rustc_codegen_llvm`. Final part of #135502 Reduces the visibility of types, modules and using declarations in the `rustc_codegen_llvm` to private or `pub(crate)` where possible, and marks unused fields and enum entries with `#[expect(dead_code)]`. r? Zalathar	2025-03-25 18:09:03 +01:00
Daniel Paoliello	79b9664091	Reduce visibility of most items in `rustc_codegen_llvm`	2025-03-25 16:36:47 +11:00
bors	1df5affaca	Auto merge of #133984 - DaniPopes:scmp-ucmp, r=scottmcm Lower BinOp::Cmp to llvm.{s,u}cmp.* intrinsics Lowers `mir::BinOp::Cmp` (`three_way_compare` intrinsic) to the corresponding LLVM `llvm.{s,u}cmp.i8.*` intrinsics. These are the intrinsics mentioned in https://github.com/rust-lang/rust/pull/118310, which are now available in LLVM 19. I couldn't find any follow-up PRs/discussions about this, please let me know if I missed something. r? `@scottmcm`	2025-03-24 22:53:12 +00:00
klensy	724a5a430b	bump thorin to drop duped deps	2025-03-24 19:38:16 +03:00
Matthias Krüger	0c594da55f	Rollup merge of #138627 - EnzymeAD:autodiff-cleanups, r=oli-obk Autodiff cleanups Splitting out some cleanups to reduce the size of my batching PR and simplify ``@haenoe`` 's [PR](https://github.com/rust-lang/rust/pull/138314). r? ``@oli-obk`` Tracking: - https://github.com/rust-lang/rust/issues/124509	2025-03-21 15:48:55 +01:00
Taiki Endo	55add8fce3	rustc_target: Add more RISC-V vector-related features	2025-03-20 19:47:57 +09:00
Zalathar	2e36990881	coverage: Convert and check span coordinates without a local file ID For expansion region support, we will want to be able to convert and check spans before creating a corresponding local file ID. If we create local file IDs eagerly, but some expansion turns out to have no successfully-converted spans, LLVM will complain about that expansion's file ID having no regions.	2025-03-20 13:29:32 +11:00
Zalathar	d07ef5b0e1	coverage: Add LLVM plumbing for expansion regions This is currently unused, but paves the way for future work on expansion regions without having to worry about the FFI parts.	2025-03-20 12:40:36 +11:00
Matthias Krüger	5661e98058	Rollup merge of #138674 - oli-obk:llvm-cleanups, r=compiler-errors Various codegen_llvm cleanups Mostly just adding safe wrappers and deduplicating code	2025-03-19 08:17:19 +01:00
Oli Scherer	f4b0984854	Create a safe wrapper around `LLVMRustDIBuilderCreateMemberType`	2025-03-18 17:15:02 +00:00
Oli Scherer	1f34b19596	Avoid splitting up a layout	2025-03-18 17:01:09 +00:00
Zalathar	cc8336b6c1	coverage: Don't store a body span in `FunctionCoverageInfo`	2025-03-18 23:18:24 +11:00
Zalathar	cd2b978433	coverage: Don't refer to the body span when enlarging empty spans Given that we now only enlarge empty spans to "{" or "}", there shouldn't be any danger of enlarging beyond a function body.	2025-03-18 23:18:23 +11:00
Manuel Drehwald	47c07ed963	[NFC] simplify matching	2025-03-17 19:13:09 -04:00
Manuel Drehwald	f4c297802f	[NFC] extract autodiff call lowering in cg_llvm into own function	2025-03-17 18:58:51 -04:00
bors	493c38ba37	Auto merge of #127173 - bjorn3:mangle_rustc_std_internal_symbol, r=wesleywiser,jieyouxu Mangle rustc_std_internal_symbols functions This reduces the risk of issues when using a staticlib or rust dylib compiled with a different rustc version in a rust program. Currently this will either (in the case of staticlib) cause a linker error due to duplicate symbol definitions, or (in the case of rust dylibs) cause rustc_std_internal_symbols functions to be silently overridden. As rust gets more commonly used inside the implementation of libraries consumed with a C interface (like Spidermonkey, Ruby YJIT (curently has to do partial linking of all rust code to hide all symbols not part of the C api), the Rusticl OpenCL implementation in mesa) this is becoming much more of an issue. With this PR the only symbols remaining with an unmangled name are rust_eh_personality (LLVM doesn't allow renaming it) and `__rust_no_alloc_shim_is_unstable`. Helps mitigate https://github.com/rust-lang/rust/issues/104707 try-job: aarch64-gnu-debug try-job: aarch64-apple try-job: x86_64-apple-1 try-job: x86_64-mingw-1 try-job: i686-mingw-1 try-job: x86_64-msvc-1 try-job: i686-msvc-1 try-job: test-various try-job: armhf-gnu	2025-03-17 22:16:22 +00:00
Oli Scherer	018032c682	Create a safe wrapper around `LLVMRustDIBuilderCreateBasicType`	2025-03-17 16:58:44 +00:00
Oli Scherer	cc41dd4fa1	Create a safe wrapper function around `LLVMRustDIBuilderCreateFile`	2025-03-17 16:58:21 +00:00
Oli Scherer	e19e4e3a4b	Create a safe wrapper around `LLVMRustDIBuilderCreateSubroutineType`	2025-03-17 16:39:52 +00:00
Oli Scherer	6adc2c1fd6	Deduplicate template parameter creation	2025-03-17 16:32:21 +00:00
Oli Scherer	b4acf7a51e	Immediately create an `Option` instead of reallocating for it later	2025-03-17 16:17:48 +00:00
Oli Scherer	eef70a9db5	Create a safe wrapper around LLVMRustDIBuilderCreateTemplateTypeParameter	2025-03-17 15:56:48 +00:00
Matthias Krüger	8f5c09b37c	Rollup merge of #138349 - 1c3t3a:external-weak-cfi, r=rcvalle Emit function declarations for functions with `#[linkage="extern_weak"]` Currently, when declaring an extern weak function in Rust, we use the following syntax: ```rust unsafe extern "C" { #[linkage = "extern_weak"] static FOO: Option<unsafe extern "C" fn() -> ()>; } ``` This allows runtime-checking the extern weak symbol through the Option. When emitting LLVM-IR, the Rust compiler currently emits this static as an i8, and a pointer that is initialized with the value of the global i8 and represents the nullabilty e.g. ``` `@FOO` = extern_weak global i8 `@_rust_extern_with_linkage_FOO` = internal global ptr `@FOO` ``` This approach does not work well with CFI, where we need to attach CFI metadata to a concrete function declaration, which was pointed out in https://github.com/rust-lang/rust/issues/115199. This change switches to emitting a proper function declaration instead of a global i8. This allows CFI to work for extern_weak functions. Example: ``` `@_rust_extern_with_linkage_FOO` = internal global ptr `@FOO` ... declare !type !61 !type !62 !type !63 !type !64 extern_weak void `@FOO(double)` unnamed_addr #6 ``` We keep initializing the Rust internal symbol with the function declaration, which preserves the correct behavior for runtime checking the Option. r? `@rcvalle` cc `@jakos-sec` try-job: test-various	2025-03-17 16:34:50 +01:00
bjorn3	b754ef727c	Remove implicit #[no_mangle] for #[rustc_std_internal_symbol]	2025-03-17 14:08:09 +00:00
Adwin White	8e235258f3	fix(debuginfo): avoid overflow when handling expanding recursive type	2025-03-17 18:33:40 +08:00
Bastian Kersting	b30cf11b96	Emit function declarations for functions with #[linkage="extern_weak"] Currently, when declaring an extern weak function in Rust, we use the following syntax: ```rust unsafe extern "C" { #[linkage = "extern_weak"] static FOO: Option<unsafe extern "C" fn() -> ()>; } ``` This allows runtime-checking the extern weak symbol through the Option. When emitting LLVM-IR, the Rust compiler currently emits this static as an i8, and a pointer that is initialized with the value of the global i8 and represents the nullabilty e.g. ``` @FOO = extern_weak global i8 @_rust_extern_with_linkage_FOO = internal global ptr @FOO ``` This approach does not work well with CFI, where we need to attach CFI metadata to a concrete function declaration, which was pointed out in https://github.com/rust-lang/rust/issues/115199. This change switches to emitting a proper function declaration instead of a global i8. This allows CFI to work for extern_weak functions. We keep initializing the Rust internal symbol with the function declaration, which preserves the correct behavior for runtime checking the Option. Co-authored-by: Jakob Koschel <jakobkoschel@google.com>	2025-03-17 08:27:53 +00:00
bors	227690a258	Auto merge of #137011 - LuuuXXX:promote-ohos-with-host-tools, r=Amanieu Promote ohos targets to tier2 with host tools. ### What does this PR try to resolve? Try to promote the following [[Tier 2 without Host Tools](https://doc.rust-lang.org/rustc/platform-support.html#tier-2-without-host-tools)](https://doc.rust-lang.org/rustc/platform-support.html#tier-2-without-host-tools) targets to [[Tier 2 with Host Tools](https://doc.rust-lang.org/rustc/platform-support.html#tier-2-with-host-tools)](https://doc.rust-lang.org/rustc/platform-support.html#tier-2-with-host-tools): - `aarch64-unknown-linux-ohos` - `armv7-unknown-linux-ohos` - `x86_64-unknown-linux-ohos` ### More Information? see MCP: https://github.com/rust-lang/compiler-team/issues/811 ### Blockage to be solved? - [x] Submit an MCP - [x] Submit code of promote ohos targets - [x] Resolve related dependencies （`measureme`） The modified code of the measureme has been merged （see https://github.com/rust-lang/measureme/pull/238）. [done] The new version will was released (https://github.com/rust-lang/measureme/pull/240). [done]	2025-03-16 18:42:18 +00:00
Matthias Krüger	d93ef397ce	Rollup merge of #138331 - nnethercote:use-RUSTC_LINT_FLAGS-more, r=onur-ozkan,jieyouxu Use `RUSTC_LINT_FLAGS` more An alternative to the failed #138084. Fixes #138106. r? ````@jieyouxu````	2025-03-12 17:59:08 +01:00
bors	ebf0cf75d3	Auto merge of #137586 - nnethercote:SetImpliedBits, r=bjorn3 Speed up target feature computation The LLVM backend calls `LLVMRustHasFeature` twice for every feature. In short-running rustc invocations, this accounts for a surprising amount of work. r? `@bjorn3`	2025-03-11 12:05:16 +00:00
Nicholas Nethercote	ff0a5fe975	Remove `#![warn(unreachable_pub)]` from all `compiler/` crates. It's no longer necessary now that `-Wunreachable_pub` is being passed.	2025-03-11 13:14:21 +11:00
许杰友 Jieyou Xu (Joe)	063ef18fdc	Revert "Use workspace lints for crates in `compiler/` #138084 " Revert <https://github.com/rust-lang/rust/pull/138084> to buy time to consider options that avoids breaking downstream usages of cargo on distributed `rustc-src` artifacts, where such cargo invocations fail due to inability to inherit `lints` from workspace root manifest's `workspace.lints` (this is only valid for the source rust-lang/rust workspace, but not really the distributed `rustc-src` artifacts). This breakage was reported in <https://github.com/rust-lang/rust/issues/138304>. This reverts commit `48caf81484`, reversing changes made to `c6662879b2`.	2025-03-10 18:12:47 +08:00
Matthias Krüger	827bb5e27b	Rollup merge of #122790 - Zoxc:dllimp-rev, r=ChrisDenton Apply dllimport in ThinLTO This partially reverts https://github.com/rust-lang/rust/pull/103353 by properly applying `dllimport` if `-Z dylib-lto` is passed. That PR should probably fully be reverted as it looks quite sketchy. We don't know locally if the entire crate graph would be statically linked. This should hopefully be sufficient to make ThinLTO work for rustc on Windows. r? ``@wesleywiser`` --- Edit: This PR is changed to just generally revert https://github.com/rust-lang/rust/pull/103353.	2025-03-09 16:41:48 +01:00
Matthias Krüger	48caf81484	Rollup merge of #138084 - nnethercote:workspace-lints, r=jieyouxu Use workspace lints for crates in `compiler/` This is nicer and hopefully less error prone than specifying lints via bootstrap. r? ``@jieyouxu``	2025-03-09 10:34:50 +01:00
Nicholas Nethercote	8a3e03392e	Remove `#![warn(unreachable_pub)]` from all `compiler/` crates. (Except for `rustc_codegen_cranelift`.) It's no longer necessary now that `unreachable_pub` is in the workspace lints.	2025-03-08 08:41:43 +11:00
Nicholas Nethercote	beba32cebb	Specify rust lints for `compiler/` crates via Cargo. By naming them in `[workspace.lints.rust]` in the top-level `Cargo.toml`, and then making all `compiler/` crates inherit them with `[lints] workspace = true`. (I omitted `rustc_codegen_{cranelift,gcc}`, because they're a bit different.) The advantages of this over the current approach: - It uses a standard Cargo feature, rather than special handling in bootstrap. So, easier to understand, and less likely to get accidentally broken in the future. - It works for proc macro crates. It's a shame it doesn't work for rustc-specific lints, as the comments explain.	2025-03-08 08:41:09 +11:00
Matthias Krüger	63c548d82c	Rollup merge of #137549 - oli-obk:llvm-ffi, r=davidtwco Clean up various LLVM FFI things in codegen_llvm cc ```@ZuseZ4``` I touched some autodiff parts The major change of this PR is [`bfd88ce`](`bfd88cead0`) which makes `CodegenCx` generic just like `GenericBuilder` The other commits mostly took advantage of the new feature of making extern functions safe, but also just used some wrappers that were already there and shrunk unsafe blocks. best reviewed commit-by-commit	2025-03-07 19:15:34 +01:00
DaniPopes	58c10c66c1	Lower BinOp::Cmp to llvm.{s,u}cmp.* intrinsics Lowers `mir::BinOp::Cmp` (`three_way_compare` intrinsic) to the corresponding LLVM `llvm.{s,u}cmp.i8.*` intrinsics, added in LLVM 19.	2025-03-06 22:29:05 +08:00
sayantn	7c2434c52c	Add the `movrs` target feature and `movrs_target_feature` feature gate	2025-03-05 05:34:37 +05:30
sayantn	0ec1d460bb	Add the new `amx` target features	2025-03-05 05:34:37 +05:30
Nicholas Nethercote	cee3114544	Remove out of date comment. No smallvecs here.	2025-03-05 09:52:28 +11:00
Nicholas Nethercote	35b7994ea8	Use `collect` to initialize `features`.	2025-03-05 09:52:26 +11:00
Nicholas Nethercote	936a8232df	Change signature of `target_features_cfg`. Currently it is called twice, once with `allow_unstable` set to true and once with it set to false. This results in some duplicated work. Most notably, for the LLVM backend, `LLVMRustHasFeature` is called twice for every feature, and it's moderately slow. For very short running compilations on platforms with many features (e.g. a `check` build of hello-world on x86) this is a significant fraction of runtime. This commit changes `target_features_cfg` so it is only called once, and it now returns a pair of feature sets. This halves the number of `LLVMRustHasFeature` calls.	2025-03-05 09:49:17 +11:00
Nicholas Nethercote	2df8e657f2	Simplify `implied_target_features`. Currently its argument is an iterator, but in practice it's always a singleton.	2025-03-05 09:20:28 +11:00
Nicholas Nethercote	1df93fd6a7	Avoid double interning of feature names. Also improve some comments.	2025-03-05 09:20:27 +11:00
LuuuXXX	7279acf202	use measureme-12.0.1	2025-03-04 17:13:46 +08:00
LuuuXXX	6324b39873	promote ohos targets to tier to with host tools	2025-03-04 17:13:46 +08:00
bors	fd17deacce	Auto merge of #137959 - matthiaskrgr:rollup-62vjvwr, r=matthiaskrgr Rollup of 12 pull requests Successful merges: - #135767 (Future incompatibility warning `unsupported_fn_ptr_calling_conventions`: Also warn in dependencies) - #137852 (Remove layouting dead code for non-array SIMD types.) - #137863 (Fix pretty printing of unsafe binders) - #137882 (do not build additional stage on compiler paths) - #137894 (Revert "store ScalarPair via memset when one side is undef and the other side can be memset") - #137902 (Make `ast::TokenKind` more like `lexer::TokenKind`) - #137921 (Subtree update of `rust-analyzer`) - #137922 (A few cleanups after the removal of `cfg(not(parallel))`) - #137939 (fix order on shl impl) - #137946 (Fix docker run-local docs) - #137955 (Always allow rustdoc-json tests to contain long lines) - #137958 (triagebot.toml: Don't label `test/rustdoc-json` as A-rustdoc-search) r? `@ghost` `@rustbot` modify labels: rollup	2025-03-04 02:27:56 +00:00
Matthias Krüger	70b9968d1e	Rollup merge of #137894 - compiler-errors:no-scalar-pair-opt, r=oli-obk Revert "store ScalarPair via memset when one side is undef and the other side can be memset" cc #137892 reverts #135335 r? oli-obk	2025-03-03 20:47:12 +01:00
John Kåre Alsaker	cc39e5f266	Apply dllimport in ThinLTO	2025-03-03 13:44:53 +01:00
Matthias Krüger	fd4bf82264	Rollup merge of #137741 - cuviper:const_str-raw_entry, r=Mark-Simulacrum Stop using `hash_raw_entry` in `CodegenCx::const_str` That unstable feature (#56167) completed fcp-close, so the compiler needs to be migrated away to allow its removal. In this case, `cg_llvm` and `cg_gcc` were using raw entries to optimize their `const_str_cache` lookup and insertion. We can change that to separate `get` and (on miss) `insert` calls, so we still have the fast path avoiding string allocation when the cache hits.	2025-03-03 10:41:00 +01:00
Michael Goulet	a59a8f9e75	Revert "Auto merge of #135335 - oli-obk:push-zxwssomxxtnq, r=saethlin" This reverts commit `a7a6c64a65`, reversing changes made to `ebbe63891f`.	2025-03-02 18:52:48 +00:00
Matthias Krüger	3bf976542a	Rollup merge of #137804 - RalfJung:backend-repr-simd-vector, r=workingjubilee rename BackendRepr::Vector → SimdVector For many Rustaceans, "vector" does not imply "SIMD", so let's be more clear in this type that is used pervasively in the compiler. r? `@workingjubilee`	2025-03-01 16:03:10 +01:00
bors	0c72c0d11a	Auto merge of #133250 - DianQK:embed-bitcode-pgo, r=nikic The embedded bitcode should always be prepared for LTO/ThinLTO Fixes #115344. Fixes #117220. There are currently two methods for generating bitcode that used for LTO. One method involves using `-C linker-plugin-lto` to emit object files as bitcode, which is the typical setting used by cargo. The other method is through `-C embed-bitcode=yes`. When using with `-C embed-bitcode=yes -C lto=no`, we run a complete non-LTO LLVM pipeline to obtain bitcode, then the bitcode is used for LTO. We run the Call Graph Profile Pass twice on the same module. This PR is doing something similar to LLVM's `buildFatLTODefaultPipeline`, obtaining the bitcode for embedding after running `buildThinLTOPreLinkDefaultPipeline`. r? nikic	2025-03-01 08:22:18 +00:00
bors	30508faeb3	Auto merge of #137796 - jieyouxu:rollup-qt9yr1g, r=jieyouxu Rollup of 10 pull requests Successful merges: - #134943 (Add FileCheck annotations to mir-opt/issues) - #137017 (Don't error when adding a staticlib with bitcode files compiled by newer LLVM) - #137197 (Update some comparison codegen tests now that they pass in LLVM20) - #137540 (Fix (more) test directives that were accidentally ignored) - #137551 (import `simd_` intrinsics) - #137599 (tests: use minicore more) - #137673 (Fix Windows `Command` search path bug) - #137676 (linker: Fix escaping style for response files on Windows) - #137693 (Re-enable `--generate-link-to-defintion` for tools internal rustdoc) - #137770 (Fix sized constraint for unsafe binder) r? `@ghost` `@rustbot` modify labels: rollup	2025-03-01 00:53:19 +00:00
Ralf Jung	aac65f562b	rename BackendRepr::Vector → SimdVector	2025-02-28 17:17:45 +01:00
许杰友 Jieyou Xu (Joe)	61e90040db	Rollup merge of #137017 - bjorn3:ignore_invalid_bitcode, r=oli-obk Don't error when adding a staticlib with bitcode files compiled by newer LLVM cc https://github.com/rust-lang/rust/issues/128955#issuecomment-2657811196	2025-02-28 22:29:49 +08:00
许杰友 Jieyou Xu (Joe)	d65f568302	Rollup merge of #137713 - vayunbiyani:fix-enzyme-build-errors, r=oli-obk Fix enzyme build errors After [this PR](https://github.com/rust-lang/rust/pull/136428) was merged, I switched to master and attempted building `./x.py build --stage 1 library` with the config mentioned in the enzyme rustbook but it resulted in some errors tho the config.example.toml build succeeded The errors were re: ### 1. Use of ref in match patterns The errors were related to match ergonomics in Rust 2024, where ref is no longer needed when matching on references. Examples: ``` error: binding modifiers may only be written when the default binding mode is `move` --> compiler/rustc_builtin_macros/src/autodiff.rs:136:31 \| 136 \| Annotatable::Item(ref iitem) => { \| ^^^ binding modifier not allowed under `ref` default binding mode \| = note: for more information, see <https://doc.rust-lang.org/nightly/edition-guide/rust-2024/match-ergonomics.html> note: matching on a reference type with a non-reference pattern changes the default binding mode --> compiler/rustc_builtin_macros/src/autodiff.rs:136:13 \| 136 \| Annotatable::Item(ref iitem) => { \| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ this matches on type `&_` help: remove the unnecessary binding modifier \| 136 - Annotatable::Item(ref iitem) => { 136 + Annotatable::Item(iitem) => { \| error: binding modifiers may only be written when the default binding mode is `move` --> compiler/rustc_builtin_macros/src/autodiff.rs:146:36 \| 146 \| Annotatable::AssocItem(ref assoc_item, _) => { \| ^^^ binding modifier not allowed under `ref` default binding mode \| = note: for more information, see <https://doc.rust-lang.org/nightly/edition-guide/rust-2024/match-ergonomics.html> note: matching on a reference type with a non-reference pattern changes the default binding mode --> compiler/rustc_builtin_macros/src/autodiff.rs:146:13 \| 146 \| Annotatable::AssocItem(ref assoc_item, _) => { \| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ this matches on type `&_` help: remove the unnecessary binding modifier \| 146 - Annotatable::AssocItem(ref assoc_item, _) => { 146 + Annotatable::AssocItem(assoc_item, _) => { \| error: binding modifiers may only be written when the default binding mode is `move` --> compiler/rustc_builtin_macros/src/autodiff.rs:174:31 \| 174 \| ... Annotatable::Item(ref iitem) => (iitem.vis.clone(), iitem.ide... \| ^^^ binding modifier not allowed under `ref` default binding mode \| = note: for more information, see <https://doc.rust-lang.org/nightly/edition-guide/rust-2024/match-ergonomics.html> note: matching on a reference type with a non-reference pattern changes the default binding mode --> compiler/rustc_builtin_macros/src/autodiff.rs:174:13 \| 174 \| ... Annotatable::Item(ref iitem) => (iitem.vis.clone(), iitem.ident.c... \| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ this matches on type `&_` help: remove the unnecessary binding modifier \| 174 - Annotatable::Item(ref iitem) => (iitem.vis.clone(), iitem.ident.clone()), 174 + Annotatable::Item(iitem) => (iitem.vis.clone(), iitem.ident.clone()), \| error: binding modifiers may only be written when the default binding mode is `move` --> compiler/rustc_builtin_macros/src/autodiff.rs:175:36 \| 175 \| Annotatable::AssocItem(ref assoc_item, _) => { \| ^^^ binding modifier not allowed under `ref` default binding mode \| = note: for more information, see <https://doc.rust-lang.org/nightly/edition-guide/rust-2024/match-ergonomics.html> note: matching on a reference type with a non-reference pattern changes the default binding mode --> compiler/rustc_builtin_macros/src/autodiff.rs:175:13 \| 175 \| Annotatable::AssocItem(ref assoc_item, _) => { \| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ this matches on type `&_` help: remove the unnecessary binding modifier \| 175 - Annotatable::AssocItem(ref assoc_item, _) => { 175 + Annotatable::AssocItem(assoc_item, _) => { \| error: could not compile `rustc_builtin_macros` (lib) due to 4 previous errors warning: build failed, waiting for other jobs to finish... Build completed unsuccessfully in 0:19:39 ``` ### 2. the use of external C blocks without unsafe in compiler/rustc_codegen_llvm/src/llvm/enzyme_ffi.rs (I don't have the error message handy) The first commit fixes the errors above --- ## Additional Improvement: `@ZuseZ4` suggested we consolidate the variants under `#[cfg(llvm_enzyme)]` and `#[cfg(not(llvm_enzyme))]` by conditionally checking for `cfg!(llvm_enzyme)` instead. This way, the autodiff code is compiled but not executed avoiding such regressions r? `@ZuseZ4` cc: `@oli-obk`	2025-02-28 21:42:01 +08:00
Josh Stone	396c2a8659	Stop using `hash_raw_entry` in `CodegenCx::const_str` That unstable feature completed fcp-close, so the compiler needs to be migrated away to allow its removal. In this case, `cg_llvm` and `cg_gcc` were using raw entries to optimize their `const_str_cache` lookup and insertion. We can change that to separate `get` and (on miss) `insert` calls, so we still have the fast path avoiding string allocation when the cache hits.	2025-02-27 09:09:52 -08:00
bjorn3	9f190d764f	Restore usage of io::Error	2025-02-26 13:45:35 +00:00
León Orell Valerian Liehr	a579a23a73	Rollup merge of #137603 - davidtwco:extern-types-no-deref, r=lcnr codegen_llvm: avoid `Deref` impls w/ extern type `rustc_codegen_llvm` relied on `Deref` impls where `Deref::Target` was or contained an extern type - in my experimental implementation of rust-lang/rfcs#3729, this isn't possible as the `Target` associated type's `?Sized` bound cannot be relaxed backwards compatibly (unless we come up with some way of doing this). In later pull requests with the rust-lang/rfcs#3729 implementation, breakage like this could only occur for nightly users relying on the `extern_types` feature. Upstreaming this to avoid needing to keep carrying this patch locally, and I think it'll necessarily need to change eventually.	2025-02-26 04:15:06 +01:00
León Orell Valerian Liehr	1511ccd6f8	Rollup merge of #137595 - folkertdev:remove-simd-pow-powi, r=RalfJung remove `simd_fpow` and `simd_fpowi` Discussed in https://github.com/rust-lang/rust/issues/137555 These functions are not exposed from `std::intrinsics::simd`, and not used anywhere outside of the compiler. They also don't lower to particularly good code at least on the major ISAs (I checked x86_64, aarch64, s390x, powerpc), where the vector is just spilled to the stack and scalar functions are used for the actual logic. r? `@RalfJung`	2025-02-25 13:07:40 +01:00
Vayun Biyani	cb53e97870	Fix enzyme build errors	2025-02-25 17:25:50 +05:30
Folkert de Vries	60a268998c	remove `simd_fpow` and `simd_fpowi`	2025-02-25 09:20:10 +01:00
Michael Goulet	6c1f959288	Rollup merge of #137556 - RalfJung:simd_shuffle_const_generic, r=oli-obk rename simd_shuffle_generic → simd_shuffle_const_generic I've been confused by this name one time too often. ;) r? `@oli-obk`	2025-02-24 19:21:51 -05:00
Michael Goulet	828a3a41b3	Rollup merge of #137417 - taiki-e:riscv-atomic, r=Amanieu rustc_target: Add more RISC-V atomic-related features This is a continuation of https://github.com/rust-lang/rust/pull/130877 and adds a few target features, including `zacas`, which was experimental in LLVM 19 and marked non-experimental in LLVM 20. This adds the following target features to unstable riscv_target_feature: - `za64rs` (Za64rs Extension 1.0): Reservation Set Size of at Most 64 Bytes ([definition in LLVM](https://github.com/llvm/llvm-project/blob/llvmorg-20.1.0-rc2/llvm/lib/Target/RISCV/RISCVFeatures.td#L227-L228), [available since LLVM 18](`8649328060`)) - `za128rs` (Za128rs Extension 1.0): Reservation Set Size of at Most 128 Bytes ([definition in LLVM](https://github.com/llvm/llvm-project/blob/llvmorg-20.1.0-rc2/llvm/lib/Target/RISCV/RISCVFeatures.td#L230-L231), [available since LLVM 18](`8649328060`)) - IIUC, `za*rs` can be referenced when implementing helpers to reduce contention in synchronization primitives, like [`crossbeam_utils::CachePadded`](https://docs.rs/crossbeam-utils/latest/crossbeam_utils/struct.CachePadded.html). (relevant discussion: https://github.com/riscv/riscv-profiles/issues/79) - `zacas` (Zacas Extension 1.0): Atomic Compare-And-Swap Instructions (`amocas.{w,d,q}{,.aq,.rl,.aqrl}` and `amocas.{b,h}{,.aq,.rl,.aqrl}` when `zabha` is also enabled) ([definition in LLVM](https://github.com/llvm/llvm-project/blob/llvmorg-20.1.0-rc2/llvm/lib/Target/RISCV/RISCVFeatures.td#L240-L243), [available as non-experimental since LLVM 20](`614aeda93b`)) - This implies `zaamo`. - This is used to optimize CAS in existing atomics and/or implement 64-bit/128-bit atomics on riscv32/riscv64 (e.g., https://github.com/taiki-e/portable-atomic/pull/173). - Note that [LLVM does not automatically use this instruction for 64-bit/128-bit atomics on riscv32/riscv64 even if this feature is enabled, because doing it changes the ABI](`876174ffd7/llvm/docs/RISCVUsage.rst (riscv-zacas-note)`). (If the ability to do that is provided by LLVM in the future, it should probably be controlled by another ABI feature similar to `forced-atomics`.) - `zama16b` (Zama16b Extension 1.0): Atomic 16-byte misaligned loads, stores and AMOs ([definition in LLVM](https://github.com/llvm/llvm-project/blob/llvmorg-20.1.0-rc2/llvm/lib/Target/RISCV/RISCVFeatures.td#L255-L256), [available since LLVM 19](`b090569685`)) - IIUC, unlike AArch64 FEAT_LSE2 which also makes 16-byte aligned ldp ({i,u}128 load) atomic, this extension only affects instructions that already considered atomic if they were naturally aligned. i.e., fld (f64 load) on riscv32 would not be atomic with or without this extension ([relevant QEMU code](`b69801dd6b/target/riscv/insn_trans/trans_rvd.c.inc (L50-L62)`)). - `zawrs` (Zawrs Extension 1.0): Wait on Reservation Set (`wrs.nto` and `wrs.sto`) ([definition in LLVM](https://github.com/llvm/llvm-project/blob/llvmorg-20.1.0-rc2/llvm/lib/Target/RISCV/RISCVFeatures.td#L258), [available as non-experimental since LLVM 17](`d41a73aa94`)) - This is used to optimize synchronization primitives (e.g., Linux uses this for spinlocks (`b8ddb0df30`)). Btw, the question of whether `zaamo` is implied by `zabha` or not, which was discussed in https://github.com/rust-lang/rust/pull/130877, has been resolved in LLVM 20, since LLVM now treats `zaamo` as implied by `zabha`/`zacas` (https://github.com/llvm/llvm-project/pull/115694), just like GCC and rustc. r? `@Amanieu` `@rustbot` label +O-riscv +A-target-feature	2025-02-24 19:21:47 -05:00
Ralf Jung	0362775fb5	rename simd_shuffle_generic → simd_shuffle_const_generic	2025-02-24 19:13:23 +01:00
Oli Scherer	553828c6f4	Mark more LLVM FFI as safe	2025-02-24 15:11:29 +00:00
Oli Scherer	3565603d25	Use a safe wrapper around an LLVM FFI function	2025-02-24 15:11:29 +00:00
Oli Scherer	f16f64b15a	Remove inherent function that has a trait method duplicate of a commonly imported trait	2025-02-24 15:11:29 +00:00
Oli Scherer	241c83f0c7	Deduplicate more functions between `SimpleCx` and `CodegenCx`	2025-02-24 15:11:29 +00:00
Oli Scherer	29440b84a9	Remove an unused lifetime param	2025-02-24 15:11:29 +00:00
Oli Scherer	396baa750e	Make allocator shim creation mostly use safe code	2025-02-24 15:11:29 +00:00
Oli Scherer	840e31b29f	Generalize BaseTypeCodegenMethods	2025-02-24 15:11:29 +00:00
Oli Scherer	75356b7437	Generalize `BackendTypes` over `GenericCx`	2025-02-24 15:11:29 +00:00
Oli Scherer	bfd88cead0	Avoid some duplication between SimpleCx and CodegenCx	2025-02-24 15:11:29 +00:00
Oli Scherer	d4379d2afd	Remove an unnecessary lifetime	2025-02-24 15:05:56 +00:00
Oli Scherer	a54bfcf52b	Use safe FFI for various functions in codegen_llvm	2025-02-24 15:05:56 +00:00
David Wood	a5615d3c62	codegen_llvm: avoid `Deref` impls w/ extern type `rustc_codegen_llvm` relied on `Deref` impls where `Deref::Target` was or contained an extern type - in my experimental implementation of rust-lang/rfcs#3729, this isn't possible as the `Target` associated type's `?Sized` bound cannot be relaxed backwards compatibly (unless we come up with some way of doing this). In later pull requests with the rust-lang/rfcs#3729 implementation, breakage like this could only occur for nightly users relying on the `extern_types` feature. Upstreaming this to avoid needing to keep carrying this patch locally, and I think it'll necessarily need to change eventually.	2025-02-24 08:08:55 +00:00
bors	e0be1a0262	Auto merge of #137271 - nikic:gep-nuw-2, r=scottmcm Emit getelementptr inbounds nuw for pointer::add() Lower pointer::add (via intrinsic::offset with unsigned offset) to getelementptr inbounds nuw on LLVM versions that support it. This lets LLVM make use of the pre-condition that the offset addition does not wrap in an unsigned sense. Together with inbounds, this also implies that the offset is non-negative. Fixes https://github.com/rust-lang/rust/issues/137217.	2025-02-24 03:06:16 +00:00
Trevor Gross	a2bb4d748d	Rollup merge of #136543 - RalfJung:round-ties-even, r=tgross35 intrinsics: unify rint, roundeven, nearbyint in a single round_ties_even intrinsic LLVM has three intrinsics here that all do the same thing (when used in the default FP environment). There's no reason Rust needs to copy that historically-grown mess -- let's just have one intrinsic and leave it up to the LLVM backend to decide how to lower that. Suggested by `@hanna-kruppe` in https://github.com/rust-lang/rust/issues/136459; Cc `@tgross35` try-job: test-various	2025-02-23 14:30:25 -05:00
DianQK	da50297a6e	Save pre-link bitcode to `ModuleCodegen`	2025-02-23 21:23:38 +08:00
DianQK	9431427cc3	Add `new_regular` and `new_allocator` to `ModuleCodegen`	2025-02-23 21:23:38 +08:00
DianQK	1a99ca8da9	The embedded bitcode should always be prepared for LTO/ThinLTO	2025-02-23 21:23:36 +08:00
bors	15469f8f8a	Auto merge of #137420 - matthiaskrgr:rollup-rr0q37f, r=matthiaskrgr Rollup of 9 pull requests Successful merges: - #136910 (Implement feature `isolate_most_least_significant_one` for integer types) - #137183 (Prune dead regionck code) - #137333 (Use `edition = "2024"` in the compiler (redux)) - #137356 (Ferris 🦀 Identifier naming conventions) - #137362 (Add build step log for `run-make-support`) - #137377 (Always allow reusing cratenum in CrateLoader::load) - #137388 (Fix(lib/fs/tests): Disable rename POSIX semantics FS tests under Windows 7) - #137410 (Use StableHasher + Hash64 for dep_tracking_hash) - #137413 (jubilee cleared out the review queue) r? `@ghost` `@rustbot` modify labels: rollup	2025-02-22 13:32:44 +00:00
Taiki Endo	a343dcb97f	rustc_target: Add more RISC-V atomic-related features	2025-02-22 16:15:14 +09:00
Manuel Drehwald	e2d250c3f6	update autodiff flags	2025-02-21 21:51:20 -05:00
Manuel Drehwald	f4e2218b13	clean up autodiff code/comments	2025-02-21 21:47:48 -05:00
Michael Goulet	e1819a889a	Fix overcapturing, unsafe extern blocks, and new unsafe ops	2025-02-22 00:01:48 +00:00
Michael Goulet	76d341fa09	Upgrade the compiler to edition 2024	2025-02-22 00:01:48 +00:00
Matthias Krüger	636f4f19d8	Rollup merge of #137313 - oli-obk:push-ywvuqkxuqyom, r=petrochenkov Some codegen_llvm cleanups Using some more safe wrappers and thus being able to remove a large unsafe block. As a next step we should probably look into safe extern fns	2025-02-21 12:45:26 +01:00
Zachary S	7ba3d7b54e	Remove `BackendRepr::Uninhabited`, replaced with an `uninhabited: bool` field in `LayoutData`. Also update comments that refered to BackendRepr::Uninhabited.	2025-02-20 13:27:32 -06:00
Oli Scherer	ce7f58bd91	Merge two operations that were always performed together	2025-02-20 11:24:00 +00:00
Oli Scherer	ea7180813b	Create safe helper for LLVMSetDLLStorageClass	2025-02-20 11:15:00 +00:00
Scott McMurray	6f9cfd694d	Rework `OperandRef::extract_field` to stop calling `to_immediate_scalar` on things which are already immediates That means it stops trying to truncate things that are already `i1`s.	2025-02-19 12:03:40 -08:00
Scott McMurray	642a705f71	PR feedback	2025-02-19 11:36:52 -08:00
Scott McMurray	511bf307f0	Emit `trunc nuw` for unchecked shifts and `to_immediate_scalar` - For shifts this shrinks the IR by no longer needing an `assume` while still providing the UB information - Having this on the `i8`→`i1` truncations will hopefully help with some places that have to load `i8`s or pass those in LLVM structs without range information	2025-02-19 11:36:52 -08:00
Nikita Popov	31cc4c074d	Emit getelementptr inbounds nuw for pointer::add()	2025-02-19 11:32:32 +01:00
Nikita Popov	5e9d8a7d55	Switch to the LLVMBuildGEPWithNoWrapFlags API This API allows us to set the nuw flag as well.	2025-02-19 11:32:32 +01:00
Matthias Krüger	2bd65ebede	Rollup merge of #137210 - workingjubilee:fixup-passmode-import, r=RalfJung compiler: Stop reexporting stuff in cg_llvm::abi The reexports confuse tooling like rustdoc into thinking cg_llvm is the source of key types that originate in rustc_target.	2025-02-19 01:30:12 +01:00
Jubilee Young	2d2de18166	compiler: Stop reexporting stuff in cg_llvm::abi The reexports confuse tooling like rustdoc into thinking cg_llvm is the source of key types that originate in rustc_target.	2025-02-18 00:31:29 -08:00
bors	3b022d8cee	Auto merge of #133852 - x17jiri:cold_path, r=saethlin improve cold_path() #120370 added a new instrinsic `cold_path()` and used it to fix `likely` and `unlikely` However, in order to limit scope, the information about cold code paths is only used in 2-target switch instructions. This is sufficient for `likely` and `unlikely`, but limits usefulness of `cold_path` for idiomatic rust. For example, code like this: ``` if let Some(x) = y { ... } ``` may generate 3-target switch: ``` switch y.discriminator: 0 => true branch 1 = > false branch _ => unreachable ``` and therefore marking a branch as cold will have no effect. This PR improves `cold_path()` to work with arbitrary switch instructions. Note that for 2-target switches, we can use `llvm.expect`, but for multiple targets we need to manually emit branch weights. I checked Clang and it also emits weights in this situation. The Clang's weight calculation is more complex that this PR, which I believe is mainly because `switch` in `C/C++` can have multiple cases going to the same target.	2025-02-18 07:49:09 +00:00
Nicholas Nethercote	fd7b4bf4e1	Move methods from `Map` to `TyCtxt`, part 2. Continuing the work started in #136466. Every method gains a `hir_` prefix, though for the ones that already have a `par_` or `try_par_` prefix I added the `hir_` after that.	2025-02-18 10:17:44 +11:00
Jiri Bobek	7bb5f4dd78	improve cold_path()	2025-02-17 06:39:58 +01:00
Matthias Krüger	fab38375bc	Rollup merge of #137095 - saethlin:use-hash64-for-hashes, r=workingjubilee Replace some u64 hashes with Hash64 I introduced the Hash64 and Hash128 types in https://github.com/rust-lang/rust/pull/110083, essentially as a mechanism to prevent hashes from landing in our leb128 encoding paths. If you just have a u64 or u128 field in a struct then derive Encodable/Decodable, that number gets leb128 encoding. So if you need to store a hash or some other value which behaves very close to a hash, don't store it as a u64. This reverts part of https://github.com/rust-lang/rust/pull/117603, which turned an encoded Hash64 into a u64. Based on https://github.com/rust-lang/rust/pull/110083, I don't expect this to be perf-sensitive on its own, though I expect that it may help stabilize some of the small rmeta size fluctuations we currently see in perf reports.	2025-02-17 06:38:14 +01:00
Ben Kimock	4cf21866e8	Move hashes from rustc_data_structure to rustc_hashes so they can be shared with rust-analyzer	2025-02-16 16:18:30 -05:00
Jacob Pratt	d3556c6644	Rollup merge of #136545 - durin42:nvptx64-align, r=nikic nvptx64: update default alignment to match LLVM 21 This changed in llvm/llvm-project@91cb8f5d32. The commit itself is mostly about some intrinsic instructions, but as an aside it also mentions something about addrspace for tensor memory, which I believe is what this string is telling us. `@rustbot` label: +llvm-main	2025-02-16 00:51:24 -05:00
bors	bdc97d1046	Auto merge of #136575 - scottmcm:nsuw-math, r=nikic Set both `nuw` and `nsw` in slice size calculation There's an old note in the code to do this, and now that [LLVM-C has an API for it](`f0b8ff1251/llvm/include/llvm-c/Core.h (L4403-L4408)`), we might as well. And it's been there since what looks like LLVM 17 `de9b6aa341` so doesn't even need to be conditional. (There's other places, like `RawVecInner` or `Layout`, that might want to do things like this too, but I'll leave those for a future PR.)	2025-02-14 14:21:29 +00:00
bjorn3	736ef0a4ce	Don't error when adding a staticlib with bitcode files compiled by newer LLVM	2025-02-14 10:54:21 +00:00
bors	905b1bf1cc	Auto merge of #137010 - workingjubilee:rollup-g00c07v, r=workingjubilee Rollup of 9 pull requests Successful merges: - #135439 (Make `-O` mean `OptLevel::Aggressive`) - #136460 (Simplify `rustc_span` `analyze_source_file`) - #136904 (add `IntoBounds` trait) - #136908 ([AIX] expect `EINVAL` for `pthread_mutex_destroy`) - #136924 (Add profiling of bootstrap commands using Chrome events) - #136951 (Use the right binder for rebinding `PolyTraitRef`) - #136981 (ci: switch loongarch jobs to free runners) - #136992 (Update backtrace) - #136993 ([cg_llvm] Remove dead error message) r? `@ghost` `@rustbot` modify labels: rollup	2025-02-14 06:13:42 +00:00
Jubilee	e8d0d00798	Rollup merge of #136993 - dpaoliello:cleanllvm4, r=workingjubilee [cg_llvm] Remove dead error message Part of #135502 Discovered a dead error message in rustc_codegen_llvm, so removing it. r? ``@Zalathar``	2025-02-13 21:37:54 -08:00
Scott McMurray	9ad6839f7a	Set both `nuw` and `nsw` in slice size calculation There's an old note in the code to do this, and now that LLVM-C has an API for it, we might as well.	2025-02-13 21:26:48 -08:00
Jubilee	864eba9fb1	Rollup merge of #136895 - maurer:fix-enum-discr, r=nikic debuginfo: Set bitwidth appropriately in enum variant tags Previously, we unconditionally set the bitwidth to 128-bits, the largest an enum would possibly be. Then, LLVM would cut down the constant by chopping off leading zeroes before emitting the DWARF. LLVM only supported 64-bit enumerators, so this would also have occasionally resulted in truncated data. LLVM added support for 128-bit enumerators in llvm/llvm-project#125578 That patchset trusts the constant to describe how wide the variant tag is, so the high 64-bits of zeros are considered potentially load-bearing. As a result, we went from emitting tags that looked like: DW_AT_discr_value (0xfe) (because `dwarf::BestForm` selected `data1`) to emitting tags that looked like: DW_AT_discr_value (<0x10> fe ff ff ff 00 00 00 00 00 00 00 00 00 00 00 00 ) This makes the `DW_AT_discr_value` encode at the bitwidth of the tag, which: 1. Is probably closer to our intentions in terms of describing the data. 2. Doesn't invoke the 128-bit support which may not be supported by all debuggers / downstream tools. 3. Will result in smaller debug information.	2025-02-13 17:46:08 -08:00
Daniel Paoliello	bfdc96114c	[cg_llvm] Remove dead error message	2025-02-13 15:04:39 -08:00
clubby789	2966256133	Make `-O` mean `-C opt-level=3`	2025-02-13 19:47:55 +00:00
Jacob Pratt	f7d5285062	Rollup merge of #136881 - dpaoliello:cleanllvm3, r=Zalathar cg_llvm: Reduce visibility of all functions in the llvm module Next part of #135502 This reduces the visibility of all functions in the `llvm` module to `pub(crate)` and marks the `enzyme_ffi` modules with `#![expect(dead_code)]` (as previously discussed: <https://github.com/rust-lang/rust/pull/135502#discussion_r1915608085>). r? ``@Zalathar``	2025-02-13 03:53:31 -05:00
Jacob Pratt	1f669fdc7d	Rollup merge of #136858 - safinaskar:parallel-cleanup-2025-02-11-07-54, r=SparrowLii Parallel-compiler-related cleanup Parallel-compiler-related cleanup I carefully split changes into commits. Commit messages are self-explanatory. Squashing is not recommended. cc "Parallel Rustc Front-end" https://github.com/rust-lang/rust/issues/113349 r? SparrowLii ``@rustbot`` label: +WG-compiler-parallel	2025-02-13 03:53:31 -05:00
Daniel Paoliello	e7cef26a3d	cg_llvm: Reduce visibility of all functions in the llvm module	2025-02-13 12:36:25 +11:00
Zalathar	659e20fa75	Remove `LLVMGetModuleContext` This was unused after the removal of `-Zprofile` in #131829.	2025-02-13 12:36:09 +11:00
Jacob Pratt	33c186baf7	Rollup merge of #136807 - workingjubilee:merge-gpus-to-get-the-arcradeongeforce, r=bjorn3 compiler: internally merge `PtxKernel` into `GpuKernel` r? ``@bjorn3`` for review	2025-02-12 20:10:00 -05:00
Jacob Pratt	0de2341fef	Rollup merge of #136217 - taiki-e:csky-asm-flags, r=Amanieu Mark condition/carry bit as clobbered in C-SKY inline assembly C-SKY's compare and some arithmetic/logical instructions modify condition/carry bit (C) in PSR, but there is currently no way to mark it as clobbered in `asm!`. This PR marks it as clobbered except when [`options(preserves_flags)`](https://doc.rust-lang.org/reference/inline-assembly.html#r-asm.options.supported-options.preserves_flags) is used. Refs: - Section 1.3 "Programming model" and Section 1.3.5 "Condition/carry bit" in CSKY Architecture user_guide: `9f7121f7d4/CSKY%20Architecture%20user_guide.pdf` > Under user mode, condition/carry bit (C) is located in the lowest bit of PSR, and it can be accessed and changed by common user instructions. It is the only data bit that can be visited under user mode in PSR. > Condition or carry bit represents the result after one operation. Condition/carry bit can be clearly set according to the results of compare instructions or unclearly set as some high-precision arithmetic or logical instructions. In addition, special instructions such as DEC[GT,LT,NE] and XTRB[0-3] will influence the value of condition/carry bit. - Register definition in LLVM: https://github.com/llvm/llvm-project/blob/llvmorg-19.1.0/llvm/lib/Target/CSKY/CSKYRegisterInfo.td#L88 cc ```@Dirreke``` ([target maintainer](`aa6f5ab18e/src/doc/rustc/src/platform-support/csky-unknown-linux-gnuabiv2.md (target-maintainers)`)) r? ```@Amanieu``` ```@rustbot``` label +O-csky +A-inline-assembly	2025-02-12 20:09:58 -05:00
Jacob Pratt	a53cd3c979	Rollup merge of #135025 - Flakebi:alloca-addrspace, r=nikic Cast allocas to default address space Pointers for variables all need to be in the same address space for correct compilation. Therefore ensure that even if an `alloca` is created in a different address space, it is casted to the default address space before its value is used. This is necessary for the amdgpu target and others where the default address space for `alloca`s is not 0. For example the following code compiles incorrectly when not casting the address space to the default one: ```rust fn f(p: const i8 / addrspace(0) /) -> const i8 /* addrspace(0) / { let local = 0i8; / addrspace(5) */ let res = if cond { p } else { &raw const local }; res } ``` results in ```llvm %local = alloca addrspace(5) i8 %res = alloca addrspace(5) ptr if: ; Store 64-bit flat pointer store ptr %p, ptr addrspace(5) %res else: ; Store 32-bit scratch pointer store ptr addrspace(5) %local, ptr addrspace(5) %res ret: ; Load and return 64-bit flat pointer %res.load = load ptr, ptr addrspace(5) %res ret ptr %res.load ``` For amdgpu, `addrspace(0)` are 64-bit pointers, `addrspace(5)` are 32-bit pointers. The above code may store a 32-bit pointer and read it back as a 64-bit pointer, which is obviously wrong and cannot work. Instead, we need to `addrspacecast %local to ptr addrspace(0)`, then we store and load the correct type. Tracking issue: #135024	2025-02-12 20:09:56 -05:00
Matthew Maurer	d82219a4fa	debuginfo: Set bitwidth appropriately in enum variant tags Previously, we unconditionally set the bitwidth to 128-bits, the largest an discrimnator would possibly be. Then, LLVM would cut down the constant by chopping off leading zeroes before emitting the DWARF. LLVM only supported 64-bit descriminators, so this would also have occasionally resulted in truncated data (or an assert) if more than 64-bits were used. LLVM added support for 128-bit enumerators in llvm/llvm-project#125578 That patchset also trusts the constant to describe how wide the variant tag is. As a result, we went from emitting tags that looked like: DW_AT_discr_value (0xfe) (`form1`) to emitting tags that looked like: DW_AT_discr_value (<0x10> fe ff ff ff 00 00 00 00 00 00 00 00 00 00 00 00 ) This makes the `DW_AT_discr_value` encode at the bitwidth of the tag, which: 1. Is probably closer to our intentions in terms of describing the data. 2. Doesn't invoke the 128-bit support which may not be supported by all debuggers / downstream tools. 3. Will result in smaller debug information.	2025-02-12 18:01:42 +00:00
Matthias Krüger	9e89feefb9	Rollup merge of #135549 - oli-obk:push-tmxtpnrloyqu, r=compiler-errors Document some safety constraints and use more safe wrappers Lots of unsafe codegen_llvm code has safe wrappers already, so I used some of them and added some where applicable. I stopped here because this diff is large enough and should probably be reviewed independently of other changes.	2025-02-12 06:07:35 +01:00
Oli Scherer	dcf1e4d72b	Document some safety constraints and use more safe wrappers	2025-02-11 09:47:13 +00:00
Oli Scherer	4b83038d63	Add a safe wrapper for `WriteBitcodeToFile`	2025-02-11 09:41:22 +00:00
Oli Scherer	b2cd1b8ead	Remove an unsafe closure invariant by inlining the closure wrapper into the called function	2025-02-11 09:41:22 +00:00
Askar Safin	51f49d8464	compiler/rustc_codegen_llvm/src/lib.rs: remove "unsafe impl Send/Sync"	2025-02-11 09:58:53 +03:00
Jacob Pratt	c49ffaf7eb	Rollup merge of #136813 - mrkajetanp:aarch32-fp16-target-feature, r=davidtwco rustc_target: Add the fp16 target feature for AArch32 As in the commit description. The feature is already available in rustc for AArch64.	2025-02-11 01:02:41 -05:00
Jacob Pratt	6153a8dcea	Rollup merge of #136721 - dpaoliello:cleanllvm2, r=Zalathar cg_llvm: Reduce visibility of some items outside the `llvm` module Next piece of #135502 This reduces the visibility of items (other than those in the `llvm` module) so that dead code analysis will correctly identify unused items.	2025-02-11 01:02:40 -05:00
Flakebi	cde7e805ad	Cast allocas to default address space Pointers for variables all need to be in the same address space for correct compilation. Therefore ensure that even if an `alloca` is created in a different address space, it is casted to the default address space before its value is used. This is necessary for the amdgpu target and others where the default address space for `alloca`s is not 0. For example the following code compiles incorrectly when not casting the address space to the default one: ```rust fn f(p: const i8 / addrspace(0) /) -> const i8 /* addrspace(0) / { let local = 0i8; / addrspace(5) */ let res = if cond { p } else { &raw const local }; res } ``` results in ```llvm %local = alloca addrspace(5) i8 %res = alloca addrspace(5) ptr if: ; Store 64-bit flat pointer store ptr %p, ptr addrspace(5) %res else: ; Store 32-bit scratch pointer store ptr addrspace(5) %local, ptr addrspace(5) %res ret: ; Load and return 64-bit flat pointer %res.load = load ptr, ptr addrspace(5) %res ret ptr %res.load ``` For amdgpu, `addrspace(0)` are 64-bit pointers, `addrspace(5)` are 32-bit pointers. The above code may store a 32-bit pointer and read it back as a 64-bit pointer, which is obviously wrong and cannot work. Instead, we need to `addrspacecast %local to ptr addrspace(0)`, then we store and load the correct type.	2025-02-10 21:38:44 +01:00
Daniel Paoliello	5f29273921	rustc_codegen_llvm: Mark items as pub(crate) outside of the llvm module	2025-02-10 10:17:25 -08:00
Matthias Krüger	78f5bddd57	Rollup merge of #136419 - EnzymeAD:autodiff-tests, r=onur-ozkan,jieyouxu adding autodiff tests I'd like to get started with upstreaming some tests, even though I'm still waiting for an answer on how to best integrate the enzyme pass. Can we therefore temporarily support the -Z llvm-plugins here without too much effort? And in that case, how would that work? I saw you can do remapping, e.g. `rust-src-base`, but I don't think that will give me the path to libEnzyme.so. Do you have another suggestion? Other than that this test simply checks that the derivative of `xx` is `2.0 x`, which in this case is computed as `%0 = fadd fast double %x.0.val, %x.0.val` (I'll add a few more tests and move it to an autodiff folder if we can use the -Z flag) r? ``@jieyouxu`` Locally at least `-Zllvm-plugins=${PWD}/build/x86_64-unknown-linux-gnu/enzyme/build/Enzyme/libEnzyme-19.so` seems to work if I copy the command I get from x.py test and run it manually. However, running x.py test itself fails. Tracking: - https://github.com/rust-lang/rust/issues/124509 Zulip discussion: https://rust-lang.zulipchat.com/#narrow/channel/326414-t-infra.2Fbootstrap/topic/Enzyme.20build.20changes	2025-02-10 16:38:23 +01:00
Jubilee	7f8108afc8	Rollup merge of #136053 - Zalathar:defer-counters, r=saethlin coverage: Defer part of counter-creation until codegen Follow-up to #135481 and #135873. One of the pleasant properties of the new counter-assignment algorithm is that we can stop partway through the process, store the intermediate state in MIR, and then resume the rest of the algorithm during codegen. This lets it take into account which parts of the control-flow graph were eliminated by MIR opts, resulting in fewer physical counters and simpler counter expressions. Those improvements end up completely obsoleting much larger chunks of code that were previously responsible for cleaning up the coverage metadata after MIR opts, while also doing a more thorough cleanup job. (That change also unlocks some further simplifications that I've kept out of this PR to limit its scope.)	2025-02-10 00:51:49 -08:00
Jubilee Young	e11e2b4d09	compiler: internally merge `Conv::PtxKernel` into `GpuKernel` It is speculated that these two can be conceptually merged, and it can start by ripping out rustc's notion of the PtxKernel call convention. Leave the ExternAbi for now, but the nvptx target now should see it as just a different way to spell Conv::GpuKernel.	2025-02-09 23:14:55 -08:00
Manuel Drehwald	061abbc369	remove outdated *First autodiff variants for higher-order ad	2025-02-10 01:35:53 -05:00
Manuel Drehwald	1221cff551	move second opt run to lto phase and cleanup code	2025-02-10 01:35:22 -05:00
bors	124cc92199	Auto merge of #136751 - bjorn3:update_rustfmt, r=Mark-Simulacrum Update bootstrap compiler and rustfmt The rustfmt version we previously used formats things differently from what the latest nightly rustfmt does. This causes issues for subtrees that get formatted both in-tree and in their own repo. Updating the rustfmt used in-tree solves those issues. Also bumped the bootstrap compiler as the stage0 update command always updates both at the same time.	2025-02-09 15:44:16 +00:00
bors	a26e97be88	Auto merge of #136754 - Urgau:rollup-qlkhjqr, r=Urgau Rollup of 5 pull requests Successful merges: - #134679 (Windows: remove readonly files) - #136213 (Allow Rust to use a number of libc filesystem calls) - #136530 (Implement `x perf` directly in bootstrap) - #136601 (Detect (non-raw) borrows of null ZST pointers in CheckNull) - #136659 (Pick the max DWARF version when LTO'ing modules with different versions ) r? `@ghost` `@rustbot` modify labels: rollup	2025-02-09 12:54:26 +00:00
Jubilee	5e4d6278af	Rollup merge of #136706 - workingjubilee:finish-up-rustc-abi-updates, r=compiler-errors compiler: mostly-finish `rustc_abi` updates This almost-finishes all the updates in the compiler to use `rustc_abi` and removes some of the reexports of `rustc_abi` items in `rustc_target` that were previously available. r? ```@compiler-errors```	2025-02-08 20:41:21 -08:00
Urgau	5ec56e5fbb	Rollup merge of #136659 - wesleywiser:dwarf_version_lto_merge_behavior, r=jieyouxu Pick the max DWARF version when LTO'ing modules with different versions Currently, when rustc compiles code with `-Clto` enabled that was built with different choices for `-Zdwarf-version`, a warning will be reported. It's very easy to observe this by compiling most anything (eg, "hello world") and specifying `-Clto -Zdwarf-version=5` since the standard library is distributed with `-Zdwarf-version=4`. This behavior isn't actually useful for a few reasons: - From observation, LLVM chooses to pick the highest DWARF version anyway after issuing the warning. - Clang specifies that in this case, the max version should be picked without a warning and as a general principle, we want to support x-lang LTO with Clang which implies using the same module flag merge behaviors. - Debuggers need to be able to handle a variety of versions within the same debugging session as you can easily have some parts of a binary (or some dynamic libraries within an application) all compiled with different DWARF versions. This commit changes the module flag merge behavior to match Clang and use the highest version of DWARF. It also adds a test to ensure this behavior is respected in the case of two crates being LTO'd together and adds a test to ensure no warning is printed. Fixes #130041 which fails due to these warnings being printed cc #103057	2025-02-09 00:37:28 +01:00
bjorn3	1fcae03369	Rustfmt	2025-02-08 22:12:13 +00:00
Wesley Wiser	bbc40e7822	Pick the max DWARF version when LTO'ing modules with different versions Currently, when rustc compiles code with `-Clto` enabled that was built with different choices for `-Zdwarf-version`, a warning will be reported. It's very easy to observe this by compiling most anything (eg, "hello world") and specifying `-Clto -Zdwarf-version=5` since the standard library is distributed with `-Zdwarf-version=4`. This behavior isn't actually useful for a few reasons: - from observation, LLVM chooses to pick the highest DWARF version anyway after issuing the warning - Clang specifies that in this case, the max version should be picked without a warning and as a general principle, we want to support x-lang LTO with Clang which implies using the same module flag merge behaviors - Debuggers need to be able to handle a variety of versions withing the same debugging session as you can easily have some parts of a binary (or some dynamic libraries within an application) all compiled with different DWARF versions This commit changes the module flag merge behavior to match Clang and use the highest version of DWARF. It also adds a test to ensure this behavior is respected in the case of two crates being LTO'd together and updates the test added in the previous commit to ensure no warning is printed.	2025-02-08 16:33:36 +00:00
Manuel Drehwald	21d096184e	fix non-enzyme builds	2025-02-07 22:27:46 -05:00
Matthias Krüger	c9771e9590	Rollup merge of #136691 - bjorn3:linkage_cleanup, r=jieyouxu Remove Linkage::Private and Linkage::Appending Neither of them has any use case. Neither known nor theoretical.	2025-02-08 03:58:48 +01:00
Matthias Krüger	93b194516a	Rollup merge of #136640 - Zalathar:debuginfo-align-bits, r=compiler-errors Debuginfo for function ZSTs should have alignment of 8 bits, not 1 bit In #116096, function ZSTs were made to have debuginfo that gives them an alignment of “1”. But because alignment in LLVM debuginfo is denoted in bits, not bytes, this resulted in an alignment specification of 1 bit instead of 1 byte. I don't know whether this has any practical consequences, but I noticed that a test started failing when I accidentally fixed the mistake while working on #136632, so I extracted the fix (and the test adjustment) to this PR.	2025-02-08 03:58:45 +01:00
Jubilee Young	eddfe8f503	compiler: remove reexports from rustc_target::callconv	2025-02-07 11:25:18 -08:00
Kajetan Puchalski	53f9852224	rustc_target: Add the fp16 target feature for AArch32	2025-02-07 18:08:19 +00:00
bjorn3	f68cd90412	Remove Linkage::Appending It can only be used for certain LLVM internal variables like llvm.global_ctors which users are not allowed to define.	2025-02-07 16:02:19 +00:00
bjorn3	382e4031c2	Remove Linkage::Private This is the same as Linkage::Internal except that it doesn't emit any symbol. Some backends may not support it and it isn't all that useful anyway.	2025-02-07 16:02:19 +00:00
Daniel Paoliello	2a6b27444a	Remove dead code from rustc_codegen_llvm and the LLVM wrapper	2025-02-06 16:53:52 -08:00
Zalathar	4385a9e063	Debuginfo for function ZSTs should have alignment of 8 bits, not 1 bit	2025-02-06 23:01:29 +11:00
bors	2f92f050e8	Auto merge of #136471 - safinaskar:parallel, r=SparrowLii tree-wide: parallel: Fully removed all `Lrc`, replaced with `Arc` tree-wide: parallel: Fully removed all `Lrc`, replaced with `Arc` This is continuation of https://github.com/rust-lang/rust/pull/132282 . I'm pretty sure I did everything right. In particular, I searched all occurrences of `Lrc` in submodules and made sure that they don't need replacement. There are other possibilities, through. We can define `enum Lrc<T> { Rc(Rc<T>), Arc(Arc<T>) }`. Or we can make `Lrc` a union and on every clone we can read from special thread-local variable. Or we can add a generic parameter to `Lrc` and, yes, this parameter will be everywhere across all codebase. So, if you think we should take some alternative approach, then don't merge this PR. But if it is decided to stick with `Arc`, then, please, merge. cc "Parallel Rustc Front-end" ( https://github.com/rust-lang/rust/issues/113349 ) r? SparrowLii `@rustbot` label WG-compiler-parallel	2025-02-06 10:50:05 +00:00
Zalathar	bd855b6c9e	coverage: Remove the old code for simplifying counters after MIR opts	2025-02-06 21:44:31 +11:00
Zalathar	20d051ec87	coverage: Defer part of counter-creation until codegen	2025-02-06 21:44:31 +11:00
Zalathar	ee7dc06cf1	coverage: Store BCB node IDs in mappings, and resolve them in codegen Even though the coverage graph itself is no longer available during codegen, its nodes can still be used as opaque IDs.	2025-02-06 21:44:29 +11:00
Zalathar	042fd8c24a	Remove some unused glob re-exports These were detected by temporarily making `mod llvm` non-public.	2025-02-06 12:10:45 +11:00
Zalathar	65d7e6937b	Remove the `mod llvm_` hack, which should no longer be necessary	2025-02-06 12:10:42 +11:00
Manuel Drehwald	70b9ba3d6e	fix fwd-mode autodiff case	2025-02-05 18:47:23 -05:00
León Orell Valerian Liehr	75989e98d8	Rollup merge of #136375 - Zalathar:llvm-di-builder, r=workingjubilee cg_llvm: Replace some DIBuilder wrappers with LLVM-C API bindings (part 1) Part of #134001, follow-up to #136326, extracted from #134009. This PR performs an arbitrary subset of the LLVM-C binding migrations from #134009, which should make it less tedious to review. The remaining migrations can occur in one or more subsequent PRs.	2025-02-05 05:03:03 +01:00
bors	3f33b30e19	Auto merge of #135760 - scottmcm:disjoint-bitor, r=WaffleLapkin Add `unchecked_disjoint_bitor` per ACP373 Following the names from libs-api in https://github.com/rust-lang/libs-team/issues/373#issuecomment-2085686057 Includes a fallback implementation so this doesn't have to update cg_clif or cg_gcc, and overrides it in cg_llvm to use `or disjoint`, which [is available in LLVM 18](https://releases.llvm.org/18.1.0/docs/LangRef.html#or-instruction) so hopefully we don't need any version checks.	2025-02-04 17:46:06 +00:00
Augie Fackler	e9cb36bd0f	nvptx64: update default alignment to match LLVM 21 This changed in llvm/llvm-project@91cb8f5d32. The commit itself is mostly about some intrinsic instructions, but as an aside it also mentions something about addrspace for tensor memory, which I believe is what this string is telling us. @rustbot label: +llvm-main	2025-02-04 10:37:07 -05:00
Ralf Jung	04e7a10af6	intrinsics: unify rint, roundeven, nearbyint in a single round_ties_even intrinsic	2025-02-04 16:27:29 +01:00
Askar Safin	0a21f1d0a2	tree-wide: parallel: Fully removed all `Lrc`, replaced with `Arc`	2025-02-03 13:25:57 +03:00
Scott McMurray	f46e6be190	Handle the case where the `or disjoint` folds immediately to a constant	2025-02-02 21:04:10 -08:00
Matthias Krüger	f5ae630f10	Rollup merge of #136426 - oli-obk:push-nkpuulwurykn, r=compiler-errors Explain why we retroactively change a static initializer to have a different type I keep getting confused about it and in turn confused `@GuillaumeGomez` while trying to explain it badly	2025-02-02 23:06:57 +01:00
Oli Scherer	b89263605a	Explain why we retroactively change a static initializer to have a different type	2025-02-01 22:39:38 +00:00
Scott McMurray	4ee1602eab	Override `disjoint_or` in the LLVM backend	2025-01-31 22:29:08 -08:00
Zalathar	c3f2930edc	Explain why (some) pointer/length strings are `*const c_uchar`	2025-02-01 14:14:40 +11:00
Zalathar	5413d2bd6f	Add FIXME for auditing optional parameters passed to DIBuilder	2025-02-01 14:14:40 +11:00
Zalathar	8ddd9c38f6	Use `LLVMDIBuilderCreateDebugLocation` The LLVM-C binding takes an explicit context, whereas our binding obtained the context from the scope argument.	2025-02-01 14:14:40 +11:00
Zalathar	949b4673ce	Use `LLVMDIBuilderCreateLexicalBlockFile`	2025-02-01 14:14:40 +11:00
Zalathar	70d41bc711	Use `LLVMDIBuilderCreateLexicalBlock`	2025-02-01 14:14:40 +11:00
Zalathar	878ab125a1	Use `LLVMDIBuilderCreateNameSpace`	2025-02-01 14:14:39 +11:00
Zalathar	cd2af2dd9a	Use `LLVMDIBuilderFinalize`	2025-02-01 13:38:12 +11:00
Zalathar	832fcfb64f	Introduce `DIBuilderBox`, an owning pointer to `DIBuilder`	2025-02-01 13:34:14 +11:00
Ben Kimock	ce7cb312fa	Add link attribute for Enzyme's FFI	2025-01-31 21:11:23 -05:00
bors	7f36543a48	Auto merge of #136332 - jhpratt:rollup-aa69d0e, r=jhpratt Rollup of 9 pull requests Successful merges: - #132156 (When encountering unexpected closure return type, point at return type/expression) - #133429 (Autodiff Upstreaming - rustc_codegen_ssa, rustc_middle) - #136281 (`rustc_hir_analysis` cleanups) - #136297 (Fix a typo in profile-guided-optimization.md) - #136300 (atomic: extend compare_and_swap migration docs) - #136310 (normalize `*.long-type.txt` paths for compare-mode tests) - #136312 (Disable `overflow_delimited_expr` in edition 2024) - #136313 (Filter out RPITITs when suggesting unconstrained assoc type on too many generics) - #136323 (Fix a typo in conventions.md) r? `@ghost` `@rustbot` modify labels: rollup	2025-01-31 09:42:28 +00:00
Jacob Pratt	c19c4b91f5	Rollup merge of #133429 - EnzymeAD:autodiff-middle, r=oli-obk Autodiff Upstreaming - rustc_codegen_ssa, rustc_middle This PR should not be merged until the rustc_codegen_llvm part is merged. I will also alter it a little based on what get's shaved off from the cg_llvm PR, and address some of the feedback I received in the other PR (including cleanups). I am putting it already up to 1) Discuss with `@jieyouxu` if there is more work needed to add tests to this and 2) Pray that there is someone reviewing who can tell me why some of my autodiff invocations get lost. Re 1: My test require fat-lto. I also modify the compilation pipeline. So if there are any other llvm-ir tests in the same compilation unit then I will likely break them. Luckily there are two groups who currently have the same fat-lto requirement for their GPU code which I have for my autodiff code and both groups have some plans to enable support for thin-lto. Once either that work pans out, I'll copy it over for this feature. I will also work on not changing the optimization pipeline for functions not differentiated, but that will require some thoughts and engineering, so I think it would be good to be able to run the autodiff tests isolated from the rest for now. Can you guide me here please? For context, here are some of my tests in the samples folder: https://github.com/EnzymeAD/rustbook Re 2: This is a pretty serious issue, since it effectively prevents publishing libraries making use of autodiff: https://github.com/EnzymeAD/rust/issues/173. For some reason my dummy code persists till the end, so the code which calls autodiff, deletes the dummy, and inserts the code to compute the derivative never gets executed. To me it looks like the rustc_autodiff attribute just get's dropped, but I don't know WHY? Any help would be super appreciated, as rustc queries look a bit voodoo to me. Tracking: - https://github.com/rust-lang/rust/issues/124509 r? `@jieyouxu`	2025-01-31 00:26:30 -05:00
bors	c37fbd873a	Auto merge of #135318 - compiler-errors:vtable-fixes, r=lcnr Fix deduplication mismatches in vtables leading to upcasting unsoundness We currently have two cases where subtleties in supertraits can trigger disagreements in the vtable layout, e.g. leading to a different vtable layout being accessed at a callsite compared to what was prepared during unsizing. Namely: ### #135315 In this example, we were not normalizing supertraits when preparing vtables. In the example, ``` trait Supertrait<T> { fn _print_numbers(&self, mem: &[usize; 100]) { println!("{mem:?}"); } } impl<T> Supertrait<T> for () {} trait Identity { type Selff; } impl<Selff> Identity for Selff { type Selff = Selff; } trait Middle<T>: Supertrait<()> + Supertrait<T> { fn say_hello(&self, _: &usize) { println!("Hello!"); } } impl<T> Middle<T> for () {} trait Trait: Middle<<() as Identity>::Selff> {} impl Trait for () {} fn main() { (&() as &dyn Trait as &dyn Middle<()>).say_hello(&0); } ``` When we prepare `dyn Trait`, we see a supertrait of `Middle<<() as Identity>::Selff>`, which itself has two supertraits `Supertrait<()>` and `Supertrait<<() as Identity>::Selff>`. These two supertraits are identical, but they are not duplicated because we were using structural equality and not considering normalization. This leads to a vtable layout with two trait pointers. When we upcast to `dyn Middle<()>`, those two supertraits are now the same, leading to a vtable layout with only one trait pointer. This leads to an offset error, and we call the wrong method. ### #135316 This one is a bit more interesting, and is the bulk of the changes in this PR. It's a bit similar, except it uses binder equality instead of normalization to make the compiler get confused about two vtable layouts. In the example, ``` trait Supertrait<T> { fn _print_numbers(&self, mem: &[usize; 100]) { println!("{mem:?}"); } } impl<T> Supertrait<T> for () {} trait Trait<T, U>: Supertrait<T> + Supertrait<U> { fn say_hello(&self, _: &usize) { println!("Hello!"); } } impl<T, U> Trait<T, U> for () {} fn main() { (&() as &'static dyn for<'a> Trait<&'static (), &'a ()> as &'static dyn Trait<&'static (), &'static ()>) .say_hello(&0); } ``` When we prepare the vtable for `dyn for<'a> Trait<&'static (), &'a ()>`, we currently consider the PolyTraitRef of the vtable as the key for a supertrait. This leads two two supertraits -- `Supertrait<&'static ()>` and `for<'a> Supertrait<&'a ()>`. However, we can upcast[^up] without offsetting the vtable from `dyn for<'a> Trait<&'static (), &'a ()>` to `dyn Trait<&'static (), &'static ()>`. This is just instantiating the principal trait ref for a specific `'a = 'static`. However, when considering those supertraits, we now have only one distinct supertrait -- `Supertrait<&'static ()>` (which is deduplicated since there are two supertraits with the same substitutions). This leads to similar offsetting issues, leading to the wrong method being called. [^up]: I say upcast but this is a cast that is allowed on stable, since it's not changing the vtable at all, just instantiating the binder of the principal trait ref for some lifetime. The solution here is to recognize that a vtable isn't really meaningfully higher ranked, and to just treat a vtable as corresponding to a `TraitRef` so we can do this deduplication more faithfully. That is to say, the vtable for `dyn for<'a> Tr<'a>` and `dyn Tr<'x>` are always identical, since they both would correspond to a set of free regions on an impl... Do note that `Tr<for<'a> fn(&'a ())>` and `Tr<fn(&'static ())>` are still distinct. ---- There's a bit more that can be cleaned up. In codegen, we can stop using `PolyExistentialTraitRef` basically everywhere. We can also fix SMIR to stop storing `PolyExistentialTraitRef` in its vtable allocations. As for testing, it's difficult to actually turn this into something that can be tested with `rustc_dump_vtable`, since having multiple supertraits that are identical is a recipe for ambiguity errors. Maybe someone else is more creative with getting that attr to work, since the tests I added being run-pass tests is a bit unsatisfying. Miri also doesn't help here, since it doesn't really generate vtables that are offset by an index in the same way as codegen. r? `@lcnr` for the vibe check? Or reassign, idk. Maybe let's talk about whether this makes sense. <sup>(I guess an alternative would also be to not do any deduplication of vtable supertraits (or only a really conservative subset) rather than trying to normalize and deduplicate more faithfully here. Not sure if that works and is sufficient tho.)</sup> cc `@steffahn` -- ty for the minimizations cc `@WaffleLapkin` -- since you're overseeing the feature stabilization :3 Fixes #135315 Fixes #135316	2025-01-31 04:09:11 +00:00
bors	6c1d960d88	Auto merge of #136318 - matthiaskrgr:rollup-a159mzo, r=matthiaskrgr Rollup of 9 pull requests Successful merges: - #135026 (Cast global variables to default address space) - #135475 (uefi: Implement path) - #135852 (Add `AsyncFn*` to `core` prelude) - #136004 (tests: Skip const OOM tests on aarch64-unknown-linux-gnu) - #136157 (override build profile for bootstrap tests) - #136180 (Introduce a wrapper for "typed valtrees" and properly check the type before extracting the value) - #136256 (Add release notes for 1.84.1) - #136271 (Remove minor future footgun in `impl Debug for MaybeUninit`) - #136288 (Improve documentation for file locking) r? `@ghost` `@rustbot` modify labels: rollup	2025-01-30 23:11:38 +00:00
Matthias Krüger	6a66a270b0	Rollup merge of #136180 - lukas-code:typed-valtree, r=oli-obk Introduce a wrapper for "typed valtrees" and properly check the type before extracting the value This PR adds a new wrapper type `ty::Value` to replace the tuple `(Ty, ty::ValTree)` and become the new canonical representation of type-level constant values. The value extraction methods `try_to_bits`/`try_to_bool`/`try_to_target_usize` are moved to this new type. For `try_to_bits` in particular, this avoids some redundant matches on `ty::ConstKind::Value`. Furthermore, these methods and will now properly check the type before extracting the value, which fixes some ICEs. The name `ty::Value` was chosen to be consistent with `ty::Expr`. Commit 1 should be non-functional and commit 2 adds the type check. --- fixes https://github.com/rust-lang/rust/issues/131102 supercedes https://github.com/rust-lang/rust/pull/136130 r? `@oli-obk` cc `@FedericoBruzzone` `@BoxyUwU`	2025-01-30 20:47:07 +01:00
Matthias Krüger	89f8abe8b4	Rollup merge of #135026 - Flakebi:global-addrspace, r=saethlin Cast global variables to default address space Pointers for variables all need to be in the same address space for correct compilation. Therefore ensure that even if a global variable is created in a different address space, it is casted to the default address space before its value is used. This is necessary for the amdgpu target and others where the default address space for global variables is not 0. For example `core` does not compile in debug mode when not casting the address space to the default one because it tries to emit the following (simplified) LLVM IR, containing a type mismatch: ```llvm `@alloc_0` = addrspace(1) constant <{ [6 x i8] }> <{ [6 x i8] c"bit.rs" }>, align 1 `@alloc_1` = addrspace(1) constant <{ ptr }> <{ ptr addrspace(1) `@alloc_0` }>, align 8 ; ^ here a struct containing a `ptr` is needed, but it is created using a `ptr addrspace(1)` ``` For this to compile, we need to insert a constant `addrspacecast` before we use a global variable: ```llvm `@alloc_0` = addrspace(1) constant <{ [6 x i8] }> <{ [6 x i8] c"bit.rs" }>, align 1 `@alloc_1` = addrspace(1) constant <{ ptr }> <{ ptr addrspacecast (ptr addrspace(1) `@alloc_0` to ptr) }>, align 8 ``` As vtables are global variables as well, they are also created with an `addrspacecast`. In the SSA backend, after a vtable global is created, metadata is added to it. To add metadata, we need the non-casted global variable. Therefore we strip away an addrspacecast if there is one, to get the underlying global. Tracking issue: #135024	2025-01-30 20:47:02 +01:00
Lukas Markeffsky	10fc0b159e	introduce `ty::Value` Co-authored-by: FedericoBruzzone <federico.bruzzone.i@gmail.com>	2025-01-30 17:47:44 +01:00
Michael Goulet	9dc41a048d	Use ExistentialTraitRef throughout codegen	2025-01-30 15:34:00 +00:00
Michael Goulet	fdc4bd22b7	Do not treat vtable supertraits as distinct when bound with different bound vars	2025-01-30 15:33:58 +00:00
Wesley Wiser	51eaa0d56a	Clean up uses of the unstable `dwarf_version` option - Consolidate calculation of the effective value. - Check the target `DebuginfoKind` instead of using `is_like_msvc`.	2025-01-29 21:44:21 -06:00
Manuel Drehwald	1f30517d40	upstream rustc_codegen_ssa/rustc_middle changes for enzyme/autodiff	2025-01-29 21:31:13 -05:00
León Orell Valerian Liehr	7e123e4940	Rollup merge of #136147 - RalfJung:required-target-features-check-not-add, r=workingjubilee ABI-required target features: warn when they are missing in base CPU Part of https://github.com/rust-lang/rust/pull/135408: instead of adding ABI-required features to the target we build for LLVM, check that they are already there. Crucially we check this after applying `-Ctarget-cpu` and `-Ctarget-feature`, by reading `sess.unstable_target_features`. This means we can tweak the ABI target feature check without changing the behavior for any existing user; they will get warnings but the target features behave as before. The test changes here show that we are un-doing the "add all required target features" part. Without the full #135408, there is no way to take a way an ABI-required target feature with `-Ctarget-cpu`, so we cannot yet test that part. Cc ``@workingjubilee``	2025-01-29 03:12:21 +01:00
Taiki Endo	93465e6c31	Mark condition/carry bit as clobbered in C-SKY inline assembly	2025-01-29 06:46:05 +09:00
Ralf Jung	93ee180cfa	ABI-required target features: warn when they are missing in base CPU (rather than silently enabling them)	2025-01-28 04:40:42 +01:00
Oli Scherer	b24f674520	Change `collect_and_partition_mono_items` tuple return type to a struct	2025-01-27 09:38:12 +00:00
Jörn Horstmann	3779b8e32e	Consistently use the most significant bit of vector masks This improves the codegen for vector `select`, `gather`, `scatter` and boolean reduction intrinsics and fixes rust-lang/portable-simd#316. The current behavior of most mask operations during llvm codegen is to truncate the mask vector to <N x i1>, telling llvm to use the least significat bit. The exception is the `simd_bitmask` intrinsics, which already used the most signifiant bit. Since sse/avx instructions are defined to use the most significant bit, truncating means that llvm has to insert a left shift to move the bit into the most significant position, before the mask can actually be used. Similarly on aarch64, mask operations like blend work bit by bit, repeating the least significant bit across the whole lane involves shifting it into the sign position and then comparing against zero. By shifting before truncating to <N x i1>, we tell llvm that we only consider the most significant bit, removing the need for additional shift instructions in the assembly.	2025-01-26 16:44:23 +01:00
bors	6365178a6b	Auto merge of #128657 - clubby789:optimize-none, r=fee1-dead,WaffleLapkin Add `#[optimize(none)]` cc #54882 This extends the `optimize` attribute to add `none`, which corresponds to the LLVM `OptimizeNone` attribute. Not sure if an MCP is required for this, happy to file one if so.	2025-01-25 05:50:36 +00:00
Matthias Krüger	1e454fe725	Rollup merge of #135581 - EnzymeAD:refactor-codgencx, r=oli-obk Separate Builder methods from tcx As part of the autodiff upstreaming we noticed, that it would be nice to have various builder methods available without the TypeContext, which prevents the normal CodegenCx to be passed around between threads. We introduce a SimpleCx which just owns the llvm module and llvm context, to encapsulate them. The previous CodegenCx now implements deref and forwards access to the llvm module or context to it's SimpleCx sub-struct. This gives us a bit more flexibility, because now we can pass (or construct) the SimpleCx in locations where we don't have enough information to construct a CodegenCx, or are not able to pass it around due to the tcx lifetimes (and it not implementing send/sync). This also introduces an SBuilder, similar to the SimpleCx. The SBuilder uses a SimpleCx, whereas the existing Builder uses the larger CodegenCx. I will push updates to make implementations generic (where possible) to be implemented once and work for either of the two. I'll also clean up the leftover code. `call` is a bit tricky, because it requires a tcx, I probably need to duplicate it after all. Tracking: - https://github.com/rust-lang/rust/issues/124509	2025-01-24 23:25:42 +01:00
Manuel Drehwald	386c233858	Make CodegenCx and Builder generic Co-authored-by: Oli Scherer <github35764891676564198441@oli-obk.de>	2025-01-24 16:05:26 -05:00
clubby789	5ac95a5c47	Rename `OptimizeAttr::None` to `Default`	2025-01-24 19:34:01 +00:00
Zalathar	ff48331588	coverage: Make query `coverage_ids_info` return an Option This reflects the fact that we can't compute meaningful info for a function that wasn't instrumented and therefore doesn't have `function_coverage_info`.	2025-01-24 16:13:11 +11:00
Flakebi	b06e840d9e	Add comments about address spaces	2025-01-24 00:37:05 +01:00

... 2 3 4 5 6 ...

2757 Commits