nordic-dev.net/rust - rust

mirror of https://github.com/rust-lang/rust.git synced 2025-04-30 12:07:40 +00:00

Author	SHA1	Message	Date
Camille GILLOT	ac0683b783	Use correct offset when codegening mir::Const::Indirect.	2023-09-23 14:07:10 +00:00
Camille GILLOT	6992405674	Tolerate non-ptr indirect scalars in codegen.	2023-09-23 14:07:10 +00:00
Camille GILLOT	8ec5639bc2	Reuse calculate_debuginfo_offset for fragments.	2023-09-23 13:52:35 +00:00
Camille GILLOT	44ac8dcc71	Remove GeneratorWitness and rename GeneratorWitnessMIR.	2023-09-23 13:47:30 +00:00
Oli Scherer	4ed4913e67	Merge `ExternProviders` into the general `Providers` struct	2023-09-22 20:15:34 +00:00
Oli Scherer	2ba911c832	Have a single struct for queries and hook	2023-09-22 16:26:20 +00:00
Ayush Singh	c7e5f3ca08	Rebase to master - Update Example - Add thread_parking to sys::uefi - Fix unsafe in unsafe errors - Improve docs - Improve os/exit - Some asserts - Switch back to atomics Signed-off-by: Ayush Singh <ayushdevel1325@gmail.com>	2023-09-22 17:23:33 +05:30
Ayush Singh	48c6ae0611	Add Minimal Std implementation for UEFI Implemented modules: 1. alloc 2. os_str 3. env 4. math Tracking Issue: https://github.com/rust-lang/rust/issues/100499 API Change Proposal: https://github.com/rust-lang/libs-team/issues/87 This was originally part of https://github.com/rust-lang/rust/pull/100316. Since that PR was becoming too unwieldy and cluttered, and with suggestion from @dvdhrm, I have extracted a minimal std implementation to this PR. Signed-off-by: Ayush Singh <ayushsingh1325@gmail.com>	2023-09-22 17:23:30 +05:30
Oli Scherer	2157f31731	Add a way to decouple the implementation and the declaration of a TyCtxt method.	2023-09-22 09:23:15 +00:00
Guillaume Gomez	208f6ed95c	Rollup merge of #115972 - RalfJung:const-consistency, r=oli-obk rename mir::Constant -> mir::ConstOperand, mir::ConstKind -> mir::Const Also, be more consistent with the `to/eval_bits` methods... we had some that take a type and some that take a size, and then sometimes the one that takes a type is called `bits_for_ty`. Turns out that `ty::Const`/`mir::ConstKind` carry their type with them, so we don't need to even pass the type to those `eval_bits` functions at all. However this is not properly consistent yet: in `ty` we have most of the methods on `ty::Const`, but in `mir` we have them on `mir::ConstKind`. And indeed those two types are the ones that correspond to each other. So `mir::ConstantKind` should actually be renamed to `mir::Const`. But what to do with `mir::Constant`? It carries around a span, that's really more like a constant operand that appears as a MIR operand... it's more suited for `syntax.rs` than `consts.rs`, but the bigger question is, which name should it get if we want to align the `mir` and `ty` types? `ConstOperand`? `ConstOp`? `Literal`? It's not a literal but it has a field called `literal` so it would at least be consistently wrong-ish... ``@oli-obk`` any ideas?	2023-09-21 13:25:39 +02:00
Ralf Jung	c94410c145	rename mir::Constant -> mir::ConstOperand, mir::ConstKind -> mir::Const	2023-09-21 08:12:30 +02:00
Ralf Jung	a2374e65aa	the Const::eval_bits methods don't need to be given the Ty	2023-09-20 07:27:21 +02:00
Ralf Jung	ea22adbabd	adjust constValue::Slice to work for arbitrary slice types	2023-09-19 20:17:43 +02:00
Ralf Jung	5a0a1ff0cd	move ConstValue into mir this way we have mir::ConstValue and ty::ValTree as reasonably parallel	2023-09-19 11:11:02 +02:00
bors	cebb9cfd4f	Auto merge of #115748 - RalfJung:post-mono, r=oli-obk move required_consts check to general post-mono-check function This factors some code that is common between the interpreter and the codegen backends into shared helper functions. Also as a side-effect the interpreter now uses the same `eval` functions as everyone else to get the evaluated MIR constants. Also this is in preparation for another post-mono check that will be needed for (the current hackfix for) https://github.com/rust-lang/rust/issues/115709: ensuring that all locals are dynamically sized. I didn't expect this to change diagnostics, but it's just cycle errors that change. r? `@oli-obk`	2023-09-18 19:41:21 +00:00
Ralf Jung	89139d4c46	clarify PassMode::Indirect as well	2023-09-15 10:43:44 +02:00
Ralf Jung	7740476a43	explain PassMode::Cast	2023-09-15 10:43:44 +02:00
Ralf Jung	9ac8b363e3	don't point at const usage site for resolution-time errors also share the code that emits the actual error	2023-09-14 22:34:05 +02:00
Ralf Jung	89ac57db4d	move required_consts check to general post-mono-check function	2023-09-14 22:30:42 +02:00
bors	5e71913156	Auto merge of #115817 - fee1-dead-contrib:fix-codegen, r=oli-obk treat host effect params as erased in codegen This fixes the changes brought to codegen tests when effect params are added to libcore, by not attempting to monomorphize functions that get the host param by being `const fn`. r? `@oli-obk`	2023-09-14 13:42:30 +00:00
Deadbeef	a0a801cd38	treat host effect params as erased generics in codegen This fixes the changes brought to codegen tests when effect params are added to libcore, by not attempting to monomorphize functions that get the host param by being `const fn`.	2023-09-14 07:34:35 +00:00
Ralf Jung	430c386821	make it more clear which functions create fresh AllocId	2023-09-14 07:27:31 +02:00
Ralf Jung	0f8908da27	cleanup op_to_const a bit; rename ConstValue::ByRef → Indirect	2023-09-14 07:27:30 +02:00
Ralf Jung	551f481ffb	use AllocId instead of Allocation in ConstValue::ByRef	2023-09-14 07:26:24 +02:00
bors	c728bf3963	Auto merge of #114656 - bossmc:rework-no-coverage-attr, r=oli-obk Rework `no_coverage` to `coverage(off)` As discussed at the tail of https://github.com/rust-lang/rust/issues/84605 this replaces the `no_coverage` attribute with a `coverage` attribute that takes sub-parameters (currently `off` and `on`) to control the coverage instrumentation. Allows future-proofing for things like `coverage(off, reason="Tested live", issue="#12345")` or similar.	2023-09-14 01:05:18 +00:00
Matthias Krüger	565b9c2264	Rollup merge of #115798 - RalfJung:non_1zst_field, r=wesleywiser add helper method for finding the one non-1-ZST field	2023-09-13 18:37:42 +02:00
Ralf Jung	6e4779ab17	make the eval() functions on our const types return the resulting value	2023-09-13 07:29:34 +02:00
Ralf Jung	60091fe924	add helper method for finding the one non-1-ZST field	2023-09-12 20:52:05 +02:00
ouz-a	3ec0165f5f	Remove assert that checks type equality	2023-09-11 23:08:40 +03:00
bors	62ebe3a2b1	Auto merge of #115417 - dpaoliello:fixdi, r=wesleywiser Use the same DISubprogram for each instance of the same inlined function within a caller # Issue Details: The call to `panic` within a function like `Option::unwrap` is translated to LLVM as a `tail call` (as it will never return), when multiple calls to the same function like this are inlined LLVM will notice the common `tail call` block (i.e., loading the same panic string + location info and then calling `panic`) and merge them together. When merging these instructions together, LLVM will also attempt to merge the debug locations as well, but this fails (i.e., debug info is dropped) as Rust emits a new `DISubprogram` at each inline site thus LLVM doesn't recognize that these are actually the same function and so thinks that there isn't a common debug location. As an example of this, consider the following program: ```rust #[no_mangle] fn add_numbers(x: &Option<i32>, y: &Option<i32>) -> i32 { let x1 = x.unwrap(); let y1 = y.unwrap(); x1 + y1 } ``` When building for x86_64 Windows using 1.72 it generates (note the lack of `.cv_loc` before the call to `panic`, thus it will be attributed to the same line at the `addq` instruction): ```llvm .cv_loc 0 1 3 0 # src\lib.rs:3:0 addq $40, %rsp retq leaq .Lalloc_f570dea0a53168780ce9a91e67646421(%rip), %rcx leaq .Lalloc_629ace53b7e5b76aaa810d549cc84ea3(%rip), %r8 movl $43, %edx callq _ZN4core9panicking5panic17h12e60b9063f6dee8E int3 ``` # Fix Details: Cache the `DISubprogram` emitted for each inlined function instance within a caller so that this can be reused if that instance is encountered again. Ideally, we would also deduplicate child scopes and variables, however my attempt to do that with #114643 resulted in asserts when building for Linux (#115156) which would require some deep changes to Rust to fix (#115455). Instead, when using an inlined function as a debug scope, we will also create a new child scope such that subsequent child scopes and variables do not collide (from LLVM's perspective). After this change the above assembly now (with <https://reviews.llvm.org/D159226> as well) shows the `panic!` was inlined from `unwrap` in `option.rs` at line 935 into the current function in `lib.rs` at line 0 (line 0 is emitted since it is ambiguous which line to use as there were two inline sites that lead to this same code): ```llvm .cv_loc 0 1 3 0 # src\lib.rs:3:0 addq $40, %rsp retq .cv_inline_site_id 6 within 0 inlined_at 1 0 0 .cv_loc 6 2 935 0 # library\core\src\option.rs:935:0 leaq .Lalloc_5f55955de67e57c79064b537689facea(%rip), %rcx leaq .Lalloc_e741d4de8cb5801e1fd7a6c6795c1559(%rip), %r8 movl $43, %edx callq _ZN4core9panicking5panic17hde1558f32d5b1c04E int3 ```	2023-09-08 20:56:01 +00:00
Andy Caldwell	726a7b9439	Correct typo	2023-09-08 12:46:10 +01:00
Andy Caldwell	8e03371fc3	Rework no_coverage to coverage(off)	2023-09-08 12:46:06 +01:00
bors	9be4eac264	Auto merge of #113492 - nebulark:pr_96475, r=petrochenkov Add CL and CMD into to pdb debug info Partial fix for https://github.com/rust-lang/rust/issues/96475 The Arg0 and CommandLineArgs of the MCTargetOptions cpp class are not set within `bb548f9645/compiler/rustc_llvm/llvm-wrapper/PassWrapper.cpp (L378)` This causes LLVM to not neither output any compiler path (cl) nor the arguments that were used when invoking it (cmd) in the PDB file. This fix adds the missing information to the target machine so LLVM can use it.	2023-09-08 10:06:40 +00:00
Florian Schmiderer	4cdc633301	Add missing Debuginfo to PDB debug file on windows. Set Arg0 and CommandLineArgs in MCTargetoptions so LLVM outputs correct CL and CMD in LF_DEBUGINFO instead of empty/invalid values.	2023-09-08 00:28:40 +02:00
Camille GILLOT	26c48e6f95	Refactor how MIR represents composite debuginfo.	2023-09-05 17:20:07 +00:00
Daniel Paoliello	06890774ab	Deduplicate inlined function debug info, but create a new lexical scope to child subsequent scopes and variables from colliding	2023-09-01 14:27:21 -07:00
bors	84a9f4c6e6	Auto merge of #114114 - keith:ks/always-add-lc_build_version-for-metadata-object-files, r=wesleywiser Always add LC_BUILD_VERSION for metadata object files As of Xcode 15 Apple's linker has become a bit more strict about the warnings it produces. One of those new warnings requires all valid Mach-O object files in an archive to have a LC_BUILD_VERSION load command: ``` ld: warning: no platform load command found in 'ARCHIVE[arm64][2106](lib.rmeta)', assuming: iOS-simulator ``` This was already being done for Mac Catalyst so this change expands this logic to include it for all Apple platforms. I filed this behavior change as FB12546320 and was told it was the new intentional behavior.	2023-08-29 21:17:13 +00:00
Matthias Krüger	56d7d93a4b	Rollup merge of #111580 - atsuzaki:layout-ice, r=oli-obk Don't ICE on layout computation failure Fixes #111176 regression. r? `@oli-obk`	2023-08-29 20:49:02 +02:00
Ralf Jung	b2ebf1c23f	const_eval and codegen: audit uses of is_zst	2023-08-29 09:03:46 +02:00
Katherine Philip	56b767322b	Don't ICE on layout computation failure	2023-08-28 12:40:39 -07:00
bors	f0727758d1	Auto merge of #115139 - cjgillot:llvm-fragment, r=nikic Do not forget to pass DWARF fragment information to LLVM. Fixes https://github.com/rust-lang/rust/issues/115113 for the rustc part	2023-08-27 14:06:57 +00:00
Camille GILLOT	930b2e72ee	Do not produce fragment for ZST.	2023-08-26 16:54:28 +00:00
Camille GILLOT	b3bbc22cb7	Do not forget to pass DWARF fragment information to LLVM.	2023-08-26 14:21:10 +00:00
Wesley Wiser	d0b2c4f727	Revert "Use the same DISubprogram for each instance of the same inlined function within the caller" This reverts commit `687bffa493`. Reverting to resolve ICEs reported on nightly.	2023-08-25 19:49:10 -04:00
Ralf Jung	114fde6ac7	cache the terminate block with the last reason that we saw	2023-08-24 13:28:26 +02:00
Ralf Jung	4c53783f3c	when terminating during unwinding, show the reason why	2023-08-24 13:28:26 +02:00
bors	154ae32a55	Auto merge of #114643 - dpaoliello:inlinedebuginfo, r=wesleywiser Use the same DISubprogram for each instance of the same inlined function within a caller # Issue Details: The call to `panic` within a function like `Option::unwrap` is translated to LLVM as a `tail call` (as it will never return), when multiple calls to the same function like this is inlined LLVM will notice the common `tail call` block (i.e., loading the same panic string + location info and then calling `panic`) and merge them together. When merging these instructions together, LLVM will also attempt to merge the debug locations as well, but this fails (i.e., debug info is dropped) as Rust emits a new `DISubprogram` at each inline site thus LLVM doesn't recognize that these are actually the same function and so thinks that there isn't a common debug location. As an example of this when building for x86_64 Windows (note the lack of `.cv_loc` before the call to `panic`, thus it will be attributed to the same line at the `addq` instruction): ``` .cv_loc 0 1 23 0 # src\lib.rs:23:0 addq $40, %rsp retq leaq .Lalloc_f570dea0a53168780ce9a91e67646421(%rip), %rcx leaq .Lalloc_629ace53b7e5b76aaa810d549cc84ea3(%rip), %r8 movl $43, %edx callq _ZN4core9panicking5panic17h12e60b9063f6dee8E int3 ``` # Fix Details: Cache the `DISubprogram` emitted for each inlined function instance within a caller so that this can be reused if that instance is encountered again, this also requires caching the `DILexicalBlock` and `DIVariable` objects to avoid creating duplicates. After this change the above assembly now looks like: ``` .cv_loc 0 1 23 0 # src\lib.rs:23:0 addq $40, %rsp retq .cv_inline_site_id 5 within 0 inlined_at 1 0 0 .cv_inline_site_id 6 within 5 inlined_at 1 12 0 .cv_loc 6 2 935 0 # library\core\src\option.rs:935:0 leaq .Lalloc_5f55955de67e57c79064b537689facea(%rip), %rcx leaq .Lalloc_e741d4de8cb5801e1fd7a6c6795c1559(%rip), %r8 movl $43, %edx callq _ZN4core9panicking5panic17hde1558f32d5b1c04E int3 ```	2023-08-22 20:15:29 +00:00
Keith Smiley	d37fdc95d4	Always add LC_BUILD_VERSION for metadata object files As of Xcode 15 Apple's linker has become a bit more strict about the warnings it produces. One of those new warnings requires all valid Mach-O object files in an archive to have a LC_BUILD_VERSION load command: ``` ld: warning: no platform load command found in 'ARCHIVE[arm64][2106](lib.rmeta)', assuming: iOS-simulator ``` This was already being done for Mac Catalyst so this change expands this logic to include it for all Apple platforms. I filed this behavior change as FB12546320 and was told it was the new intentional behavior.	2023-08-21 13:31:57 -07:00
Ralf Jung	818ec8e23a	give some unwind-related terminators a more clear name	2023-08-20 15:52:38 +02:00
Gary Guo	2f68d97b64	Fix ELF flag for RISC-V targets without explicit ABI	2023-08-18 15:08:09 +01:00
Gary Guo	5ed556e84a	Add comment explanining unstable_target_features	2023-08-18 15:08:09 +01:00
Gary Guo	a8a92f5131	Fix ABI flags in RISC-V/LoongArch ELF file generated by rustc	2023-08-18 14:37:39 +01:00
Camille GILLOT	933b618360	Revert "Implement references VarDebugInfo." This reverts commit `2ec0071913`.	2023-08-17 17:02:04 +00:00
Be Wilson	72e29da3ec	stabilize combining +bundle and +whole-archive link modifiers Currently, combining +bundle and +whole-archive works only with #![feature(packed_bundled_libs)] This crate feature is independent of the -Zpacked-bundled-libs command line option. This commit stabilizes the #![feature(packed_bundled_libs)] crate feature and implicitly enables it only when the +bundle and +whole-archive link modifiers are combined. This allows rlib crates to use the +whole-archive link modifier with native libraries and have all symbols included in the linked library to be included in downstream staticlib crates that use the rlib as a dependency. Other cases requiring the packed_bundled_libs behavior still require the -Zpacked-bundled-libs command line option, which can be stabilized independently in the future. Per discussion on https://github.com/rust-lang/rust/issues/108081 there is no risk of regression stabilizing the crate feature in this way because the combination of +bundle,+whole-archive link modifiers was previously not allowed.	2023-08-15 15:51:18 -05:00
Guillaume Gomez	f92974189b	Rollup merge of #114711 - lqd:linker-inference, r=petrochenkov Infer `Lld::No` linker hint when the linker stem is a generic compiler driver This PR basically reverts the temporary solution in https://github.com/rust-lang/rust/pull/113631 to a more long-term solution. r? ``@petrochenkov`` In [this comment](https://github.com/rust-lang/rust/pull/113631#issuecomment-1634598238), you had ideas about a long-term solution: > I wonder what a good non-temporary solution for the inference would look like. > > * If the default is `(Cc::No, Lld::Yes)` (e.g. `rust-lld`) > > * and we switch to some specific platform compiler (e.g. `-C linker=arm-none-eabi-gcc`), should we change to `Lld::No`? Maybe yes? > * and we switch to some non-default but generic compiler `-C linker=clang`? Then maybe not? > > * If the default is `(Cc::Yes, Lld::Yes)` (e.g. future x86_64 linux with default LLD) > > * and we switch to some specific platform compiler (e.g. `-C linker=arm-none-eabi-gcc`), should we change to `Lld::No`? Maybe yes? > * and we switch to some non-default but generic compiler `-C linker=clang`? Then maybe not? > I believe that we should infer the `Lld::No` linker hint for any `-Clinker` override, and all the cases above: - the linker drivers have their own defaults, so in my mind `-Clinker` is a signal to use its default linker / flavor, rather than ours or the target's. In the case of generic compilers, it's more likely than not going to be `Lld::No`. I would expect this to be the case in general, even when including platform-specific compilers. - the guess will be wrong if the linker driver uses lld by default (and we also don't want to search for `-fuse-ld` link args), but will work in the more common cases. And the minority of other cases can fix the wrong guess by opting into the precise linker flavor. - this also ensures backwards-compatibility: today, even on targets with an lld default and overriding the linker, rustc will not use lld. That includes `thumbv6m-none-eabi` where issue #113597 happened. It looks like the simplest option, and the one with least churn: we maintain the current behavior in ambiguous cases. I've tested that this works on #113597, as expected from the failure. (I also have a no-std `run-make` test using a custom target json spec: basically simulating a future `x86_64-unknown-linux-gnu` using an lld flavor by default, to check that e.g. `-Clinker=clang` doesn't use lld. I could add that test to this PR, but IIUC such a custom target requires `cargo -Z build-std` and we have no tests depending on this cargo feature yet. Let me know if you want to add this test of the linker inference for such targets.) What do you think ?	2023-08-15 14:29:45 +02:00
dirreke	74817b7053	Upgrade Object and related deps	2023-08-14 23:05:45 +08:00
Dirreke	184a9afffb	add details for csky-unknown-linux-gnuabiv2 and add docs	2023-08-14 23:02:37 +08:00
Dirreke	8c51e28bd5	add `rustc_codegen_ssa` support for csky and correct some code	2023-08-14 23:02:36 +08:00
Jacob Pratt	62ca5aa8e4	Remove unnecessary feature gates	2023-08-12 00:21:04 -04:00
Daniel Paoliello	687bffa493	Use the same DISubprogram for each instance of the same inlined function within the caller	2023-08-11 10:21:52 -07:00
Rémy Rakic	99c1bcfac5	Revert "make MCP510 behavior explicitly opt-in" This reverts commit `2b61a5e17a`.	2023-08-10 20:35:48 +00:00
Vadim Petrochenkov	907aa440cf	rustc: Move `stable_crate_id` from `Session` to `GlobalCtxt` Removes a piece of mutable state. Follow up to #114578.	2023-08-09 14:35:23 +08:00
Vadim Petrochenkov	0b89aac08d	rustc: Move `crate_types` from `Session` to `GlobalCtxt` Removes a piece of mutable state. Follow up to #114578.	2023-08-09 14:17:54 +08:00
bors	a946c1e017	Auto merge of #114470 - pnkfelix:dont-export-no-mangle-from-proc-macros-issue-99978, r=bjorn3 Restrict linker version script of proc-macro crates to just its two symbols Restrict linker version script of proc-macro crates to just the two symbols of each proc-macro crate. The main known effect of doing this is to stop including `#[no_mangle]` symbols in the linker version script. Background: The combination of a proc-macro crate with an import of another crate that itself exports a no_mangle function was broken for a period of time, because: * In PR #99944 we stopped exporting no_mangle symbols from proc-macro crates; proc-macro crates have a very limited interface and are meant to be treated as a blackbox to everything except rustc itself. However: he constructed linker version script still referred to them, but resolving that discrepancy was left as a FIXME in the code, tagged with issue #99978. * In PR #108017 we started telling the linker to check (via the`--no-undefined-version` linker invocation flag) that every symbol referenced in the "linker version script" is provided as linker input. So the unresolved discrepancy from #99978 started surfacing as a compile-time error (e.g. #111888). Fix #111888 Fix #99978.	2023-08-09 00:38:00 +00:00
Matthias Krüger	28ee1a919e	Rollup merge of #114500 - taiki-e:arm-crypto, r=Amanieu Remove arm crypto target feature Follow-up to https://github.com/rust-lang/stdarch/pull/1407. LLVM has moved away from a combined `crypto` feature on both aarch64 and arm, and we did the same on aarch64, but were deferred from doing the same on arm due to compatibility with older LLVM. As the minimum LLVM version has increased, we can now remove this (unstable) target feature on arm. r? `@Amanieu`	2023-08-08 03:30:55 +02:00
Matthias Krüger	328e9785fb	Rollup merge of #114376 - inferiorhumanorgans:rustc-codegen-ssa-duplicate-export, r=wesleywiser Avoid exporting __rust_alloc_error_handler_should_panic more than once. Exporting `__rust_alloc_error_handler_should_panic` multiple times causes `ld.gold` to balk with: `error: version script assignment of to symbol __rust_alloc_error_handler_should_panic failed: symbol not defined` Specifically this breaks builds of 1.70.0 and newer on DragonFly and YoctoProject with `ld.gold`. Builds with `ld.bfd` and `lld` should be unaffected. http://errors.yoctoproject.org/Errors/Details/708194/	2023-08-08 03:30:54 +02:00
Felix S. Klock II	a2058ddbed	Review feedback: return empty iff !should_codegen, and use simpler unconditional logic otherwise.	2023-08-07 13:31:14 -04:00
Matthias Krüger	13de583583	Rollup merge of #114505 - ouz-a:cleanup_mir, r=RalfJung Add documentation to has_deref Documentation of `has_deref` needed some polish to be more clear about where it should be used and what's it's purpose. cc https://github.com/rust-lang/rust/issues/114401 r? `@RalfJung`	2023-08-06 17:26:29 +02:00
ouz-a	6df546281b	cleanup misinformation regarding has_deref	2023-08-06 17:29:09 +03:00
Taiki Endo	dddd2190b9	Remove arm crypto target feature	2023-08-05 15:23:31 +09:00
bors	e173a8e663	Auto merge of #112117 - bryangarza:track-caller-feature-gate, r=compiler-errors Add separate feature gate for async fn track caller This patch adds a feature gate `async_fn_track_caller` that is separate from `closure_track_caller`. This is to allow enabling `async_fn_track_caller` separately. Fixes #110009	2023-08-04 22:17:59 +00:00
Felix S. Klock II	3000c24afc	special-case proc-macro crates in rustc_codegen_ssa:🔙:linker::exported_symbols to only export the two symbols that proc-macros need.	2023-08-04 11:29:01 -04:00
Matthias Krüger	4fb44e59c4	Rollup merge of #114427 - Enselic:rustc_codegen_ssa-fixme, r=Nilstrieb Handle non-utf8 rpaths (fix FIXME) Removes a FIXME for #9639 which is closed since long ago. Part of #44366 which is E-help-wanted. (Also see https://github.com/rust-lang/rust/pull/114377)	2023-08-04 09:18:59 +02:00
Matthias Krüger	353e26869f	Rollup merge of #114431 - ehuss:ssa-test, r=est31 Enable tests on rustc_codegen_ssa This enables unittests in rustc_codegen_ssa. There are some tests, primarily in [`back/rpath/tests.rs`](https://github.com/rust-lang/rust/blob/HEAD/compiler/rustc_codegen_ssa/src/back/rpath/tests.rs) that haven't ever been running since the unittests are disabled. From what I can tell, this was just a consequence of how things evolved. When testing was initially added in https://github.com/rust-lang/rust/pull/33282, `librustc_trans` had test=false because it didn't have any tests. `rustc_codegen_ssa` eventually split off from that (https://github.com/rust-lang/rust/pull/55627), and the rpath module eventually got merged in too (from `librustc_back` where it used to live). That migration didn't enable the tests. This also includes some fluent diagnostic tests, though I'm not sure what exactly they are testing.	2023-08-04 07:25:48 +02:00
Eric Huss	40729bcb69	Enable tests on rustc_codegen_ssa	2023-08-03 12:48:55 -07:00
Martin Nordholts	ea3b49f2f1	Handle non-utf8 rpaths (fix FIXME)	2023-08-03 20:04:18 +02:00
Oli Scherer	4457ef2c6d	Forbid old-style `simd_shuffleN` intrinsics	2023-08-03 09:29:00 +00:00
Bryan Garza	673ab17c7f	Add separate feature gate for async fn track caller This patch adds a feature gate `async_fn_track_caller` that is separate from `closure_track_caller`. This is to allow enabling `async_fn_track_caller` separately. Fixes #110009	2023-08-02 14:18:21 -07:00
Alex Zepeda	2e29d85f7e	Avoid exporting symbols more than once. Exporting `__rust_alloc_error_handler_should_panic` multiple times causes ld.gold to balk with: `error: version script assignment of to symbol __rust_alloc_error_handler_should_panic failed: symbol not defined` Specifically this breaks builds on DragonFly and YoctoProject with ld.gold. Builds with ld.bfd should be unaffected.	2023-08-02 11:02:23 -07:00
bors	abd3637e42	Auto merge of #105545 - erikdesjardins:ptrclean, r=bjorn3 cleanup: remove pointee types This can't be merged until the oldest LLVM version we support uses opaque pointers, which will be the case after #114148. (Also note `-Cllvm-args="-opaque-pointers=0"` can technically be used in LLVM 15, though I don't think we should support that configuration.) I initially hoped this would provide some minor perf win, but in https://github.com/rust-lang/rust/pull/105412#issuecomment-1341224450 it had very little impact, so this is only valuable as a cleanup. As a followup, this will enable #96242 to be resolved. r? `@ghost` `@rustbot` label S-blocked	2023-08-01 19:44:17 +00:00
Matthias Krüger	58f963fb65	Rollup merge of #113717 - cuishuang:master, r=Nilstrieb remove repetitive words	2023-07-31 22:49:47 +02:00
cui fliter	88c7b16e03	remove repetitive words Signed-off-by: cui fliter <imcusg@gmail.com>	2023-07-31 16:13:02 +08:00
Nicholas Nethercote	c17c8dc78e	Remove unnecessary semicolon.	2023-07-31 16:34:13 +10:00
Nicholas Nethercote	5673f47042	Clean up `generate_lto_work`. This function has some shared code for the thin LTO and fat LTO cases, but those cases have so little in common that it's actually clearer to treat them fully separately.	2023-07-31 16:21:03 +10:00
Nicholas Nethercote	d404699fb1	Fix LLVM thread names on Windows. PR #112946 tweaked the naming of LLVM threads, but messed things up slightly, resulting in threads on Windows having names like `optimize module {} regex.f10ba03eb5ec7975-cgu.0`. This commit removes the extraneous `{} `.	2023-07-31 16:21:03 +10:00
Nicholas Nethercote	90ce358afa	Introduce `running_with_any_token` closure. It makes things a little clearer.	2023-07-31 16:21:02 +10:00
Nicholas Nethercote	3b44f5b0eb	Use standard Rust capitalization rules for names containing "LTO".	2023-07-31 16:21:02 +10:00
Nicholas Nethercote	a08220bcab	Tweak structure of the message loop. The main loop has a very complex condition, which includes two mentions of `codegen_state`. The body of the loop then immediately switches on the `codegen_state`. I find it easier to understand if it's a `loop` and we check for exit conditions after switching on `codegen_state`. We end up with a tiny bit of code duplication, but it's clear that (a) we never exit in the `Ongoing` case, (b) we exit in the `Completed` state only if several things are true (and there's interaction with LTO there), and (c) we exit in the `Aborted` state if a couple of things are true. Also, the exit conditions are all simple conjunctions.	2023-07-31 16:21:02 +10:00
Nicholas Nethercote	179bf19813	Tweak a loop condition. This loop condition involves `codegen_state`, `work_items`, and `running_with_own_token`. But the body of the loop cannot modify `codegen_state`, so repeatedly checking it is unnecessary.	2023-07-31 16:21:02 +10:00
Nicholas Nethercote	d21d31cce7	Move `maybe_start_llvm_timer`'s body into `spawn_work`. The two functions are alway called together. This commit factors out the repeated code.	2023-07-31 16:21:02 +10:00
Nicholas Nethercote	3517fe899e	Remove `CodegenContext::worker`. `CodegenContext` is immutable except for the `worker` field - we clone `CodegenContext` in multiple places, changing the `worker` field each time. It's simpler to move the `worker` field out of `CodegenContext`.	2023-07-31 16:21:02 +10:00
Nicholas Nethercote	4a120f33f7	Remove `ExtraBackendMethods::spawn_thread`. It's no longer used, and `spawn_named_thread` is preferable, because naming threads is helpful when profiling.	2023-07-31 16:21:02 +10:00
Nicholas Nethercote	e78fb95dfa	Give the coordinator thread a name. This is useful when profiling with a profiler like Samply.	2023-07-31 16:21:02 +10:00
Nicholas Nethercote	176610c2cd	Remove some unused values in `codegen_crate`.	2023-07-31 16:21:02 +10:00
Nicholas Nethercote	8b9e3f0dd6	Remove an unnecessary `pub`.	2023-07-31 16:21:00 +10:00
Nicholas Nethercote	f81fe9d702	Rename `MainThreadWorkerState`. The `Worker` is unnecessary, and just makes it longer than necessary.	2023-07-31 16:20:18 +10:00
Nicholas Nethercote	5bef04ed38	Rename things related to the main thread's operations. It took me some time to understand how the main thread can lend a jobserver token to an LLVM thread. This commit renames a couple of things to make it clearer. - Rename the `LLVMing` variant as `Lending`, because that is a clearer description of what is happening. - Rename `running` as `running_with_own_token`, which makes it clearer that there might be one additional LLVM thread running (with a loaned token). Also add a comment to its definition.	2023-07-31 16:20:18 +10:00
Nicholas Nethercote	fd017d3c17	Add some assertions. - Thin and fat LTO can't happen together. - `NeedsLink` and (non-allocator) `Compiled` work item results can't happen together.	2023-07-31 16:20:18 +10:00
Nicholas Nethercote	4f598b852c	Add comments to `WorkItemResult`. And rename the `Compiled` variant as `Finished`, because that name makes it clearer there is nothing left to do, contrasting nicely with the `Needs*` variants.	2023-07-31 16:20:18 +10:00
Nicholas Nethercote	a8c71f0a15	Inline and remove `submit_pre_codegened_module_to_llvm`. It has a single callsite, and provides little value.	2023-07-31 16:20:18 +10:00
Matthias Krüger	3ce90b1649	inline format!() args up to and including rustc_codegen_llvm	2023-07-30 14:22:50 +02:00
Erik Desjardins	04303cfb3a	cg_ssa: remove pointee types and pointercast/bitcast-of-ptr	2023-07-29 13:18:20 -04:00
Matthias Krüger	c3cd05198a	Rollup merge of #113872 - nnethercote:tweak-cgu-sorting, r=pnkfelix Tweak CGU sorting in a couple of places. In `base.rs`, tweak how the CGU size interleaving works. Since #113777, it's much more common to have multiple CGUs with identical sizes. With the existing code these same-sized items ended up in the opposite-to-desired order due to the stable sorting. The code now starts with a reverse sort (like is done in `partitioning.rs`) which gives the behaviour we want. This doesn't matter much for perf, but makes profiles in `samply` look more like what we expect. In `partitioning.rs`, we can use `sort_by_key` instead of `sort_by_cached_key` because `CGU::size_estimate()` is cheap. (There is an identical CGU sort earlier in that function that already uses `sort_by_key`.) r? `@pnkfelix`	2023-07-27 06:04:12 +02:00
Oli Scherer	2b444672e1	Use a builder instead of boolean/option arguments	2023-07-25 13:51:15 +00:00
Matthias Krüger	abde841f0a	remove redundant clones	2023-07-23 09:48:07 +02:00
bors	1c44af9b79	Auto merge of #111836 - calebzulawski:target-feature-closure, r=workingjubilee Fix #[inline(always)] on closures with target feature 1.1 Fixes #108655. I think this is the most obvious solution that isn't overly complicated. The comment includes more justification, but I think this is likely better than demoting the `#[inline(always)]` to `#[inline]`, since existing code is unaffected.	2023-07-23 00:16:03 +00:00
bors	d908a5b08e	Auto merge of #113892 - RalfJung:uninit-undef-poison, r=wesleywiser clarify MIR uninit vs LLVM undef/poison In [this LLVM discussion](https://discourse.llvm.org/t/rfc-load-instruction-uninitialized-memory-semantics/67481) I learned that mapping our uninitialized memory in MIR to poison in LLVM would be quite problematic due to the lack of a byte type. I am not sure where to write down this insight but this seems like a reasonable start.	2023-07-21 19:32:17 +00:00
Matthias Krüger	b1d1e99c22	Rollup merge of #113780 - dtolnay:printkindpath, r=b-naber Support `--print KIND=PATH` command line syntax As is already done for `--emit KIND=PATH` and `-L KIND=PATH`. In the discussion of #110785, it was pointed out that `--print KIND=PATH` is nicer than trying to apply the single global `-o` path to `--print`'s output, because in general there can be multiple print requests within a single rustc invocation, and anyway `-o` would already be used for a different meaning in the case of `link-args` and `native-static-libs`. I am interested in using `--print cfg=PATH` in Buck2. Currently Buck2 works around the lack of support for `--print KIND=PATH` by [indirecting through a Python wrapper script](`d43cf3a51a/prelude/rust/tools/get_rustc_cfg.py`) to redirect rustc's stdout into the location dictated by the build system. From skimming Cargo's usages of `--print`, it definitely seems like it would benefit from `--print KIND=PATH` too. Currently it is working around the lack of this by inserting `--crate-name=___ --print=crate-name` so that it can look for a line containing `___` as a delimiter between the 2 other `--print` informations it actually cares about. This is commented as a "HACK" and "abuse". `31eda6f7c3/src/cargo/core/compiler/build_context/target_info.rs (L242)` (FYI `@weihanglo` as you dealt with this recently in https://github.com/rust-lang/cargo/pull/11633.) Mentioning reviewers active in #110785: `@fee1-dead` `@jyn514` `@bjorn3`	2023-07-21 06:52:28 +02:00
Matthias Krüger	2734b5ada9	Rollup merge of #113723 - khei4:khei4/llvm-stats, r=oli-obk,nikic Resurrect: rustc_llvm: Add a -Z `print-codegen-stats` option to expose LLVM statistics. This resurrects PR https://github.com/rust-lang/rust/pull/104000, which has sat idle for a while. And I want to see the effect of stack-move optimizations on LLVM (like https://reviews.llvm.org/D153453) :). I have applied the changes requested by `@oli-obk` and `@nagisa` https://github.com/rust-lang/rust/pull/104000#discussion_r1014625377 and https://github.com/rust-lang/rust/pull/104000#discussion_r1014642482 in the latest commits. r? `@oli-obk` ----- LLVM has a neat [statistics](https://llvm.org/docs/ProgrammersManual.html#the-statistic-class-stats-option) feature that tracks how often optimizations kick in. It's very handy for optimization work. Since we expose the LLVM pass timings, I thought it made sense to expose the LLVM statistics too. ----- (Edit: fix broken link (Edit2: fix segmentation fault and use malloc If `rustc` is built with ```toml [llvm] assertions = true ``` Then you can see like ``` rustc +stage1 -Z print-codegen-stats -C opt-level=3 tmp.rs ===-------------------------------------------------------------------------=== ... Statistics Collected ... ===-------------------------------------------------------------------------=== 3 aa - Number of MayAlias results 193 aa - Number of MustAlias results 531 aa - Number of NoAlias results ... ``` And the current default build emits only ``` $ rustc +stage1 -Z print-codegen-stats -C opt-level=3 tmp.rs ===-------------------------------------------------------------------------=== ... Statistics Collected ... ===-------------------------------------------------------------------------=== $ ``` This might be better to emit the message to tell assertion flag necessity, but now I can't find how to do that...	2023-07-21 06:52:27 +02:00
David Tolnay	26fd6b15b0	Add note about writing native-static-libs to file	2023-07-20 11:04:32 -07:00
David Tolnay	dcfe94a009	Implement printing to file for link-args and native-static-libs	2023-07-20 11:04:31 -07:00
David Tolnay	6e734fce63	Implement printing to file in llvm_util	2023-07-20 11:04:31 -07:00
David Tolnay	c80cbe4bae	Implement printing to file in codegen_backend.print	2023-07-20 11:04:31 -07:00
David Tolnay	c0dc0c6875	Store individual output file name with every PrintRequest	2023-07-20 11:04:30 -07:00
Ralf Jung	41a73d8251	clarify MIR uninit vs LLVM undef/poison	2023-07-20 18:43:54 +02:00
Matthias Krüger	8c17e0701e	Rollup merge of #113529 - oli-obk:simd_shuffle_evaluated, r=wesleywiser Permit pre-evaluated constants in simd_shuffle fixes https://github.com/rust-lang/rust/issues/113500	2023-07-20 17:19:32 +02:00
bors	b14fd2359f	Auto merge of #113695 - bjorn3:fix_rlib_cdylib_metadata_handling, r=pnkfelix,petrochenkov Verify that all crate sources are in sync This ensures that rustc will not attempt to link against a cdylib as if it is a rust dylib when an rlib for the same crate is available. Previously rustc didn't actually check if any further formats of a crate which has been loaded are of the same version and if they are actually valid. This caused a cdylib to be interpreted as rust dylib as soon as the corresponding rlib was loaded. As cdylibs don't export any rust symbols, linking would fail if rustc decides to link against the cdylib rather than the rlib. Two crates depended on the previous behavior by separately compiling a test crate as both rlib and dylib. These have been changed to capture their original spirit to the best of my ability while still working when rustc verifies that all crates are in sync. It is unlikely that build systems depend on the current behavior and in any case we are taking a lot of measures to ensure that any change to either the source or the compilation options (including crate type) results in rustc rejecting it as incompatible. We merely didn't do this check here for now obsolete perf reasons. Fixes https://github.com/rust-lang/rust/issues/10786 Fixes https://github.com/rust-lang/rust/issues/82151 Fixes https://github.com/rust-lang/rust/issues/82972 Closes https://github.com/bevy-cheatbook/bevy-cheatbook/issues/114	2023-07-20 09:00:10 +00:00
Oli Scherer	c7428d5052	Monomorphize constants before inspecting them	2023-07-20 08:53:09 +00:00
bors	a6cdd81eff	Auto merge of #108714 - estebank:ice_dump, r=oli-obk On nightly, dump ICE backtraces to disk Implement rust-lang/compiler-team#578. When an ICE is encountered on nightly releases, the new rustc panic handler will also write the contents of the backtrace to disk. If any `delay_span_bug`s are encountered, their backtrace is also added to the file. The platform and rustc version will also be collected. <img width="1032" alt="Screenshot 2023-03-03 at 2 13 25 PM" src="https://user-images.githubusercontent.com/1606434/222842420-8e039740-4042-4563-b31d-599677171acf.png"> The current behavior will always write to disk on nightly builds, regardless of whether the backtrace is printed to the terminal, unless the environment variable `RUSTC_ICE_DISK_DUMP` is set to `0`. This is a compromise and can be changed.	2023-07-20 01:29:17 +00:00
Nicholas Nethercote	8c31219d5c	Tweak CGU sorting in a couple of places. In `base.rs`, tweak how the CGU size interleaving works. Since #113777, it's much more common to have multiple CGUs with identical sizes. With the existing code these same-sized items ended up in the opposite-to-desired order due to the stable sorting. The code now starts with a reverse sort (like is done in `partitioning.rs`) which gives the behaviour we want. This doesn't matter much for perf, but makes profiles in `samply` look more like what we expect. In `partitioning.rs`, we can use `sort_by_key` instead of `sort_by_cached_key` because `CGU::size_estimate()` is cheap. (There is an identical CGU sort earlier in that function that already uses `sort_by_key`.)	2023-07-20 09:58:13 +10:00
Dylan DPC	c1d6d322f4	Rollup merge of #113716 - DianQK:add-no_builtins-to-function, r=pnkfelix Add the `no-builtins` attribute to functions when `no_builtins` is applied at the crate level. When `no_builtins` is applied at the crate level, we should add the `no-builtins` attribute to each function to ensure it takes effect in LTO. This is also the reason why no_builtins does not take effect in LTO as mentioned in #35540. Now, `#![no_builtins]` should be similar to `-fno-builtin` in clang/gcc, see https://clang.godbolt.org/z/z4j6Wsod5. Next, we should make `#![no_builtins]` participate in LTO again. That makes sense, as LTO also takes into consideration function-level instruction optimizations, such as the MachineOutliner. More importantly, when a user writes a large `#![no_builtins]` crate, they would like this crate to participate in LTO as well. We should also add a function-level no_builtins attribute to allow users to have more control over it. This is similar to Clang's `__attribute__((no_builtin))` feature, see https://clang.godbolt.org/z/Wod6KK6eq. Before implementing this feature, maybe we should discuss whether to support more fine-grained control, such as `__attribute__((no_builtin("memcpy")))`. Related discussions: - #109821 - #35540 Next (a separate pull request?): - [ ] Revert #35637 - [ ] Add a function-level `no_builtin` attribute?	2023-07-19 22:37:06 +05:30
bjorn3	8c9a8b63c9	Fix review comments	2023-07-19 14:53:26 +00:00
bjorn3	52853c2694	Don't compress dylib metadata	2023-07-19 14:47:06 +00:00
Esteban Küber	8eb5843a59	On nightly, dump ICE backtraces to disk Implement rust-lang/compiler-team#578. When an ICE is encountered on nightly releases, the new rustc panic handler will also write the contents of the backtrace to disk. If any `delay_span_bug`s are encountered, their backtrace is also added to the file. The platform and rustc version will also be collected.	2023-07-19 14:10:07 +00:00
DianQK	cc08749df2	Add the `no-builtins` attribute to functions when `no_builtins` is applied at the crate level. When `no_builtins` is applied at the crate level, we should add the `no-builtins` attribute to each function to ensure it takes effect in LTO.	2023-07-18 22:15:47 +08:00
chenx97	d3727148a0	support for mips32r6 as a target_arch value	2023-07-18 18:58:18 +08:00
chenx97	a132b3ec03	merge patterns	2023-07-18 18:58:18 +08:00
chenx97	c6e03cd951	support for mips64r6 as a target_arch value	2023-07-18 18:58:18 +08:00
Oli Scherer	9e5a67e57f	Permit pre-evaluated constants in simd_shuffle	2023-07-18 08:13:55 +00:00
Nicholas Nethercote	b52f9eb6ca	Introduce `MonoItemData`. It replaces `(Linkage, Visibility)`, making the code nicer. Plus the next commit will add another field.	2023-07-17 08:44:48 +10:00
Patrick Walton	2d47816cba	rustc_llvm: Add a `-Z print-llvm-stats` option to expose LLVM statistics. LLVM has a neat [statistics] feature that tracks how often optimizations kick in. It's very handy for optimization work. Since we expose the LLVM pass timings, I thought it made sense to expose the LLVM statistics too. [statistics]: https://llvm.org/docs/ProgrammersManual.html#the-statistic-class-stats-option	2023-07-16 22:56:04 +09:00
bors	55be59d2ce	Auto merge of #113626 - Urgau:dedup-native-static-libs, r=petrochenkov De-duplicate consecutive libs when printing native-static-libs This PR adds a de-duplicate step just before printing the `native-static-libs`. This step de-duplicates all the consecutive libs based only on the relevant comparison elements (this exclude spans, ast elements, ...). Fixes https://github.com/rust-lang/rust/issues/113209	2023-07-16 10:59:45 +00:00
bors	7a17f577b3	Auto merge of #112157 - erikdesjardins:align, r=nikic Resurrect: rustc_target: Add alignment to indirectly-passed by-value types, correcting the alignment of byval on x86 in the process. Same as #111551, which I [accidentally closed](https://github.com/rust-lang/rust/pull/111551#issuecomment-1571222612) :/ --- This resurrects PR #103830, which has sat idle for a while. Beyond #103830, this also: - fixes byval alignment for types containing vectors on Darwin (see `tests/codegen/align-byval-vector.rs`) - fixes byval alignment for overaligned types on x86 Windows (see `tests/codegen/align-byval.rs`) - fixes ABI for types with 128bit requested alignment on ARM64 Linux (see `tests/codegen/aarch64-struct-align-128.rs`) r? `@nikic` --- `@pcwalton's` original PR description is reproduced below: Commit `88e4d2c` from five years ago removed support for alignment on indirectly-passed arguments because of problems with the `i686-pc-windows-msvc` target. Unfortunately, the `memcpy` optimizations I recently added to LLVM 16 depend on this to forward `memcpy`s. This commit attempts to fix the problems with `byval` parameters on that target and now correctly adds the `align` attribute. The problem is summarized in [this comment] by `@eddyb.` Briefly, 32-bit x86 has special alignment rules for `byval` parameters: for the most part, their alignment is forced to 4. This is not well-documented anywhere but in the Clang source. I looked at the logic in Clang `TargetInfo.cpp` and tried to replicate it here. The relevant methods in that file are `X86_32ABIInfo::getIndirectResult()` and `X86_32ABIInfo::getTypeStackAlignInBytes()`. The `align` parameter attribute for `byval` parameters in LLVM must match the platform ABI, or miscompilations will occur. Note that this doesn't use the approach suggested by eddyb, because I felt it was overkill to store the alignment in `on_stack` when special handling is really only needed for 32-bit x86. As a side effect, this should fix #80127, because it will make the `align` parameter attribute for `byval` parameters match the platform ABI on LLVM x86-64. [this comment]: #80822 (comment)	2023-07-15 15:39:53 +00:00
Mahdi Dibaiee	e55583c4b8	refactor(rustc_middle): Substs -> GenericArg	2023-07-14 13:27:35 +01:00
Matthias Krüger	fc1cb0459d	Rollup merge of #113631 - lqd:fix-113597, r=petrochenkov make MCP510 behavior opt-in to avoid conflicts between the CLI and target flavors Fixes #113597, which contains more details on how this happens through the code, and showcases an unexpected `Gnu(Cc::Yes, Lld::Yes)` flavor. #112910 added support to use `lld` when the flavor requests it, but didn't explicitly do so only when using `-Clink-self-contained=+linker` or one of the unstable `-Clinker-flavor`s. The problem: some targets have a `lld` linker and flavor, e.g. `thumbv6m-none-eabi` from that issue. Users can override the linker but there are no linker flavors precise enough to describe the linker opting out of lld: when using `-Clinker=arm-none-eabi-gcc`, we infer this is a `Cc::Yes` linker flavor, but the `lld` component is unknown and therefore defaulted to the target's linker flavor, `Lld::Yes`. <details> <summary>Walkthrough of how this happens</summary> The linker flavor used is a mix between what can be inferred from the CLI (`-C linker`) and the target's default linker flavor: - there is no linker flavor on the CLI (and that also offers another workaround on nightly: `-C linker-flavor=gnu-cc -Zunstable-options`), so it will have to be inferred [from here](`5dac6b320b/compiler/rustc_codegen_ssa/src/back/link.rs (L1334-L1336)`) to [here](`5dac6b320b/compiler/rustc_codegen_ssa/src/back/link.rs (L1321-L1327)`). - in [`infer_linker_hints`](`5dac6b320b/compiler/rustc_target/src/spec/mod.rs (L320-L352)`) `-C linker=arm-none-eabi-gcc` infers a `Some(Cc::Yes)` cc hint, and no hint about lld. - the target's `linker_flavor` is combined in `with_cli_hints` with these hints. We have our `Cc::Yes`, but there is no hint about lld, [so the target's flavor `lld` component is used](`5dac6b320b/compiler/rustc_target/src/spec/mod.rs (L356-L358)`). It's [`Gnu(Cc::No, Lld::Yes)`](`993deaa0bf/compiler/rustc_target/src/spec/thumb_base.rs (L35)`). - so we now have our `Gnu(Cc::Yes, Lld::Yes)` flavor </details> This results in a `Gnu(Cc::Yes, Lld::Yes)` flavor on a non-lld linker, causing an additional unexpected `-fuse-ld=lld` argument to be passed. I don't know if this target defaulting to `rust-lld` is expected, but until MCP510's new linker flavor are stable, when people will be able to describe their linker/flavor accurately, this PR keeps the stable behavior of not doing anything when the linker/flavor on the CLI unexpectedly conflict with the target's. I've tested this on a `no_std` `-C linker=arm-none-eabi-gcc -C link-arg=-nostartfiles --target thumbv6m-none-eabi` example, trying to simulate one of `cortex-m`'s test mentioned in issue #113597 (I don't know how to build a local complete `thumbv6m-none-eabi` toolchain to run the exact test), and checked that `-fuse-lld` was indeed gone and the error disappeared. r? `````@petrochenkov`````	2023-07-13 22:33:25 +02:00
Mark Rousskov	cc907f80b9	Re-format let-else per rustfmt update	2023-07-12 21:49:27 -04:00
Rémy Rakic	2b61a5e17a	make MCP510 behavior explicitly opt-in because sometimes users can't opt out	2023-07-12 20:17:10 +00:00
Urgau	ad16606471	De-duplicate consecutive libs when printing native-static-libs	2023-07-12 20:04:30 +02:00
Charisee	650243977b	Use constants from object crate Replace hard-coded values with GNU_PROPERTY_{X86\|AARCH64}_FEATURE_1_AND from the object crate.	2023-07-11 23:48:18 +00:00
Matthias Krüger	685ba08693	Rollup merge of #113497 - xSetech:mips_32_abi, r=davidtwco Support explicit 32-bit MIPS ABI for the synthetic object PR #95604 introduced a "synthetic object file to ensure all exported and used symbols participate in the linking". One constraint on this file is that for MIPS-based targets, its architecture-specific ELF flags must be the same as all other object files passed to the linker. That's enforced by LLD, here: https://github.com/llvm/llvm-project/blob/llvmorg-16.0.6/lld/ELF/Arch/MipsArchTree.cpp#L77 The current approach to determining e_flags for 32-bit was implemented in PR #96930, which links to this issue that summarizes the problem well: https://github.com/ayrtonm/psx-sdk-rs/issues/9 > ... the temporary object file is created with an e_flags which is > invalid for 32-bit MIPS targets. The main issue is that it omits the ABI > bits (EF_MIPS_ABI_O32) which implies it uses the N64 ABI. To enable the N32 MIPS ABI (which succeeded O32), this patch enables setting the synthetic object's ABI based on the target "llvm-abiname" field, if it's given; otherwise, the O32 ABI is assumed for 32-bit MIPS targets. More information about the N32 ABI can be found here: https://web.archive.org/web/20160121005457/http://techpubs.sgi.com/library/manuals/2000/007-2816-005/pdf/007-2816-005.pdf	2023-07-11 17:46:19 +02:00
Erik Desjardins	0e76446a9f	ensure byval allocas are sufficiently aligned	2023-07-10 19:19:38 -04:00
Seth Junot	329e099400	Support explicit 32-bit MIPS ABI for the synthetic object PR #95604 introduced a "synthetic object file to ensure all exported and used symbols participate in the linking". One constraint on this file is that for MIPS-based targets, its architecture-specific ELF flags must be the same as all other object files passed to the linker. That's enforced by LLD, here: https://github.com/llvm/llvm-project/blob/llvmorg-16.0.6/lld/ELF/Arch/MipsArchTree.cpp#L77 The current approach to determining e_flags for 32-bit was implemented in PR #96930, which links to this issue that summarizes the problem well: https://github.com/ayrtonm/psx-sdk-rs/issues/9 > ... the temporary object file is created with an e_flags which is > invalid for 32-bit MIPS targets. The main issue is that it omits the ABI > bits (EF_MIPS_ABI_O32) which implies it uses the N64 ABI. To enable the N32 MIPS ABI (which succeeded O32), this patch enables setting the synthetic object's ABI based on the target "llvm-abiname" field, if it's given; otherwise, the O32 ABI is assumed for 32-bit MIPS targets. More information about the N32 ABI can be found here: https://web.archive.org/web/20160121005457/http://techpubs.sgi.com/library/manuals/2000/007-2816-005/pdf/007-2816-005.pdf	2023-07-09 11:17:37 -07:00
Matthias Krüger	4406a92cd1	Rollup merge of #111618 - cjgillot:name-return-place, r=tmiasko Always name the return place. MIR opts more and more consider `_0` as just another local, so there is no point in keeping the special case in debug-info logic.	2023-07-09 16:33:35 +02:00
Camille GILLOT	d7983a2f23	Always name the return place.	2023-07-08 15:38:40 +02:00
Nilstrieb	2beabbbf6f	Rename `adjustment::PointerCast` and variants using it to `PointerCoercion` It makes it sound like the `ExprKind` and `Rvalue` are supposed to represent all pointer related casts, when in reality their just used to share a some enum variants. Make it clear there these are only coercion to make it clear why only some pointer related "casts" are in the enum.	2023-07-07 18:17:16 +02:00
Boxy	12138b8e5e	Move `TyCtxt::mk_x` to `Ty::new_x` where applicable	2023-07-05 20:27:07 +01:00
Zalathar	cb570d6bc1	Move `coverageinfo::ffi` and `coverageinfo::map` out of SSA	2023-07-05 20:40:40 +10:00
Zalathar	9c430d38cf	Remove trait `CoverageInfoMethods`, since non-LLVM backends don't need it These methods are only ever called from within `rustc_codegen_llvm`, so they can just be declared there as well.	2023-07-05 20:40:40 +10:00
Zalathar	4169d0f756	Narrow trait `CoverageInfoBuilderMethods` down to just one method This effectively inlines most of `FunctionCx::codegen_coverage` into the LLVM implementation of `CoverageInfoBuilderMethods`.	2023-07-05 20:40:39 +10:00
bors	131a03664e	Auto merge of #113040 - Kobzol:llvm-remark-streamer, r=tmiasko Add `-Zremark-dir` unstable flag to write LLVM optimization remarks to YAML This PR adds an option for `rustc` to emit LLVM optimization remarks to a set of YAML files, which can then be digested by existing tools, like https://github.com/OfekShilon/optview2. When `-Cremark-dir` is passed, and remarks are enabled (`-Cremark=all`), the remarks will be now written to the specified directory, instead of being printed to standard error output. The files are named based on the CGU from which they are being generated. Currently, the remarks are written using the LLVM streaming machinery, directly in the diagnostics handler. It seemed easier than going back to Rust and then form there back to C++ to use the streamer from the diagnostics handler. But there are many ways to implement this, of course, so I'm open to suggestions :) I included some comments with questions into the code. Also, I'm not sure how to test this. r? `@tmiasko`	2023-07-02 12:48:44 +00:00
Jakub Beránek	62728c7aaf	Add `rustc` option to output LLVM optimization remarks to YAML files	2023-07-02 13:41:36 +02:00
bors	72b2101434	Auto merge of #112718 - oli-obk:SIMD-destructure_mir_const, r=cjgillot Make simd_shuffle_indices use valtrees This removes the second-to-last user of the `destructure_mir_constant` query. So in a follow-up we can remove the query and just move the query provider function directly into pretty printing (which is the last user). cc `@rust-lang/clippy` there's a small functional change, but I think it is correct?	2023-07-02 07:43:36 +00:00
bors	8e2d5e3a58	Auto merge of #112910 - lqd:mcp510, r=petrochenkov Implement most of MCP510 This implements most of what remains to be done for MCP510: - turns `-C link-self-contained` into a `+`/`-` list of components, like `-C link-self-contained=+linker,+crto,+libc,+unwind,+sanitizers,+mingw`. The scaffolding is present for all these expected components to be implemented and stabilized in the future on their own time. This PR only handles the `-Zgcc-ld=lld` subset of these link-self-contained components as `-Clink-self-contained=+linker` - handles `-C link-self-contained=y\|n` as-is today, for compatibility with `rustc_codegen_ssa:🔙🔗:self_contained`'s [explicit opt-in and opt-out](`9eee230cd0/compiler/rustc_codegen_ssa/src/back/link.rs (L1671-L1676)`). - therefore supports our plan to opt out of `rust-lld` (when it's enabled by default) even for current `-Clink-self-contained` users, with e.g. `-Clink-self-contained -Clink-self-contained=-linker` - turns `add_gcc_ld_path` into its expected final form, by using the `-C link-self-contained=+linker` CLI flag, and whether the `LinkerFlavor` has the expected `Cc::Yes` and `Lld::Yes` shape (this is not yet the case in practice for any CLI linker flavor) - makes the [new clean linker flavors](https://github.com/rust-lang/rust/pull/96827#issuecomment-1208441595) selectable in the CLI in addition to the legacy ones, in order to opt-in to using `cc` and `lld` to emulate `-Zgcc-ld=lld` - ensure the new `-C link-self-contained` components, and `-C linker-flavor`s are unstable, and require `-Z unstable-options` to be used The up-to-date set of flags for the future stable CLI version of `-Zgcc-ld=lld` is currently: `-Clink-self-contained=+linker -Clinker-flavor=gnu-lld-cc -Zunstable-options`. It's possible we'll also need to do something for distros that don't ship `rust-lld`, but maybe there are already no tool search paths to be added to `cc` in this situation anyways. r? `@petrochenkov`	2023-07-02 02:25:01 +00:00
bors	7905eff5f7	Auto merge of #112550 - loongarch-rs:fix-eflags, r=cjgillot loongarch: Fix ELF header flags This patch changes the ELF header flags so that the ABI matches the floating-point features. It also updates the link to the new official documentation.	2023-07-01 09:31:35 +00:00
Rémy Rakic	1da271b6d0	refactor `add_gcc_ld_path` into its final form	2023-06-30 21:07:05 +00:00
Rémy Rakic	0fb80715bb	use `LinkSelfContained` for `-C link-self-contained`	2023-06-30 21:01:38 +00:00
bors	af9df2fd91	Auto merge of #106619 - agausmann:avr-object-file, r=nagisa Fix unset e_flags in ELF files generated for AVR targets Closes #106576 ~~Sort-of blocked by gimli-rs/object#500~~ (merged) I'm not sure whether the list of AVR CPU names is okay here. Maybe it could be moved out-of-line to improve the readability of the function.	2023-06-30 08:55:56 +00:00
Matthias Krüger	4696a92183	Rollup merge of #111322 - mirkootter:master, r=davidtwco Support for native WASM exceptions ### Motivation Currently, rustc does not support native WASM exceptions. It does support JavaScript based exceptions for the wasm32-emscripten-target, but this requires back&forth with javascript for many calls, which is very slow. Native wasm support for exceptions is quite common: Clang+LLVM implemented them years ago, and all major browsers support them by now. They enable zero-cost exceptions, at least with regard to runtime-performance-cost. They may increase startup-time and code size, though. ### Important: This PR does not change default behaviour Exceptions usually add a lot of code in form of unwinding blocks, increasing the binary size. Most users probably do not want that, especially which regard to web development. Therefore, wasm exceptions play a similar role as WASM-threads: rustc should support them, like clang does, but users who want to use it have to use some command-line magic like rustflags to opt in. ### What does this PR do? As stated above, the default behaviour is not changed. It is already possible to opt-in into wasm exceptions using the command line. Unfortunately, the LLVM IR is invalid and the LLVM backend crashes. ``` rustc <sourcefile> --target wasm32-unknown-unknown -C panic=unwind -C llvm-args=-wasm-enable-eh -C target-feature=+exception-handling ``` As it turns out, LLVM is quite picky when it comes to IR for exception handling. If the IR does not look exactly like it should, some LLVM-assertions fail and the code generation crashes. This PR adds the necessary modifications to the code generator to make it work. It also adds `exception-handling` as a wasm target feature. ### What this PR does not / what is missing This PR is not a full fledges solution. It is the first step. A few parts are still missing; however, it is already useable (see next section). Currently missing: * The std library has to be adapted. Currently, only [no_std] crates work * Usually, nested exceptions abort the program (i.e. a panic during the cleanup of another panic). This is currently not done yet. - Currently, code inside cleanup handlers does not unwind - To fix this requires a little more work: The code generator currently maintains a single terminate block per function for this. Unfortunately, WASM requires funclet based exception handling. Therefore, we need to create a terminate block per funclet. This is probably not a big problem, but I want to keep this PR simple. ### How to use the compiler given this PR? This PR does not add any command line flags or features. It uses those which are already there. To compile with exceptions enabled, you need * to set the panic strategy to unwind, i.e. `-C panic=unwind` * to enable the exception-handling target feature, i.e. `-C target-feature=+exception-handling` * to tell LLVM about the exception handling, i.e. `-C llvm-args=-wasm-enable-eh` Since the standard library has not been adapted, you can only use it in [no_std] crates as of now. The intrinsic `core::intrinsics::r#try` works. To throw exceptions, you need the ```@llvm.wasm.throw``` intrinsic. I created a sample application which works for me: https://github.com/mirkootter/rust-wasm-demos This example can be run at https://webassembly.sh	2023-06-29 16:36:30 +02:00
Takayuki Maeda	bad0688563	Rollup merge of #112946 - nnethercote:improve-cgu-naming-and-ordering, r=wesleywiser Improve cgu naming and ordering Some quality of life improvements when debugging and profiling CGU formation. r? `@wesleywiser`	2023-06-29 03:29:32 +09:00
bors	bb95b7dcd6	Auto merge of #112307 - lcnr:operand-ref, r=compiler-errors mir opt + codegen: handle subtyping fixes #107205 the same issue was caused in multiple places: - mir opts: both copy and destination propagation - codegen: assigning operands to locals (which also propagates values) I changed codegen to always update the type in the operands used for locals which should guard against any new occurrences of this bug going forward. I don't know how to make mir optimizations more resilient here. Hopefully the added tests will be enough to detect any trivially wrong optimizations going forward.	2023-06-28 00:41:37 +00:00
Matthias Krüger	1880e83ae3	Rollup merge of #112207 - qwandor:virt_feature, r=davidtwco Add trustzone and virtualization target features for aarch32. These are LLVM target features which allow the `smc` and `hvc` instructions respectively to be used in inline assembly.	2023-06-27 22:10:12 +02:00
Oli Scherer	acdfec6061	Move mir const to valtree conversion to its own method.	2023-06-26 09:34:52 +00:00
Oli Scherer	168de14ac9	Make simd_shuffle_indices use valtrees	2023-06-26 09:34:52 +00:00
Nicholas Nethercote	666b1b68a7	Tweak thread names for CGU processing. For non-incremental builds on Unix, currently all the thread names look like `opt regex.f10ba03eb5ec7975-cgu.0`. But they are truncated by `pthread_setname` to `opt regex.f10ba`, hiding the numeric suffix that distinguishes them. This is really annoying when using a profiler like Samply. This commit changes these thread names to a form like `opt cgu.0`, which is much better.	2023-06-26 09:14:45 +10:00
Nicholas Nethercote	fae4f45214	Remove unused fields from `CodegenContext`.	2023-06-22 09:07:19 +10:00
Nicholas Nethercote	3bbf9f0128	Introduce `CodegenState`. The codegen main loop has two bools, `codegen_done` and `codegen_aborted`. There are only three valid combinations: `(false, false)`, `(true, false)`, `(true, true)`. This commit replaces them with a single tri-state enum, which makes things clearer.	2023-06-22 09:07:15 +10:00
Nicholas Nethercote	a521ba400d	Add comments to `Message` and `WorkItem`. This is particularly useful for `Message`.	2023-06-22 08:07:59 +10:00
Nicholas Nethercote	88cd8f9324	Simplify `Message`. `Message` is an enum with multiple variants. Four of those variants map directly onto the four variants of `WorkItemResult`. This commit reduces those four `Message` variants to a single variant containing a `WorkItemResult`. This requires increasing `WorkItemResult`'s visibility to `pub(crate)` visibility, but `WorkItem` and `Message` can also have their visibility reduced to `pub(crate)`. This change avoids some boilerplate enum translation code, and makes `Message` easier to understand.	2023-06-22 08:07:59 +10:00
Nicholas Nethercote	757c290fba	Move `Message::CodegenItem` to a separate type. `Message` is an enum with multiple variants, for messages sent to the coordinator thread. Except for `Message::CodegenItem`, which is entirely disjoint, being for messages sent from the coordinator thread to the main thread. This commit move `Message::CodegenItem` into a separate type, `CguMessage`, which makes the code much clearer.	2023-06-22 08:07:59 +10:00
Nilstrieb	904994e101	Rollup merge of #112830 - nnethercote:more-codegen-cleanups, r=oli-obk More codegen cleanups Some additional cleanups I found while looking closely at this code, following up from #112827. r= `@oli-obk`	2023-06-21 07:37:03 +02:00
Nicholas Nethercote	c696307a87	Inline and remove `WorkItem::start_profiling` and `execute_work_item`. They both match on a `WorkItem`. It's simpler to do it all in one place.	2023-06-21 10:56:19 +10:00
Guillaume Gomez	0688182f9b	Rollup merge of #112794 - bjorn3:fix_lib_global_alloc, r=oli-obk Fix linker failures when #[global_allocator] is used in a dependency Fixes https://github.com/rust-lang/rust/issues/112715	2023-06-20 14:23:41 +02:00
Michael Goulet	31d1fbf8d2	Rollup merge of #112232 - fee1-dead-contrib:match-eq-const-msg, r=b-naber Better error for non const `PartialEq` call generated by `match` Resolves #90237	2023-06-19 17:53:33 -07:00
bjorn3	206b951803	Fix linker failures when #[global_allocator] is used in a dependency	2023-06-19 17:31:54 +00:00
Scott McMurray	3fd8501823	Remove unchecked_add/sub/mul/shl/shr from CTFE/cg_ssa/cg_clif	2023-06-19 01:47:03 -07:00
Scott McMurray	39788e07ba	Promote unchecked_add/sub/mul/shl/shr to mir::BinOp	2023-06-19 01:47:03 -07:00
lcnr	46973c9c8a	add FIXME's for a later refactoring	2023-06-19 09:16:26 +02:00
lcnr	46af169ec5	codegen: fix `OperandRef` subtype handling	2023-06-19 09:06:42 +02:00
Deadbeef	89c24af133	Better error for non const `PartialEq` call generated by `match`	2023-06-18 05:24:38 +00:00
Michael Goulet	3eb8c2ae10	Rollup merge of #112474 - ldm0:ldm_enum_debuginfo_128_support, r=compiler-errors Support 128-bit enum variant in debuginfo codegen fixes #111600	2023-06-16 12:53:22 -07:00
bors	99b334696f	Auto merge of #112597 - danakj:map-linker-paths, r=michaelwoerister Use the relative sysroot path in the linker command line to specify sysroot rlibs This addresses https://github.com/rust-lang/rust/issues/112586	2023-06-16 09:02:50 +00:00
Guillaume Gomez	05d5449522	Rollup merge of #112669 - Nilstrieb:typo, r=jyn514 Fix comment for ptr alignment checks in codegen	2023-06-15 22:04:58 +02:00
Nilstrieb	465e4d9c9c	Fix comment for ptr alignment checks in codegen	2023-06-15 19:27:31 +02:00
danakj	c340325ebf	Remap dylib paths into the sysroot to be relative to the sysroot Like for rlibs, the paths on the linker command line need to be relative paths if the sysroot was specified by the user to be a relative path. Dylibs put the path in /LIBPATH instead of into the file path of the library itself, so we rehome the libpath and adjust the rehoming function to be able to support both use cases, rlibs and dylibs.	2023-06-15 11:13:03 -04:00
danakj	4b83156a5c	Move the comment on fixing paths to where it belongs	2023-06-14 10:58:08 -04:00
danakj	7e07271eaf	Avoid absolute sysroot paths in the MSVC linker command line When the `--sysroot` is specified as relative to the current working directory, the sysroot's rlibs should also be specified as relative paths. Otherwise, the current working directory ends up in the absolute paths, and in the linker command line. And the entire linker command line appears in the PDB file generated by the MSVC linker. When adding an rlib to the linker command line, if the rlib's canonical path is in the sysroot's canonical path, then use the current sysroot path + filename instead of the full absolute path to the rlib. This means that when `--sysroot=foo` is specified, the linker command line will contain `foo/rustlib/target/lib/lib*.rlib` instead of the full absolute path to the same. This addresses https://github.com/rust-lang/rust/issues/112586	2023-06-14 10:44:00 -04:00
Nicholas Nethercote	7c3ce02a11	Introduce a minimum CGU size in non-incremental builds. Because tiny CGUs make compilation less efficient and result in worse generated code. We don't do this when the number of CGUs is explicitly given, because there are times when the requested number is very important, as described in some comments within the commit. So the commit also introduces a `CodegenUnits` type that distinguishes between default values and user-specified values. This change has a roughly neutral effect on walltimes across the rustc-perf benchmarks; there are some speedups and some slowdowns. But it has significant wins for most other metrics on numerous benchmarks, including instruction counts, cycles, binary size, and max-rss. It also reduces parallelism, which is good for reducing jobserver competition when multiple rustc processes are running at the same time. It's smaller benchmarks that benefit the most; larger benchmarks already have CGUs that are all larger than the minimum size. Here are some example before/after CGU sizes for opt builds. - html5ever - CGUs: 16, mean size: 1196.1, sizes: [3908, 2992, 1706, 1652, 1572, 1136, 1045, 948, 946, 938, 579, 471, 443, 327, 286, 189] - CGUs: 4, mean size: 4396.0, sizes: [6706, 3908, 3490, 3480] - libc - CGUs: 12, mean size: 35.3, sizes: [163, 93, 58, 53, 37, 8, 2 (x6)] - CGUs: 1, mean size: 424.0, sizes: [424] - tt-muncher - CGUs: 5, mean size: 1819.4, sizes: [8508, 350, 198, 34, 7] - CGUs: 1, mean size: 9075.0, sizes: [9075] Note that CGUs of size 100,000+ aren't unusual in larger programs.	2023-06-14 10:57:44 +10:00
WANG Rui	aa8e8642d9	loongarch: Fix ELF header flags	2023-06-12 19:50:47 +08:00
DonoughLiu	204bfb6a8c	Support 128-bit enum variant in debuginfo codegen	2023-06-10 03:39:24 +08:00
bors	343ad6f059	Auto merge of #111626 - pjhades:output, r=b-naber Write to stdout if `-` is given as output file With this PR, if `-o -` or `--emit KIND=-` is provided, output will be written to stdout instead. Binary output (those of type `obj`, `llvm-bc`, `link` and `metadata`) being written this way will result in an error unless stdout is not a tty. Multiple output types going to stdout will trigger an error too, as they will all be mixded together. This implements https://github.com/rust-lang/compiler-team/issues/431 The idea behind the changes is to introduce an `OutFileName` enum that represents the output - be it a real path or stdout - and to use this enum along the code paths that handle different output types.	2023-06-09 09:45:40 +00:00
bors	e7409258db	Auto merge of #112415 - GuillaumeGomez:rollup-5pa9frd, r=GuillaumeGomez Rollup of 9 pull requests Successful merges: - #112034 (Migrate `item_opaque_ty` to Askama) - #112179 (Avoid passing --cpu-features when empty) - #112309 (bootstrap: remove dependency `is-terminal`) - #112388 (Migrate GUI colors test to original CSS color format) - #112389 (Add a test for #105709) - #112392 (Fix ICE for while loop with assignment condition with LHS place expr) - #112394 (Remove accidental comment) - #112396 (Track more diagnostics in `rustc_expand`) - #112401 (Don't `use compile_error as print`) r? `@ghost` `@rustbot` modify labels: rollup	2023-06-08 10:31:52 +00:00
Guillaume Gomez	ad9d7e3ed5	Rollup merge of #112179 - tamird:no-empty-cpu-features, r=petrochenkov Avoid passing --cpu-features when empty Added in `12ac719b99`, this logic always passed --cpu-features, even when the value was the empty string.	2023-06-08 10:15:09 +02:00
bors	a0df04c0f2	Auto merge of #110040 - ndrewxie:issue-84447-partial-1, r=lcnr,michaelwoerister Removed use of iteration through a HashMap/HashSet in rustc_incremental and replaced with IndexMap/IndexSet This allows for the `#[allow(rustc::potential_query_instability)]` in rustc_incremental to be removed, moving towards fixing #84447 (although a LOT more modules have to be changed to fully resolve it). Only HashMaps/HashSets that are being iterated through have been modified (although many structs and traits outside of rustc_incremental had to be modified as well, as they had fields/methods that involved a HashMap/HashSet that would be iterated through) I'm making a PR for just 1 module changed to test for performance regressions and such, for future changes I'll either edit this PR to reflect additional modules being converted, or batch multiple modules of changes together and make a PR for each group of modules.	2023-06-08 07:30:03 +00:00
bors	f383703e32	Auto merge of #111698 - Amanieu:force-static-lib, r=petrochenkov Force all native libraries to be statically linked when linking a static binary Previously, `#[link]` without an explicit `kind = "static"` would confuse the linker and end up producing a dynamically linked library because of the `-Bdynamic` flag. However this binary would not work correctly anyways since it was linked with startup code for a static binary. This PR solves this by forcing all native libraries to be statically linked when the output is a static binary that cannot link to dynamic libraries anyways. Fixes #108878 Fixes #102993	2023-06-07 22:02:24 +00:00
Amanieu d'Antras	0304e0a5b0	Force all native libraries to be statically linked when linking a static binary	2023-06-07 19:30:37 +01:00
Jan-Mirko Otter	82730b4521	wasm exception handling	2023-06-07 17:48:28 +02:00
Jan-Mirko Otter	35cdb28c84	add comment	2023-06-07 17:46:34 +02:00
Jan-Mirko Otter	82336c1311	wasm target feature: exception handling	2023-06-07 17:46:34 +02:00
Jing Peng	9b1a1e1d95	Write to stdout if `-` is given as output file If `-o -` or `--emit KIND=-` is provided, output will be written to stdout instead. Binary output (`obj`, `llvm-bc`, `link` and `metadata`) being written this way will result in an error unless stdout is not a tty. Multiple output types going to stdout will trigger an error too, as they will all be mixded together.	2023-06-06 17:53:29 -04:00
bors	fd9bf59436	Auto merge of #111999 - scottmcm:codegen-less-memcpy, r=compiler-errors Use `load`+`store` instead of `memcpy` for small integer arrays I was inspired by #98892 to see whether, rather than making `mem::swap` do something smart in the library, we could update MIR assignments like `_1 = _2` to do something smarter than `memcpy` for sufficiently-small types that doing it inline is going to be better than a `memcpy` call in assembly anyway. After all, special code may help `mem::swap`, but if the "obvious" MIR can just result in the correct thing that helps everything -- other code like `mem::replace`, people doing it manually, and just passing around by value in general -- as well as makes MIR inlining happier since it doesn't need to deal with all the complicated library code if it just sees a couple assignments. LLVM will turn the short, known-length `memcpy`s into direct instructions in the backend, but that's too late for it to be able to remove `alloca`s. In general, replacing `memcpy`s with typed instructions is hard in the middle-end -- even for `memcpy.inline` where it knows it won't be a function call -- is hard [due to poison propagation issues](https://rust-lang.zulipchat.com/#narrow/stream/187780-t-compiler.2Fwg-llvm/topic/memcpy.20vs.20load-store.20for.20MIR.20assignments/near/360376712). So because we know more about the type invariants -- these are typed copies -- rustc can emit something more specific, allowing LLVM to `mem2reg` away the `alloca`s in some situations. #52051 previously did something like this in the library for `mem::swap`, but it ended up regressing during enabling mir inlining (`cbbf06b0cd`), so this has been suboptimal on stable for ≈5 releases now. The code in this PR is narrowly targeted at just integer arrays in LLVM, but works via a new method on the [`LayoutTypeMethods`](https://doc.rust-lang.org/nightly/nightly-rustc/rustc_codegen_ssa/traits/trait.LayoutTypeMethods.html) trait, so specific backends based on cg_ssa can enable this for more situations over time, as we find them. I don't want to try to bite off too much in this PR, though. (Transparent newtypes and simple things like the 3×usize `String` would be obvious candidates for a follow-up.) Codegen demonstrations: <https://llvm.godbolt.org/z/fK8hT9aqv> Before: ```llvm define void `@swap_rgb48_old(ptr` noalias nocapture noundef align 2 dereferenceable(6) %x, ptr noalias nocapture noundef align 2 dereferenceable(6) %y) unnamed_addr #1 { %a.i = alloca [3 x i16], align 2 call void `@llvm.lifetime.start.p0(i64` 6, ptr nonnull %a.i) call void `@llvm.memcpy.p0.p0.i64(ptr` noundef nonnull align 2 dereferenceable(6) %a.i, ptr noundef nonnull align 2 dereferenceable(6) %x, i64 6, i1 false) tail call void `@llvm.memcpy.p0.p0.i64(ptr` noundef nonnull align 2 dereferenceable(6) %x, ptr noundef nonnull align 2 dereferenceable(6) %y, i64 6, i1 false) call void `@llvm.memcpy.p0.p0.i64(ptr` noundef nonnull align 2 dereferenceable(6) %y, ptr noundef nonnull align 2 dereferenceable(6) %a.i, i64 6, i1 false) call void `@llvm.lifetime.end.p0(i64` 6, ptr nonnull %a.i) ret void } ``` Note it going to stack: ```nasm swap_rgb48_old: # `@swap_rgb48_old` movzx eax, word ptr [rdi + 4] mov word ptr [rsp - 4], ax mov eax, dword ptr [rdi] mov dword ptr [rsp - 8], eax movzx eax, word ptr [rsi + 4] mov word ptr [rdi + 4], ax mov eax, dword ptr [rsi] mov dword ptr [rdi], eax movzx eax, word ptr [rsp - 4] mov word ptr [rsi + 4], ax mov eax, dword ptr [rsp - 8] mov dword ptr [rsi], eax ret ``` Now: ```llvm define void `@swap_rgb48(ptr` noalias nocapture noundef align 2 dereferenceable(6) %x, ptr noalias nocapture noundef align 2 dereferenceable(6) %y) unnamed_addr #0 { start: %0 = load <3 x i16>, ptr %x, align 2 %1 = load <3 x i16>, ptr %y, align 2 store <3 x i16> %1, ptr %x, align 2 store <3 x i16> %0, ptr %y, align 2 ret void } ``` still lowers to `dword`+`word` operations, but has no stack traffic: ```nasm swap_rgb48: # `@swap_rgb48` mov eax, dword ptr [rdi] movzx ecx, word ptr [rdi + 4] movzx edx, word ptr [rsi + 4] mov r8d, dword ptr [rsi] mov dword ptr [rdi], r8d mov word ptr [rdi + 4], dx mov word ptr [rsi + 4], cx mov dword ptr [rsi], eax ret ``` And as a demonstration that this isn't just `mem::swap`, a `mem::replace` on a small array (since replace doesn't use swap since #83022), which used to be `memcpy`s in LLVM changes in IR ```llvm define void `@replace_short_array(ptr` noalias nocapture noundef sret([3 x i32]) dereferenceable(12) %0, ptr noalias noundef align 4 dereferenceable(12) %r, ptr noalias nocapture noundef readonly dereferenceable(12) %v) unnamed_addr #0 { start: %1 = load <3 x i32>, ptr %r, align 4 store <3 x i32> %1, ptr %0, align 4 %2 = load <3 x i32>, ptr %v, align 4 store <3 x i32> %2, ptr %r, align 4 ret void } ``` but that lowers to reasonable `dword`+`qword` instructions still ```nasm replace_short_array: # `@replace_short_array` mov rax, rdi mov rcx, qword ptr [rsi] mov edi, dword ptr [rsi + 8] mov dword ptr [rax + 8], edi mov qword ptr [rax], rcx mov rcx, qword ptr [rdx] mov edx, dword ptr [rdx + 8] mov dword ptr [rsi + 8], edx mov qword ptr [rsi], rcx ret ```	2023-06-06 01:50:28 +00:00

... 2 3 4 5 6 ...

1926 Commits