nordic-dev.net/rust - rust

mirror of https://github.com/rust-lang/rust.git synced 2024-11-27 01:04:03 +00:00

Author	SHA1	Message	Date
Scott McMurray	1de2257c3f	Add `intrinsics::transmute_unchecked` This takes a whole 3 lines in `compiler/` since it lowers to `CastKind::Transmute` in MIR exactly the same as the existing `intrinsics::transmute` does, it just doesn't have the fancy checking in `hir_typeck`. Added to enable experimenting with the request in <https://github.com/rust-lang/rust/pull/106281#issuecomment-1496648190> and because the portable-simd folks might be interested for dependently-sized array-vector conversions. It also simplifies a couple places in `core`.	2023-04-22 17:22:03 -07:00
Wesley Wiser	4e8b642646	Turn on ConstDebugInfo pass.	2023-04-22 23:41:48 +02:00
bors	7e23d180c1	Auto merge of #109993 - scottmcm:transmute-niches, r=oli-obk `assume` value ranges in `transmute` Fixes #109958	2023-04-20 10:46:13 +00:00
Scott McMurray	baf98e7515	Add transmute optimization tests and some extra comments	2023-04-19 23:17:35 -07:00
Guillaume Gomez	e6b607335a	Rollup merge of #110441 - kadiwa4:typos, r=thomcc 5 little typos	2023-04-18 14:50:51 +02:00
bors	5fe3528be5	Auto merge of #110242 - cuviper:vanilla-llvm-16, r=Mark-Simulacrum ci: add a runner for vanilla LLVM 16 Like #107044, this will let us track compatibility with LLVM 16 going forward, especially after we eventually upgrade our own to the next. This also drops `tidy` here and in `x86_64-gnu-llvm-15`, syncing with that change in #106085.	2023-04-18 08:38:04 +00:00
Matthias Krüger	c81e8b8e18	Rollup merge of #110455 - durin42:tls-D148269-fix, r=nikic tests: adapt for LLVM change 5b386b864c7619897c51a1da97d78f1cf6f3eff6 The above-mentioned change modified the output of thread-local.rs by changing some variable names. Rather than assume things get put in %0, we capture the variable so the test passes in both the old and new version.	2023-04-17 18:13:37 +02:00
Matthias Krüger	eb0524615c	Rollup merge of #110313 - fee1-dead-contrib:repr_align_method, r=WaffleLapkin allow `repr(align = x)` on inherent methods Discussion: https://github.com/rust-lang/rust/issues/82232#issuecomment-905929314	2023-04-17 18:13:34 +02:00
Augie Fackler	bef3502dba	tests: adapt for LLVM change 5b386b864c7619897c51a1da97d78f1cf6f3eff6 The above-mentioned change modified the output of thread-local.rs by changing some variable names. Rather than assume things get put in %0, we capture the variable so the test passes in both the old and new version.	2023-04-17 10:53:18 -04:00
kadiwa	85653831f7	typos	2023-04-17 09:16:07 +02:00
bors	5546cb64f6	Auto merge of #109247 - saethlin:inline-without-inline, r=oli-obk Permit MIR inlining without #[inline] I noticed that there are at least a handful of portable-simd functions that have no `#[inline]` but compile to an assign + return. I locally benchmarked inlining thresholds between 0 and 50 in increments of 5, and 50 seems to be the best. Interesting. That didn't include check builds though, ~maybe perf will have something to say about that~. Perf has little useful to say about this. We generally regress all the check builds, as best as I can tell, due to a number of small codegen changes in a particular hot function in the compiler. Probably this is because we've nudged the inlining outcomes all over, and uses of `#[inline(always)]`/`#[inline(never)]` might need to be adjusted.	2023-04-17 02:36:38 +00:00
Josh Stone	33036159a4	ci: add a runner for vanilla LLVM 16 Like #107044, this will let us track compatibility with LLVM 16 going forward, especially after we eventually upgrade our own to the next. This also drops `tidy` here and in `x86_64-gnu-llvm-15`, syncing with that change in #106085.	2023-04-16 11:50:20 -07:00
Deadbeef	dda89945b7	Allow all associated functions and add test	2023-04-16 06:31:08 +00:00
Camille GILLOT	4a1ff5e04d	Bless codegen test.	2023-04-15 07:46:46 +00:00
Camille GILLOT	700084aa97	Update codegen test.	2023-04-14 16:26:11 +00:00
Deadbeef	b59ec166ad	allow `repr(align = x)` on inherent methods	2023-04-14 06:39:48 +00:00
Scott McMurray	1bcb0ec28c	`assume` value ranges in `transmute` Fixes #109958	2023-04-13 00:12:39 -07:00
bors	d8fc819247	Auto merge of #109466 - davidlattimore:inline-arg-via-var-debug-info, r=wesleywiser Preserve argument indexes when inlining MIR We store argument indexes on VarDebugInfo. Unlike the previous method of relying on the variable index to know whether a variable is an argument, this survives MIR inlining. We also no longer check if var.source_info.scope is the outermost scope. When a function gets inlined, the arguments to the inner function will no longer be in the outermost scope. What we care about though is whether they were in the outermost scope prior to inlining, which we know by whether we assigned an argument index. Fixes #83217 I considered using `Option<NonZeroU16>` instead of `Option<u16>` to store the index. I didn't because `TypeFoldable` isn't implemented for `NonZeroU16` and because it looks like due to padding, it currently wouldn't make any difference. But I indexed from 1 anyway because (a) it'll make it easier if later it becomes worthwhile to use a `NonZeroU16` and because the arguments were previously indexed from 1, so it made for a smaller change. This is my first PR on rust-lang/rust, so apologies if I've gotten anything not quite right.	2023-04-13 01:51:27 +00:00
bors	13d1802b88	Auto merge of #109895 - nikic:llvm-16-tests, r=cuviper Add codegen tests for issues fixed by LLVM 16 Fixes #75978. Fixes #99960. Fixes #101048. Fixes #101082. Fixes #101814. Fixes #103132. Fixes #103327.	2023-04-12 02:30:21 +00:00
Nikita Popov	83f525cc28	Make test compatible with 32-bit	2023-04-11 17:19:07 +02:00
Nikita Popov	ec635c002b	Add ignore-debug to two tests These don't optimize with debug assertions. For one of them, this is due to the new alignment checks, for the other I'm not sure what specifically blocks it.	2023-04-11 11:22:15 +02:00
David Lattimore	a6292676eb	Preserve argument indexes when inlining MIR We store argument indexes on VarDebugInfo. Unlike the previous method of relying on the variable index to know whether a variable is an argument, this survives MIR inlining. We also no longer check if var.source_info.scope is the outermost scope. When a function gets inlined, the arguments to the inner function will no longer be in the outermost scope. What we care about though is whether they were in the outermost scope prior to inlining, which we know by whether we assigned an argument index.	2023-04-11 11:07:48 +10:00
Scott McMurray	d757c4b904	Handle not all immediates having `abi::Scalar`s	2023-04-09 11:16:50 -07:00
Ben Kimock	e88e2af959	Give the cross-crate generic some work to do	2023-04-07 15:46:45 -04:00
Ben Kimock	e3126b1084	Permit MIR inlining without #[inline]	2023-04-07 15:46:43 -04:00
Scott McMurray	454bca514a	Check `CastKind::Transmute` sizes in a better way Fixes #110005	2023-04-06 13:53:10 -07:00
bors	2e486be8d2	Auto merge of #107925 - thomcc:sip13, r=cjgillot Use SipHash-1-3 instead of SipHash-2-4 for StableHasher Noticed this, and it seems easy and likely a perf win. IIUC we don't need DDOS resistance (just collision) so we ideally would have an even faster hash, but it's hard to beat this SipHash impl here, since it's been so highly tuned for the interface. It wouldn't surprise me if there's some subtle reason changing this sucks, as it's so obvious it seems likely to have been done. Still, SipHash-1-3 seems to still have the guarantees StableHasher should need (and seemingly more), and is clearly less work. So it's worth a shot. Not fully tested locally.	2023-04-05 18:35:34 +00:00
bors	b2b676d886	Auto merge of #108905 - ferrocene:pa-compiletest-ignore, r=ehuss Validate `ignore` and `only` compiletest directive, and add human-readable ignore reasons This PR adds strict validation for the `ignore` and `only` compiletest directives, failing if an unknown value is provided to them. Doing so uncovered 79 tests in `tests/ui` that had invalid directives, so this PR also fixes them. Finally, this PR adds human-readable ignore reasons when tests are ignored due to `ignore` or `only` directives, like "only executed when the architecture is aarch64" or "ignored when the operative system is windows". This was the original reason why I started working on this PR and #108659, as we need both of them for Ferrocene. The PR is a draft because the code is extremely inefficient: it calls `rustc --print=cfg --target $target` for every rustc target (to gather the list of allowed ignore values), which on my system takes between 4s and 5s, and performs a lot of allocations of constant values. I'll fix both of them in the coming days. r? `@ehuss`	2023-04-05 16:15:25 +00:00
Rémy Rakic	931fd8539e	Fix codegen tests with hard-coded hashes	2023-04-05 15:59:29 +00:00
Thom Chiovoloni	36ca32c1ed	Fix a codegen test with some hard-coded hashes	2023-04-05 15:59:29 +00:00
bors	8d321f7a88	Auto merge of #109843 - scottmcm:better-transmute, r=WaffleLapkin Allow `transmute`s to produce `OperandValue`s instead of needing `alloca`s LLVM can usually optimize these away, but especially for things like transmutes of newtypes it's silly to generate the `alloc`+`store`+`load` at all when it's actually a nop at LLVM level.	2023-04-05 03:26:38 +00:00
Scott McMurray	9aa9a846b6	Allow `transmute`s to produce `OperandValue`s instead of always using `alloca`s LLVM can usually optimize these away, but especially for things like transmutes of newtypes it's silly to generate the `alloc`+`store`+`load` at all when it's actually a nop at LLVM level.	2023-04-04 18:44:29 -07:00
bors	700938c078	Auto merge of #109808 - jyn514:debuginfo-options, r=michaelwoerister Extend -Cdebuginfo with new options and named aliases This is a rebase of https://github.com/rust-lang/rust/pull/83947, along with my best guess at what the new options mean. I tried to follow the LLVM source code to get a better idea but ran into quite a lot of trouble (https://rust-lang.zulipchat.com/#narrow/stream/187780-t-compiler.2Fwg-llvm/topic/go-to-definition.20in.20src.2Fllvm-project.3F). The description for the original PR follows below. Note that the changes in this PR have already been through FCP: https://github.com/rust-lang/rust/pull/83947#issuecomment-878384979 Closes https://github.com/rust-lang/rust/pull/109311. Helps with https://github.com/rust-lang/rust/pull/104968. r? `@michaelwoerister` cc `@cuviper` --- The -Cdebuginfo=1 option was never line tables only and can't be due to backwards compatibility issues. This was clarified and an option for emitting line tables only was added. Additionally an option for emitting line info directives only was added, which is needed for some targets, i.e. nvptx. The debug info options should now behave similarly to clang's debug info options. Fix https://github.com/rust-lang/rust/issues/60020 Fix https://github.com/rust-lang/rust/issues/64405	2023-04-04 20:01:05 +00:00
Nikita Popov	73f40d4293	Add codegen tests for issues fixed by LLVM 16 Fixes #75978. Fixes #99960. Fixes #101048. Fixes #101082. Fixes #101814. Fixes #103132. Fixes #103327.	2023-04-03 17:02:57 +02:00
The 8472	7a70647f19	llvm 16 finally reconizes some additional vec in-place conversions as noops	2023-04-03 15:29:46 +02:00
Pietro Albini	8f8873e386	remove unknown xcore arch	2023-04-03 10:23:09 +02:00
Pietro Albini	3602200d50	make 32bit ignore more accurate	2023-04-03 10:23:08 +02:00
Pietro Albini	e045598c68	remove a bunch of unknown archs from the global_asm tests	2023-04-03 09:30:37 +02:00
Pietro Albini	e592aaa705	remove invalid ignore-powerpc64le	2023-04-03 09:24:12 +02:00
Julia Tatz	7b453b9f5a	More in-depth documentation for the new debuginfo options	2023-03-31 07:28:39 -04:00
Julia Tatz	0504a33383	Preserve, clarify, and extend debug information `-Cdebuginfo=1` was never line tables only and can't be due to backwards compatibility issues. This was clarified and an option for line tables only was added. Additionally an option for line info directives only was added, which is well needed for some targets. The debug info options should now behave the same as clang's debug info options.	2023-03-31 07:28:39 -04:00
bors	22a7a19f93	Auto merge of #98112 - saethlin:mir-alignment-checks, r=oli-obk Insert alignment checks for pointer dereferences when debug assertions are enabled Closes https://github.com/rust-lang/rust/issues/54915 - [x] Jake tells me this sounds like a place to use `MirPatch`, but I can't figure out how to insert a new basic block with a new terminator in the middle of an existing basic block, using `MirPatch`. (if nobody else backs up this point I'm checking this as "not actually a good idea" because the code looks pretty clean to me after rearranging it a bit) - [x] Using `CastKind::PointerExposeAddress` is definitely wrong, we don't want to expose. Calling a function to get the pointer address seems quite excessive. ~I'll see if I can add a new `CastKind`.~ `CastKind::Transmute` to the rescue! - [x] Implement a more helpful panic message like slice bounds checking. r? `@oli-obk`	2023-03-31 08:50:35 +00:00
Rémy Rakic	9f16a81bc8	update codegen test expectations Changing the layout of the InitMask changed the const allocations' hashes.	2023-03-27 17:44:33 +00:00
bors	0c61c7a978	Auto merge of #109474 - nikic:llvm-16-again, r=cuviper Upgrade to LLVM 16, again Relative to the previous attempt in https://github.com/rust-lang/rust/pull/107224: * Update to GCC 8.5 on dist-x86_64-linux, to avoid std::optional ABI-incompatibility between libstdc++ 7 and 8. * Cherry-pick `96df79af02`. * Cherry-pick `6fc670e5e3`. r? `@cuviper`	2023-03-25 19:55:10 +00:00
bors	31d74fb24b	Auto merge of #109220 - nikic:poison, r=cuviper Use poison instead of undef In cases where it is legal, we should prefer poison values over undef values. This replaces undef with poison for aggregate construction and for uninhabited types. There are more places where we can likely use poison, but I wanted to stay conservative to start with. In particular the aggregate case is important for newer LLVM versions, which are not able to handle an undef base value during early optimization due to poison-propagation concerns. r? `@cuviper`	2023-03-24 15:39:40 +00:00
Ben Kimock	8ccf53332e	A MIR transform that checks pointers are aligned	2023-03-23 18:23:06 -04:00
bors	e216300876	Auto merge of #108442 - scottmcm:mir-transmute, r=oli-obk Add `CastKind::Transmute` to MIR ~~Nothing actually produces it in this commit, so I don't know how to test it, but it also means it shouldn't be possible for it to break anything.~~ Includes lowering `transmute` calls to it, so it's used. Zulip Conversation: <https://rust-lang.zulipchat.com/#narrow/stream/189540-t-compiler.2Fwg-mir-opt/topic/Good.20first.20isssue/near/321849610>	2023-03-23 18:43:04 +00:00
bors	cf811810fe	Auto merge of #109172 - scottmcm:move-codegen-issues-tests, r=WaffleLapkin mv tests/codegen/issue-* tests/codegen/issues/ No changes to the contents; just a move. Like how there's a <https://github.com/rust-lang/rust/tree/master/tests/ui/issues> folder.	2023-03-23 04:11:47 +00:00
Scott McMurray	64cce5fc7d	Add `CastKind::Transmute` to MIR Updates `interpret`, `codegen_ssa`, and `codegen_cranelift` to consume the new cast instead of the intrinsic. Includes `CastTransmute` for custom MIR building, to be able to test the extra UB.	2023-03-22 15:15:41 -07:00
Matthias Krüger	44942ad10f	Rollup merge of #109394 - krasimirgg:llvm-17-vec-panic, r=nikic adapt tests/codegen/vec-shrink-panik for LLVM 17 After `0d4a709bb8` LLVM now doesn't generate references to panic_cannot_unwind: https://buildkite.com/llvm-project/rust-llvm-integrate-prototype/builds/17978#0186ff55-ca6f-4bc5-b1ec-2622c77d0ed5/744-746 Adapted as suggested by ````@nikic```` on Zulip: https://rust-lang.zulipchat.com/#narrow/stream/187780-t-compiler.2Fwg-llvm/topic/a.20couple.20codegen.20test.20failures.20after.20llvm.200d4a709bb876824a/near/342664944 >Okay, so LLVM now realizes that double panic is not possible, so that's fine.	2023-03-22 20:08:01 +01:00
Nikita Popov	58ac25b453	Increase array size in array-map.rs Make sure that the loop is not fully unrolled (which allows eliminating the allocas) in LLVM 16 either.	2023-03-22 09:30:37 +01:00
bors	ef03fda339	Auto merge of #106967 - saethlin:remove-vec-as-ptr-assume, r=thomcc Remove the assume(!is_null) from Vec::as_ptr At a guess, this code is leftover from LLVM was worse at keeping track of the niche information here. In any case, we don't need this anymore: Removing this `assume` doesn't get rid of the `nonnull` attribute on the return type.	2023-03-21 08:44:17 +00:00
Krasimir Georgiev	e4a4064480	adapt tests/codegen/vec-shrink-panik for LLVM 17 After `0d4a709bb8` LLVM now doesn't generate references to panic_cannot_unwind: @nikic: https://rust-lang.zulipchat.com/#narrow/stream/187780-t-compiler.2Fwg-llvm/topic/a.20couple.20codegen.20test.20failures.20after.20llvm.200d4a709bb876824a/near/342664944 >Okay, so LLVM now realizes that double panic is not possible, so that's fine.	2023-03-20 15:33:57 +00:00
Scott McMurray	48011e2aa4	Also move the auxiliary file	2023-03-20 10:25:29 +00:00
Scott McMurray	5dfe37a504	mv tests/codegen/issue-* tests/codegen/issues/	2023-03-20 10:25:29 +00:00
Nikita Popov	4192743ab7	Revert "Auto merge of #107224 - nikic:llvm-16, r=cuviper" This reverts commit `4a04d086ca`, reversing changes made to `2d0a7def33`.	2023-03-18 23:49:24 +01:00
bors	4a04d086ca	Auto merge of #107224 - nikic:llvm-16, r=cuviper Upgrade to LLVM 16 This updates Rust to LLVM 16. It also updates our host compiler for dist-x86_64-linux to LLVM 16. The reason for that is that Bolt from LLVM 15 is not capable of compiling LLVM 16 (https://github.com/llvm/llvm-project/issues/61114). LLVM 16.0.0 has been [released](https://discourse.llvm.org/t/llvm-16-0-0-release/69326) on March 18, while Rust 1.70 will become stable on June 1. Tested images: `dist-x86_64-linux`, `dist-riscv64-linux` (alt), `dist-x86_64-illumos`, `dist-various-1`, `dist-various-2`, `dist-powerpc-linux`, `wasm32`, `armhf-gnu` Tested images until the usual IPv6 failures: `test-various`	2023-03-18 18:14:35 +00:00
Nikita Popov	b238a76f65	Increase array size in array-map.rs Make sure that the loop is not fully unrolled (which allows eliminating the allocas) in LLVM 16 either.	2023-03-17 09:43:24 +01:00
Matthias Krüger	edd6b42565	Rollup merge of #109181 - durin42:v0-mangle-inherit_overflow, r=Nilstrieb inherit_overflow: adapt pattern to also work with v0 mangling This test was failing under new-symbol-mangling = true. Adapt pattern to work in both cases. Related to #106002 from December.	2023-03-17 08:42:39 +01:00
bors	511364e787	Auto merge of #108944 - cjgillot:clear-local-info, r=oli-obk Wrap the whole LocalInfo in ClearCrossCrate. MIR contains a lot of information about locals. The primary purpose of this information is the quality of borrowck diagnostics. This PR aims to drop this information after MIR analyses are finished, ie. starting from post-cleanup runtime MIR.	2023-03-16 19:59:56 +00:00
Nikita Popov	30331828cb	Use poison instead of undef In cases where it is legal, we should prefer poison values over undef values. This replaces undef with poison for aggregate construction and for uninhabited types. There are more places where we can likely use poison, but I wanted to stay conservative to start with. In particular the aggregate case is important for newer LLVM versions, which are not able to handle an undef base value during early optimization due to poison-propagation concerns.	2023-03-16 15:07:04 +01:00
Augie Fackler	0b9b7dd5c6	inherit_overflow: adapt pattern to also work with v0 mangling This test was failing under new-symbol-mangling = true. Adapt pattern to work in both cases. Related to #106002 from December.	2023-03-15 14:22:26 -04:00
bors	e4b9f86054	Auto merge of #109035 - scottmcm:ptr-read-should-know-undef, r=WaffleLapkin,JakobDegen Ensure `ptr::read` gets all the same LLVM `load` metadata that dereferencing does I was looking into `array::IntoIter` optimization, and noticed that it wasn't annotating the loads with `noundef` for simple things like `array::IntoIter<i32, N>`. Trying to narrow it down, it seems that was because `MaybeUninit::assume_init_read` isn't marking the load as initialized (<https://rust.godbolt.org/z/Mxd8TPTnv>), which is unfortunate since that's basically its reason to exist. The root cause is that `ptr::read` is currently implemented via the untyped `copy_nonoverlapping`, and thus the `load` doesn't get any type-aware metadata: no `noundef`, no `!range`. This PR solves that by lowering `ptr::read(p)` to `copy p` in MIR, for which the backends already do the right thing. Fortuitiously, this also improves the IR we give to LLVM for things like `mem::replace`, and fixes a couple of long-standing bugs where `ptr::read` on `Copy` types was worse than ``ing them. Zulip conversation: <https://rust-lang.zulipchat.com/#narrow/stream/219381-t-libs/topic/Move.20array.3A.3AIntoIter.20to.20ManuallyDrop/near/341189936> cc `@erikdesjardins` `@JakobDegen` `@workingjubilee` `@the8472` Fixes #106369 Fixes #73258	2023-03-15 11:44:12 +00:00
Scott McMurray	dfc3377954	Split the mem-replace codegen test Apparently in CI it's getting generated in the opposite order, one function per file will make the test pass either way.	2023-03-15 00:57:08 -07:00
Scott McMurray	e7c6ad89cf	Improved implementation and comments after code review feedback	2023-03-14 22:24:28 -07:00
Camille GILLOT	526a2c7521	ICE when checking LocalInfo on runtime MIR.	2023-03-14 20:52:42 +01:00
Matthias Krüger	39e1f810a9	Rollup merge of #109081 - krasimirgg:llvm-17-simd-wide-sum, r=nikic simd-wide-sum test: adapt for LLVM 17 codegen change After `0d4a709bb8` LLVM becomes more clever and turns ```@wider_reduce_loop``` into an alias: https://buildkite.com/llvm-project/rust-llvm-integrate-prototype/builds/17806#0186da6b-582c-46bf-a227-1565fa0859ac/743-766 This adapts the test to prevent this.	2023-03-13 21:55:38 +01:00
Krasimir Georgiev	ed8dc5d817	simd-wide-sum test: adapt for LLVM 17 codegen change After `0d4a709bb8` LLVM becomes more clever and turns `@wider_reduce_loop` into an alias: https://buildkite.com/llvm-project/rust-llvm-integrate-prototype/builds/17806#0186da6b-582c-46bf-a227-1565fa0859ac/743-766 This adapts the test to prevent this.	2023-03-13 15:07:16 +00:00
bors	cf8d98b227	Auto merge of #108623 - scottmcm:try-different-as-slice-impl, r=the8472 Move `Option::as_slice` to an always-sound implementation This approach depends on CSE to not have any branches or selects when the guessed offset is correct -- which it always will be right now -- but to also be sound (just less efficient) if the layout algorithms change such that the guess is incorrect. The codegen test confirms that CSE handles this as expected, leaving the optimal codegen. cc JakobDegen #108545	2023-03-13 13:53:24 +00:00
Scott McMurray	1f70bb8c43	Add a codegen test to confirm this fixes 73258	2023-03-12 13:23:22 -07:00
Scott McMurray	0b96fee343	Add a codegen test to confirm this fixes 106369	2023-03-12 12:57:40 -07:00
Scott McMurray	f6a57c1955	Move `Option::as_slice` to an always-sound implementation This approach depends on CSE to not have any branches or selects when the guessed offset is correct -- which it always will be right now -- but to also be sound (just less efficient) if the layout algorithms change such that the guess is incorrect.	2023-03-11 20:29:26 -08:00
Scott McMurray	b2c717fa33	`MaybeUninit::assume_init_read` should have `noundef` load metadata I was looking into `array::IntoIter` optimization, and noticed that it wasn't annotating the loads with `noundef` for simple things like `array::IntoIter<i32, N>`. Turned out to be a more general problem as `MaybeUninit::assume_init_read` isn't marking the load as initialized (<https://rust.godbolt.org/z/Mxd8TPTnv>), which is unfortunate since that's basically its reason to exist. This PR lowers `ptr::read(p)` to `copy *p` in MIR, which fortuitiously also improves the IR we give to LLVM for things like `mem::replace`.	2023-03-11 17:44:43 -08:00
bors	160c2ebeca	Auto merge of #108763 - scottmcm:indexing-nuw-lengths, r=cuviper Use `nuw` when calculating slice lengths from `Range`s An `assume` would definitely not be worth it, but since the flag is almost free we might as well tell LLVM this, especially on `_unchecked` calls where there's no obvious way for it to deduce it. (Today neither safe nor unsafe indexing gets it: <https://rust.godbolt.org/z/G1jYT548s>)	2023-03-07 13:17:59 +00:00
Scott McMurray	3554036280	Use `nuw` when calculating slice lengths from `Range`s An `assume` would definitely not be worth it, but since the flag is almost free we might as well tell LLVM this, especially on `_unchecked` calls where there's no obvious way for it to deduce it. (Today neither safe nor unsafe indexing gets it: <https://rust.godbolt.org/z/G1jYT548s>)	2023-03-05 15:15:22 -08:00
bors	816f958ac3	Auto merge of #108157 - scottmcm:tuple-gt-via-partialcmp, r=dtolnay Use `partial_cmp` to implement tuple `lt`/`le`/`ge`/`gt` In today's implementation, `(A, B)::gt` contains calls to both `A::eq` and `A::gt`. That's fine for primitives, but for things like `String`s it's kinda weird -- `(String, usize)::gt` has a call to both `bcmp` and `memcmp` (<https://rust.godbolt.org/z/7jbbPMesf>) because when `bcmp` says the `String`s aren't equal, it turns around and calls `memcmp` to find out which one's bigger. This PR changes the implementation to instead implement `(A, …, C, Z)::gt` using `A::partial_cmp`, `…::partial_cmp`, `C::partial_cmp`, and `Z::gt`. (And analogously for `lt`, `le`, and `ge`.) That way expensive comparisons don't need to be repeated. Technically this is an observable change on stable, so I've marked it `needs-fcp` + `T-libs-api` and will r? rust-lang/libs-api I'm hoping that this will be non-controversial, however, since it's very similar to the observable changes that were made to the derives (#81384 #98655) -- like those, this only changes behaviour if a type overrode behaviour in a way inconsistent with the rules for the various traits involved. (The first commit here is #108156, adding the codegen test, which I used to make sure this doesn't regress behaviour for primitives.) Zulip conversation about this change: <https://rust-lang.zulipchat.com/#narrow/stream/219381-t-libs/topic/.60.3E.60.20on.20Tuples/near/328392927>.	2023-03-05 22:02:26 +00:00
bors	864b6258fc	Auto merge of #106673 - flba-eb:add_qnx_nto_stdlib, r=workingjubilee Add support for QNX Neutrino to standard library This change: - adds standard library support for QNX Neutrino (7.1). - upgrades `libc` to version `0.2.139` which supports QNX Neutrino `@gh-tr` ⚠️ Backtraces on QNX require https://github.com/rust-lang/backtrace-rs/pull/507 which is not yet merged! (But everything else works without these changes) ⚠️ Tested mainly with a x86_64 virtual machine (see qnx-nto.md) and partially with an aarch64 hardware (some tests fail due to constrained resources).	2023-03-02 02:41:42 +00:00
bors	0b4ba4cf0e	Auto merge of #108483 - scottmcm:unify-bytewise-eq-traits, r=the8472 Merge two different equality specialization traits in `core` Arrays and slices each had their own version of this, without a matching set of `impl`s. Merge them into one (still-`pub(crate)`) `cmp::BytewiseEq` trait, so we can stop doing all these things twice. And that means that the `[T]::eq` → `memcmp` specialization picks up a bunch of types where that previously only worked for arrays, so examples like <https://rust.godbolt.org/z/KjsG8MGGT> will use it now instead of emitting loops. r? the8472	2023-03-01 23:34:37 +00:00
Scott McMurray	44eec1d9b0	Merge two different equality specialization traits in `core`	2023-03-01 14:42:06 -08:00
bors	609496eecf	Auto merge of #108446 - Zoxc:named-allocs, r=oli-obk Name LLVM anonymous constants by a hash of their contents This makes the names stable between different versions of a crate unlike the `AllocId` naming, making LLVM IR comparisons with `llvm-diff` more practical.	2023-03-01 15:36:15 +00:00
Andre Bogus	41da875fae	Add `Option::as_slice`(`_mut`) This adds the following functions: * `Option<T>::as_slice(&self) -> &[T]` * `Option<T>::as_slice_mut(&mut self) -> &[T]` The `as_slice` and `as_slice_mut` functions benefit from an optimization that makes them completely branch-free. Note that the optimization's soundness hinges on the fact that either the niche optimization makes the offset of the `Some(_)` contents zero or the mempory layout of `Option<T>` is equal to that of `Option<MaybeUninit<T>>`.	2023-03-01 00:05:31 +01:00
Florian Bartels	3ce2cd059f	Add QNX Neutrino support to libstd Co-authored-by: gh-tr <troach@qnx.com>	2023-02-28 15:59:47 +01:00
John Kåre Alsaker	b897b2d65c	Update tests	2023-02-25 21:43:25 +01:00
Ben Kimock	738c8b08d5	Remove the assume(!is_null) from Vec::as_ptr	2023-02-19 14:30:21 -05:00
bors	7aa413d592	Auto merge of #107921 - cjgillot:codegen-overflow-check, r=tmiasko Make codegen choose whether to emit overflow checks ConstProp and DataflowConstProp currently have a specific code path not to propagate constants when they overflow. This is meant to have the correct behaviour when inlining from a crate with overflow checks (like `core`) into a crate compiled without. This PR shifts the behaviour change to the `Assert(Overflow*)` MIR terminators: if the crate is compiled without overflow checks, just skip emitting the assertions. This is already what happens with `OverflowNeg`. This allows ConstProp and DataflowConstProp to transform `CheckedBinaryOp(Add, u8::MAX, 1)` into `const (0, true)`, and let codegen ignore the `true`. The interpreter is modified to conform to this behaviour. Fixes #35310	2023-02-19 18:17:26 +00:00
Camille GILLOT	c107e0e945	Fix codegen test.	2023-02-18 21:35:02 +00:00
Camille GILLOT	86dbcb5390	Add codegen test.	2023-02-18 21:35:02 +00:00
Michael Goulet	e82cc656c8	Make dyn* have the same scalar pair ABI as corresponding fat pointer	2023-02-18 19:47:34 +00:00
Michael Goulet	1f11d841b5	Add codegen test	2023-02-18 19:47:34 +00:00
bors	fabfd1fd93	Auto merge of #99679 - repnop:kernel-address-sanitizer, r=cuviper Add `kernel-address` sanitizer support for freestanding targets This PR adds support for KASan (kernel address sanitizer) instrumentation in freestanding targets. I included the minimal set of `x86_64-unknown-none`, `riscv64{imac, gc}-unknown-none-elf`, and `aarch64-unknown-none` but there's likely other targets it can be added to. (`linux_kernel_base.rs`?) KASan uses the address sanitizer attributes but has the `CompileKernel` parameter set to `true` in the pass creation.	2023-02-18 03:05:11 +00:00
Scott McMurray	680e21687d	Use `partial_cmp` to implement tuple `lt`/`le`/`ge`/`gt`	2023-02-16 23:59:13 -08:00
Scott McMurray	dc37e37329	Add a codegen test for comparisons of 2-tuples of primitives The operators are all overridden in full for tuples, so those parts pass easily, but they're worth pinning. Going via `Ord::cmp`, though, doesn't optimize away for anything but `cmp`+`is_le`. So this leaves `FIXME`s in the tests for the others.	2023-02-16 21:36:14 -08:00
bors	639377ed73	Auto merge of #107449 - saethlin:enable-copyprop, r=oli-obk Enable CopyProp r? `@tmiasko` `@rustbot` label +A-mir-opt	2023-02-16 03:44:37 +00:00
Wesley Norris	19714385e0	Add `kernel-address` sanitizer support for freestanding targets	2023-02-14 20:54:25 -05:00
Ben Kimock	37a875cbdb	Try to fix codegen tests for ??? LLVM 14 ???	2023-02-14 19:49:49 -05:00
Ben Kimock	a82adf0125	Fix codegen tests	2023-02-14 19:21:58 -05:00
Matthias Krüger	a1ba861190	Rollup merge of #107573 - cuviper:drop-llvm-13, r=nagisa Update the minimum external LLVM to 14 With this change, we'll have stable support for LLVM 14 through 16 (pending release). For reference, the previous increase to LLVM 13 was #100460.	2023-02-14 18:24:40 +01:00
bors	2d91939bb7	Auto merge of #107634 - scottmcm:array-drain, r=thomcc Improve the `array::map` codegen The `map` method on arrays [is documented as sometimes performing poorly](https://doc.rust-lang.org/std/primitive.array.html#note-on-performance-and-stack-usage), and after [a question on URLO](https://users.rust-lang.org/t/try-trait-residual-o-trait-and-try-collect-into-array/88510?u=scottmcm) prompted me to take another look at the core [`try_collect_into_array`](`7c46fb2111/library/core/src/array/mod.rs (L865-L912)`) function, I had some ideas that ended up working better than I'd expected. There's three main ideas in here, split over three commits: 1. Don't use `array::IntoIter` when we can avoid it, since that seems to not get SRoA'd, meaning that every step writes things like loop counters into the stack unnecessarily 2. Don't return arrays in `Result`s unnecessarily, as that doesn't seem to optimize away even with `unwrap_unchecked` (perhaps because it needs to get moved into a new LLVM type to account for the discriminant) 3. Don't distract LLVM with all the `Option` dances when we know for sure we have enough items (like in `map` and `zip`). This one's a larger commit as to do it I ended up adding a new `pub(crate)` trait, but hopefully those changes are still straight-forward. (No libs-api changes; everything should be completely implementation-detail-internal.) It's still not completely fixed -- I think it needs pcwalton's `memcpy` optimizations still (#103830) to get further -- but this seems to go much better than before. And the remaining `memcpy`s are just `transmute`-equivalent (`[T; N] -> ManuallyDrop<[T; N]>` and `[MaybeUninit<T>; N] -> [T; N]`), so hopefully those will be easier to remove with LLVM16 than the previous subobject copies 🤞 r? `@thomcc` As a simple example, this test ```rust pub fn long_integer_map(x: [u32; 64]) -> [u32; 64] { x.map(\|x\| 13 * x + 7) } ``` On nightly <https://rust.godbolt.org/z/xK7548TGj> takes `sub rsp, 808` ```llvm start: %array.i.i.i.i = alloca [64 x i32], align 4 %_3.sroa.5.i.i.i = alloca [65 x i32], align 4 %_5.i = alloca %"core::iter::adapters::map::Map<core::array::iter::IntoIter<u32, 64>, [closure@/app/example.rs:2:11: 2:14]>", align 8 ``` (and yes, that's a 65-element array `alloca` despite 64-element input and output) But with this PR it's only `sub rsp, 520` ```llvm start: %array.i.i.i.i.i.i = alloca [64 x i32], align 4 %array1.i.i.i = alloca %"core::mem::manually_drop::ManuallyDrop<[u32; 64]>", align 4 ``` Similarly, the loop it emits on nightly is scalar-only and horrifying ```nasm .LBB0_1: mov esi, 64 mov edi, 0 cmp rdx, 64 je .LBB0_3 lea rsi, [rdx + 1] mov qword ptr [rsp + 784], rsi mov r8d, dword ptr [rsp + 4rdx + 528] mov edi, 1 lea edx, [r8 + 2r8] lea r8d, [r8 + 4rdx] add r8d, 7 .LBB0_3: test edi, edi je .LBB0_11 mov dword ptr [rsp + 4rcx + 272], r8d cmp rsi, 64 jne .LBB0_6 xor r8d, r8d mov edx, 64 test r8d, r8d jne .LBB0_8 jmp .LBB0_11 .LBB0_6: lea rdx, [rsi + 1] mov qword ptr [rsp + 784], rdx mov edi, dword ptr [rsp + 4rsi + 528] mov r8d, 1 lea esi, [rdi + 2rdi] lea edi, [rdi + 4rsi] add edi, 7 test r8d, r8d je .LBB0_11 .LBB0_8: mov dword ptr [rsp + 4rcx + 276], edi add rcx, 2 cmp rcx, 64 jne .LBB0_1 ``` whereas with this PR it's unrolled and vectorized ```nasm vpmulld ymm1, ymm0, ymmword ptr [rsp + 64] vpaddd ymm1, ymm1, ymm2 vmovdqu ymmword ptr [rsp + 328], ymm1 vpmulld ymm1, ymm0, ymmword ptr [rsp + 96] vpaddd ymm1, ymm1, ymm2 vmovdqu ymmword ptr [rsp + 360], ymm1 ``` (though sadly still stack-to-stack)	2023-02-13 10:18:48 +00:00
Ben Kimock	640ede7b0a	Enable CopyProp by default, tune the impl a bit	2023-02-12 13:23:53 -05:00
Josh Stone	a06aaa4a9e	Update the minimum external LLVM to 14	2023-02-10 16:06:25 -08:00
Matthias Krüger	8fc9ed51f0	Rollup merge of #107043 - Nilstrieb:true-and-false-is-false, r=wesleywiser Support `true` and `false` as boolean flag params Implements [MCP 577](https://github.com/rust-lang/compiler-team/issues/577).	2023-02-10 06:09:56 +01:00
Oleksii Lozovskyi	54b26f49e6	Test XRay only for supported targets Now that the compiler accepts "-Z instrument-xray" option only when targeting one of the supported targets, make sure to not run the codegen tests where the compiler will fail. Like with other compiletests, we don't have access to internals, so simply hardcode a list of supported architectures here.	2023-02-09 12:29:43 +09:00
Oleksii Lozovskyi	0fef658ffe	Codegen tests for -Z instrument-xray Let's add at least some tests to verify that this option is accepted and produces expected LLVM attributes. More tests can be added later with attribute support.	2023-02-09 12:28:00 +09:00
Ralf Jung	1ef16874b5	also do not add noalias on not-Unpin Box	2023-02-06 12:17:41 +01:00
Ralf Jung	ea541bc2ee	make &mut !Unpin not dereferenceable See https://github.com/rust-lang/unsafe-code-guidelines/issues/381 for discussion.	2023-02-06 11:46:37 +01:00
Ralf Jung	201ae73872	make PointerKind directly reflect pointer types The code that consumes PointerKind (`adjust_for_rust_scalar` in rustc_ty_utils) ended up using PointerKind variants to talk about Rust reference types (& and &mut) anyway, making the old code structure quite confusing: one always had to keep in mind which PointerKind corresponds to which type. So this changes PointerKind to directly reflect the type. This does not change behavior.	2023-02-06 11:46:32 +01:00
Scott McMurray	bb77860d9c	Add another autovectorization codegen test using array zip-map	2023-02-04 16:44:53 -08:00
Scott McMurray	5bc328fdef	Allow canonicalizing the `array::map` loop in trusted cases	2023-02-04 16:44:51 -08:00
Scott McMurray	52df0558ea	Stop forcing `array::map` through an unnecessary `Result`	2023-02-04 16:41:35 -08:00
Scott McMurray	5a7342c3dd	Stop using `into_iter` in `array::map`	2023-02-04 16:41:35 -08:00
Matthias Krüger	c89bb159f6	Rollup merge of #107373 - michaelwoerister:dont-merge-vtables-when-debuginfo, r=WaffleLapkin Don't merge vtables when full debuginfo is enabled. This PR makes the compiler not emit the `unnamed_addr` attribute for vtables when full debuginfo is enabled, so that they don't get merged even if they have the same contents. This allows debuggers to more reliably map from a dyn pointer to the self-type of a trait object by looking at the vtable's debuginfo. The PR only changes the behavior of the LLVM backend as other backends don't emit vtable debuginfo (as far as I can tell). The performance impact of this change should be small as [measured](https://github.com/rust-lang/rust/pull/103514#issuecomment-1290833854) in a previous PR.	2023-01-28 05:20:19 +01:00
Matthias Krüger	7b78b6a78d	Rollup merge of #107022 - scottmcm:ordering-option-eq, r=m-ou-se Implement `SpecOptionPartialEq` for `cmp::Ordering` Noticed as I continue to explore options for having code using `partial_cmp` optimize better. Before: ```llvm ; Function Attrs: mustprogress nofree nosync nounwind willreturn uwtable define noundef zeroext i1 `@ordering_eq(i8` noundef %0, i8 noundef %1) unnamed_addr #0 { start: %2 = icmp eq i8 %0, 2 br i1 %2, label %bb1.i, label %bb3.i bb1.i: ; preds = %start %3 = icmp eq i8 %1, 2 br label %"_ZN55_$LT$T$u20$as$u20$core..option..SpecOptionPartialEq$GT$2eq17hb7e7beacecde585fE.exit" bb3.i: ; preds = %start %.not.i = icmp ne i8 %1, 2 %4 = icmp eq i8 %0, %1 %spec.select.i = and i1 %.not.i, %4 br label %"_ZN55_$LT$T$u20$as$u20$core..option..SpecOptionPartialEq$GT$2eq17hb7e7beacecde585fE.exit" "_ZN55_$LT$T$u20$as$u20$core..option..SpecOptionPartialEq$GT$2eq17hb7e7beacecde585fE.exit": ; preds = %bb1.i, %bb3.i %.0.i = phi i1 [ %3, %bb1.i ], [ %spec.select.i, %bb3.i ] ret i1 %.0.i } ``` After: ```llvm ; Function Attrs: mustprogress nofree norecurse nosync nounwind readnone willreturn uwtable define noundef zeroext i1 `@ordering_eq(i8` noundef %0, i8 noundef %1) unnamed_addr #1 { start: %2 = icmp eq i8 %0, %1 ret i1 %2 } ``` (Which <https://alive2.llvm.org/ce/z/-rop5r> says LLVM could just do itself, but there's probably an issue already open for that problem from when this was originally looked at for `Option<NonZeroU8>` and friends.)	2023-01-28 05:20:15 +01:00
Michael Woerister	e5995e6168	Don't merge vtables when full debuginfo is enabled.	2023-01-27 15:29:04 +00:00
Erik Desjardins	009192b01b	abi: add `AddressSpace` field to `Primitive::Pointer` ...and remove it from `PointeeInfo`, which isn't meant for this. There are still various places (marked with FIXMEs) that assume all pointers have the same size and alignment. Fixing this requires parsing non-default address spaces in the data layout string, which will be done in a followup.	2023-01-22 23:41:39 -05:00
bors	705a96d39b	Auto merge of #106989 - clubby789:is-zero-num, r=scottmcm Implement `alloc::vec::IsZero` for `Option<$NUM>` types Fixes #106911 Mirrors the `NonZero$NUM` implementations with an additional `assert_zero_valid`. `None::<i32>` doesn't stricly satisfy `IsZero` but for the purpose of allocating we can produce more efficient codegen.	2023-01-19 08:04:26 +00:00
Scott McMurray	3122db7d03	Implement `SpecOptionPartialEq` for `cmp::Ordering`	2023-01-18 19:19:28 -08:00
Nilstrieb	a6fda3ee7f	Support `true` and `false` as boolean flag params Implements MCP 577.	2023-01-18 20:46:36 +01:00
clubby789	b94a29a25f	Implement `alloc::vec::IsZero` for `Option<$NUM>` types	2023-01-18 15:15:15 +00:00
Matthias Krüger	c96dac16c3	Rollup merge of #106995 - lukas-code:align_offset_assembly_test, r=cuviper bump failing assembly & codegen tests from LLVM 14 to LLVM 15 These tests need LLVM 15. Found by ```@Robert-Cunningham``` in https://github.com/rust-lang/rust/pull/100601#issuecomment-1385400008 Passed tests at 006506e93fc80318ebfd7939fe1fd4dc19ecd8cb in https://github.com/rust-lang/rust/actions/runs/3942442730/jobs/6746104740.	2023-01-18 06:59:21 +01:00
Lukas Markeffsky	1216cc7f1c	bump failing assembly & codegen tests from LLVM 14 to LLVM 15	2023-01-17 20:02:01 +01:00
Nilstrieb	f1255380ac	Add more codegen tests	2023-01-17 16:23:22 +01:00
Nilstrieb	af23ad93cd	Improve comments	2023-01-17 08:14:35 +01:00
Nilstrieb	645c0fddd2	Put `noundef` on all scalars that don't allow uninit Previously, it was only put on scalars with range validity invariants like bool, was uninit was obviously invalid for those. Since then, we have normatively declared all uninit primitives to be undefined behavior and can therefore put `noundef` on them. The remaining concern was the `mem::uninitialized` function, which cause quite a lot of UB in the older parts of the ecosystem. This function now doesn't return uninit values anymore, making users of it safe from this change. The only real sources of UB where people could encounter uninit primitives are `MaybeUninit::uninit().assume_init()`, which has always be clear in the docs about being UB and from heap allocations (like reading from the spare capacity of a vec. This is hopefully rare enough to not break anything.	2023-01-17 08:14:35 +01:00
The 8472	9db0134018	replace manual ptr arithmetic with ptr_sub	2023-01-15 17:38:05 +01:00
Nicholas Bishop	46f9e878f6	Stabilize `abi_efiapi` feature Tracking issue: https://github.com/rust-lang/rust/issues/65815	2023-01-11 20:42:13 -05:00
Ben Kimock	13eec69e1c	Add a regression test for argument copies with DestinationPropagation	2023-01-11 10:27:06 -05:00
Albert Larsan	40ba0e84d5	Change `src/test` to `tests` in source files, fix tidy and tests	2023-01-11 09:32:13 +00:00
Albert Larsan	cf2dff2b1e	Move /src/test to /tests	2023-01-11 09:32:08 +00:00

... 3 4 5 6 7

328 Commits