nordic-dev.net/rust - rust

mirror of https://github.com/rust-lang/rust.git synced 2025-06-18 18:47:29 +00:00

Author	SHA1	Message	Date
Nikita Popov	31cc4c074d	Emit getelementptr inbounds nuw for pointer::add()	2025-02-19 11:32:32 +01:00
Nikita Popov	5e9d8a7d55	Switch to the LLVMBuildGEPWithNoWrapFlags API This API allows us to set the nuw flag as well.	2025-02-19 11:32:32 +01:00
Matthias Krüger	2bd65ebede	Rollup merge of #137210 - workingjubilee:fixup-passmode-import, r=RalfJung compiler: Stop reexporting stuff in cg_llvm::abi The reexports confuse tooling like rustdoc into thinking cg_llvm is the source of key types that originate in rustc_target.	2025-02-19 01:30:12 +01:00
Jubilee Young	2d2de18166	compiler: Stop reexporting stuff in cg_llvm::abi The reexports confuse tooling like rustdoc into thinking cg_llvm is the source of key types that originate in rustc_target.	2025-02-18 00:31:29 -08:00
bors	3b022d8cee	Auto merge of #133852 - x17jiri:cold_path, r=saethlin improve cold_path() #120370 added a new instrinsic `cold_path()` and used it to fix `likely` and `unlikely` However, in order to limit scope, the information about cold code paths is only used in 2-target switch instructions. This is sufficient for `likely` and `unlikely`, but limits usefulness of `cold_path` for idiomatic rust. For example, code like this: ``` if let Some(x) = y { ... } ``` may generate 3-target switch: ``` switch y.discriminator: 0 => true branch 1 = > false branch _ => unreachable ``` and therefore marking a branch as cold will have no effect. This PR improves `cold_path()` to work with arbitrary switch instructions. Note that for 2-target switches, we can use `llvm.expect`, but for multiple targets we need to manually emit branch weights. I checked Clang and it also emits weights in this situation. The Clang's weight calculation is more complex that this PR, which I believe is mainly because `switch` in `C/C++` can have multiple cases going to the same target.	2025-02-18 07:49:09 +00:00
Nicholas Nethercote	fd7b4bf4e1	Move methods from `Map` to `TyCtxt`, part 2. Continuing the work started in #136466. Every method gains a `hir_` prefix, though for the ones that already have a `par_` or `try_par_` prefix I added the `hir_` after that.	2025-02-18 10:17:44 +11:00
Jiri Bobek	7bb5f4dd78	improve cold_path()	2025-02-17 06:39:58 +01:00
Matthias Krüger	fab38375bc	Rollup merge of #137095 - saethlin:use-hash64-for-hashes, r=workingjubilee Replace some u64 hashes with Hash64 I introduced the Hash64 and Hash128 types in https://github.com/rust-lang/rust/pull/110083, essentially as a mechanism to prevent hashes from landing in our leb128 encoding paths. If you just have a u64 or u128 field in a struct then derive Encodable/Decodable, that number gets leb128 encoding. So if you need to store a hash or some other value which behaves very close to a hash, don't store it as a u64. This reverts part of https://github.com/rust-lang/rust/pull/117603, which turned an encoded Hash64 into a u64. Based on https://github.com/rust-lang/rust/pull/110083, I don't expect this to be perf-sensitive on its own, though I expect that it may help stabilize some of the small rmeta size fluctuations we currently see in perf reports.	2025-02-17 06:38:14 +01:00
Ben Kimock	4cf21866e8	Move hashes from rustc_data_structure to rustc_hashes so they can be shared with rust-analyzer	2025-02-16 16:18:30 -05:00
Jacob Pratt	d3556c6644	Rollup merge of #136545 - durin42:nvptx64-align, r=nikic nvptx64: update default alignment to match LLVM 21 This changed in llvm/llvm-project@91cb8f5d32. The commit itself is mostly about some intrinsic instructions, but as an aside it also mentions something about addrspace for tensor memory, which I believe is what this string is telling us. `@rustbot` label: +llvm-main	2025-02-16 00:51:24 -05:00
bors	bdc97d1046	Auto merge of #136575 - scottmcm:nsuw-math, r=nikic Set both `nuw` and `nsw` in slice size calculation There's an old note in the code to do this, and now that [LLVM-C has an API for it](`f0b8ff1251/llvm/include/llvm-c/Core.h (L4403-L4408)`), we might as well. And it's been there since what looks like LLVM 17 `de9b6aa341` so doesn't even need to be conditional. (There's other places, like `RawVecInner` or `Layout`, that might want to do things like this too, but I'll leave those for a future PR.)	2025-02-14 14:21:29 +00:00
bjorn3	736ef0a4ce	Don't error when adding a staticlib with bitcode files compiled by newer LLVM	2025-02-14 10:54:21 +00:00
bors	905b1bf1cc	Auto merge of #137010 - workingjubilee:rollup-g00c07v, r=workingjubilee Rollup of 9 pull requests Successful merges: - #135439 (Make `-O` mean `OptLevel::Aggressive`) - #136460 (Simplify `rustc_span` `analyze_source_file`) - #136904 (add `IntoBounds` trait) - #136908 ([AIX] expect `EINVAL` for `pthread_mutex_destroy`) - #136924 (Add profiling of bootstrap commands using Chrome events) - #136951 (Use the right binder for rebinding `PolyTraitRef`) - #136981 (ci: switch loongarch jobs to free runners) - #136992 (Update backtrace) - #136993 ([cg_llvm] Remove dead error message) r? `@ghost` `@rustbot` modify labels: rollup	2025-02-14 06:13:42 +00:00
Jubilee	e8d0d00798	Rollup merge of #136993 - dpaoliello:cleanllvm4, r=workingjubilee [cg_llvm] Remove dead error message Part of #135502 Discovered a dead error message in rustc_codegen_llvm, so removing it. r? ``@Zalathar``	2025-02-13 21:37:54 -08:00
Scott McMurray	9ad6839f7a	Set both `nuw` and `nsw` in slice size calculation There's an old note in the code to do this, and now that LLVM-C has an API for it, we might as well.	2025-02-13 21:26:48 -08:00
Jubilee	864eba9fb1	Rollup merge of #136895 - maurer:fix-enum-discr, r=nikic debuginfo: Set bitwidth appropriately in enum variant tags Previously, we unconditionally set the bitwidth to 128-bits, the largest an enum would possibly be. Then, LLVM would cut down the constant by chopping off leading zeroes before emitting the DWARF. LLVM only supported 64-bit enumerators, so this would also have occasionally resulted in truncated data. LLVM added support for 128-bit enumerators in llvm/llvm-project#125578 That patchset trusts the constant to describe how wide the variant tag is, so the high 64-bits of zeros are considered potentially load-bearing. As a result, we went from emitting tags that looked like: DW_AT_discr_value (0xfe) (because `dwarf::BestForm` selected `data1`) to emitting tags that looked like: DW_AT_discr_value (<0x10> fe ff ff ff 00 00 00 00 00 00 00 00 00 00 00 00 ) This makes the `DW_AT_discr_value` encode at the bitwidth of the tag, which: 1. Is probably closer to our intentions in terms of describing the data. 2. Doesn't invoke the 128-bit support which may not be supported by all debuggers / downstream tools. 3. Will result in smaller debug information.	2025-02-13 17:46:08 -08:00
Daniel Paoliello	bfdc96114c	[cg_llvm] Remove dead error message	2025-02-13 15:04:39 -08:00
clubby789	2966256133	Make `-O` mean `-C opt-level=3`	2025-02-13 19:47:55 +00:00
Jacob Pratt	f7d5285062	Rollup merge of #136881 - dpaoliello:cleanllvm3, r=Zalathar cg_llvm: Reduce visibility of all functions in the llvm module Next part of #135502 This reduces the visibility of all functions in the `llvm` module to `pub(crate)` and marks the `enzyme_ffi` modules with `#![expect(dead_code)]` (as previously discussed: <https://github.com/rust-lang/rust/pull/135502#discussion_r1915608085>). r? ``@Zalathar``	2025-02-13 03:53:31 -05:00
Jacob Pratt	1f669fdc7d	Rollup merge of #136858 - safinaskar:parallel-cleanup-2025-02-11-07-54, r=SparrowLii Parallel-compiler-related cleanup Parallel-compiler-related cleanup I carefully split changes into commits. Commit messages are self-explanatory. Squashing is not recommended. cc "Parallel Rustc Front-end" https://github.com/rust-lang/rust/issues/113349 r? SparrowLii ``@rustbot`` label: +WG-compiler-parallel	2025-02-13 03:53:31 -05:00
Daniel Paoliello	e7cef26a3d	cg_llvm: Reduce visibility of all functions in the llvm module	2025-02-13 12:36:25 +11:00
Zalathar	659e20fa75	Remove `LLVMGetModuleContext` This was unused after the removal of `-Zprofile` in #131829.	2025-02-13 12:36:09 +11:00
Jacob Pratt	33c186baf7	Rollup merge of #136807 - workingjubilee:merge-gpus-to-get-the-arcradeongeforce, r=bjorn3 compiler: internally merge `PtxKernel` into `GpuKernel` r? ``@bjorn3`` for review	2025-02-12 20:10:00 -05:00
Jacob Pratt	0de2341fef	Rollup merge of #136217 - taiki-e:csky-asm-flags, r=Amanieu Mark condition/carry bit as clobbered in C-SKY inline assembly C-SKY's compare and some arithmetic/logical instructions modify condition/carry bit (C) in PSR, but there is currently no way to mark it as clobbered in `asm!`. This PR marks it as clobbered except when [`options(preserves_flags)`](https://doc.rust-lang.org/reference/inline-assembly.html#r-asm.options.supported-options.preserves_flags) is used. Refs: - Section 1.3 "Programming model" and Section 1.3.5 "Condition/carry bit" in CSKY Architecture user_guide: `9f7121f7d4/CSKY%20Architecture%20user_guide.pdf` > Under user mode, condition/carry bit (C) is located in the lowest bit of PSR, and it can be accessed and changed by common user instructions. It is the only data bit that can be visited under user mode in PSR. > Condition or carry bit represents the result after one operation. Condition/carry bit can be clearly set according to the results of compare instructions or unclearly set as some high-precision arithmetic or logical instructions. In addition, special instructions such as DEC[GT,LT,NE] and XTRB[0-3] will influence the value of condition/carry bit. - Register definition in LLVM: https://github.com/llvm/llvm-project/blob/llvmorg-19.1.0/llvm/lib/Target/CSKY/CSKYRegisterInfo.td#L88 cc ```@Dirreke``` ([target maintainer](`aa6f5ab18e/src/doc/rustc/src/platform-support/csky-unknown-linux-gnuabiv2.md (target-maintainers)`)) r? ```@Amanieu``` ```@rustbot``` label +O-csky +A-inline-assembly	2025-02-12 20:09:58 -05:00
Jacob Pratt	a53cd3c979	Rollup merge of #135025 - Flakebi:alloca-addrspace, r=nikic Cast allocas to default address space Pointers for variables all need to be in the same address space for correct compilation. Therefore ensure that even if an `alloca` is created in a different address space, it is casted to the default address space before its value is used. This is necessary for the amdgpu target and others where the default address space for `alloca`s is not 0. For example the following code compiles incorrectly when not casting the address space to the default one: ```rust fn f(p: const i8 / addrspace(0) /) -> const i8 /* addrspace(0) / { let local = 0i8; / addrspace(5) */ let res = if cond { p } else { &raw const local }; res } ``` results in ```llvm %local = alloca addrspace(5) i8 %res = alloca addrspace(5) ptr if: ; Store 64-bit flat pointer store ptr %p, ptr addrspace(5) %res else: ; Store 32-bit scratch pointer store ptr addrspace(5) %local, ptr addrspace(5) %res ret: ; Load and return 64-bit flat pointer %res.load = load ptr, ptr addrspace(5) %res ret ptr %res.load ``` For amdgpu, `addrspace(0)` are 64-bit pointers, `addrspace(5)` are 32-bit pointers. The above code may store a 32-bit pointer and read it back as a 64-bit pointer, which is obviously wrong and cannot work. Instead, we need to `addrspacecast %local to ptr addrspace(0)`, then we store and load the correct type. Tracking issue: #135024	2025-02-12 20:09:56 -05:00
Matthew Maurer	d82219a4fa	debuginfo: Set bitwidth appropriately in enum variant tags Previously, we unconditionally set the bitwidth to 128-bits, the largest an discrimnator would possibly be. Then, LLVM would cut down the constant by chopping off leading zeroes before emitting the DWARF. LLVM only supported 64-bit descriminators, so this would also have occasionally resulted in truncated data (or an assert) if more than 64-bits were used. LLVM added support for 128-bit enumerators in llvm/llvm-project#125578 That patchset also trusts the constant to describe how wide the variant tag is. As a result, we went from emitting tags that looked like: DW_AT_discr_value (0xfe) (`form1`) to emitting tags that looked like: DW_AT_discr_value (<0x10> fe ff ff ff 00 00 00 00 00 00 00 00 00 00 00 00 ) This makes the `DW_AT_discr_value` encode at the bitwidth of the tag, which: 1. Is probably closer to our intentions in terms of describing the data. 2. Doesn't invoke the 128-bit support which may not be supported by all debuggers / downstream tools. 3. Will result in smaller debug information.	2025-02-12 18:01:42 +00:00
Matthias Krüger	9e89feefb9	Rollup merge of #135549 - oli-obk:push-tmxtpnrloyqu, r=compiler-errors Document some safety constraints and use more safe wrappers Lots of unsafe codegen_llvm code has safe wrappers already, so I used some of them and added some where applicable. I stopped here because this diff is large enough and should probably be reviewed independently of other changes.	2025-02-12 06:07:35 +01:00
Oli Scherer	dcf1e4d72b	Document some safety constraints and use more safe wrappers	2025-02-11 09:47:13 +00:00
Oli Scherer	4b83038d63	Add a safe wrapper for `WriteBitcodeToFile`	2025-02-11 09:41:22 +00:00
Oli Scherer	b2cd1b8ead	Remove an unsafe closure invariant by inlining the closure wrapper into the called function	2025-02-11 09:41:22 +00:00
Askar Safin	51f49d8464	compiler/rustc_codegen_llvm/src/lib.rs: remove "unsafe impl Send/Sync"	2025-02-11 09:58:53 +03:00
Jacob Pratt	c49ffaf7eb	Rollup merge of #136813 - mrkajetanp:aarch32-fp16-target-feature, r=davidtwco rustc_target: Add the fp16 target feature for AArch32 As in the commit description. The feature is already available in rustc for AArch64.	2025-02-11 01:02:41 -05:00
Jacob Pratt	6153a8dcea	Rollup merge of #136721 - dpaoliello:cleanllvm2, r=Zalathar cg_llvm: Reduce visibility of some items outside the `llvm` module Next piece of #135502 This reduces the visibility of items (other than those in the `llvm` module) so that dead code analysis will correctly identify unused items.	2025-02-11 01:02:40 -05:00
Flakebi	cde7e805ad	Cast allocas to default address space Pointers for variables all need to be in the same address space for correct compilation. Therefore ensure that even if an `alloca` is created in a different address space, it is casted to the default address space before its value is used. This is necessary for the amdgpu target and others where the default address space for `alloca`s is not 0. For example the following code compiles incorrectly when not casting the address space to the default one: ```rust fn f(p: const i8 / addrspace(0) /) -> const i8 /* addrspace(0) / { let local = 0i8; / addrspace(5) */ let res = if cond { p } else { &raw const local }; res } ``` results in ```llvm %local = alloca addrspace(5) i8 %res = alloca addrspace(5) ptr if: ; Store 64-bit flat pointer store ptr %p, ptr addrspace(5) %res else: ; Store 32-bit scratch pointer store ptr addrspace(5) %local, ptr addrspace(5) %res ret: ; Load and return 64-bit flat pointer %res.load = load ptr, ptr addrspace(5) %res ret ptr %res.load ``` For amdgpu, `addrspace(0)` are 64-bit pointers, `addrspace(5)` are 32-bit pointers. The above code may store a 32-bit pointer and read it back as a 64-bit pointer, which is obviously wrong and cannot work. Instead, we need to `addrspacecast %local to ptr addrspace(0)`, then we store and load the correct type.	2025-02-10 21:38:44 +01:00
Daniel Paoliello	5f29273921	rustc_codegen_llvm: Mark items as pub(crate) outside of the llvm module	2025-02-10 10:17:25 -08:00
Matthias Krüger	78f5bddd57	Rollup merge of #136419 - EnzymeAD:autodiff-tests, r=onur-ozkan,jieyouxu adding autodiff tests I'd like to get started with upstreaming some tests, even though I'm still waiting for an answer on how to best integrate the enzyme pass. Can we therefore temporarily support the -Z llvm-plugins here without too much effort? And in that case, how would that work? I saw you can do remapping, e.g. `rust-src-base`, but I don't think that will give me the path to libEnzyme.so. Do you have another suggestion? Other than that this test simply checks that the derivative of `xx` is `2.0 x`, which in this case is computed as `%0 = fadd fast double %x.0.val, %x.0.val` (I'll add a few more tests and move it to an autodiff folder if we can use the -Z flag) r? ``@jieyouxu`` Locally at least `-Zllvm-plugins=${PWD}/build/x86_64-unknown-linux-gnu/enzyme/build/Enzyme/libEnzyme-19.so` seems to work if I copy the command I get from x.py test and run it manually. However, running x.py test itself fails. Tracking: - https://github.com/rust-lang/rust/issues/124509 Zulip discussion: https://rust-lang.zulipchat.com/#narrow/channel/326414-t-infra.2Fbootstrap/topic/Enzyme.20build.20changes	2025-02-10 16:38:23 +01:00
Jubilee	7f8108afc8	Rollup merge of #136053 - Zalathar:defer-counters, r=saethlin coverage: Defer part of counter-creation until codegen Follow-up to #135481 and #135873. One of the pleasant properties of the new counter-assignment algorithm is that we can stop partway through the process, store the intermediate state in MIR, and then resume the rest of the algorithm during codegen. This lets it take into account which parts of the control-flow graph were eliminated by MIR opts, resulting in fewer physical counters and simpler counter expressions. Those improvements end up completely obsoleting much larger chunks of code that were previously responsible for cleaning up the coverage metadata after MIR opts, while also doing a more thorough cleanup job. (That change also unlocks some further simplifications that I've kept out of this PR to limit its scope.)	2025-02-10 00:51:49 -08:00
Jubilee Young	e11e2b4d09	compiler: internally merge `Conv::PtxKernel` into `GpuKernel` It is speculated that these two can be conceptually merged, and it can start by ripping out rustc's notion of the PtxKernel call convention. Leave the ExternAbi for now, but the nvptx target now should see it as just a different way to spell Conv::GpuKernel.	2025-02-09 23:14:55 -08:00
Manuel Drehwald	061abbc369	remove outdated *First autodiff variants for higher-order ad	2025-02-10 01:35:53 -05:00
Manuel Drehwald	1221cff551	move second opt run to lto phase and cleanup code	2025-02-10 01:35:22 -05:00
bors	124cc92199	Auto merge of #136751 - bjorn3:update_rustfmt, r=Mark-Simulacrum Update bootstrap compiler and rustfmt The rustfmt version we previously used formats things differently from what the latest nightly rustfmt does. This causes issues for subtrees that get formatted both in-tree and in their own repo. Updating the rustfmt used in-tree solves those issues. Also bumped the bootstrap compiler as the stage0 update command always updates both at the same time.	2025-02-09 15:44:16 +00:00
bors	a26e97be88	Auto merge of #136754 - Urgau:rollup-qlkhjqr, r=Urgau Rollup of 5 pull requests Successful merges: - #134679 (Windows: remove readonly files) - #136213 (Allow Rust to use a number of libc filesystem calls) - #136530 (Implement `x perf` directly in bootstrap) - #136601 (Detect (non-raw) borrows of null ZST pointers in CheckNull) - #136659 (Pick the max DWARF version when LTO'ing modules with different versions ) r? `@ghost` `@rustbot` modify labels: rollup	2025-02-09 12:54:26 +00:00
Jubilee	5e4d6278af	Rollup merge of #136706 - workingjubilee:finish-up-rustc-abi-updates, r=compiler-errors compiler: mostly-finish `rustc_abi` updates This almost-finishes all the updates in the compiler to use `rustc_abi` and removes some of the reexports of `rustc_abi` items in `rustc_target` that were previously available. r? ```@compiler-errors```	2025-02-08 20:41:21 -08:00
Urgau	5ec56e5fbb	Rollup merge of #136659 - wesleywiser:dwarf_version_lto_merge_behavior, r=jieyouxu Pick the max DWARF version when LTO'ing modules with different versions Currently, when rustc compiles code with `-Clto` enabled that was built with different choices for `-Zdwarf-version`, a warning will be reported. It's very easy to observe this by compiling most anything (eg, "hello world") and specifying `-Clto -Zdwarf-version=5` since the standard library is distributed with `-Zdwarf-version=4`. This behavior isn't actually useful for a few reasons: - From observation, LLVM chooses to pick the highest DWARF version anyway after issuing the warning. - Clang specifies that in this case, the max version should be picked without a warning and as a general principle, we want to support x-lang LTO with Clang which implies using the same module flag merge behaviors. - Debuggers need to be able to handle a variety of versions within the same debugging session as you can easily have some parts of a binary (or some dynamic libraries within an application) all compiled with different DWARF versions. This commit changes the module flag merge behavior to match Clang and use the highest version of DWARF. It also adds a test to ensure this behavior is respected in the case of two crates being LTO'd together and adds a test to ensure no warning is printed. Fixes #130041 which fails due to these warnings being printed cc #103057	2025-02-09 00:37:28 +01:00
bjorn3	1fcae03369	Rustfmt	2025-02-08 22:12:13 +00:00
Wesley Wiser	bbc40e7822	Pick the max DWARF version when LTO'ing modules with different versions Currently, when rustc compiles code with `-Clto` enabled that was built with different choices for `-Zdwarf-version`, a warning will be reported. It's very easy to observe this by compiling most anything (eg, "hello world") and specifying `-Clto -Zdwarf-version=5` since the standard library is distributed with `-Zdwarf-version=4`. This behavior isn't actually useful for a few reasons: - from observation, LLVM chooses to pick the highest DWARF version anyway after issuing the warning - Clang specifies that in this case, the max version should be picked without a warning and as a general principle, we want to support x-lang LTO with Clang which implies using the same module flag merge behaviors - Debuggers need to be able to handle a variety of versions withing the same debugging session as you can easily have some parts of a binary (or some dynamic libraries within an application) all compiled with different DWARF versions This commit changes the module flag merge behavior to match Clang and use the highest version of DWARF. It also adds a test to ensure this behavior is respected in the case of two crates being LTO'd together and updates the test added in the previous commit to ensure no warning is printed.	2025-02-08 16:33:36 +00:00
Manuel Drehwald	21d096184e	fix non-enzyme builds	2025-02-07 22:27:46 -05:00
Matthias Krüger	c9771e9590	Rollup merge of #136691 - bjorn3:linkage_cleanup, r=jieyouxu Remove Linkage::Private and Linkage::Appending Neither of them has any use case. Neither known nor theoretical.	2025-02-08 03:58:48 +01:00
Matthias Krüger	93b194516a	Rollup merge of #136640 - Zalathar:debuginfo-align-bits, r=compiler-errors Debuginfo for function ZSTs should have alignment of 8 bits, not 1 bit In #116096, function ZSTs were made to have debuginfo that gives them an alignment of “1”. But because alignment in LLVM debuginfo is denoted in bits, not bytes, this resulted in an alignment specification of 1 bit instead of 1 byte. I don't know whether this has any practical consequences, but I noticed that a test started failing when I accidentally fixed the mistake while working on #136632, so I extracted the fix (and the test adjustment) to this PR.	2025-02-08 03:58:45 +01:00
Jubilee Young	eddfe8f503	compiler: remove reexports from rustc_target::callconv	2025-02-07 11:25:18 -08:00
Kajetan Puchalski	53f9852224	rustc_target: Add the fp16 target feature for AArch32	2025-02-07 18:08:19 +00:00
bjorn3	f68cd90412	Remove Linkage::Appending It can only be used for certain LLVM internal variables like llvm.global_ctors which users are not allowed to define.	2025-02-07 16:02:19 +00:00
bjorn3	382e4031c2	Remove Linkage::Private This is the same as Linkage::Internal except that it doesn't emit any symbol. Some backends may not support it and it isn't all that useful anyway.	2025-02-07 16:02:19 +00:00
Daniel Paoliello	2a6b27444a	Remove dead code from rustc_codegen_llvm and the LLVM wrapper	2025-02-06 16:53:52 -08:00
Zalathar	4385a9e063	Debuginfo for function ZSTs should have alignment of 8 bits, not 1 bit	2025-02-06 23:01:29 +11:00
bors	2f92f050e8	Auto merge of #136471 - safinaskar:parallel, r=SparrowLii tree-wide: parallel: Fully removed all `Lrc`, replaced with `Arc` tree-wide: parallel: Fully removed all `Lrc`, replaced with `Arc` This is continuation of https://github.com/rust-lang/rust/pull/132282 . I'm pretty sure I did everything right. In particular, I searched all occurrences of `Lrc` in submodules and made sure that they don't need replacement. There are other possibilities, through. We can define `enum Lrc<T> { Rc(Rc<T>), Arc(Arc<T>) }`. Or we can make `Lrc` a union and on every clone we can read from special thread-local variable. Or we can add a generic parameter to `Lrc` and, yes, this parameter will be everywhere across all codebase. So, if you think we should take some alternative approach, then don't merge this PR. But if it is decided to stick with `Arc`, then, please, merge. cc "Parallel Rustc Front-end" ( https://github.com/rust-lang/rust/issues/113349 ) r? SparrowLii `@rustbot` label WG-compiler-parallel	2025-02-06 10:50:05 +00:00
Zalathar	bd855b6c9e	coverage: Remove the old code for simplifying counters after MIR opts	2025-02-06 21:44:31 +11:00
Zalathar	20d051ec87	coverage: Defer part of counter-creation until codegen	2025-02-06 21:44:31 +11:00
Zalathar	ee7dc06cf1	coverage: Store BCB node IDs in mappings, and resolve them in codegen Even though the coverage graph itself is no longer available during codegen, its nodes can still be used as opaque IDs.	2025-02-06 21:44:29 +11:00
Zalathar	042fd8c24a	Remove some unused glob re-exports These were detected by temporarily making `mod llvm` non-public.	2025-02-06 12:10:45 +11:00
Zalathar	65d7e6937b	Remove the `mod llvm_` hack, which should no longer be necessary	2025-02-06 12:10:42 +11:00
Manuel Drehwald	70b9ba3d6e	fix fwd-mode autodiff case	2025-02-05 18:47:23 -05:00
León Orell Valerian Liehr	75989e98d8	Rollup merge of #136375 - Zalathar:llvm-di-builder, r=workingjubilee cg_llvm: Replace some DIBuilder wrappers with LLVM-C API bindings (part 1) Part of #134001, follow-up to #136326, extracted from #134009. This PR performs an arbitrary subset of the LLVM-C binding migrations from #134009, which should make it less tedious to review. The remaining migrations can occur in one or more subsequent PRs.	2025-02-05 05:03:03 +01:00
bors	3f33b30e19	Auto merge of #135760 - scottmcm:disjoint-bitor, r=WaffleLapkin Add `unchecked_disjoint_bitor` per ACP373 Following the names from libs-api in https://github.com/rust-lang/libs-team/issues/373#issuecomment-2085686057 Includes a fallback implementation so this doesn't have to update cg_clif or cg_gcc, and overrides it in cg_llvm to use `or disjoint`, which [is available in LLVM 18](https://releases.llvm.org/18.1.0/docs/LangRef.html#or-instruction) so hopefully we don't need any version checks.	2025-02-04 17:46:06 +00:00
Augie Fackler	e9cb36bd0f	nvptx64: update default alignment to match LLVM 21 This changed in llvm/llvm-project@91cb8f5d32. The commit itself is mostly about some intrinsic instructions, but as an aside it also mentions something about addrspace for tensor memory, which I believe is what this string is telling us. @rustbot label: +llvm-main	2025-02-04 10:37:07 -05:00
Ralf Jung	04e7a10af6	intrinsics: unify rint, roundeven, nearbyint in a single round_ties_even intrinsic	2025-02-04 16:27:29 +01:00
Askar Safin	0a21f1d0a2	tree-wide: parallel: Fully removed all `Lrc`, replaced with `Arc`	2025-02-03 13:25:57 +03:00
Scott McMurray	f46e6be190	Handle the case where the `or disjoint` folds immediately to a constant	2025-02-02 21:04:10 -08:00
Matthias Krüger	f5ae630f10	Rollup merge of #136426 - oli-obk:push-nkpuulwurykn, r=compiler-errors Explain why we retroactively change a static initializer to have a different type I keep getting confused about it and in turn confused `@GuillaumeGomez` while trying to explain it badly	2025-02-02 23:06:57 +01:00
Oli Scherer	b89263605a	Explain why we retroactively change a static initializer to have a different type	2025-02-01 22:39:38 +00:00
Scott McMurray	4ee1602eab	Override `disjoint_or` in the LLVM backend	2025-01-31 22:29:08 -08:00
Zalathar	c3f2930edc	Explain why (some) pointer/length strings are `*const c_uchar`	2025-02-01 14:14:40 +11:00
Zalathar	5413d2bd6f	Add FIXME for auditing optional parameters passed to DIBuilder	2025-02-01 14:14:40 +11:00
Zalathar	8ddd9c38f6	Use `LLVMDIBuilderCreateDebugLocation` The LLVM-C binding takes an explicit context, whereas our binding obtained the context from the scope argument.	2025-02-01 14:14:40 +11:00
Zalathar	949b4673ce	Use `LLVMDIBuilderCreateLexicalBlockFile`	2025-02-01 14:14:40 +11:00
Zalathar	70d41bc711	Use `LLVMDIBuilderCreateLexicalBlock`	2025-02-01 14:14:40 +11:00
Zalathar	878ab125a1	Use `LLVMDIBuilderCreateNameSpace`	2025-02-01 14:14:39 +11:00
Zalathar	cd2af2dd9a	Use `LLVMDIBuilderFinalize`	2025-02-01 13:38:12 +11:00
Zalathar	832fcfb64f	Introduce `DIBuilderBox`, an owning pointer to `DIBuilder`	2025-02-01 13:34:14 +11:00
Ben Kimock	ce7cb312fa	Add link attribute for Enzyme's FFI	2025-01-31 21:11:23 -05:00
bors	7f36543a48	Auto merge of #136332 - jhpratt:rollup-aa69d0e, r=jhpratt Rollup of 9 pull requests Successful merges: - #132156 (When encountering unexpected closure return type, point at return type/expression) - #133429 (Autodiff Upstreaming - rustc_codegen_ssa, rustc_middle) - #136281 (`rustc_hir_analysis` cleanups) - #136297 (Fix a typo in profile-guided-optimization.md) - #136300 (atomic: extend compare_and_swap migration docs) - #136310 (normalize `*.long-type.txt` paths for compare-mode tests) - #136312 (Disable `overflow_delimited_expr` in edition 2024) - #136313 (Filter out RPITITs when suggesting unconstrained assoc type on too many generics) - #136323 (Fix a typo in conventions.md) r? `@ghost` `@rustbot` modify labels: rollup	2025-01-31 09:42:28 +00:00
Jacob Pratt	c19c4b91f5	Rollup merge of #133429 - EnzymeAD:autodiff-middle, r=oli-obk Autodiff Upstreaming - rustc_codegen_ssa, rustc_middle This PR should not be merged until the rustc_codegen_llvm part is merged. I will also alter it a little based on what get's shaved off from the cg_llvm PR, and address some of the feedback I received in the other PR (including cleanups). I am putting it already up to 1) Discuss with `@jieyouxu` if there is more work needed to add tests to this and 2) Pray that there is someone reviewing who can tell me why some of my autodiff invocations get lost. Re 1: My test require fat-lto. I also modify the compilation pipeline. So if there are any other llvm-ir tests in the same compilation unit then I will likely break them. Luckily there are two groups who currently have the same fat-lto requirement for their GPU code which I have for my autodiff code and both groups have some plans to enable support for thin-lto. Once either that work pans out, I'll copy it over for this feature. I will also work on not changing the optimization pipeline for functions not differentiated, but that will require some thoughts and engineering, so I think it would be good to be able to run the autodiff tests isolated from the rest for now. Can you guide me here please? For context, here are some of my tests in the samples folder: https://github.com/EnzymeAD/rustbook Re 2: This is a pretty serious issue, since it effectively prevents publishing libraries making use of autodiff: https://github.com/EnzymeAD/rust/issues/173. For some reason my dummy code persists till the end, so the code which calls autodiff, deletes the dummy, and inserts the code to compute the derivative never gets executed. To me it looks like the rustc_autodiff attribute just get's dropped, but I don't know WHY? Any help would be super appreciated, as rustc queries look a bit voodoo to me. Tracking: - https://github.com/rust-lang/rust/issues/124509 r? `@jieyouxu`	2025-01-31 00:26:30 -05:00
bors	c37fbd873a	Auto merge of #135318 - compiler-errors:vtable-fixes, r=lcnr Fix deduplication mismatches in vtables leading to upcasting unsoundness We currently have two cases where subtleties in supertraits can trigger disagreements in the vtable layout, e.g. leading to a different vtable layout being accessed at a callsite compared to what was prepared during unsizing. Namely: ### #135315 In this example, we were not normalizing supertraits when preparing vtables. In the example, ``` trait Supertrait<T> { fn _print_numbers(&self, mem: &[usize; 100]) { println!("{mem:?}"); } } impl<T> Supertrait<T> for () {} trait Identity { type Selff; } impl<Selff> Identity for Selff { type Selff = Selff; } trait Middle<T>: Supertrait<()> + Supertrait<T> { fn say_hello(&self, _: &usize) { println!("Hello!"); } } impl<T> Middle<T> for () {} trait Trait: Middle<<() as Identity>::Selff> {} impl Trait for () {} fn main() { (&() as &dyn Trait as &dyn Middle<()>).say_hello(&0); } ``` When we prepare `dyn Trait`, we see a supertrait of `Middle<<() as Identity>::Selff>`, which itself has two supertraits `Supertrait<()>` and `Supertrait<<() as Identity>::Selff>`. These two supertraits are identical, but they are not duplicated because we were using structural equality and not considering normalization. This leads to a vtable layout with two trait pointers. When we upcast to `dyn Middle<()>`, those two supertraits are now the same, leading to a vtable layout with only one trait pointer. This leads to an offset error, and we call the wrong method. ### #135316 This one is a bit more interesting, and is the bulk of the changes in this PR. It's a bit similar, except it uses binder equality instead of normalization to make the compiler get confused about two vtable layouts. In the example, ``` trait Supertrait<T> { fn _print_numbers(&self, mem: &[usize; 100]) { println!("{mem:?}"); } } impl<T> Supertrait<T> for () {} trait Trait<T, U>: Supertrait<T> + Supertrait<U> { fn say_hello(&self, _: &usize) { println!("Hello!"); } } impl<T, U> Trait<T, U> for () {} fn main() { (&() as &'static dyn for<'a> Trait<&'static (), &'a ()> as &'static dyn Trait<&'static (), &'static ()>) .say_hello(&0); } ``` When we prepare the vtable for `dyn for<'a> Trait<&'static (), &'a ()>`, we currently consider the PolyTraitRef of the vtable as the key for a supertrait. This leads two two supertraits -- `Supertrait<&'static ()>` and `for<'a> Supertrait<&'a ()>`. However, we can upcast[^up] without offsetting the vtable from `dyn for<'a> Trait<&'static (), &'a ()>` to `dyn Trait<&'static (), &'static ()>`. This is just instantiating the principal trait ref for a specific `'a = 'static`. However, when considering those supertraits, we now have only one distinct supertrait -- `Supertrait<&'static ()>` (which is deduplicated since there are two supertraits with the same substitutions). This leads to similar offsetting issues, leading to the wrong method being called. [^up]: I say upcast but this is a cast that is allowed on stable, since it's not changing the vtable at all, just instantiating the binder of the principal trait ref for some lifetime. The solution here is to recognize that a vtable isn't really meaningfully higher ranked, and to just treat a vtable as corresponding to a `TraitRef` so we can do this deduplication more faithfully. That is to say, the vtable for `dyn for<'a> Tr<'a>` and `dyn Tr<'x>` are always identical, since they both would correspond to a set of free regions on an impl... Do note that `Tr<for<'a> fn(&'a ())>` and `Tr<fn(&'static ())>` are still distinct. ---- There's a bit more that can be cleaned up. In codegen, we can stop using `PolyExistentialTraitRef` basically everywhere. We can also fix SMIR to stop storing `PolyExistentialTraitRef` in its vtable allocations. As for testing, it's difficult to actually turn this into something that can be tested with `rustc_dump_vtable`, since having multiple supertraits that are identical is a recipe for ambiguity errors. Maybe someone else is more creative with getting that attr to work, since the tests I added being run-pass tests is a bit unsatisfying. Miri also doesn't help here, since it doesn't really generate vtables that are offset by an index in the same way as codegen. r? `@lcnr` for the vibe check? Or reassign, idk. Maybe let's talk about whether this makes sense. <sup>(I guess an alternative would also be to not do any deduplication of vtable supertraits (or only a really conservative subset) rather than trying to normalize and deduplicate more faithfully here. Not sure if that works and is sufficient tho.)</sup> cc `@steffahn` -- ty for the minimizations cc `@WaffleLapkin` -- since you're overseeing the feature stabilization :3 Fixes #135315 Fixes #135316	2025-01-31 04:09:11 +00:00
bors	6c1d960d88	Auto merge of #136318 - matthiaskrgr:rollup-a159mzo, r=matthiaskrgr Rollup of 9 pull requests Successful merges: - #135026 (Cast global variables to default address space) - #135475 (uefi: Implement path) - #135852 (Add `AsyncFn*` to `core` prelude) - #136004 (tests: Skip const OOM tests on aarch64-unknown-linux-gnu) - #136157 (override build profile for bootstrap tests) - #136180 (Introduce a wrapper for "typed valtrees" and properly check the type before extracting the value) - #136256 (Add release notes for 1.84.1) - #136271 (Remove minor future footgun in `impl Debug for MaybeUninit`) - #136288 (Improve documentation for file locking) r? `@ghost` `@rustbot` modify labels: rollup	2025-01-30 23:11:38 +00:00
Matthias Krüger	6a66a270b0	Rollup merge of #136180 - lukas-code:typed-valtree, r=oli-obk Introduce a wrapper for "typed valtrees" and properly check the type before extracting the value This PR adds a new wrapper type `ty::Value` to replace the tuple `(Ty, ty::ValTree)` and become the new canonical representation of type-level constant values. The value extraction methods `try_to_bits`/`try_to_bool`/`try_to_target_usize` are moved to this new type. For `try_to_bits` in particular, this avoids some redundant matches on `ty::ConstKind::Value`. Furthermore, these methods and will now properly check the type before extracting the value, which fixes some ICEs. The name `ty::Value` was chosen to be consistent with `ty::Expr`. Commit 1 should be non-functional and commit 2 adds the type check. --- fixes https://github.com/rust-lang/rust/issues/131102 supercedes https://github.com/rust-lang/rust/pull/136130 r? `@oli-obk` cc `@FedericoBruzzone` `@BoxyUwU`	2025-01-30 20:47:07 +01:00
Matthias Krüger	89f8abe8b4	Rollup merge of #135026 - Flakebi:global-addrspace, r=saethlin Cast global variables to default address space Pointers for variables all need to be in the same address space for correct compilation. Therefore ensure that even if a global variable is created in a different address space, it is casted to the default address space before its value is used. This is necessary for the amdgpu target and others where the default address space for global variables is not 0. For example `core` does not compile in debug mode when not casting the address space to the default one because it tries to emit the following (simplified) LLVM IR, containing a type mismatch: ```llvm `@alloc_0` = addrspace(1) constant <{ [6 x i8] }> <{ [6 x i8] c"bit.rs" }>, align 1 `@alloc_1` = addrspace(1) constant <{ ptr }> <{ ptr addrspace(1) `@alloc_0` }>, align 8 ; ^ here a struct containing a `ptr` is needed, but it is created using a `ptr addrspace(1)` ``` For this to compile, we need to insert a constant `addrspacecast` before we use a global variable: ```llvm `@alloc_0` = addrspace(1) constant <{ [6 x i8] }> <{ [6 x i8] c"bit.rs" }>, align 1 `@alloc_1` = addrspace(1) constant <{ ptr }> <{ ptr addrspacecast (ptr addrspace(1) `@alloc_0` to ptr) }>, align 8 ``` As vtables are global variables as well, they are also created with an `addrspacecast`. In the SSA backend, after a vtable global is created, metadata is added to it. To add metadata, we need the non-casted global variable. Therefore we strip away an addrspacecast if there is one, to get the underlying global. Tracking issue: #135024	2025-01-30 20:47:02 +01:00
Lukas Markeffsky	10fc0b159e	introduce `ty::Value` Co-authored-by: FedericoBruzzone <federico.bruzzone.i@gmail.com>	2025-01-30 17:47:44 +01:00
Michael Goulet	9dc41a048d	Use ExistentialTraitRef throughout codegen	2025-01-30 15:34:00 +00:00
Michael Goulet	fdc4bd22b7	Do not treat vtable supertraits as distinct when bound with different bound vars	2025-01-30 15:33:58 +00:00
Wesley Wiser	51eaa0d56a	Clean up uses of the unstable `dwarf_version` option - Consolidate calculation of the effective value. - Check the target `DebuginfoKind` instead of using `is_like_msvc`.	2025-01-29 21:44:21 -06:00
Manuel Drehwald	1f30517d40	upstream rustc_codegen_ssa/rustc_middle changes for enzyme/autodiff	2025-01-29 21:31:13 -05:00
León Orell Valerian Liehr	7e123e4940	Rollup merge of #136147 - RalfJung:required-target-features-check-not-add, r=workingjubilee ABI-required target features: warn when they are missing in base CPU Part of https://github.com/rust-lang/rust/pull/135408: instead of adding ABI-required features to the target we build for LLVM, check that they are already there. Crucially we check this after applying `-Ctarget-cpu` and `-Ctarget-feature`, by reading `sess.unstable_target_features`. This means we can tweak the ABI target feature check without changing the behavior for any existing user; they will get warnings but the target features behave as before. The test changes here show that we are un-doing the "add all required target features" part. Without the full #135408, there is no way to take a way an ABI-required target feature with `-Ctarget-cpu`, so we cannot yet test that part. Cc ``@workingjubilee``	2025-01-29 03:12:21 +01:00
Taiki Endo	93465e6c31	Mark condition/carry bit as clobbered in C-SKY inline assembly	2025-01-29 06:46:05 +09:00
Ralf Jung	93ee180cfa	ABI-required target features: warn when they are missing in base CPU (rather than silently enabling them)	2025-01-28 04:40:42 +01:00
Oli Scherer	b24f674520	Change `collect_and_partition_mono_items` tuple return type to a struct	2025-01-27 09:38:12 +00:00
Jörn Horstmann	3779b8e32e	Consistently use the most significant bit of vector masks This improves the codegen for vector `select`, `gather`, `scatter` and boolean reduction intrinsics and fixes rust-lang/portable-simd#316. The current behavior of most mask operations during llvm codegen is to truncate the mask vector to <N x i1>, telling llvm to use the least significat bit. The exception is the `simd_bitmask` intrinsics, which already used the most signifiant bit. Since sse/avx instructions are defined to use the most significant bit, truncating means that llvm has to insert a left shift to move the bit into the most significant position, before the mask can actually be used. Similarly on aarch64, mask operations like blend work bit by bit, repeating the least significant bit across the whole lane involves shifting it into the sign position and then comparing against zero. By shifting before truncating to <N x i1>, we tell llvm that we only consider the most significant bit, removing the need for additional shift instructions in the assembly.	2025-01-26 16:44:23 +01:00
bors	6365178a6b	Auto merge of #128657 - clubby789:optimize-none, r=fee1-dead,WaffleLapkin Add `#[optimize(none)]` cc #54882 This extends the `optimize` attribute to add `none`, which corresponds to the LLVM `OptimizeNone` attribute. Not sure if an MCP is required for this, happy to file one if so.	2025-01-25 05:50:36 +00:00
Matthias Krüger	1e454fe725	Rollup merge of #135581 - EnzymeAD:refactor-codgencx, r=oli-obk Separate Builder methods from tcx As part of the autodiff upstreaming we noticed, that it would be nice to have various builder methods available without the TypeContext, which prevents the normal CodegenCx to be passed around between threads. We introduce a SimpleCx which just owns the llvm module and llvm context, to encapsulate them. The previous CodegenCx now implements deref and forwards access to the llvm module or context to it's SimpleCx sub-struct. This gives us a bit more flexibility, because now we can pass (or construct) the SimpleCx in locations where we don't have enough information to construct a CodegenCx, or are not able to pass it around due to the tcx lifetimes (and it not implementing send/sync). This also introduces an SBuilder, similar to the SimpleCx. The SBuilder uses a SimpleCx, whereas the existing Builder uses the larger CodegenCx. I will push updates to make implementations generic (where possible) to be implemented once and work for either of the two. I'll also clean up the leftover code. `call` is a bit tricky, because it requires a tcx, I probably need to duplicate it after all. Tracking: - https://github.com/rust-lang/rust/issues/124509	2025-01-24 23:25:42 +01:00
Manuel Drehwald	386c233858	Make CodegenCx and Builder generic Co-authored-by: Oli Scherer <github35764891676564198441@oli-obk.de>	2025-01-24 16:05:26 -05:00
clubby789	5ac95a5c47	Rename `OptimizeAttr::None` to `Default`	2025-01-24 19:34:01 +00:00
Zalathar	ff48331588	coverage: Make query `coverage_ids_info` return an Option This reflects the fact that we can't compute meaningful info for a function that wasn't instrumented and therefore doesn't have `function_coverage_info`.	2025-01-24 16:13:11 +11:00
Flakebi	b06e840d9e	Add comments about address spaces	2025-01-24 00:37:05 +01:00
clubby789	cd848c9f3e	Implement `optimize(none)` attribute	2025-01-23 17:19:53 +00:00
Ken Matsui	44e8c43976	rustc_codegen_llvm: remove outdated asm-to-obj codegen note Remove comment about missing integrated assembler handling, which was removed in commit `02840ca`.	2025-01-22 17:58:50 -05:00
Matthias Krüger	e0d74c0667	Rollup merge of #135156 - Zalathar:debuginfo-flags, r=cuviper Make our `DIFlags` match `LLVMDIFlags` in the LLVM-C API In order to be able to use a mixture of LLVM-C and C++ bindings for debuginfo, our Rust-side `DIFlags` needs to have the same layout as LLVM-C's `LLVMDIFlags`, and we also need to be able to convert it to the `DIFlags` accepted by LLVM's C++ API. Internally, LLVM converts between the two types with a simple cast. We can't necessarily rely on that always being true, and LLVM doesn't expose a conversion function, so we have two potential options: - Convert each bit/subvalue individually - Statically assert that doing a cast is actually fine As long as both types do remain the same under the hood (which seems likely), the static-assert-and-cast approach is easier and faster. If the static assertions ever start failing against some future version of LLVM, we'll have to switch over to the convert-each-subvalue approach, which is a bit more error-prone. --- Extracted from #134009, though this PR ended up choosing the static-assert-and-cast approach over the convert-each-subvalue approach.	2025-01-22 19:29:39 +01:00
bors	ed43cbcb88	Auto merge of #134299 - RalfJung:remove-start, r=compiler-errors remove support for the (unstable) #[start] attribute As explained by `@Noratrieb:` `#[start]` should be deleted. It's nothing but an accidentally leaked implementation detail that's a not very useful mix between "portable" entrypoint logic and bad abstraction. I think the way the stable user-facing entrypoint should work (and works today on stable) is pretty simple: - `std`-using cross-platform programs should use `fn main()`. the compiler, together with `std`, will then ensure that code ends up at `main` (by having a platform-specific entrypoint that gets directed through `lang_start` in `std` to `main` - but that's just an implementation detail) - `no_std` platform-specific programs should use `#![no_main]` and define their own platform-specific entrypoint symbol with `#[no_mangle]`, like `main`, `_start`, `WinMain` or `my_embedded_platform_wants_to_start_here`. most of them only support a single platform anyways, and need cfg for the different platform's ways of passing arguments or other things anyways `#[start]` is in a super weird position of being neither of those two. It tries to pretend that it's cross-platform, but its signature is a total lie. Those arguments are just stubbed out to zero on ~~Windows~~ wasm, for example. It also only handles the platform-specific entrypoints for a few platforms that are supported by `std`, like Windows or Unix-likes. `my_embedded_platform_wants_to_start_here` can't use it, and neither could a libc-less Linux program. So we have an attribute that only works in some cases anyways, that has a signature that's a total lie (and a signature that, as I might want to add, has changed recently, and that I definitely would not be comfortable giving any stability guarantees on), and where there's a pretty easy way to get things working without it in the first place. Note that this feature has not been RFCed in the first place. This comment was posted [in May](https://github.com/rust-lang/rust/issues/29633#issuecomment-2088596042) and so far nobody spoke up in that issue with a usecase that would require keeping the attribute. Closes https://github.com/rust-lang/rust/issues/29633 try-job: x86_64-gnu-nopt try-job: x86_64-msvc-1 try-job: x86_64-msvc-2 try-job: test-various	2025-01-21 19:46:20 +00:00
Ralf Jung	56c90dc31e	remove support for the #[start] attribute	2025-01-21 06:59:15 -07:00
Oli Scherer	dfa4c01b2e	Treat undef bytes as equal to any other byte	2025-01-21 08:27:21 +00:00
Zalathar	d10bdafa26	Note that cg_llvm's gimli should match the version used elsewhere	2025-01-21 14:41:44 +11:00
Zalathar	32f1c1d85e	Make our `DIFlags` match `LLVMDIFlags` in the LLVM-C API	2025-01-21 14:41:44 +11:00
bors	6a64e3b897	Auto merge of #135643 - khuey:135332, r=jieyouxu When LLVM's location discriminator value limit is exceeded, emit locations with dummy spans instead of dropping them entirely Dropping them fails `-Zverify-llvm-ir`. Fixes #135332. r? `@jieyouxu`	2025-01-20 14:16:22 +00:00
Yotam Ofek	264fa0fc54	Run `clippy --fix` for `unnecessary_map_or` lint	2025-01-19 19:15:00 +00:00
Kyle Huey	45ef92731b	When LLVM's location discriminator value limit is exceeded, emit locations with dummy spans instead of dropping them entirely Revert most of #133194 (except the test and the comment fixes). Then refix not emitting locations at all when the correct location discriminator value exceeds LLVM's capacity.	2025-01-19 07:17:33 -08:00
bors	0c2c096e1a	Auto merge of #135047 - Flakebi:amdgpu-kernel-cc, r=workingjubilee Add gpu-kernel calling convention The amdgpu-kernel calling convention was reverted in commit `f6b21e90d1` (#120495 and https://github.com/rust-lang/rust-analyzer/pull/16463) due to inactivity in the amdgpu target. Introduce a `gpu-kernel` calling convention that translates to `ptx_kernel` or `amdgpu_kernel`, depending on the target that rust compiles for. Tracking issue: #135467 amdgpu target tracking issue: #135024	2025-01-17 04:36:09 +00:00
Flakebi	e7e5202978	Add gpu-kernel calling convention The amdgpu-kernel calling convention was reverted in commit `f6b21e90d1` due to inactivity in the amdgpu target. Introduce a `gpu-kernel` calling convention that translates to `ptx_kernel` or `amdgpu_kernel`, depending on the target that rust compiles for.	2025-01-16 00:26:55 +01:00
Matthias Krüger	448bad9eba	Rollup merge of #133752 - klensy:cp, r=davidtwco replace copypasted ModuleLlvm::parse replaced code same as in `bd36e69d25/compiler/rustc_codegen_llvm/src/lib.rs (L426-L445)` except before error message was emitted via `write::llvm_err`, which returned other error kind, but it still ok?	2025-01-13 15:56:55 +01:00
Matthias Krüger	0bb0f0412f	Rollup merge of #135205 - lqd:bitsets, r=Mark-Simulacrum Rename `BitSet` to `DenseBitSet` r? `@Mark-Simulacrum` as you requested this in https://github.com/rust-lang/rust/pull/134438#discussion_r1890659739 after such a confusion. This PR renames `BitSet` to `DenseBitSet` to make it less obvious as the go-to solution for bitmap needs, as well as make its representation (and positives/negatives) clearer. It also expands the comments there to hopefully make it clearer when it's not a good fit, with some alternative bitsets types. (This migrates the subtrees cg_gcc and clippy to use the new name in separate commits, for easier review by their respective owners, but they can obvs be squashed)	2025-01-11 18:13:47 +01:00
Matthias Krüger	b8e230a824	Rollup merge of #134030 - folkertdev:min-fn-align, r=workingjubilee add `-Zmin-function-alignment` tracking issue: https://github.com/rust-lang/rust/issues/82232 This PR adds the `-Zmin-function-alignment=<align>` flag, that specifies a minimum alignment for all* functions. ### Motivation This feature is requested by RfL [here](https://github.com/rust-lang/rust/issues/128830): > i.e. the equivalents of `-fmin-function-alignment` ([GCC](https://gcc.gnu.org/onlinedocs/gcc/Optimize-Options.html#index-fmin-function-alignment_003dn), Clang does not support it) / `-falign-functions` ([GCC](https://gcc.gnu.org/onlinedocs/gcc/Optimize-Options.html#index-falign-functions), [Clang](https://clang.llvm.org/docs/ClangCommandLineReference.html#cmdoption-clang1-falign-functions)). > > For the Linux kernel, the behavior wanted is that of GCC's `-fmin-function-alignment` and Clang's `-falign-functions`, i.e. align all functions, including cold functions. > > There is [`feature(fn_align)`](https://github.com/rust-lang/rust/issues/82232), but we need to do it globally. ### Behavior The `fn_align` feature does not have an RFC. It was decided at the time that it would not be necessary, but maybe we feel differently about that now? In any case, here are the semantics of this flag: - `-Zmin-function-alignment=<align>` specifies the minimum alignment of all* functions - the `#[repr(align(<align>))]` attribute can be used to override the function alignment on a per-function basis: when `-Zmin-function-alignment` is specified, the attribute's value is only used when it is higher than the value passed to `-Zmin-function-alignment`. - the target may decide to use a higher value (e.g. on x86_64 the minimum that LLVM generates is 16) - The highest supported alignment in rust is `2^29`: I checked a bunch of targets, and they all emit the `.p2align 29` directive for targets that align functions at all (some GPU stuff does not have function alignment). *: Only with `build-std` would the minimum alignment also be applied to `std` functions. --- cc `@ojeda` r? `@workingjubilee` you were active on the tracking issue	2025-01-11 18:13:45 +01:00
Rémy Rakic	a13354bea0	rename `BitSet` to `DenseBitSet` This should make it clearer that this bitset is dense, with the advantages and disadvantages that it entails.	2025-01-11 11:34:01 +00:00
Folkert de Vries	47573bf61e	add `-Zmin-function-alignment`	2025-01-10 22:53:54 +01:00
David Wood	f86169a58f	mir_transform: implement forced inlining Adds `#[rustc_force_inline]` which is similar to always inlining but reports an error if the inlining was not possible, and which always attempts to inline annotated items, regardless of optimisation levels. It can only be applied to free functions to guarantee that the MIR inliner will be able to resolve calls.	2025-01-10 18:37:54 +00:00
Guillaume Gomez	020d8758f4	Rollup merge of #135177 - maurer:rename-module, r=nikic llvm: Ignore error value that is always false See llvm/llvm-project#121851 For LLVM 20+, this function (`renameModuleForThinLTO`) has no return value. For prior versions of LLVM, this never failed, but had a signature which allowed an error value people were handling. `@rustbot` label: +llvm-main r? `@nikic` Wait a moment before approving while the llvm-main infrastructure picks it up.	2025-01-07 15:30:25 +01:00
Jacob Pratt	4e4a93c2dd	Rollup merge of #131830 - hoodmane:emscripten-wasm-eh, r=workingjubilee Add support for wasm exception handling to Emscripten target This is a draft because we need some additional setting for the Emscripten target to select between the old exception handling and the new exception handling. I don't know how to add a setting like that, would appreciate advice from Rust folks. We could maybe choose to use the new exception handling if `Ctarget-feature=+exception-handling` is passed? I tried this but I get errors from llvm so I'm not doing it right.	2025-01-06 22:04:13 -05:00
Matthew Maurer	fc32dd49cb	llvm: Ignore error value that is always false See llvm/llvm-project#121851 For LLVM 20+, this function (`renameModuleForThinLTO`) has no return value. For prior versions of LLVM, this never failed, but had a signature which allowed an error value people were handling.	2025-01-07 01:02:22 +00:00
Hood Chatham	49c74234a7	Add support for wasm exception handling to Emscripten target Gated behind an unstable `-Z emscripten-wasm-eh` flag	2025-01-06 10:29:54 +01:00
bors	56f9e6f935	Auto merge of #135140 - jhpratt:rollup-pn2gi84, r=jhpratt Rollup of 3 pull requests Successful merges: - #135115 (cg_llvm: Use constants for DWARF opcodes, instead of FFI calls) - #135118 (Clarified the documentation on `core::iter::from_fn` and `core::iter::successors`) - #135121 (Mark `slice::reverse` unstably const) r? `@ghost` `@rustbot` modify labels: rollup	2025-01-06 02:30:55 +00:00
bors	feb32c6546	Auto merge of #134794 - RalfJung:abi-required-target-features, r=workingjubilee Add a notion of "some ABIs require certain target features" I think I finally found the right shape for the data and checks that I recently added in https://github.com/rust-lang/rust/pull/133099, https://github.com/rust-lang/rust/pull/133417, https://github.com/rust-lang/rust/pull/134337: we have a notion of "this ABI requires the following list of target features, and it is incompatible with the following list of target features". Both `-Ctarget-feature` and `#[target_feature]` are updated to ensure we follow the rules of the ABI. This removes all the "toggleability" stuff introduced before, though we do keep the notion of a fully "forbidden" target feature -- this is needed to deal with target features that are actual ABI switches, and hence are needed to even compute the list of required target features. We always explicitly (un)set all required and in-conflict features, just to avoid potential trouble caused by the default features of whatever the base CPU is. We do this before applying `-Ctarget-feature` to maintain backward compatibility; this poses a slight risk of missing some implicit feature dependencies in LLVM but has the advantage of not breaking users that deliberately toggle ABI-relevant target features. They get a warning but the feature does get toggled the way they requested. For now, our logic supports x86, ARM, and RISC-V (just like the previous logic did). Unsurprisingly, RISC-V is the nicest. ;) As a side-effect this also (unstably) allows enabling `x87` when that is harmless. I used the opportunity to mark SSE2 as required on x86-64, to better match the actual logic in LLVM and because all x86-64 chips do have SSE2. This infrastructure also prepares us for requiring SSE on x86-32 when we want to use that for our ABI (and for float semantics sanity), see https://github.com/rust-lang/rust/issues/133611, but no such change is happening in this PR. r? `@workingjubilee`	2025-01-05 23:21:06 +00:00
Zalathar	f50721ebad	Explain why the `DW_TAG_*` constants remain as-is for now	2025-01-05 22:16:49 +11:00
Zalathar	1b62645418	Use constants for DWARF opcodes, instead of FFI calls	2025-01-05 22:16:25 +11:00
Zalathar	e267106104	Use gimli to get the values of DWARF constants needed by codegen The `gimli` crate is already a dependency of `thorin-dwp`, which is already a dependency of `rustc_codegen_ssa`.	2025-01-05 22:07:48 +11:00
Ralf Jung	2e64b5352b	add dedicated type for ABI target feature constraints	2025-01-05 10:46:30 +01:00
bors	3dc3c524f7	Auto merge of #133990 - Walnut356:static_const, r=workingjubilee [Debuginfo] Force enum `DISCR_` to `static const u64` to allow for inspection via LLDB see [here](https://rust-lang.zulipchat.com/#narrow/channel/317568-t-compiler.2Fwg-debugging/topic/Revamping.20Debuginfo/near/486614878) for more info. This change mainly helps `-msvc` debugged with LLDB. Currently, LLDB cannot inspect `static` struct fields, so the intended visualization for enums is only borderline functional, and niche enums with ranges of discriminant cannot be determined at all . LLDB can inspect `static const` values (though for whatever reason, non-enum/non-u64 consts don't work). This change adds the `LLVMRustDIBuilderCreateQualifiedType` to the rust FFI layer to wrap the discr type with a `const` modifier, as well as forcing all generated integer enum `DISCR_*` values to be u64's. Those values will only ever be used by debugger visualizers anyway, so it shouldn't be a huge deal, but I left a fixme comment for it just in case.. The `tag` also still properly reflects the discriminant type, so no information is lost.	2025-01-04 23:56:29 +00:00
Flakebi	56bf673f0a	Remove range-metadata amdgpu workaround Range metadata was disabled for amdgpu due to a backend bug. I did not encounter any problems when removing the workaround to enable range metadata (tried compiling `core` and `alloc`), so I assume this has been fixed in LLVM in the last years. Remove the workaround to re-enable range metadata.	2025-01-02 15:45:04 +01:00
Flakebi	436e4fb647	Cast global variables to default address space Pointers for variables all need to be in the same address space for correct compilation. Therefore ensure that even if a global variable is created in a different address space, it is casted to the default address space before its value is used. This is necessary for the amdgpu target and others where the default address space for global variables is not 0. For example `core` does not compile in debug mode when not casting the address space to the default one because it tries to emit the following (simplified) LLVM IR, containing a type mismatch: ```llvm @alloc_0 = addrspace(1) constant <{ [6 x i8] }> <{ [6 x i8] c"bit.rs" }>, align 1 @alloc_1 = addrspace(1) constant <{ ptr }> <{ ptr addrspace(1) @alloc_0 }>, align 8 ; ^ here a struct containing a `ptr` is needed, but it is created using a `ptr addrspace(1)` ``` For this to compile, we need to insert a constant `addrspacecast` before we use a global variable: ```llvm @alloc_0 = addrspace(1) constant <{ [6 x i8] }> <{ [6 x i8] c"bit.rs" }>, align 1 @alloc_1 = addrspace(1) constant <{ ptr }> <{ ptr addrspacecast (ptr addrspace(1) @alloc_0 to ptr) }>, align 8 ``` As vtables are global variables as well, they are also created with an `addrspacecast`. In the SSA backend, after a vtable global is created, metadata is added to it. To add metadata, we need the non-casted global variable. Therefore we strip away an addrspacecast if there is one, to get the underlying global.	2025-01-02 15:42:00 +01:00
Manuel Drehwald	d753cbf779	upstream rustc_codegen_llvm changes for enzyme/autodiff	2025-01-01 21:42:45 +01:00
Ralf Jung	912b7291d0	add ABI target features before -Ctarget-features	2024-12-31 12:41:20 +01:00
Ralf Jung	eb527424a5	x86-64 hardfloat actually requires sse2	2024-12-31 12:41:20 +01:00
Ralf Jung	cfae43d638	clean up target feature system; most of the toggleability is now handled by the ABI target feature check	2024-12-31 12:41:20 +01:00
Ralf Jung	2bf27e09be	explicitly model that certain ABIs require/forbid certain target features	2024-12-31 12:41:20 +01:00
Walnut	a1191e30b6	force enum `DISCR_*` to `const u64` to allow for inspection via LLDB's `SBTypeStaticField::GetConstantValue()`	2024-12-30 19:01:48 -06:00
Ralf Jung	a0dbb37ebd	add llvm_floatabi field to target spec that controls FloatABIType	2024-12-30 21:59:05 +01:00
Ralf Jung	fff026c8e5	rustc_llvm: expose FloatABIType target machine parameter	2024-12-30 18:10:59 +01:00
Ralf Jung	62bb35ab5d	make -Csoft-float have an effect on all ARM targets	2024-12-29 11:10:36 +01:00
Scott McMurray	4669c0d756	Override `carrying_mul_add` in cg_llvm	2024-12-27 08:17:40 -08:00
Walnut	bc4266ca96	add LLVMRustDIBuilderCreateQualifiedType to ffi	2024-12-23 19:12:32 -06:00
bjorn3	c02c311d84	Remove some dead code around import library generation This was missed when replacing the usage of LLVM for generating import libraries.	2024-12-20 15:20:15 +00:00
Matthias Krüger	57cbd078f2	Rollup merge of #134497 - Zalathar:spans, r=jieyouxu coverage: Store coverage source regions as `Span` until codegen (take 2) This is an attempt to re-land #133418: > Historically, coverage spans were converted into line/column coordinates during the MIR instrumentation pass. > This PR moves that conversion step into codegen, so that coverage spans spend most of their time stored as Span instead. > In addition to being conceptually nicer, this also reduces the size of coverage mappings in MIR, because Span is smaller than 4x u32. That PR was reverted by #133608, because in some circumstances not covered by our test suite we were emitting coverage metadata that was causing `llvm-cov` to exit with an error (#133606). --- The implementation here is mostly the same, but adapted for subsequent changes in the relevant code (e.g. #134163). I believe that the changes in #134163 should be sufficient to prevent the problem that required the original PR to be reverted. But I haven't been able to reproduce the original breakage in a regression test, and the `llvm-cov` error message is extremely unhelpful, so I can't completely rule out the possibility of this breaking again. r? jieyouxu (reviewer of the original PR)	2024-12-19 15:26:16 +01:00
Zalathar	aced4dcf10	coverage: Add a synthetic test for when all spans are discarded	2024-12-19 22:03:43 +11:00
Zalathar	837a25dd41	coverage: Identify source files by ID, not by interned filename	2024-12-19 18:09:09 +11:00
Zalathar	34ed51cb83	coverage: Store coverage source regions as `Span` until codegen	2024-12-19 18:09:09 +11:00
Zalathar	c3780e1d22	coverage: Quietly skip functions that end up having no mappings In codegen, a used function with `FunctionCoverageInfo` but no mappings has historically indicated a bug. However, that will no longer be the case after moving some fallible span-processing steps into codegen.	2024-12-19 18:09:07 +11:00
Zalathar	d416cead5a	coverage: Rename some FFI fields from `span` to `cov_span` This will avoid confusion with actual `Span` spans.	2024-12-19 17:26:01 +11:00
Ralf Jung	397ae3cdf6	fix outdated comment Co-authored-by: Camille Gillot <gillot.camille@gmail.com>	2024-12-18 11:01:54 +01:00
Ralf Jung	e023590de4	make no-variant types a dedicated Variants variant	2024-12-18 11:01:54 +01:00
Ralf Jung	21de42bf8d	Variants::Single: do not use invalid VariantIdx for uninhabited enums	2024-12-18 11:00:21 +01:00
bors	a89ca2c85e	Auto merge of #134243 - nnethercote:re-export-more-rustc_span, r=jieyouxu Re-export more `rustc_span::symbol` things from `rustc_span`. `rustc_span::symbol` defines some things that are re-exported from `rustc_span`, such as `Symbol` and `sym`. But it doesn't re-export some closely related things such as `Ident` and `kw`. So you can do `use rustc_span::{Symbol, sym}` but you have to do `use rustc_span::symbol::{Ident, kw}`, which is inconsistent for no good reason. This commit re-exports `Ident`, `kw`, and `MacroRulesNormalizedIdent`, and changes many `rustc_span::symbol::` qualifiers to `rustc_span::`. This is a 300+ net line of code reduction, mostly because many files with two `use rustc_span` items can be reduced to one. r? `@jieyouxu`	2024-12-18 02:56:38 +00:00
Nicholas Nethercote	2620eb42d7	Re-export more `rustc_span::symbol` things from `rustc_span`. `rustc_span::symbol` defines some things that are re-exported from `rustc_span`, such as `Symbol` and `sym`. But it doesn't re-export some closely related things such as `Ident` and `kw`. So you can do `use rustc_span::{Symbol, sym}` but you have to do `use rustc_span::symbol::{Ident, kw}`, which is inconsistent for no good reason. This commit re-exports `Ident`, `kw`, and `MacroRulesNormalizedIdent`, and changes many `rustc_span::symbol::` qualifiers in `compiler/` to `rustc_span::`. This is a 200+ net line of code reduction, mostly because many files with two `use rustc_span` items can be reduced to one.	2024-12-18 13:38:53 +11:00
Matthias Krüger	e696f5c180	Rollup merge of #134323 - Zalathar:dismantle-map-data, r=jieyouxu coverage: Dismantle `map_data.rs` by moving its responsibilities elsewhere This is a series of incremental changes that combine to let us get rid of `coverageinfo/map_data.rs`, by moving all of its responsibilities into more appropriate places. Some of the notable consequences are: - We once again build the per-CGU file table on the fly while preparing individual covfun records, instead of building the whole table up-front. The up-front approach was introduced by #117042 to work around various other problems in generating the covmap/covfun records, but subsequent cleanups have made that approach no longer necessary. - Expression conversion and mapping-region conversion are now performed directly in `mapgen::covfun`, which should make future changes easier. - We no longer insert unused function instances into the same map that is also used to track used function instances. This helps to decouple the handling of used vs unused functions. --- There should be no meaningful change to compiler output. The file table is no longer sorted, because reordering it would invalidate the file indices stored in individual covfun records, but the table order should still be deterministic (albeit arbitrary). There are some subsequent cleanups that I intend to investigate, but this is enough change for one PR.	2024-12-17 22:34:42 +01:00
Zalathar	541d4e85d9	coverage: Track used functions in a set instead of a map This patch dismantles what was left of `FunctionCoverage` in `map_data.rs`, replaces `function_coverage_map` with a set, and overhauls how we prepare covfun records for unused functions.	2024-12-17 14:14:19 +11:00
Zalathar	d34c365eb0	coverage: Pull function source hash out of `map_data.rs`	2024-12-17 13:55:20 +11:00
Zalathar	527f8127bb	coverage: Pull region conversion out of `map_data.rs`	2024-12-17 13:55:20 +11:00
Zalathar	252276a53d	coverage: Pull expression conversion out of `map_data.rs`	2024-12-17 13:55:20 +11:00
Zalathar	154fae1e8d	coverage: Build the global file table on the fly	2024-12-17 13:55:19 +11:00
Zalathar	fe412af4fc	coverage: Use `is_eligible_for_coverage` to filter unused functions The checks in `is_eligible_for_coverage` include `is_fn_like`, but will also exclude various function-like things that cannot possibly have coverage instrumentation.	2024-12-17 13:30:11 +11:00
Jonathan Dönszelmann	efb98b6552	rename rustc_attr to rustc_attr_parsing and create rustc_attr_data_structures	2024-12-16 19:08:19 +01:00
bors	d18506299b	Auto merge of #133417 - RalfJung:aarch64-float-abi, r=workingjubilee reject aarch64 target feature toggling that would change the float ABI ~~Stacked on top of https://github.com/rust-lang/rust/pull/133099. Only the last two commits are new.~~ The first new commit lays the groundwork for separately controlling whether a feature may be enabled or disabled. The second commit uses that to make it illegal to disable the `neon` feature (which is only possible via `-Ctarget-feature`, and so the new check just adds a warning). Enabling the `neon` feature remains allowed on targets that don't disable `neon` or `fp-armv8`, which is all our built-in targets. This way, the entire PR is not a breaking change. Fixes https://github.com/rust-lang/rust/issues/131058 for hardfloat targets (together with https://github.com/rust-lang/rust/pull/133102 which fixed it for softfloat targets). Part of https://github.com/rust-lang/rust/issues/116344.	2024-12-15 16:32:03 +00:00
Ralf Jung	eb2e928250	target_features: control separately whether enabling and disabling a target feature is allowed	2024-12-14 08:24:18 +01:00
Matthias Krüger	704102c0f0	Rollup merge of #134208 - Zalathar:covmap-covfun, r=compiler-errors coverage: Tidy up creation of covmap and covfun records This is a small follow-up to #134163 that mostly just inlines and renames some variables, and adds a few comments. It also slightly defers the creation of the LLVM value that holds the filename table, to just before the value is needed. --- try-job: x86_64-mingw-2 try-job: dist-x86_64-linux	2024-12-14 05:01:07 +01:00
bors	327c7ee436	Auto merge of #133099 - RalfJung:forbidden-hardfloat-features, r=workingjubilee forbid toggling x87 and fpregs on hard-float targets Part of https://github.com/rust-lang/rust/issues/116344, follow-up to https://github.com/rust-lang/rust/pull/129884: The `x87` target feature on x86 and the `fpregs` target feature on ARM must not be disabled on a hardfloat target, as that would change the float ABI. However, enabling `fpregs` on ARM is [explicitly requested](https://github.com/rust-lang/rust/issues/130988) as it seems to be useful. Therefore, we need to refine the distinction of "forbidden" target features and "allowed" target features: all (un)stable target features can determine on a per-target basis whether they should be allowed to be toggled or not. `fpregs` then checks whether the current target has the `soft-float` feature, and if yes, `fpregs` is permitted -- otherwise, it is not. (Same for `x87` on x86). Also fixes https://github.com/rust-lang/rust/issues/132351. Since `fpregs` and `x87` can be enabled on some builds and disabled on others, it would make sense that one can query it via `cfg`. Therefore, I made them behave in `cfg` like any other unstable target feature. The first commit prepares the infrastructure, but does not change behavior. The second commit then wires up `fpregs` and `x87` with that new infrastructure. r? `@workingjubilee`	2024-12-13 19:43:00 +00:00
Zalathar	5f5745beb0	coverage: Tidy up creation of covfun records	2024-12-12 22:13:07 +11:00
Zalathar	de53fe245d	coverage: Tidy up creation of covmap records	2024-12-12 22:10:42 +11:00
Zalathar	f7c6a2cf11	Fix our `llvm::Bool` typedef to be signed, to match `LLVMBool` In the LLVM-C API, boolean values are passed as `typedef int LLVMBool`, but our Rust-side typedef was using `c_uint` instead. Signed and unsigned integers have the same ABI on most platforms, but that isn't universally true, so we should prefer to be consistent with LLVM.	2024-12-12 20:54:33 +11:00
bors	903d2976fd	Auto merge of #129181 - beetrees:asm-spans, r=pnkfelix,compiler-errors Pass end position of span through inline ASM cookie Before this PR, only the start position of the span was passed though the inline ASM cookie to diagnostics. LLVM 19 has full support for 64-bit inline ASM cookies; this PR uses that to pass the end position of the span in the upper 32 bits, meaning inline ASM diagnostics now point at the entire line the error occurred on, not just the first character of it.	2024-12-12 02:34:06 +00:00
bors	1daec069fb	Auto merge of #128004 - folkertdev:naked-fn-asm, r=Amanieu codegen `#[naked]` functions using global asm tracking issue: https://github.com/rust-lang/rust/issues/90957 Fixes #124375 This implements the approach suggested in the tracking issue: use the existing global assembly infrastructure to emit the body of `#[naked]` functions. The main advantage is that we now have full control over what gets generated, and are no longer dependent on LLVM not sneakily messing with our output (inlining, adding extra instructions, etc). I discussed this approach with `@Amanieu` and while I think the general direction is correct, there is probably a bunch of stuff that needs to change or move around here. I'll leave some inline comments on things that I'm not sure about. Combined with https://github.com/rust-lang/rust/pull/127853, if both accepted, I think that resolves all steps from the tracking issue. r? `@Amanieu`	2024-12-11 21:51:07 +00:00
Ralf Jung	60eca2c575	apply review feedback	2024-12-11 22:18:51 +01:00
Ralf Jung	2d887a5c5c	generalize 'forbidden feature' concept so that even (un)stable feature can be invalid to toggle Also rename some things for extra clarity	2024-12-11 22:11:15 +01:00
Matthias Krüger	eefefbea2f	Rollup merge of #134165 - durin42:wasm-target-string, r=jieyouxu wasm(32\|64): update alignment string See llvm/llvm-project@c5ab70c508 `@rustbot` label: +llvm-main	2024-12-11 20:00:21 +01:00
Matthias Krüger	13c13ee4ec	Rollup merge of #134163 - Zalathar:covfun, r=SparrowLii,jieyouxu coverage: Rearrange the code for embedding per-function coverage metadata This is a series of refactorings to the code that prepares and embeds per-function coverage metadata records (“covfun records”) in the `__llvm_covfun` linker section of the final binary. The `llvm-cov` tool reads this metadata from the binary when preparing a coverage report. Beyond general cleanup, a big motivation behind these changes is to pave the way for re-landing an updated version of #133418. --- There should be no change in compiler output, as demonstrated by the absence of (meaningful) changes to coverage tests. The first patch is just moving code around, so I suggest looking at the other patches to see the actual changes. --- try-job: x86_64-gnu try-job: x86_64-msvc try-job: aarch64-apple	2024-12-11 20:00:18 +01:00
Augie Fackler	48b883287a	wasm(32\|64): update alignment string See llvm/llvm-project@c5ab70c508 @rustbot label: +llvm-main	2024-12-11 05:52:59 -05:00
Zalathar	3f3a9bf7f5	coverage: Store intermediate region tables in `CovfunRecord` This defers the call to `llvm_cov::write_function_mappings_to_buffer` until just before its enclosing global variable is created.	2024-12-11 21:35:45 +11:00
Zalathar	512f3fdebe	coverage: Only generate a CGU's covmap record if it has covfun records	2024-12-11 21:35:44 +11:00
Zalathar	6a8c016266	coverage: Reify `CovfunRecord` as an intermediate step	2024-12-11 18:25:10 +11:00
Zalathar	7c4ac71ad1	coverage: Extract function metadata handling to a `covfun` submodule	2024-12-11 17:49:44 +11:00
Folkert	bd8f8e0631	codegen `#[naked]` functions using `global_asm!`	2024-12-10 21:41:03 +01:00
León Orell Valerian Liehr	6d17cb833d	Rollup merge of #134115 - durin42:ppc64-target-string, r=jieyouxu rustc_target: ppc64 target string fixes for LLVM 20 LLVM continues to clean these up, and we continue to make this consistent. This is similar to `9caced7bad`, `e985396145`, and `a10e744faf`. ```@rustbot``` label: +llvm-main	2024-12-10 20:16:05 +01:00
León Orell Valerian Liehr	0064e731a6	Rollup merge of #134042 - sayantn:power8-crypto, r=jieyouxu Add the `power8-crypto` target feature Add the `power8-crypto` target feature. This will enable adding some new PPC intrinsics in stdarch (specifically AES, SHA and CLMUL intrinsics). The implied target feature is from [here](https://github.com/llvm/llvm-project/blob/main/llvm/lib/Target/PowerPC/PPC.td) ```@rustbot``` label A-target-feature O-PowerPC	2024-12-10 20:16:01 +01:00
Augie Fackler	0680155a17	rustc_target: ppc64 target string fixes for LLVM 20 LLVM continues to clean these up, and we continue to make this consistent. This is similar to `9caced7bad`, `e985396145`, and `a10e744faf`. `@rustbot` label: +llvm-main	2024-12-10 05:54:08 -05:00
León Orell Valerian Liehr	bb8a20678c	Rollup merge of #134029 - Zalathar:zero, r=oli-obk coverage: Use a query to find counters/expressions that must be zero As of #133446, this query (`coverage_ids_info`) determines which counter/expression IDs are unused. So with only a little extra work, we can take the code that was using that information to determine which coverage counters/expressions must be zero, and move that inside the query as well. There should be no change in compiler output.	2024-12-10 08:55:59 +01:00
bors	1b3fb31675	Auto merge of #134052 - matthiaskrgr:rollup-puxwqrk, r=matthiaskrgr Rollup of 7 pull requests Successful merges: - #133567 (A bunch of cleanups) - #133789 (Add doc alias 'then_with' for `then` method on `bool`) - #133880 (Expand home_dir docs) - #134036 (crash tests: use individual mir opts instead of mir-opt-level where easily possible) - #134045 (Fix some triagebot mentions paths) - #134046 (Remove ignored tests for hangs w/ new solver) - #134050 (Miri subtree update) r? `@ghost` `@rustbot` modify labels: rollup	2024-12-09 03:24:24 +00:00
Matthias Krüger	d2881e4eb5	Rollup merge of #133567 - bjorn3:various_cleanups, r=cjgillot A bunch of cleanups These are all extracted from a branch I have to get rid of driver queries. Most of the commits are not directly necessary for this, but were found in the process of implementing the removal of driver queries. Previous PR: https://github.com/rust-lang/rust/pull/132410	2024-12-09 01:56:32 +01:00
Sayantan Chakraborty	1220f393cc	Add the `power8-crypto` target feature	2024-12-09 00:41:35 +05:30
Zalathar	3a35fb6938	coverage: Unused functions don't need to store `CoverageIdsInfo`	2024-12-08 21:00:53 +11:00
Zalathar	4d2bfece41	coverage: Remove FunctionCoverageCollector The information that was being collected by this builder type is now collected by the `coverage_ids_info` query instead.	2024-12-08 20:53:57 +11:00
Zalathar	2022ef7f12	coverage: Use a query to find counters/expressions that must be zero This query (`coverage_ids_info`) already determines which counter/expression IDs are unused, so it only takes a little extra effort to also determine which counters/expressions must have a value of zero.	2024-12-08 20:53:39 +11:00
Zalathar	f3f7c20f7b	coverage: Move `CoverageIdsInfo` into `mir::coverage`	2024-12-08 17:50:42 +11:00
Scott McMurray	18d7b9a12f	Remove unnecessary `int_type_width_signed` function	2024-12-07 19:01:00 -08:00
Ben Kimock	711c8cc690	Remove polymorphization	2024-12-06 16:42:09 -05:00
bjorn3	401dd840ff	Remove all threading through of ErrorGuaranteed from the driver It was inconsistently done (sometimes even within a single function) and most of the rest of the compiler uses fatal errors instead, which need to be caught using catch_with_exit_code anyway. Using fatal errors instead of ErrorGuaranteed everywhere in the driver simplifies things a bit.	2024-12-06 18:42:31 +00:00
Jacob Pratt	b5a7f41a87	Rollup merge of #127565 - esp-rs:xtensa-vaargs, r=workingjubilee Teach rustc about the Xtensa VaListImpl Following on from the target Xtensa target PRs (https://github.com/rust-lang/rust/pull/125141, https://github.com/rust-lang/rust/pull/126380), this PR teaches rustc about the structure of the VA list on the Xtensa arch, as well as adding the required lowering to be able to actually use it.	2024-12-05 05:50:50 -05:00
bors	8575f8f91b	Auto merge of #104342 - mweber15:add_file_location_to_more_types, r=wesleywiser Require `type_map::stub` callers to supply file information This change attaches file information (`DIFile` reference and line number) to struct debug info nodes. Before: ``` ; foo.ll ... !5 = !DIFile(filename: "<unknown>", directory: "") ... !16 = !DICompositeType(tag: DW_TAG_structure_type, name: "MyType", scope: !2, file: !5, size: 32, align: 32, elements: !17, templateParams: !19, identifier: "4cb373851db92e732c4cb5651b886dd0") ... ``` After: ``` ; foo.ll ... !3 = !DIFile(filename: "foo.rs", directory: "/home/matt/src/rust98678", checksumkind: CSK_SHA1, checksum: "bcb9f08512c8f3b8181ef4726012bc6807bc9be4") ... !16 = !DICompositeType(tag: DW_TAG_structure_type, name: "MyType", scope: !2, file: !3, line: 3, size: 32, align: 32, elements: !17, templateParams: !19, identifier: "9e5968c7af39c148acb253912b7f409f") ... ``` Fixes #98678 r? `@wesleywiser`	2024-12-03 12:49:57 +00:00
Brian J. Tarricone	059f6272c3	Teach rust core about Xtensa VaListImpl and add a custom lowering of vaarg for xtensa. LLVM does not include an implementation of the va_arg instruction for Xtensa. From what I understand, this is a conscious decision and instead language frontends are encouraged to implement it themselves. The rationale seems to be that loading values correctly requires language and ABI-specific knowledge that LLVM lacks. This is true of most architectures, and rustc already provides implementation for a number of them. This commit extends the support to include Xtensa. See https://lists.llvm.org/pipermail/llvm-dev/2017-August/116337.html for some discussion on the topic. Unfortunately there does not seem to be a reference document for the semantics of the va_list and va_arg on Xtensa. The most reliable source is the GCC implementation, which this commit tries to follow. Clang also provides its own compatible implementation. This was tested for all the types that rustc allows in variadics. Co-authored-by: Brian Tarricone <brian@tarricone.org> Co-authored-by: Jonathan Bastien-Filiatrault <joe@x2a.org> Co-authored-by: Paul Lietar <paul@lietar.net>	2024-12-03 10:54:08 +00:00
Matthias Krüger	9709334061	Rollup merge of #133395 - calebzulawski:simd_relaxed_fma, r=workingjubilee Add simd_relaxed_fma intrinsic Adds compiler support for https://github.com/rust-lang/portable-simd/issues/387#issuecomment-2337169786 r? `@workingjubilee` cc `@RalfJung` is this kind of nondeterminism a problem for miri/opsem?	2024-12-03 07:48:33 +01:00
Kornel	eadea7764e	Use c"lit" for CStrings without unwrap	2024-12-02 18:16:36 +00:00
klensy	694950d73c	replace copypasted ModuleLlvm::parse	2024-12-02 16:02:46 +03:00
Jacob Pratt	fa2edee758	Rollup merge of #133446 - Zalathar:querify, r=cjgillot coverage: Use a query to identify which counter/expression IDs are used Given that we already have a query to identify the highest-numbered counter ID in a MIR body, we can extend that query to also build bitsets of used counter/expression IDs. That lets us avoid some messy coverage bookkeeping during the main MIR traversal for codegen. This does mean that we fail to treat some IDs as used in certain MIR-inlining scenarios, but I think that's fine, because it means that the results will be consistent across all instantiations of a function. --- There's some more cleanup I want to do in the function coverage collector, since it isn't really collecting anything any more, but I'll leave that for future work.	2024-12-01 21:38:25 -05:00
bors	8ac313bdbe	Auto merge of #133499 - nikic:no-backend-verify, r=Mark-Simulacrum Respect verify-llvm-ir option in the backend We are currently unconditionally verifying the LLVM IR in the backend (twice), ignoring the value of the verify-llvm-ir option. This has substantial compile-time impact for debug builds.	2024-12-01 04:54:02 +00:00
许杰友 Jieyou Xu (Joe)	1aa01927d3	Rollup merge of #131551 - taiki-e:ppc-asm-vreg-inout, r=Amanieu Support input/output in vector registers of PowerPC inline assembly This extends currently clobber-only vector registers (`vreg`) support to allow passing `#[repr(simd)]` types as input/output. \| Architecture \| Register class \| Target feature \| Allowed types \| \| ------------ \| -------------- \| -------------- \| -------------- \| \| PowerPC \| `vreg` \| `altivec` \| `i8x16`, `i16x8`, `i32x4`, `f32x4` \| \| PowerPC \| `vreg` \| `vsx` \| `f32`, `f64`, `i64x2`, `f64x2` \| In addition to floats and `core::simd` types listed above, `core::arch` types and custom `#[repr(simd)]` types of the same size and type are also allowed. All allowed types and relevant target features are currently unstable. r? `@Amanieu` `@rustbot` label +O-PowerPC +A-inline-assembly	2024-11-30 12:57:32 +08:00
Zalathar	6fc0fe76e8	coverage: Use a query to identify which counter/expression IDs are used	2024-11-30 00:58:48 +11:00
Zalathar	121a17ccc3	coverage: All counter terms in an unused function are zero This is currently handled automatically by the fact that codegen doesn't visit coverage statements in unused functions, but that will no longer be the case when unused IDs are identified by a separate query instead.	2024-11-30 00:54:53 +11:00
Zalathar	58e122fef8	coverage: Hoist and explain the check for `coverage_cx`	2024-11-30 00:54:53 +11:00
Zalathar	3f65114ffc	coverage: Rename `CrateCoverageContext` to `CguCoverageContext` This context is stored in `CodegenCx`, which makes it per-CGU rather than per-crate. A single crate can have multiple CGUs.	2024-11-30 00:54:53 +11:00
Zalathar	9461f4296f	Revert "Rollup merge of #133418 - Zalathar:spans, r=jieyouxu" This reverts commit `adf9b5fcd1`, reversing changes made to `af1ca153d4`. Reverting due to <https://github.com/rust-lang/rust/issues/133606>.	2024-11-29 14:57:01 +11:00
bors	d53f0b1d8e	Auto merge of #123244 - Mark-Simulacrum:share-inline-never-generics, r=saethlin Enable -Zshare-generics for inline(never) functions This avoids inlining cross-crate generic items when possible that are already marked inline(never), implying that the author is not intending for the function to be inlined by callers. As such, having a local copy may make it easier for LLVM to optimize but mostly just adds to binary bloat and codegen time. In practice our benchmarks indicate this is indeed a win for larger compilations, where the extra cost in dynamic linking to these symbols is diminished compared to the advantages in fewer copies that need optimizing in each binary. It might also make sense it expand this with other heuristics (e.g., `#[cold]`) in the future, but this seems like a good starting point. FWIW, I expect that doing cleanup in where we make the decision what should/shouldn't be shared is also a good idea. Way too much code needed to be tweaked to check this. But I'm hoping to leave that for a follow-up PR rather than blocking this on it.	2024-11-28 21:44:34 +00:00
Mark Rousskov	4a216a25d1	Share inline(never) generics across crates This reduces code sizes and better respects programmer intent when marking inline(never). Previously such a marking was essentially ignored for generic functions, as we'd still inline them in remote crates.	2024-11-28 13:43:05 -05:00
Taiki Endo	df8feb5067	Support floats in input/output in vector registers of PowerPC inline assembly	2024-11-29 03:10:07 +09:00
Taiki Endo	0f8ebba54a	Support #[repr(simd)] types in input/output of PowerPC inline assembly	2024-11-29 00:24:36 +09:00
Guillaume Gomez	470c4f94e8	Rollup merge of #133452 - taiki-e:hexagon-asm-pred, r=Amanieu Support predicate registers (clobber-only) in Hexagon inline assembly The result of the Hexagon instructions such as comparison, store conditional, etc. is stored in predicate registers (`p[0-3]`), but currently there is no way to mark it as clobbered in `asm!`. This is also needed for `clobber_abi` (although implementing `clobber_abi` will require the addition of support for [several more register classes](https://github.com/llvm/llvm-project/blob/llvmorg-19.1.0/llvm/lib/Target/Hexagon/HexagonRegisterInfo.cpp#L71-L90). see also https://github.com/rust-lang/rust/issues/93335#issuecomment-2395210055). Refs: - [Section 6 "Conditional Execution" in Qualcomm Hexagon V73 Programmer’s Reference Manual](https://docs.qualcomm.com/bundle/publicresource/80-N2040-53_REV_AB_Qualcomm_Hexagon_V73_Programmers_Reference_Manual.pdf#page=90) - [Register definition in LLVM](https://github.com/llvm/llvm-project/blob/llvmorg-19.1.0/llvm/lib/Target/Hexagon/HexagonRegisterInfo.td#L155) cc `@androm3da` (target maintainer of hexagon-unknown-{[none-elf](https://doc.rust-lang.org/nightly/rustc/platform-support/hexagon-unknown-none-elf.html#target-maintainers),[linux-musl](https://doc.rust-lang.org/nightly/rustc/platform-support/hexagon-unknown-linux-musl.html#target-maintainers)}) r? `@Amanieu` `@rustbot` label +A-inline-assembly (Currently there is no O-hexagon label...)	2024-11-28 12:06:02 +01:00
Matthias Krüger	adf9b5fcd1	Rollup merge of #133418 - Zalathar:spans, r=jieyouxu coverage: Store coverage source regions as `Span` until codegen Historically, coverage spans were converted into line/column coordinates during the MIR instrumentation pass. This PR moves that conversion step into codegen, so that coverage spans spend most of their time stored as `Span` instead. In addition to being conceptually nicer, this also reduces the size of coverage mappings in MIR, because `Span` is smaller than 4x u32. --- There should be no changes to coverage output.	2024-11-27 22:23:25 +01:00
Nikita Popov	d3ad000943	Respect verify-llvm-ir option in the backend We are currently unconditionally verifying the LLVM IR in the backend (twice), ignoring the value of the verify-llvm-ir option.	2024-11-26 15:26:03 +01:00
beetrees	68227a3777	Pass end position of span through inline ASM cookie	2024-11-26 13:00:08 +00:00
Taiki Endo	59f01cdbf4	Support predicate registers (clobber-only) in Hexagon inline assembly	2024-11-25 23:11:17 +09:00
Matthias Krüger	3f86eddf83	Rollup merge of #131664 - taiki-e:s390x-asm-vreg-inout, r=Amanieu Support input/output in vector registers of s390x inline assembly (under asm_experimental_reg feature) This extends currently clobber-only vector registers (`vreg`) support to allow passing `#[repr(simd)]` types, floats (f32/f64/f128), and integers (i32/i64/i128) as input/output. This is unstable and gated under new `#![feature(asm_experimental_reg)]` (tracking issue: https://github.com/rust-lang/rust/issues/133416). If the feature is not enabled, only clober is supported as before. \| Architecture \| Register class \| Target feature \| Allowed types \| \| ------------ \| -------------- \| -------------- \| -------------- \| \| s390x \| `vreg` \| `vector` \| `i32`, `f32`, `i64`, `f64`, `i128`, `f128`, `i8x16`, `i16x8`, `i32x4`, `i64x2`, `f32x4`, `f64x2` \| This matches the list of types that are supported by the vector registers in LLVM: https://github.com/llvm/llvm-project/blob/llvmorg-19.1.0/llvm/lib/Target/SystemZ/SystemZRegisterInfo.td#L301-L313 In addition to `core::simd` types and floats listed above, custom `#[repr(simd)]` types of the same size and type are also allowed. All allowed types other than i32/f32/i64/f64/i128, and relevant target features are currently unstable. Currently there is no SIMD type for s390x in `core::arch`, but this is tracked in https://github.com/rust-lang/rust/issues/130869. cc https://github.com/rust-lang/rust/issues/130869 about vector facility support in s390x cc https://github.com/rust-lang/rust/issues/125398 & https://github.com/rust-lang/rust/issues/116909 about f128 support in asm `@rustbot` label +O-SystemZ +A-inline-assembly	2024-11-25 07:01:37 +01:00
Matthias Krüger	c5230d1148	Rollup merge of #131523 - nbdd0121:asm, r=compiler-errors Fix asm goto with outputs and move it to a separate feature gate Tracking issue: #119364 This PR addresses 3 aspects of asm goto with outputs: * Codegen is fixed. My initial implementation has an oversight which cause the output to be only stored in fallthrough path, but not in label blocks. * Outputs can now be used with `options(noreturn)` if a label block is given. * All of this is moved to a new feature gate, because we likely want to stabilise `asm_goto` before asm goto with outputs. `@rustbot` labels: +A-inline-assembly +F-asm	2024-11-25 07:01:37 +01:00
许杰友 Jieyou Xu (Joe)	8d20d71256	Rollup merge of #133297 - DianQK:embed-bitcode-ios, r=nikic Remove legacy bitcode for iOS Follow #117364.	2024-11-25 00:39:05 +08:00
Gary Guo	73f8309300	Support use of asm goto with outputs and `options(noreturn)` When labels are present, the `noreturn` option really means that asm block won't fallthrough -- if labels are present, then outputs can still be meaningfully used.	2024-11-24 14:18:10 +00:00
Gary Guo	b8df869ebb	Fix asm goto with outputs When outputs are used together with labels, they are considered to be written for all destinations, not only when falling through.	2024-11-24 14:18:10 +00:00
Zalathar	2748009aad	coverage: Identify source files by ID, not by interned filename	2024-11-24 23:46:41 +11:00
Zalathar	b9fb1a69d2	coverage: Store coverage source regions as `Span` until codegen	2024-11-24 23:46:39 +11:00
Taiki Endo	c024d8ccdf	Make s390x non-clobber-only vector register support unstable	2024-11-24 21:42:22 +09:00
Zalathar	87fe7def12	coverage: Rename some FFI fields from `span` to `cov_span` This will avoid confusion with actual `Span` spans.	2024-11-24 23:29:02 +11:00
Zalathar	619a272612	coverage: Ignore functions that end up having no mappings A used function with no mappings has historically indicated a bug, but that will no longer be the case after moving some fallible span-processing steps into codegen.	2024-11-24 23:28:02 +11:00
DianQK	3a23669787	embed-bitcode is no longer used in iOS	2024-11-24 15:51:47 +08:00
Caleb Zulawski	e73e9f9af2	Add simd_relaxed_fma intrinsic	2024-11-23 14:39:42 -05:00
许杰友 Jieyou Xu (Joe)	c6d36256a6	Rollup merge of #127483 - BertalanD:no_sanitize-global-var, r=rcvalle Allow disabling ASan instrumentation for globals AddressSanitizer adds instrumentation to global variables unless the [`no_sanitize_address`](https://llvm.org/docs/LangRef.html#global-attributes) attribute is set on them. This commit extends the existing `#[no_sanitize(address)]` attribute to set this; previously it only had the desired effect on functions. (cc https://github.com/rust-lang/rust/issues/39699)	2024-11-23 20:19:51 +08:00
Taiki Endo	2c8f6de1ba	Support input/output in vector registers of s390x inline assembly	2024-11-22 04:18:14 +09:00
Kyle Huey	f5b023bd9c	When the required discriminator value exceeds LLVM's limits, drop the debug info for the function instead of panicking. The maximum discriminator value LLVM can currently encode is 2^12. If macro use results in more than 2^12 calls to the same function attributed to the same callsite, and those calls are MIR-inlined, we will require more than the maximum discriminator value to completely represent the debug information. Once we reach that point drop the debug info instead.	2024-11-19 05:19:09 -08:00
Kyle Huey	1e4ebb0ccd	Honor collapse_debuginfo when dealing with MIR-inlined functions inside macros. The test relies on the fact that inlining more than 2^12 calls at the same callsite will trigger a panic (and after the following commit, a warning) due to LLVM limitations but with collapse_debuginfo the callsites should not be the same.	2024-11-19 05:18:56 -08:00
lcnr	9cba14b95b	use `TypingEnv` when no `infcx` is available the behavior of the type system not only depends on the current assumptions, but also the currentnphase of the compiler. This is mostly necessary as we need to decide whether and how to reveal opaque types. We track this via the `TypingMode`.	2024-11-18 10:38:56 +01:00
Jiri Bobek	777003ae9f	Likely unlikely fix	2024-11-17 21:49:10 +01:00
bors	3bc6916f4c	Auto merge of #132965 - mati865:cfguard-gnullvm, r=wesleywiser allow CFGuard on windows-gnullvm No unit tests because of https://github.com/rust-lang/rust/issues/132278	2024-11-15 00:21:07 +00:00
Matthias Krüger	bd79fe7a94	Rollup merge of #132702 - 1c3t3a:issue-132615, r=rcvalle CFI: Append debug location to CFI blocks Currently we're not appending debug locations to the inserted CFI blocks. This shows up in #132615 and #100783. This change fixes that by passing down the debug location to the CFI type-test generation and appending it to the blocks. Credits also belong to `@jakos-sec` who worked with me on this.	2024-11-12 23:26:41 +01:00
Mateusz Mikuła	811c1db715	allow CFGuard on windows-gnullvm	2024-11-12 01:18:53 +01:00
Matthias Krüger	35225d61f4	Rollup merge of #132820 - bjorn3:default_backend_link_impl, r=jieyouxu Add a default implementation for CodegenBackend::link As a side effect this should add raw-dylib support to cg_gcc as the default ArchiveBuilderBuilder that is used implements create_dll_import_lib. I haven't tested if the raw-dylib support actually works however.	2024-11-11 21:58:32 +01:00
Bastian Kersting	c2102259a0	CFI: Append debug location to CFI blocks	2024-11-11 09:17:43 +00:00
bors	71042b4b20	Auto merge of #132880 - RalfJung:implied-features, r=workingjubilee target_features: explain what exacty 'implied' means here	2024-11-11 09:12:03 +00:00
Ralf Jung	2c7f3badcf	target_features: explain what exacty 'implied' means here	2024-11-11 07:33:39 +01:00
Matthias Krüger	b95232dabb	Rollup merge of #132675 - Zalathar:empty-spans, r=jieyouxu coverage: Restrict empty-span expansion to only cover `{` and `}` Coverage instrumentation has some tricky code for converting a coverage-relevant `Span` into a set of start/end line/byte-column coordinates that will be embedded in the CGU's coverage metadata. A big part of this complexity is special code for handling empty spans, which are expanded into non-empty spans (if possible) because LLVM's coverage reporter does not handle empty spans well. This PR simplifies that code by restricting it to only apply in two specific situations: when the character after the empty span is `{`, or the character before the empty span is `}`. (As an added benefit, this means that the expanded spans no longer extend awkwardly beyond the end of a physical line, which was common under the previous implementation.) Along the way, this PR also removes some unhelpful code for dealing with function source code spread across multiple files. Functions currently can't have coverage spans in multiple files, and if that ever changes (e.g. to properly support expansion regions) then this code will need to be completely overhauled anyway.	2024-11-10 17:43:07 +01:00
Zalathar	925dfc8608	coverage: Pass a `LocalFileId` to `CoverageSpan::from_source_region`	2024-11-10 11:58:44 +11:00
bjorn3	0a619dbc5d	Pass owned CodegenResults to link_binary After link_binary the temporary files referenced by CodegenResults are deleted, so calling link_binary again with the same CodegenResults should not be allowed.	2024-11-09 21:22:00 +00:00
Kyle Huey	1dc106121b	Add discriminators to DILocations when multiple functions are inlined into a single point. LLVM does not expect to ever see multiple dbg_declares for the same variable at the same location with different values. proc-macros make it possible for arbitrary code, including multiple calls that get inlined, to happen at any given location in the source code. Add discriminators when that happens so these locations are different to LLVM. This may interfere with the AddDiscriminators pass in LLVM, which is added by the unstable flag -Zdebug-info-for-profiling. Fixes #131944	2024-11-09 08:01:31 -08:00
bors	80445576d0	Auto merge of #132800 - matthiaskrgr:rollup-c1kkj56, r=matthiaskrgr Rollup of 5 pull requests Successful merges: - #132552 (Add v9, v8plus, and leoncasa target feature to sparc and use v8plus in create_object_file) - #132745 (pointee_info_at: fix logic for recursing into enums) - #132777 (try_question_mark_nop: update test for LLVM 20) - #132785 (rustc_target: more target string fixes for LLVM 20) - #132794 (Use a separate dir for r-a builds consistently in helix config) r? `@ghost` `@rustbot` modify labels: rollup	2024-11-09 12:23:47 +00:00
Matthias Krüger	b9d4ef16c9	Rollup merge of #132552 - taiki-e:sparc-target-feature, r=workingjubilee Add v9, v8plus, and leoncasa target feature to sparc and use v8plus in create_object_file This adds the following three unstable target features: - `v9`: SPARC-V9 instructions ([LLVM definition][sparc-v9]) - Relevant to https://github.com/rust-lang/rust/pull/131222#issuecomment-2453310963 - Relevant to https://github.com/rust-lang/rust/pull/132472#discussion_r1832606081 - This is also needed to implement https://github.com/taiki-e/atomic-maybe-uninit/pull/31 (depends on inline assembly support) more robustly. - `v8plus`: SPARC-V8+ ABI ([LLVM definition][sparc-v8plus]) - This is added in LLVM 20. In LLVM 19 and older, it is emulated to work the same way as LLVM in each LLVM version. - See https://github.com/rust-lang/rust/issues/132585#issuecomment-2453926257 for more. - `leoncasa`: CASA instruction[^1] of LEON3 and LEON4 processors ([LLVM definition][sparc-leoncasa], LLVM feature name: `hasleoncasa`) - This is needed to implement https://github.com/taiki-e/atomic-maybe-uninit/pull/31 (depends on inline assembly support) more robustly. [^1]: Atomic CAS instruction [sparc-v9]: `f5e4ffaa49/llvm/lib/Target/Sparc/Sparc.td (L37-L39)` [sparc-v8plus]: `f5e4ffaa49/llvm/lib/Target/Sparc/Sparc.td (L37-L39)` [sparc-leoncasa]: https://github.com/llvm/llvm-project/blob/llvmorg-19.1.0/llvm/lib/Target/Sparc/LeonFeatures.td#L32-L37	2024-11-09 10:52:03 +01:00
bors	4b198d6871	Auto merge of #132584 - Zalathar:includes, r=cuviper Trim and tidy includes in `rustc_llvm` These includes tend to accumulate over time, and are usually only removed when something breaks in a new LLVM version, so it's nice to clean them up manually once in a while. General strategy used for this PR: - Remove all includes from `LLVMWrapper.h` that aren't needed by the header itself, transplanting them to individual source files as necessary. - For each source file, temporarily remove each include if doing so doesn't cause a compile error. - If a “required” include looks like it shouldn't be needed, try replacing it with its sub-includes, then trim that list. - After doing all of the above, go back and re-add any removed include if the file does actually use things defined in that header, even if the header happens to also be included by something else.	2024-11-09 09:46:08 +00:00
Zalathar	89d7efaf8f	Make `RustString` an extern type to avoid `improper_ctypes` warnings	2024-11-09 11:07:44 +11:00
Taiki Endo	c059eb7750	Add v8plus target feature to sparc and use it in create_object_file	2024-11-09 03:22:09 +09:00
Taiki Endo	400a690b5f	Add v9 and leoncasa target feature to sparc	2024-11-09 03:17:24 +09:00
Zalathar	996bdabc2a	coverage: Remove unhelpful code for handling multiple files per function Functions currently can't have mappings in multiple files, and if that ever changes (e.g. to properly support expansion regions), this code will need to be completely overhauled anyway.	2024-11-08 20:43:08 +11:00
Zalathar	3f9c54caf0	coverage: Add `GlobalFileId` for stricter type-checking of file IDs We already had a dedicated `LocalFileId` index type, but previously we used a raw `u32` for global file IDs, because index types were harder to pass through FFI.	2024-11-08 20:43:08 +11:00
Stuart Cook	3a48d80155	Rollup merge of #132590 - Zalathar:z-timings-stats, r=jieyouxu Simplify FFI calls for `-Ztime-llvm-passes` and `-Zprint-codegen-stats` The existing code for these unstable LLVM-infodump flags was jumping through hoops to pass an allocated C string across the FFI boundary, when it's much simpler to just write to a `&RustString` instead.	2024-11-08 18:51:30 +11:00
Stuart Cook	758a904764	Rollup merge of #132452 - Zalathar:llvm-cov-wrappers, r=jieyouxu coverage: Extract safe FFI wrapper functions to `llvm_cov` This PR takes all of the inline `unsafe` calls in coverage codegen, and all the safe wrapper functions in `coverageinfo/mod.rs`, and moves them to a new `llvm_cov` submodule that is dedicated to safe FFI wrapper functions. This reduces the mixing of abstraction levels in the rest of coverage codegen. As a follow-up, this PR also tidies up the names and signatures of several of the coverage FFI functions.	2024-11-08 18:51:29 +11:00
Jubilee	97dbab9124	Rollup merge of #132741 - zmodem:mips_data_layout, r=nikic Update mips64 data layout to match LLVM 20 change LLVM changed the data layout in https://github.com/llvm/llvm-project/pull/112084	2024-11-07 18:48:26 -08:00
Jubilee	60e8ab6ba8	Rollup merge of #130586 - dpaoliello:fixrawdylib, r=wesleywiser Set "symbol name" in raw-dylib import libraries to the decorated name `windows-rs` received a bug report that mixing raw-dylib generated and the Windows SDK import libraries was causing linker failures: <https://github.com/microsoft/windows-rs/issues/3285> The root cause turned out to be #124958, that is we are not including the decorated name in the import library and so the import name type is also not being correctly set. This change modifies the generation of import libraries to set the "symbol name" to the fully decorated name and correctly marks the import as being data vs function. Note that this also required some changes to how the symbol is named within Rust: for MSVC we now need to use the decorated name but for MinGW we still need to use partially decorated (or undecorated) name. Fixes #124958 Passing i686 MSVC and MinGW build: <https://github.com/rust-lang/rust/actions/runs/11000433888?pr=130586> r? `@ChrisDenton`	2024-11-07 18:48:20 -08:00
Hans Wennborg	eb7d95bafd	remove the extra specification for llvm versions < 20	2024-11-07 20:59:50 +01:00
Taiki Endo	241f82ad91	Basic inline assembly support for SPARC and SPARC64	2024-11-07 21:19:03 +09:00
Matt Weber	8286299742	Clean up use requirements after rebasing	2024-11-06 22:26:18 -05:00
Matt Weber	f9ac7aca5d	Add location info for f16	2024-11-06 22:26:18 -05:00
Matt Weber	21c58b1b2c	Rename option and add doc	2024-11-06 22:26:18 -05:00
Matt Weber	4692d46a46	Add additional option checks	2024-11-06 22:26:17 -05:00
Matt Weber	a4833a8089	Move additional source location info behind -Z option	2024-11-06 22:26:17 -05:00
Matt Weber	27b1b01daa	Refactor `type_stub` from `DefId` to tuple	2024-11-06 22:25:04 -05:00
Matt Weber	aa485fc2a1	Add file and line metadata for enum variant and fields	2024-11-06 22:25:04 -05:00
Matt Weber	af6b0deaf3	Add file and line metadata for struct/union members	2024-11-06 22:25:04 -05:00
Matt Weber	c07797a854	Add file and line metadata for coroutines	2024-11-06 22:24:58 -05:00
Matt Weber	f3da828185	Refactor `type_map::stub` parameters Push span lookup into `type_map::stub` and pass the `DefId` instead of doing the lookup outside and passing in the location metadata.	2024-11-06 22:12:03 -05:00
Matt Weber	94669d9d47	Add file and line metadata for closures	2024-11-06 22:12:03 -05:00
Matt Weber	2f00b6affd	Require `type_map::stub` callers to supply file information This change attaches file information (`DIFile` reference and line number) to struct debug info nodes. Before: ``` ; foo.ll ... !5 = !DIFile(filename: "<unknown>", directory: "") ... !16 = !DICompositeType(tag: DW_TAG_structure_type, name: "MyType", scope: !2, file: !5, size: 32, align: 32, elements: !17, templateParams: !19, identifier: "4cb373851db92e732c4cb5651b886dd0") ... ``` After: ``` ; foo.ll ... !3 = !DIFile(filename: "foo.rs", directory: "/home/matt/src/rust98678", checksumkind: CSK_SHA1, checksum: "bcb9f08512c8f3b8181ef4726012bc6807bc9be4") ... !16 = !DICompositeType(tag: DW_TAG_structure_type, name: "MyType", scope: !2, file: !3, line: 3, size: 32, align: 32, elements: !17, templateParams: !19, identifier: "9e5968c7af39c148acb253912b7f409f") ... ``` Fixes #98678	2024-11-06 22:12:02 -05:00
bors	a69df72bdc	Auto merge of #132664 - matthiaskrgr:rollup-i27nr7i, r=matthiaskrgr Rollup of 5 pull requests Successful merges: - #131261 (Stabilize `UnsafeCell::from_mut`) - #131405 (bootstrap/codegen_ssa: ship llvm-strip and use it for -Cstrip) - #132077 (Add a new `wide-arithmetic` feature for WebAssembly) - #132562 (Remove the `wasm32-wasi` target from rustc) - #132660 (Remove unused errs.rs file) Failed merges: - #131721 (Add new unstable feature `const_eq_ignore_ascii_case`) r? `@ghost` `@rustbot` modify labels: rollup	2024-11-06 01:21:42 +00:00
Matthias Krüger	088e698835	Rollup merge of #132077 - alexcrichton:wide-arithmetic, r=jieyouxu Add a new `wide-arithmetic` feature for WebAssembly This commit adds a new rustc target feature named `wide-arithmetic` for WebAssembly targets. This corresponds to the [wide-arithmetic] proposal for WebAssembly which adds new instructions catered towards accelerating integer arithmetic larger than 64-bits. This proposal to WebAssembly is not standard yet so this new feature is flagged as an unstable target feature. Additionally Rust's LLVM version doesn't support this new feature yet since support will first be added in LLVM 20, so the feature filtering logic for LLVM is updated to handle this. I'll also note that I'm not currently planning to add wasm-specific intrinsics to `std::arch::wasm32` at this time. The currently proposed instructions are all accessible through `i128` or `u128`-based operations which Rust already supports, so intrinsic shouldn't be necessary to get access to these new instructions. [wide-arithmetic]: https://github.com/WebAssembly/wide-arithmetic	2024-11-05 23:43:57 +01:00
Matthias Krüger	c8247c0a19	Rollup merge of #132259 - mrkajetanp:branch-protection-pauth-lr, r=davidtwco rustc_codegen_llvm: Add a new 'pc' option to branch-protection Add a new 'pc' option to -Z branch-protection for aarch64 that enables the use of PC as a diversifier in PAC branch protection code. When the pauth-lr target feature is enabled in combination with -Z branch-protection=pac-ret,pc, the new 9.5-a instructions (pacibsppc, retaasppc, etc) will be generated.	2024-11-05 20:10:49 +01:00
bors	e8c698bb3b	Auto merge of #129884 - RalfJung:forbidden-target-features, r=workingjubilee mark some target features as 'forbidden' so they cannot be (un)set with -Ctarget-feature The context for this is https://github.com/rust-lang/rust/issues/116344: some target features change the way floats are passed between functions. Changing those target features is unsound as code compiled for the same target may now use different ABIs. So this introduces a new concept of "forbidden" target features (on top of the existing "stable " and "unstable" categories), and makes it a hard error to (un)set such a target feature. For now, the x86 and ARM feature `soft-float` is on that list. We'll have to make some effort to collect more relevant features, and similar features from other targets, but that can happen after the basic infrastructure for this landed. (These features are being collected in https://github.com/rust-lang/rust/issues/131799.) I've made this a warning for now to give people some time to speak up if this would break something. MCP: https://github.com/rust-lang/compiler-team/issues/780	2024-11-05 16:25:45 +00:00
Zalathar	19d5dc0ed1	coverage: Tidy up coverage-specific FFI functions	2024-11-05 15:32:36 +11:00
Zalathar	b790e4473c	coverage: Extract safe FFI wrapper functions to `llvm_cov`	2024-11-05 15:32:34 +11:00
bors	96477c55bc	Auto merge of #131341 - taiki-e:ppc-clobber-abi, r=bzEq,workingjubilee Support clobber_abi and vector registers (clobber-only) in PowerPC inline assembly This supports `clobber_abi` which is one of the requirements of stabilization mentioned in #93335. This basically does a similar thing I did in https://github.com/rust-lang/rust/pull/130630 to implement `clobber_abi` for s390x, but for powerpc/powerpc64/powerpc64le. - This also supports vector registers (as `vreg`) as clobber-only, which need to support clobbering of them to implement `clobber_abi`. - `vreg` should be able to accept `#[repr(simd)]` types as input/output if the unstable `altivec` target feature is enabled, but `core::arch::{powerpc,powerpc64}` vector types, `#[repr(simd)]`, and `core::simd` are all unstable, so the fact that this is currently a clobber-only should not be considered a blocker of clobber_abi implementation or stabilization. So I have not implemented it in this PR. - See https://github.com/rust-lang/rust/pull/131551 (which is based on this PR) for a PR to implement this. - (I'm not sticking to whether that PR should be a separate PR or part of this PR, so I can merge that PR into this PR if needed.) Refs: - PPC32 SysV: Section "Function Calling Sequence" in [System V Application Binary Interface PowerPC Processor Supplement](https://refspecs.linuxfoundation.org/elf/elfspec_ppc.pdf) - PPC64 ELFv1: Section 3.2 "Function Calling Sequence" in [64-bit PowerPC ELF Application Binary Interface Supplement](https://refspecs.linuxfoundation.org/ELF/ppc64/PPC-elf64abi.html#FUNC-CALL) - PPC64 ELFv2: Section 2.2 "Function Calling Sequence" in [64-Bit ELF V2 ABI Specification](https://openpowerfoundation.org/specifications/64bitelfabi/) - AIX: [Register usage and conventions](https://www.ibm.com/docs/en/aix/7.3?topic=overview-register-usage-conventions), [Special registers in the PowerPC®](https://www.ibm.com/docs/en/aix/7.3?topic=overview-special-registers-in-powerpc), [AIX vector programming](https://www.ibm.com/docs/en/aix/7.3?topic=concepts-aix-vector-programming) - Register definition in LLVM: https://github.com/llvm/llvm-project/blob/llvmorg-19.1.0/llvm/lib/Target/PowerPC/PPCRegisterInfo.td#L189 If I understand the above four ABI documentations correctly, except for the PPC32 SysV's VR (Vector Registers) and 32-bit AIX (currently not supported by rustc)'s r13, there does not appear to be important differences in terms of implementing `clobber_abi`: - The above four ABIs are consistent about FPR (0-13: volatile, 14-31: nonvolatile), CR (0-1,5-7: volatile, 2-4: nonvolatile), XER (volatile), and CTR (volatile). - As for GPR, only the registers we are treating as reserved are slightly different - r0, r3-r12 are volatile - r1(sp, reserved), r14-31 are nonvolatile - r2(reserved) is TOC pointer in PPC64 ELF/AIX, system-reserved register in PPC32 SysV (AFAIK used as thread pointer in Linux/BSDs) - r13(reserved for non-32-bit-AIX) is thread pointer in PPC64 ELF, small data area pointer register in PPC32 SysV, "reserved under 64-bit environment; not restored across system calls[^r13]" in AIX) - As for FPSCR, volatile in PPC64 ELFv1/AIX, some fields are volatile only in certain situations (rest are volatile) in PPC32 SysV/PPC64 ELFv2. - As for VR (Vector Registers), it is not mentioned in PPC32 SysV, v0-v19 are volatile in both in PPC64 ELF/AIX, v20-v31 are nonvolatile in PPC64 ELF, reserved or nonvolatile depending on the ABI ([vec-extabi vs vec-default in LLVM](https://reviews.llvm.org/D89684), we are [using vec-extabi](https://github.com/rust-lang/rust/pull/131341#discussion_r1797693299)) in AIX: > When the default Vector enabled mode is used, these registers are reserved and must not be used. > In the extended ABI vector enabled mode, these registers are nonvolatile and their values are preserved across function calls I left [FIXME comment about PPC32 SysV](https://github.com/rust-lang/rust/pull/131341#discussion_r1790496095) and added ABI check for AIX. - As for VRSAVE, it is not mentioned in PPC32 SysV, nonvolatile in PPC64 ELFv1, reserved in PPC64 ELFv2/AIX - As for VSCR, it is not mentioned in PPC32 SysV/PPC64 ELFv1, some fields are volatile only in certain situations (rest are volatile) in PPC64 ELFv2, volatile in AIX We are currently treating r1-r2, r13 (non-32-bit-AIX), r29-r31, LR, CTR, and VRSAVE as reserved. We are currently not processing anything about FPSCR and VSCR, but I feel those are things that should be processed by `preserves_flags` rather than `clobber_abi` if we need to do something about them. (However, PPCRegisterInfo.td in LLVM does not seem to define anything about them.) Replaces #111335 and #124279 cc `@ecnelises` `@bzEq` `@lu-zero` r? `@Amanieu` `@rustbot` label +O-PowerPC +A-inline-assembly [^r13]: callee-saved, according to [LLVM](`6a6af0246b/llvm/lib/Target/PowerPC/PPCCallingConv.td (L322)`) and [GCC](`a9173a50e7/gcc/config/rs6000/rs6000.h (L859)`).	2024-11-05 03:13:47 +00:00
Ralf Jung	ffad9aac27	mark some target features as 'forbidden' so they cannot be (un)set For now, this is just a warning, but should become a hard error in the future	2024-11-04 22:56:47 +01:00
Zalathar	5bfa0b106e	Simplify FFI calls for `-Ztime-llvm-passes` and `-Zprint-codegen-stats`	2024-11-04 20:31:16 +11:00
Jubilee	7155c65d68	Rollup merge of #132565 - bjorn3:less_target_name_dependence, r=workingjubilee Reduce dependence on the target name The target name can be anything with custom target specs. Matching on fields inside the target spec is much more robust than matching on the target name. Also remove the unused is_builtin target spec field.	2024-11-03 20:08:14 -08:00
Zalathar	44a056a50b	Move `LLVMRustAttribute[Kind]` out of `LLVMWrapper.h`	2024-11-04 12:27:23 +11:00
Jubilee Young	b895bf4fdc	compiler: Directly use rustc_abi in codegen	2024-11-03 12:30:32 -08:00
bjorn3	9e6d2da83d	Reduce dependence on the target name The target name can be anything with custom target specs. Matching on fields inside the target spec is much more robust than matching on the target name.	2024-11-03 18:29:01 +00:00
bors	59ae5eba7e	Auto merge of #132514 - Zalathar:print-target-cpus, r=jieyouxu Port most of `--print=target-cpus` to Rust The logic and formatting needed by `--print=target-cpus` has historically been carried out in C++ code. Originally it used `printf` to write directly to the console, but later it switched over to writing to a `std::ostringstream` and then passing its buffer to a callback function pointer. This PR replaces that C++ code with a very simple function that writes a list of CPU names to a `&RustString`, with the rest of the logic and formatting being handled by ordinary safe Rust code.	2024-11-03 11:09:38 +00:00
Daniel Bertalan	204b2281fa	Allow disabling ASan instrumentation for globals AddressSanitizer adds instrumentation to global variables unless the [`no_sanitize_address`](https://llvm.org/docs/LangRef.html#global-attributes) attribute is set on them. This commit extends the existing `#[no_sanitize(address)]` attribute to set this; previously it only had the desired effect on functions.	2024-11-02 22:35:34 +01:00
Noratrieb	a26450cf81	Rename target triple to target tuple in many places in the compiler This changes the naming to the new naming, used by `--print target-tuple`. It does not change all locations, but many.	2024-11-02 21:29:59 +01:00
Zalathar	90f2075b66	Port most of `LLVMRustPrintTargetCPUs` to Rust	2024-11-02 23:39:29 +11:00
Zalathar	0fa86f9660	Use a dedicated safe wrapper for `LLVMRustGetHostCPUName`	2024-11-02 23:39:29 +11:00
Taiki Endo	d19517dcd0	Support clobber_abi and vector registers (clobber-only) in PowerPC inline assembly	2024-11-02 20:26:08 +09:00
Matthias Krüger	bb544f863f	Rollup merge of #131037 - madsmtm:move-llvm-target-versioning, r=petrochenkov Move versioned Apple LLVM targets from `rustc_target` to `rustc_codegen_ssa` Fully specified LLVM targets contain the OS version on macOS/iOS/tvOS/watchOS/visionOS, and this version depends on the deployment target environment variables like `MACOSX_DEPLOYMENT_TARGET`, `IPHONEOS_DEPLOYMENT_TARGET` etc. We would like to move this to later in the compilation pipeline, both because it feels impure to access environment variables when fetching target information, but mostly because we need access to more information from https://github.com/rust-lang/rust/pull/130883 to do https://github.com/rust-lang/rust/issues/118204. See also https://github.com/rust-lang/rust/pull/129342#issuecomment-2335156119 for some discussion. The first and second commit does the actual refactor, it should be a non-functional change, the third commit adds diagnostics for invalid deployment targets, which are now possible to do because we have access to the session. Tested with the same commands as in https://github.com/rust-lang/rust/pull/130435. r? ``````@petrochenkov``````	2024-11-02 08:33:10 +01:00
Guillaume Gomez	526c67f37b	Rollup merge of #131829 - Zalathar:goodbye-zprofile, r=chenyukang Remove support for `-Zprofile` (gcov-style coverage instrumentation) Tracking issue: #42524 MCP: https://github.com/rust-lang/compiler-team/issues/798 --- This PR removes the unstable `-Zprofile` flag, which enables ”gcov-style” coverage instrumentation, along with its associated `-Zprofile-emit` configuration flag. (The profile flag predates and is almost entirely separate from the stable `-Cinstrument-coverage` flag.) Notably, the `-Zprofile` flag: - Is largely untested in-tree, having only one run-make test that does not check whether its output is correct or useful. - Has no known maintainer. - Has seen no push towards stabilization. - Has at least one severe regression reported in 2022 that apparently remains unaddressed. - #100125 - Is confusingly named, since it appears to be more about coverage than performance profiling, and has nothing to do with PGO. - Is fundamentally limited by relying on counters auto-inserted by LLVM, with no knowledge of Rust beyond debuginfo.	2024-11-02 03:08:49 +08:00
Mads Marquart	e1233153ac	Move versioned LLVM target creation to rustc_codegen_ssa The OS version depends on the deployment target environment variables, the access of which we want to move to later in the compilation pipeline that has access to more information, for example `env_depinfo`.	2024-11-01 17:07:18 +01:00
Matthew Maurer	9caced7bad	llvm: Match new LLVM 128-bit integer alignment on sparc LLVM continues to align more 128-bit integers to 128-bits in the data layout rather than relying on the high level language to do it. Update SPARC target files to match and add a backcompat replacement for current LLVMs. See llvm/llvm-project#106951 for details	2024-10-31 20:37:54 +00:00
Kajetan Puchalski	10edeea4b4	rustc_codegen_llvm: Add a new 'pc' option to branch-protection Add a new 'pc' option to -Z branch-protection for aarch64 that enables the use of PC as a diversifier in PAC branch protection code. When the pauth-lr target feature is enabled in combination with -Z branch-protection=pac-ret,pc, the new 9.5-a instructions (pacibsppc, retaasppc, etc) will be generated.	2024-10-31 11:59:17 +00:00

... 4 5 6 7 8 ...

2759 Commits