nordic-dev.net/rust - rust

mirror of https://github.com/rust-lang/rust.git synced 2025-04-27 18:56:24 +00:00

Author	SHA1	Message	Date
Matthias Krüger	4696a92183	Rollup merge of #111322 - mirkootter:master, r=davidtwco Support for native WASM exceptions ### Motivation Currently, rustc does not support native WASM exceptions. It does support JavaScript based exceptions for the wasm32-emscripten-target, but this requires back&forth with javascript for many calls, which is very slow. Native wasm support for exceptions is quite common: Clang+LLVM implemented them years ago, and all major browsers support them by now. They enable zero-cost exceptions, at least with regard to runtime-performance-cost. They may increase startup-time and code size, though. ### Important: This PR does not change default behaviour Exceptions usually add a lot of code in form of unwinding blocks, increasing the binary size. Most users probably do not want that, especially which regard to web development. Therefore, wasm exceptions play a similar role as WASM-threads: rustc should support them, like clang does, but users who want to use it have to use some command-line magic like rustflags to opt in. ### What does this PR do? As stated above, the default behaviour is not changed. It is already possible to opt-in into wasm exceptions using the command line. Unfortunately, the LLVM IR is invalid and the LLVM backend crashes. ``` rustc <sourcefile> --target wasm32-unknown-unknown -C panic=unwind -C llvm-args=-wasm-enable-eh -C target-feature=+exception-handling ``` As it turns out, LLVM is quite picky when it comes to IR for exception handling. If the IR does not look exactly like it should, some LLVM-assertions fail and the code generation crashes. This PR adds the necessary modifications to the code generator to make it work. It also adds `exception-handling` as a wasm target feature. ### What this PR does not / what is missing This PR is not a full fledges solution. It is the first step. A few parts are still missing; however, it is already useable (see next section). Currently missing: * The std library has to be adapted. Currently, only [no_std] crates work * Usually, nested exceptions abort the program (i.e. a panic during the cleanup of another panic). This is currently not done yet. - Currently, code inside cleanup handlers does not unwind - To fix this requires a little more work: The code generator currently maintains a single terminate block per function for this. Unfortunately, WASM requires funclet based exception handling. Therefore, we need to create a terminate block per funclet. This is probably not a big problem, but I want to keep this PR simple. ### How to use the compiler given this PR? This PR does not add any command line flags or features. It uses those which are already there. To compile with exceptions enabled, you need * to set the panic strategy to unwind, i.e. `-C panic=unwind` * to enable the exception-handling target feature, i.e. `-C target-feature=+exception-handling` * to tell LLVM about the exception handling, i.e. `-C llvm-args=-wasm-enable-eh` Since the standard library has not been adapted, you can only use it in [no_std] crates as of now. The intrinsic `core::intrinsics::r#try` works. To throw exceptions, you need the ```@llvm.wasm.throw``` intrinsic. I created a sample application which works for me: https://github.com/mirkootter/rust-wasm-demos This example can be run at https://webassembly.sh	2023-06-29 16:36:30 +02:00
Takayuki Maeda	bad0688563	Rollup merge of #112946 - nnethercote:improve-cgu-naming-and-ordering, r=wesleywiser Improve cgu naming and ordering Some quality of life improvements when debugging and profiling CGU formation. r? `@wesleywiser`	2023-06-29 03:29:32 +09:00
bors	bb95b7dcd6	Auto merge of #112307 - lcnr:operand-ref, r=compiler-errors mir opt + codegen: handle subtyping fixes #107205 the same issue was caused in multiple places: - mir opts: both copy and destination propagation - codegen: assigning operands to locals (which also propagates values) I changed codegen to always update the type in the operands used for locals which should guard against any new occurrences of this bug going forward. I don't know how to make mir optimizations more resilient here. Hopefully the added tests will be enough to detect any trivially wrong optimizations going forward.	2023-06-28 00:41:37 +00:00
Matthias Krüger	1880e83ae3	Rollup merge of #112207 - qwandor:virt_feature, r=davidtwco Add trustzone and virtualization target features for aarch32. These are LLVM target features which allow the `smc` and `hvc` instructions respectively to be used in inline assembly.	2023-06-27 22:10:12 +02:00
Nicholas Nethercote	666b1b68a7	Tweak thread names for CGU processing. For non-incremental builds on Unix, currently all the thread names look like `opt regex.f10ba03eb5ec7975-cgu.0`. But they are truncated by `pthread_setname` to `opt regex.f10ba`, hiding the numeric suffix that distinguishes them. This is really annoying when using a profiler like Samply. This commit changes these thread names to a form like `opt cgu.0`, which is much better.	2023-06-26 09:14:45 +10:00
Nicholas Nethercote	fae4f45214	Remove unused fields from `CodegenContext`.	2023-06-22 09:07:19 +10:00
Nicholas Nethercote	3bbf9f0128	Introduce `CodegenState`. The codegen main loop has two bools, `codegen_done` and `codegen_aborted`. There are only three valid combinations: `(false, false)`, `(true, false)`, `(true, true)`. This commit replaces them with a single tri-state enum, which makes things clearer.	2023-06-22 09:07:15 +10:00
Nicholas Nethercote	a521ba400d	Add comments to `Message` and `WorkItem`. This is particularly useful for `Message`.	2023-06-22 08:07:59 +10:00
Nicholas Nethercote	88cd8f9324	Simplify `Message`. `Message` is an enum with multiple variants. Four of those variants map directly onto the four variants of `WorkItemResult`. This commit reduces those four `Message` variants to a single variant containing a `WorkItemResult`. This requires increasing `WorkItemResult`'s visibility to `pub(crate)` visibility, but `WorkItem` and `Message` can also have their visibility reduced to `pub(crate)`. This change avoids some boilerplate enum translation code, and makes `Message` easier to understand.	2023-06-22 08:07:59 +10:00
Nicholas Nethercote	757c290fba	Move `Message::CodegenItem` to a separate type. `Message` is an enum with multiple variants, for messages sent to the coordinator thread. Except for `Message::CodegenItem`, which is entirely disjoint, being for messages sent from the coordinator thread to the main thread. This commit move `Message::CodegenItem` into a separate type, `CguMessage`, which makes the code much clearer.	2023-06-22 08:07:59 +10:00
Nilstrieb	904994e101	Rollup merge of #112830 - nnethercote:more-codegen-cleanups, r=oli-obk More codegen cleanups Some additional cleanups I found while looking closely at this code, following up from #112827. r= `@oli-obk`	2023-06-21 07:37:03 +02:00
Nicholas Nethercote	c696307a87	Inline and remove `WorkItem::start_profiling` and `execute_work_item`. They both match on a `WorkItem`. It's simpler to do it all in one place.	2023-06-21 10:56:19 +10:00
Guillaume Gomez	0688182f9b	Rollup merge of #112794 - bjorn3:fix_lib_global_alloc, r=oli-obk Fix linker failures when #[global_allocator] is used in a dependency Fixes https://github.com/rust-lang/rust/issues/112715	2023-06-20 14:23:41 +02:00
Michael Goulet	31d1fbf8d2	Rollup merge of #112232 - fee1-dead-contrib:match-eq-const-msg, r=b-naber Better error for non const `PartialEq` call generated by `match` Resolves #90237	2023-06-19 17:53:33 -07:00
bjorn3	206b951803	Fix linker failures when #[global_allocator] is used in a dependency	2023-06-19 17:31:54 +00:00
Scott McMurray	3fd8501823	Remove unchecked_add/sub/mul/shl/shr from CTFE/cg_ssa/cg_clif	2023-06-19 01:47:03 -07:00
Scott McMurray	39788e07ba	Promote unchecked_add/sub/mul/shl/shr to mir::BinOp	2023-06-19 01:47:03 -07:00
lcnr	46973c9c8a	add FIXME's for a later refactoring	2023-06-19 09:16:26 +02:00
lcnr	46af169ec5	codegen: fix `OperandRef` subtype handling	2023-06-19 09:06:42 +02:00
Deadbeef	89c24af133	Better error for non const `PartialEq` call generated by `match`	2023-06-18 05:24:38 +00:00
Michael Goulet	3eb8c2ae10	Rollup merge of #112474 - ldm0:ldm_enum_debuginfo_128_support, r=compiler-errors Support 128-bit enum variant in debuginfo codegen fixes #111600	2023-06-16 12:53:22 -07:00
bors	99b334696f	Auto merge of #112597 - danakj:map-linker-paths, r=michaelwoerister Use the relative sysroot path in the linker command line to specify sysroot rlibs This addresses https://github.com/rust-lang/rust/issues/112586	2023-06-16 09:02:50 +00:00
Guillaume Gomez	05d5449522	Rollup merge of #112669 - Nilstrieb:typo, r=jyn514 Fix comment for ptr alignment checks in codegen	2023-06-15 22:04:58 +02:00
Nilstrieb	465e4d9c9c	Fix comment for ptr alignment checks in codegen	2023-06-15 19:27:31 +02:00
danakj	c340325ebf	Remap dylib paths into the sysroot to be relative to the sysroot Like for rlibs, the paths on the linker command line need to be relative paths if the sysroot was specified by the user to be a relative path. Dylibs put the path in /LIBPATH instead of into the file path of the library itself, so we rehome the libpath and adjust the rehoming function to be able to support both use cases, rlibs and dylibs.	2023-06-15 11:13:03 -04:00
danakj	4b83156a5c	Move the comment on fixing paths to where it belongs	2023-06-14 10:58:08 -04:00
danakj	7e07271eaf	Avoid absolute sysroot paths in the MSVC linker command line When the `--sysroot` is specified as relative to the current working directory, the sysroot's rlibs should also be specified as relative paths. Otherwise, the current working directory ends up in the absolute paths, and in the linker command line. And the entire linker command line appears in the PDB file generated by the MSVC linker. When adding an rlib to the linker command line, if the rlib's canonical path is in the sysroot's canonical path, then use the current sysroot path + filename instead of the full absolute path to the rlib. This means that when `--sysroot=foo` is specified, the linker command line will contain `foo/rustlib/target/lib/lib*.rlib` instead of the full absolute path to the same. This addresses https://github.com/rust-lang/rust/issues/112586	2023-06-14 10:44:00 -04:00
Nicholas Nethercote	7c3ce02a11	Introduce a minimum CGU size in non-incremental builds. Because tiny CGUs make compilation less efficient and result in worse generated code. We don't do this when the number of CGUs is explicitly given, because there are times when the requested number is very important, as described in some comments within the commit. So the commit also introduces a `CodegenUnits` type that distinguishes between default values and user-specified values. This change has a roughly neutral effect on walltimes across the rustc-perf benchmarks; there are some speedups and some slowdowns. But it has significant wins for most other metrics on numerous benchmarks, including instruction counts, cycles, binary size, and max-rss. It also reduces parallelism, which is good for reducing jobserver competition when multiple rustc processes are running at the same time. It's smaller benchmarks that benefit the most; larger benchmarks already have CGUs that are all larger than the minimum size. Here are some example before/after CGU sizes for opt builds. - html5ever - CGUs: 16, mean size: 1196.1, sizes: [3908, 2992, 1706, 1652, 1572, 1136, 1045, 948, 946, 938, 579, 471, 443, 327, 286, 189] - CGUs: 4, mean size: 4396.0, sizes: [6706, 3908, 3490, 3480] - libc - CGUs: 12, mean size: 35.3, sizes: [163, 93, 58, 53, 37, 8, 2 (x6)] - CGUs: 1, mean size: 424.0, sizes: [424] - tt-muncher - CGUs: 5, mean size: 1819.4, sizes: [8508, 350, 198, 34, 7] - CGUs: 1, mean size: 9075.0, sizes: [9075] Note that CGUs of size 100,000+ aren't unusual in larger programs.	2023-06-14 10:57:44 +10:00
DonoughLiu	204bfb6a8c	Support 128-bit enum variant in debuginfo codegen	2023-06-10 03:39:24 +08:00
bors	343ad6f059	Auto merge of #111626 - pjhades:output, r=b-naber Write to stdout if `-` is given as output file With this PR, if `-o -` or `--emit KIND=-` is provided, output will be written to stdout instead. Binary output (those of type `obj`, `llvm-bc`, `link` and `metadata`) being written this way will result in an error unless stdout is not a tty. Multiple output types going to stdout will trigger an error too, as they will all be mixded together. This implements https://github.com/rust-lang/compiler-team/issues/431 The idea behind the changes is to introduce an `OutFileName` enum that represents the output - be it a real path or stdout - and to use this enum along the code paths that handle different output types.	2023-06-09 09:45:40 +00:00
bors	e7409258db	Auto merge of #112415 - GuillaumeGomez:rollup-5pa9frd, r=GuillaumeGomez Rollup of 9 pull requests Successful merges: - #112034 (Migrate `item_opaque_ty` to Askama) - #112179 (Avoid passing --cpu-features when empty) - #112309 (bootstrap: remove dependency `is-terminal`) - #112388 (Migrate GUI colors test to original CSS color format) - #112389 (Add a test for #105709) - #112392 (Fix ICE for while loop with assignment condition with LHS place expr) - #112394 (Remove accidental comment) - #112396 (Track more diagnostics in `rustc_expand`) - #112401 (Don't `use compile_error as print`) r? `@ghost` `@rustbot` modify labels: rollup	2023-06-08 10:31:52 +00:00
Guillaume Gomez	ad9d7e3ed5	Rollup merge of #112179 - tamird:no-empty-cpu-features, r=petrochenkov Avoid passing --cpu-features when empty Added in `12ac719b99`, this logic always passed --cpu-features, even when the value was the empty string.	2023-06-08 10:15:09 +02:00
bors	a0df04c0f2	Auto merge of #110040 - ndrewxie:issue-84447-partial-1, r=lcnr,michaelwoerister Removed use of iteration through a HashMap/HashSet in rustc_incremental and replaced with IndexMap/IndexSet This allows for the `#[allow(rustc::potential_query_instability)]` in rustc_incremental to be removed, moving towards fixing #84447 (although a LOT more modules have to be changed to fully resolve it). Only HashMaps/HashSets that are being iterated through have been modified (although many structs and traits outside of rustc_incremental had to be modified as well, as they had fields/methods that involved a HashMap/HashSet that would be iterated through) I'm making a PR for just 1 module changed to test for performance regressions and such, for future changes I'll either edit this PR to reflect additional modules being converted, or batch multiple modules of changes together and make a PR for each group of modules.	2023-06-08 07:30:03 +00:00
bors	f383703e32	Auto merge of #111698 - Amanieu:force-static-lib, r=petrochenkov Force all native libraries to be statically linked when linking a static binary Previously, `#[link]` without an explicit `kind = "static"` would confuse the linker and end up producing a dynamically linked library because of the `-Bdynamic` flag. However this binary would not work correctly anyways since it was linked with startup code for a static binary. This PR solves this by forcing all native libraries to be statically linked when the output is a static binary that cannot link to dynamic libraries anyways. Fixes #108878 Fixes #102993	2023-06-07 22:02:24 +00:00
Amanieu d'Antras	0304e0a5b0	Force all native libraries to be statically linked when linking a static binary	2023-06-07 19:30:37 +01:00
Jan-Mirko Otter	82730b4521	wasm exception handling	2023-06-07 17:48:28 +02:00
Jan-Mirko Otter	35cdb28c84	add comment	2023-06-07 17:46:34 +02:00
Jan-Mirko Otter	82336c1311	wasm target feature: exception handling	2023-06-07 17:46:34 +02:00
Jing Peng	9b1a1e1d95	Write to stdout if `-` is given as output file If `-o -` or `--emit KIND=-` is provided, output will be written to stdout instead. Binary output (`obj`, `llvm-bc`, `link` and `metadata`) being written this way will result in an error unless stdout is not a tty. Multiple output types going to stdout will trigger an error too, as they will all be mixded together.	2023-06-06 17:53:29 -04:00
bors	fd9bf59436	Auto merge of #111999 - scottmcm:codegen-less-memcpy, r=compiler-errors Use `load`+`store` instead of `memcpy` for small integer arrays I was inspired by #98892 to see whether, rather than making `mem::swap` do something smart in the library, we could update MIR assignments like `_1 = _2` to do something smarter than `memcpy` for sufficiently-small types that doing it inline is going to be better than a `memcpy` call in assembly anyway. After all, special code may help `mem::swap`, but if the "obvious" MIR can just result in the correct thing that helps everything -- other code like `mem::replace`, people doing it manually, and just passing around by value in general -- as well as makes MIR inlining happier since it doesn't need to deal with all the complicated library code if it just sees a couple assignments. LLVM will turn the short, known-length `memcpy`s into direct instructions in the backend, but that's too late for it to be able to remove `alloca`s. In general, replacing `memcpy`s with typed instructions is hard in the middle-end -- even for `memcpy.inline` where it knows it won't be a function call -- is hard [due to poison propagation issues](https://rust-lang.zulipchat.com/#narrow/stream/187780-t-compiler.2Fwg-llvm/topic/memcpy.20vs.20load-store.20for.20MIR.20assignments/near/360376712). So because we know more about the type invariants -- these are typed copies -- rustc can emit something more specific, allowing LLVM to `mem2reg` away the `alloca`s in some situations. #52051 previously did something like this in the library for `mem::swap`, but it ended up regressing during enabling mir inlining (`cbbf06b0cd`), so this has been suboptimal on stable for ≈5 releases now. The code in this PR is narrowly targeted at just integer arrays in LLVM, but works via a new method on the [`LayoutTypeMethods`](https://doc.rust-lang.org/nightly/nightly-rustc/rustc_codegen_ssa/traits/trait.LayoutTypeMethods.html) trait, so specific backends based on cg_ssa can enable this for more situations over time, as we find them. I don't want to try to bite off too much in this PR, though. (Transparent newtypes and simple things like the 3×usize `String` would be obvious candidates for a follow-up.) Codegen demonstrations: <https://llvm.godbolt.org/z/fK8hT9aqv> Before: ```llvm define void `@swap_rgb48_old(ptr` noalias nocapture noundef align 2 dereferenceable(6) %x, ptr noalias nocapture noundef align 2 dereferenceable(6) %y) unnamed_addr #1 { %a.i = alloca [3 x i16], align 2 call void `@llvm.lifetime.start.p0(i64` 6, ptr nonnull %a.i) call void `@llvm.memcpy.p0.p0.i64(ptr` noundef nonnull align 2 dereferenceable(6) %a.i, ptr noundef nonnull align 2 dereferenceable(6) %x, i64 6, i1 false) tail call void `@llvm.memcpy.p0.p0.i64(ptr` noundef nonnull align 2 dereferenceable(6) %x, ptr noundef nonnull align 2 dereferenceable(6) %y, i64 6, i1 false) call void `@llvm.memcpy.p0.p0.i64(ptr` noundef nonnull align 2 dereferenceable(6) %y, ptr noundef nonnull align 2 dereferenceable(6) %a.i, i64 6, i1 false) call void `@llvm.lifetime.end.p0(i64` 6, ptr nonnull %a.i) ret void } ``` Note it going to stack: ```nasm swap_rgb48_old: # `@swap_rgb48_old` movzx eax, word ptr [rdi + 4] mov word ptr [rsp - 4], ax mov eax, dword ptr [rdi] mov dword ptr [rsp - 8], eax movzx eax, word ptr [rsi + 4] mov word ptr [rdi + 4], ax mov eax, dword ptr [rsi] mov dword ptr [rdi], eax movzx eax, word ptr [rsp - 4] mov word ptr [rsi + 4], ax mov eax, dword ptr [rsp - 8] mov dword ptr [rsi], eax ret ``` Now: ```llvm define void `@swap_rgb48(ptr` noalias nocapture noundef align 2 dereferenceable(6) %x, ptr noalias nocapture noundef align 2 dereferenceable(6) %y) unnamed_addr #0 { start: %0 = load <3 x i16>, ptr %x, align 2 %1 = load <3 x i16>, ptr %y, align 2 store <3 x i16> %1, ptr %x, align 2 store <3 x i16> %0, ptr %y, align 2 ret void } ``` still lowers to `dword`+`word` operations, but has no stack traffic: ```nasm swap_rgb48: # `@swap_rgb48` mov eax, dword ptr [rdi] movzx ecx, word ptr [rdi + 4] movzx edx, word ptr [rsi + 4] mov r8d, dword ptr [rsi] mov dword ptr [rdi], r8d mov word ptr [rdi + 4], dx mov word ptr [rsi + 4], cx mov dword ptr [rsi], eax ret ``` And as a demonstration that this isn't just `mem::swap`, a `mem::replace` on a small array (since replace doesn't use swap since #83022), which used to be `memcpy`s in LLVM changes in IR ```llvm define void `@replace_short_array(ptr` noalias nocapture noundef sret([3 x i32]) dereferenceable(12) %0, ptr noalias noundef align 4 dereferenceable(12) %r, ptr noalias nocapture noundef readonly dereferenceable(12) %v) unnamed_addr #0 { start: %1 = load <3 x i32>, ptr %r, align 4 store <3 x i32> %1, ptr %0, align 4 %2 = load <3 x i32>, ptr %v, align 4 store <3 x i32> %2, ptr %r, align 4 ret void } ``` but that lowers to reasonable `dword`+`qword` instructions still ```nasm replace_short_array: # `@replace_short_array` mov rax, rdi mov rcx, qword ptr [rsi] mov edi, dword ptr [rsi + 8] mov dword ptr [rax + 8], edi mov qword ptr [rax], rcx mov rcx, qword ptr [rdx] mov edx, dword ptr [rdx + 8] mov dword ptr [rsi + 8], edx mov qword ptr [rsi], rcx ret ```	2023-06-06 01:50:28 +00:00
bors	7452822843	Auto merge of #107583 - EsmeYi:aix_xcoff_metadata, r=bjorn3 Support the rustc metadata for AIX Support the rustc metadata for rlibs and dylibs on AIX. XCOFF is the object file format on AIX.	2023-06-05 10:42:05 +00:00
Esme Yi	e31661cb9e	Address @bjorn3's comments.	2023-06-05 15:15:09 +08:00
Andrew Xie	17412bae30	Removed use of iteration through a HashMap/HashSet in rustc_incremental and replaced with IndexMap/IndexSet	2023-06-04 21:54:28 -04:00
Scott McMurray	e1b020df9f	Use `load`-`store` instead of `memcpy` for short integer arrays	2023-06-04 00:51:49 -07:00
Michael Goulet	5460f92a0f	Rollup merge of #112168 - scottmcm:lower-div-rem-unchecked-to-mir, r=oli-obk Lower `unchecked_div`/`_rem` to MIR's `BinOp::Div`/`Rem` As described in <https://doc.rust-lang.org/nightly/nightly-rustc/rustc_middle/mir/enum.BinOp.html#variant.Div>, the ordinary `BinOp`s for these are already UB for division by zero ([or overflow](https://llvm.org/docs/LangRef.html#sdiv-instruction), [demo](https://rust.godbolt.org/z/71e7P7Exh)), as MIR building is responsible for inserting code to panic for those cases regardless of whether the overflow checks are enabled. So we can lower these in the same arm that lowers `wrapping_add` to MIR `BinOp::Add` and such, as all these cases turn into ordinary `Rvalue::BinaryOp`s.	2023-06-02 16:02:06 -07:00
Andrew Walbran	7269972f73	Add trustzone and virtualization target features for aarch32. These are LLVM target features which allow the `smc` and `hvc` instructions respectively to be used in inline assembly.	2023-06-02 13:50:18 +01:00
Tamir Duberstein	cd4354578a	Avoid passing --cpu-features when empty Added in `12ac719b99`, this logic always passed --cpu-features, even when the value was the empty string.	2023-06-01 13:07:51 -04:00
Deadbeef	4f83717cf7	Use translatable diagnostics in `rustc_const_eval`	2023-06-01 14:45:18 +00:00
Scott McMurray	dc353f13b8	remove unchecked_div/_rem from cg_ssa	2023-06-01 00:05:55 -07:00
Scott McMurray	bf36193ef6	Add a distinct `OperandValue::ZeroSized` variant for ZSTs These tend to have special handling in a bunch of places anyway, so the variant helps remember that. And I think it's easier to grok than non-Scalar Aggregates sometimes being `Immediates` (like I got wrong and caused 109992). As a minor bonus, it means we don't need to generate poison LLVM values for them to pass around in `OperandValue::Immediate`s.	2023-05-31 19:10:28 -07:00

1 2 3 4 5 ...

1614 Commits