nordic-dev.net/rust - rust

mirror of https://github.com/rust-lang/rust.git synced 2025-04-29 19:47:38 +00:00

Author	SHA1	Message	Date
Tomasz Miąsko	6846674c75	Emit LLVM optimization remarks when enabled with `-Cremark` The default diagnostic handler considers all remarks to be disabled by default unless configured otherwise through LLVM internal flags: `-pass-remarks`, `-pass-remarks-missed`, and `-pass-remarks-analysis`. This behaviour makes `-Cremark` ineffective on its own. Fix this by configuring a custom diagnostic handler that enables optimization remarks based on the value of `-Cremark` option. With `-Cremark=all` enabling all remarks.	2021-11-16 08:19:20 +01:00
Tomasz Miąsko	b16ac4cbba	Use brief format for optimization remarks	2021-11-16 08:19:20 +01:00
Matthias Krüger	fd5a4f42ad	Rollup merge of #90701 - michaelwoerister:more-artifact-sizes, r=davidtwco Record more artifact sizes during self-profiling. This PR adds artifact size recording for - "linked artifacts" (executables, RLIBs, dylibs, static libs) - object files - dwo files - assembly files - crate metadata - LLVM bitcode files - LLVM IR files - codegen unit size estimates Currently the identifiers emitted for these are hard-coded as string literals. Is it worth adding constants to https://github.com/rust-lang/measureme/blob/master/measureme/src/rustc.rs instead? We don't do that for query names and the like -- but artifact kinds might be more stable than query names.	2021-11-09 19:00:45 +01:00
Michael Woerister	fefe1e9192	Record more artifact sizes during self-profiling.	2021-11-08 17:02:40 +01:00
Joshua Nelson	0ac13bd430	Don't abort compilation after giving a lint error The only reason to use `abort_if_errors` is when the program is so broken that either: 1. later passes get confused and ICE 2. any diagnostics from later passes would be noise This is never the case for lints, because the compiler has to be able to deal with `allow`-ed lints. So it can continue to lint and compile even if there are lint errors.	2021-11-08 01:22:28 +00:00
Matthias Krüger	2f67647606	Rollup merge of #89581 - jblazquez:master, r=Mark-Simulacrum Add -Z no-unique-section-names to reduce ELF header bloat. This change adds a new compiler flag that can help reduce the size of ELF binaries that contain many functions. By default, when enabling function sections (which is the default for most targets), the LLVM backend will generate different section names for each function. For example, a function `func` would generate a section called `.text.func`. Normally this is fine because the linker will merge all those sections into a single one in the binary. However, starting with [LLVM 12](https://github.com/llvm/llvm-project/commit/ee5d1a04), the backend will also generate unique section names for exception handling, resulting in thousands of `.gcc_except_table.*` sections ending up in the final binary because some linkers like LLD don't currently merge or strip these EH sections (see discussion [here](https://reviews.llvm.org/D83655)). This can bloat the ELF headers and string table significantly in binaries that contain many functions. The new option is analogous to Clang's `-fno-unique-section-names`, and instructs LLVM to generate the same `.text` and `.gcc_except_table` section for each function, resulting in a smaller final binary. The motivation to add this new option was because we have a binary that ended up with so many ELF sections (over 65,000) that it broke some existing ELF tools, which couldn't handle so many sections. Here's our old binary: ``` $ readelf --sections old.elf \| head -1 There are 71746 section headers, starting at offset 0x2a246508: $ readelf --sections old.elf \| grep shstrtab [71742] .shstrtab STRTAB 0000000000000000 2977204c ad44bb 00 0 0 1 ``` That's an 11MB+ string table. Here's the new binary using this option: ``` $ readelf --sections new.elf \| head -1 There are 43 section headers, starting at offset 0x29143ca8: $ readelf --sections new.elf \| grep shstrtab [40] .shstrtab STRTAB 0000000000000000 29143acc 0001db 00 0 0 1 ``` The whole binary size went down by over 20MB, which is quite significant.	2021-10-25 22:59:46 +02:00
Javier Blazquez	4ed846ad4d	Add -Z no-unique-section-names to reduce ELF header bloat. This change adds a new compiler flag that can help reduce the size of ELF binaries that contain many functions. By default, when enabling function sections (which is the default for most targets), the LLVM backend will generate different section names for each function. For example, a function "func" would generate a section called ".text.func". Normally this is fine because the linker will merge all those sections into a single one in the binary. However, starting with LLVM 12 (llvm/llvm-project@ee5d1a0), the backend will also generate unique section names for exception handling, resulting in thousands of ".gcc_except_table.*" sections ending up in the final binary because some linkers don't currently merge or strip these EH sections. This can bloat the ELF headers and string table significantly in binaries that contain many functions. The new option is analogous to Clang's -fno-unique-section-names, and instructs LLVM to generate the same ".text" and ".gcc_except_table" section for each function, resulting in smaller object files and potentially a smaller final binary.	2021-10-11 12:09:32 -07:00
Hans Kratz	4593d78e96	Default to disabling the new pass manager for the s390x targets.	2021-10-08 15:05:07 +02:00
Jubilee	6c17601a2e	Rollup merge of #89025 - ricobbe:raw-dylib-link-ordinal, r=michaelwoerister Implement `#[link_ordinal(n)]` Allows the use of `#[link_ordinal(n)]` with `#[link(kind = "raw-dylib")]`, allowing Rust to link against DLLs that export symbols by ordinal rather than by name. As long as the ordinal matches, the name of the function in Rust is not required to match the name of the corresponding function in the exporting DLL. Part of #58713.	2021-10-07 20:26:11 -07:00
Michael Benfield	a17193dbb9	Enable AutoFDO. This largely involves implementing the options debug-info-for-profiling and profile-sample-use and forwarding them on to LLVM. AutoFDO can be used on x86-64 Linux like this: rustc -O -Cdebug-info-for-profiling main.rs -o main perf record -b ./main create_llvm_prof --binary=main --out=code.prof rustc -O -Cprofile-sample-use=code.prof main.rs -o main2 Now `main2` will have feedback directed optimization applied to it. The create_llvm_prof tool can be obtained from this github repository: https://github.com/google/autofdo Fixes #64892.	2021-10-06 19:36:52 +00:00
Camille GILLOT	8961616e60	Move rustc_middle::middle::cstore to rustc_session.	2021-10-03 16:08:51 +02:00
bors	b27661eb33	Auto merge of #89405 - GuillaumeGomez:fix-clippy-lints, r=cjgillot Fix clippy lints I'm currently working on allowing clippy to run on librustdoc after a discussion I had with `@Mark-Simulacrum.` So in the meantime, I fixed a few lints on the compiler crates.	2021-10-02 10:52:09 +00:00
Manish Goregaokar	1781e4b81a	Rollup merge of #89376 - andjo403:selfProfileUseAfterDropFix, r=Mark-Simulacrum Fix use after drop in self-profile with llvm events self-profile with `-Z self-profile-events=llvm` have failed with a segmentation fault due to this use after drop. this type of events can be more useful now that the new passmanager is the default.	2021-10-01 14:46:49 -07:00
Guillaume Gomez	759eba0a08	Fix clippy lints	2021-10-01 23:17:19 +02:00
Manish Goregaokar	6f1e930581	Rollup merge of #88820 - hlopko:add_pie_relocation_model, r=petrochenkov Add `pie` as another `relocation-model` value MCP: https://github.com/rust-lang/compiler-team/issues/461	2021-10-01 09:18:16 -07:00
Marcel Hlopko	198d90786b	Add `pie` as another `relocation-model` value	2021-10-01 08:06:42 +02:00
Andreas Jonson	d90934ce87	Fix use after drop in self-profile with llvm events	2021-09-29 22:58:33 +02:00
Nikita Popov	be01f42f73	Enable new pass manager on LLVM 13 The new pass manager is enabled by default in clang since Clang/LLVM 13. While the discussion about this is still ongoing (https://lists.llvm.org/pipermail/llvm-dev/2021-August/152305.html) it's expected that support for the legacy pass manager will be dropped either in LLVM 14 or 15. This switches us to use the new pass manager if LLVM >= 13 is used.	2021-09-25 11:24:23 +02:00
Richard Cobbe	142f6c0b07	Implement #[link_ordinal] attribute in the context of #[link(kind = "raw-dylib")].	2021-09-20 14:50:35 -07:00
Yilin Chen	d5de680e20	Work around invalid DWARF bugs for fat LTO Signed-off-by: Yilin Chen <sticnarf@gmail.com>	2021-09-17 23:19:38 +08:00
bjorn3	977f279553	Move add_rlib and add_native_library to cg_ssa This deduplicates logic between codegen backends	2021-09-01 14:43:27 +02:00
Nikita Popov	621f5146c3	Handle SrcMgr diagnostics This is how InlineAsm diagnostics with source information are reported now. Previously a separate InlineAsm diagnostic handler was used.	2021-08-16 18:28:17 +02:00
Richard Cobbe	a867dd4c7e	Add support for raw-dylib with stdcall, fastcall functions on i686-pc-windows-msvc.	2021-07-09 12:04:54 -07:00
Richard Cobbe	6aa45b71b1	Add first cut of functionality for #58713 : support for #[link(kind = "raw-dylib")]. This does not yet support #[link_name] attributes on functions, the #[link_ordinal] attribute, #[link(kind = "raw-dylib")] on extern blocks in bin crates, or stdcall functions on 32-bit x86.	2021-06-04 18:01:35 -07:00
Camille GILLOT	0bde3b1f80	Use () for codegen queries.	2021-05-12 13:58:46 +02:00
Nikita Popov	c2b15a6b64	Support -C passes in NewPM And report an error if parsing the additional pass pipeline fails. Threading through the error accounts for most of the changes here.	2021-05-08 10:58:08 +02:00
Nikita Popov	5519cbfe33	Don't force -O1 with ThinLTO This doesn't seem to be necessary anymore, although I don't know at which point or why that changed. Forcing -O1 makes some tests fail under NewPM, because NewPM also performs inlining at -O1, so it ends up performing much more optimization in practice than before.	2021-05-08 10:58:08 +02:00
Nikita Popov	db140de8f2	Explicitly register GCOV profiling pass as well	2021-05-08 10:58:08 +02:00
Nikita Popov	5ecbe7fcf8	Explicitly register instrprof pass Don't use "passes" for this purpose, explicitly insert it into the correct place in the pipeline instead.	2021-05-08 10:58:08 +02:00
Nikita Popov	0318883cd6	Make -Z new-llvm-pass-manager an Option<bool> To allow it to have an LLVM version dependent default.	2021-05-08 10:58:08 +02:00
Luqman Aden	db555e1284	Implement RFC 2951: Native link modifiers This commit implements both the native linking modifiers infrastructure as well as an initial attempt at the individual modifiers from the RFC. It also introduces a feature flag for the general syntax along with individual feature flags for each modifier.	2021-05-05 16:04:25 -07:00
Dylan DPC	e64dbb1f46	Rollup merge of #82483 - tmiasko:option-from-str, r=matthewjasper Use FromStr trait for number option parsing Replace `parse_uint` with generic `parse_number` based on `FromStr`. Use it for parsing inlining threshold to avoid casting later.	2021-04-05 13:03:37 +02:00
Dylan DPC	0d12422f2d	Rollup merge of #80525 - devsnek:wasm64, r=nagisa wasm64 support There is still some upstream llvm work needed before this can land.	2021-04-05 00:24:23 +02:00
Gus Caplan	da66a31572	wasm64	2021-04-04 11:29:34 -05:00
Simonas Kazlauskas	64af7eae1e	Move SanitizerSet to rustc_target	2021-04-03 00:37:49 +03:00
bors	6ff482bde5	Auto merge of #83666 - Amanieu:instrprof-order, r=tmandry Run LLVM coverage instrumentation passes before optimization passes This matches the behavior of Clang and allows us to remove several hacks which were needed to ensure functions weren't optimized away before reaching the instrumentation pass. Fixes #83429 cc `@richkadel` r? `@tmandry`	2021-03-31 03:20:33 +00:00
Amanieu d'Antras	cad9b6b695	Apply review feedback	2021-03-30 07:03:41 +01:00
Amanieu d'Antras	26d260bfa4	Run LLVM coverage instrumentation passes before optimization passes This matches the behavior of Clang and allows us to remove several hacks which were needed to ensure functions weren't optimized away before reaching the instrumentation pass.	2021-03-30 02:10:28 +01:00
Josh Stone	72ebebe474	Use iter::zip in compiler/	2021-03-26 09:32:31 -07:00
bors	0c341226ad	Auto merge of #83084 - nagisa:nagisa/features-native, r=petrochenkov Adjust `-Ctarget-cpu=native` handling in cg_llvm When cg_llvm encounters the `-Ctarget-cpu=native` it computes an explciit set of features that applies to the target in order to correctly compile code for the host CPU (because e.g. `skylake` alone is not sufficient to tell if some of the instructions are available or not). However there were a couple of issues with how we did this. Firstly, the order in which features were overriden wasn't quite right – conceptually you'd expect `-Ctarget-cpu=native` option to override the features that are implicitly set by the target definition. However due to how other `-Ctarget-cpu` values are handled we must adopt the following order of priority: * Features from -Ctarget-cpu=; are overriden by Features implied by --target; are overriden by * Features from -Ctarget-feature; are overriden by * function specific features. Another problem was in that the function level `target-features` attribute would overwrite the entire set of the globally enabled features, rather than just the features the `#[target_feature(enable/disable)]` specified. With something like `-Ctarget-cpu=native` we'd end up in a situation wherein a function without `#[target_feature(enable)]` annotation would have a broader set of features compared to a function with one such attribute. This turned out to be a cause of heavy run-time regressions in some code using these function-level attributes in conjunction with `-Ctarget-cpu=native`, for example. With this PR rustc is more careful about specifying the entire set of features for functions that use `#[target_feature(enable/disable)]` or `#[instruction_set]` attributes. Sadly testing the original reproducer for this behaviour is quite impossible – we cannot rely on `-Ctarget-cpu=native` to be anything in particular on developer or CI machines. cc https://github.com/rust-lang/rust/issues/83027 `@BurntSushi`	2021-03-17 05:46:08 +00:00
Simonas Kazlauskas	72fb4379d5	Adjust `-Ctarget-cpu=native` handling in cg_llvm When cg_llvm encounters the `-Ctarget-cpu=native` it computes an explciit set of features that applies to the target in order to correctly compile code for the host CPU (because e.g. `skylake` alone is not sufficient to tell if some of the instructions are available or not). However there were a couple of issues with how we did this. Firstly, the order in which features were overriden wasn't quite right – conceptually you'd expect `-Ctarget-cpu=native` option to override the features that are implicitly set by the target definition. However due to how other `-Ctarget-cpu` values are handled we must adopt the following order of priority: * Features from -Ctarget-cpu=; are overriden by Features implied by --target; are overriden by * Features from -Ctarget-feature; are overriden by * function specific features. Another problem was in that the function level `target-features` attribute would overwrite the entire set of the globally enabled features, rather than just the features the `#[target_feature(enable/disable)]` specified. With something like `-Ctarget-cpu=native` we'd end up in a situation wherein a function without `#[target_feature(enable)]` annotation would have a broader set of features compared to a function with one such attribute. This turned out to be a cause of heavy run-time regressions in some code using these function-level attributes in conjunction with `-Ctarget-cpu=native`, for example. With this PR rustc is more careful about specifying the entire set of features for functions that use `#[target_feature(enable/disable)]` or `#[instruction_set]` attributes. Sadly testing the original reproducer for this behaviour is quite impossible – we cannot rely on `-Ctarget-cpu=native` to be anything in particular on developer or CI machines.	2021-03-16 21:32:55 +02:00
Hiroki Noda	8357e57346	Add support for storing code model to LLVM module IR This patch avoids undefined behavior by linking different object files. Also this would it could be propagated properly to LTO. See https://reviews.llvm.org/D52322 and https://reviews.llvm.org/D52323. This patch is based on https://github.com/rust-lang/rust/pull/74002	2021-03-12 11:02:25 +09:00
Tomasz Miąsko	1ec905766d	Use FromStr trait for number option parsing Replace `parse_uint` with generic `parse_number` based on `FromStr`. Use it for parsing inlining threshold to avoid casting later.	2021-03-09 14:49:04 +01:00
bors	446d4533e8	Auto merge of #82102 - nagisa:nagisa/fix-dwo-name, r=davidtwco Set path of the compile unit to the source directory As part of the effort to implement split dwarf debug info, we ended up setting the compile unit location to the output directory rather than the source directory. Furthermore, it seems like we failed to remap the prefixes for this as well! The desired behaviour is to instead set the `DW_AT_GNU_dwo_name` to a path relative to compiler's working directory. This still allows debuggers to find the split dwarf files, while not changing the behaviour of the code that is compiling with regular debug info, and not changing the compiler's behaviour with regards to reproducibility. Fixes #82074 cc `@alexcrichton` `@davidtwco`	2021-02-23 10:02:16 +00:00
Simonas Kazlauskas	fa3621b468	Don't fail to remove files if they are missing In the backend we may want to remove certain temporary files, but in certain other situations these files might not be produced in the first place. We don't exactly care about that, and the intent is really that these files are gone after a certain point in the backend. Here we unify the backend file removing calls to use `ensure_removed` which will attempt to delete a file, but will not fail if it does not exist (anymore). The tradeoff to this approach is, of course, that we may miss instances were we are attempting to remove files at wrong paths due to some bug – compilation would silently succeed but the temporary files would remain there somewhere.	2021-02-14 18:31:57 +02:00
Simonas Kazlauskas	16c71886c9	Set path of the compile unit to the source directory As part of the effort to implement split dwarf debug info, we ended up setting the compile unit location to the output directory rather than the source directory. Furthermore, it seems like we failed to remap the prefixes for this as well! The desired behaviour is to instead set the `DW_AT_GNU_dwo_name` to a path relative to compiler's working directory. This still allows debuggers to find the split dwarf files, while not changing the behaviour of the code that is compiling with regular debug info, and not changing the compiler's behaviour with regards to reproducibility. Fixes #82074	2021-02-14 17:12:14 +02:00
Tri Vo	c7d9bffe76	HWASan support	2021-02-07 23:48:58 -08:00
Alex Crichton	a124043fb0	rustc: Stabilize `-Zrun-dsymutil` as `-Csplit-debuginfo` This commit adds a new stable codegen option to rustc, `-Csplit-debuginfo`. The old `-Zrun-dsymutil` flag is deleted and now subsumed by this stable flag. Additionally `-Zsplit-dwarf` is also subsumed by this flag but still requires `-Zunstable-options` to actually activate. The `-Csplit-debuginfo` flag takes one of three values: * `off` - This indicates that split-debuginfo from the final artifact is not desired. This is not supported on Windows and is the default on Unix platforms except macOS. On macOS this means that `dsymutil` is not executed. * `packed` - This means that debuginfo is desired in one location separate from the main executable. This is the default on Windows (`.pdb`) and macOS (`.dSYM`). On other Unix platforms this subsumes `-Zsplit-dwarf=single` and produces a `.dwp` file. `unpacked` - This means that debuginfo will be roughly equivalent to object files, meaning that it's throughout the build directory rather than in one location (often the fastest for local development). This is not the default on any platform and is not supported on Windows. Each target can indicate its own default preference for how debuginfo is handled. Almost all platforms default to `off` except for Windows and macOS which default to `packed` for historical reasons. Some equivalencies for previous unstable flags with the new flags are: * `-Zrun-dsymutil=yes` -> `-Csplit-debuginfo=packed` * `-Zrun-dsymutil=no` -> `-Csplit-debuginfo=unpacked` * `-Zsplit-dwarf=single` -> `-Csplit-debuginfo=packed` * `-Zsplit-dwarf=split` -> `-Csplit-debuginfo=unpacked` Note that `-Csplit-debuginfo` still requires `-Zunstable-options` for non-macOS platforms since split-dwarf support was just implemented in rustc. There's some more rationale listed on #79361, but the main gist of the motivation for this commit is that `dsymutil` can take quite a long time to execute in debug builds and provides little benefit. This means that incremental compile times appear that much worse on macOS because the compiler is constantly running `dsymutil` over every single binary it produces during `cargo build` (even build scripts!). Ideally rustc would switch to not running `dsymutil` by default, but that's a problem left to get tackled another day. Closes #79361	2021-01-28 08:51:11 -08:00
LingMan	a56bffb4f9	Use Option::map_or instead of `.map(..).unwrap_or(..)`	2021-01-14 19:23:59 +01:00
Andrew Sun	bf80159050	Make target-cpu=native detect individual features	2021-01-06 03:23:54 -05:00

1 2

72 Commits