nordic-dev.net/rust - rust

mirror of https://github.com/rust-lang/rust.git synced 2025-04-29 03:27:44 +00:00

Author	SHA1	Message	Date
Matthias Krüger	2f67647606	Rollup merge of #89581 - jblazquez:master, r=Mark-Simulacrum Add -Z no-unique-section-names to reduce ELF header bloat. This change adds a new compiler flag that can help reduce the size of ELF binaries that contain many functions. By default, when enabling function sections (which is the default for most targets), the LLVM backend will generate different section names for each function. For example, a function `func` would generate a section called `.text.func`. Normally this is fine because the linker will merge all those sections into a single one in the binary. However, starting with [LLVM 12](https://github.com/llvm/llvm-project/commit/ee5d1a04), the backend will also generate unique section names for exception handling, resulting in thousands of `.gcc_except_table.*` sections ending up in the final binary because some linkers like LLD don't currently merge or strip these EH sections (see discussion [here](https://reviews.llvm.org/D83655)). This can bloat the ELF headers and string table significantly in binaries that contain many functions. The new option is analogous to Clang's `-fno-unique-section-names`, and instructs LLVM to generate the same `.text` and `.gcc_except_table` section for each function, resulting in a smaller final binary. The motivation to add this new option was because we have a binary that ended up with so many ELF sections (over 65,000) that it broke some existing ELF tools, which couldn't handle so many sections. Here's our old binary: ``` $ readelf --sections old.elf \| head -1 There are 71746 section headers, starting at offset 0x2a246508: $ readelf --sections old.elf \| grep shstrtab [71742] .shstrtab STRTAB 0000000000000000 2977204c ad44bb 00 0 0 1 ``` That's an 11MB+ string table. Here's the new binary using this option: ``` $ readelf --sections new.elf \| head -1 There are 43 section headers, starting at offset 0x29143ca8: $ readelf --sections new.elf \| grep shstrtab [40] .shstrtab STRTAB 0000000000000000 29143acc 0001db 00 0 0 1 ``` The whole binary size went down by over 20MB, which is quite significant.	2021-10-25 22:59:46 +02:00
Javier Blazquez	4ed846ad4d	Add -Z no-unique-section-names to reduce ELF header bloat. This change adds a new compiler flag that can help reduce the size of ELF binaries that contain many functions. By default, when enabling function sections (which is the default for most targets), the LLVM backend will generate different section names for each function. For example, a function "func" would generate a section called ".text.func". Normally this is fine because the linker will merge all those sections into a single one in the binary. However, starting with LLVM 12 (llvm/llvm-project@ee5d1a0), the backend will also generate unique section names for exception handling, resulting in thousands of ".gcc_except_table.*" sections ending up in the final binary because some linkers don't currently merge or strip these EH sections. This can bloat the ELF headers and string table significantly in binaries that contain many functions. The new option is analogous to Clang's -fno-unique-section-names, and instructs LLVM to generate the same ".text" and ".gcc_except_table" section for each function, resulting in smaller object files and potentially a smaller final binary.	2021-10-11 12:09:32 -07:00
Hans Kratz	4593d78e96	Default to disabling the new pass manager for the s390x targets.	2021-10-08 15:05:07 +02:00
Michael Benfield	a17193dbb9	Enable AutoFDO. This largely involves implementing the options debug-info-for-profiling and profile-sample-use and forwarding them on to LLVM. AutoFDO can be used on x86-64 Linux like this: rustc -O -Cdebug-info-for-profiling main.rs -o main perf record -b ./main create_llvm_prof --binary=main --out=code.prof rustc -O -Cprofile-sample-use=code.prof main.rs -o main2 Now `main2` will have feedback directed optimization applied to it. The create_llvm_prof tool can be obtained from this github repository: https://github.com/google/autofdo Fixes #64892.	2021-10-06 19:36:52 +00:00
bors	b27661eb33	Auto merge of #89405 - GuillaumeGomez:fix-clippy-lints, r=cjgillot Fix clippy lints I'm currently working on allowing clippy to run on librustdoc after a discussion I had with `@Mark-Simulacrum.` So in the meantime, I fixed a few lints on the compiler crates.	2021-10-02 10:52:09 +00:00
Manish Goregaokar	1781e4b81a	Rollup merge of #89376 - andjo403:selfProfileUseAfterDropFix, r=Mark-Simulacrum Fix use after drop in self-profile with llvm events self-profile with `-Z self-profile-events=llvm` have failed with a segmentation fault due to this use after drop. this type of events can be more useful now that the new passmanager is the default.	2021-10-01 14:46:49 -07:00
Guillaume Gomez	759eba0a08	Fix clippy lints	2021-10-01 23:17:19 +02:00
Manish Goregaokar	6f1e930581	Rollup merge of #88820 - hlopko:add_pie_relocation_model, r=petrochenkov Add `pie` as another `relocation-model` value MCP: https://github.com/rust-lang/compiler-team/issues/461	2021-10-01 09:18:16 -07:00
Marcel Hlopko	198d90786b	Add `pie` as another `relocation-model` value	2021-10-01 08:06:42 +02:00
Andreas Jonson	d90934ce87	Fix use after drop in self-profile with llvm events	2021-09-29 22:58:33 +02:00
Nikita Popov	be01f42f73	Enable new pass manager on LLVM 13 The new pass manager is enabled by default in clang since Clang/LLVM 13. While the discussion about this is still ongoing (https://lists.llvm.org/pipermail/llvm-dev/2021-August/152305.html) it's expected that support for the legacy pass manager will be dropped either in LLVM 14 or 15. This switches us to use the new pass manager if LLVM >= 13 is used.	2021-09-25 11:24:23 +02:00
Nikita Popov	621f5146c3	Handle SrcMgr diagnostics This is how InlineAsm diagnostics with source information are reported now. Previously a separate InlineAsm diagnostic handler was used.	2021-08-16 18:28:17 +02:00
Camille GILLOT	0bde3b1f80	Use () for codegen queries.	2021-05-12 13:58:46 +02:00
Nikita Popov	c2b15a6b64	Support -C passes in NewPM And report an error if parsing the additional pass pipeline fails. Threading through the error accounts for most of the changes here.	2021-05-08 10:58:08 +02:00
Nikita Popov	db140de8f2	Explicitly register GCOV profiling pass as well	2021-05-08 10:58:08 +02:00
Nikita Popov	5ecbe7fcf8	Explicitly register instrprof pass Don't use "passes" for this purpose, explicitly insert it into the correct place in the pipeline instead.	2021-05-08 10:58:08 +02:00
Nikita Popov	0318883cd6	Make -Z new-llvm-pass-manager an Option<bool> To allow it to have an LLVM version dependent default.	2021-05-08 10:58:08 +02:00
Dylan DPC	e64dbb1f46	Rollup merge of #82483 - tmiasko:option-from-str, r=matthewjasper Use FromStr trait for number option parsing Replace `parse_uint` with generic `parse_number` based on `FromStr`. Use it for parsing inlining threshold to avoid casting later.	2021-04-05 13:03:37 +02:00
Dylan DPC	0d12422f2d	Rollup merge of #80525 - devsnek:wasm64, r=nagisa wasm64 support There is still some upstream llvm work needed before this can land.	2021-04-05 00:24:23 +02:00
Gus Caplan	da66a31572	wasm64	2021-04-04 11:29:34 -05:00
Simonas Kazlauskas	64af7eae1e	Move SanitizerSet to rustc_target	2021-04-03 00:37:49 +03:00
Amanieu d'Antras	cad9b6b695	Apply review feedback	2021-03-30 07:03:41 +01:00
Amanieu d'Antras	26d260bfa4	Run LLVM coverage instrumentation passes before optimization passes This matches the behavior of Clang and allows us to remove several hacks which were needed to ensure functions weren't optimized away before reaching the instrumentation pass.	2021-03-30 02:10:28 +01:00
bors	0c341226ad	Auto merge of #83084 - nagisa:nagisa/features-native, r=petrochenkov Adjust `-Ctarget-cpu=native` handling in cg_llvm When cg_llvm encounters the `-Ctarget-cpu=native` it computes an explciit set of features that applies to the target in order to correctly compile code for the host CPU (because e.g. `skylake` alone is not sufficient to tell if some of the instructions are available or not). However there were a couple of issues with how we did this. Firstly, the order in which features were overriden wasn't quite right – conceptually you'd expect `-Ctarget-cpu=native` option to override the features that are implicitly set by the target definition. However due to how other `-Ctarget-cpu` values are handled we must adopt the following order of priority: * Features from -Ctarget-cpu=; are overriden by Features implied by --target; are overriden by * Features from -Ctarget-feature; are overriden by * function specific features. Another problem was in that the function level `target-features` attribute would overwrite the entire set of the globally enabled features, rather than just the features the `#[target_feature(enable/disable)]` specified. With something like `-Ctarget-cpu=native` we'd end up in a situation wherein a function without `#[target_feature(enable)]` annotation would have a broader set of features compared to a function with one such attribute. This turned out to be a cause of heavy run-time regressions in some code using these function-level attributes in conjunction with `-Ctarget-cpu=native`, for example. With this PR rustc is more careful about specifying the entire set of features for functions that use `#[target_feature(enable/disable)]` or `#[instruction_set]` attributes. Sadly testing the original reproducer for this behaviour is quite impossible – we cannot rely on `-Ctarget-cpu=native` to be anything in particular on developer or CI machines. cc https://github.com/rust-lang/rust/issues/83027 `@BurntSushi`	2021-03-17 05:46:08 +00:00
Simonas Kazlauskas	72fb4379d5	Adjust `-Ctarget-cpu=native` handling in cg_llvm When cg_llvm encounters the `-Ctarget-cpu=native` it computes an explciit set of features that applies to the target in order to correctly compile code for the host CPU (because e.g. `skylake` alone is not sufficient to tell if some of the instructions are available or not). However there were a couple of issues with how we did this. Firstly, the order in which features were overriden wasn't quite right – conceptually you'd expect `-Ctarget-cpu=native` option to override the features that are implicitly set by the target definition. However due to how other `-Ctarget-cpu` values are handled we must adopt the following order of priority: * Features from -Ctarget-cpu=; are overriden by Features implied by --target; are overriden by * Features from -Ctarget-feature; are overriden by * function specific features. Another problem was in that the function level `target-features` attribute would overwrite the entire set of the globally enabled features, rather than just the features the `#[target_feature(enable/disable)]` specified. With something like `-Ctarget-cpu=native` we'd end up in a situation wherein a function without `#[target_feature(enable)]` annotation would have a broader set of features compared to a function with one such attribute. This turned out to be a cause of heavy run-time regressions in some code using these function-level attributes in conjunction with `-Ctarget-cpu=native`, for example. With this PR rustc is more careful about specifying the entire set of features for functions that use `#[target_feature(enable/disable)]` or `#[instruction_set]` attributes. Sadly testing the original reproducer for this behaviour is quite impossible – we cannot rely on `-Ctarget-cpu=native` to be anything in particular on developer or CI machines.	2021-03-16 21:32:55 +02:00
Hiroki Noda	8357e57346	Add support for storing code model to LLVM module IR This patch avoids undefined behavior by linking different object files. Also this would it could be propagated properly to LTO. See https://reviews.llvm.org/D52322 and https://reviews.llvm.org/D52323. This patch is based on https://github.com/rust-lang/rust/pull/74002	2021-03-12 11:02:25 +09:00
Tomasz Miąsko	1ec905766d	Use FromStr trait for number option parsing Replace `parse_uint` with generic `parse_number` based on `FromStr`. Use it for parsing inlining threshold to avoid casting later.	2021-03-09 14:49:04 +01:00
bors	446d4533e8	Auto merge of #82102 - nagisa:nagisa/fix-dwo-name, r=davidtwco Set path of the compile unit to the source directory As part of the effort to implement split dwarf debug info, we ended up setting the compile unit location to the output directory rather than the source directory. Furthermore, it seems like we failed to remap the prefixes for this as well! The desired behaviour is to instead set the `DW_AT_GNU_dwo_name` to a path relative to compiler's working directory. This still allows debuggers to find the split dwarf files, while not changing the behaviour of the code that is compiling with regular debug info, and not changing the compiler's behaviour with regards to reproducibility. Fixes #82074 cc `@alexcrichton` `@davidtwco`	2021-02-23 10:02:16 +00:00
Simonas Kazlauskas	fa3621b468	Don't fail to remove files if they are missing In the backend we may want to remove certain temporary files, but in certain other situations these files might not be produced in the first place. We don't exactly care about that, and the intent is really that these files are gone after a certain point in the backend. Here we unify the backend file removing calls to use `ensure_removed` which will attempt to delete a file, but will not fail if it does not exist (anymore). The tradeoff to this approach is, of course, that we may miss instances were we are attempting to remove files at wrong paths due to some bug – compilation would silently succeed but the temporary files would remain there somewhere.	2021-02-14 18:31:57 +02:00
Simonas Kazlauskas	16c71886c9	Set path of the compile unit to the source directory As part of the effort to implement split dwarf debug info, we ended up setting the compile unit location to the output directory rather than the source directory. Furthermore, it seems like we failed to remap the prefixes for this as well! The desired behaviour is to instead set the `DW_AT_GNU_dwo_name` to a path relative to compiler's working directory. This still allows debuggers to find the split dwarf files, while not changing the behaviour of the code that is compiling with regular debug info, and not changing the compiler's behaviour with regards to reproducibility. Fixes #82074	2021-02-14 17:12:14 +02:00
Tri Vo	c7d9bffe76	HWASan support	2021-02-07 23:48:58 -08:00
Alex Crichton	a124043fb0	rustc: Stabilize `-Zrun-dsymutil` as `-Csplit-debuginfo` This commit adds a new stable codegen option to rustc, `-Csplit-debuginfo`. The old `-Zrun-dsymutil` flag is deleted and now subsumed by this stable flag. Additionally `-Zsplit-dwarf` is also subsumed by this flag but still requires `-Zunstable-options` to actually activate. The `-Csplit-debuginfo` flag takes one of three values: * `off` - This indicates that split-debuginfo from the final artifact is not desired. This is not supported on Windows and is the default on Unix platforms except macOS. On macOS this means that `dsymutil` is not executed. * `packed` - This means that debuginfo is desired in one location separate from the main executable. This is the default on Windows (`.pdb`) and macOS (`.dSYM`). On other Unix platforms this subsumes `-Zsplit-dwarf=single` and produces a `.dwp` file. `unpacked` - This means that debuginfo will be roughly equivalent to object files, meaning that it's throughout the build directory rather than in one location (often the fastest for local development). This is not the default on any platform and is not supported on Windows. Each target can indicate its own default preference for how debuginfo is handled. Almost all platforms default to `off` except for Windows and macOS which default to `packed` for historical reasons. Some equivalencies for previous unstable flags with the new flags are: * `-Zrun-dsymutil=yes` -> `-Csplit-debuginfo=packed` * `-Zrun-dsymutil=no` -> `-Csplit-debuginfo=unpacked` * `-Zsplit-dwarf=single` -> `-Csplit-debuginfo=packed` * `-Zsplit-dwarf=split` -> `-Csplit-debuginfo=unpacked` Note that `-Csplit-debuginfo` still requires `-Zunstable-options` for non-macOS platforms since split-dwarf support was just implemented in rustc. There's some more rationale listed on #79361, but the main gist of the motivation for this commit is that `dsymutil` can take quite a long time to execute in debug builds and provides little benefit. This means that incremental compile times appear that much worse on macOS because the compiler is constantly running `dsymutil` over every single binary it produces during `cargo build` (even build scripts!). Ideally rustc would switch to not running `dsymutil` by default, but that's a problem left to get tackled another day. Closes #79361	2021-01-28 08:51:11 -08:00
LingMan	a56bffb4f9	Use Option::map_or instead of `.map(..).unwrap_or(..)`	2021-01-14 19:23:59 +01:00
Andrew Sun	bf80159050	Make target-cpu=native detect individual features	2021-01-06 03:23:54 -05:00
Matthias Krüger	e5ead5fc58	remove unused return types such as empty Results or Options that would always be Some(..) remove unused return type of dropck::check_drop_obligations() don't wrap return type in Option in get_macro_by_def_id() since we would always return Some(..) remove redundant return type of back::write::optimize() don't Option-wrap return type of compute_type_parameters() since we always return Some(..) don't return empty Result in assemble_generator_candidates() don't return empty Result in assemble_closure_candidates() don't return empty result in assemble_fn_pointer_candidates() don't return empty result in assemble_candidates_from_impls() don't return empty result in assemble_candidates_from_auto_impls() don't return emtpy result in assemble_candidates_for_trait_alias() don't return empty result in assemble_builtin_bound_candidates() don't return empty results in assemble_extension_candidates_for_traits_in_scope() and assemble_extension_candidates_for_trait() remove redundant wrapping of return type of StripItem::strip() since it always returns Some(..) remove unused return type of assemble_extension_candidates_for_all_traits()	2020-12-30 13:15:40 +01:00
David Wood	ee073b5ec5	cg_llvm: split dwarf filename and comp dir llvm-dwp concatenates `DW_AT_comp_dir` with `DW_AT_GNU_dwo_name` (only when `DW_AT_comp_dir` exists), which can result in it failing to find the DWARF object files. In earlier testing, `DW_AT_comp_dir` wasn't present in the final object and the current directory was the output directory. When running tests through compiletest, the working directory of the compilation is different from output directory and that resulted in `DW_AT_comp_dir` being in the object file (and set to the current working directory, rather than the output directory), and `DW_AT_GNU_dwo_name` being set to the full path (rather than just the filename), so llvm-dwp was failing. This commit changes the compilation directory provided to LLVM to match the output directory, where DWARF objects are output; and ensures that only the filename is used for `DW_AT_GNU_dwo_name`. Signed-off-by: David Wood <david@davidtw.co>	2020-12-16 10:33:52 +00:00
David Wood	e3fdae9d81	cg_llvm: implement split dwarf support This commit implements Split DWARF support, wiring up the flag (added in earlier commits) to the modified FFI wrapper (also from earlier commits). Signed-off-by: David Wood <david@davidtw.co>	2020-12-16 10:33:47 +00:00
David Wood	6890312ea3	cg_ssa: introduce `TargetMachineFactoryFn` alias This commit removes the `TargetMachineFactory` struct and adds a `TargetMachineFactoryFn` type alias which is used everywhere that the previous, long type was used. Signed-off-by: David Wood <david@davidtw.co>	2020-12-16 10:33:43 +00:00
David Wood	341aa97adb	llvm: update ffi bindings for split dwarf This commit modifies the FFI bindings to LLVM required for Split DWARF support in rustc. In particular: - `addPassesToEmitFile`'s wrapper, `LLVMRustWriteOutputFile` now takes a `DwoPath` `const char*`. When disabled, `nullptr` should be provided which will preserve existing behaviour. When enabled, the path to the `.dwo` file should be provided. - `createCompileUnit`'s wrapper, `LLVMRustDIBuilderCreateCompileUnit` now has two additional arguments, for the `DWOId` and to enable `SplitDebugInlining`. `DWOId` should always be zero. - `createTargetMachine`'s wrapper, `LLVMRustCreateTargetMachine` has an additional argument which should be provided the path to the `.dwo` when enabled. Signed-off-by: David Wood <david@davidtw.co>	2020-12-16 10:31:42 +00:00
Dario Nieuwenhuis	7b62e09b03	Allow disabling TrapUnreachable via -Ztrap-unreachable=no This is useful for embedded targets where small code size is desired. For example, on my project (thumbv7em-none-eabi) this yields a 0.6% code size reduction.	2020-11-24 01:08:27 +01:00
Dylan DPC	ae7020fcb4	Rollup merge of #78848 - DevJPM:ci-llvm-9, r=nikic Bump minimal supported LLVM version to 9 This bumps the minimal tested llvm version to 9. This should enable supporting newer LLVM features (and CPU extensions). This was motived by #78361 having to drop features because of LLVM 8 not supporting certain CPU extensions yet. This was declared relatively uncontroversial on [Zulip](https://rust-lang.zulipchat.com/#narrow/stream/182449-t-compiler.2Fhelp/topic/Min.20Supported.20LLVM.20Upgrade.20Process.3F/near/215957859). Paging ````@eddyb```` because there was a comment in the [dockerfile](https://github.com/rust-lang/rust/blob/master/src/ci/docker/host-x86_64/x86_64-gnu-llvm-8/Dockerfile#L42) describing a hack (which I don't quite understand) which was also blocked by not having LLVM 9.	2020-11-15 03:02:39 +01:00
Vadim Petrochenkov	04d41e1f40	rustc_target: Mark UEFI targets as `is_like_windows`/`is_like_msvc` Document what `is_like_windows` and `is_like_msvc` mean in more detail.	2020-11-12 19:40:41 +03:00
DevJPM	b51bcc72d9	fully exploited the dropped support of LLVM 8 This commit grepped for LLVM_VERSION_GE, LLVM_VERSION_LT, get_major_version and min-llvm-version and statically evaluated every expression possible (and sensible) assuming that the LLVM version is >=9 now	2020-11-12 14:39:47 +01:00
Vadim Petrochenkov	bf66988aa1	Collapse all uses of `target.options.foo` into `target.foo` with an eye on merging `TargetOptions` into `Target`. `TargetOptions` as a separate structure is mostly an implementation detail of `Target` construction, all its fields logically belong to `Target` and available from `Target` through `Deref` impls.	2020-11-08 17:29:13 +03:00
Anthony Ramine	6febaf2419	Implement -Z relax-elf-relocations=yes\|no This lets rustc users tweak whether the linker should relax ELF relocations, namely whether it should emit R_X86_64_GOTPCRELX relocations instead of R_X86_64_GOTPCREL, as the former is allowed by the ABI to be further optimised. The default value is whatever the target defines.	2020-10-31 17:16:56 +01:00
Anthony Ramine	056942215c	Implement -Z function-sections=yes\|no This lets rustc users tweak whether all functions should be put in their own TEXT section, using whatever default value the target defines if the flag is missing.	2020-10-26 23:26:43 +01:00
Tyler Mandry	6640a62e0e	Revert "Set .llvmbc and .llvmcmd sections as allocatable"	2020-10-23 12:54:00 -07:00
Dylan DPC	55f9676c47	Rollup merge of #77961 - glandium:embed-bitcode, r=nagisa Set .llvmbc and .llvmcmd sections as allocatable This marks both sections as allocatable rather than excluded, which matches what clang does with the equivalent `-fembed-bitcode` flag.	2020-10-17 03:27:20 +02:00
est31	4fa5578774	Replace target.target with target and target.ptr_width with target.pointer_width Preparation for a subsequent change that replaces rustc_target::config::Config with its wrapped Target. On its own, this commit breaks the build. I don't like making build-breaking commits, but in this instance I believe that it makes review easier, as the "real" changes of this PR can be seen much more easily. Result of running: find compiler/ -type f -exec sed -i -e 's/target\.target$[)\.,; ]$/target\1/g' {} \; find compiler/ -type f -exec sed -i -e 's/target\.target$/target/g' {} \; find compiler/ -type f -exec sed -i -e 's/target.ptr_width/target.pointer_width/g' {} \; ./x.py fmt	2020-10-15 12:02:24 +02:00
Mike Hommey	684d142e70	Set .llvmbc and .llvmcmd sections as allocatable	2020-10-15 14:04:57 +09:00

1 2

53 Commits