nordic-dev.net/rust - rust

mirror of https://github.com/rust-lang/rust.git synced 2025-06-05 19:58:32 +00:00

Author	SHA1	Message	Date
Camille GILLOT	c49e2501bf	Make AbsoluteBytePos a u64.	2022-11-08 18:47:26 +00:00
Nilstrieb	36be251a35	Merge `QueryDescription` into `QueryConfig` `QueryDescription` has gone through a lot of refactoring and doesn't make sense anymore.	2022-11-05 16:24:13 +01:00
Michael Woerister	9117ea9758	Introduce UnordMap, UnordSet, and UnordBag (see MCP 533) MCP 533: https://github.com/rust-lang/compiler-team/issues/533 Also, as an example, substitute UnordMap for FxHashMap in used_trait_imports query result.	2022-10-27 13:23:26 +00:00
Patrick Walton	da630ac79d	Introduce deduced parameter attributes, and use them for deducing `readonly` on indirect immutable freeze by-value function parameters. Right now, `rustc` only examines function signatures and the platform ABI when determining the LLVM attributes to apply to parameters. This results in missed optimizations, because there are some attributes that can be determined via analysis of the MIR making up the function body. In particular, `readonly` could be applied to most indirectly-passed by-value function arguments (specifically, those that are freeze and are observed not to be mutated), but it currently is not. This patch introduces the machinery that allows `rustc` to determine those attributes. It consists of a query, `deduced_param_attrs`, that, when evaluated, analyzes the MIR of the function to determine supplementary attributes. The results of this query for each function are written into the crate metadata so that the deduced parameter attributes can be applied to cross-crate functions. In this patch, we simply check the parameter for mutations to determine whether the `readonly` attribute should be applied to parameters that are indirect immutable freeze by-value. More attributes could conceivably be deduced in the future: `nocapture` and `noalias` come to mind. Adding `readonly` to indirect function parameters where applicable enables some potential optimizations in LLVM that are discussed in [issue 103103] and [PR 103070] around avoiding stack-to-stack memory copies that appear in functions like `core::fmt::Write::write_fmt` and `core::panicking::assert_failed`. These functions pass a large structure unchanged by value to a subfunction that also doesn't mutate it. Since the structure in this case is passed as an indirect parameter, it's a pointer from LLVM's perspective. As a result, the intermediate copy of the structure that our codegen emits could be optimized away by LLVM's MemCpyOptimizer if it knew that the pointer is `readonly nocapture noalias` in both the caller and callee. We already pass `nocapture noalias`, but we're missing `readonly`, as we can't determine whether a by-value parameter is mutated by examining the signature in Rust. I didn't have much success with having LLVM infer the `readonly` attribute, even with fat LTO; it seems that deducing it at the MIR level is necessary. No large benefits should be expected from this optimization now; LLVM needs some changes (discussed in [PR 103070]) to more aggressively use the `noalias nocapture readonly` combination in its alias analysis. I have some LLVM patches for these optimizations and have had them looked over. With all the patches applied locally, I enabled LLVM to remove all the `memcpy`s from the following code: ```rust fn main() { println!("Hello {}", 3); } ``` which is a significant codegen improvement over the status quo. I expect that if this optimization kicks in in multiple places even for such a simple program, then it will apply to Rust code all over the place. [issue 103103]: https://github.com/rust-lang/rust/issues/103103 [PR 103070]: https://github.com/rust-lang/rust/pull/103070	2022-10-21 02:33:15 -07:00
Nicholas Nethercote	9110d925d0	Remove `-Ztime` option. The compiler currently has `-Ztime` and `-Ztime-passes`. I've used `-Ztime-passes` for years but only recently learned about `-Ztime`. What's the difference? Let's look at the `-Zhelp` output: ``` -Z time=val -- measure time of rustc processes (default: no) -Z time-passes=val -- measure time of each rustc pass (default: no) ``` The `-Ztime-passes` description is clear, but the `-Ztime` one is less so. Sounds like it measures the time for the entire process? No. The real difference is that `-Ztime-passes` prints out info about passes, and `-Ztime` does the same, but only for a subset of those passes. More specifically, there is a distinction in the profiling code between a "verbose generic activity" and an "extra verbose generic activity". `-Ztime-passes` prints both kinds, while `-Ztime` only prints the first one. (It took me a close reading of the source code to determine this difference.) In practice this distinction has low value. Perhaps in the past the "extra verbose" output was more voluminous, but now that we only print stats for a pass if it exceeds 5ms or alters the RSS, `-Ztime-passes` is less spammy. Also, a lot of the "extra verbose" cases are for individual lint passes, and you need to also use `-Zno-interleave-lints` to see those anyway. Therefore, this commit removes `-Ztime` and the associated machinery. One thing to note is that the existing "extra verbose" activities all have an extra string argument, so the commit adds the ability to accept an extra argument to the "verbose" activities.	2022-10-06 15:49:44 +11:00
Michael Goulet	4cdf264e6f	cache collect_trait_impl_trait_tys	2022-09-14 20:50:52 +00:00
klensy	f6329485a8	rmeta/query cache: don't write string values of preinterned symbols	2022-08-20 15:39:21 +03:00
klensy	adba4691f6	cache strings while encoding/decoding to compiler artifacts	2022-08-15 17:56:37 +03:00
kadmin	e612e2603c	Move abstract const to rustc_middle::ty	2022-07-12 02:21:31 +00:00
Camille GILLOT	43bb31b954	Allow to create definitions inside the query system.	2022-07-06 22:50:55 +02:00
bors	3a8b0144c8	Auto merge of #98106 - cjgillot:split-definitions, r=michaelwoerister Split up `Definitions` and `ResolverAstLowering`. Split off https://github.com/rust-lang/rust/pull/95573 r? `@michaelwoerister`	2022-06-17 10:00:11 +00:00
Nicholas Nethercote	bb02cc47c4	Move `finish` out of the `Encoder` trait. This simplifies things, but requires making `CacheEncoder` non-generic. (This was previously merged as commit 4 in #94732 and then was reverted in #97905 because it caused a perf regression.)	2022-06-16 16:20:32 +10:00
Camille GILLOT	34e4d72929	Separate `source_span` and `expn_that_defined` from `Definitions`.	2022-06-14 22:45:51 +02:00
Nicholas Nethercote	abe45a9ffa	Rename rustc_serialize::opaque::Encoder as MemEncoder. This avoids the name clash with `rustc_serialize::Encoder` (a trait), and allows lots qualifiers to be removed and imports to be simplified (e.g. fewer `as` imports). (This was previously merged as commit 5 in #94732 and then was reverted in #97905 because of a perf regression caused by commit 4 in #94732.)	2022-06-14 14:52:01 +10:00
Nicholas Nethercote	3186e311e5	Revert `dc08bc51f2`.	2022-06-10 11:58:29 +10:00
Nicholas Nethercote	7f51a1b976	Revert `b983e42936`.	2022-06-10 08:35:03 +10:00
Nicholas Nethercote	b983e42936	Rename `rustc_serialize::opaque::Encoder` as `MemEncoder`. This avoids the name clash with `rustc_serialize::Encoder` (a trait), and allows lots qualifiers to be removed and imports to be simplified (e.g. fewer `as` imports).	2022-06-08 09:50:44 +10:00
Nicholas Nethercote	dc08bc51f2	Move `finish` out of the `Encoder` trait. This simplifies things, but requires making `CacheEncoder` non-generic.	2022-06-08 09:21:05 +10:00
Nicholas Nethercote	1acbe7573d	Use delayed error handling for `Encodable` and `Encoder` infallible. There are two impls of the `Encoder` trait: `opaque::Encoder` and `opaque::FileEncoder`. The former encodes into memory and is infallible, the latter writes to file and is fallible. Currently, standard `Result`/`?`/`unwrap` error handling is used, but this is a bit verbose and has non-trivial cost, which is annoying given how rare failures are (especially in the infallible `opaque::Encoder` case). This commit changes how `Encoder` fallibility is handled. All the `emit_*` methods are now infallible. `opaque::Encoder` requires no great changes for this. `opaque::FileEncoder` now implements a delayed error handling strategy. If a failure occurs, it records this via the `res` field, and all subsequent encoding operations are skipped if `res` indicates an error has occurred. Once encoding is complete, the new `finish` method is called, which returns a `Result`. In other words, there is now a single `Result`-producing method instead of many of them. This has very little effect on how any file errors are reported if `opaque::FileEncoder` has any failures. Much of this commit is boring mechanical changes, removing `Result` return values and `?` or `unwrap` from expressions. The more interesting parts are as follows. - serialize.rs: The `Encoder` trait gains an `Ok` associated type. The `into_inner` method is changed into `finish`, which returns `Result<Vec<u8>, !>`. - opaque.rs: The `FileEncoder` adopts the delayed error handling strategy. Its `Ok` type is a `usize`, returning the number of bytes written, replacing previous uses of `FileEncoder::position`. - Various methods that take an encoder now consume it, rather than being passed a mutable reference, e.g. `serialize_query_result_cache`.	2022-06-08 07:01:26 +10:00
bjorn3	7381ea019c	Remove emit_unit It doesn't do anything for all encoders	2022-06-03 17:02:14 +00:00
Nicholas Nethercote	0b81d7cdc6	Lazify `SourceFile::lines`. `SourceFile::lines` is a big part of metadata. It's stored in a compressed form (a difference list) to save disk space. Decoding it is a big fraction of compile time for very small crates/programs. This commit introduces a new type `SourceFileLines` which has a `Lines` form and a `Diffs` form. The latter is used when the metadata is first read, and it is only decoded into the `Lines` form when line data is actually needed. This avoids the decoding cost for many files, especially in `std`. It's a performance win of up to 15% for tiny crates/programs where metadata decoding is a high part of compilation costs. A `Lock` is needed because the methods that access lines data (which can trigger decoding) take `&self` rather than `&mut self`. To allow for this, `SourceFile::lines` now takes a `FnMut` that operates on the lines slice rather than returning the lines slice.	2022-06-01 10:36:39 +10:00
Michael Goulet	4638915940	Make TyCtxt implement Interner, make HashStable generic and move to rustc_type_ir	2022-05-28 12:16:05 -07:00
Michael Goulet	a056a953f0	Initial fixes on top of type interner commit	2022-05-28 11:38:22 -07:00
Wilco Kusee	a7015fe816	Move things to rustc_type_ir	2022-05-28 11:38:22 -07:00
Camille GILLOT	9900ea352b	Cache more queries on disk.	2022-05-13 08:06:48 +02:00
Mark Rousskov	9deed6f74e	Move Sharded maps into each QueryCache impl	2022-02-20 12:10:46 -05:00
Nicholas Nethercote	416399dc10	Make `Decodable` and `Decoder` infallible. `Decoder` has two impls: - opaque: this impl is already partly infallible, i.e. in some places it currently panics on failure (e.g. if the input is too short, or on a bad `Result` discriminant), and in some places it returns an error (e.g. on a bad `Option` discriminant). The number of places where either happens is surprisingly small, just because the binary representation has very little redundancy and a lot of input reading can occur even on malformed data. - json: this impl is fully fallible, but it's only used (a) for the `.rlink` file production, and there's a `FIXME` comment suggesting it should change to a binary format, and (b) in a few tests in non-fundamental ways. Indeed #85993 is open to remove it entirely. And the top-level places in the compiler that call into decoding just abort on error anyway. So the fallibility is providing little value, and getting rid of it leads to some non-trivial performance improvements. Much of this commit is pretty boring and mechanical. Some notes about a few interesting parts: - The commit removes `Decoder::{Error,error}`. - `InternIteratorElement::intern_with`: the impl for `T` now has the same optimization for small counts that the impl for `Result<T, E>` has, because it's now much hotter. - Decodable impls for SmallVec, LinkedList, VecDeque now all use `collect`, which is nice; the one for `Vec` uses unsafe code, because that gave better perf on some benchmarks.	2022-01-22 10:38:31 +11:00
Aaron Hill	70d36a05bc	Show a more informative panic message when `DefPathHash` does not exist This should hopefully make it easier to debug incremental compilation bugs like #93096 without affecting performance.	2022-01-19 17:36:44 -05:00
Aaron Hill	d9220924dc	Import `SourceFile`s from crate before decoding foreign `Span` Fixes #92163 Fixes #92014 When writing to the incremental cache, we encode all `Span`s we encounter, regardless of whether or not their `SourceFile` comes from the local crate, or from a foreign crate. When we decode a `Span`, we use the `StableSourceFileId` we encoded to locate the matching `SourceFile` in the current session. If this id corresponds to a `SourceFile` from another crate, then we need to have already imported that `SourceFile` into our current session. This usually happens automatically during resolution / macro expansion, when we try to resolve definitions from other crates. In certain cases, however, we may try to load a `Span` from a transitive dependency without having ever imported the `SourceFile`s from that crate, leading to an ICE. This PR fixes the issue by calling `imported_source_files()` when we encounter a `SourceFile` with a foreign `CrateNum`. This ensures that all `SourceFile`s from that crate are imported into the current session.	2021-12-23 12:56:12 -05:00
LegionMammal978	77a0c65264	Remove `in_band_lifetimes` from `rustc_query_impl` See #91867 for more information.	2021-12-14 12:13:07 -05:00
est31	15de4cbc4b	Remove redundant [..]s	2021-12-09 00:01:29 +01:00
Camille GILLOT	138e96b719	Do not require QueryCtxt for cache_on_disk.	2021-10-23 18:12:43 +02:00
Camille GILLOT	7c0920f5fb	Build the query vtable directly.	2021-10-23 16:59:19 +02:00
Camille GILLOT	0a5666b838	Do not depend on the stored value when trying to cache on disk.	2021-10-21 20:00:45 +02:00
Camille GILLOT	602d3cbce3	Invoke callbacks from rustc_middle.	2021-10-20 18:29:33 +02:00
Camille GILLOT	e53404cca6	Move def_path_hash_to_def_id to rustc_middle.	2021-10-20 18:28:54 +02:00
Camille GILLOT	daf8903e8e	Do not re-hash foreign spans.	2021-10-06 19:10:07 +02:00
Camille GILLOT	ce21756ed3	Access Session while decoding expn_id.	2021-10-06 19:06:20 +02:00
Michael Woerister	66cf8ea1af	Replace cnum_map with tcx.stable_crate_id_to_crate_num() in OnDiskCache.	2021-09-14 13:56:33 +02:00
Michael Woerister	021c0520e3	Fix up comment about OnDiskCache::foreign_expn_data.	2021-09-14 13:56:33 +02:00
Michael Woerister	2b60338ee9	Make DefPathHash->DefId panic for if the mapping fails. We only use this mapping for cases where we know that it must succeed. Letting it panic otherwise makes it harder to use the API in unsupported ways.	2021-09-14 13:56:33 +02:00
Michael Woerister	5445715c20	Remove RawDefId tracking infrastructure from incr. comp. framework. This infrastructure is obsolete now with the new encoding scheme for the DefPathHash->DefIndex maps in crate metadata.	2021-09-14 13:56:33 +02:00
Michael Woerister	960893c50a	Store DefPathHash->DefIndex map in on-disk-hash-table format in crate metadata. This encoding allows for random access without an expensive upfront decoding state which in turn allows simplifying the DefPathIndex lookup logic without regressing performance.	2021-09-14 13:56:33 +02:00
Manish Goregaokar	f5ac5cadd3	Rollup merge of #88709 - BoxyUwU:thir-abstract-const, r=lcnr generic_const_exprs: use thir for abstract consts instead of mir Changes `AbstractConst` building to use `thir` instead of `mir` so that there's less chance of consts unifying when they shouldn't because lowering to mir dropped information (see `abstract-consts-as-cast-5.rs` test) r? `@lcnr`	2021-09-12 03:44:56 -07:00
Camille GILLOT	940fa9251e	Rename decode to data_untracked.	2021-09-10 20:18:22 +02:00
Camille GILLOT	b19ae20aad	Track span dependency using a callback.	2021-09-10 20:18:18 +02:00
Camille GILLOT	e85ddeb474	Encode spans relative to their parent.	2021-09-10 20:18:11 +02:00
Camille GILLOT	00485e0c0e	Keep a parent LocalDefId in SpanData.	2021-09-10 20:17:33 +02:00
Ellen	406d2ab95d	rename mir -> thir around abstract consts	2021-09-09 01:32:03 +01:00
Camille GILLOT	bcefd487c3	Comment drop_serialized_data.	2021-08-28 21:49:51 +02:00

1 2

54 Commits