nordic-dev.net/rust - rust

mirror of https://github.com/rust-lang/rust.git synced 2025-04-29 03:27:44 +00:00

Author	SHA1	Message	Date
Mark Rousskov	6f72f13436	Remove allocations from case-insensitive comparison to keywords	2025-01-11 12:39:44 -05:00
Nicholas Nethercote	0f7dccf784	Fix `Parser` size assertion on s390x. For some reason the memory layout is different on s390x.	2024-12-19 20:06:44 +11:00
Nicholas Nethercote	b9bf0b4b10	Speed up `Parser::expected_token_types`. The parser pushes a `TokenType` to `Parser::expected_token_types` on every call to the various `check`/`eat` methods, and clears it on every call to `bump`. Some of those `TokenType` values are full tokens that require cloning and dropping. This is a lot of work for something that is only used in error messages and it accounts for a significant fraction of parsing execution time. This commit overhauls `TokenType` so that `Parser::expected_token_types` can be implemented as a bitset. This requires changing `TokenType` to a C-style parameterless enum, and adding `TokenTypeSet` which uses a `u128` for the bits. (The new `TokenType` has 105 variants.) The new types `ExpTokenPair` and `ExpKeywordPair` are now arguments to the `check`/`eat` methods. This is for maximum speed. The elements in the pairs are always statically known; e.g. a `token::BinOp(token::Star)` is always paired with a `TokenType::Star`. So we now compute `TokenType`s in advance and pass them in to `check`/`eat` rather than the current approach of constructing them on insertion into `expected_token_types`. Values of these pair types can be produced by the new `exp!` macro, which is used at every `check`/`eat` call site. The macro is for convenience, allowing any pair to be generated from a single identifier. The ident/keyword filtering in `expected_one_of_not_found` is no longer necessary. It was there to account for some sloppiness in `TokenKind`/`TokenType` comparisons. The existing `TokenType` is moved to a new file `token_type.rs`, and all its new infrastructure is added to that file. There is more boilerplate code than I would like, but I can't see how to make it shorter.	2024-12-19 16:05:41 +11:00
Nicholas Nethercote	d5370d981f	Remove `bra`/`ket` naming. This is a naming convention used in a handful of spots in the parser for delimiters. It confused me when I first saw it a long time ago, and I've never liked it. A web search says "Bra-ket notation" exists in linear algebra but the terminology has zero prior use in a programming context, as far as I can tell. This commit changes it to `open`/`close`, which is consistent with the rest of the compiler.	2024-12-19 16:05:41 +11:00
Nicholas Nethercote	fb5ba8a6d4	Tweak some parser `check`/`eat` methods. The most significant is `check_keyword`: it now only pushes to `expected_token_types` if the keyword check fails, which matches how all the other `check` methods work. The remainder are just tweaks to make these methods more consistent with each other.	2024-12-19 16:05:41 +11:00
Nicholas Nethercote	48f7714819	Rename `Parser::expected_tokens` as `Parser::expected_token_types`. Because the `Token` type is similar to but different to the `TokenType` type, and the difference is important, so we want to avoid confusion.	2024-12-19 16:05:41 +11:00
许杰友 Jieyou Xu (Joe)	477f222b02	Rollup merge of #134161 - nnethercote:overhaul-token-cursors, r=spastorino Overhaul token cursors Some nice cleanups here. r? `````@davidtwco`````	2024-12-18 22:56:53 +08:00
Nicholas Nethercote	2620eb42d7	Re-export more `rustc_span::symbol` things from `rustc_span`. `rustc_span::symbol` defines some things that are re-exported from `rustc_span`, such as `Symbol` and `sym`. But it doesn't re-export some closely related things such as `Ident` and `kw`. So you can do `use rustc_span::{Symbol, sym}` but you have to do `use rustc_span::symbol::{Ident, kw}`, which is inconsistent for no good reason. This commit re-exports `Ident`, `kw`, and `MacroRulesNormalizedIdent`, and changes many `rustc_span::symbol::` qualifiers in `compiler/` to `rustc_span::`. This is a 200+ net line of code reduction, mostly because many files with two `use rustc_span` items can be reduced to one.	2024-12-18 13:38:53 +11:00
Nicholas Nethercote	2903356b2e	Overhaul `TokenTreeCursor`. - Move it to `rustc_parse`, which is the only crate that uses it. This lets us remove all the `pub` markers from it. - Change `next_ref` and `look_ahead` to `get` and `bump`, which work better for the `rustc_parse` uses. - This requires adding a `TokenStream::get` method, which is simple. - In `TokenCursor`, we currently duplicate the `DelimSpan`/`DelimSpacing`/`Delimiter` from the surrounding `TokenTree::Delimited` in the stack. This isn't necessary so long as we don't prematurely move past the `Delimited`, and is a small perf win on a very hot code path. - In `parse_token_tree`, we clone the relevant `TokenTree::Delimited` instead of constructing an identical one from pieces.	2024-12-18 12:50:22 +11:00
Jonathan Dönszelmann	d50c0a5480	Add hir::Attribute	2024-12-15 19:18:46 +01:00
Oli Scherer	53b2c7cc95	Rename `value` field to `expr` to simplify later commits' diffs	2024-12-15 18:47:45 +01:00
Oli Scherer	778321d155	Change `AttrArgs::Eq` into a struct variant	2024-12-02 10:28:58 +00:00
Nicholas Nethercote	cfafa9380b	Add metavariables to `TokenDescription`. Pasted metavariables are wrapped in invisible delimiters, which pretty-print as empty strings, and changing that can break some proc macros. But error messages saying "expected identifer, found ``" are bad. So this commit adds support for metavariables in `TokenDescription` so they print as "metavariable" in error messages, instead of "``". It's not used meaningfully yet, but will be needed to get rid of interpolated tokens.	2024-11-21 08:16:55 +11:00
Nicholas Nethercote	afe238f66f	Introduce `InvisibleOrigin` on invisible delimiters. It's not used meaningfully yet, but will be needed to get rid of interpolated tokens.	2024-11-21 08:16:54 +11:00
Nicholas Nethercote	99d02fb40f	Optimize `check_keyword_case`. `to_lowercase` allocates, but `eq_ignore_ascii_case` doesn't. This path is hot enough that this makes a small but noticeable difference in benchmarking.	2024-11-13 08:43:47 +11:00
Nicholas Nethercote	a201fab208	Tweak `expand_incomplete_parse` warning. By using `token_descr`, as is done for many other errors, we can get slightly better descriptions in error messages, e.g. "macro expansion ignores token `let` and any following" becomes "macro expansion ignores keyword `let` and any tokens following". This will be more important once invisible delimiters start being mentioned in error messages -- without this commit, that leads to error messages such as "error at ``" because invisible delimiters are pretty printed as an empty string.	2024-10-28 14:12:45 +11:00
Jubilee	515bdcda01	Rollup merge of #130551 - nnethercote:fix-break-last-token, r=petrochenkov Fix `break_last_token`. It currently doesn't handle the three-char tokens `>>=` and `<<=` correctly. These can be broken twice, resulting in three individual tokens. This is a latent bug that currently doesn't cause any problems, but does cause problems for #124141, because that PR increases the usage of lazy token streams. r? `@petrochenkov`	2024-09-23 07:54:44 -07:00
Nicholas Nethercote	73cc575177	Fix `break_last_token`. It currently doesn't handle the three-char tokens `>>=` and `<<=` correctly. These can be broken twice, resulting in three individual tokens. This is a latent bug that currently doesn't cause any problems, but does cause problems for #124141, because that PR increases the usage of lazy token streams.	2024-09-23 09:14:30 +10:00
Michael Goulet	c682aa162b	Reformat using the new identifier sorting from rustfmt	2024-09-22 19:11:29 -04:00
Pavel Grigorenko	e90e2593ea	Parser: recover from `:::` to `::`	2024-09-21 20:07:52 +03:00
Michael Goulet	af8d911d63	Also fix if in else	2024-09-11 17:24:01 -04:00
bors	6d05f12170	Auto merge of #129346 - nnethercote:fix-double-handling-in-collect_tokens, r=petrochenkov Fix double handling in `collect_tokens` Double handling of AST nodes can occur in `collect_tokens`. This is when an inner call to `collect_tokens` produces an AST node, and then an outer call to `collect_tokens` produces the same AST node. This can happen in a few places, e.g. expression statements where the statement delegates `HasTokens` and `HasAttrs` to the expression. It will also happen more after #124141. This PR fixes some double handling cases that cause problems, including #129166. r? `@petrochenkov`	2024-09-08 05:35:23 +00:00
Michael Goulet	97910580aa	Add initial support for raw lifetimes	2024-09-06 10:32:48 -04:00
Nicholas Nethercote	1fdabfbebb	Avoid double-handling of attributes in `collect_tokens`. By keeping track of attributes that have been previously processed. This fixes the `macro-rules-derive-cfg.stdout` test, and is necessary for #124141 which removes nonterminals. Also shrink the `SmallVec` inline size used in `IntervalSet`. 2 gives slightly better perf than 4 now that there's an `IntervalSet` in `Parser`, which is cloned reasonably often.	2024-08-24 06:57:47 +10:00
Nicholas Nethercote	39b38a94e3	Split the assertion in `NodeRange::new`.	2024-08-23 14:40:08 +10:00
Nicholas Nethercote	9d31f86f0d	Overhaul token collection. This commit does the following. - Renames `collect_tokens_trailing_token` as `collect_tokens`, because (a) it's annoying long, and (b) the `_trailing_token` bit is less accurate now that its types have changed. - In `collect_tokens`, adds a `Option<CollectPos>` argument and a `UsePreAttrPos` in the return type of `f`. These are used in `parse_expr_force_collect` (for vanilla expressions) and in `parse_stmt_without_recovery` (for two different cases of expression statements). Together these ensure are enough to fix all the problems with token collection and assoc expressions. The changes to the `stringify.rs` test demonstrate some of these. - Adds a new test. The code in this test was causing an assertion failure prior to this commit, due to an invalid `NodeRange`. The extra complexity is annoying, but necessary to fix the existing problems.	2024-08-16 09:07:55 +10:00
Nicholas Nethercote	5aaa2f92ee	Add an assertion to `NodeRange::new`.	2024-08-16 09:07:31 +10:00
Nicholas Nethercote	c8098be41f	Convert a bool to `Trailing`. This pre-existing type is suitable for use with the return value of the `f` parameter in `collect_tokens_trailing_token`. The more descriptive name will be useful because the next commit will add another boolean value to the return value of `f`.	2024-08-16 09:07:29 +10:00
Nicholas Nethercote	7923b20dd9	Use `impl PartialEq<TokenKind> for Token` more. This lets us compare a `Token` with a `TokenKind`. It's used a lot, but can be used even more, avoiding the need for some `.kind` uses.	2024-08-14 16:37:09 +10:00
Guillaume Gomez	99a785d62d	Rollup merge of #128994 - nnethercote:fix-Parser-look_ahead-more, r=compiler-errors Fix bug in `Parser::look_ahead`. The special case was failing to handle invisible delimiters on one path. Fixes (but doesn't close until beta backported) #128895. r? `@davidtwco`	2024-08-12 17:09:20 +02:00
Nicholas Nethercote	46b4c5adc5	Fix bug in `Parser::look_ahead`. The special case was failing to handle invisible delimiters on one path. Fixes #128895.	2024-08-12 13:00:12 +10:00
Michael Goulet	c361c924a0	Use assert_matches around the compiler	2024-08-11 12:25:39 -04:00
Matthias Krüger	7d9ed2a864	Rollup merge of #127921 - spastorino:stabilize-unsafe-extern-blocks, r=compiler-errors Stabilize unsafe extern blocks (RFC 3484) # Stabilization report ## Summary This is a tracking issue for the RFC 3484: Unsafe Extern Blocks We are stabilizing `#![feature(unsafe_extern_blocks)]`, as described in [Unsafe Extern Blocks RFC 3484](https://github.com/rust-lang/rfcs/pull/3484). This feature makes explicit that declaring an extern block is unsafe. Starting in Rust 2024, all extern blocks must be marked as unsafe. In all editions, items within unsafe extern blocks may be marked as safe to use. RFC: https://github.com/rust-lang/rfcs/pull/3484 Tracking issue: #123743 ## What is stabilized ### Summary of stabilization We now need extern blocks to be marked as unsafe and items inside can also have safety modifiers (unsafe or safe), by default items with no modifiers are unsafe to offer easy migration without surprising results. ```rust unsafe extern { // sqrt (from libm) may be called with any `f64` pub safe fn sqrt(x: f64) -> f64; // strlen (from libc) requires a valid pointer, // so we mark it as being an unsafe fn pub unsafe fn strlen(p: const c_char) -> usize; // this function doesn't say safe or unsafe, so it defaults to unsafe pub fn free(p: mut core::ffi::c_void); pub safe static IMPORTANT_BYTES: [u8; 256]; pub safe static LINES: SyncUnsafeCell<i32>; } ``` ## Tests The relevant tests are in `tests/ui/rust-2024/unsafe-extern-blocks`. ## History - https://github.com/rust-lang/rust/pull/124482 - https://github.com/rust-lang/rust/pull/124455 - https://github.com/rust-lang/rust/pull/125077 - https://github.com/rust-lang/rust/pull/125522 - https://github.com/rust-lang/rust/issues/126738 - https://github.com/rust-lang/rust/issues/126749 - https://github.com/rust-lang/rust/issues/126755 - https://github.com/rust-lang/rust/pull/126757 - https://github.com/rust-lang/rust/pull/126758 - https://github.com/rust-lang/rust/issues/126756 - https://github.com/rust-lang/rust/pull/126973 - https://github.com/rust-lang/rust/pull/127535 - https://github.com/rust-lang/rustfmt/pull/6204 ## Unresolved questions I am not aware of any unresolved questions.	2024-08-03 20:51:51 +02:00
Matthias Krüger	dee57ce043	Rollup merge of #128483 - nnethercote:still-more-cfg-cleanups, r=petrochenkov Still more `cfg` cleanups Found while looking closely at `cfg`/`cfg_attr` processing code. r? `````````@petrochenkov`````````	2024-08-03 11:17:44 +02:00
Nicholas Nethercote	d1f05fd184	Distinguish the two kinds of token range. When collecting tokens there are two kinds of range: - a range relative to the parser's full token stream (which we get when we are parsing); - a range relative to a single AST node's token stream (which we use within `LazyAttrTokenStreamImpl` when replacing tokens). These are currently both represented with `Range<u32>` and it's easy to mix them up -- until now I hadn't properly understood the difference. This commit introduces `ParserRange` and `NodeRange` to distinguish them. This also requires splitting `ReplaceRange` in two, giving the new types `ParserReplacement` and `NodeReplacement`. (These latter two names reduce the overloading of the word "range".) The commit also rewrites some comments to be clearer. The end result is a little more verbose, but much clearer.	2024-08-01 19:30:40 +10:00
Michael Goulet	e4076e34f8	Mark Parser::eat/check methods as must_use	2024-07-29 21:29:08 -04:00
Nicholas Nethercote	84ac80f192	Reformat `use` declarations. The previous commit updated `rustfmt.toml` appropriately. This commit is the outcome of running `x fmt --all` with the new formatting options.	2024-07-29 08:26:52 +10:00
Folkert	d3858f7465	improve error message when `global_asm!` uses `asm!` options	2024-07-25 22:33:52 +02:00
Santiago Pastorino	8366c7fe9c	Stabilize unsafe extern blocks (RFC 3484)	2024-07-23 00:29:39 -03:00
bors	3811f40d27	Auto merge of #127957 - matthiaskrgr:rollup-1u5ivck, r=matthiaskrgr Rollup of 6 pull requests Successful merges: - #127350 (Parser: Suggest Placing the Return Type After Function Parameters) - #127621 (Rewrite and rename `issue-22131` and `issue-26006` `run-make` tests to rmake) - #127662 (When finding item gated behind a `cfg` flag, point at it) - #127903 (`force_collect` improvements) - #127932 (rustdoc: fix `current` class on sidebar modnav) - #127943 (Don't allow unsafe statics outside of extern blocks) r? `@ghost` `@rustbot` modify labels: rollup	2024-07-19 13:39:12 +00:00
Matthias Krüger	9ada89d9a1	Rollup merge of #127903 - nnethercote:force_collect-improvements, r=petrochenkov `force_collect` improvements Yet more cleanups relating to `cfg_attr` processing. r? ````@petrochenkov````	2024-07-19 10:48:05 +02:00
Matthias Krüger	c86e13f330	Rollup merge of #127350 - veera-sivarajan:bugfix-126311, r=lcnr Parser: Suggest Placing the Return Type After Function Parameters Fixes #126311 This PR suggests placing the return type after the function parameters when it's misplaced after a `where` clause. This also tangentially improves diagnostics for cases like [this](`86d6f1312a/tests/ui/parser/issues/misplaced-return-type-without-where-issue-126311.rs (L1C1-L1C28)`) and adds doc comments for `parser::AllowPlus`.	2024-07-19 10:48:03 +02:00
Nicholas Nethercote	1dd566a6d0	Overhaul comments in `collect_tokens_trailing_token`. Adding details, clarifying lots of little things, etc. In particular, the commit adds details of an example. I find this very helpful, because it's taken me a long time to understand how this code works.	2024-07-19 15:25:55 +10:00
Nicholas Nethercote	ca6649516f	Make `Parser::num_bump_calls` 0-indexed. Currently in `collect_tokens_trailing_token`, `start_pos` and `end_pos` are 1-indexed by `replace_ranges` is 0-indexed, which is really confusing. Making them both 0-indexed makes debugging much easier.	2024-07-19 15:25:55 +10:00
Nicholas Nethercote	757f73f506	Simplify `CaptureState::inner_attr_ranges`. The `Option`s within the `ReplaceRange`s within the hashmap are always `None`. This PR omits them and inserts them when they are extracted from the hashmap.	2024-07-19 15:25:54 +10:00
Nicholas Nethercote	e69ff1c106	Remove an unnecessary `ForceCollect::Yes`. No need to collect tokens on this recovery path, because the parsed statement isn't even looked at.	2024-07-19 08:20:57 +10:00
Veera	4cad705017	Parser: Suggest Placing the Return Type After Function Parameters	2024-07-18 17:56:34 -04:00
Nicholas Nethercote	487802d6c8	Remove `TrailingToken`. It's used in `Parser::collect_tokens_trailing_token` to decide whether to capture a trailing token. But the callers actually know whether to capture a trailing token, so it's simpler for them to just pass in a bool. Also, the `TrailingToken::Gt` case was weird, because it didn't result in a trailing token being captured. It could have been subsumed by the `TrailingToken::MaybeComma` case, and it effectively is in the new code.	2024-07-18 17:28:49 +10:00
Nicholas Nethercote	9c4f3dbd06	Remove references to `maybe_whole_expr`. It was removed in #126571.	2024-07-16 16:40:35 +10:00
Matthias Krüger	febe4423c1	Rollup merge of #127273 - nnethercote:fix-DebugParser, r=workingjubilee Fix `DebugParser`. I tried using this and it didn't work at all. `prev_token` is never eof, so the accumulator is always false, which means the `then_some` always returns `None`, which means `scan` always returns `None`, and `tokens` always ends up an empty vec. I'm not sure how this code was supposed to work. (An aside: I find `Iterator::scan` to be a pretty wretched function, that produces code which is very hard to understand. Probably why this is just one of two uses of it in the entire compiler.) This commit changes it to a simpler imperative style that produces a valid `tokens` vec. r? `@workingjubilee`	2024-07-14 20:24:58 +02:00

1 2 3 4 5 ...

330 Commits