nordic-dev.net/rust - rust

mirror of https://github.com/rust-lang/rust.git synced 2024-11-22 23:04:33 +00:00

Author	SHA1	Message	Date
Veera	14e86eb7d9	Add Suggestions for Misspelled Keywords This PR detects misspelled keywords using two heuristics: 1. Lowercasing the unexpected identifier. 2. Using edit distance to find a keyword similar to the unexpected identifier. However, it does not detect each and every misspelled keyword to minimize false positives and ambiguities. More details about the implementation can be found in the comments.	2024-09-06 23:07:45 -04:00
Michael Goulet	afa24f0180	Add some more tests	2024-09-06 10:32:48 -04:00
Michael Goulet	97910580aa	Add initial support for raw lifetimes	2024-09-06 10:32:48 -04:00
Michael Goulet	3b3e43a386	Format lexer	2024-09-06 10:32:48 -04:00
Michael Goulet	9aaf873396	Reserve prefix lifetimes too	2024-09-06 10:32:48 -04:00
Alexander Cyon	00de006f22	chore: Fix typos in 'compiler' (batch 2)	2024-09-02 07:50:22 +02:00
Matthias Krüger	1fd0c71818	Rollup merge of #120221 - compiler-errors:statements-are-not-patterns, r=nnethercote Don't make statement nonterminals match pattern nonterminals Right now, the heuristic we use to check if a token may begin a pattern nonterminal falls back to `may_be_ident`: `ef71f1047e/compiler/rustc_parse/src/parser/nonterminal.rs (L21-L37)` This has the unfortunate side effect that a `stmt` nonterminal eagerly matches against a `pat` nonterminal, leading to a parse error: ```rust macro_rules! m { ($pat:pat) => {}; ($stmt:stmt) => {}; } macro_rules! m2 { ($stmt:stmt) => { m! { $stmt } }; } m2! { let x = 1 } ``` This PR fixes it by more accurately reflecting the set of nonterminals that may begin a pattern nonterminal. As a side-effect, I modified `Token::can_begin_pattern` to work correctly and used that in `Parser::nonterminal_may_begin_with`.	2024-08-31 10:08:51 +02:00
Nicholas Nethercote	cac04a1cb9	Add `warn(unreachable_pub)` to `rustc_parser`.	2024-08-29 20:13:06 +10:00
Matthias Krüger	472c9645fb	Rollup merge of #129667 - dev-ardi:rustc_driver-cleanup, r=michaelwoerister Rustc driver cleanup This adds a few comments to the driver to clarify a bit what's happening and does some cleanup.	2024-08-28 17:12:19 +02:00
Orion Gonzalez	c35e01e48e	clarify what term can be	2024-08-28 13:11:02 +02:00
Michael Goulet	c61f85b6dd	Don't make pattern nonterminals match statement nonterminals	2024-08-26 18:30:15 -04:00
Trevor Gross	dfe7d5c31e	Rollup merge of #128524 - chenyukang:yukang-fix-127930-invalid-outer-style-sugg, r=cjgillot Don't suggest turning crate-level attributes into outer style Fixes #127930	2024-08-24 21:03:31 -05:00
Nicholas Nethercote	d4bf28c014	Optimize `collect_tokens` a little. Use `Cow` to avoid cloning `ret.attrs()` unless necessary. This requires moving some things around to satisfy the borrow checker.	2024-08-24 06:58:35 +10:00
Nicholas Nethercote	1fdabfbebb	Avoid double-handling of attributes in `collect_tokens`. By keeping track of attributes that have been previously processed. This fixes the `macro-rules-derive-cfg.stdout` test, and is necessary for #124141 which removes nonterminals. Also shrink the `SmallVec` inline size used in `IntervalSet`. 2 gives slightly better perf than 4 now that there's an `IntervalSet` in `Parser`, which is cloned reasonably often.	2024-08-24 06:57:47 +10:00
Nicholas Nethercote	39b38a94e3	Split the assertion in `NodeRange::new`.	2024-08-23 14:40:08 +10:00
Nicholas Nethercote	0bae33fcd5	Avoid nested replacement ranges. In a case like this: ``` mod a { mod b { #[cfg_attr(unix, inline)] fn f() { #[cfg_attr(linux, inline)] fn g1() {} #[cfg_attr(linux, inline)] fn g2() {} } } } ``` We currently end up with the following replacement ranges. - The lazy tokens for `f` has replacement ranges for `g1` and `g2`. - The lazy tokens for `a` has replacement ranges for `f`, `g1`, and `g2`. I.e. the replacement ranges for `g1` and `g2` are duplicated. In general, replacement ranges for inner AST nodes are duplicated up the chain for each nested `collect_tokens` call. And the code that processes the replacements is careful about the ordering in which the replacements are applied, to ensure that inner replacements are applied before outer replacements. But all of this is unnecessary. If you apply an inner replacement and then an outer replacement, the outer replacement completely overwrites the inner replacement. This commit avoids the duplication by removing replacements from `self.capture_state.parser_replacements` when they are used. (The effect on the example above is that the lazy tokesn for `a` no longer include replacement ranges for `g1` and `g2`.) This eliminates the possibility of nested replacements on individual AST nodes, which avoids the need for careful ordering of replacements.	2024-08-23 14:40:08 +10:00
Nicholas Nethercote	1ae521e9d5	Return earlier in some cases in `collect_token`. This example triggers an assertion failure: ``` fn f() -> u32 { #[cfg_eval] #[cfg(not(FALSE))] 0 } ``` The sequence of events: - `configure_annotatable` calls `parse_expr_force_collect`, which calls `collect_tokens`. - Within that, we end up in `parse_expr_dot_or_call`, which again calls `collect_tokens`. - The return value of the `f` call is the expression `0`. - This inner call collects tokens for `0` (parser range 10..11) and creates a replacement covering `#[cfg(not(FALSE))] 0` (parser range 0..11). - We return to the outer `collect_tokens` call. The return value of the `f` call is again the expression `0`, again with the range 10..11, but the replacement from earlier covers the range 0..11. The code mistakenly assumes that any attributes from an inner `collect_tokens` call fit entirely within the body of the result of an outer `collect_tokens` call. So it adjusts the replacement parser range 0..11 to a node range by subtracting 10, resulting in -10..1. This is an invalid range and triggers an assertion failure. It's tricky to follow, but basically things get complicated when an AST node is returned from an inner `collect_tokens` call and then returned again from an outer `collect_token` node without being wrapped in any kind of additional layer. This commit changes `collect_tokens` to return early in some extra cases, avoiding the construction of lazy tokens. In the example above, the outer `collect_tokens` returns earlier because the `0` token already has tokens and `self.capture_state.capturing` is `Capturing::No`. This early return avoids the creation of the invalid range and the assertion failure. Fixes #129166. Note: these invalid ranges have been happening for a long time. #128725 looks like it's at fault only because it introduced the assertion that catches the invalid ranges.	2024-08-23 14:40:08 +10:00
Nicholas Nethercote	312ecdb2ed	Avoid unnecessary `cloned`.	2024-08-23 14:40:08 +10:00
Nicholas Nethercote	deab741ab4	Clarify a comment.	2024-08-23 14:40:08 +10:00
Michael Goulet	25ff9b6bcb	Use bool in favor of Option<()> for diagnostics	2024-08-21 01:31:11 -04:00
bors	a971212545	Auto merge of #127672 - compiler-errors:precise-capturing, r=spastorino Stabilize opaque type precise capturing (RFC 3617) This PR partially stabilizes opaque type precise capturing, which was specified in [RFC 3617](https://github.com/rust-lang/rfcs/pull/3617), and whose syntax was amended by FCP in [#125836](https://github.com/rust-lang/rust/issues/125836). This feature, as stabilized here, gives us a way to explicitly specify the generic lifetime parameters that an RPIT-like opaque type captures. This solves the problem of overcapturing, for lifetime parameters in these opaque types, and will allow the Lifetime Capture Rules 2024 ([RFC 3498](https://github.com/rust-lang/rfcs/pull/3498)) to be fully stabilized for RPIT in Rust 2024. ### What are we stabilizing? This PR stabilizes the use of a `use<'a, T>` bound in return-position impl Trait opaque types. Such a bound fully specifies the set of generic parameters captured by the RPIT opaque type, entirely overriding the implicit default behavior. E.g.: ```rust fn does_not_capture<'a, 'b>() -> impl Sized + use<'a> {} // ~~~~~~~~~~~~~~~~~~~~ // This RPIT opaque type does not capture `'b`. ``` The way we would suggest thinking of `impl Trait` types without an explicit `use<..>` bound is that the `use<..>` bound has been elided, and that the bound is filled in automatically by the compiler according to the edition-specific capture rules. All non-`'static` lifetime parameters, named (i.e. non-APIT) type parameters, and const parameters in scope are valid to name, including an elided lifetime if such a lifetime would also be valid in an outlives bound, e.g.: ```rust fn elided(x: &u8) -> impl Sized + use<'_> { x } ``` Lifetimes must be listed before type and const parameters, but otherwise the ordering is not relevant to the `use<..>` bound. Captured parameters may not be duplicated. For now, only one `use<..>` bound may appear in a bounds list. It may appear anywhere within the bounds list. ### How does this differ from the RFC? This stabilization differs from the RFC in one respect: the RFC originally specified `use<'a, T>` as syntactically part of the RPIT type itself, e.g.: ```rust fn capture<'a>() -> impl use<'a> Sized {} ``` However, settling on the final syntax was left as an open question. T-lang later decided via FCP in [#125836](https://github.com/rust-lang/rust/issues/125836) to treat `use<..>` as a syntactic bound instead, e.g.: ```rust fn capture<'a>() -> impl Sized + use<'a> {} ``` ### What aren't we stabilizing? The key goal of this PR is to stabilize the parts of precise capturing that are needed to enable the migration to Rust 2024. There are some capabilities of precise capturing that the RFC specifies but that we're not stabilizing here, as these require further work on the type system. We hope to lift these limitations later. The limitations that are part of this PR were specified in the [RFC's stabilization strategy](https://rust-lang.github.io/rfcs/3617-precise-capturing.html#stabilization-strategy). #### Not capturing type or const parameters The RFC addresses the overcapturing of type and const parameters; that is, it allows for them to not be captured in opaque types. We're not stabilizing that in this PR. Since all in scope generic type and const parameters are implicitly captured in all editions, this is not needed for the migration to Rust 2024. For now, when using `use<..>`, all in scope type and const parameters must be nameable (i.e., APIT cannot be used) and included as arguments. For example, this is an error because `T` is in scope and not included as an argument: ```rust fn test<T>() -> impl Sized + use<> {} //~^ ERROR `impl Trait` must mention all type parameters in scope in `use<...>` ``` This is due to certain current limitations in the type system related to how generic parameters are represented as captured (i.e. bivariance) and how inference operates. We hope to relax this in the future, and this stabilization is forward compatible with doing so. #### Precise capturing for return-position impl Trait in trait (RPITIT) The RFC specifies precise capturing for RPITIT. We're not stabilizing that in this PR. Since RPITIT already adheres to the Lifetime Capture Rules 2024, this isn't needed for the migration to Rust 2024. The effect of this is that the anonymous associated types created by RPITITs must continue to capture all of the lifetime parameters in scope, e.g.: ```rust trait Foo<'a> { fn test() -> impl Sized + use<Self>; //~^ ERROR `use<...>` precise capturing syntax is currently not allowed in return-position `impl Trait` in traits } ``` To allow this involves a meaningful amount of type system work related to adding variance to GATs or reworking how generics are represented in RPITITs. We plan to do this work separately from the stabilization. See: - https://github.com/rust-lang/rust/pull/124029 Supporting precise capturing for RPITIT will also require us to implement a new algorithm for detecting refining capture behavior. This may involve looking through type parameters to detect cases where the impl Trait type in an implementation captures fewer lifetimes than the corresponding RPITIT in the trait definition, e.g.: ```rust trait Foo { fn rpit() -> impl Sized + use<Self>; } impl<'a> Foo for &'a () { // This is "refining" due to not capturing `'a` which // is implied by the trait's `use<Self>`. fn rpit() -> impl Sized + use<>; // This is not "refining". fn rpit() -> impl Sized + use<'a>; } ``` This stabilization is forward compatible with adding support for this later. ### The technical details This bound is purely syntactical and does not lower to a [`Clause`](https://doc.rust-lang.org/1.79.0/nightly-rustc/rustc_middle/ty/type.ClauseKind.html) in the type system. For the purposes of the type system (and for the types team's curiosity regarding this stabilization), we have no current need to represent this as a `ClauseKind`. Since opaques already capture a variable set of lifetimes depending on edition and their syntactical position (e.g. RPIT vs RPITIT), a `use<..>` bound is just a way to explicitly rather than implicitly specify that set of lifetimes, and this only affects opaque type lowering from AST to HIR. ### FCP plan While there's much discussion of the type system here, the feature in this PR is implemented internally as a transformation that happens before lowering to the type system layer. We already support impl Trait types partially capturing the in scope lifetimes; we just currently only expose that implicitly. So, in my (errs's) view as a types team member, there's nothing for types to weigh in on here with respect to the implementation being stabilized, and I'd suggest a lang-only proposed FCP (though we'll of course CC the team below). ### Authorship and acknowledgments This stabilization report was coauthored by compiler-errors and TC. TC would like to acknowledge the outstanding and speedy work that compiler-errors has done to make this feature happen. compiler-errors thanks TC for authoring the RFC, for all of his involvement in this feature's development, and pushing the Rust 2024 edition forward. ### Open items We're doing some things in parallel here. In signaling the intention to stabilize, we want to uncover any latent issues so we can be sure they get addressed. We want to give the maximum time for discussion here to happen by starting it while other remaining miscellaneous work proceeds. That work includes: - [x] Look into `syn` support. - https://github.com/dtolnay/syn/issues/1677 - https://github.com/dtolnay/syn/pull/1707 - [x] Look into `rustfmt` support. - https://github.com/rust-lang/rust/pull/126754 - [x] Look into `rust-analyzer` support. - https://github.com/rust-lang/rust-analyzer/issues/17598 - https://github.com/rust-lang/rust-analyzer/pull/17676 - [x] Look into `rustdoc` support. - https://github.com/rust-lang/rust/issues/127228 - https://github.com/rust-lang/rust/pull/127632 - https://github.com/rust-lang/rust/pull/127658 - [x] Suggest this feature to RfL (a known nightly user). - [x] Add a chapter to the edition guide. - https://github.com/rust-lang/edition-guide/pull/316 - [x] Update the Reference. - https://github.com/rust-lang/reference/pull/1577 ### (Selected) implementation history * https://github.com/rust-lang/rfcs/pull/3498 * https://github.com/rust-lang/rfcs/pull/3617 * https://github.com/rust-lang/rust/pull/123468 * https://github.com/rust-lang/rust/issues/125836 * https://github.com/rust-lang/rust/pull/126049 * https://github.com/rust-lang/rust/pull/126753 Closes #123432. cc `@rust-lang/lang` `@rust-lang/types` `@rustbot` labels +T-lang +I-lang-nominated +A-impl-trait +F-precise_capturing Tracking: - https://github.com/rust-lang/rust/issues/123432 ---- For the compiler reviewer, I'll leave some inline comments about diagnostics fallout :^) r? compiler	2024-08-20 10:42:55 +00:00
Ralf Jung	79503dd742	stabilize raw_ref_op	2024-08-18 19:46:53 +02:00
bors	37d56daac6	Auto merge of #128771 - carbotaniuman:stabilize_unsafe_attr, r=nnethercote Stabilize `unsafe_attributes` # Stabilization report ## Summary This is a tracking issue for the RFC 3325: unsafe attributes We are stabilizing `#![feature(unsafe_attributes)]`, which makes certain attributes considered 'unsafe', meaning that they must be surrounded by an `unsafe(...)`, as in `#[unsafe(no_mangle)]`. RFC: rust-lang/rfcs#3325 Tracking issue: #123757 ## What is stabilized ### Summary of stabilization Certain attributes will now be designated as unsafe attributes, namely, `no_mangle`, `export_name`, and `link_section` (stable only), and these attributes will need to be called by surrounding them in `unsafe(...)` syntax. On editions prior to 2024, this is simply an edition lint, but it will become a hard error in 2024. This also works in `cfg_attr`, but `unsafe` is not allowed for any other attributes, including proc-macros ones. ```rust #[unsafe(no_mangle)] fn a() {} #[cfg_attr(any(), unsafe(export_name = "c"))] fn b() {} ``` For a table showing the attributes that were considered to be included in the list to require unsafe, and subsequent reasoning about why each such attribute was or was not included, see [this comment here](https://github.com/rust-lang/rust/pull/124214#issuecomment-2124753464) ## Tests The relevant tests are in `tests/ui/rust-2024/unsafe-attributes` and `tests/ui/attributes/unsafe`.	2024-08-17 22:48:42 +00:00
Michael Goulet	eae5b5c6e7	Stabilize opaque type precise capturing	2024-08-17 12:33:29 -04:00
Nicholas Nethercote	9d31f86f0d	Overhaul token collection. This commit does the following. - Renames `collect_tokens_trailing_token` as `collect_tokens`, because (a) it's annoying long, and (b) the `_trailing_token` bit is less accurate now that its types have changed. - In `collect_tokens`, adds a `Option<CollectPos>` argument and a `UsePreAttrPos` in the return type of `f`. These are used in `parse_expr_force_collect` (for vanilla expressions) and in `parse_stmt_without_recovery` (for two different cases of expression statements). Together these ensure are enough to fix all the problems with token collection and assoc expressions. The changes to the `stringify.rs` test demonstrate some of these. - Adds a new test. The code in this test was causing an assertion failure prior to this commit, due to an invalid `NodeRange`. The extra complexity is annoying, but necessary to fix the existing problems.	2024-08-16 09:07:55 +10:00
Nicholas Nethercote	5aaa2f92ee	Add an assertion to `NodeRange::new`.	2024-08-16 09:07:31 +10:00
Nicholas Nethercote	c8098be41f	Convert a bool to `Trailing`. This pre-existing type is suitable for use with the return value of the `f` parameter in `collect_tokens_trailing_token`. The more descriptive name will be useful because the next commit will add another boolean value to the return value of `f`.	2024-08-16 09:07:29 +10:00
Nicholas Nethercote	55906aa240	Make visibilities minimal and consistent in `attr_wrapper.rs`.	2024-08-16 09:06:15 +10:00
Nicholas Nethercote	af0093a6b8	Remove size assertion on `AttrWrapper`. It's not an important type when it comes to memory use.	2024-08-16 09:06:15 +10:00
Nicholas Nethercote	7923b20dd9	Use `impl PartialEq<TokenKind> for Token` more. This lets us compare a `Token` with a `TokenKind`. It's used a lot, but can be used even more, avoiding the need for some `.kind` uses.	2024-08-14 16:37:09 +10:00
Nicholas Nethercote	bbcfd90cd1	Convert a `&mut self` to `&self`.	2024-08-14 13:06:57 +10:00
Guillaume Gomez	99a785d62d	Rollup merge of #128994 - nnethercote:fix-Parser-look_ahead-more, r=compiler-errors Fix bug in `Parser::look_ahead`. The special case was failing to handle invisible delimiters on one path. Fixes (but doesn't close until beta backported) #128895. r? `@davidtwco`	2024-08-12 17:09:20 +02:00
Guillaume Gomez	7c6dca9050	Rollup merge of #128978 - compiler-errors:assert-matches, r=jieyouxu Use `assert_matches` around the compiler more It's a useful assertion, especially since it actually prints out the LHS.	2024-08-12 17:09:19 +02:00
Nicholas Nethercote	46b4c5adc5	Fix bug in `Parser::look_ahead`. The special case was failing to handle invisible delimiters on one path. Fixes #128895.	2024-08-12 13:00:12 +10:00
Michael Goulet	c361c924a0	Use assert_matches around the compiler	2024-08-11 12:25:39 -04:00
Matthias Krüger	32e0fe129d	Rollup merge of #128762 - fmease:use-more-slice-pats, r=compiler-errors Use more slice patterns inside the compiler Nothing super noteworthy. Just replacing the common 'fragile' pattern of "length check followed by indexing or unwrap" with slice patterns for legibility and 'robustness'. r? ghost	2024-08-11 07:51:51 +02:00
许杰友 Jieyou Xu (Joe)	d65f1316bb	parser: ensure let stmt compound assignment removal suggestion respect codepoint boundaries Previously we would try to issue a suggestion for `let x <op>= 1`, i.e. a compound assignment within a `let` binding, to remove the `<op>`. The suggestion code unfortunately incorrectly assumed that the `<op>` is an exactly-1-byte ASCII character, but this assumption is incorrect because we also recover Unicode-confusables like `➖=` as `-=`. In this example, the suggestion code used a `+ BytePos(1)` to calculate the span of the `<op>` codepoint that looks like `-` but the mult-byte Unicode look-alike would cause the suggested removal span to be inside a multi-byte codepoint boundary, triggering a codepoint boundary assertion. Issue: <https://github.com/rust-lang/rust/issues/128845>	2024-08-09 05:56:50 +00:00
León Orell Valerian Liehr	c4c518d2d4	Use more slice patterns inside the compiler	2024-08-07 13:37:52 +02:00
carbotaniuman	de9b5c3ea2	Stabilize `unsafe_attributes`	2024-08-07 03:12:13 -05:00
Matthias Krüger	7d9ed2a864	Rollup merge of #127921 - spastorino:stabilize-unsafe-extern-blocks, r=compiler-errors Stabilize unsafe extern blocks (RFC 3484) # Stabilization report ## Summary This is a tracking issue for the RFC 3484: Unsafe Extern Blocks We are stabilizing `#![feature(unsafe_extern_blocks)]`, as described in [Unsafe Extern Blocks RFC 3484](https://github.com/rust-lang/rfcs/pull/3484). This feature makes explicit that declaring an extern block is unsafe. Starting in Rust 2024, all extern blocks must be marked as unsafe. In all editions, items within unsafe extern blocks may be marked as safe to use. RFC: https://github.com/rust-lang/rfcs/pull/3484 Tracking issue: #123743 ## What is stabilized ### Summary of stabilization We now need extern blocks to be marked as unsafe and items inside can also have safety modifiers (unsafe or safe), by default items with no modifiers are unsafe to offer easy migration without surprising results. ```rust unsafe extern { // sqrt (from libm) may be called with any `f64` pub safe fn sqrt(x: f64) -> f64; // strlen (from libc) requires a valid pointer, // so we mark it as being an unsafe fn pub unsafe fn strlen(p: const c_char) -> usize; // this function doesn't say safe or unsafe, so it defaults to unsafe pub fn free(p: mut core::ffi::c_void); pub safe static IMPORTANT_BYTES: [u8; 256]; pub safe static LINES: SyncUnsafeCell<i32>; } ``` ## Tests The relevant tests are in `tests/ui/rust-2024/unsafe-extern-blocks`. ## History - https://github.com/rust-lang/rust/pull/124482 - https://github.com/rust-lang/rust/pull/124455 - https://github.com/rust-lang/rust/pull/125077 - https://github.com/rust-lang/rust/pull/125522 - https://github.com/rust-lang/rust/issues/126738 - https://github.com/rust-lang/rust/issues/126749 - https://github.com/rust-lang/rust/issues/126755 - https://github.com/rust-lang/rust/pull/126757 - https://github.com/rust-lang/rust/pull/126758 - https://github.com/rust-lang/rust/issues/126756 - https://github.com/rust-lang/rust/pull/126973 - https://github.com/rust-lang/rust/pull/127535 - https://github.com/rust-lang/rustfmt/pull/6204 ## Unresolved questions I am not aware of any unresolved questions.	2024-08-03 20:51:51 +02:00
yukang	22aa104bce	don't suggest turning crate-level attributes into outer style	2024-08-04 00:11:16 +08:00
Matthias Krüger	dee57ce043	Rollup merge of #128483 - nnethercote:still-more-cfg-cleanups, r=petrochenkov Still more `cfg` cleanups Found while looking closely at `cfg`/`cfg_attr` processing code. r? `````````@petrochenkov`````````	2024-08-03 11:17:44 +02:00
Matthias Krüger	29cd3103a1	Rollup merge of #128496 - clubby789:box-syntax-multipart, r=compiler-errors Fix removed `box_syntax` diagnostic if source isn't available Fix #128442	2024-08-01 18:43:41 +02:00
clubby789	e157954cce	Fix removed `box_syntax` diagnostic if source isn't available	2024-08-01 13:11:24 +00:00
bors	c0e32983f5	Auto merge of #127543 - carbotaniuman:more_unsafe_attr_verification, r=estebank,traviscross More unsafe attr verification This code denies unsafe on attributes such as `#[test]` and `#[ignore]`, while also changing the `MetaItem` parsing so `unsafe` in args like `#[allow(unsafe(dead_code))]` is not accidentally allowed. Tracking: - https://github.com/rust-lang/rust/issues/123757	2024-08-01 10:40:45 +00:00
Nicholas Nethercote	d1f05fd184	Distinguish the two kinds of token range. When collecting tokens there are two kinds of range: - a range relative to the parser's full token stream (which we get when we are parsing); - a range relative to a single AST node's token stream (which we use within `LazyAttrTokenStreamImpl` when replacing tokens). These are currently both represented with `Range<u32>` and it's easy to mix them up -- until now I hadn't properly understood the difference. This commit introduces `ParserRange` and `NodeRange` to distinguish them. This also requires splitting `ReplaceRange` in two, giving the new types `ParserReplacement` and `NodeReplacement`. (These latter two names reduce the overloading of the word "range".) The commit also rewrites some comments to be clearer. The end result is a little more verbose, but much clearer.	2024-08-01 19:30:40 +10:00
Nicholas Nethercote	9d77d17f71	Move a comment to a better spot.	2024-08-01 19:30:39 +10:00
Nicholas Nethercote	2eb2ef1684	Streamline attribute stitching on AST nodes. It can be done more concisely.	2024-08-01 19:30:32 +10:00
Matthias Krüger	3acd910036	Rollup merge of #126697 - vincenzopalazzo:macros/find_the_expression_tok, r=eholk,compiler-errors [RFC] mbe: consider the `_` in 2024 an expression This commit is adding the possibility to parse the `_` as an expression inside the esition 2024. Link: https://rust-lang.zulipchat.com/#narrow/stream/404510-wg-macros/topic/supporting.20.60_.60.20expressions Issue https://github.com/rust-lang/rust/issues/123742 r? `@eholk`	2024-07-31 23:20:09 +02:00
Michael Goulet	79ef91e879	tweak comment on `NonterminalKind::Expr` Co-authored-by: Eric Holk <eric@theincredibleholk.org>	2024-07-31 15:37:55 -04:00
Vincenzo Palazzo	276fa19c0a	rustc_parser: consider the in 2024 an expression This commit is adding the possibility to parse the `_` as an expression inside the esition 2024. Link: https://rust-lang.zulipchat.com/#narrow/stream/404510-wg-macros/topic/supporting.20.60_.60.20expressions Co-authored-by: Eric Holk <eric@theincredibleholk.org> Signed-off-by: Vincenzo Palazzo <vincenzopalazzodev@gmail.com>	2024-07-31 18:34:22 +00:00
Nicholas Nethercote	fe647f0538	Remove `LhsExpr`. `parse_expr_assoc_with` has an awkward structure -- sometimes the lhs is already parsed. This commit splits the post-lhs part into a new method `parse_expr_assoc_rest_with`, which makes everything shorter and simpler.	2024-07-31 12:56:25 +10:00
Nicholas Nethercote	281c2fd5bf	Inline and remove `parse_local_mk`. It has a single use. This makes the `let` handling case in `parse_stmt_without_recovery` more similar to the statement path and statement expression cases.	2024-07-31 12:08:55 +10:00
carbotaniuman	49db8a5a99	Add toggle for `parse_meta_item` unsafe parsing This makes it possible for the `unsafe(...)` syntax to only be valid at the top level, and the `NestedMetaItem`s will automatically reject `unsafe(...)`.	2024-07-30 18:28:43 -05:00
Matthias Krüger	6f0b237c72	Rollup merge of #128376 - compiler-errors:finish-ur-vegetables, r=jieyouxu Mark `Parser::eat`/`check` methods as `#[must_use]` These methods return a `bool`, but we probably should either use these values or explicitly throw them away (e.g. when we just want to unconditionally eat a token if it exists). I changed a few places from `eat` to `expect`, but otherwise I tried to leave a comment explaining why the `eat` was okay. This also adds a test for the `pattern_type!` macro, which used to silently accept a missing `is` token.	2024-07-30 22:51:38 +02:00
bors	f8060d282d	Auto merge of #128083 - Mark-Simulacrum:bump-bootstrap, r=albertlarsan68 Bump bootstrap compiler to new beta https://forge.rust-lang.org/release/process.html#master-bootstrap-update-t-2-day-tuesday	2024-07-30 17:49:08 +00:00
bors	595316b400	Auto merge of #127955 - chenyukang:yukang-fix-mismatched-delimiter-issue-12786, r=nnethercote Add limit for unclosed delimiters in lexer diagnostic Fixes #127868 The first commit shows the original diagnostic, and the second commit shows the changes.	2024-07-30 13:02:16 +00:00
carbotaniuman	d8bc8761a5	Deny unsafe on more builtin attributes	2024-07-29 21:00:09 -05:00
Michael Goulet	e4076e34f8	Mark Parser::eat/check methods as must_use	2024-07-29 21:29:08 -04:00
Nicholas Nethercote	84ac80f192	Reformat `use` declarations. The previous commit updated `rustfmt.toml` appropriately. This commit is the outcome of running `x fmt --all` with the new formatting options.	2024-07-29 08:26:52 +10:00
Mark Rousskov	5eca36d27a	step cfg(bootstrap)	2024-07-28 14:46:29 -04:00
Trevor Gross	9164dbd48c	Rollup merge of #128207 - folkertdev:asm-parser-generalize, r=Amanieu improve error message when `global_asm!` uses `asm!` options specifically, what was error: expected one of `)`, `att_syntax`, or `raw`, found `preserves_flags` --> $DIR/bad-options.rs:45:25 \| LL \| global_asm!("", options(preserves_flags)); \| ^^^^^^^^^^^^^^^ expected one of `)`, `att_syntax`, or `raw` is now error: the `preserves_flags` option cannot be used with `global_asm!` --> $DIR/bad-options.rs:45:25 \| LL \| global_asm!("", options(preserves_flags)); \| ^^^^^^^^^^^^^^^ the `preserves_flags` option is not meaningful for global-scoped inline assembly mirroring the phrasing of the [reference](https://doc.rust-lang.org/reference/inline-assembly.html#options). This is also a bit of a refactor for a future `naked_asm!` macro (for use in `#[naked]` functions). Currently this sort of error can come up when switching from inline to global asm, or when a user just isn't that experienced with assembly. With `naked_asm!` added to the mix hitting this error is more likely.	2024-07-27 13:32:56 -04:00
Trevor Gross	7eaf74743b	Rollup merge of #128229 - tdittr:unsafe-extern-abi-error, r=compiler-errors Improve `extern "<abi>" unsafe fn()` error message These errors were already reported in #87217, and fixed by #87235 but missed the case of an explicit ABI. This PR does not cover multiple keywords like `extern "C" pub const unsafe fn()`, but I don't know what a good way to cover this would be. It also seems rarer than `extern "C" unsafe` which I saw happen a few times in workshops.	2024-07-26 19:03:08 -04:00
Trevor Gross	af52be2cea	Rollup merge of #128224 - nnethercote:fewer-replace_ranges, r=petrochenkov Remove unnecessary range replacements This PR removes an unnecessary range replacement in `collect_tokens_trailing_token`, and does a couple of other small cleanups. r? ````@petrochenkov````	2024-07-26 19:03:06 -04:00
Trevor Gross	553a64f412	Rollup merge of #128223 - nnethercote:refactor-collect_tokens, r=petrochenkov Refactor complex conditions in `collect_tokens_trailing_token` More readability improvements for this complicated function. r? ````@petrochenkov````	2024-07-26 19:03:06 -04:00
Tamme Dittrich	3fdc99193e	Improve error message for `extern "C" unsafe fn()` This was handled correctly already for `extern unsafe fn()`. Co-authored-by: Folkert <folkert@folkertdev.nl>	2024-07-26 15:14:05 +02:00
Nicholas Nethercote	55d37ae711	Remove an unnecessary block.	2024-07-26 17:37:03 +10:00
Nicholas Nethercote	6ea2da5a28	Tweak a loop. A fully imperative style is easier to read than a half-iterator, half-imperative style. Also, rename `inner_attr` as `attr` because it might be an outer attribute.	2024-07-26 17:37:03 +10:00
Nicholas Nethercote	6e87858f26	Fix a comment. Imagine you have replace ranges (2..20,X) and (5..15,Y), and these tokens: ``` a,b,c,d,e,f,g,h,i,j,k,l,m,n,o,p,q,r,s,t,u,v,w,x ``` If we replace (5..15,Y) first, then (2..20,X) we get this sequence ``` a,b,c,d,e,Y,_,_,_,_,_,_,_,_,_,p,q,r,s,t,u,v,w,x a,b,X,_,_,_,_,_,_,_,_,_,_,_,_,_,_,_,_,_,u,v,w,x ``` which is what we want. If we do it in the other order, we get this: ``` a,b,X,_,_,_,_,_,_,_,_,_,_,_,_,p,q,r,s,t,u,v,w,x a,b,X,_,_,Y,_,_,_,_,_,_,_,_,_,_,_,_,_,_,u,v,w,x ``` which is wrong. So it's true that we need the `.rev()` but the comment is wrong about why.	2024-07-26 17:37:03 +10:00
Nicholas Nethercote	a560810a69	Don't include inner attribute ranges in `CaptureState`. The current code is this: ``` self.capture_state.replace_ranges.push((start_pos..end_pos, Some(target))); self.capture_state.replace_ranges.extend(inner_attr_replace_ranges); ``` What's not obvious is that every range in `inner_attr_replace_ranges` must be a strict sub-range of `start_pos..end_pos`. Which means, in `LazyAttrTokenStreamImpl::to_attr_token_stream`, they will be done first, and then the `start_pos..end_pos` replacement will just overwrite them. So they aren't needed.	2024-07-26 14:18:20 +10:00
Nicholas Nethercote	e631b1ebfa	Invert the sense of `is_complete` and rename it as `needs_tokens`. I have always found `is_complete` an unhelpful name. The new name (and inverted sense) fits in better with the conditions at its call sites.	2024-07-26 09:58:34 +10:00
Nicholas Nethercote	3d363c3d99	Move `is_complete` to the module that uses it. And make it non-`pub`.	2024-07-26 09:44:39 +10:00
Nicholas Nethercote	4288edb219	Inline and remove `AttrWrapper::is_complete`. It has a single call site. This change makes the two `needs_collect` conditions more similar to each other, and therefore easier to understand.	2024-07-26 09:44:07 +10:00
Nicholas Nethercote	caee195bdd	Invert early exit conditions in `collect_tokens_trailing_token`. This has been bugging me for a while. I find complex "if any of these are true" conditions easier to think about than complex "if all of these are true" conditions, because you can stop as soon as one is true.	2024-07-26 09:43:41 +10:00
Folkert	d3858f7465	improve error message when `global_asm!` uses `asm!` options	2024-07-25 22:33:52 +02:00
surechen	4ac60601d3	Fix a span error when parsing a wrong param of function. fixes #128042	2024-07-25 22:33:45 +08:00
yukang	94a3fd7678	add limit for unclosed delimiters in lexer diagnostic	2024-07-25 17:01:32 +08:00
Matthias Krüger	cfc5f25b3d	Rollup merge of #127054 - compiler-errors:bound-ordering, r=fmease Reorder trait bound modifiers after `for<...>` binder in trait bounds This PR suggests changing the grammar of trait bounds from: ``` [CONSTNESS] [ASYNCNESS] [?] [BINDER] [TRAIT_PATH] const async ? for<'a> Sized ``` to ``` ([BINDER] [CONSTNESS] [ASYNCNESS] \| [?]) [TRAIT_PATH] ``` i.e., either ``` ? Sized ``` or ``` for<'a> const async Sized ``` (but not both) ### Why? I think it's strange that the binder applies "more tightly" than the `?` trait polarity. This becomes even weirder when considering that we (or at least, I) want to have `async` trait bounds expressed like: ``` where T: for<'a> async Fn(&'a ()) -> i32, ``` and not: ``` where T: async for<'a> Fn(&'a ()) -> i32, ``` ### Fallout No crates on crater use this syntax, presumably because it's literally useless. This will require modifying the reference grammar, though. ### Alternatives If this is not desirable, then we can alternatively keep parsing `for<'a>` after the `?` but deprecate it with either an FCW (or an immediate hard error), and begin parsing `for<'a>` before the `?`.	2024-07-25 04:43:18 +02:00
León Orell Valerian Liehr	7da751a108	Apply suggestions from code review	2024-07-25 03:00:04 +02:00
Santiago Pastorino	8366c7fe9c	Stabilize unsafe extern blocks (RFC 3484)	2024-07-23 00:29:39 -03:00
Oli Scherer	8d290058c9	Always pass the visitor as the first argument to walk* functions	2024-07-22 14:01:24 +00:00
Oli Scherer	754bdef793	Sync `mut_visit` function names with immut `visit` ones (s/noop_visit/walk/)	2024-07-22 14:01:24 +00:00
bors	3811f40d27	Auto merge of #127957 - matthiaskrgr:rollup-1u5ivck, r=matthiaskrgr Rollup of 6 pull requests Successful merges: - #127350 (Parser: Suggest Placing the Return Type After Function Parameters) - #127621 (Rewrite and rename `issue-22131` and `issue-26006` `run-make` tests to rmake) - #127662 (When finding item gated behind a `cfg` flag, point at it) - #127903 (`force_collect` improvements) - #127932 (rustdoc: fix `current` class on sidebar modnav) - #127943 (Don't allow unsafe statics outside of extern blocks) r? `@ghost` `@rustbot` modify labels: rollup	2024-07-19 13:39:12 +00:00
Matthias Krüger	9ada89d9a1	Rollup merge of #127903 - nnethercote:force_collect-improvements, r=petrochenkov `force_collect` improvements Yet more cleanups relating to `cfg_attr` processing. r? ````@petrochenkov````	2024-07-19 10:48:05 +02:00
Matthias Krüger	c86e13f330	Rollup merge of #127350 - veera-sivarajan:bugfix-126311, r=lcnr Parser: Suggest Placing the Return Type After Function Parameters Fixes #126311 This PR suggests placing the return type after the function parameters when it's misplaced after a `where` clause. This also tangentially improves diagnostics for cases like [this](`86d6f1312a/tests/ui/parser/issues/misplaced-return-type-without-where-issue-126311.rs (L1C1-L1C28)`) and adds doc comments for `parser::AllowPlus`.	2024-07-19 10:48:03 +02:00
Nicholas Nethercote	1dd566a6d0	Overhaul comments in `collect_tokens_trailing_token`. Adding details, clarifying lots of little things, etc. In particular, the commit adds details of an example. I find this very helpful, because it's taken me a long time to understand how this code works.	2024-07-19 15:25:55 +10:00
Nicholas Nethercote	ca6649516f	Make `Parser::num_bump_calls` 0-indexed. Currently in `collect_tokens_trailing_token`, `start_pos` and `end_pos` are 1-indexed by `replace_ranges` is 0-indexed, which is really confusing. Making them both 0-indexed makes debugging much easier.	2024-07-19 15:25:55 +10:00
Nicholas Nethercote	f9c7ca70cb	Move `inner_attr` code downwards. This puts it just before the `replace_ranges` initialization, which makes sense because the two variables are closely related.	2024-07-19 15:25:54 +10:00
Nicholas Nethercote	1f67cf9e63	Remove `final_attrs` local variable. It's no shorter than `ret.attrs()`, and `ret.attrs()` is used multiple times earlier in the function.	2024-07-19 15:25:54 +10:00
Nicholas Nethercote	757f73f506	Simplify `CaptureState::inner_attr_ranges`. The `Option`s within the `ReplaceRange`s within the hashmap are always `None`. This PR omits them and inserts them when they are extracted from the hashmap.	2024-07-19 15:25:54 +10:00
Nicholas Nethercote	4158a1c48f	Only check `force_collect` in `collect_tokens_trailing_token`. There are three places where we currently check `force_collect` and call `collect_tokens_no_attrs` for `ForceCollect::Yes` and a vanilla parsing function for `ForceCollect::No`. But we can instead just pass in `force_collect` and let `collect_tokens_trailing_token` do the appropriate thing.	2024-07-19 08:42:33 +10:00
Nicholas Nethercote	9d908a2877	Use `ForceCollect` in `parse_attr_item`. Instead of a `bool`. Because `ForceCollect` is used in this way everywhere else.	2024-07-19 08:24:54 +10:00
Nicholas Nethercote	7d7e2a173a	Don't always force collect tokens in `recover_stmt_local_after_let`. Use a parameter to decide whether to force collect, as is done for the closely related `parse_local_mk` method.	2024-07-19 08:24:53 +10:00
Nicholas Nethercote	e69ff1c106	Remove an unnecessary `ForceCollect::Yes`. No need to collect tokens on this recovery path, because the parsed statement isn't even looked at.	2024-07-19 08:20:57 +10:00
Veera	4cad705017	Parser: Suggest Placing the Return Type After Function Parameters	2024-07-18 17:56:34 -04:00
Matthias Krüger	50a90e394e	Rollup merge of #127835 - estebank:issue-127823, r=compiler-errors Fix ICE in suggestion caused by `⩵` being recovered as `==` The second suggestion shown here would previously incorrectly assume that the span corresponding to `⩵` was 2 bytes wide composed by 2 1 byte wide chars, so a span pointing at `==` could point only at one of the `=` to remove it. Instead, we now replace the whole thing (as we should have the whole time): ``` error: unknown start of token: \u{2a75} --> $DIR/unicode-double-equals-recovery.rs:1:16 \| LL \| const A: usize ⩵ 2; \| ^ \| help: Unicode character '⩵' (Two Consecutive Equals Signs) looks like '==' (Double Equals Sign), but it is not \| LL \| const A: usize == 2; \| ~~ error: unexpected `==` --> $DIR/unicode-double-equals-recovery.rs:1:16 \| LL \| const A: usize ⩵ 2; \| ^ \| help: try using `=` instead \| LL \| const A: usize = 2; \| ~ ``` Fix #127823.	2024-07-18 23:05:21 +02:00
Esteban Küber	67ec1326ee	Fix ICE in suggestion caused by `⩵` being recovered as `==` The second suggestion shown here would previously incorrectly assume that the span corresponding to `⩵` was 2 bytes wide composed by 2 1 byte wide chars, so a span pointing at `==` could point only at one of the `=` to remove it. Instead, we now replace the whole thing (as we should have the whole time): ``` error: unknown start of token: \u{2a75} --> $DIR/unicode-double-equals-recovery.rs:1:16 \| LL \| const A: usize ⩵ 2; \| ^ \| help: Unicode character '⩵' (Two Consecutive Equals Signs) looks like '==' (Double Equals Sign), but it is not \| LL \| const A: usize == 2; \| ~~ error: unexpected `==` --> $DIR/unicode-double-equals-recovery.rs:1:16 \| LL \| const A: usize ⩵ 2; \| ^ \| help: try using `=` instead \| LL \| const A: usize = 2; \| ~ ```	2024-07-18 17:47:31 +00:00
Trevor Gross	e2e0681e3a	Rollup merge of #127842 - nnethercote:rm-TrailingToken, r=petrochenkov Remove `TrailingToken`. It's used in `Parser::collect_tokens_trailing_token` to decide whether to capture a trailing token. But the callers actually know whether to capture a trailing token, so it's simpler for them to just pass in a bool. Also, the `TrailingToken::Gt` case was weird, because it didn't result in a trailing token being captured. It could have been subsumed by the `TrailingToken::MaybeComma` case, and it effectively is in the new code. r? `@petrochenkov`	2024-07-18 05:14:07 -05:00
Nicholas Nethercote	487802d6c8	Remove `TrailingToken`. It's used in `Parser::collect_tokens_trailing_token` to decide whether to capture a trailing token. But the callers actually know whether to capture a trailing token, so it's simpler for them to just pass in a bool. Also, the `TrailingToken::Gt` case was weird, because it didn't result in a trailing token being captured. It could have been subsumed by the `TrailingToken::MaybeComma` case, and it effectively is in the new code.	2024-07-18 17:28:49 +10:00
Matthias Krüger	77e5bbf341	Rollup merge of #127889 - estebank:anon-arg-sugg, r=compiler-errors More accurate span for anonymous argument suggestion Use smaller span for suggesting adding `_:` ahead of a type: ``` error: expected one of `(`, `...`, `..=`, `..`, `::`, `:`, `{`, or `\|`, found `)` --> $DIR/anon-params-denied-2018.rs:12:47 \| LL \| fn foo_with_qualified_path(<Bar as T>::Baz); \| ^ expected one of 8 possible tokens \| = note: anonymous parameters are removed in the 2018 edition (see RFC 1685) help: explicitly ignore the parameter name \| LL \| fn foo_with_qualified_path(_: <Bar as T>::Baz); \| ++ ```	2024-07-18 08:09:02 +02:00

1 2 3 4 5 ...

2108 Commits