nordic-dev.net/rust - rust

mirror of https://github.com/rust-lang/rust.git synced 2025-05-01 12:37:37 +00:00

Author	SHA1	Message	Date
Nicholas Nethercote	9018d2c455	Detect `NulInCStr` error earlier. By making it an `EscapeError` instead of a `LitError`. This makes it like the other errors produced when checking string literals contents, e.g. for invalid escape sequences or bare CR chars. NOTE: this means these errors are issued earlier, before expansion, which changes behaviour. It will be possible to move the check back to the later point if desired. If that happens, it's likely that all the string literal contents checks will be delayed together. One nice thing about this: the old approach had some code in `report_lit_error` to calculate the span of the nul char from a range. This code used a hardwired `+2` to account for the `c"` at the start of a C string literal, but this should have changed to a `+3` for raw C string literals to account for the `cr"`, which meant that the caret in `cr"` nul error messages was one short of where it should have been. The new approach doesn't need any of this and avoids the off-by-one error.	2024-01-12 16:19:37 +11:00
Nicholas Nethercote	6656413a5c	Stop using `DiagnosticBuilder::buffer` in the parser. One consequence is that errors returned by `maybe_new_parser_from_source_str` now must be consumed, so a bunch of places that previously ignored those errors now cancel them. (Most of them explicitly dropped the errors before. I guess that was to indicate "we are explicitly ignoring these", though I'm not 100% sure.)	2024-01-11 18:37:56 +11:00
Nicholas Nethercote	d02150fd45	Fix lifetimes in `StringReader`. Two different lifetimes are conflated. This doesn't matter right now, but needs to be fixed for the next commit to work. And the more descriptive lifetime names make the code easier to read.	2024-01-11 16:55:10 +11:00
Nicholas Nethercote	ed76b0b882	Rename consuming chaining methods on `DiagnosticBuilder`. In #119606 I added them and used a `_mv` suffix, but that wasn't great. A `with_` prefix has three different existing uses. - Constructors, e.g. `Vec::with_capacity`. - Wrappers that provide an environment to execute some code, e.g. `with_session_globals`. - Consuming chaining methods, e.g. `Span::with_{lo,hi,ctxt}`. The third case is exactly what we want, so this commit changes `DiagnosticBuilder::foo_mv` to `DiagnosticBuilder::with_foo`. Thanks to @compiler-errors for the suggestion.	2024-01-10 07:40:00 +11:00
Nicholas Nethercote	3c4f1d85af	Rename `{create,emit}_warning` as `{create,emit}_warn`. For consistency with `warn`/`struct_warn`, and also `{create,emit}_err`, all of which use an abbreviated form.	2024-01-10 07:33:06 +11:00
Nicholas Nethercote	4752a923af	Remove `DiagnosticBuilder::delay_as_bug_without_consuming`. The existing uses are replaced in one of three ways. - In a function that also has calls to `emit`, just rearrange the code so that exactly one of `delay_as_bug` or `emit` is called on every path. - In a function returning a `DiagnosticBuilder`, use `downgrade_to_delayed_bug`. That's good enough because it will get emitted later anyway. - In `unclosed_delim_err`, one set of errors is being replaced with another set, so just cancel the original errors.	2024-01-08 16:07:14 +11:00
Nicholas Nethercote	6682f243dc	Remove all eight `DiagnosticBuilder::*_with_code` methods. These all have relatively low use, and can be perfectly emulated with a simpler construction method combined with `code` or `code_mv`.	2024-01-08 16:00:34 +11:00
Nicholas Nethercote	589591efde	Use chaining in `DiagnosticBuilder` construction. To avoid the use of a mutable local variable, and because it reads more nicely.	2024-01-08 15:43:07 +11:00
Nicholas Nethercote	b1b9278851	Make `DiagnosticBuilder::emit` consuming. This works for most of its call sites. This is nice, because `emit` very much makes sense as a consuming operation -- indeed, `DiagnosticBuilderState` exists to ensure no diagnostic is emitted twice, but it uses runtime checks. For the small number of call sites where a consuming emit doesn't work, the commit adds `DiagnosticBuilder::emit_without_consuming`. (This will be removed in subsequent commits.) Likewise, `emit_unless` becomes consuming. And `delay_as_bug` becomes consuming, while `delay_as_bug_without_consuming` is added (which will also be removed in subsequent commits.) All this requires significant changes to `DiagnosticBuilder`'s chaining methods. Currently `DiagnosticBuilder` method chaining uses a non-consuming `&mut self -> &mut Self` style, which allows chaining to be used when the chain ends in `emit()`, like so: ``` struct_err(msg).span(span).emit(); ``` But it doesn't work when producing a `DiagnosticBuilder` value, requiring this: ``` let mut err = self.struct_err(msg); err.span(span); err ``` This style of chaining won't work with consuming `emit` though. For that, we need to use to a `self -> Self` style. That also would allow `DiagnosticBuilder` production to be chained, e.g.: ``` self.struct_err(msg).span(span) ``` However, removing the `&mut self -> &mut Self` style would require that individual modifications of a `DiagnosticBuilder` go from this: ``` err.span(span); ``` to this: ``` err = err.span(span); ``` There are many such places. I have a high tolerance for tedious refactorings, but even I gave up after a long time trying to convert them all. Instead, this commit has it both ways: the existing `&mut self -> Self` chaining methods are kept, and new `self -> Self` chaining methods are added, all of which have a `_mv` suffix (short for "move"). Changes to the existing `forward!` macro lets this happen with very little additional boilerplate code. I chose to add the suffix to the new chaining methods rather than the existing ones, because the number of changes required is much smaller that way. This doubled chainging is a bit clumsy, but I think it is worthwhile because it allows a lot of good things to subsequently happen. In this commit, there are many `mut` qualifiers removed in places where diagnostics are emitted without being modified. In subsequent commits: - chaining can be used more, making the code more concise; - more use of chaining also permits the removal of redundant diagnostic APIs like `struct_err_with_code`, which can be replaced easily with `struct_err` + `code_mv`; - `emit_without_diagnostic` can be removed, which simplifies a lot of machinery, removing the need for `DiagnosticBuilderState`.	2024-01-08 15:24:49 +11:00
Nicholas Nethercote	8e6bca63f9	Inline and remove `StringReader::struct_fatal_span_char`. It has a single call site.	2024-01-04 11:44:16 +11:00
Nicholas Nethercote	505c1371d0	Rename some `Diagnostic` setters. `Diagnostic` has 40 methods that return `&mut Self` and could be considered setters. Four of them have a `set_` prefix. This doesn't seem necessary for a type that implements the builder pattern. This commit removes the `set_` prefixes on those four methods.	2024-01-03 19:40:20 +11:00
Nicholas Nethercote	99472c7049	Remove `Session` methods that duplicate `DiagCtxt` methods. Also add some `dcx` methods to types that wrap `TyCtxt`, for easier access.	2023-12-24 08:05:28 +11:00
Nicholas Nethercote	d51db05d7e	Remove `ParseSess` methods that duplicate `DiagCtxt` methods. Also add missing `#[track_caller]` attributes to `DiagCtxt` methods as necessary to keep tests working.	2023-12-24 07:59:21 +11:00
Nicholas Nethercote	ec9af0d6cb	Remove `Parser` methods that duplicate `DiagCtxt` methods.	2023-12-24 07:48:47 +11:00
Nicholas Nethercote	f5459201e0	Add `EmitResult` associated type to `EmissionGuarantee`. This lets different error levels share the same return type from `emit_*`. - A lot of inconsistencies in the `DiagCtxt` API are removed. - `Noted` is removed. - `FatalAbort` is introduced for fatal errors (abort via `raise`), replacing the `EmissionGuarantee` impl for `!`. - `Bug` is renamed `BugAbort` (to avoid clashing with `Level::Bug` and to mirror `FatalAbort`), and modified to work in the new way with bug errors (abort via panic). - Various diagnostic creators and emitters updated to the new, better signatures. Note that `DiagCtxt::bug` no longer needs to call `panic_any`, because `emit` handles that. Also shorten the obnoxiously long `diagnostic_builder_emit_producing_guarantee` name.	2023-12-19 09:52:02 +11:00
Nicholas Nethercote	f422dca3ae	Rename many `DiagCtxt` arguments.	2023-12-18 16:06:22 +11:00
Nicholas Nethercote	9df1576e1d	Rename `ParseSess::span_diagnostic` as `ParseSess::dcx`.	2023-12-18 16:06:21 +11:00
Nicholas Nethercote	cde19c016e	Rename `Handler` as `DiagCtxt`.	2023-12-18 16:06:19 +11:00
bors	3ad8e2d129	Auto merge of #118897 - nnethercote:more-unescaping-cleanups, r=fee1-dead More unescaping cleanups More minor improvements I found while working on #118699. r? `@fee1-dead`	2023-12-16 08:52:06 +00:00
Nicholas Nethercote	e3b7ecc1ef	Remove one use of `span_bug_no_panic`. It's unclear why this is used here. All entries in the third column of `UNICODE_ARRAY` are covered by `ASCII_ARRAY`, so if the lookup fails it's a genuine compiler bug. It was added way back in #29837, for no clear reason. This commit changes it to `span_bug`, which is more typical.	2023-12-14 15:53:55 +11:00
Nicholas Nethercote	423bf4233d	Rename the `span` args to `emit_unescape_error`. The `span` arg is described in a comment as "interior span of the literal, without quotes", which is incorrect. It's actually the span of the error part of the literal, corresponding to `range`. This commit renames `span` and `span_without_quotes` to make things clearer, and fixes the erroneous comment.	2023-12-13 10:05:57 +11:00
Nicholas Nethercote	4cfdbd328b	Add spacing information to delimiters. This is an extension of the previous commit. It means the output of something like this: ``` stringify!(let a: Vec<u32> = vec![];) ``` goes from this: ``` let a: Vec<u32> = vec![] ; ``` With this PR, it now produces this string: ``` let a: Vec<u32> = vec![]; ```	2023-12-11 09:36:40 +11:00
Nicholas Nethercote	925f7fad57	Improve `print_tts` by changing `tokenstream::Spacing`. `tokenstream::Spacing` appears on all `TokenTree::Token` instances, both punct and non-punct. Its current usage: - `Joint` means "can join with the next token and that token is a punct". - `Alone` means "cannot join with the next token or can join with the next token but that token is not a punct". The fact that `Alone` is used for two different cases is awkward. This commit augments `tokenstream::Spacing` with a new variant `JointHidden`, resulting in: - `Joint` means "can join with the next token and that token is a punct". - `JointHidden` means "can join with the next token and that token is a not a punct". - `Alone` means "cannot join with the next token". This drastically improves the output of `print_tts`. For example, this: ``` stringify!(let a: Vec<u32> = vec![];) ``` currently produces this string: ``` let a : Vec < u32 > = vec! [] ; ``` With this PR, it now produces this string: ``` let a: Vec<u32> = vec![] ; ``` (The space after the `]` is because `TokenTree::Delimited` currently doesn't have spacing information. The subsequent commit fixes this.) The new `print_tts` doesn't replicate original code perfectly. E.g. multiple space characters will be condensed into a single space character. But it's much improved. `print_tts` still produces the old, uglier output for code produced by proc macros. Because we have to translate the generated code from `proc_macro::Spacing` to the more expressive `token::Spacing`, which results in too much `proc_macro::Along` usage and no `proc_macro::JointHidden` usage. So `space_between` still exists and is used by `print_tts` in conjunction with the `Spacing` field. This change will also help with the removal of `Token::Interpolated`. Currently interpolated tokens are pretty-printed nicely via AST pretty printing. `Token::Interpolated` removal will mean they get printed with `print_tts`. Without this change, that would result in much uglier output for code produced by decl macro expansions. With this change, AST pretty printing and `print_tts` produce similar results. The commit also tweaks the comments on `proc_macro::Spacing`. In particular, it refers to "compound tokens" rather than "multi-char operators" because lifetimes aren't operators.	2023-12-11 09:19:09 +11:00
bors	63d16b5a98	Auto merge of #117472 - jmillikin:stable-c-str-literals, r=Nilstrieb Stabilize C string literals RFC: https://rust-lang.github.io/rfcs/3348-c-str-literal.html Tracking issue: https://github.com/rust-lang/rust/issues/105723 Documentation PR (reference manual): https://github.com/rust-lang/reference/pull/1423 # Stabilization report Stabilizes C string and raw C string literals (`c"..."` and `cr#"..."#`), which are expressions of type [`&CStr`](https://doc.rust-lang.org/stable/core/ffi/struct.CStr.html). Both new literals require Rust edition 2021 or later. ```rust const HELLO: &core::ffi::CStr = c"Hello, world!"; ``` C strings may contain any byte other than `NUL` (`b'\x00'`), and their in-memory representation is guaranteed to end with `NUL`. ## Implementation Originally implemented by PR https://github.com/rust-lang/rust/pull/108801, which was reverted due to unintentional changes to lexer behavior in Rust editions < 2021. The current implementation landed in PR https://github.com/rust-lang/rust/pull/113476, which restricts C string literals to Rust edition >= 2021. ## Resolutions to open questions from the RFC * Adding C character literals (`c'.'`) of type `c_char` is not part of this feature. * Support for `c"..."` literals does not prevent `c'.'` literals from being added in the future. * C string literals should not be blocked on making `&CStr` a thin pointer. * It's possible to declare constant expressions of type `&'static CStr` in stable Rust (as of v1.59), so C string literals are not adding additional coupling on the internal representation of `CStr`. * The unstable `concat_bytes!` macro should not accept `c"..."` literals. * C strings have two equally valid `&[u8]` representations (with or without terminal `NUL`), so allowing them to be used in `concat_bytes!` would be ambiguous. * Adding a type to represent C strings containing valid UTF-8 is not part of this feature. * Support for a hypothetical `&Utf8CStr` may be explored in the future, should such a type be added to Rust.	2023-12-01 13:33:55 +00:00
Nilstrieb	21a870515b	Fix `clippy::needless_borrow` in the compiler `x clippy compiler -Aclippy::all -Wclippy::needless_borrow --fix`. Then I had to remove a few unnecessary parens and muts that were exposed now.	2023-11-21 20:13:40 +01:00
sjwang05	f88cf0206f	Move unclosed delim errors to separate function	2023-11-11 13:39:08 -08:00
sjwang05	a49368f00b	Correctly handle while-let-chains	2023-11-10 12:13:53 -08:00
sjwang05	9455259450	Catch an edge case	2023-11-09 20:07:17 -08:00
sjwang05	0094238157	Catch stray { in let-chains	2023-11-09 18:47:49 -08:00
John Millikin	0f41bc21b9	Stabilize C string literals	2023-11-01 09:16:34 +09:00
Esteban Küber	50ca5ef07f	When encountering unclosed delimiters during parsing, check for diff markers Fix #116252.	2023-10-30 00:56:46 +00:00
Michael Goulet	b2d2184ede	Format all the let chains in compiler	2023-10-13 08:59:36 +00:00
Nicholas Nethercote	bb9c2f50c3	Reorder an expression to improve readability.	2023-10-12 08:46:15 +11:00
Nicholas Nethercote	becf4942a2	Rename `Token::is_op` as `Token::is_punct`. For consistency with `proc_macro::Punct`.	2023-10-12 08:46:15 +11:00
beetrees	072d8c8bbc	Fix suggestion for attempting to define a string with single quotes	2023-08-16 21:51:57 +01:00
bjorn3	ef2da4a49b	Remove reached_eof from ParseSess It was only ever set in a function which isn't called anywhere.	2023-08-13 13:33:37 +00:00
Matthias Krüger	23815467a2	inline format!() args up to and including rustc_middle	2023-07-30 13:18:33 +02:00
bors	23405bb123	Auto merge of #113476 - fee1-dead-contrib:c-str-lit, r=petrochenkov Reimplement C-str literals This reverts #113334, cc `@fmease.` While converting lexer tokens to ast Tokens in `rustc_parse`, we check the edition of the span of the token. If the edition < 2021, we split the token into two, one being the identifier and other being the str literal.	2023-07-25 12:04:34 +00:00
Deadbeef	a0376e9ec2	extract common code	2023-07-25 09:24:12 +00:00
Matthias Krüger	ed4c5fef72	fix some clippy::style findings comparison_to_empty iter_nth_zero for_kv_map manual_next_back redundant_pattern	2023-07-23 23:36:56 +02:00
Deadbeef	df9bd80d74	reimplement C string literals	2023-07-23 06:54:07 +00:00
Hankai Zhang	6336da9a75	Use a better link	2023-06-10 14:46:11 -04:00
Hankai Zhang	e5fccf927d	Update links to Rust Reference page on literals in diagnostic Instead of linking to the old Rust Reference site on static.rust-lang.org, link to the current website doc.rust-lang.org/stable/reference instead in diagnostic about incorrect literals.	2023-06-10 12:34:16 -04:00
Nicholas Nethercote	01e33a3600	Avoid `&format("...")` calls in error message code. Error message all end up passing into a function as an `impl Into<{D,Subd}iagnosticMessage>`. If an error message is creatd as `&format("...")` that means we allocate a string (in the `format!` call), then take a reference, and then clone (allocating again) the reference to produce the `{D,Subd}iagnosticMessage`, which is silly. This commit removes the leading `&` from a lot of these cases. This means the original `String` is moved into the `{D,Subd}iagnosticMessage`, avoiding the double allocations. This requires changing some function argument types from `&str` to `String` (when all arguments are `String`) or `impl Into<{D,Subd}iagnosticMessage>` (when some arguments are `String` and some are `&str`).	2023-05-16 17:59:56 +10:00
Dylan DPC	4891f02cff	Rollup merge of #108801 - fee1-dead-contrib:c-str, r=compiler-errors Implement RFC 3348, `c"foo"` literals RFC: https://github.com/rust-lang/rfcs/pull/3348 Tracking issue: #105723	2023-05-05 18:40:33 +05:30
Nicholas Nethercote	6b62f37402	Restrict `From<S>` for `{D,Subd}iagnosticMessage`. Currently a `{D,Subd}iagnosticMessage` can be created from any type that impls `Into<String>`. That includes `&str`, `String`, and `Cow<'static, str>`, which are reasonable. It also includes `&String`, which is pretty weird, and results in many places making unnecessary allocations for patterns like this: ``` self.fatal(&format!(...)) ``` This creates a string with `format!`, takes a reference, passes the reference to `fatal`, which does an `into()`, which clones the reference, doing a second allocation. Two allocations for a single string, bleh. This commit changes the `From` impls so that you can only create a `{D,Subd}iagnosticMessage` from `&str`, `String`, or `Cow<'static, str>`. This requires changing all the places that currently create one from a `&String`. Most of these are of the `&format!(...)` form described above; each one removes an unnecessary static `&`, plus an allocation when executed. There are also a few places where the existing use of `&String` was more reasonable; these now just use `clone()` at the call site. As well as making the code nicer and more efficient, this is a step towards possibly using `Cow<'static, str>` in `{D,Subd}iagnosticMessage::{Str,Eager}`. That would require changing the `From<&'a str>` impls to `From<&'static str>`, which is doable, but I'm not yet sure if it's worthwhile.	2023-05-03 08:44:39 +10:00
Deadbeef	d30c668175	make cook generic	2023-05-02 10:32:08 +00:00
Deadbeef	abb181dfd9	make it semantic error	2023-05-02 10:32:08 +00:00
Deadbeef	4c01d494b8	refactor unescape	2023-05-02 10:32:07 +00:00
Deadbeef	8ff3903643	initial step towards implementing C string literals	2023-05-02 10:30:09 +00:00
clubby789	0138513635	Fix static string lints	2023-04-25 18:59:55 +01:00
Josh Soref	e09d0d2a29	Spelling - compiler * account * achieved * advising * always * ambiguous * analysis * annotations * appropriate * build * candidates * cascading * category * character * clarification * compound * conceptually * constituent * consts * convenience * corresponds * debruijn * debug * debugable * debuggable * deterministic * discriminant * display * documentation * doesn't * ellipsis * erroneous * evaluability * evaluate * evaluation * explicitly * fallible * fulfill * getting * has * highlighting * illustrative * imported * incompatible * infringing * initialized * into * intrinsic * introduced * javascript * liveness * metadata * monomorphization * nonexistent * nontrivial * obligation * obligations * offset * opaque * opportunities * opt-in * outlive * overlapping * paragraph * parentheses * poisson * precisely * predecessors * predicates * preexisting * propagated * really * reentrant * referent * responsibility * rustonomicon * shortcircuit * simplifiable * simplifications * specify * stabilized * structurally * suggestibility * translatable * transmuting * two * unclosed * uninhabited * visibility * volatile * workaround Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com>	2023-04-17 16:09:18 -04:00
bors	9693b178fc	Auto merge of #110252 - matthiaskrgr:rollup-ovaixra, r=matthiaskrgr Rollup of 8 pull requests Successful merges: - #109810 (Replace rustdoc-ui/{c,z}-help tests with a stable run-make test ) - #110035 (fix: ensure bad `#[test]` invocs retain correct AST) - #110089 (sync::mpsc: synchronize receiver disconnect with initialization) - #110103 (Report overflows gracefully with new solver) - #110122 (Fix x check --stage 1 when download-ci-llvm=false) - #110133 (Do not use ImplDerivedObligationCause for inherent impl method error reporting) - #110135 (Revert "Don't recover lifetimes/labels containing emojis as character literals") - #110235 (Fix `--extend-css` option) Failed merges: r? `@ghost` `@rustbot` modify labels: rollup	2023-04-12 22:19:29 +00:00
Matthias Krüger	57393be6fb	Rollup merge of #110135 - compiler-errors:revert-108031, r=davidtwco Revert "Don't recover lifetimes/labels containing emojis as character literals" Reverts PR #108031 per https://github.com/rust-lang/rust/pull/109754#issuecomment-1490452045 Fixes (doesnt close until beta backported) #109746 This reverts commit `e3f9db5fc3`. This reverts commit `98b82aedba`. This reverts commit `380fa26413`.	2023-04-12 22:04:35 +02:00
DaniPopes	677357d32b	Fix typos in compiler	2023-04-10 22:02:52 +02:00
Michael Goulet	a047064d6b	Revert "Don't recover lifetimes/labels containing emojis as character literals" Reverts PR #108031 Fixes (doesnt close until beta backported) #109746 This reverts commit `e3f9db5fc3`. This reverts commit `98b82aedba`. This reverts commit `380fa26413`.	2023-04-10 06:52:41 +00:00
Nilstrieb	4b4948c2e3	Remove identity casts	2023-04-09 23:22:14 +02:00
Nilstrieb	81c320ea77	Fix some clippy::complexity	2023-04-09 23:22:14 +02:00
Oli Scherer	7edd1d8799	Replace another lock with an append-only vec	2023-04-04 09:01:44 +00:00
Maybe Waffle	775bacd1b8	Simplify `sort_by` calls	2023-03-07 18:13:41 +00:00
yukang	f808877bbf	refactor parse_token_trees to not return unmatched_delims	2023-02-28 07:57:17 +00:00
yukang	9ce7472db4	rename unmatched_braces to unmatched_delims	2023-02-28 07:57:17 +00:00
yukang	65ad5f8de7	remove duplicated diagnostic for unclosed delimiter	2023-02-28 07:57:17 +00:00
许杰友 Jieyou Xu (Joe)	380fa26413	Don't recover lifetimes/labels containing emojis as character literals Note that at the time of this commit, `unic-emoji-char` seems to have data tables only up to Unicode 5.0, but Unicode is already newer than this. A newer emoji such as `🥺` will not be recognized as an emoji but older emojis such as `🐱` will.	2023-02-14 17:31:58 +08:00
clubby789	521c5f36d6	Migrate `rustc_parse` to derive diagnostics	2023-02-06 14:40:35 +00:00
Matthias Krüger	e3048c7838	Rollup merge of #104012 - chenyukang:yukang/fix-103882-deli-indentation, r=petrochenkov Improve unexpected close and mismatch delimiter hint in TokenTreesReader Fixes #103882 Fixes #68987 Fixes #69259 The inner indentation mismatching will be covered by outer block, the new added function `report_error_prone_delim_block` will find out the error prone candidates for reporting.	2023-01-28 11:11:05 +01:00
yukang	cd233231aa	Improve unexpected close and mismatch delimiter hint in TokenTreesReader	2023-01-27 17:45:41 +08:00
clubby789	1487aa9f9d	Add double-equals homoglyph	2023-01-19 02:25:55 +00:00
clubby789	3520bba136	Use strings for homoglyph replacements	2023-01-19 02:24:51 +00:00
David Tolnay	dab06ccdab	Emit only one nbsp error per file	2023-01-14 11:06:22 -08:00
clubby789	a3d6bc3468	Emit a single error for contiguous sequences of Unicode homoglyphs	2023-01-12 00:15:32 +00:00
Maybe Waffle	1d42936b18	Prefer doc comments over `//`-comments in compiler	2022-11-27 11:19:04 +00:00
Nicholas Nethercote	358a603f11	Use `token::Lit` in `ast::ExprKind::Lit`. Instead of `ast::Lit`. Literal lowering now happens at two different times. Expression literals are lowered when HIR is crated. Attribute literals are lowered during parsing. This commit changes the language very slightly. Some programs that used to not compile now will compile. This is because some invalid literals that are removed by `cfg` or attribute macros will no longer trigger errors. See this comment for more details: https://github.com/rust-lang/rust/pull/102944#issuecomment-1277476773	2022-11-16 09:41:28 +11:00
Dylan DPC	4b50fb3745	Rollup merge of #103919 - nnethercote:unescaping-cleanups, r=matklad Unescaping cleanups Some code improvements, and some error message improvements. Best reviewed one commit at a time. r? ````@matklad````	2022-11-09 19:21:22 +05:30
Nicholas Nethercote	dba6fc3ef5	Make underscore_literal_suffix a hard error. It's been a warning for 5.5 years. Time to make it a hard error. Closes #42326.	2022-11-07 10:00:36 +11:00
Nicholas Nethercote	a203482d2a	Inline and remove `validate_int_literal`. It has a single callsite, and is fairly small. The `Float` match arm already has base-specific checking inline, so this makes things more consistent.	2022-11-04 14:24:41 +11:00
Nicholas Nethercote	d963686f5a	Refactor `cook_lexer_literal`. It deals with eight cases: ints, floats, and the six quoted types (char/byte/strings). For ints and floats we have an early return, and the other six types fall through to the code at the end, which makes the function hard to read. This commit rearranges things to avoid the early returns.	2022-11-04 14:24:41 +11:00
Nicholas Nethercote	7dbf2c0ed8	Make non-ASCII errors more consistent. There are three kinds of "byte" literals: byte literals, byte string literals, and raw byte string literals. None are allowed to have non-ASCII chars in them. Two `EscapeError` variants exist for when that constraint is violated. - `NonAsciiCharInByte`: used for byte literals and byte string literals. - `NonAsciiCharInByteString`: used for raw byte string literals. As a result, the messages for raw byte string literals use different wording, without good reason. Also, byte string literals are incorrectly described as "byte constants" in some error messages. This commit eliminates `NonAsciiCharInByteString` so the three cases are handled similarly, and described correctly. The `mode` is enough to distinguish them. Note: Some existing error messages mention "byte constants" and some mention "byte literals". I went with the latter here, because it's a more correct name, as used by the Reference.	2022-11-04 14:23:40 +11:00
Nicholas Nethercote	34b32b0dac	Use `Mode` less. It's passed to numerous places where we just need an `is_byte` bool. Passing the bool avoids the need for some assertions. Also rename `is_bytes()` as `is_byte()`, to better match `Mode::Byte`, `Mode::ByteStr`, and `Mode::RawByteStr`.	2022-11-03 15:58:19 +11:00
Dylan DPC	e029c1fd43	Rollup merge of #101293 - compiler-errors:lt-is-actually-char, r=estebank Recover when unclosed char literal is parsed as a lifetime in some positions Fixes #101278	2022-10-23 15:20:16 +05:30
Michael Goulet	0270b50eb0	Recover unclosed char literal being parsed as lifetime	2022-10-22 06:57:12 +00:00
clubby789	ed40d46159	Properly escape quotes when suggesting switching between char/string literals	2022-10-22 02:37:15 +01:00
Nicholas Nethercote	4e5ddf1adf	Invert `is_top_level` to avoid negation.	2022-10-03 11:42:29 +11:00
Nicholas Nethercote	a822d08bd1	Remove `TokenStreamBuilder`. It's now only used in one function. Also, the "should we glue the tokens?" check is only necessary when pushing a `TokenTree::Token`, not when pushing a `TokenTree::Delimited`. As part of this, we now do the "should we glue the tokens?" check immediately, which avoids having look back at the previous token. It also puts all the logic dealing with token gluing in a single place.	2022-10-03 11:42:29 +11:00
Nicholas Nethercote	8d0754d602	Inline and remove `parse_token_tree_non_delim_non_eof`. It has a single call site.	2022-10-03 11:42:29 +11:00
Nicholas Nethercote	ce7676829e	Merge `parse_token_trees_until_close_delim` and `parse_all_token_trees`. Because they're very similar, and this will allow some follow-up changes.	2022-10-03 11:42:29 +11:00
Nicholas Nethercote	d0a26acb2a	Address review comments.	2022-09-28 11:15:23 +10:00
Nicholas Nethercote	7f7e2165b1	Rename some variables. These make the delimiter processing clearer.	2022-09-27 12:04:03 +10:00
Nicholas Nethercote	880ebb657a	Minor improvements. Add some comments, and mark one path as unreachable.	2022-09-27 09:53:04 +10:00
Nicholas Nethercote	fb4dba0a17	Inline and remove `cook_lexer_token`. This is a small performance win, alas.	2022-09-26 13:50:13 +10:00
Nicholas Nethercote	da84f0f4c3	Add `rustc_lexer::TokenKind::Eof`. For alignment with `rust_ast::TokenKind::Eof`. Plus it's a bit faster, due to less `Option` manipulation in `StringReader::next_token`.	2022-09-26 13:48:08 +10:00
Nicholas Nethercote	ceb25d125f	Use less DRY in `cook_lexer_token`. This is a case where a small amount of repetition results in code that is faster and easier to read.	2022-09-26 13:41:58 +10:00
Nicholas Nethercote	aa6bfaf04b	Make `rustc_lexer::cursor::Cursor` public. `Cursor` is currently hidden, and the main tokenization path uses `rustc_lexer::first_token` which involves constructing a new `Cursor` for every single token, which is weird. Also, `first_token` also can't handle empty input, so callers have to check for that first. This commit makes `Cursor` public, so `StringReader` can contain a `Cursor`, which results in a simpler structure. The commit also changes `StringReader::advance_token` so it returns an `Option<Token>`, simplifying the the empty input case.	2022-09-26 13:36:35 +10:00
Nicholas Nethercote	33516ac09a	[ui] Rearrange `StringReader`/`TokenTreesReader` creation. `TokenTreesReader` wraps a `StringReader`, but the `into_token_trees` function obscures this. This commit moves to a more straightforward control flow.	2022-09-26 13:35:46 +10:00
Nicholas Nethercote	33ba2776c9	Remove `ast::Token::take`. Instead of replacing `TokenTreesReader::token` in two steps, we can just do it in one, which is both simpler and faster.	2022-09-26 13:35:43 +10:00
Nicholas Nethercote	5b2075e03d	Remove `TokenTreesReader::bump`. It's an unnecessary layer that obfuscates when I am looking for optimizations.	2022-09-26 13:34:04 +10:00
Nicholas Nethercote	d7928a92e5	Clarify spacing computation. The spacing computation is done in two parts. In the first part `next_token` and `bump` use `Spacing::Alone` to mean "preceded by whitespace" and `Spacing::Joint` to mean the opposite. In the second part `parse_token_tree_other` then adjusts the `spacing` value to mean the usual thing (i.e. "is the following token joinable punctuation?"). This shift in meaning is very confusing and it took me some time to understand what was going on. This commit changes the first part to use a bool, and adds some comments, which makes things much clearer.	2022-09-26 13:21:26 +10:00
Nicholas Nethercote	9640d1c023	Move `#!` checking. Currently does the "is this a `#!` at the start of the file?" check for every single token(!) This commit moves it so it only happens once.	2022-09-26 13:19:14 +10:00
Nicholas Nethercote	14281e6147	Remove unnecessary `spacing` assignment. It has no useful effect.	2022-09-26 08:28:45 +10:00
Nicholas Nethercote	66e9b1149c	Rearrange `TokenTreesReader::parse_token_tree`. `parse_token_tree` is basically a match with four arms: `Eof`, `OpenDelim`, `CloseDelim`, and "other". It has two call sites, and at each call site one of the arms is unreachable. It's also not inlined. This commit removes `parse_token_tree` by splitting it into four functions and inlining them. This avoids some repeated conditional tests and also some non-inlined function calls on the hot path.	2022-09-26 08:28:45 +10:00

1 2 3 4 5

227 Commits