Use serde for target spec json deserialize #144218

Noratrieb · 2025-07-20T13:18:40Z

The previous manual parsing of serde_json::Value was a lot of complicated code and extremely error-prone. It was full of janky behavior like sometimes ignoring type errors, sometimes erroring for type errors, sometimes warning for type errors, and sometimes just ICEing for type errors (the icing on the top).

Additionally, many of the error messages about allowed values were out of date because they were in a completely different place than the FromStr impls. Overall, the system caused confusion for users.

I also found the old deserialization code annoying to read. Whenever a key! invocation was found, one had to first look for the right macro arm, and no go to definition could help.

This PR replaces all this manual parsing with a 2-step process involving serde.
First, the string is parsed into a TargetSpecJson struct. This struct is a 1:1 representation of the spec JSON. It already parses all the enums and is very simple to read and write.
Then, the fields from this struct are copied into the actual Target. The reason for this two-step process instead of just serializing into a Target is because of a few reasons

There are a few transformations performed between the two formats
The default logic is implemented this way. Otherwise all the default field values would have to be spelled out again, which is suboptimal. With this logic, they fall out naturally, because everything in the json struct is an Option.

Overall, the mapping is pretty simple, with the vast majority of fields just doing a 1:1 mapping that is captured by two macros. I have deliberately avoided making the macros generic to keep them simple.

All the FromStr impls now have the error message right inside them, which increases the chance of it being up to date. Some "from_str" impls were turned into proper FromStr impls to support this.

The new code is much less involved, delegating all the JSON parsing logic to serde, without any manual type matching.

This change introduces a few breaking changes for consumers. While it is possible to use this format on stable, it is very much subject to change, so breaking changes are expected. The hope is also that because of the way stricter behavior, breaking changes are easier to deal with, as they come with clearer error messages.

Invalid types now always error, everywhere. Previously, they would sometimes error, and sometimes just be ignored (which meant the users JSON was still broken, just silently!)
This now makes use of deny_unknown_fields instead of just warning on unused fields, which was done previously. Serde doesn't make it easy to get such warning behavior, which was the primary reason that this now changed. But I think error behavior is very reasonable too. If someone has random stale fields in their JSON, it is likely because these fields did something at some point but no longer do, and the user likely wants to be informed of this so they can figure out what to do.

This is also relevant for the future. If we remove a field but someone has it set, it probably makes sense for them to take a look whether they need this and should look for alternatives, or whether they can just delete it. Overall, the JSON is made more explicit.

This is the only expected breakage, but there could also be small breakage from small mistakes. All targets roundtrip though, so it can't be anything too major.

fixes #144153

rustbot · 2025-07-20T13:18:45Z

r? @fee1-dead

rustbot has assigned @fee1-dead.
They will have a look at your PR within the next two weeks and either review your PR or reassign to another reviewer.

Use r? to explicitly pick a reviewer

rustbot · 2025-07-20T13:18:48Z

This PR modifies run-make tests.

cc @jieyouxu

These commits modify the Cargo.lock file. Unintentional changes to Cargo.lock can be introduced when switching branches and rebasing PRs.

If this was unintentional then you should revert the changes before this PR is merged.
Otherwise, you can ignore this comment.

These commits modify compiler targets.
(See the Target Tier Policy.)

The list of allowed third-party dependencies may have been modified! You must ensure that any new dependencies have compatible licenses before merging.

cc @davidtwco, @wesleywiser

There are changes to the tidy tool.

cc @jieyouxu

### What does this PR try to resolve? This is not necessary, as 32 is the default, and actually of the wrong type now since it's a number now. When planning to make these type mismatches error in rust-lang/rust#144218, cargo would fail here, so I just removed it. This custom target test very much shows how Cargo should be a subtree, in this case it was fine because there's a compatible fix that I can push now, otherwise it would have been very annoying. ### How to test and review this PR? If the test suite passes, it works

compiler/rustc_target/src/spec/mod.rs

Noratrieb · 2025-07-20T18:45:46Z

I am finding a lot of target json specs with values that no longer exist, even an entire test case based on the assumption that it exists while it doesn't. I am getting a lot more confidence in the error behavior thanks to this.

The previous manual parsing of `serde_json::Value` was a lot of complicated code and extremely error-prone. It was full of janky behavior like sometimes ignoring type errors, sometimes erroring for type errors, sometimes warning for type errors, and sometimes just ICEing for type errors (the icing on the top). Additionally, many of the error messages about allowed values were out of date because they were in a completely different place than the FromStr impls. Overall, the system caused confusion for users. I also found the old deserialization code annoying to read. Whenever a `key!` invocation was found, one had to first look for the right macro arm, and no go to definition could help. This PR replaces all this manual parsing with a 2-step process involving serde. First, the string is parsed into a `TargetSpecJson` struct. This struct is a 1:1 representation of the spec JSON. It already parses all the enums and is very simple to read and write. Then, the fields from this struct are copied into the actual `Target`. The reason for this two-step process instead of just serializing into a `Target` is because of a few reasons 1. There are a few transformations performed between the two formats 2. The default logic is implemented this way. Otherwise all the default field values would have to be spelled out again, which is suboptimal. With this logic, they fall out naturally, because everything in the json struct is an `Option`. Overall, the mapping is pretty simple, with the vast majority of fields just doing a 1:1 mapping that is captured by two macros. I have deliberately avoided making the macros generic to keep them simple. All the `FromStr` impls now have the error message right inside them, which increases the chance of it being up to date. Some "`from_str`" impls were turned into proper `FromStr` impls to support this. The new code is much less involved, delegating all the JSON parsing logic to serde, without any manual type matching. This change introduces a few breaking changes for consumers. While it is possible to use this format on stable, it is very much subject to change, so breaking changes are expected. The hope is also that because of the way stricter behavior, breaking changes are easier to deal with, as they come with clearer error messages. 1. Invalid types now always error, everywhere. Previously, they would sometimes error, and sometimes just be ignored (which meant the users JSON was still broken, just silently!) 2. This now makes use of `deny_unknown_fields` instead of just warning on unused fields, which was done previously. Serde doesn't make it easy to get such warning behavior, which was the primary reason that this now changed. But I think error behavior is very reasonable too. If someone has random stale fields in their JSON, it is likely because these fields did something at some point but no longer do, and the user likely wants to be informed of this so they can figure out what to do. This is also relevant for the future. If we remove a field but someone has it set, it probably makes sense for them to take a look whether they need this and should look for alternatives, or whether they can just delete it. Overall, the JSON is made more explicit. This is the only expected breakage, but there could also be small breakage from small mistakes. All targets roundtrip though, so it can't be anything too major.

aapoalas

LGTM :)

aapoalas · 2025-07-20T20:24:54Z

compiler/rustc_target/src/spec/mod.rs

            "darwin" => LldFlavor::Ld64,
            "gnu" => LldFlavor::Ld,
            "link" => LldFlavor::Link,
            "wasm" => LldFlavor::Wasm,
-            _ => return None,
+            _ => {
+                return Err(format!(


nitpick: Unnecessary use of a macro; did you perhaps mean to include '{s}' in the error output?

Also, would it be better if the lld flavor and linker flavor error messages matched one another? This one says "expected one of ...while the other saysallowed values: ...`.

aapoalas · 2025-07-20T20:33:20Z

compiler/rustc_target/src/spec/mod.rs

    fn from_str(s: &str) -> Result<Self, Self::Err> {
        match s {
            "coff" => Ok(Self::Coff),
            "elf" => Ok(Self::Elf),
            "mach-o" => Ok(Self::MachO),
            "wasm" => Ok(Self::Wasm),
            "xcoff" => Ok(Self::Xcoff),
-            _ => Err(()),
+            _ => Err(format!(


thought: The error formats are pretty repetitive; I wonder if a few helper methods wouldn't perhaps be an improvement? Something like

#[inline(never)] pub(crate) fn s_is_not_a_valid_value_for(s: &str, field: &str, variants: &[&str]) -> String { format!("'{s}' is not a valid value for {field}. Use {}", format_to_list(variants)) }

and then the same for the "use command to print allowed values" etc.

This'd probably also reduce the slight drift between the error messages; some say "for field. Use 'a', 'b', or 'c'." while others say `for field, expected 'a', 'b', or 'c'" or such.

ultrabear · 2025-07-20T21:05:57Z

compiler/rustc_target/Cargo.toml

@@ -12,7 +12,9 @@ rustc_fs_util = { path = "../rustc_fs_util" }
 rustc_macros = { path = "../rustc_macros" }
 rustc_serialize = { path = "../rustc_serialize" }
 rustc_span = { path = "../rustc_span" }
+serde = { version = "1.0.219", features = ["derive"] }


using serde with the derive feature causes a hard dependency on syn/procmacro2 that leads to everything that uses serde but doesn't use derive needing to wait on these expensive crates to build
The current suggested way to pull serde derive (to increase build parallelization) is to just pull the derive crate separately and use serde_derive::Deseralize etc in code

Suggested change

serde = { version = "1.0.219", features = ["derive"] }

serde = "1.0.219"

serde_derive = "1.0.219"

rustbot assigned fee1-dead Jul 20, 2025

Noratrieb mentioned this pull request Jul 20, 2025

Remove unnecessary target-c-int-width from target specs rust-lang/cargo#15759

Merged

This comment has been minimized.

Sign in to view

Noratrieb force-pushed the target-spec-json-de-jank branch from a829315 to 949f059 Compare July 20, 2025 13:37

This comment has been minimized.

Sign in to view

Noratrieb force-pushed the target-spec-json-de-jank branch from 949f059 to 47312aa Compare July 20, 2025 16:29

This comment has been minimized.

Sign in to view

klensy reviewed Jul 20, 2025

View reviewed changes

compiler/rustc_target/src/spec/mod.rs Outdated Show resolved Hide resolved

Noratrieb force-pushed the target-spec-json-de-jank branch from 47312aa to ae949f5 Compare July 20, 2025 18:41

This comment has been minimized.

Sign in to view

Noratrieb force-pushed the target-spec-json-de-jank branch from ae949f5 to babcbc3 Compare July 20, 2025 20:35

aapoalas approved these changes Jul 20, 2025

View reviewed changes

ultrabear reviewed Jul 20, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use serde for target spec json deserialize #144218

Use serde for target spec json deserialize #144218

Noratrieb commented Jul 20, 2025 •

edited by rustbot

Loading

Uh oh!

rustbot commented Jul 20, 2025

Uh oh!

rustbot commented Jul 20, 2025

Uh oh!

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

Uh oh!

Noratrieb commented Jul 20, 2025

Uh oh!

This comment has been minimized.

aapoalas left a comment

Uh oh!

aapoalas Jul 20, 2025

Uh oh!

aapoalas Jul 20, 2025

Uh oh!

ultrabear Jul 20, 2025 •

edited

Loading

Uh oh!

Uh oh!

	serde = { version = "1.0.219", features = ["derive"] }
	serde = "1.0.219"
	serde_derive = "1.0.219"

Use serde for target spec json deserialize #144218

Are you sure you want to change the base?

Use serde for target spec json deserialize #144218

Conversation

Noratrieb commented Jul 20, 2025 • edited by rustbot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rustbot commented Jul 20, 2025

Uh oh!

rustbot commented Jul 20, 2025

Uh oh!

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

Uh oh!

Noratrieb commented Jul 20, 2025

Uh oh!

This comment has been minimized.

aapoalas left a comment

Choose a reason for hiding this comment

Uh oh!

aapoalas Jul 20, 2025

Choose a reason for hiding this comment

Uh oh!

aapoalas Jul 20, 2025

Choose a reason for hiding this comment

Uh oh!

ultrabear Jul 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Noratrieb commented Jul 20, 2025 •

edited by rustbot

Loading

ultrabear Jul 20, 2025 •

edited

Loading