Make CodeGeneration a lot less string-based #1672

ahoppen · 2023-05-16T18:07:01Z

Because of its Python legacy the CodeGeneration tool has been heavily string-based. This PR fixes that at least to some degree.

The main motivating factors here were:

Instead of using strings to specify the node type of a child node, define a SyntaxNodeKind enum that contains a case for each syntax node. This way you can be sure that you’re not referring to a non-existent syntax node (like we did for e.g. InOutToken, which we forgot to change to .token("inout"))
Refactor Node so that it contains two initializers: One for collections and one for layout nodes instead of having one initializer that has a bunch of optional and defaulted arguments.
Change a bunch of properties from returning a String to returning a TypeSyntax or TokenSyntax so we don‘t need to use the raw: interpolation style for them.

In general, the API of Node and SyntaxNodeKind is what I’m happy with now. The other files can still do with some cleanup.

All the changes resulted in nearly no functionality changes of the generated code, so I think this PR doesn’t need a super thorough review, just skimming over it should be sufficient. It’s really repetitive after all.

kimdv

So nice! WUHU! 😍

Have been thinking a lot on doing this, but you were faster.

kimdv · 2023-05-16T18:11:00Z

CodeGeneration/Sources/SyntaxSupport/Child.swift

@@ -123,19 +118,26 @@ public class Child {

  /// Whether this child has syntax kind `UnexpectedNodes`.
  public var isUnexpectedNodes: Bool {
-    syntaxKind == "UnexpectedNodes"
+    switch kind {
+    case .collection(kind: .unexpectedNodes, collectionElementName: _):


Would it make sense to just omit collectionElementName

kimdv · 2023-05-16T18:12:12Z

CodeGeneration/Sources/SyntaxSupport/CommonNodes.swift

    nameForDiagnostics: nil,
-    description: "A CodeBlockItem is any Syntax node that appears on its own line inside a CodeBlock.",
-    kind: "Syntax",
+    documentation: "A CodeBlockItem is any Syntax node that appears on its own line inside a CodeBlock.",


Not part of this, but we could add ticks around CodeBlockItem and CodeBlock (and properly also many other places) to add doc linking

bnbarham · 2023-05-16T22:02:56Z

Sources/SwiftSyntaxBuilder/generated/BuildableCollectionNodes.swift

-/// `AccessorList` represents a collection of `AccessorDeclSyntax`
+/// `AccessorListSyntax` represents a collection of `AccessorDeclSyntax`


These docs don't seem particularly useful to me. I'd expect them on the type itself rather than the extension (and even there... fairly dubious about their value).

I won’t argue with you about that. I hope that we’ll write proper documentation soon and then these will just automatically go away.

Because of its Python legacy the CodeGeneration tool has been heavily string-based. This PR fixes that at least to some degree. The main motivating factors here were: - Instead of using strings to specify the node type of a child node, define a `SyntaxNodeKind` enum that contains a case for each syntax node. This way you can be sure that you’re not referring to a non-existent syntax node (like we did for e.g. `InOutToken`, which we forgot to change to `.token("inout")`) - Refactor `Node` so that it contains two initializers: One for collections and one for layout nodes instead of having one initializer that has a bunch of optional and defaulted arguments. - Change a bunch of properties from returning a `String` to returning a `TypeSyntax` or `TokenSyntax` so we don‘t need to use the `raw:` interpolation style for them. In general, the API of `Node` and `SyntaxNodeKind` is what I’m happy with now. The other files can still do with some cleanup. All the changes resulted in nearly no functionaly changes of the generated code.

ahoppen · 2023-05-25T03:23:16Z

@swift-ci Please test

ahoppen requested review from bnbarham and kimdv May 16, 2023 18:07

kimdv approved these changes May 16, 2023

View reviewed changes

bnbarham approved these changes May 16, 2023

View reviewed changes

ahoppen force-pushed the ahoppen/code-gen-not-string-based branch from e708871 to cf6ff40 Compare May 25, 2023 03:23

ahoppen enabled auto-merge May 25, 2023 03:23

ahoppen merged commit 4c0ad1b into swiftlang:main May 25, 2023

StevenWong12 mentioned this pull request May 26, 2023

Fail to run CodeGeneration #1707

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Make CodeGeneration a lot less string-based #1672

Make CodeGeneration a lot less string-based #1672

ahoppen commented May 16, 2023

Uh oh!

kimdv left a comment

Uh oh!

kimdv May 16, 2023

Uh oh!

kimdv May 16, 2023

Uh oh!

bnbarham May 16, 2023

Uh oh!

ahoppen May 16, 2023

Uh oh!

ahoppen commented May 25, 2023

Uh oh!

Uh oh!

		/// `AccessorList` represents a collection of `AccessorDeclSyntax`
		/// `AccessorListSyntax` represents a collection of `AccessorDeclSyntax`

Make CodeGeneration a lot less string-based #1672

Make CodeGeneration a lot less string-based #1672

Conversation

ahoppen commented May 16, 2023

Uh oh!

kimdv left a comment

Choose a reason for hiding this comment

Uh oh!

kimdv May 16, 2023

Choose a reason for hiding this comment

Uh oh!

kimdv May 16, 2023

Choose a reason for hiding this comment

Uh oh!

bnbarham May 16, 2023

Choose a reason for hiding this comment

Uh oh!

ahoppen May 16, 2023

Choose a reason for hiding this comment

Uh oh!

ahoppen commented May 25, 2023

Uh oh!

Uh oh!