Implement "Statements" package #938

jeongsoolee09 · 2025-08-01T19:52:33Z

Description

This PR implements the Statements package.

Change request type

Release or process automation (GitHub workflows, internal scripts)
Internal documentation
External documentation
Query files (.ql, .qll, .qls or unit tests)
External scripts (analysis report or other code shipped as part of a release)

Rules with added or modified queries

No rules added
Queries have been added for the following rules:
- RULE-9-4-2
- RULE-9-5-1
- RULE-9-5-2
Queries have been modified for the following rules:
- rule number here

Release change checklist

A change note (development_handbook.md#change-notes) is required for any pull request which modifies:

The structure or layout of the release artifacts.
The evaluation performance (memory, execution time) of an existing query.
The results of an existing query in any circumstance.

If you are only adding new rule queries, a change note is not required.

Author: Is a change note required?

Yes
No

🚨🚨🚨
Reviewer: Confirm that format of shared queries (not the .qll file, the
.ql file that imports it) is valid by running them within VS Code.

Confirmed

Reviewer: Confirm that either a change note is not required or the change note is required and has been added.

Confirmed

Query development review checklist

For PRs that add new queries or modify existing queries, the following checklist should be completed by both the author and reviewer:

Author

Have all the relevant rule package description files been checked in?
Have you verified that the metadata properties of each new query is set appropriately?
Do all the unit tests contain both "COMPLIANT" and "NON_COMPLIANT" cases?
Are the alert messages properly formatted and consistent with the style guide?
Have you run the queries on OpenPilot and verified that the performance and results are acceptable?
As a rule of thumb, predicates specific to the query should take no more than 1 minute, and for simple queries be under 10 seconds. If this is not the case, this should be highlighted and agreed in the code review process.
Does the query have an appropriate level of in-query comments/documentation?
Have you considered/identified possible edge cases?
Does the query not reinvent features in the standard library?
Can the query be simplified further (not golfed!)

Reviewer

Have all the relevant rule package description files been checked in?
Have you verified that the metadata properties of each new query is set appropriately?
Do all the unit tests contain both "COMPLIANT" and "NON_COMPLIANT" cases?
Are the alert messages properly formatted and consistent with the style guide?
Have you run the queries on OpenPilot and verified that the performance and results are acceptable?
As a rule of thumb, predicates specific to the query should take no more than 1 minute, and for simple queries be under 10 seconds. If this is not the case, this should be highlighted and agreed in the code review process.
Does the query have an appropriate level of in-query comments/documentation?
Have you considered/identified possible edge cases?
Does the query not reinvent features in the standard library?
Can the query be simplified further (not golfed!)

…essions

MichaelRFairhurst

This is really coming along and looking really good!!

MichaelRFairhurst · 2025-09-24T04:03:32Z

cpp/misra/src/rules/RULE-9-5-1/LegacyForStatementsShouldBeSimple.ql

+ * to a non-const reference variable (thus constituting a `T` -> `&T` conversion.), i.e.
+ * initialization and assignment.
+ */
+/*


Simple comment formatting, unnecessary split

Good call. The intention was to split the documentation and the meta-level comment (explaining how this predicate came to be). But like you said it can be disconnected easily, so I'll merge the meta-level comment into the docstring first.

Addressed in c8c0770.

Somehow this change didn't make it to c8c0770; it did to a recent commit.

MichaelRFairhurst · 2025-09-24T04:21:44Z

cpp/misra/src/rules/RULE-9-5-1/LegacyForStatementsShouldBeSimple.ql

+predicate loopVariableAssignedToNonConstPointerOrReferenceType(
+  ForStmt forLoop, VariableAccess loopVariableAccessInCondition
+) {
+  exists(Expr assignmentRhs, DerivedType targetType |


Likely want to test that this works for a int * const x:

void f(int * const x) { (*x)++; } int main() { for (int i = 0; i < 10; ++i) { f(&i); std::cout << i << std::endl; } }

I believe what will happen is that int * const x will be a DerivedType of type SpecifiedType with a const specifier. A SpecifiedType is not instanceof PointerType or instanceof ReferenceType and so this predicate will not hold, even though the value of i is modifiable within f.

You may also have problems with typedefs, such as typedef int *int_ptr_t for the same reason.

The solution here I believe will be to call .getUnderlyingType(). Another option frequently used for this is .stripSpecifiers(). Each of these will remove the const and resolve the typedef. I think .stripSpecifiers() may remove the const in const int*, though, which would make it unsuitable here.

You're right; the predicate does not catch this example. 🤔 I guess a clever use of one or more of isDeeplyConst, or isDeeplyConstBelow will do the trick.

Forgetting to handle typedefs or meaningless consts is a very common bug. But you'll (mostly) get in the habit soon enough of always calling one of these four member predicates on the Types you handle in your queries:

getUnderlyingType()

resolveTypedefs()

stripSpecifiers()

stripTopLevelSpecifiers()

Each one does subtly different things.

In this case, I believe the fix is to do:

exists(..., Type targetType, DerivedType strippedType | isAssignment(assignmentRhs, targetType, _) and strippedType = targetType.stripTopLevelSpecifiers() not strippedType.getBaseType().isConst() and ( strippedType instanceof PointerType or strippedType instanceof ReferenceType )

The documentation for stripTopLevelSpecifiers says:

Get this type after any top-level specifiers and typedefs have been stripped.

For example, starting with const i64* const, this predicate will return const i64*.

which is actually wrong, as it ignores the fact that i64 is a TypeDefType, so it actually will result in const long long*. Which is what you want!

The TLDR of the other options:

getUnderlyingType() -- resolves TypdefTypes and DeclTypes, but won't drop the outer specifer in const i64* const. Stops at the first non-TypedefType/non-DeclType.

stripType() -- resolves all typedefs and decltypes and removes all const/volatile specifiers recursively all the way down the type chain -- not what you want.

resolveTypedefs -- resolves all typedefs and decltypes all the way down the type chain without removing const or volatile specifiers. That would handle typedefs but not int const *.

Note that these predicates can have no result. Only a limited set of types are in the database, and these operations just assume that the type you want is one of those types. resolveTypedefs is also bugged and doesn't recurse into ArrayType.

Thank you for the detailed breakdown of the related predicates. What I want to express here is definitely "The type we get after we strip all the typedefs and the specifiers is const". I've come to believe stripTopLevelSpecifiers is the one I should use, and swapped the portion with your suggestion.

I also patched an equivalent part in loopVariablePassedAsArgumentToNonConstReferenceParameter, in 7d5f08b.

cpp/misra/src/rules/RULE-9-5-1/LegacyForStatementsShouldBeSimple.ql

MichaelRFairhurst · 2025-09-24T04:54:48Z

cpp/misra/src/rules/RULE-9-5-1/LegacyForStatementsShouldBeSimple.ql

+      loopCounterType = forLoopCondition.getLoopCounter().getType() and
+      loopBoundType = forLoopCondition.getLoopBound().getType()
+    |
+      loopCounterType.getSize() < loopBoundType.getSize()


Two missed cases here:

Mixing signed/unsigned types, they may have the same size but they'll hold different ranges.

The type and runtime value may lead to different conclusions.

I think you may be able to get away with upperBound(loopCounter) < upperBound(loopBound). That would handle signedness, constants (like x < 10ull), and dynamic ranges (like unsigned long long bound = 10; ... x < bound).

Also almost forgot

Another trap case is that when doing upperBound(e) / lowerBound(e) you usually want upperBound(e.getFullyConverted()). Because conversions on e will change the bound.

This eliminated a lot of false positives where the counter variable is int and the loop bound is size_t. Thank you!

Changed the upperBound(loopCounter.getFullyConverted()) to typeUpperBound(loopCounter.getType()). As typeUpperBound resolves references, I didn't have to use getBaseType() on it.

cpp/misra/src/rules/RULE-9-5-1/LegacyForStatementsShouldBeSimple.ql

We are interested if the underlying *data* can be mutated, not the pointer itself. Also, the surface type may be a typedef, so resolve that as well.

…nmentExpr`

Both `TLoopBoundIsMutatedVariableAccess` and `TLoopStepIsMutatedVariableAccess` transitively rely on `valueToUpdate`, which overapproximates by looking at the types alone. Therefore we'd like to drop the confidence slightly in reporting the expression where the expression *might* have been changed.

…sizes

Copilot

Pull Request Overview

This PR implements the "Statements" package for the MISRA C++-2023 coding standards, adding three new query rules for analyzing statement structures in C++ code.

Added rule implementations for RULE-9-4-2, RULE-9-5-1, and RULE-9-5-2
Added comprehensive test files with both compliant and non-compliant examples
Created supporting library code for analyzing increment operations and loop conditions

Reviewed Changes

Copilot reviewed 17 out of 17 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
rule_packages/cpp/Statements.json	Package configuration defining metadata and properties for the three new statement rules
cpp/misra/src/rules/RULE-9-4-2/AppropriateStructureOfSwitchStatement.ql	Query implementation to check proper switch statement structure
cpp/misra/src/rules/RULE-9-5-1/LegacyForStatementsShouldBeSimple.ql	Query implementation to enforce simple legacy for-loop patterns
cpp/misra/src/rules/RULE-9-5-2/ForRangeInitializerAtMostOneFunctionCall.ql	Query implementation to limit function calls in range-based for initializers
cpp/misra/test/rules/RULE-9-/	Test files and expected results for all three rules
cpp/common/src/codingstandards/cpp/exclusions/cpp/Statements.qll	Auto-generated exclusion metadata for the new package
cpp/common/src/codingstandards/cpp/exclusions/cpp/RuleMetadata.qll	Updated metadata registry to include Statements package
cpp/common/src/codingstandards/cpp/ast/Increment.qll	New library for analyzing increment/decrement operations
cpp/common/src/codingstandards/cpp/Loops.qll	Extended loop analysis with LegacyForLoopCondition class

Comments suppressed due to low confidence (1)

rule_packages/cpp/Statements.json:1

Fixed typo 'that that' should be 'that'.

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

cpp/misra/test/rules/RULE-9-5-2/test.cpp

cpp/misra/src/rules/RULE-9-4-2/AppropriateStructureOfSwitchStatement.ql

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

…github/codeql-coding-standards into jeongsoolee09/MISRA-C++-2023-Statements

MichaelRFairhurst

This is looking really good. Every time I go through the code again, I'm really impressed with the overall organization and clarity. Nicely done!

Let me know if these next couple suggestions are unclear, we're so close! :)

MichaelRFairhurst · 2025-10-09T22:36:52Z

cpp/misra/src/rules/RULE-9-5-1/LegacyForStatementsShouldBeSimple.ql

+  /* 3-1. The loop counter is mutated somewhere other than its update expression. */
+  TLoopCounterMutatedInLoopBody(ForStmt forLoop, Variable loopCounterVariable) {
+    loopCounterVariable = getDeclaredVariableInForLoop(forLoop) and
+    variableModifiedInExpression(forLoop.getStmt().getChildStmt().getAChild*(),


Do we need to worry about updates in the condition?

for(int i = 0; i++ < 10;) {...}

I changed the case body to better match the comment /* 3-1. ... */, it now finds all cases where the mutating expression is not in the update expression (25e29e4).

MichaelRFairhurst · 2025-10-09T22:54:39Z

cpp/misra/test/rules/RULE-9-5-1/test.cpp

+                                // anywhere in the loop
+  }
+
+  for (int i = 0; i < j; i++) { // COMPLIANT: The loop bound `j` is not mutated


Case to test: for (int i = 0; i < j++; i++), and maybe for (int i = 0; i++ < j++; i++) while we're at it.

Sounds good. Two alerts should appear in the second example:

The loop bound is mutated (the rule in question here).

The loop counter is mutated outside of the update expression.

Added in 0f998ea and covered in 01216fa.

MichaelRFairhurst · 2025-10-09T23:11:08Z

cpp/common/src/codingstandards/cpp/Loops.qll

+    exists(Expr loopCounterExpr |
+      loopCounterExpr = this.getAnOperand() and
+      loopBound = this.getAnOperand() and
+      loopCounter = loopCounterExpr.getAChild*() and


getAChild*() is the right tool here, but it must be coupled with an allow-list or we'll have FNs, because it's still casting a very very wide net.

We should ensure (either here, or reported as an error in the query) that loopCounterExpr is not any arbitrary type of expression.

The following should be non-compliant:

for (int i = 0; f(i) < 10; ++i) {} for (int i = 0; i * i < 10; ++i) {} for (int i = 0; i + f() < 10; ++i) {} for (int i = 0; (i > other_var) < 1; ++i) {} // etc

Basically, we probably just want an allow-list where every expr from loopCounterExpr to loopCounter is either loopCounter itself or an addition/subtraction with only constant values on one side and an allow-listed expression on the other.

for (int i = 0; i + 10 < 20; ++i) {} // OK, `i` is allowed and `ALLOWED + 10` is allowed for (int i = 0; 10 - i < 20; ++i) {} // OK, `i` is allowed and `10 - ALLOWED` is allowed for (int i = 0; -i < 20; ++i) {} // OK, `i` is allowed and `-ALLOWED` is allowed for (int i = 0; -i + 10 - < 20; ++i) {} // OK, `i` is allowed and -ALLOWED is allowed for (int i = 0; (i + 5) + 3 < 20; ++i) {} // OK, `i` is allowed, `ALLOWED + 5`, and `ALLOWED + 3` is allowed for (int i = 0; i + (5 + 3) < 20; ++i) {} // OK, `i` is allowed, `ALLOWED + (5 + 3)` is allowed for (int i = 0; i + i < 20; ++i) {} // BAD, `i` is allowed but `ALLOWED + ALLOWED` is not allowed for (int i = 0; i + j < 20; ++j) {} // BAD, 'j' is not allowed for (int i = 0; (i + 10) + (i + 10) < 20; ++i) {} // BAD, `i` and `ALLOWED + 10` is allowed, but `ALLOWED + ALLOWED` is not allowed

Hopefully that mostly makes sense.

MichaelRFairhurst · 2025-10-09T23:13:00Z

cpp/misra/test/rules/RULE-9-5-1/test.cpp

+
+  for (int i = j; i < k; i += l) { // COMPLIANT: The loop step is taken as
+                                   // a const pointer
+    const int *m = &l;


Add tests that int * const m is non compliant

Added in 2aed0f1.

This refined definition can handle more cases than the previous one that only looked into the loop body, and better matches the description in the comment above.

…mutating exprs

jeongsoolee09 added 15 commits July 30, 2025 18:43

Generate query files for "Statements" package

c8f559c

Add agent-generated first draft

ffcb432

Add some test cases

221b9b2

Finish first draft

fabb7e5

Add test case for Rule 9.5.2

6909528

Minor

0b5250f

Update test case for Rule 9.5.2

b239862

Add first draft of ForRangeInitializerAtMostOneFunctionCall

a013388

Add test cases

1c623f4

Merge branch 'main' into jeongsoolee09/MISRA-C++-2023-Statements

b792897

Merge branch 'main' into jeongsoolee09/MISRA-C++-2023-Statements

0d38cd7

Update unit test

c2cbed3

Add more cases and more vocabularies to reason about them

b7ae38e

Fix labeling of two cases

86aaa0e

Add cases of addresses taken as part of non-const declaration or expr…

49bdc07

…essions

jeongsoolee09 self-assigned this Sep 8, 2025

jeongsoolee09 requested a review from MichaelRFairhurst September 8, 2025 20:19

jeongsoolee09 added 11 commits September 9, 2025 14:55

Finish first draft

01e3276

Merge branch 'main' into jeongsoolee09/MISRA-C++-2023-Statements

9cab1f5

Finish first draft

2f6fc3d

Tidy up, refine a bit more, add a series of test cases

999e870

Add two more cases

562c7be

Add QLDocs to two helper predicates

6855e6a

Introduce from variables and fix logical operator association

5e24d1b

Introduce newtype

18daff7

Split cases 5-1 and 5-2

55b8476

Debug 5-1-2 and 5-2-2 not being reported

662f51f

Add LegacyForLoopUpdateExpression and test cases

a653b66

MichaelRFairhurst reviewed Sep 24, 2025

View reviewed changes

Separate out helper classes into libraries

c8c0770

jeongsoolee09 added 2 commits September 24, 2025 15:28

Finish draft of LegacyForStatementsShouldBeSimple

2227255

Decouple ForStmt from Increment.qll and rewrite getLoopStepOfForStmt

38b5fbc

jeongsoolee09 requested a review from MichaelRFairhurst September 29, 2025 22:41

jeongsoolee09 added 9 commits September 29, 2025 18:46

Update expected result of RULE-9-5-1

23aa711

Update expected results of RULE-9-5-2

f0b53e2

Count in typedefs and cv-qualifiers

7d5f08b

We are interested if the underlying *data* can be mutated, not the pointer itself. Also, the surface type may be a typedef, so resolve that as well.

Fix cross-join issue in `TLoopCounterUpdatedNotByCrementOrAddSubAssig…

e135fe6

…nmentExpr`

Make the loop counter detection more relaxed

82f9adc

Refine LegacyForLoopCondition

2116400

Change phrasing of message from TNoRelationalOperatorInLoopCondition

1f8083a

Use upperbound/0 and getFullyConverted/0 to more precisely infer …

1355eff

…sizes

jeongsoolee09 marked this pull request as ready for review October 8, 2025 23:56

Copilot AI review requested due to automatic review settings October 8, 2025 23:56

Copilot AI reviewed Oct 8, 2025

View reviewed changes

cpp/misra/test/rules/RULE-9-5-2/test.cpp Outdated Show resolved Hide resolved

cpp/misra/src/rules/RULE-9-4-2/AppropriateStructureOfSwitchStatement.ql Outdated Show resolved Hide resolved

jeongsoolee09 and others added 7 commits October 9, 2025 14:41

Apply suggestion from @Copilot

3ad5ef2

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Apply suggestion from @Copilot

fb20065

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Reformat test cases of 9-4-2 and 9-5-2

bce38a0

Merge branch 'jeongsoolee09/MISRA-C++-2023-Statements' of github.com:…

32c36fe

…github/codeql-coding-standards into jeongsoolee09/MISRA-C++-2023-Statements

Merge branch 'main' into jeongsoolee09/MISRA-C++-2023-Statements

c0a8253

Update expected results coming from change in message

59a4096

Use different range predicate and update tests

d37bc70

MichaelRFairhurst requested changes Oct 9, 2025

View reviewed changes

jeongsoolee09 added 5 commits October 10, 2025 18:01

Refine TLoopCounterMutatedInLoopBody

25e29e4

This refined definition can handle more cases than the previous one that only looked into the loop body, and better matches the description in the comment above.

Add more candidate exprs and remove duplicate reportings on compound …

01216fa

…mutating exprs

Add more test cases

0f998ea

Fix loop counter -> loop bound

87a5158

Add const-but-mutable pointer examples

2aed0f1

Implement "Statements" package #938

Are you sure you want to change the base?

Implement "Statements" package #938

Uh oh!

Conversation

jeongsoolee09 commented Aug 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Change request type

Rules with added or modified queries

Release change checklist

Query development review checklist

Author

Reviewer

Uh oh!

MichaelRFairhurst left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jeongsoolee09 Oct 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jeongsoolee09 Sep 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

MichaelRFairhurst left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jeongsoolee09 Oct 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

jeongsoolee09 commented Aug 1, 2025 •

edited

Loading

jeongsoolee09 Oct 8, 2025 •

edited

Loading

jeongsoolee09 Sep 24, 2025 •

edited

Loading

jeongsoolee09 Oct 10, 2025 •

edited

Loading