C#: Restrict dataflow node creation to source and source-referenced entities #17482

smowton · 2024-09-16T15:17:28Z

This addresses a problem observed when libraries define a very large number of types with large class hierarchies, for example as part of generated code. Since user code is very unlikely to refer to all or even most of them, by excluding such code from dataflow node generation and therefore from the C# dataflow hooks' idea of a "relevant" type, we can make many of the type predicates defined in DataFlowPrivate.qll very much cheaper.

#17483 is a variation on this PR that additionally considers virtual dispatch targets and accessed (but not called) callables as being used in source, and therefore meriting dataflow node creation.

aschackmull · 2024-09-17T08:59:24Z

csharp/ql/lib/semmle/code/csharp/dataflow/internal/DataFlowPrivate.qll

+    TExplicitParameterNode(Parameter p, DataFlowCallable c) {
+      p = c.asCallable(_).(CallableUsedInSource).getAParameter()


Java simply uses the constraint exists(p.getCallable().getBody()). Would that be enough in this case?

Suggested change

TExplicitParameterNode(Parameter p, DataFlowCallable c) {

p = c.asCallable(_).(CallableUsedInSource).getAParameter()

TExplicitParameterNode(Parameter p, DataFlowCallable c) {

exists(Callable c0 | c0 = c.asCallable(_) and p = c0.getAParameter() and exists(c0.getBody()))

I've refined this further at #17483 -- my observations so far:

There are methods with no body but which are from source where we probably ought to include parameter nodes, e.g. abstract and interface methods

There are also methods that have a body but for which fromSource doesn't hold -- synthetic methods, so far as I can tell, such as default constructors.

Therefore I've included both criteria for now.

There are methods with no body but which are from source where we probably ought to include parameter nodes, e.g. abstract and interface methods

Why? What use are they? Those parameter nodes cannot flow to anything.

They indeed cannot; my concern was whether interface or abstract method parameter nodes might be relevant to models, either MaD-written or implemented in QL? The short answer is I don't know, but the cost of keeping them is probably small since they're bounded by the size of the source code, and giving nodes as-is to everything present in source is the least surprising route for any third-party code we don't get to exercise via DCA and QA.

smowton · 2024-09-17T15:21:07Z

I'm leaning towards variant #17483 since it plays things safer (creating more nodes than this variant) but still seems to have significantly beneficial performance consequences. I've now started QA, linked from that PR.

smowton · 2024-09-18T19:04:50Z

Variant #17483 has now passed DCA and QA as noted on that PR -- I therefore propose we should merge that variant and invite reviews there.

Restrict dataflow node creation to source and source-referenced entities

78fa7f6

smowton requested a review from a team as a code owner September 16, 2024 15:17

github-actions bot added the C# label Sep 16, 2024

smowton mentioned this pull request Sep 16, 2024

C#: Restrict dataflow node creation to source and source-referenced entities #17483

Merged

aschackmull reviewed Sep 17, 2024

View reviewed changes

smowton closed this Sep 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

C#: Restrict dataflow node creation to source and source-referenced entities #17482

C#: Restrict dataflow node creation to source and source-referenced entities #17482

smowton commented Sep 16, 2024 •

edited

Loading

aschackmull Sep 17, 2024

smowton Sep 17, 2024

aschackmull Sep 18, 2024

smowton Sep 18, 2024

smowton commented Sep 17, 2024

smowton commented Sep 18, 2024

		TExplicitParameterNode(Parameter p, DataFlowCallable c) {
		p = c.asCallable(_).(CallableUsedInSource).getAParameter()

C#: Restrict dataflow node creation to source and source-referenced entities #17482

C#: Restrict dataflow node creation to source and source-referenced entities #17482

Conversation

smowton commented Sep 16, 2024 • edited Loading

aschackmull Sep 17, 2024

Choose a reason for hiding this comment

smowton Sep 17, 2024

Choose a reason for hiding this comment

aschackmull Sep 18, 2024

Choose a reason for hiding this comment

smowton Sep 18, 2024

Choose a reason for hiding this comment

smowton commented Sep 17, 2024

smowton commented Sep 18, 2024

smowton commented Sep 16, 2024 •

edited

Loading