Add time range bucketing attribute to APM took time latency metrics #135549

javanna · 2025-09-26T16:07:19Z

This is similar to #135524, but adding the attribute to the took time latency metric.

That requires a bit of ceremony as the took time metric is recorded on the coordinating node, while the time range filter is parsed on each shard. We don't have mappings available on the coord node, which are needed to parse dates on the coord node. Thus we need to rely on date parsing done on the data nodes, which requires sending back the parsed value to the coord node, performing some simple reduction on it, and adding it back to the search response.

This is similar to elastic#135524, but adding the attribute to the took time latency metric. That requires a bit of ceremony as the took time metric is recorded on the coordinating node, while the time range filter is parsed on each shard. We don't have mappings available on the coord node, which are needed to parse dates on the coord node. Thus we need to rely on date parsing done on the data nodes, which requires sending back the parsed value to the coord node, performing some simple reduction on it, and adding it back to the search response.

elasticsearchmachine · 2025-09-26T16:07:46Z

Pinging @elastic/es-search-foundations (Team:Search Foundations)

elasticsearchmachine · 2025-09-26T16:07:46Z

Hi @javanna, I've created a changelog YAML for you.

javanna · 2025-09-26T18:25:01Z

server/src/main/java/org/elasticsearch/search/query/QueryPhase.java

                    )
                ) {
-                    QueryPhase.addCollectorsAndSearch(rankSearchContext);
+                    QueryPhase.addCollectorsAndSearch(rankSearchContext, null);


I will look into this as a follow-up, need to write a test that leverages this and see if it makes sense to track timestamp from for it too, it probably does.

javanna · 2025-09-26T18:26:18Z

...er/src/test/java/org/elasticsearch/search/TelemetryMetrics/SearchTookTimeTelemetryTests.java

+            assertEquals("hits_only", attributes.get("query_type"));
+            assertEquals("_score", attributes.get("sort"));
+            assertEquals("pit", attributes.get("pit_scroll"));
+        }


As a follow-up, I have more work to do around retrievers, as they optionally hold a query too. Need to walk the retrievers tree like I do for queries, and add more tests.

javanna · 2025-09-26T19:25:30Z

server/src/main/java/org/elasticsearch/index/query/SearchExecutionContext.java

-        } else {
-            this.rangeTimestampFrom = Math.min(rangeTimestampFrom, this.rangeTimestampFrom);
-        }
-    }


I realized that we need this in QueryRewriteContext, that this extends, to do the same tracking when we parse the time range filter as part of query rewrite. There's cases when the range gets rewritten to a range with no bounds (match all), for which we still want to bucket the request based on its original time range.

benchaplin

Change looks good.

andreidan

LGTM, thanks Luca

server/src/main/java/org/elasticsearch/action/search/SearchResponse.java

andreidan · 2025-09-29T12:35:54Z

server/src/main/java/org/elasticsearch/index/query/BoolQueryBuilder.java

-        addBooleanClauses(context, booleanQueryBuilder, mustNotClauses, BooleanClause.Occur.MUST_NOT);
-        addBooleanClauses(context, booleanQueryBuilder, shouldClauses, BooleanClause.Occur.SHOULD);
+        try {
+            context.setTrackRangeTimestampFrom(false);


should we document why we disable this tracking here?

maybe secondary, but if we have only one should clause, we want to track the range?

I can add a comment, yes we could have a special case for bool with a single should.

this reminds me of the current discrepancy between the boolean attribute that signals whether there is a filter on timestamp, and the range bucket. I meant to ask you what you think about this.

If a shard has all documents within the range, it is in fact not a range query when it comes to the lucene level, but rather a match_all. I thought that we'd want to track this distinction.

At the coord level, we will always extract the original range before query rewrite and set to flag to true.
At the shard level, only those shards that effectively execute a range query will have the boolean flag set to true, but all will have the range extracted. Would we want instead the two to be consistent? I even challenged at some point that we still want the query introspection if all we use it for is to figure out whether there was a filter on @timestamp: we now have another way to do that which does not require visiting the query tree, and perhaps we don't even need the boolean flag anymore.

Looking further, I don't think I will add a special case for bool with a single should clause. I already don't like how intrusive the metrics tracking is in the actual code that does stuff. I am way of adding more logic to control when to report on metrics, intermingled with the actual code that rewrites and executes queries. I think that having a time range as a single should clause it also quite an edge case. If needed we can always optimize it later.

Sounds good, thanks for looking into this

andreidan · 2025-09-29T12:40:15Z

server/src/main/java/org/elasticsearch/search/SearchService.java

+                // range queries may get rewritten to match_all or a range with open bounds. Rewriting in that case is the only place
+                // where we parse the date and set it to the context. We need to propagate it back from the clone into the original context


❤️ this is very useful and a true comment, documenting the why. Thank you!

server/src/main/java/org/elasticsearch/search/query/QueryPhase.java

server/src/main/java/org/elasticsearch/search/query/QuerySearchResult.java

….java Co-authored-by: Andrei Dan <andrei.dan@elastic.co>

…hResult.java Co-authored-by: Andrei Dan <andrei.dan@elastic.co>

javanna added 3 commits September 26, 2025 17:41

iter

e81fb6c

iter

6470960

javanna requested review from tteofili and andreidan September 26, 2025 16:07

javanna added >enhancement :Search Foundations/Search Catch all for Search Foundations v9.2.0 labels Sep 26, 2025

elasticsearchmachine added the Team:Search Foundations Meta label for the Search Foundations team in Elasticsearch label Sep 26, 2025

Update docs/changelog/135549.yaml

1aed53d

elasticsearchmachine and others added 3 commits September 26, 2025 16:15

[CI] Auto commit changes from spotless

6eb53a3

iter

3967f72

iter

3eb28fa

javanna commented Sep 26, 2025

View reviewed changes

iter

d9fdc5b

javanna commented Sep 26, 2025

View reviewed changes

[CI] Auto commit changes from spotless

cc195d4

benchaplin approved these changes Sep 26, 2025

View reviewed changes

javanna added 4 commits September 27, 2025 00:03

iter

b5f144d

Merge branch 'main' into enhancement/took_time_time_range_bucketing

d81c985

iter

532eb96

Merge branch 'main' into enhancement/took_time_time_range_bucketing

0684e86

andreidan approved these changes Sep 29, 2025

View reviewed changes

javanna and others added 4 commits September 29, 2025 20:06

Update server/src/main/java/org/elasticsearch/search/query/QueryPhase…

ff41cb0

….java Co-authored-by: Andrei Dan <andrei.dan@elastic.co>

Update server/src/main/java/org/elasticsearch/search/query/QuerySearc…

f985361

…hResult.java Co-authored-by: Andrei Dan <andrei.dan@elastic.co>

iter

2ba5023

Merge branch 'main' into enhancement/took_time_time_range_bucketing

bcffd12

javanna and others added 11 commits September 29, 2025 20:56

iter

d663723

iter

5eaf6db

[CI] Auto commit changes from spotless

233db24

Merge branch 'main' into enhancement/took_time_time_range_bucketing

ba23911

iter

53a80d0

[CI] Auto commit changes from spotless

54a8a4f

[CI] Update transport version definitions

97cb335

iter

3f47f9a

Merge branch 'main' into enhancement/took_time_time_range_bucketing

6948a16

iter

e28898f

Merge branch 'main' into enhancement/took_time_time_range_bucketing

edffb03

elasticsearchmachine added v9.3.0 and removed v9.2.0 labels Oct 2, 2025

javanna mentioned this pull request Oct 2, 2025

Convert BytesTransportResponse when proxying response from/to local node #135873

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add time range bucketing attribute to APM took time latency metrics #135549

Add time range bucketing attribute to APM took time latency metrics #135549

javanna commented Sep 26, 2025

Uh oh!

elasticsearchmachine commented Sep 26, 2025

Uh oh!

elasticsearchmachine commented Sep 26, 2025

Uh oh!

javanna Sep 26, 2025

Uh oh!

javanna Sep 26, 2025

Uh oh!

javanna Sep 26, 2025

Uh oh!

benchaplin left a comment

Uh oh!

andreidan left a comment

Uh oh!

Uh oh!

andreidan Sep 29, 2025

Uh oh!

javanna Sep 29, 2025

Uh oh!

javanna Sep 29, 2025

Uh oh!

javanna Sep 29, 2025

Uh oh!

andreidan Sep 30, 2025

Uh oh!

andreidan Sep 29, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

		// range queries may get rewritten to match_all or a range with open bounds. Rewriting in that case is the only place
		// where we parse the date and set it to the context. We need to propagate it back from the clone into the original context

Add time range bucketing attribute to APM took time latency metrics #135549

Are you sure you want to change the base?

Add time range bucketing attribute to APM took time latency metrics #135549

Conversation

javanna commented Sep 26, 2025

Uh oh!

elasticsearchmachine commented Sep 26, 2025

Uh oh!

elasticsearchmachine commented Sep 26, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

benchaplin left a comment

Choose a reason for hiding this comment

Uh oh!

andreidan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!