ESQL: GROUP BY ALL with the dimensions output by leontyevdv · Pull Request #138595 · elastic/elasticsearch

leontyevdv · 2025-11-25T15:02:32Z

The second part of the #136253.

Load a list of dimensions and output them in the _timeseries column instead of _tsid (here is a doc about it).

The first part is here: #137367

Conflicts: x-pack/plugin/esql/qa/testFixtures/src/main/resources/k8s-timeseries.csv-spec

Conflicts: x-pack/plugin/esql/qa/testFixtures/src/main/java/org/elasticsearch/xpack/esql/LoadMapping.java x-pack/plugin/esql/qa/testFixtures/src/main/resources/k8s-timeseries.csv-spec

…-all-over-time' into esql-group-by-all-over-time

# Conflicts: # server/src/main/resources/transport/upper_bounds/8.18.csv # server/src/main/resources/transport/upper_bounds/8.19.csv # server/src/main/resources/transport/upper_bounds/9.0.csv # server/src/main/resources/transport/upper_bounds/9.1.csv # server/src/main/resources/transport/upper_bounds/9.2.csv # server/src/main/resources/transport/upper_bounds/9.3.csv # x-pack/plugin/esql/qa/testFixtures/src/main/resources/k8s-timeseries.csv-spec # x-pack/plugin/esql/src/main/java/org/elasticsearch/xpack/esql/optimizer/rules/logical/TranslateTimeSeriesAggregate.java # x-pack/plugin/esql/src/main/java/org/elasticsearch/xpack/esql/plan/logical/Drop.java

kkrik-es

Looks good!

Add output of the dimension list into the _timeseries column. Part of elastic#136253

dnhatn

One comment, but this looks good. Thanks Dima!

dnhatn · 2025-11-29T19:31:23Z

...in/esql/src/main/java/org/elasticsearch/xpack/esql/planner/EsPhysicalOperationProviders.java

+                        }
+
+                        MappingLookup mappingLookup = ctx.getMappingLookup();
+                        Set<String> dimensionFields = new HashSet<>();


Can we return an ordered collection to ensure consistent _timeseries values?

I changed it to using LinkedHashMap 👍🏼

Add output of the dimension list into the _timeseries column. Part of elastic#136253

Fix a comment Part of elastic#136253

…ions' into feature/esql-group-by-all-dimensions

kkrik-es

Well done Dima. Will leave it to Nhat to stamp.

elasticsearchmachine · 2025-12-01T16:00:19Z

Pinging @elastic/es-storage-engine (Team:StorageEngine)

dnhatn

One comment, but this looks great. Thanks Dima!

dnhatn · 2025-12-01T16:21:40Z

server/src/main/java/org/elasticsearch/index/mapper/MappedFieldType.java

+         * Returns a list of dimension field names from a MappingLookup.
+         */
+        @Nullable
+        default Set<String> dimensionFields() {


I think we should remove this method and add MappingLookup to BlockLoaderContext and compute dimensionFields inside SourceFieldMapper#blockLoader or TimeSeriesMetadataFieldBlockLoader instead. This can be done in a follow-up.

Thanks Nhat! This is a great idea which makes the code cleaner. I implemented it in this PR 👍🏼

Move the dimensions extraction into the TimeSeriesMetadataFieldBlockLoader Part of elastic#136253

…ions' into feature/esql-group-by-all-dimensions

Add tests Part of elastic#136253

kkrik-es · 2025-12-01T16:59:48Z

server/src/main/java/org/elasticsearch/index/mapper/TimeSeriesMetadataFieldBlockLoader.java

+    private static class TimeSeries extends BlockStoredFieldsReader {
+        @Override
+        public void read(int docId, StoredFields storedFields, Builder builder) throws IOException {
+            // TODO support appending BytesReference


Nit: maybe check for empty dimension set? What should we be doing in that case, @dnhatn?

Do we allow cases where all dimensions have no value? If so, I think we should append an empty json {}.

I'll add a test for this case

I cannot create such index. What I've tried:

Creating an index without a routing_path leads to

"type": "illegal_argument_exception", "reason": "[index.mode=time_series] requires a non-empty [index.routing_path]"

Creating an index with a routing_path ["cluster", "pod"] but without time_series_dimension in the mapping leads to

"type": "illegal_argument_exception", "reason": "All fields that match routing_path must be configured with [time_series_dimension: true] or flattened fields with a list of dimensions in [time_series_dimensions] and without the [script] parameter. [cluster] was not a dimension."

Creation of an index without dimensions is impossible (correct me if I'm wrong) so having the empty set of the dimensions is impossible as well.

Another thing I've tried is to index a document without dimensions. This led to the routing hash exception:
It failed while indexing the doc with the exception:

[1:49] failed to parse: Input byte[] should at least have 2 bytes for base64 bytes org.elasticsearch.index.mapper.DocumentParsingException: [1:49] failed to parse: Input byte[] should at least have 2 bytes for base64 bytes at __randomizedtesting.SeedInfo.seed([730F72C82C3D6654:ECFB9F4DA02BB689]:0) at org.elasticsearch.index.mapper.DocumentParser.wrapInDocumentParsingException(DocumentParser.java:272)

You'd probably need to have a time-series index with hardcoded routing path that includes no dimension fields. This is not very interesting, admittedly, but can happen I guess. A graceful failure is fine, so long as we don't produce a cryptic error.

Add tests Part of elastic#136253

dnhatn

Thanks Dima for another iteration!

dnhatn · 2025-12-01T19:26:20Z

server/src/main/java/org/elasticsearch/index/mapper/TimeSeriesMetadataFieldBlockLoader.java

+            }
+            return dimensionFields;
+        }
+        return null;


nit: I think we should throw an IllegalStateException here, as we should not reach this line.

Polish code Part of elastic#136253

elasticsearchmachine · 2025-12-02T09:52:02Z

Pinging @elastic/es-analytical-engine (Team:Analytics)

not-napoleon and others added 30 commits August 27, 2025 10:51

add capability and initial CSV test (WIP)

5b93ed9

tests (that don't pass)

7e15b58

Merge branch 'main' into esql-group-by-all-over-time

6ded62e

tests and tracing notes

90e8144

Merge branch 'main' into esql-group-by-all-over-time

a49979c

added rate agg to tests

60d8343

add test gates

611058f

detect when group by all should kick in

7504f02

add outline for the rest of the steps

9517ba3

why does this throw?

a953df9

load dimension and metric data in the tests

317da80

build the values aggs for the dimensions

b61fedb

work around output changed verification

ffd140d

notes for future me, and others

7733326

draft of analyzer rule

2ddc444

wire up the rule

9867913

many tests passing

f8f523e

Merge branch 'main' into esql-group-by-all-over-time

0a2104b

Conflicts: x-pack/plugin/esql/qa/testFixtures/src/main/resources/k8s-timeseries.csv-spec

failing for a new reason

0719784

[CI] Auto commit changes from spotless

80247dc

misc cleanups

0dea349

Merge branch 'main' into esql-group-by-all-over-time

77213a4

Conflicts: x-pack/plugin/esql/qa/testFixtures/src/main/java/org/elasticsearch/xpack/esql/LoadMapping.java x-pack/plugin/esql/qa/testFixtures/src/main/resources/k8s-timeseries.csv-spec

more testing

aabcf4c

Merge remote-tracking branch 'refs/remotes/not-napoleon/esql-group-by…

1240a21

…-all-over-time' into esql-group-by-all-over-time

forgot to add this test yesterday

09b8df8

[CI] Auto commit changes from spotless

bd38eea

[CI] Update transport version definitions

22e0973

Merge branch 'main' into esql-group-by-all-over-time

97c7f60

Fix unit tests

5e5e915

kkrik-es reviewed Nov 27, 2025

View reviewed changes

leontyevdv added 2 commits November 28, 2025 12:37

ESQL: Add GROUP BY ALL

fc4cbad

Add output of the dimension list into the _timeseries column. Part of elastic#136253

Merge branch 'main' into feature/esql-group-by-all-dimensions

983596b

dnhatn reviewed Nov 29, 2025

View reviewed changes

leontyevdv added 6 commits December 1, 2025 10:39

Merge branch 'main' into feature/esql-group-by-all-dimensions

40ccfbf

ESQL: Add GROUP BY ALL

6be9822

Add output of the dimension list into the _timeseries column. Part of elastic#136253

ESQL: Add GROUP BY ALL

daa0226

Add output of the dimension list into the _timeseries column. Part of elastic#136253

Merge branch 'main' into feature/esql-group-by-all-dimensions

e55fa53

ESQL: Add GROUP BY ALL

706bc1c

Fix a comment Part of elastic#136253

Merge remote-tracking branch 'origin/feature/esql-group-by-all-dimens…

337ab26

…ions' into feature/esql-group-by-all-dimensions

kkrik-es reviewed Dec 1, 2025

View reviewed changes

leontyevdv requested a review from dnhatn December 1, 2025 15:57

Merge branch 'main' into feature/esql-group-by-all-dimensions

caea51c

leontyevdv marked this pull request as ready for review December 1, 2025 15:59

dnhatn approved these changes Dec 1, 2025

View reviewed changes

leontyevdv added 4 commits December 1, 2025 17:35

Merge branch 'main' into feature/esql-group-by-all-dimensions

dd6a967

ESQL: Add GROUP BY ALL

0827e7a

Move the dimensions extraction into the TimeSeriesMetadataFieldBlockLoader Part of elastic#136253

Merge remote-tracking branch 'origin/feature/esql-group-by-all-dimens…

539030b

…ions' into feature/esql-group-by-all-dimensions

ESQL: Add GROUP BY ALL

fa79d2f

Add tests Part of elastic#136253

kkrik-es reviewed Dec 1, 2025

View reviewed changes

ESQL: Add GROUP BY ALL

de7610e

Add tests Part of elastic#136253

dnhatn approved these changes Dec 1, 2025

View reviewed changes

ESQL: Add GROUP BY ALL

8076a4f

Polish code Part of elastic#136253

leontyevdv added Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) :Analytics/ES|QL AKA ESQL :StorageEngine/ES|QL Timeseries / metrics / PromQL / logsdb capabilities in ES|QL labels Dec 2, 2025

leontyevdv merged commit fa98de5 into elastic:main Dec 2, 2025
34 checks passed

leontyevdv mentioned this pull request Dec 17, 2025

Fix bare time series aggs server error #139612

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ESQL: GROUP BY ALL with the dimensions output#138595

ESQL: GROUP BY ALL with the dimensions output#138595
leontyevdv merged 96 commits intoelastic:mainfrom
leontyevdv:feature/esql-group-by-all-dimensions

leontyevdv commented Nov 25, 2025 •

edited

Loading

kkrik-es left a comment

dnhatn left a comment

dnhatn Nov 29, 2025

leontyevdv Dec 1, 2025 •

edited

Loading

kkrik-es left a comment

elasticsearchmachine commented Dec 1, 2025

dnhatn left a comment

dnhatn Dec 1, 2025

leontyevdv Dec 1, 2025 •

edited

Loading

kkrik-es Dec 1, 2025

dnhatn Dec 1, 2025

leontyevdv Dec 2, 2025

leontyevdv Dec 2, 2025

kkrik-es Dec 2, 2025

dnhatn left a comment

dnhatn Dec 1, 2025

leontyevdv Dec 2, 2025

elasticsearchmachine commented Dec 2, 2025

Uh oh!

Labels

6 participants

Conversation

leontyevdv commented Nov 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

kkrik-es left a comment

Choose a reason for hiding this comment

dnhatn left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

leontyevdv Dec 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

kkrik-es left a comment

Choose a reason for hiding this comment

elasticsearchmachine commented Dec 1, 2025

dnhatn left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

leontyevdv Dec 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dnhatn left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

elasticsearchmachine commented Dec 2, 2025

Uh oh!

Labels

6 participants

leontyevdv commented Nov 25, 2025 •

edited

Loading

leontyevdv Dec 1, 2025 •

edited

Loading

leontyevdv Dec 1, 2025 •

edited

Loading