chore(engine): Use shard information from query frontend in the new query engine (backport k260) #18208

loki-gh-app · 2025-06-24T06:37:17Z

Backport b316d52 from #17792

What this PR does / why we need it:

This PR introduces the sharding to the new query engine to be able to test the engine using the exsiting Loki architecture with query frontend, query scheduler, and queriers.

Note, this is only an interim solution used for validating and testing.

By design, the first phase of the query engine implementation does only local execution of queries without sharding or time-based splitting.

However, for testing the results of this first phase, we can utilise dual-ingestion (writing both chunks/tsdb and data objects in a way that sharding and splitting is performed in the query frontend using the existing middlewares.
On the queriers themselves, the sub-queries are parsed and planned using the new logical and physical planner. If the query can successfully be planned, it will be executed by the new engine, otherwise it falls back to the old engine.

Shard size considerations

While performing time-splitting in the frontend works for data objects as well, sharding by information from TSDB is not directly mappable to data objects. The default target shard size in TSDB is 600MB (decompressed), whereas target size of data objects is 1GB compressed or roughly 10-15GB uncompressed. However, individual logs sections of a data object have a target size of 128MB, which is roughly 0.9-1.2GB. That is 1.5-2x larger than the TSDB target shard size. So when using the sharding calculation from TSDB, it would over-shard for data object sections, which is likely acceptable for testing and good enough for proving that local execution with the new engine works.

How does sharding with data objects in this PR work?

The query frontend passes down the calculated shards as part of the query parameters of the serialised sub-request. The logical planner on the querier stores the Shard annotation on the MAKETABLE alongside the stream selector. This is then used by the physical planner to filter out only the relevant sections of the resolved data objects from the metastore lookup. During exeuction, only readers for the relevant sections are initialised when performing the DataObjScan.

…uery engine (#17792) **What this PR does / why we need it**: This PR introduces the sharding to the new query engine to be able to test the engine using the exsiting Loki architecture with query frontend, query scheduler, and queriers. **Note, this is only an interim solution used for validating and testing.** By design, the first phase of the query engine implementation does only local execution of queries without sharding or time-based splitting. However, for testing the results of this first phase, we can utilise dual-ingestion (writing both chunks/tsdb and data objects in a way that sharding and splitting is performed in the query frontend using the existing middlewares. On the queriers themselves, the sub-queries are parsed and planned using the new logical and physical planner. If the query can successfully be planned, it will be executed by the new engine, otherwise it falls back to the old engine. **Shard size considerations** While performing time-splitting in the frontend works for data objects as well, sharding by information from TSDB is not directly mappable to data objects. The default target shard size in TSDB is 600MB (decompressed), whereas target size of data objects is 1GB compressed or roughly 10-15GB uncompressed. However, individual logs sections of a data object have a target size of 128MB, which is roughly 0.9-1.2GB. That is 1.5-2x larger than the TSDB target shard size. So when using the sharding calculation from TSDB, it would over-shard for data object sections, which is likely acceptable for testing and good enough for proving that local execution with the new engine works. **How does sharding with data objects in this PR work?** The query frontend passes down the calculated shards as part of the query parameters of the serialised sub-request. The logical planner on the querier stores the Shard annotation on the `MAKETABLE` alongside the stream selector. This is then used by the physical planner to filter out only the relevant sections of the resolved data objects from the metastore lookup. During exeuction, only readers for the relevant sections are initialised when performing the `DataObjScan`. Signed-off-by: Christian Haudum <christian.haudum@gmail.com> Co-authored-by: Ashwanth <iamashwanth@gmail.com> (cherry picked from commit b316d52)

loki-gh-app bot added backport product-approved size/XL labels Jun 24, 2025

loki-gh-app bot requested a review from a team as a code owner June 24, 2025 06:37

loki-gh-app bot added size/XL backport product-approved labels Jun 24, 2025

loki-gh-app bot requested a review from chaudum June 24, 2025 06:37

chaudum approved these changes Jun 24, 2025

View reviewed changes

chaudum merged commit b5a54c2 into k260 Jun 24, 2025
71 checks passed

chaudum deleted the backport-17792-to-k260 branch June 24, 2025 06:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

chore(engine): Use shard information from query frontend in the new query engine (backport k260) #18208

chore(engine): Use shard information from query frontend in the new query engine (backport k260) #18208

Uh oh!

loki-gh-app bot commented Jun 24, 2025

Uh oh!

chore(engine): Use shard information from query frontend in the new query engine (backport k260) #18208

chore(engine): Use shard information from query frontend in the new query engine (backport k260) #18208

Uh oh!

Conversation

loki-gh-app bot commented Jun 24, 2025

Uh oh!