More efficient sort in `tryRelocateShard` by DaveCTurner · Pull Request #128063 · elastic/elasticsearch

DaveCTurner · 2025-05-14T12:42:16Z

No need to do this via an allocation-heavy Stream, we can just put the
objects straight into an array, sort them in-place, and keep hold of the
array to avoid having to allocate anything on the next iteration.

Also slims down BY_DESCENDING_SHARD_ID: it's always sorting the same
index so we don't need to look at ShardId#index in the comparison, nor
do we really need multiple layers of vtable lookups, we can just compare
the shard IDs directly.

Relates #128021

No need to do this via an allocation-heavy `Stream`, we can just put the objects straight into an array, sort them in-place, and keep hold of the array to avoid having to allocate anything on the next iteration. Also slims down `BY_DESCENDING_SHARD_ID`: it's always sorting the same index so we don't need to look at `ShardId#index` in the comparison, nor do we really need multiple layers of vtable lookups, we can just compare the shard IDs directly. Relates elastic#128021

elasticsearchmachine · 2025-05-14T12:42:40Z

Pinging @elastic/es-distributed-coordination (Team:Distributed Coordination)

elasticsearchmachine · 2025-05-14T12:42:41Z

Hi @DaveCTurner, I've created a changelog YAML for you.

DiannaHohensee

lgtm 👍

To confirm, this change essentially

removes the use of Stream, which isn't performant
shortens the comparison method
reuses allocated memory

Though this last one sacrifices memory in favor of performance -- if a user has one index with a stupid number of shards, we're keep that array memory allocation for the duration of a balancing round? Though even a large number of shards will take insignificant memory.

DiannaHohensee · 2025-05-19T18:21:56Z

...ain/java/org/elasticsearch/cluster/routing/allocation/allocator/BalancedShardsAllocator.java

+                for (final var shardRouting : index) {
+                    if (shardRouting.started()) { // cannot rebalance unassigned, initializing or relocating shards anyway
+                        shardRoutingsOnMaxWeightNode[startedShards] = shardRouting;
+                        startedShards += 1;


opt: ++startedShards or shardRoutingsOnMaxWeightNode[startedShards++] = shardRouting;

IMO unary increments are harder on the reader than spelling out a += 1 so I'd rather leave it like this.

DaveCTurner · 2025-05-19T18:37:08Z

if a user has one index with a stupid number of shards,

Yeah the maximum is 1024, if they all end up on one node for some reason then we allocate an array of length 2048 which is still tiny and probably not worth GC'ing.

We probably allocate more than 2048*8=16kiB today just by using streams.

No need to do this via an allocation-heavy `Stream`, we can just put the objects straight into an array, sort them in-place, and keep hold of the array to avoid having to allocate anything on the next iteration. Also slims down `BY_DESCENDING_SHARD_ID`: it's always sorting the same index so we don't need to look at `ShardId#index` in the comparison, nor do we really need multiple layers of vtable lookups, we can just compare the shard IDs directly. Relates elastic#128021

elasticsearchmachine · 2025-05-19T19:46:59Z

💚 Backport successful

Status	Branch	Result
✅	8.19

No need to do this via an allocation-heavy `Stream`, we can just put the objects straight into an array, sort them in-place, and keep hold of the array to avoid having to allocate anything on the next iteration. Also slims down `BY_DESCENDING_SHARD_ID`: it's always sorting the same index so we don't need to look at `ShardId#index` in the comparison, nor do we really need multiple layers of vtable lookups, we can just compare the shard IDs directly. Relates #128021

DaveCTurner added >enhancement :Distributed/Allocation All issues relating to the decision making around placing a shard (both master logic & on the nodes) v8.19.0 v9.1.0 labels May 14, 2025

DaveCTurner requested a review from DiannaHohensee May 14, 2025 12:42

elasticsearchmachine added the Team:Distributed Coordination (obsolete) Meta label for Distributed Coordination team. Obsolete. Please do not use. label May 14, 2025

Update docs/changelog/128063.yaml

3060542

DaveCTurner added the auto-backport Automatically create backport pull requests when merged label May 14, 2025

DiannaHohensee approved these changes May 19, 2025

View reviewed changes

Merge branch 'main' into 2025/05/14/BalancedShardsAllocator-slimmer-sort

1324257

DaveCTurner added the auto-merge-without-approval Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) label May 19, 2025

elasticsearchmachine merged commit a84dff8 into elastic:main May 19, 2025
17 checks passed

DaveCTurner deleted the 2025/05/14/BalancedShardsAllocator-slimmer-sort branch May 19, 2025 19:45

DaveCTurner mentioned this pull request May 19, 2025

[8.19] More efficient sort in tryRelocateShard (#128063) #128159

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More efficient sort in `tryRelocateShard`#128063

More efficient sort in `tryRelocateShard`#128063
elasticsearchmachine merged 3 commits intoelastic:mainfrom
DaveCTurner:2025/05/14/BalancedShardsAllocator-slimmer-sort

DaveCTurner commented May 14, 2025

elasticsearchmachine commented May 14, 2025

elasticsearchmachine commented May 14, 2025

DiannaHohensee left a comment

DiannaHohensee May 19, 2025

DaveCTurner May 19, 2025

DaveCTurner commented May 19, 2025 •

edited

Loading

Uh oh!

elasticsearchmachine commented May 19, 2025

Labels

3 participants

Conversation

DaveCTurner commented May 14, 2025

elasticsearchmachine commented May 14, 2025

elasticsearchmachine commented May 14, 2025

DiannaHohensee left a comment

Choose a reason for hiding this comment

DiannaHohensee May 19, 2025

Choose a reason for hiding this comment

DaveCTurner May 19, 2025

Choose a reason for hiding this comment

DaveCTurner commented May 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticsearchmachine commented May 19, 2025

💚 Backport successful

Labels

3 participants

DaveCTurner commented May 19, 2025 •

edited

Loading