Skip to content

[8.19] Optimize sparse vector stats collection (#128740)#128771

Merged
elasticsearchmachine merged 1 commit intoelastic:8.19from
jimczi:backport/8.19/pr-128740
Jun 2, 2025
Merged

[8.19] Optimize sparse vector stats collection (#128740)#128771
elasticsearchmachine merged 1 commit intoelastic:8.19from
jimczi:backport/8.19/pr-128740

Conversation

@jimczi
Copy link
Contributor

@jimczi jimczi commented Jun 2, 2025

Backports the following commits to 8.19:

This change improves the performance of sparse vector statistics gathering by using the document count of terms directly, rather than relying on the field name field to compute stats.
By avoiding per-term disk/network reads and instead leveraging statistics already loaded into leaf readers at index opening, we expect to significantly reduce overhead.

Relates to elastic#128583
@jimczi jimczi added :Core/Infra/Stats Statistics tracking and retrieval APIs >enhancement auto-merge-without-approval Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) backport Team:Data Management (obsolete) DO NOT USE. This team no longer exists. labels Jun 2, 2025
@elasticsearchmachine elasticsearchmachine merged commit 839aa2b into elastic:8.19 Jun 2, 2025
15 checks passed
@jimczi jimczi deleted the backport/8.19/pr-128740 branch June 2, 2025 16:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

auto-merge-without-approval Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) backport :Core/Infra/Stats Statistics tracking and retrieval APIs >enhancement Team:Data Management (obsolete) DO NOT USE. This team no longer exists. v8.19.0

2 participants