Weights in simplified RRF retriever cleanup by Mikep86 · Pull Request #135040 · elastic/elasticsearch

Mikep86 · 2025-09-18T21:55:01Z

Cleans up tests added in #132680. The rewrite tests in LinearRetrieverBuilderTests and RRFRetrieverBuilderTests are now aligned, testing the same scenarios in the same order. This should make these tests easier to maintain. The YAML tests have been adjusted to remove redundant tests and score checks that did not add value.

elasticsearchmachine · 2025-09-18T21:55:25Z

Pinging @elastic/search-relevance (Team:Search - Relevance)

Mikep86 · 2025-09-18T22:05:09Z

.../rank-rrf/src/yamlRestTest/resources/rest-api-spec/test/rrf/310_rrf_retriever_simplified.yml

-"Basic per-field boosting using the simplified format":
-  - requires:
-      cluster_features: ["rrf_retriever.simplified_weighted_support"]
-      reason: "Simplified weighted fields syntax support"
-
-  - do:
-      search:
-        index: test-index
-        body:
-          retriever:
-            rrf:
-              fields: [ "text_1", "text_2^2" ]
-              query: "foo"
-
-  # With weighted fields, verify basic functionality
-  - gte: { hits.total.value: 1 }
-  - length: { hits.hits: 1 }
-  # Verify that text_2^2 affects ranking (basic smoke test)


This test is redundant, as we already have a test that applies per-field boosts to lexical matches and checks for changes in result order. Additionally, the assertions performed in this test do not meaningfully check that the boost actually was applied.

Mikep86 · 2025-09-18T22:05:35Z

.../rank-rrf/src/yamlRestTest/resources/rest-api-spec/test/rrf/310_rrf_retriever_simplified.yml

-"Semantic field weighting":
-  - requires:
-      cluster_features: ["rrf_retriever.simplified_weighted_support"]
-      reason: "Simplified weighted fields syntax support"
-
-  - do:
-      search:
-        index: test-index
-        body:
-          retriever:
-            rrf:
-              fields: ["dense_inference^2", "sparse_inference^1.5"]
-              query: "elasticsearch"
-
-  - match: { hits.total.value: 3 }
-  - length: { hits.hits: 3 }


Same as https://github.com/elastic/elasticsearch/pull/135040/files#r2361285845, but for boosts on semantic matches.

Mikep86 · 2025-09-18T22:05:57Z

.../rank-rrf/src/yamlRestTest/resources/rest-api-spec/test/rrf/310_rrf_retriever_simplified.yml

-"Zero weight handling":
-  - requires:
-      cluster_features: ["rrf_retriever.simplified_weighted_support"]
-      reason: "Simplified weighted fields syntax support"
-
-  - do:
-      search:
-        index: test-index
-        body:
-          retriever:
-            rrf:
-              fields: ["text_1^0", "text_2^1"]
-              query: "foo"
-
-  - gte: { hits.total.value: 1 }


We already have coverage for this in the rewrite tests

Mikep86 · 2025-09-18T22:07:33Z

.../rank-rrf/src/yamlRestTest/resources/rest-api-spec/test/rrf/310_rrf_retriever_simplified.yml

RRF scores are generally not meaningful, hence why the existing tests did not check them. Moreover, the way they were being checked did not add any real value.

Mikep86 · 2025-09-19T11:56:15Z

@elasticmachine update branch

kderusso

Test looks good to me. The one piece of feedback that I have for the unit tests, is that it would be really helpful to add comments to the tests explaining why the scores are expected. Just to make it a little readable and parseable. Maybe that's something @mridula-s109 could help with in a followup if we agree it would be useful?

mridula-s109 · 2025-09-19T12:30:54Z

Test looks good to me. The one piece of feedback that I have for the unit tests, is that it would be really helpful to add comments to the tests explaining why the scores are expected. Just to make it a little readable and parseable. Maybe that's something @mridula-s109 could help with in a followup if we agree it would be useful?

Good point - adding comments to explain the expected scores would definitely make the tests more readable. I’m happy to take that up in a follow-up PR if everyone agrees.

Mikep86 · 2025-09-19T12:40:46Z

@kderusso

The one piece of feedback that I have for the unit tests, is that it would be really helpful to add comments to the tests explaining why the scores are expected

Do you mean the per-field boosts? We don't check document scores in the unit tests.

kderusso · 2025-09-19T12:47:30Z

Do you mean the per-field boosts? We don't check document scores in the unit tests.

Sorry, yes, e.g. quickly explain why we would expect a boost of 3.75 on a combined field

Mikep86 added 2 commits September 18, 2025 16:28

Aligned RRF and linear retriever unit tests

df2c607

Adjusted YAML tests

a9b25c3

Mikep86 requested review from ioanatia, kderusso and mridula-s109 September 18, 2025 21:55

Mikep86 added >test Issues or PRs that are addressing/adding tests :SearchOrg/Relevance Label for the Search (solution/org) Relevance team labels Sep 18, 2025

elasticsearchmachine added the Team:Search - Relevance The Search organization Search Relevance team label Sep 18, 2025

elasticsearchmachine added the v9.2.0 label Sep 18, 2025

Mikep86 commented Sep 18, 2025

View reviewed changes

Merge branch 'main' into simplified-rrf-weights-cleanup

4370053

mridula-s109 approved these changes Sep 19, 2025

View reviewed changes

kderusso approved these changes Sep 19, 2025

View reviewed changes

ioanatia approved these changes Sep 19, 2025

View reviewed changes

Mikep86 merged commit a375c6e into elastic:main Sep 19, 2025
34 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Weights in simplified RRF retriever cleanup#135040

Weights in simplified RRF retriever cleanup#135040
Mikep86 merged 3 commits intoelastic:mainfrom
Mikep86:simplified-rrf-weights-cleanup

Mikep86 commented Sep 18, 2025

elasticsearchmachine commented Sep 18, 2025

Mikep86 Sep 18, 2025 •

edited

Loading

Mikep86 Sep 18, 2025 •

edited

Loading

Mikep86 Sep 18, 2025

Mikep86 Sep 18, 2025

Mikep86 commented Sep 19, 2025

kderusso left a comment

mridula-s109 commented Sep 19, 2025

Mikep86 commented Sep 19, 2025

kderusso commented Sep 19, 2025

Uh oh!

Labels

6 participants

Conversation

Mikep86 commented Sep 18, 2025

elasticsearchmachine commented Sep 18, 2025

Mikep86 Sep 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Mikep86 Sep 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Mikep86 Sep 18, 2025

Choose a reason for hiding this comment

Mikep86 Sep 18, 2025

Choose a reason for hiding this comment

Mikep86 commented Sep 19, 2025

kderusso left a comment

Choose a reason for hiding this comment

mridula-s109 commented Sep 19, 2025

Mikep86 commented Sep 19, 2025

kderusso commented Sep 19, 2025

Uh oh!

Labels

6 participants

Mikep86 Sep 18, 2025 •

edited

Loading

Mikep86 Sep 18, 2025 •

edited

Loading