Adjust Dense Vector Unit Vector Epsilon#110240
Conversation
|
Pinging @elastic/es-search (Team:Search) |
|
What tests, if any, should we add for this change? |
Would the Cohere vectors you found be good candidates for a test? |
|
@elasticsearchmachine run elasticsearch-ci/part-3 |
|
@elasticsearchmachine run elasticsearch-ci/8.15.0 / bwc-snapshots |
|
@elasticsearchmachine run "elasticsearch-ci/8.15.0 / bwc-snapshots" |
|
@elasticsearchmachine run elasticsearch-ci/bwc-snapshots |
|
@elasticsearchmachine run elasticsearch-ci |
Not really. While they are an example of unit vectors that violate the current epsilon, there is no way to know whether they are representative of the global maximum magnitude variance that Cohere will return. We could add a test demonstrating that some set of hard-coded vectors are within the adjusted epsilon, but given that both the vectors and the epsilon are (or would be) hard-coded, I don't know what value that adds. I will merge without tests and we can add some in a follow-up PR if we want to revisit. |
Change dense vector unit vector epsilon to
1e-3. This is required because Cohere sometimes generates unit vectors with magnitude slightly outside our current epsilon. For example, we have observed Cohere unit vectors with magnitude 1.0001829.