Nested KNN vs inner hits scoring consistency by tteofili · Pull Request #144548 · elastic/elasticsearch

tteofili · 2026-03-19T11:37:19Z

This attempts at fixing child visibility issue at #144207 and approximate vs rescore score consistency issues (#138011 and #138496) for nested KNN.

For nested vectors, we retrieve vectors in child (nested) space but ultimately return parents, so in the diversifying kNN step we gather enough nearest nested hits and then fold them up under parents. When that step only considers too few children per parent, the parent score can be computed from an incomplete set of nested candidates. Meanwhile inner_hits may be driven by a different evaluation path (e.g. focusing on children with exact kNN during inner-hit rewrite).

This adds an internal flag useMaxNestedKnnCandidatesInNested such that when the query runs in a nested-parent context (parentBitSet != null), the implementation raises the effective k and num_candidates to at least the (new) MAX_NESTED_KNN_CANDIDATES_SETTING index setting (defaults to 100).
Higher index setting values will diversify over more nested docs per parent (more CPU/memory, usually stabler parent vs inner-hit agreement when many children compete). Lower values will mean cheaper execution, but easier to miss the nested child combination that dominates the true max score for that parent.

…nn_ihscore

…h into nested_knn_ihscore

…nn_ihscore

elasticsearchmachine · 2026-03-23T16:34:35Z

Hi @tteofili, I've created a changelog YAML for you.

github-actions · 2026-04-21T08:52:02Z

🔍 Preview links for changed docs

⏳ Building and deploying preview... View progress

This comment will be updated with preview links when the build is complete.

github-actions · 2026-04-21T08:53:29Z

ℹ️ Important: Docs version tagging

👋 Thanks for updating the docs! Just a friendly reminder that our docs are now cumulative. This means all 9.x versions are documented on the same page and published off of the main branch, instead of creating separate pages for each minor version.

We use applies_to tags to mark version-specific features and changes.

Expand for a quick overview

When to use applies_to tags:

✅ At the page level to indicate which products/deployments the content applies to (mandatory)
✅ When features change state (e.g. preview, ga) in a specific version
✅ When availability differs across deployments and environments

What NOT to do:

❌ Don't remove or replace information that applies to an older version
❌ Don't add new information that applies to a specific version without an applies_to tag
❌ Don't forget that applies_to tags can be used at the page, section, and inline level

🤔 Need help?

Check out the cumulative docs guidelines
Reach out in the #docs Slack channel

…nn_ihscore

elasticsearchmachine · 2026-05-05T14:46:36Z

Pinging @elastic/es-search-relevance (Team:Search Relevance)

benwtrent

I disagree with this approach. I don't think it fixes two of the main issues and its a dangerous performance hit.

If we want to force the index exploration to gather MORE children per parent node, we need to do something directly with the graph/index exploration (example: apache/lucene#16034)

However, that said, I am not sure that this approach actually fixes the issue. Its significantly dependent on the distribution of the number of children vs. parent. I don't think a new magic number is what we need.

This inconsistency seems to be caused by multiple places:

Quantized vs. raw scoring
Potentially seeing NEW children that weren't considered before

First one is scorer focused
Second one is approximation focused

Nested KNN vs inner hits scoring consistency

f612642

elasticsearchmachine added the v9.4.0 label Mar 19, 2026

tteofili and others added 13 commits March 19, 2026 12:38

spotless

d22a8d5

Merge branch 'main' of github.com:elastic/elasticsearch into nested_k…

da93548

…nn_ihscore

[CI] Auto commit changes from spotless

0bb6aff

more tests

2a71aee

Merge branch 'nested_knn_ihscore' of github.com:tteofili/elasticsearc…

3561b94

…h into nested_knn_ihscore

[CI] Auto commit changes from spotless

1ee6cd5

Merge branch 'main' into nested_knn_ihscore

7cb69b6

Merge branch 'main' into nested_knn_ihscore

781632b

Merge branch 'main' of github.com:elastic/elasticsearch into nested_k…

abbe7e7

…nn_ihscore

make sure parent/child scores are aligned

de2ffff

Merge branch 'main' of github.com:elastic/elasticsearch into nested_k…

39fb4b1

…nn_ihscore

Merge branch 'main' of github.com:elastic/elasticsearch into nested_k…

7efe0a4

…nn_ihscore

ccr support

43eb76f

tteofili added :Search Relevance/Vectors Vector search >bug labels Mar 23, 2026

Update docs/changelog/144548.yaml

53b903a

brianseeders added v9.5.0 and removed v9.4.0 labels Apr 10, 2026

Merge branch 'main' into nested_knn_ihscore

7a8e78b

github-actions Bot deployed to docs-preview April 21, 2026 08:52 View deployment

tteofili marked this pull request as ready for review April 21, 2026 13:11

tteofili requested a review from mayya-sharipova April 22, 2026 13:11

Merge branch 'main' of github.com:elastic/elasticsearch into nested_k…

f387c97

…nn_ihscore

elasticsearchmachine added the Team:Search Relevance Meta label for the Search Relevance team in Elasticsearch label May 5, 2026

github-actions Bot deployed to docs-preview May 5, 2026 14:48 View deployment

benwtrent requested changes May 5, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nested KNN vs inner hits scoring consistency#144548

Nested KNN vs inner hits scoring consistency#144548
tteofili wants to merge 17 commits into
elastic:mainfrom
tteofili:nested_knn_ihscore

tteofili commented Mar 19, 2026 •

edited

Loading

Uh oh!

elasticsearchmachine commented Mar 23, 2026

Uh oh!

github-actions Bot commented Apr 21, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Apr 21, 2026

When to use applies_to tags:

What NOT to do:

Uh oh!

elasticsearchmachine commented May 5, 2026

Uh oh!

benwtrent left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

tteofili commented Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticsearchmachine commented Mar 23, 2026

Uh oh!

github-actions Bot commented Apr 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔍 Preview links for changed docs

Uh oh!

github-actions Bot commented Apr 21, 2026

ℹ️ Important: Docs version tagging

When to use applies_to tags:

What NOT to do:

🤔 Need help?

Uh oh!

elasticsearchmachine commented May 5, 2026

Uh oh!

benwtrent left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

tteofili commented Mar 19, 2026 •

edited

Loading

github-actions Bot commented Apr 21, 2026 •

edited

Loading