Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Auto-merging Retrieval #7863

Open
davidsbatista opened this issue Jun 14, 2024 · 0 comments
Open

Auto-merging Retrieval #7863

davidsbatista opened this issue Jun 14, 2024 · 0 comments
Labels
2.x Related to Haystack v2.0 P2 Medium priority, add to the next sprint if no P1 available topic:retriever

Comments

@davidsbatista
Copy link
Contributor

Auto-merging retrieval aims to combine (or merge) information from multiple sources or segments of text to create a more comprehensive and contextually relevant response to a query.

This approach is particularly useful when no single document or segment fully answers the query but rather the answer lies in combining information from multiple sources.

It allows smaller chunks to be merged into bigger parent chunks. It does this via the following steps:

  1. Define a hierarchy of smaller chunks linked to parent chunks.
  2. If the set of smaller chunks linking to a parent chunk exceeds some threshold (say, cosine similarity), then “merge” smaller chunks into the bigger parent chunk.

The method will finally retrieve the parent chunk for better context.

@shadeMe shadeMe added topic:retriever 2.x Related to Haystack v2.0 P2 Medium priority, add to the next sprint if no P1 available labels Jun 25, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
2.x Related to Haystack v2.0 P2 Medium priority, add to the next sprint if no P1 available topic:retriever
Projects
None yet
Development

No branches or pull requests

2 participants