Knowledge Center Monthly Newsletter - September 2025

Stay up to date with the latest from the Knowledge Center. See all new Knowledge Center articles published in the last month, and re:Post’s top contributors.

Handling Large-Topic Summarization in Amazon Bedrock Knowledge Base with OpenSearch and Nova Lite

Hi AWS Community

I am working on an RAG system using Amazon Bedrock Knowledge Base (AKB) with OpenSearch as the vector store. Here is my current setup:

Data ingestion: I have ingested metadata and post content into the AKB. The data is embedded and stored in the OpenSearch vector store.

LLM: Currently using Nova Lite for generating summaries.

Retrieval & generation: I am using the RetrieveAndGenerate API. The maximum number of retrieved results per request is currently 100.

Scenario / Challenge:

A topic may have a very large number of posts (e.g., 10,000 large posts).
There is no existing summary, so all posts are relevant.
Retrieval is based on matching threadId.
The total content exceeds the LLM’s context window, making it impossible to summarize all posts in a single call.
The batch size should be dynamic based on the total size of posts, not just the number of posts.
Summarization must preserve the chronological order of posts.

Questions / Guidance Needed:

Batching / hierarchical summarization:

How can I implement a hierarchical summarization flow to handle such large topics
Can batching be done dynamically based on the content size to fit the context window?
Can I use NextToken or some form of pagination with the Retrieve or RetrieveAndGenerate API to retrieve large datasets in chunks?

API usage:

Should I use Retrieve API + InvokeModel or RetrieveAndGenerate API for this scenario?
Which approach is best for generating partial summaries and then combining them into a final summary?

Ensuring order:

How can I guarantee that posts are retrieved and summarized in chronological order when batching?

Best practices:

Any recommendations for implementing this type of large-topic summarization in AKB with OpenSearch vector store and Nova Lite?

I would greatly appreciate any guidance, examples, or references for implementing this hierarchical, order-preserving summarization workflow with large topics.

Thank you in Advance

Topics: Analytics Machine Learning & AI Generative AI on AWS
Tags: Amazon OpenSearch Service Amazon Bedrock
Language: English

Azar S

asked 12 days ago12 views

1 Answer

Newest
Most votes
Most comments

Are these answers helpful? Upvote the correct answer to help the community benefit from your knowledge.

This recommendation was generated using AWS Generative AI capabilities. You are responsible for evaluating the recommendation in your specific context and implementing appropriate oversight and safeguards. Learn more

This answer has not been reviewed by an expert

When handling large-topic summarization in Amazon Bedrock Knowledge Base with OpenSearch and Nova Lite, you'll need to implement a strategic approach to overcome the context window limitations. Here's guidance on your questions:

Batching/Hierarchical Summarization

For implementing hierarchical summarization with large topics:

You can dynamically adjust the number of retrieved chunks in your RetrieveAndGenerate request by modifying the retrievalConfiguration.vectorSearchConfiguration.numberOfResults parameter. This allows you to optimize information density based on content size.
For pagination, you can implement metadata filtering to add contextual layers to chunk retrieval. This would help you retrieve posts in batches while maintaining chronological order by using timestamp metadata.
Consider implementing a multi-level summarization approach:

First level: Retrieve and summarize posts in smaller chronological batches
Second level: Combine these batch summaries into a final summary

API Usage

For your specific scenario:

The RetrieveAndGenerate API is generally more straightforward for implementing RAG workflows. The API call structure is simple:

response = bedrock_agent_runtime_client.retrieve_and_generate(
input={
'text': prompt,
},
retrieveAndGenerateConfiguration={
'type': 'KNOWLEDGE_BASE',
'knowledgeBaseConfiguration': {
'knowledgeBaseId': knowledge_base_id,
'modelArn': model_arn,
}
}
)

However, for more control over the process, you might consider using the Retrieve API first to get chunks in batches, and then using InvokeModel to generate summaries. This approach gives you more flexibility in handling the summarization process.

Ensuring Order

To guarantee chronological order:

Use metadata filtering with timestamp fields to retrieve posts in chronological order.
Implement query expansion techniques to transform your queries for more effective retrieval.
When processing batches, maintain explicit ordering information in your workflow.

Best Practices

For large-topic summarization:

Optimize your chunking strategy to create meaningful chunks that balance size and semantic completeness.
Implement hybrid search combining dense and sparse retrieval for better results.
Consider applying reranking models to improve precision by reorganizing retrieved contexts by relevance.
Experiment with different similarity metrics beyond standard cosine similarity.
For very large knowledge bases, consider implementing a retry mechanism where failed jobs are placed in an Amazon SQS Dead Letter Queue for retry.
Monitor the size of your knowledge base and track query performance to identify optimization opportunities.

By implementing these strategies, you should be able to effectively handle large-topic summarization while maintaining chronological order and ensuring comprehensive coverage of your content.
Sources
Evaluate and improve performance of Amazon Bedrock Knowledge Bases | AWS Machine Learning Blog
Improve search results for AI using Amazon OpenSearch Service as a vector database with Amazon Bedrock | AWS Big Data Blog
Leverage generative AI for biocuration using Amazon Bedrock and Amazon Nova foundation models | AWS Public Sector Blog
Bedrock knowledge base retrieval with opensearch | AWS re:Post

re:Post Agent

answered 12 days ago

Relevant content

Can we create an agent with an action group to perform custom retrievals on a KB? If not, can we do it on OpenSearch or define a custom RAG flow using either of them?
Safal
asked 6 months ago
AWS-Bedrock - Knowledge- Base
Accepted Answer
Thabiso
asked 6 months ago
Separating knowledge in RAG solutions
Mohamed ElHaj
asked 7 months ago
How to reduce input token usage for Bedrock Knowledge base chatbot?
Yongkang
asked a month ago
How do I use the Amazon OpenSearch Serverless collection with a private network policy to create a knowledge base in Amazon Bedrock?
AWS OFFICIALUpdated a year ago
How do I troubleshoot permission errors that I get when I create a knowledge base in Amazon Bedrock?
AWS OFFICIALUpdated 4 months ago
How do I troubleshoot the "Sorry, I am unable to assist you with this request" response in Amazon Bedrock Knowledge Bases?
AWS OFFICIALUpdated 4 months ago
How do I resolve the "Failed to delete knowledge base" error in Amazon Bedrock?
AWS OFFICIALUpdated 10 days ago
The leverage of LLM system prompt by Knowledge Bases for Bedrock in RAG workflows
EXPERT
Didier Durand
published a year ago
Architecting Real-Time Streaming for Generative AI Fraud Detection and Prevention
EXPERT
Jatinder Singh
published 9 months ago