RAG with Amazon Bedrock

Introduction

RAG stands for Retrieval-Augmented Generative. It refers to a class of natural language processing models that combines generative capabilities with information retrieval mechanisms. RAG models aim to enhance the generation of natural language text by allowing the model to retrieve and incorporate relevant information from a pre-existing knowledge base.

In the previous part, we learnt about the Basics of RAG. The architecture of a typical RAG model includes a generative language model, such as a transformer-based model, and a retrieval component. The retrieval component enables the model to search and retrieve information from a specified knowledge source, often a large database or a collection of documents. The generative model then uses this retrieved information to produce more contextually relevant and informed responses.

Pioneers in Cloud Consulting & Migration Services

Reduced infrastructural costs
Accelerated application deployment

Get Started

Components of Amazon Bedrock

Text playground: Hands-on text generation application in the AWS Management Console.
Image playground: Hands-on image generation application in the console.
Chat playground: Hands-on conversation generation application in the console.
Examples library: Example use cases provided for loading.
Amazon Bedrock API: Explore using AWS CLI or access the API to interact with base models.
Embeddings: Utilize the API to generate embeddings from Titan text and image models.
Agents for Amazon Bedrock: Develop agents for orchestration and task execution for customers.
Knowledge base for Amazon Bedrock: Draw from data sources to help agents find information for customers.
Provisioned Throughput: Purchase throughput for discounted rates to run inference on models.
Fine-tuning and Continued Pre-training: Customize Amazon Bedrock base models for improved performance and customer experience.
Model invocation logging: Collect logs, input data, and output data for all invocations in your AWS account using Amazon Bedrock.
Model versioning: Benefit from continuous updates and improvements in foundation models to enhance application capabilities, accuracy, and safety.

Fully Managed RAG on Amazon Bedrock

The Knowledge Bases for Amazon Bedrock streamline the entire RAG workflow on your behalf. You indicate the data location and choose an embedding model for converting the data into vector embeddings. Amazon Bedrock then generates a vector store to house the vector data in your account. Opting for this choice, exclusively available in the console, results in Amazon Bedrock establishing a vector index in Amazon OpenSearch Serverless within your account, eliminating the necessity for manual management tasks.

rag

Vector embeddings are numerical representations of text data found in your documents. These embeddings are designed to encapsulate the semantic or contextual meaning of the data they represent. In the context of Amazon Bedrock, the platform handles the entire lifecycle of your embeddings, including their creation, storage, management, and updates within the vector store. Amazon Bedrock ensures that your data consistently remains synchronized with the corresponding vector store.

With the new RetrieveAndGenerate API, you can directly retrieve relevant information from your knowledge bases and have Amazon Bedrock generate a response from the results by specifying an FM in your API call.

rag2

In the background, Amazon Bedrock transforms queries into embeddings, interacts with the knowledge base, and enhances the FM prompt by incorporating search results as contextual information. Subsequently, it delivers the FM-generated response to address my inquiry. In the case of multi-turn conversations, Knowledge Bases effectively handle the short-term memory of the conversation, ensuring more contextualized results.

Python Code

def retrieveAndGenerate(input, kbId):
    return bedrock_agent_runtime.retrieve_and_generate(
        input={
            'text': input
        },
        retrieveAndGenerateConfiguration={
            'type': 'KNOWLEDGE_BASE',
            'knowledgeBaseConfiguration': {
                'knowledgeBaseId': kbId,
                'modelArn': 'arn:aws:bedrock:us-east-1::foundation-model/anthropic.claude-instant-v1'
                }
            }
        )

response = retrieveAndGenerate("What is Amazon Bedrock?", "AES9P3MT9T")["output"]["text"]

def retrieveAndGenerate(input, kbId):

return bedrock_agent_runtime.retrieve_and_generate(

input={

'text': input

retrieveAndGenerateConfiguration={

'type': 'KNOWLEDGE_BASE',

'knowledgeBaseConfiguration': {

'knowledgeBaseId': kbId,

'modelArn': 'arn:aws:bedrock:us-east-1::foundation-model/anthropic.claude-instant-v1'

}

)

response = retrieveAndGenerate("What is Amazon Bedrock?", "AES9P3MT9T")["output"]["text"]

The output of the RetrieveAndGenerate API includes the generated response, the source attribution, and the retrieved text chunks.

Response to the above code:

{ ... 
    'output': {'text': 'Amazon Bedrock is a managed service from AWS that ...'}, 
    'citations': 
        [{'generatedResponsePart': 
             {'textResponsePart': 
                 {'text': 'Amazon Bedrock is ...', 'span': {'start': 0, 'end': 241}}
             }, 
	      'retrievedReferences': 
			[{'content':
                 {'text': 'All AWS-managed service API activity...'}, 
				 'location': {'type': 'S3', 's3Location': {'uri': 's3://data-generative-ai-on-aws/gaia.pdf'}}}, 
		     {'content': 
			      {'text': 'Changing a portion of the image using ...'}, 
				  'location': {'type': 'S3', 's3Location': {'uri': 's3://data-generative-ai-on-aws/gaia.pdf'}}}, ...]
        ...}]
}

{ ...

'output': {'text': 'Amazon Bedrock is a managed service from AWS that ...'},

'citations':

[{'generatedResponsePart':

{'textResponsePart':

{'text': 'Amazon Bedrock is ...', 'span': {'start': 0, 'end': 241}}

'retrievedReferences':

[{'content':

{'text': 'All AWS-managed service API activity...'},

'location': {'type': 'S3', 's3Location': {'uri': 's3://data-generative-ai-on-aws/gaia.pdf'}}},

{'content':

{'text': 'Changing a portion of the image using ...'},

'location': {'type': 'S3', 's3Location': {'uri': 's3://data-generative-ai-on-aws/gaia.pdf'}}}, ...]

...}]

}

Conclusion

In conclusion, Amazon Bedrock’s Knowledge Base is a game-changer for developers seeking to harness the power of information. Whether integrating RAG for dynamic response generation or empowering agents with advanced reasoning capabilities, the possibilities are vast.

Developers can create intelligent applications that stand out in today’s competitive technological landscape by understanding and implementing the various ways to leverage the Knowledge Base. Unlock the true potential of your data with Amazon Bedrock’s Knowledge Base and revolutionize your application development journey.

Drop a query if you have any questions regarding Amazon Bedrock and we will get back to you quickly.

Empowering organizations to become ‘data driven’ enterprises with our Cloud experts.

Reduced infrastructure costs
Timely data-driven decisions

Get Started

About CloudThat

CloudThat is an award-winning company and the first in India to offer cloud training and consulting services worldwide. As a Microsoft Solutions Partner, AWS Advanced Tier Training Partner, and Google Cloud Platform Partner, CloudThat has empowered over 850,000 professionals through 600+ cloud certifications winning global recognition for its training excellence including 20 MCT Trainers in Microsoft’s Global Top 100 and an impressive 12 awards in the last 8 years. CloudThat specializes in Cloud Migration, Data Platforms, DevOps, IoT, and cutting-edge technologies like Gen AI & AI/ML. It has delivered over 500 consulting projects for 250+ organizations in 30+ countries as it continues to empower professionals and enterprises to thrive in the digital-first world.

FAQs

1. How does Amazon Bedrock manage vector embeddings in the context of text data?

ANS: – To explore the process involved in creating, storing, and synchronizing vector embeddings within the vector store by Amazon Bedrock.

2. Can you elaborate on the role of Knowledge Bases in the RAG workflow and short-term memory management for multi-turn conversations?

ANS: – To understand how Knowledge Bases for Amazon Bedrock streamline the end-to-end RAG workflow and manage the short-term memory of multi-turn conversations to provide contextual results.