.NET on AWS Blog
Implementing Semantic Search using Amazon Bedrock and RDS for PostgreSQL in .NET
Introduction
Large language models (LLMs) are driving the rapid growth of semantic search applications. Semantic search understands both user intent and content context, rather than just matching keywords. LLMs enhance this capability through their advanced language processing abilities. These AI models can process multiple content formats, including text, images, audio, and video. The users receive comprehensive search results across different media types that match their intent. For example, a natural language query about “how to make sushi” might return text recipes, instructional videos and step-by-step images.
Leading search engine companies integrate semantic search to improve result accuracy. This advancement relies primarily on Embeddings and Vector Database. This blog post walks you through the step-by-step process of implementing semantic search capabilities in a .NET application using Amazon Bedrock and Amazon RDS for PostgreSQL.
Solution Overview
The solution integrates a .NET application with Amazon Titan Text Embeddings models to convert plain text into numeric vectors (embeddings) and stores them in Amazon PostgreSQL database for semantic search operations. This solution uses Amazon Titan Text Embedding v2 model (amazon.titan-embed-text-v2:0) to convert unstructured content like documents, paragraphs, and sentences into vector representations. The embeddings are stored and queried using pgvector, an open source extension for PostgreSQL that adds support for vector operations. This extension provides specialized data types for vector storage and functions for similarity calculations. User can combine these vector capabilities with standard SQL queries to perform complex similarity operations.
The following diagram illustrates the high-level architecture of semantic search application.
data:image/s3,"s3://crabby-images/a0f8f/a0f8f848043c18b5b961c959687cd65a73ec4b88" alt="Architecture diagram showing text transformed to embeddings through Amazon Bedrock, stored in PostgreSQL for semantic search using an app interface."
Figure 1: Semantic search application architecture
The solution architecture involves the following steps:
- Send text input to the embedding model in Amazon Bedrock.
- Generate embeddings for the input text.
- Store embeddings in PostgreSQL database.
- Send search query to embedding model in Amazon Bedrock.
- Generate embeddings for search query.
- Execute semantic search queries against stored embeddings.
Prerequisites
Before proceeding, make sure you have:
- An AWS account with permission to use Amazon Bedrock and Amazon RDS.
- Get the latest version of AWS Command Line Interface (AWS CLI), and Configure the AWS CLI.
- Enable Amazon Titan Text Embeddings V2 model with Amazon Bedrock access.
- Install Visual Studio (or your preferred .NET IDE).
- Install .NET 8.0 SDK .
- Install PgAdmin.
Set up .NET application with pgvector
This step-by-step guide demonstrates how to store, query, and search vector embeddings using AWS services. You can leverage pgvector on Amazon RDS for PostgreSQL to set up, operate, and scale databases for machine learning (ML) enabled applications. The pgvector extension is available on Amazon RDS PostgreSQL 15.2 and higher versions.
- Create a .NET app
- Generate Embeddings from plain text using Amazon Bedrock
- Provision database by creating Amazon RDS for PostgreSQL
- Enable pgvector for Vector Operations
- Store Embeddings in the PostgreSQL Database
- Implement Semantic Search on .NET application
Walkthrough
Step 1: Create a .NET app
The solution uses a .NET console application, but can be used with all .NET project types, including Windows Forms and Web Application Programming Interfaces (APIs). It demonstrates semantic search using movie descriptions to create embeddings. This process works with any text data, but for better processing accuracy, divide larger documents into smaller chunks.
1. Create a .NET 8 console app using Visual Studio. Select Do not use top-level statements.
2. Open the solution in your IDE, and add the following code in Program.cs
file to define a list of movies.
Step 2: Generate Embeddings from plain text using Amazon Bedrock
Amazon Bedrock offers several embedding models from various providers. I will use Amazon Titan Text Embeddings V2 model to convert text into embeddings.
1. Install AWSSDK.BedrockRuntime NuGet package to the project. The package enables .NET application to interact with foundation models from Amazon Bedrock through a managed API client.
2. Add the following code in Program.cs
to generate embeddings.
The GenerateEmbeddingAsync
method creates vector embeddings from input text using the TitanEmbedTextV2Model through AmazonBedrockRuntimeClient
. The model accepts the following parameters:
inputText
– To convert text into embeddingsdimensions
– The following values are accepted as dimensions for output embeddings: 1024 (default), 512, 256.normalize
– Controls embedding normalization (defaults to true)
3. Add the following code to the Main
method in Program.cs
.
After running the .NET application, vector embeddings are generated for each movie.
data:image/s3,"s3://crabby-images/d3c32/d3c3212c8d38a4ce646b23844934d85a67076d62" alt="Visual Studio console showing vector embeddings output for multiple movie titles with numerical values."
Figure 2: Vector embedding in .NET application
Step 3: Provision database by creating Amazon RDS for PostgreSQL
Create an Amazon RDS for PostgreSQL instance to handle vector operations. Amazon RDS provides scalable storage and querying capabilities with automated maintenance, backups, and security updates.
1. Create Amazon RDS for PostgreSQL instance by following steps: Creating and connecting to a PostgreSQL DB instance .
2. Connect to PostgreSQL using pgAdmin by following steps: Connecting to a PostgreSQL DB instance using pgAdmin.
3. Install pgvector extension on the self-managed PostgreSQL as an alternate to Amazon RDS for PostgreSQL.
Step 4: Enable pgvector for Vector Operations
Amazon RDS and Amazon Aurora for PostgreSQL include pre-installed PostgreSQL extensions by default.
1. To enable pgvector on PostgreSQL database, execute the following command using the PSQL tool :
CREATE EXTENSION vector;
2. Use following command to create a new table with a vector type column.
This creates a movies table with vector_description
column storing 256-dimensional vectors. The dimension value (256) must match the embedding size specified when generating embeddings in Step 2.
Vector dimensions affect semantic search performance. Higher dimensions provide better semantic search accuracy, but consume more storage space and require longer processing time. Lower dimensions process faster but may reduce search accuracy. The choice of dimensions depends on balancing your performance requirements with search precision needs.
Step 5: Store Embeddings in the PostgreSQL Database
The Pgvector.EntityFrameworkCore NuGet package enables PostgreSQL vector data type support in .NET applications. With this package, developers can define vector properties in EF Core entity models that map to the corresponding vector data type column in PostgreSQL. The integration provides seamless storage and retrieval of vector data within .NET applications, eliminating the need to handle low-level PostgreSQL implementation details.
1. Install Pgvector.EntityFrameworkCore NuGet package.
2. Create a new file named Movie.cs
in your project to define the entity model:
3. Create a new file named MovieContext.cs
in your project and define a DbContext
class:
4. Configure the database connection string in DbContext
to establish database connectivity. It is recommended to store the connection string externally. To keep application simple, define the connection string in the Program.cs
file and replace placeholders <endpoint>
and <password>
with your Amazon RDS for PostgreSQL endpoint and password, respectively.
5. Add the following code in Program.cs
to store movies and their embeddings:
6. Update the Main
method with the following code in Program.cs
:
After running the .NET application, verify the data ingestion by executing following SELECT query in PgAdmin:
data:image/s3,"s3://crabby-images/7a90d/7a90d70dc692aa6b47e2977f4085dcd624a4419d" alt="PgAdmin query result showing movie table with columns for ID, name, description, and vector_description containing vector embeddings."
Figure 3: Vector embedding in PostgreSQL Database
Step 6: Implement Semantic Search on .NET application
Implementing semantic search in .NET applications involves two key steps: first, the embedding model converts search query (plain text) into a vector embedding, then use pgvector Entity Framework Core to calculate cosine distance and find the nearest neighbours.
1. Add the following code in Program.cs to find semantic similar movies with distance:
Cosine similarity is a popular technique for finding semantic similarity, particularly in natural language processing and information retrieval. It measures the cosine of the angle between two vectors in a multi-dimensional space.
2. Update the Main
method with the following code to test semantic search:
Launch the application from Visual Studio, or enter the following command in the terminal:
dotnet run -- "a movie suitable for the whole family, including children"
After running the .NET application, it displays the semantically similar movies, ordered by their cosine distance. Lower distance values indicate higher similarity to search query. Try your own queries.
data:image/s3,"s3://crabby-images/42a99/42a99aefa9692f84804c23524e97424538e5ea33" alt="Visual Studio console showing search result of semantically similar movies with cosine distance scores."
Figure 4: .NET application semantic search output
Step 7 (optional): Index Vector Data
In addition to the previous steps, users can create vector indexes to improve read performance from large datasets. Vector indexes are specialized data structures that optimize storage and retrieval of high-dimensional vector data, which is crucial for semantic search applications. pgvector offers two modes for nearest neighbor search: exact (default) and approximate. The exact search returns all relevant results with complete accuracy, while the approximate nearest neighbor search offers faster performance by using dedicated indexes like HNSW and IVF FLAT.
Create an index using either method:
Clean-up
Clean up the resources that were created for the sample application to avoid incurring charges. If the foundational model is no longer needed, refer to Add or remove access to Amazon Bedrock foundation models and follow the steps to remove the model access. Delete the Amazon RDS for PostgreSQL database to avoid incurring charges, follow the instructions in the Amazon RDS User Guide: Deleting a DB Instance.
Conclusion
In this post, you learned how to build a semantic search application using Amazon Bedrock and Amazon RDS for PostgreSQL in .NET. PostgreSQL was chosen as the database solution because the enterprise applications majorly use relational databases, and it extends traditional database capabilities with vector operations. This approach adds semantic search functionality to the existing applications while leveraging familiar PostgreSQL features. I encourage you to explore building your own semantic search applications. Refer to supported foundation models in Amazon Bedrock and AWS Vector Database solutions to find the best fit for your semantic search needs.