Deepgram Voice AI- Multilingual Speech-to-Text (STT)

Deepgram is the enterprise Voice AI platform for building and scaling real time voice applications on AWS. This model transcribes the following 10 languages at once, allowing for aggressive code switching during the conversation. English, Spanish, French, German, Hindi, Russian, Portuguese, Japanese, Italian, and Dutch. Our APIs for Nova or Flux Speech to Text (STT) and Aura Text to Speech (TTS) are natively available in the new SageMaker BiDirectional Streaming API. Additional native touchpoints with Amazon Bedrock, Lex, and Amazon Connect make it simple to compose full voice experiences with the cloud services your teams already trust.

0 AWS reviews

View purchase options

Try for free

Overview

Deepgram powers end to end voice solutions on AWS, from real time transcription to lifelike speech synthesis and interruptible, human like voice agents. Deploy our STT/TTS and agent runtime where you need them: In SageMaker, in a Deepgram managed Dedicated environment or self hosted inside your AWS VPC for maximum control and compliance, with native touchpoints to Amazon Bedrock and Amazon Connect to compose complete voice workflows.

Procure through AWS Marketplace to accelerate onboarding with usage based pricing and consolidated billing on your AWS invoice, ideal for trials, POCs, and scaling to production while aligning to AWS commitments.

Deepgrams AWS alignment includes the AWS Generative AI Competency and a multi year strategic collaboration, giving teams confidence that integrations, cosell, and global scale on AWS are first class.

Use cases include: real time contact center transcription and automation with Amazon Connect + Lex, Bedrock powered voice agents with Deepgram STT/TTS, and streaming/batch analytics via S3, API Gateway, Lambda, and EKS/EC2, all built on the AWS patterns your teams already trust.

Highlights

Real time STT for human like conversations: Sub 300 ms streaming latency with industry leading accuracy (Nova 3: 6.84% median WER streaming; 5.26% batch) to keep pace with fast, noisy speech.
Natural, low latency TTS: Sub 250 ms responses with lifelike speech and streaming delivery for natural turn taking in real time.
Production ready voice agents on AWS: Combine Deepgram STT/TTS with Amazon Bedrock for reasoning and Amazon Connect and Lex for contact center workflows supporting interruptible, human like dialogs at scale alongside real time STT voice agents and real time STT for transcription.

Details

Sold by

Deepgram

Unlock automation with AI agent solutions

Fast-track AI initiatives with agents, tools, and solutions from AWS Partners.

Explore AI agent solutions

Features and programs

Financing for AWS Marketplace purchases

AWS Marketplace now accepts line of credit payments through the PNC Vendor Finance program. This program is available to select AWS customers in the US, excluding NV, NC, ND, TN, & VT.

View financing details

Pricing

Free trial

Try for free

Try this product free for 14 days according to the free trial terms set by the vendor.

Deepgram Voice AI- Multilingual Speech-to-Text (STT)

Info

View purchase options

Pricing is based on actual usage, with charges varying according to how much you consume. Subscriptions have no end date and may be canceled any time.

Additional AWS infrastructure costs may apply. Use the AWS Pricing Calculator to estimate your infrastructure costs.

Usage costs (5)

Info

Dimension	Description	Cost/host/hour
ml.m5.xlarge Inference (Batch) Recommended	Model inference on the ml.m5.xlarge instance type, batch mode	$3.90
ml.g6.2xlarge Inference (Real-Time) Recommended	Model inference on the ml.g6.2xlarge instance type, real-time mode	$16.90
ml.g6e.2xlarge Inference (Real-Time)	Model inference on the ml.g6e.2xlarge instance type, real-time mode	$32.50
ml.g5.2xlarge Inference (Real-Time)	Model inference on the ml.g5.2xlarge instance type, real-time mode	$14.95
ml.g4dn.2xlarge Inference (Real-Time)	Model inference on the ml.g4dn.2xlarge instance type, real-time mode	$3.90

Vendor refund policy

No refunds.

How can we make this page better?

We'd like to hear your feedback and ideas on how to improve this page.

Legal

Vendor terms and conditions

Upon subscribing to this product, you must acknowledge and agree to the terms and conditions outlined in the vendor's End User License Agreement (EULA) .

Content disclaimer

Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

Usage information

Info

Delivery details

Amazon SageMaker model

An Amazon SageMaker model package is a pre-trained machine learning model ready to use without additional training. Use the model package to create a model on Amazon SageMaker for real-time inference or batch processing. Amazon SageMaker is a fully managed platform for building, training, and deploying machine learning models at scale.

Deploy the model on Amazon SageMaker AI using the following options:

Real-time inference

Deploy the model as an API endpoint for your applications. When you send data to the endpoint, SageMaker processes it and returns results by API response. The endpoint runs continuously until you delete it. You're billed for software and SageMaker infrastructure costs while the endpoint runs. AWS Marketplace models don't support Amazon SageMaker Asynchronous Inference. For more information, see Deploy models for real-time inference .

Batch transform

Deploy the model to process batches of data stored in Amazon Simple Storage Service (Amazon S3). SageMaker runs the job, processes your data, and returns results to Amazon S3. When complete, SageMaker stops the model. You're billed for software and SageMaker infrastructure costs only during the batch job. Duration depends on your model, instance type, and dataset size. AWS Marketplace models don't support Amazon SageMaker Asynchronous Inference. For more information, see Batch transform for inference with Amazon SageMaker AI .

Version release notes

V1 Release

Additional details

Inputs

Summary: Binary audio payload for transcription and link to docs. https://developers.deepgram.com/reference/speech-to-text/listen-streaming

Input MIME type: audio/*, *

Real-time inference sample input data

https://dpgr.am/bueller.wav

Batch transform sample input data

https://dpgr.am/bueller.wav

Support

Vendor support

Basic support is provided through email (aws@deepgram.com ). Premium and VIP support packages are also available for enterprise clients.

AWS infrastructure support

AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.

Get support

Similar products

Speech-to-Text & Text-to-Speech GenAI API

By Deepgram

Deepgram, Language AI models to power your apps.

View product

Speech-to-Text, Text-to-Speech, & Voice Agent API (Self-Hosted in AWS)

By Deepgram

Deepgram is the leading voice AI platform for enterprise use cases, offering speech-to-text (STT), text-to-speech (TTS), and full speech-to-speech (STS) capabilities. 200,000+ developers build with Deepgrams voice-native foundational models due to our unmatched accuracy, low latency, and pricing. Having processed over 50,000 years of audio and transcribed over 1 trillion words, there is no organization in the world that understands voice better than Deepgram.

View product

Speech-to-Text, Text-to-Speech, & Voice Agent API (DG Dedicated)

By Deepgram

View product

Deepgram Voice AI- English Speech-to-Text (STT)

By Deepgram

Deepgram is the enterprise Voice AI platform for building and scaling real time voice applications on AWS. Our APIs for Nova or Flux Speech to Text (STT) and Aura Text to Speech (TTS) are natively available in the new SageMaker BiDirectional Streaming API. Additional native touchpoints with Amazon Bedrock, Lex, and Amazon Connect make it simple to compose full voice experiences with the cloud services your teams already trust.

View product

Deepgram Voice AI- Spanish Speech-to-Text (STT)

By Deepgram

Deepgram is the enterprise Voice AI platform for building and scaling real time voice applications on AWS. This model transcribes Spanish. Our APIs for Nova or Flux Speech to Text (STT) and Aura Text to Speech (TTS) are natively available in the new SageMaker BiDirectional Streaming API. Additional native touchpoints with Amazon Bedrock, Lex, and Amazon Connect make it simple to compose full voice experiences with the cloud services your teams already trust.

View product

Customer reviews

Leave a review

Ratings and reviews

Info

0 ratings

5 star

4 star

3 star

2 star

1 star

0 AWS reviews

No customer reviews yet

Be the first to review this product . We've partnered with PeerSpot to gather customer feedback. You can share your experience by writing or recording a review, or scheduling a call with a PeerSpot analyst.