Overview
Deepgram Voice AI Flux Multilingual Speech-to-Text (STT) Streaming enables real-time, high-accuracy transcription across multiple languages using Amazon SageMaker's bidirectional streaming inference. Built on Deepgram's state-of-the-art Flux model, this product delivers low-latency speech recognition ideal for live captioning, voice assistants, call center analytics, and customer experience applications.
Deployment is straightforward through Amazon SageMaker endpoints. Customers run real-time inference using SageMaker bidirectional streaming APIs, enabling seamless integration into existing applications and workflows with minimal friction.
The Flux model supports multilingual transcription, allowing businesses to serve global audiences with a single unified solution. With Deepgram's proven AI infrastructure, customers benefit from enterprise-grade reliability, performance, and accuracy across a broad range of languages and acoustic environments.
Highlights
- Real-time multilingual speech-to-text transcription powered by Deepgram's state-of-the-art Flux model via Amazon SageMaker streaming inference
Details
Introducing multi-product solutions
You can now purchase comprehensive solutions tailored to use cases and industries.
Features and programs
Financing for AWS Marketplace purchases
Pricing
Free trial
Dimension | Description | Cost/host/hour |
|---|---|---|
ml.m5.xlarge Inference (Batch) Recommended | Model inference on the ml.m5.xlarge instance type, batch mode | $37.00 |
ml.g6.2xlarge Inference (Real-Time) Recommended | Model inference on the ml.g6.2xlarge instance type, real-time mode | $51.80 |
ml.g6e.2xlarge Inference (Real-Time) | Model inference on the ml.g6e.2xlarge instance type, real-time mode | $74.00 |
ml.g5.2xlarge Inference (Real-Time) | Model inference on the ml.g5.2xlarge instance type, real-time mode | $37.00 |
ml.g4dn.2xlarge Inference (Real-Time) | Model inference on the ml.g4dn.2xlarge instance type, real-time mode | $51.80 |
Vendor refund policy
n/a
How can we make this page better?
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
Amazon SageMaker model
An Amazon SageMaker model package is a pre-trained machine learning model ready to use without additional training. Use the model package to create a model on Amazon SageMaker for real-time inference or batch processing. Amazon SageMaker is a fully managed platform for building, training, and deploying machine learning models at scale.
Version release notes
Added Flux Multilingual model support for streaming speech-to-text transcription across multiple languages.
Additional details
Inputs
- Summary
Audio for transcription
- Input MIME type
- audio/*, *
Support
Vendor support
For product support, visit https://deepgram.com/support or email support@deepgram.com . Documentation and sample notebooks are available at https://github.com/deepgram-devs/dg-sagemaker . Enterprise support options are available for SageMaker deployments.
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.