Sign in
Categories
Your Saved List Partners Sell in AWS Marketplace Amazon Web Services Home Help

Speech to Text (25 results) showing 1 - 10


  • Version 22.04
  • Sold by NVIDIA

The Kaldi Speech Recognition Toolkit project began in 2009 at Johns Hopkins University with the intent of developing techniques to reduce both the cost and time required to build speech recognition systems. While originally focused on ASR support for new languages and domains, the Kaldi project has...

Free Trial


Deepgram is the only true end-to-end Deep Learning ASR offering real-time transcription, built to scale for enterprise. We take the heavy lifting out of noisy, multi-speaker, hard to understand audio transcription, so you can focus on getting the insights you need from your voice data. Use it alone...

Model Package - Fulfilled on Amazon SageMaker


This container makes it possible to quickly deploy a pretrained, english-language transcription model. You can send any sample rate of WAV files, but they will be converted to 32kHz. Metadata is outputted as JSON. This container uses Kaldi, an open-source speech recognition toolkit written in C++...

Free Trial


Deepgram is the only true end-to-end Deep Learning ASR offering real-time transcription, built to scale for enterprise. We take the heavy lifting out of noisy, multi-speaker, hard to understand audio transcription, so you can focus on getting the insights you need from your voice data. Use it alone...

Model Package - Fulfilled on Amazon SageMaker

Free Trial


Deepgram is the only true end-to-end Deep Learning ASR offering real-time transcription, built to scale for enterprise. We take the heavy lifting out of noisy, multi-speaker, hard to understand audio transcription, so you can focus on getting the insights you need from your voice data. Use it alone...

Model Package - Fulfilled on Amazon SageMaker

Free Trial


Deepgram is the only true end-to-end Deep Learning ASR offering real-time transcription, built to scale for enterprise. We take the heavy lifting out of noisy, multi-speaker, hard to understand audio transcription, so you can focus on getting the insights you need from your voice data. Use it alone...

Model Package - Fulfilled on Amazon SageMaker

Free Trial


Deepgram is the only true end-to-end Deep Learning ASR offering real-time transcription, built to scale for enterprise. We take the heavy lifting out of noisy, multi-speaker, hard to understand audio transcription, so you can focus on getting the insights you need from your voice data. Use it alone...

Model Package - Fulfilled on Amazon SageMaker


Starting from $0.76 to $3.42/hr for software + AWS usage fees

An AMI product that provides a state-of-the-art ASR (Speech to Text) technology accessible via a REST API. It is a perfect technology for building voicebots, voice assistants, speech analytics systems, and many other use cases.

Linux/Unix, Ubuntu 20.04 - 64-bit Amazon Machine Image (AMI)


Starting from $0.76 to $3.42/hr for software + AWS usage fees

An AMI product that provides a state-of-the-art ASR (Speech to Text) technology accessible via a REST API. It is a perfect technology for building voicebots, voice assistants, speech analytics systems, and many other use cases.

Linux/Unix, Ubuntu 20.04 - 64-bit Amazon Machine Image (AMI)


Starting from $0.76 to $3.42/hr for software + AWS usage fees

An AMI product that provides a state-of-the-art ASR (Speech to Text) technology accessible via a REST API. It is a perfect technology for building voicebots, voice assistants, speech analytics systems, and many other use cases.

Linux/Unix, Ubuntu 20.04 - 64-bit Amazon Machine Image (AMI)