Sign in
Categories
Your Saved List Partners Sell in AWS Marketplace Amazon Web Services Home Help

Speech Recognition (39 results) showing 1 - 10



NVIDIA NeMo is an open-source toolkit with a PyTorch backend that pushes the abstractions one step further. NeMo makes it possible for you to quickly compose and train complex, state-of-the-art, neural network architectures with three lines of code. It is used to build models for real-time...


The Kaldi Speech Recognition Toolkit project began in 2009 at Johns Hopkins University with the intent of developing techniques to reduce both the cost and time required to build speech recognition systems. While originally focused on ASR support for new languages and domains, the Kaldi project has...


Through this NeMo application, we empower you to create your own ASR models built for your domain specific data. Developers have complete control over their data unlike when using a 'black box' ASR tool available in the cloud, giving you the ability to create better performing ASR models for your...

Free Trial


Deepgram is the only true end-to-end Deep Learning ASR offering real-time transcription, built to scale for enterprise. We take the heavy lifting out of noisy, multi-speaker, hard to understand audio transcription, so you can focus on getting the insights you need from your voice data. Use it alone...

Model Package - Fulfilled on Amazon SageMaker


VoiceWorx.ai SmartOffice is a No-Code enterprise skill management platform that allows users to easily create and publish secure Alexa Skills with connections/integrations to enterprise data sources like SalesForce, ServiceNow, Zendesk and various other popular systems used at work. The...


Starting from $0.76 to $3.42/hr for software + AWS usage fees

An AMI product that provides a state-of-the-art ASR (Speech to Text) technology accessible via a REST API. It is a perfect technology for building voicebots, voice assistants, speech analytics systems, and many other use cases.

Linux/Unix, Ubuntu 20.04 - 64-bit Amazon Machine Image (AMI)


Starting from $0.76 to $3.42/hr for software + AWS usage fees

An AMI product that provides a state-of-the-art ASR (Speech to Text) technology accessible via a REST API. It is a perfect technology for building voicebots, voice assistants, speech analytics systems, and many other use cases.

Linux/Unix, Ubuntu 20.04 - 64-bit Amazon Machine Image (AMI)


This model is trained to recognize if there is any sort of background noise (be it a dog barking, street sounds, static, airplane noise, or anything other than the main speaker speaking) when there is a single speaker in the audio snippet. Fundamentally, it classifies an audio recording as noisy or...

Model Package - Fulfilled on Amazon SageMaker

Free Trial


Deepgram is the only true end-to-end Deep Learning ASR offering real-time transcription, built to scale for enterprise. We take the heavy lifting out of noisy, multi-speaker, hard to understand audio transcription, so you can focus on getting the insights you need from your voice data. Use it alone...

Model Package - Fulfilled on Amazon SageMaker


Welcome to LabVoice, the laboratory digital assistant. The LabVoice digital assistant optimizes your laboratory processes, transforming how scientists interact with their instruments and software. It is a modern scientific app to run those workflows, enabling guided process execution and hands...