Sign in
Categories
Your Saved List Partners Sell in AWS Marketplace Amazon Web Services Home Help

Speech Recognition (38 results) showing 1 - 10



NVIDIA NeMo is an open-source toolkit with a PyTorch backend that pushes the abstractions one step further. NeMo makes it possible for you to quickly compose and train complex, state-of-the-art, neural network architectures with three lines of code. It is used to build models for real-time...


The Kaldi Speech Recognition Toolkit project began in 2009 at Johns Hopkins University with the intent of developing techniques to reduce both the cost and time required to build speech recognition systems. While originally focused on ASR support for new languages and domains, the Kaldi project has...

Free Trial


Deepgram is the only true end-to-end Deep Learning ASR offering real-time transcription, built to scale for enterprise. We take the heavy lifting out of noisy, multi-speaker, hard to understand audio transcription, so you can focus on getting the insights you need from your voice data. Use it alone...

Model Package - Fulfilled on Amazon SageMaker


Through this NeMo application, we empower you to create your own ASR models built for your domain specific data. Developers have complete control over their data unlike when using a 'black box' ASR tool available in the cloud, giving you the ability to create better performing ASR models for your...


Starting from $0.76 to $3.42/hr for software + AWS usage fees

An AMI product that provides a state-of-the-art ASR (Speech to Text) technology accessible via a REST API. It is a perfect technology for building voicebots, voice assistants, speech analytics systems, and many other use cases.

Linux/Unix, Ubuntu 20.04 - 64-bit Amazon Machine Image (AMI)


Symbl contextual conversation intelligence platform provides scalable, secure speech recognition and contextual analytics to build differentiated experiences from voice, video or text data in realtime. Generate action items, questions, appointments, topic hierarchy, topics, sentiments, action...


VoiceWorx.ai SmartOffice is a No-Code enterprise skill management platform that allows users to easily create and publish secure Alexa Skills with connections/integrations to enterprise data sources like SalesForce, ServiceNow, Zendesk and various other popular systems used at work. The...


This model is trained to recognize if there is any sort of background noise (be it a dog barking, street sounds, static, airplane noise, or anything other than the main speaker speaking) when there is a single speaker in the audio snippet. Fundamentally, it classifies an audio recording as noisy or...

Model Package - Fulfilled on Amazon SageMaker

Free Trial


Deepgram is the only true end-to-end Deep Learning ASR offering real-time transcription, built to scale for enterprise. We take the heavy lifting out of noisy, multi-speaker, hard to understand audio transcription, so you can focus on getting the insights you need from your voice data. Use it alone...

Model Package - Fulfilled on Amazon SageMaker


Lingvanex Translation On-premise Server is enterprise solution for a secure translation of text, documents and speech within your organization. Free demo server includes 5 languages: English, Spanish, Chinese Simplified, French, German. Note that the demo version includes neural models with average...

Linux/Unix, Ubuntu 20.04 - 64-bit Amazon Machine Image (AMI)