Sign in
Categories
Your Saved List Partners Sell in AWS Marketplace Amazon Web Services Home Help

Speech Recognition (16 results) showing 1 - 10


  • Version v1.0.0b2
  • Sold by NVIDIA

NVIDIA NeMo is an open-source toolkit with a PyTorch backend that pushes the abstractions one step further. NeMo makes it possible for you to quickly compose and train complex, state-of-the-art, neural network architectures with three lines of code. It is used to build models for real-time...


VoiceWorx.ai SmartOffice is a No-Code enterprise skill management platform that allows users to easily create and publish secure Alexa Skills with connections/integrations to enterprise data sources like SalesForce, ServiceNow, Zendesk and various other popular systems used at work. The...


Oration enables contact centres to deliver full natural language call routing solutions with no technical capability. The product is designed to enable Contact Centre managers to fully control, monitor and analyse the call routing experience to reduce AHT, deflect calls through targeted banner...

  • Version 20.11
  • Sold by NVIDIA

The Kaldi Speech Recognition Toolkit project began in 2009 at Johns Hopkins University with the intent of developing techniques to reduce both the cost and time required to build speech recognition systems. While originally focused on ASR support for new languages and domains, the Kaldi project has...

  • Version 20.07
  • Sold by NVIDIA

Through this NeMo application, we empower you to create your own ASR models built for your domain specific data. Developers have complete control over their data unlike when using a 'black box' ASR tool available in the cloud, giving you the ability to create better performing ASR models for your...


The Fonznik is a speech controlled service that places calls to members of your contact list, or to telephone numbers that you say. When you dial The Fonznik from your cell phone or land-line you will be asked to say the name of your contact, or say the number, to call. The Fonznik will place the...


This model is trained to recognize if there is any sort of background noise (be it a dog barking, street sounds, static, airplane noise, or anything other than the main speaker speaking) when there is a single speaker in the audio snippet. Fundamentally, it classifies an audio recording as noisy or...

Model Package - Fulfilled on Amazon SageMaker


Deepgram Brain Speech Recognition provides high accuracy, scalable speech recognition built to deliver actionable insights from voice data in realtime. With deep learning speech models optimally trained to understand your recorded audio, Deepgram run on Amazon’s cloud is the one-stop-shop for...

Model Package - Fulfilled on Amazon SageMaker