Sign in
Categories
Your Saved List Become a Channel Partner Sell in AWS Marketplace Amazon Web Services Home Help

Speech to Text (20 results) showing 1 - 10



Starting from $1.00 to $8.00/hr for software + AWS usage fees

NVIDIA® Riva is a set of GPU-accelerated multilingual speech and translation microservices for building fully customizable, real-time conversational AI pipelines. Riva includes automatic speech recognition (ASR), text-to-speech (TTS), and neural machine translation (NMT) and is deployable in all...

Linux/Unix, Ubuntu 22.04 - 64-bit Amazon Machine Image (AMI)


Meetrix Transcribe simplifies speech-to-text conversion, enabling developers to transcribe audio or video files and streams into accurate text. With easy integration via APIs, it supports various audio formats and numerous languages. Transcribe Step-by-step Installation Guide:...


AssemblyAI provides AI models to transcribe and analyze audio and speech data through our production-ready, scalable web API. Our latest AI model for speech recognition (Conformer-2) achieves state-of-the-art accuracy on a wide variety of academic and real-world datasets compared to other ASR...


Eden AI is a versatile platform that connects to a wide array of AI services from big names like Google, AWS, and OpenAI, as well as niche providers like Clarifai and NLPCloud. Its API lets you handle a variety of tasks: from processing documents (like invoices and IDs) to advanced text analysis...


Gladia's core product is an enterprise-grade audio intelligence API. The API is distinguished by exceptional accuracy and speed of transcription, available in both real-time and asynchronous versions. The company's latest hybrid ASR system, Whisper-Zero, is based on an enhanced and optimized...


PatternAI is a standalone generative AI assistant built for enterprise users. With an unmatched context window of 1 Million tokens, you can give Pattern any content you'd like -- any meetings, any documents or data, or even combination -- and you can ''chat'' with Pattern about that content. Or...

Free Trial


KanjuTech's Transcription and Diarization model ensures secure end-to-end recognition of multi-participant conversations. It converts dialogue records into precise transcripts with labeled speakers and lines, offering automatic detection for any number of participants. With low error rates (WER and...

Model Package - Fulfilled on Amazon SageMaker


This solution uses state-of-the-art transformer-based models, performing speech processing for audio transcription. Designed with enterprise scalability in mind, it caters to the needs of businesses of all sizes. This solution ensures fast and accurate transcriptions, while optimizing resource...

Model Package - Fulfilled on Amazon SageMaker

Free Trial


Meet SquadStack ASR, a groundbreaking speech-to-text solution designed for both technology enthusiasts and business users. This innovative AI model boasts top-notch Word Error Rate (WER) performance, ensuring highly accurate transcriptions. SquadStack ASR also offers breakthrough performance in...

Model Package - Fulfilled on Amazon SageMaker


Verus by LEO Technologies is the only search and analytics platform for public safety agencies and correctional institutions that proactively enhances public, personnel, and inmate safety. The Verus System is proprietary, cloud-based Software-as-a-Service (SaaS) on the Amazon Web Services (AWS)...