Filters

Speech to Text (20 results) showing 1 - 10

NVIDIA Riva

Version 24.05.1
By NVIDIA

Starting from $1.00 to $8.00/hr for software + AWS usage fees

NVIDIA® Riva is a set of GPU-accelerated multilingual speech and translation microservices for building fully customizable, real-time conversational AI pipelines. Riva includes automatic speech recognition (ASR), text-to-speech (TTS), and neural machine translation (NMT) and is deployable in all...

Linux/Unix, Ubuntu 22.04 - 64-bit Amazon Machine Image (AMI)

Meetrix transcriber with API: Audio/Video-to-Text conversion

Version 1.0.6
By Meetrix.io

Meetrix Transcribe simplifies speech-to-text conversion, enabling developers to transcribe audio or video files and streams into accurate text. With easy integration via APIs, it supports various audio formats and numerous languages. Transcribe Step-by-step Installation Guide:...

AssemblyAI

By AssemblyAI

23 external reviews

AssemblyAI offers Speech AI models via an API that product teams and developers can use to build powerful AI solutions based on voice data for their users. Thousands of developers build on AssemblyAI's Speech AI models every day to run Speech-to-Text on multilingual speech, and harness the power of...

Eden AI - Buy credits

By Eden AI

Eden AI is a versatile platform that connects to a wide array of AI services from big names like Google, AWS, and OpenAI, as well as niche providers like Clarifai and NLPCloud. Its API lets you handle a variety of tasks: from processing documents (like invoices and IDs) to advanced text analysis...

Gladia Speech-To-Text

By Gladia

Gladia's core product is an enterprise-grade audio intelligence API. The API is distinguished by exceptional accuracy and speed of transcription, available in both real-time and asynchronous versions. The company's latest hybrid ASR system, Whisper-Zero, is based on an enhanced and optimized...

Generative AI Assistant for Enterprise Users

By Pattern AI

PatternAI is a standalone generative AI assistant built for enterprise users. With an unmatched context window of 1 Million tokens, you can give Pattern any content you'd like -- any meetings, any documents or data, or even combination -- and you can ''chat'' with Pattern about that content. Or...

Free Trial

KanjuTech Transcription and Diarization

Version 1.17
By KanjuTech

KanjuTech's Transcription and Diarization model ensures secure end-to-end recognition of multi-participant conversations. It converts dialogue records into precise transcripts with labeled speakers and lines, offering automatic detection for any number of participants. With low error rates (WER and...

Model Package - Fulfilled on Amazon SageMaker

Automatic Speech Recognition

Version 1.3
By HARMAN Digital Transformation Solutions

This solution uses state-of-the-art transformer-based models, performing speech processing for audio transcription. Designed with enterprise scalability in mind, it caters to the needs of businesses of all sizes. This solution ensures fast and accurate transcriptions, while optimizing resource...

Model Package - Fulfilled on Amazon SageMaker

Free Trial

SquadStack Speech Recognition (Hinglish)

Version v1.0
By SquadStack

Meet SquadStack ASR, a groundbreaking speech-to-text solution designed for both technology enthusiasts and business users. This innovative AI model boasts top-notch Word Error Rate (WER) performance, ensuring highly accurate transcriptions. SquadStack ASR also offers breakthrough performance in...

Model Package - Fulfilled on Amazon SageMaker

Verus

By LEO Technologies

Verus by LEO Technologies is the only search and analytics platform for public safety agencies and correctional institutions that proactively enhances public, personnel, and inmate safety. The Verus System is proprietary, cloud-based Software-as-a-Service (SaaS) on the Amazon Web Services (AWS)...

showing 1 - 10