Overview
AssemblyAI offers Speech AI models via an API that product teams and developers can use to build powerful AI solutions based on voice data. Thousands of developers build on AssemblyAI's Speech AI models every day to run Speech-to-Text on multilingual speech, and harness the power of Large Language Models to extract the full value from that voice data - including answering questions from voice data, generating content, and extracting metadata in seconds. AssemblyAI offers two of the world's most powerful and accurate async transcription models, as well as real-time transcription with ultra high accuracy, low latency, and built-in turn detection.
AssemblyAI gives you access to state-of-the-art Speech AI models and capabilities for real-world use cases with unlimited concurrency and no upfront contract commitment, so you can build smarter applications in a fraction of the time. Models and features include:
- Speech recognition
- Keyterms prompting for streaming
- Auto language detection
- Translation
- Speaker diarization and identification
- Auto punctuation and casing
- Custom formatting
- Custom spelling
- Custom vocabulary
- Guardrails, including Content Moderation, PII Redaction, and Profanity Filtering
- Filler word filtering
- Summarization
- Sentiment analysis
- Auto highlights
- Topic detection (IAB classification)
- Entity detection
- Auto chapters
- Dual channel transcription
- Export SRT or VTT caption files
In addition, LLM Gateway allows you to connect speech-to-text outputs directly to your preferred leading LLM provider through a single, unified API for tasks like output fine-tuning, summarization, question & answer, and AI coaching feedback.
Our Speech AI products support 33 different audio and video file types and 99+ languages. Our models are used by thousands of breakthrough startups and dozens of global enterprises for mission-critical workloads.
Highlights
- Unparalleled Human-Level Accuracy: Our multilingual speech recognition AI models deliver industry-leading performance with the lowest word error rates on the market, outperforming competitors by over 60% when recognizing challenging content like rare words and proper nouns. Trusted by more than 3,000 innovative companies, including Zoom, our platform provides the foundation for mission-critical speech applications at scale.
- Built for enterprise-grade performance, our APIs deliver unmatched scalability for high-concurrency applications. Security is embedded with SOC 2 Type 2, PCI DSS, and GDPR compliance. For healthcare applications, AssemblyAI offers Business Associate Agreements (BAAs). Choose flexible hosting options in both US and EU regions.
- Comprehensive Speech Understanding Suite and Guardrails: Our advanced models summarize conversations, identify speakers through diarization, analyze sentiment, moderate content, automatically redact PII, and much more, all in a single platform. Our LLM Gateway seamlessly connects spoken data with your preferred large language models, enabling unlimited possibilities for voice-powered applications in one unified platform.
Details
Unlock automation with AI agent solutions

Features and programs
Financing for AWS Marketplace purchases
Pricing
Dimension | Cost/unit |
|---|---|
Fast, intelligent async transcription with exceptional accuracy and unlimited concurrency | $0.15 |
Highest accuracy transcription powered by LLM intelligence | $0.27 |
Fast, accurate real-time transcription. Built-in turn detection and unlimited concurrency | $0.15 |
Improve recognition accuracy for specific words and phrases | $0.04 |
Identify speakers by their actual names and roles | $0.02 |
Automatically convert your transcribed audio content from one language to another | $0.06 |
Ensure consistency through automatic, standardized formatting | $0.03 |
Identify entities like person and company names, email addresses, dates, and locations | $0.08 |
Detect the sentiment of each sentence of speech spoken in your audio files | $0.02 |
Automatically generate a summary over time for audio and video files | $0.08 |
Vendor refund policy
All fees are non-refundable and non-cancellable except as required by law.
How can we make this page better?
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
Software as a Service (SaaS)
SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.
Resources
Vendor resources
Support
Vendor support
Support is available 24/7 via chat on our website at <www.assemblyai.com > or email at support@assemblyai.com .
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.
Similar products
