AssemblyAI

AssemblyAI builds new AI systems that can understand human speech with superhuman abilities. AssemblyAI's multilingual Speech AI models provide speech-to-text with industry-leading accuracy and advanced capabilities like speaker detection, PII redaction, and sentiment analysis, which give organizations the ability to generate powerful, actionable insights from audio - all through a secure and scalable API.

0 AWS reviews

31 external reviews

View purchase options

Overview

AssemblyAI offers Speech AI models via an API that product teams and developers can use to build powerful AI solutions based on voice data for their users. Thousands of developers build on AssemblyAI's Speech AI models every day to run Speech-to-Text on multilingual speech, and harness the power of Large Language Models to extract the full value from that voice data - including answering questions from voice data, generating content, and extracting metadata in seconds. AssemblyAI offers async transcription, with most audio files completing in well under 45 seconds regardless of audio duration, as well as real-time transcription with high accuracy and <600 ms of latency.

AssemblyAI gives you access to state-of-the-art Speech AI models and capabilities for real-world use cases, so you can build smarter applications in a fraction of the time. Models and features include:

- Speech recognition
- Speaker diarization
- Auto punctuation and casing
- Auto language detection
- Summarization
- Content moderation
- Sentiment analysis
- Auto highlights
- PII redaction
- Topic detection (IAB classification)
- Entity detection
- Auto chapters
- Custom spelling
- Custom vocabulary
- Dual channel transcription
- Export SRT or VTT caption files
- Filler word filtering
- Profanity filtering

In addition, LeMUR, which allows users to leverage the capabilities of Large Language Models, can quickly process audio transcripts for single or multiple audio files for tasks like summarization, question & answer, and AI coaching feedback.

Our Speech AI products support 33 different audio and video file types and 99+ languages. Our models are used by thousands of breakthrough startups and dozens of global enterprises for mission-critical workloads.

In Pricing, one unit is equivalent to one hour and for Enterprise Pricing please contact sales: www.assemblyai.com/contact

Highlights

Human-level accuracy: Our latest multilingual AI model for speech recognition Universal-1 achieves state-of-the-art accuracy on a wide variety of academic and real-world datasets compared to other ASR models, and is 93% accurate.
More than just a model: Designed for real-world applications, our API includes critical features that help you understand human speech. Our API processes terabytes of audio data every day with over 99.9% uptime and success, and is compliant with SOC 2 Type 2, PCI DSS, and GDPR.
Build smarter apps: Summarize, diarize, detect sentiment, moderate content, redact PII, and more with our set of Audio Intelligence models. Or leverage LeMUR, our framework to build LLM-powered apps on spoken data.

Details

Sold by

AssemblyAI

Features and programs

Financing for AWS Marketplace purchases

AWS Marketplace now accepts line of credit payments through the PNC Vendor Finance program. This program is available to select AWS customers in the US, excluding NV, NC, ND, TN, & VT.

View financing details

Pricing

AssemblyAI

Info

View purchase options

Pricing is based on contract duration. You pay upfront or in installments according to your contract terms with the vendor. This entitles you to a specified quantity of use for the contract duration. Usage-based pricing is in effect for any usage exceeds the entitle amount or not covered in the contract. These charges will be applied on top of the contract price. If you choose not to renew or replace your contract before it ends, access to your entitlements will expire.

1-month contract (1)

Info

Dimension	Description	Cost/month
Pay As You Go	State-of-the-art production-ready AI models	$0.00

Additional usage costs (20)

Info

The following dimensions are not included in the contract terms, which will be charged based on your usage.

Dimension	Cost/unit
Async Transcription (core)	$0.37
Nano Speech-to-Text (core)	$0.12
Real-Time Transcription (core)	$0.47
Auto Chapters (Audio Intelligence)	$0.08
Content Moderation (Audio Intelligence)	$0.15
Entity Detection (Audio Intelligence)	$0.08
Key Phrases (Auto Highlights)	$0.01
PII Redaction (Audio Intelligence)	$0.08
PII Audio Redaction (Audio Intelligence)	$0.05
Sentiment Analysis (Audio Intelligence)	$0.02

Vendor refund policy

All fees are non-refundable and non-cancellable except as required by law.

How can we make this page better?

We'd like to hear your feedback and ideas on how to improve this page.

Legal

Vendor terms and conditions

Upon subscribing to this product, you must acknowledge and agree to the terms and conditions outlined in the vendor's End User License Agreement (EULA) .

Content disclaimer

Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

Usage information

Info

Delivery details

Software as a Service (SaaS)

SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.

Resources

Vendor resources

Why AssemblyAI

Case Studies

AssemblyAI Docs

Support

Vendor support

Support is available via chat and email 24/7. support@assemblyai.com

AWS infrastructure support

AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.

Get support

Customer reviews

Write a review

Ratings and reviews

Info

0 ratings

5 star

4 star

3 star

2 star

1 star

0 AWS reviews

31 external reviews

External reviews are sourced from G2 and are not included in the star rating for this product.

Sandeep K.

Excellent speech to text service

Reviewed on Dec 19, 2024

Review provided by G2

What do you like best about the product?

It is easy to use. Moreover provided great documentation. I personally like the pay as you go feature instead of fixed monthly subscription.

What do you dislike about the product?

In some cases it is not much accurate with some accents.

What problems is the product solving and how is that benefiting you?

It helped us to generate transcripts of our documentation videos . So we can easily find keywords and understand concepts.

Leave a comment 0 comments

Financial Services

AssemblyAI does exactly what we need it to

Reviewed on Nov 25, 2024

Review provided by G2

What do you like best about the product?

Easy to use API, handles the volume we need, and provides great accuracy. Our system was rapid to get started and we have had great support.

What do you dislike about the product?

Any trouble or issues we had during configuration were quickly handled by support. Nothing to dislike.

What problems is the product solving and how is that benefiting you?

We use AssemblyAI to convert our contact center conversations to text for AI processing.

Leave a comment 0 comments

Danilo C.

Good quality and speed with a high price.

Reviewed on Nov 21, 2024

Review provided by G2

What do you like best about the product?

Most of the transcriptions performed achieve good results, but we face challenges with lower-quality audio (which is common in Brazil). The pricing charged in dollars is hindering progress (as we operate in Brazil).

What do you dislike about the product?

Challenges with low-quality audio (common in Brazil) and pricing in dollars.

What problems is the product solving and how is that benefiting you?

Identifying business opportunities in calls between companies and their customers.

Leave a comment 0 comments

Craig W.

Easy to Integrate, Phenomonal Results!

Reviewed on Nov 21, 2024

Review provided by G2

What do you like best about the product?

AssemblyAI has been a phenomenal partner to work with and their experienced team made implementation a smoothe process. Our clients give high praise to the accuracy and consistency offered by AssemblyAI's Speech to Text API.

What do you dislike about the product?

We've yet to experience any issues that couldn't be managed by adjusting our prompts. AI naturally likes to make assumptions when interpreting transcriptions, but we noticed AssemblyAI's API does a great job with it's initial response. A tweak here and there to our prompts was all it needed to improve accuracy.

What problems is the product solving and how is that benefiting you?

AssemblyAI - Speech to Text API is used to transcribe and summarize telehealth sessions.

Leave a comment 0 comments

Phil M.

A detailed and well-supported speech-to-text service for our company!

Reviewed on Nov 21, 2024

Review provided by G2

What do you like best about the product?

Since our company is light on the dev side of things, we needed to have a UI-based capability to use AssemblyAI, and they did not disappoint with their Make.com integration. This was very easy to implement, well-documented, and when we did have questions, the AssemblyAI customer support team was ready and able to help! We use this daily for call transcription and summarization.

What do you dislike about the product?

I don't have anything negative to say about it!

What problems is the product solving and how is that benefiting you?

We use it for Call Transcription & Call Summarization, and we extract key pieces of information in order to pass into our CRM so that our Sales & Customer Service teams can have much more data at their fingertips.

Leave a comment 0 comments

View all reviews