Listing Thumbnail

    AssemblyAI

     Info
    Sold by: AssemblyAI 
    AssemblyAI builds new AI systems that can understand human speech with superhuman abilities. AssemblyAI's multilingual Speech AI models provide speech-to-text with industry-leading accuracy and advanced capabilities like speaker detection, PII redaction, and sentiment analysis, which give organizations the ability to generate powerful, actionable insights from audio - all through a secure and scalable API.

    Overview

    AssemblyAI offers Speech AI models via an API that product teams and developers can use to build powerful AI solutions based on voice data for their users. Thousands of developers build on AssemblyAI's Speech AI models every day to run Speech-to-Text on multilingual speech, and harness the power of Large Language Models to extract the full value from that voice data - including answering questions from voice data, generating content, and extracting metadata in seconds. AssemblyAI offers async transcription, with most audio files completing in well under 45 seconds regardless of audio duration, as well as real-time transcription with high accuracy and <600 ms of latency.

    AssemblyAI gives you access to state-of-the-art Speech AI models and capabilities for real-world use cases, so you can build smarter applications in a fraction of the time. Models and features include:

    - Speech recognition
    - Speaker diarization
    - Auto punctuation and casing
    - Auto language detection
    - Summarization
    - Content moderation
    - Sentiment analysis
    - Auto highlights
    - PII redaction
    - Topic detection (IAB classification)
    - Entity detection
    - Auto chapters
    - Custom spelling
    - Custom vocabulary
    - Dual channel transcription
    - Export SRT or VTT caption files
    - Filler word filtering
    - Profanity filtering

    In addition, LeMUR, which allows users to leverage the capabilities of Large Language Models, can quickly process audio transcripts for single or multiple audio files for tasks like summarization, question & answer, and AI coaching feedback.

    Our Speech AI products support 33 different audio and video file types and 99+ languages. Our models are used by thousands of breakthrough startups and dozens of global enterprises for mission-critical workloads.

    In Pricing, one unit is equivalent to one hour and for Enterprise Pricing please contact sales: www.assemblyai.com/contact 

    Highlights

    • Human-level accuracy: Our latest multilingual AI model for speech recognition Universal-1 achieves state-of-the-art accuracy on a wide variety of academic and real-world datasets compared to other ASR models, and is 93% accurate.
    • More than just a model: Designed for real-world applications, our API includes critical features that help you understand human speech. Our API processes terabytes of audio data every day with over 99.9% uptime and success, and is compliant with SOC 2 Type 2, PCI DSS, and GDPR.
    • Build smarter apps: Summarize, diarize, detect sentiment, moderate content, redact PII, and more with our set of Audio Intelligence models. Or leverage LeMUR, our framework to build LLM-powered apps on spoken data.

    Details

    Delivery method

    Features and programs

    Financing for AWS Marketplace purchases

    AWS Marketplace now accepts line of credit payments through the PNC Vendor Finance program. This program is available to select AWS customers in the US, excluding NV, NC, ND, TN, & VT.
    Financing for AWS Marketplace purchases

    Pricing

    Pricing is based on contract duration. You pay upfront or in installments according to your contract terms with the vendor. This entitles you to a specified quantity of use for the contract duration. Usage-based pricing is in effect for any usage exceeds the entitle amount or not covered in the contract. These charges will be applied on top of the contract price. If you choose not to renew or replace your contract before it ends, access to your entitlements will expire.

    1-month contract (1)

     Info
    Dimension
    Description
    Cost/month
    Pay As You Go
    State-of-the-art production-ready AI models
    $0.00

    Additional usage costs (20)

     Info

    The following dimensions are not included in the contract terms, which will be charged based on your usage.

    Dimension
    Cost/unit
    Async Transcription (core)
    $0.37
    Nano Speech-to-Text (core)
    $0.12
    Real-Time Transcription (core)
    $0.47
    Auto Chapters (Audio Intelligence)
    $0.08
    Content Moderation (Audio Intelligence)
    $0.15
    Entity Detection (Audio Intelligence)
    $0.08
    Key Phrases (Auto Highlights)
    $0.01
    PII Redaction (Audio Intelligence)
    $0.08
    PII Audio Redaction (Audio Intelligence)
    $0.05
    Sentiment Analysis (Audio Intelligence)
    $0.02

    Vendor refund policy

    All fees are non-refundable and non-cancellable except as required by law.

    Legal

    Vendor terms and conditions

    Upon subscribing to this product, you must acknowledge and agree to the terms and conditions outlined in the vendor's End User License Agreement (EULA) .

    Content disclaimer

    Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

    Usage information

     Info

    Delivery details

    Software as a Service (SaaS)

    SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.

    Support

    Vendor support

    Support is available via chat and email 24/7. support@assemblyai.com 

    AWS infrastructure support

    AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.

    Customer reviews

    Ratings and reviews

     Info
    0 ratings
    5 star
    4 star
    3 star
    2 star
    1 star
    0%
    0%
    0%
    0%
    0%
    0 AWS reviews
    |
    24 external reviews
    External reviews are sourced from G2  and are not included in the star rating for this product.
    Andrew J.

    Accurate and consistent ASR

    Reviewed on Aug 13, 2024
    Review provided by G2
    What do you like best about the product?
    AssemblyAI produces reliable ASR results at a great price.
    What do you dislike about the product?
    Most of the time, AssemblyAI provides an API with excellent uptime and few errors. There are occasional bugs that crop up, and it can take a while to get a bug or edge case resolved.
    What problems is the product solving and how is that benefiting you?
    We record and analyze phone calls. We capture the raw audio and need that audio transcribed into text. AssemblyAI is able to do that ASR more accurately and at a better price than anyone else on the market.
    Jacob S.

    Overall, the Perfect Product!

    Reviewed on Jun 21, 2024
    Review provided by G2
    What do you like best about the product?
    Assembly has been the go-to for our business for all things Speech to Text related. We love the ease-of-use with integration, extremely clear documentation (implementation), phenomenal support, and accuracy of the speech to text recognition.

    Assembly has kept up with our startup's high volume of requests. We grew quickly from 0-180,000 users within 7 months and used Assembly from our very first MVP into our full scale production versions now. We have had practicaly no issues throughout this process - and if we did - support was quick to provide a solution.

    In addition to all of the above, the price for the product is perfect.
    What do you dislike about the product?
    They are limited in the number of languages they offer. While I haven't done extensive competitor research, I believe that others may offer more language variety which we would greatly benefit from in our app.

    The other downside is the number of customizable outputs from the API. We wanted our Speech to Text output to deliver us an SRT file that only displays one word at a time on the screen (since this is a popular format on social media). Assembly does not support these kinds of customizations - however, they did offer a solution for this that requires custom code post-processing of the API call which we appreciated.

    Therefore, while there are a few small drawbacks, we have been able to accurately deliver speech-to-text recognition to our 180,000+ users thanks to Assembly.
    What problems is the product solving and how is that benefiting you?
    They recently added speaker diarization which is great and will benefit our product. They also added new languages recently which we now can also provide to our users (which we appreciate). In addition to this, they told me they are also improving the languages they currently support as well. This is helpful because we have had some users experience inaccuracies with their STT.
    Ryan J.

    Best Speech to Text technology in the market!

    Reviewed on Jun 11, 2024
    Review provided by G2
    What do you like best about the product?
    AssemblyAI really is focused on Product Development as their core customer inside of an organization. Their APIs are well defined and are always making updates to them on the regular. The accuracy and error rate of their speech to text model is the best in the market! Our customers love the transcriptions we can provide them along with some of the other intelligence features. AssemblyAI makes their APIs easy to use and impliment into our products.
    What do you dislike about the product?
    There are no downsides or things I dislike with AssemblyAI's speech to text API.
    What problems is the product solving and how is that benefiting you?
    The main problem that the Speech to Text API solves for CallRail is transcibing phone conversations into transcripts for our end customers. Our customers may receive hundreds and even thousands of phone calls a day. They don't have the time or resources to listen to every call recording. AssemblyAI's Speech to Text API allows us to easily transcribe these conversations inside of out Conversation Intelligence product.
    Avijit C.

    Good Features & Great Advacements

    Reviewed on May 31, 2023
    Review provided by G2
    What do you like best about the product?
    Assembly AI is leveraging generative AI and the current technological trends, allowing them to introduce and provide amazing and better features as well as good accuracy in their engines along with added services.
    What do you dislike about the product?
    I feel they can explore the generative AI specs more deeply and introduce more features rather than traditional Q&A, in order to provide better usability and product lookout of their offering.
    What problems is the product solving and how is that benefiting you?
    They are solving the speech to text and helping us convert calls and voice recordings into text allowing us to utilise NLP based models in order to gain and provide insights
    Computer Software

    Most accurate Speech to text model for telephony audio data transcription.

    Reviewed on Aug 11, 2022
    Review provided by G2
    What do you like best about the product?
    Service is super easy to integrate via APIs whose documentation is available in multiple languages on their website so setting up their service is super easy for developers. Apart from this you pay for what you transcribe i.e they charge based on no. of hours of transcription.
    What do you dislike about the product?
    Their model's accuracy on telephonic Audio data performs much better than competitors but transcribes poorly in the case of Audio having high background noise and disturbances.
    What problems is the product solving and how is that benefiting you?
    Extracting meaningful information from spoken language-based data is becoming essential for language-based startups and the ones who are trying to automate and improve their services. Training models require in-depth domain knowledge of machine learning and AI. assembly makes it easier to perform these tasks in a much simpler way without the need for in-depth domain knowledge in Language-based AI & ML.
    View all reviews