Amazon Transcribe

Automatically convert speech to text

Amazon Transcribe makes it easy for developers to add speech-to-text capability to their applications. Audio data is virtually impossible for computers to search and analyze. Therefore, recorded speech needs to be converted to text before it can be used in applications. Historically, customers had to work with transcription providers that required them to sign expensive contracts and were hard to integrate into their technology stacks to accomplish this task. Many of these providers use outdated technology that does not adapt well to different scenarios, like low-fidelity phone audio common in contact centers, which results in poor accuracy.

Amazon Transcribe uses a deep learning process called automatic speech recognition (ASR) to convert speech to text quickly and accurately. Amazon Transcribe can be used to transcribe customer service calls, to automate closed captioning and subtitling, and to generate metadata for media assets to create a fully searchable archive. You can use Amazon Transcribe Medical to add medical speech to text capabilities to clinical documentation applications.

Key Features

Easy-to-Read Transcriptions

Amazon Transcribe automatically adds punctuation and formatting so that the output closely matches the quality of manual transcription at a fraction of the time and expense.

Streaming Transcription

You can process audio in batch or in near real-time. Using a secure connection, you can send a live audio stream to the service, and receive a stream of text in response.

Timestamp Generation

Amazon Transcribe returns a timestamp for each word, so that you can easily find a word or phrase in the original recording or add subtitles to video.

Custom Vocabulary

You can add new words to the base vocabulary to generate more accurate transcriptions for domain-specific words and phrases like product names, technical terminology, or names of individuals.

Vocabulary Filtering

You can specify a list of words to remove from transcripts. For example, you can specify a list of profane or offensive words and Amazon Transcribe removes them from transcripts automatically. 

Recognize Multiple Speakers

Speaker changes are automatically recognized and attributed in the text to capture scenarios like telephone calls, meetings, and television shows accurately.

Channel Identification

Contact centers can submit a single audio file to Amazon Transcribe, and the service will identify produce a single transcript annotated by channel labels automatically.

Automatic Content Redaction

When instructed, Amazon Transcribe can identify and redact sensitive personally identifiable information (PII) from the supported language transcripts. This allows contact centers to easily review and share the transcripts for customer experience insight and agent training.

Use Cases

Improving Customer Service

By converting audio input into text, Amazon Transcribe helps you build text analytics applications that can search and analyze voice input. Customer contact centers can use Amazon Transcribe to transcribe calls, and mine the data for insights using other AWS services like Amazon Comprehend to extract meaning and intent from conversations.

Captioning & Subtitling Workflows

Amazon Transcribe can help content producers and media distributors improve reach and accessibility by automatically generating time-stamped subtitles that can be displayed along with the video content. By combining this text with Amazon Translate, you can also easily localize videos.

Cataloging Audio Archives

You can use Amazon Transcribe to transcribe audio and video assets into fully searchable archives for compliance monitoring and risk management. Convert audio to text and use Amazon Elasticsearch to index and search across your audio/video library.

Amazon Transcribe Customers

Learn more about Amazon Transcribe pricing

Visit the pricing page
Ready to build?
Have more questions?
Contact us