Amazon Transcribe makes it easy for developers to add speech-to-text capability to their applications. Audio data is virtually impossible for computers to search and analyze. Therefore, recorded speech needs to be converted to text before it can be used in applications. Historically, customers had to work with transcription providers that required them to sign expensive contracts and were hard to integrate into their technology stacks to accomplish this task. Many of these providers use outdated technology that does not adapt well to different scenarios, like low-fidelity phone audio common in contact centers, which results in poor accuracy.
Amazon Transcribe uses a deep learning process called automatic speech recognition (ASR) to convert speech to text quickly and accurately. Amazon Transcribe can be used to transcribe customer service calls, to automate closed captioning and subtitling, and to generate metadata for media assets to create a fully searchable archive. You can use Amazon Transcribe Medical to add medical speech to text capabilities to clinical documentation applications.
Key Features
Easy-to-Read Transcriptions
Amazon Transcribe automatically adds punctuation and formatting so that the output closely matches the quality of manual transcription at a fraction of the time and expense.
Streaming Transcription
You can process audio in batch or in near real-time. Using a secure connection, you can send a live audio stream to the service, and receive a stream of text in response.
Timestamp Generation
Amazon Transcribe returns a timestamp for each word, so that you can easily find a word or phrase in the original recording or add subtitles to video.
Custom Vocabulary
You can add new words to the base vocabulary to generate more accurate transcriptions for domain-specific words and phrases like product names, technical terminology, or names of individuals.
Vocabulary Filtering
You can specify a list of words to remove from transcripts. For example, you can specify a list of profane or offensive words and Amazon Transcribe removes them from transcripts automatically.
Recognize Multiple Speakers
Speaker changes are automatically recognized and attributed in the text to capture scenarios like telephone calls, meetings, and television shows accurately.
Channel Identification
Contact centers can submit a single audio file to Amazon Transcribe, and the service will identify produce a single transcript annotated by channel labels automatically.
Use Cases
Improving Customer Service
By converting audio input into text, Amazon Transcribe helps you build text analytics applications that can search and analyze voice input. Customer contact centers can use Amazon Transcribe to transcribe calls, and mine the data for insights using other AWS services like Amazon Comprehend to extract meaning and intent from conversations.
Captioning & Subtitling Workflows
Amazon Transcribe can help content producers and media distributors improve reach and accessibility by automatically generating time-stamped subtitles that can be displayed along with the video content. By combining this text with Amazon Translate, you can also easily localize videos.
Cataloging Audio Archives
You can use Amazon Transcribe to transcribe audio and video assets into fully searchable archives for compliance monitoring and risk management. Convert audio to text and use Amazon Elasticsearch to index and search across your audio/video library.
Amazon Transcribe Customers
Learn more about Amazon Transcribe pricing