Amazon Transcribe | Artificial Intelligence

Stream multi-channel audio to Amazon Transcribe using the Web Audio API

In this post, we explore the implementation details of a web application that uses the browser’s Web Audio API and Amazon Transcribe streaming to enable real-time dual-channel transcription. By using the combination of AudioContext, ChannelMergerNode, and AudioWorklet, we were able to seamlessly process and encode the audio data from two microphones before sending it to Amazon Transcribe for transcription.

Revolutionizing clinical trials with the power of voice and AI

As the healthcare industry continues to embrace digital transformation, solutions that combine advanced technologies like audio-to-text translation and LLMs will become increasingly valuable in addressing key challenges, such as patient education, engagement, and empowerment. In this post, we discuss possible use cases for combining speech recognition technology with LLMs, and how the solution can revolutionize clinical trials.

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

In this post, we explore the approach behind building an AWS AI-powered Chrome extension that aims to revolutionize the live streaming experience by providing real-time transcription, translation, and summarization capabilities directly within your browser.

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

In this post, we share how DPG Media is introducing AI-powered processes using Amazon Bedrock into its video publication pipelines. This solution is helping accelerate audio metadata extraction, create a more engaging user experience, and save time.

Enhance your media search experience using Amazon Q Business and Amazon Transcribe

In today’s digital landscape, the demand for audio and video content is skyrocketing. Organizations are increasingly using media to engage with their audiences in innovative ways. From product documentation in video format to podcasts replacing traditional blog posts, content creators are exploring diverse channels to reach a wider audience. The rise of virtual workplaces has […]

Introducing medical speech-to-text with Amazon Transcribe Medical

We are excited to announce Amazon Transcribe Medical, a new HIPAA-eligible, machine learning automatic speech recognition (ASR) service that allows developers to add medical speech-to-text capabilities to their applications. Transcribe Medical provides accurate and affordable medical transcription, enabling healthcare providers, IT vendors, insurers, and pharmaceutical companies to build services that help physicians, nurses, researchers, and […]

Subtitling videos accurately and easily with CaptionHub and AWS

This is a guest post from Graham Pengelly, CTO, and James Jameson, the Commercial Lead, at CaptionHub. CaptionHub is a London-based company that focuses on video captioning and subtitling production for enterprise organizations. While the act of captioning—that is, taking video files and making sure the text on the screen reflects what’s being said accurately […]

Transcribe speech to text in real time using Amazon Transcribe with WebSocket

October 2024: This post was reviewed and updated for accuracy. Amazon Transcribe is an automatic speech recognition (ASR) service that makes it easy for developers to add speech-to-text capability to applications. In November 2018, we added streaming transcriptions over HTTP/2 to Amazon Transcribe. This enabled users to pass a live audio stream to the service […]

Transcribe speech in three new languages: French, Italian, and Brazilian Portuguese

We’re excited to announce that Amazon Transcribe now supports automatic speech recognition in three new languages: French, Italian, and Brazilian Portuguese. These new languages expand upon the 5 languages already available in Amazon Transcribe: US English, US Spanish, Australian English, British English, and Canadian French. Using the Amazon Transcribe API, you can analyze audio files […]

Amazon Transcribe now lets you designate your own Amazon S3 buckets to store transcription outputs

Amazon Transcribe is an automatic speech recognition (ASR) service that makes it easy for you to add a speech-to-text capability to your applications. You can use Amazon Transcribe to create text transcripts of audio and video files. Starting today, you can designate your own S3 buckets to store transcription outputs rather than S3 buckets maintained […]

Artificial Intelligence

Tag: Amazon Transcribe