AWS Machine Learning Blog

Tag: Amazon Transcribe

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

In this post, we explore the approach behind building an AWS AI-powered Chrome extension that aims to revolutionize the live streaming experience by providing real-time transcription, translation, and summarization capabilities directly within your browser.

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

In this post, we share how DPG Media is introducing AI-powered processes using Amazon Bedrock into its video publication pipelines. This solution is helping accelerate audio metadata extraction, create a more engaging user experience, and save time.

Enhance your media search experience using Amazon Q Business and Amazon Transcribe

In today’s digital landscape, the demand for audio and video content is skyrocketing. Organizations are increasingly using media to engage with their audiences in innovative ways. From product documentation in video format to podcasts replacing traditional blog posts, content creators are exploring diverse channels to reach a wider audience. The rise of virtual workplaces has […]

Introducing medical speech-to-text with Amazon Transcribe Medical

We are excited to announce Amazon Transcribe Medical, a new HIPAA-eligible, machine learning automatic speech recognition (ASR) service that allows developers to add medical speech-to-text capabilities to their applications. Transcribe Medical provides accurate and affordable medical transcription, enabling healthcare providers, IT vendors, insurers, and pharmaceutical companies to build services that help physicians, nurses, researchers, and […]

Subtitling videos accurately and easily with CaptionHub and AWS

This is a guest post from Graham Pengelly, CTO, and James Jameson, the Commercial Lead, at CaptionHub. CaptionHub is a London-based company that focuses on video captioning and subtitling production for enterprise organizations. While the act of captioning—that is, taking video files and making sure the text on the screen reflects what’s being said accurately […]

Transcribe speech to text in real time using Amazon Transcribe with WebSocket

October 2024: This post was reviewed and updated for accuracy. Amazon Transcribe is an automatic speech recognition (ASR) service that makes it easy for developers to add speech-to-text capability to applications. In November 2018, we added streaming transcriptions over HTTP/2 to Amazon Transcribe. This enabled users to pass a live audio stream to the service […]

Transcribe speech in three new languages: French, Italian, and Brazilian Portuguese

We’re excited to announce that Amazon Transcribe now supports automatic speech recognition in three new languages: French, Italian, and Brazilian Portuguese. These new languages expand upon the 5 languages already available in Amazon Transcribe: US English, US Spanish, Australian English, British English, and Canadian French. Using the Amazon Transcribe API, you can analyze audio files […]

Amazon Transcribe now lets you designate your own Amazon S3 buckets to store transcription outputs  

Amazon Transcribe is an automatic speech recognition (ASR) service that makes it easy for you to add a speech-to-text capability to your applications. You can use Amazon Transcribe to create text transcripts of audio and video files. Starting today, you can designate your own S3 buckets to store transcription outputs rather than S3 buckets maintained […]

Monitor Amazon Transcribe applications with AWS CloudTrail and Amazon CloudWatch Events

Monitoring your AWS resources is critical for security, performance, compliance, and cost control purposes. Therefore, our customers always ask for features to enable monitoring. Today, we are pleased to announce that Amazon Transcribe is integrated with AWS CloudTrail and Amazon CloudWatch Events to give you more visibility and control of your Amazon Transcribe resources. Let’s […]

VidMob combines computer vision and language AI services for data-driven creative asset production

VidMob is a social video creation platform that marketers of all sizes can use to develop personalized advertising communications at scale. VidMob uses machine learning (ML) to power its SaaS application. This application uses metadata extraction and sentiment analysis to provide marketers with actionable insights into which creative assets resonate with their intended audience, and […]