AWS Machine Learning Blog

Category: Amazon Transcribe

Get started with automated metadata extraction using the AWS Media Analysis Solution

You can easily get started extracting meaningful metadata from your media files by using the Media Analysis Solution on AWS. The Media Analysis Solution provides AWS CloudFormation templates that you can use to start extracting meaningful metadata from your media files within minutes. With a web-based user interface, you can easily upload files and see the metadata that is automatically extracted. This solution uses Amazon Rekognition for facial recognition, Amazon Transcribe to create a transcript, and Amazon Comprehend to run sentiment analysis on the transcript. You can also upload your own images to an Amazon Rekognition collection and train the solution to recognize individuals. In this blog post, we’ll show you step-by step how to launch the solution and upload an image and video. You’ll be able to see firsthand how metadata is seamlessly extracted.

Amazon Transcribe now supports multi-channel transcriptions

Amazon Transcribe is an automatic speech recognition (ASR) service that makes it easy for developers to add speech-to-text capability to applications. We’re excited to announce the availability of a new feature called channel identification, which allows users to process multi-channel audio files and retrieve a single transcript annotated with respective channel labels.

Announcing the Artificial Intelligence (AI) Hackathon: Build Intelligent Applications using machine learning APIs and serverless

Amazon Web Services (AWS) brings image and video analysis, natural language processing, speech recognition, text-to-speech, and machine translation within the reach of every developer. With machine learning (ML) services by AWS, you can plug in prebuilt AI functionality into your apps without having to worry about ML models. Thousands of developers have used Amazon ML […]

Create video subtitles with translation using machine learning

Businesses from around the globe require fast and reliable ways to transcribe an audio or video file, and often in multiple languages.  This audio and video content can range from a news broadcast, call center phone interactions, a job interview, a product demonstration, or even court proceedings.  The traditional process for transcription is both expensive […]

Amazon Transcribe now lets you designate your own Amazon S3 buckets to store transcription outputs  

Amazon Transcribe is an automatic speech recognition (ASR) service that makes it easy for you to add a speech-to-text capability to your applications. You can use Amazon Transcribe to create text transcripts of audio and video files. Starting today, you can designate your own S3 buckets to store transcription outputs rather than S3 buckets maintained […]

Monitor Amazon Transcribe applications with AWS CloudTrail and Amazon CloudWatch Events

Monitoring your AWS resources is critical for security, performance, compliance, and cost control purposes. Therefore, our customers always ask for features to enable monitoring. Today, we are pleased to announce that Amazon Transcribe is integrated with AWS CloudTrail and Amazon CloudWatch Events to give you more visibility and control of your Amazon Transcribe resources. Let’s […]

VidMob combines computer vision and language AI services for data-driven creative asset production

VidMob is a social video creation platform that marketers of all sizes can use to develop personalized advertising communications at scale. VidMob uses machine learning (ML) to power its SaaS application. This application uses metadata extraction and sentiment analysis to provide marketers with actionable insights into which creative assets resonate with their intended audience, and […]