Amazon Transcribe now supports automatic language identification for multi-lingual real-time audio streams

Posted On: Nov 16, 2023

Amazon Transcribe is an automatic speech recognition (ASR) service that makes it easy for you to add speech-to-text capabilities to your applications. Today, we are excited to announce automatic language identification support for multi-lingual streams. If you operate in a country with multiple official languages or across multiple regions, your audio streams can contain different languages and switch between languages. For such use cases, you can enable multi-language identification, which identifies all languages spoken in your stream and creates transcript using each identified language. This means that if speakers change languages mid-conversation, or if each participant is speaking a different language, your transcription output detects and transcribes each language correctly.

Live streaming transcription is used across industries in contact center applications, broadcast events, meetings captions, and e-learning. With a minimum of 3 seconds of audio, Transcribe can efficiently generate transcripts in the spoken languages without needing humans to specify the language.

Automatic language identification for multilingual audio is supported for all 14 languages that are currently supported for streaming transcriptions at no additional cost, and is available in the following AWS Regions: US East (Ohio), US East (N. Virginia), US West (Oregon), Asia Pacific (Mumbai), Asia Pacific (Singapore), Asia Pacific (Seoul), Asia Pacific (Sydney), Asia Pacific (Tokyo), Africa (Cape Town), Canada (Central), Europe (Frankfurt), Europe (Ireland), Europe (London), South America (São Paulo), and AWS GovCloud (US-West). You can learn more by checking out the Amazon Transcribe documentation page or visit the AWS console to try it out.

_{11/21/23 - This post has been updated to reflect the correct AWS GovCloud (US) Region (US-West).}

Amazon Transcribe now supports automatic language identification for multi-lingual real-time audio streams

Ending Support for Internet Explorer