Skip to main content

Generative AI

Amazon Nova Sonic

A speech-to-speech foundation model for conversational AI

What is Amazon Nova Sonic?

Amazon Nova Sonic delivers real-time, human-like voice conversations with leading price performance and low latency. Available in Amazon Bedrock via the bidirectional streaming API, the model understands streaming speech in various speaking styles and generates expressive speech responses that dynamically adapt to the prosody of input speech.

Amazon Nova Sonic supports expressive voices, including both masculine-sounding and feminine-sounding voices, in English, Spanish, French, Italian, and German. The model can be utilized across a wide range of applications, including customer support call automation, outbound marketing, voice-enabled personal assistants and agents, and interactive education and language learning.

Key capabilities

Learn more about Amazon Nova Sonic capabilities

Handles user interruptions and detects non-verbal cues (e.g., laughter, grunts, inter-sentential pauses, and hesitations) to enable human-like turn-taking in dialogues.

Nova Sonic’s unified architecture enables it to adapt speech responses to the user’s tone and sentiment.

Bidirectional streaming speech I/O with low user perceived latency.

Accurately recognizes streaming speech across accents with robustness to background noise.

Amazon Nova Sonic supports English (including American and British accents), Spanish, French, Italian, and German.

See Amazon Nova Sonic

Amazon Nova Sonic

Model comparison tables

Discover real-world use cases

Missing alt text value
ASAPP empowers enterprise customers’ contact centers to deliver unmatched customer service through GenerativeAgent, a fully conversational generative Al voice agent.
At ASAPP, we are focused on using generative AI to deliver reliable, secure, and high-performing solutions for improving customer service in contact centers. We’ve been particularly impressed by Amazon Nova Sonic’s highly accurate speech understanding capabilities which allow for more natural voice interactions and precise dialog handling over telephony. We’re excited to continue using Nova Sonic to deliver secure, high-quality, and precise conversations.

Nirmal Mukhi

Nirmal Mukhi, VP AI Engineering at ASAPP
Missing alt text value
Stats Perform is a sports data and AI technology provider, serving global media organizations and professional sports teams.
Our goal is to empower the world’s top sports broadcasters, media, federations and teams with magic in the detail of our vast live and historical Opta sports dataset. We’ve been testing Amazon Nova Sonic and have been particularly impressed by the system's low latency, which enables near- instantaneous responses even to complex queries of our model. The intuitive prompting capability and ease of setup have exceeded our expectations, making implementation simple. Overall, Nova Sonic has proven to be a fantastic solution.

Mike Perez

Mike Perez, Chief Operating Officer at Stats Perform
Missing alt text value
Education First is a leader in international education through its network of schools and offices in over 50 countrie
Amazon Nova Sonic enables EF students to practice new vocabulary and refine their pronunciation in a dynamic learning environment. The model is capable of accurately understanding non-native English speakers with a variety of accents. We were also impressed with the barge-in feature of Nova Sonic, where the model quickly reacts to interruptions. The scalability and reliability of the technology will allow us to expand our capacity to serve a larger student population.

Tim Hesse

Tim Hesse, VP AI & Data at Education First

Getting started with Amazon Nova Sonic

This video provides a step-by-step tutorial on how to use Amazon Nova Sonic in Amazon Bedrock to build your own voice-enabled bot.

Did you find what you were looking for today?

Let us know so we can improve the quality of the content on our pages