AWS News Blog

Category: Amazon Machine Learning

Introducing Amazon Nova 2 Sonic: Our new speech-to-speech model for conversational AI

Build more natural voice interactions with enhanced speech-to-speech AI—now featuring multilingual conversations, dynamic speech control, crossmodal inputs, and improved telephony integration while maintaining conversation context across tasks.

AWS Weekly Roundup

AWS Weekly Roundup: How to join AWS re:Invent 2025, plus Kiro GA, and lots of launches (Nov 24, 2025)

Next week, don’t miss AWS re:Invent, Dec. 1-5, 2025, for the latest AWS news, expert insights, and global cloud community connections! Our News Blog team is finalizing posts to introduce the most exciting launches from our service teams. If you’re joining us in person in Las Vegas, review the agenda, session catalog, and attendee guides […]

AWS Weekly Roundup

AWS Weekly Roundup: OpenAI partnership, Jane Goodall Institute research archive, and more (November 10, 2025)

AWS re:Invent 2025 is only 3 weeks away and I’m already looking forward to the new launches and announcements at the conference. Last year brought 60,000 attendees from across the globe to Las Vegas, Nevada, and the atmosphere was amazing. Registration is still open for AWS re:Invent 2025. We hope you’ll join us in Las Vegas […]

AWS Weekly Roundup

AWS Weekly Roundup: Project Rainier online, Amazon Nova Web Grounding, MCP Proxy for AWS, and more (November 3, 2025)

Last week I met Jeff Barr at the AWS Shenzhen Community Day. Jeff shared stories about how builders around the world are experimenting with generative AI and encouraged local developers to keep pushing ideas into real prototypes. Many attendees stayed after the sessions to discuss model grounding, evaluation, and how to bring generative AI into […]

Amazon Nova Multimodal Embeddings: State-of-the-art embedding model for agentic RAG and semantic search

Amazon Nova Multimodal Embeddings is a new state-of-the-art embedding model available in Amazon Bedrock. It is the industry’s first embedding model that supports text, documents, images, video, and audio through a single unified model, enabling cross-modal retrieval and semantic search applications with leading accuracy, at industry-leading costs.