- Artificial Intelligence›
- Generative AI›
- Amazon Nova›
- Nova Forge
Amazon Nova Forge
Build your own frontier models using Nova
Amazon Nova Forge: Build your own frontier models using Nova
Nova Forge is a new service to build your own frontier models using Nova. Nova Forge customers can start their development from early model checkpoints, blend proprietary data with Amazon Nova-curated training data, and host their custom models securely on AWS.
Nova Forge is the easiest and most cost-effective way to build your own frontier model.
Benefits
Start your custom model development on SageMaker AI using early Nova checkpoints across pre-training, mid-training, or post-training phases. This lets you introduce your proprietary data at the optimal point in the model training, maximizing the model’s learning from your data.
Blend proprietary data with Amazon Nova-curated training data using Amazon provided SageMaker recipes. This approach lets you build a model that deeply understands your organization’s proprietary knowledge, while minimizing risks like catastrophic forgetting and preserving foundational capabilities like reasoning.
Integrate reward functions in your environment for Reinforcement Fine Tuning (RFT). This allows the model to learn from feedback generated in your environment from your applications.
Use the responsible AI toolkit available in Nova Forge to configure the safety and content moderation settings of your custom model. You can adjust settings to meet your specific business needs in areas like safety, security, and handling.
Stay at the forefront of AI technology with early access to new Nova models, including Nova 2 Pro and Nova 2 Omni.
Control and flexibility across all model training phases
Maximize learning in the earliest stage of training
Customers with large volumes of unstructured data can introduce their data through Continued Pre-Training (CPT). Starting with the pre-trained checkpoint ensures that the new datasets are introduced to the model when it is at its peak for learning new domains, while blending in Nova training data to minimize risks like catastrophic forgetting of fundamental capabilities.
Enhance model capabilities using specialized datasets
For customers with intermediate volumes of unstructured data, Nova Forge provides model checkpoints and recipes to introduce data in mid-training, where the propensity to learn from new training data is not set as high as in pre-training. As with pre-training, customers can blend their proprietary data with Amazon Nova-curated training data during the mid-training phase. This allows the model to absorb domain-specific knowledge while retaining general capabilities like language comprehension and reasoning, but with more conservative learning rate to avoid catastrophic forgetting.
Generate effective application-specific responses through SFT and data-mixing
Supervised Fine-Tuning uses labelled data, such as instruction-response datasets, to teach the model how to respond to patterns of queries. Customers can choose to run Supervised Fine-Tuning on a custom base checkpoint they have created, Nova’s pre-built base checkpoint, or Nova’s instruction-tuned model – depending on their needs and training data availability. As with CPT, customers can mix their proprietary data with Amazon Nova-curated labelled data sets during Supervised Fine-Tuning. This enables customers to train the model for their specialized applications while maintaining broad model capabilities such as instruction-following.
Maximize accuracy and align the model with real-world feedback and simulations
Reinforcement Learning (RL) further refines the model's behavior using reward signals and human feedback. In the RL phase, Nova Forge customers can align their model using feedback from remote reward functions. This allows them to use feedback from custom environments with proprietary tools and verifiers through an API during alignment. Unlike the industry standard of simple reward functions defined in small Python files, this API-based approach enables customers to integrate sophisticated custom environment harnesses and run them at scale. Examples of custom reward functions include physics simulators, complex code evaluation using internal systems with nested tool-calling, and robotics tasks against proprietary testing frameworks.
Nova customization and training capabilities
Working with Nova Forge is allowing us to improve content moderation on Reddit with a more unified system that's already delivering impressive results. We're replacing a number of different models with a single, more accurate solution that makes moderation more efficient. The ability to replace multiple specialized ML workflows with one cohesive approach marks a shift in how we implement and scale AI across Reddit. After seeing these early successes in our safety efforts, we're eager to explore how Nova Forge might help in other areas of our business.
Chris Slowe
CTO, Reddit
We’re using Nova Forge to build a unified drug discovery assistant that can predict molecular properties, reason through chemistry problems, and generate novel drug candidates. By exploring thousands of candidates computationally before testing in the lab, where each experiment costs in the thousands, we can bring better medicines to patients faster while reducing costs. Through supervised fine-tuning and reinforcement fine-tuning with Nova 2 Lite, we have already outperformed existing large language models such as Sonnet 4 by 20-50% on property prediction tasks; exceeded or matched the performance of several specialized GNN models on the same tasks, and we are now moving into molecular generation.
Leela Dodda
Director of Computational Chemistry, Nimbus Therapeutics
Nova Forge enables us to build industry-specific LLMs as a compelling alternative to open-weight models. Running on SageMaker AI with managed training infrastructure, we can efficiently develop specialized models like our Japanese financial services LLM by combining Amazon Nova-curated data with our proprietary datasets.
Takahiko Inaba
Head of AI and Managing Director, Nomura Research Institute, Ltd.
At Cosine AI, we are constantly pushing the boundaries of software development agents through reinforcement fine-tuning. We co-designed the API-based approach from Nova Forge, which allows us to use our internal tools and environment for the model to learn and optimize for the exact challenges your business faces, a crucial component of how we have achieved state-of-the-art.
Yang Li
Co-founder, Cosine AI - Yang Li, Co-founder, Cosine AIWe are utilizing the Nova Forge program to build state-of-the-art AI for our diverse business and operations. At Sony Group, we are challenging ourselves to increase the efficiency of the review and assessment process by 100x, using an AI Agent powered by a model developed through Nova Forge. Using reinforcement fine-tuning, early results show that we are exceeding the performance of larger models, while benefitting from Nova's latency and price performance.
Masahiro Oba
Senior General Manager of AI Acceleration Division, Digital & Technology Platform for Sony Group Corporation
Nova 2 Lite enables us to develop the next generation conversational experience, reinventing how users will interact with the Siemens website. Leveraging Nova's fine-tuning capabilities, we can optimize the contextual output of our Retrieval-Augmented Generation (RAG) system, refine the relevance of tool calls, and increase the overall accuracy of search results.
Fabian Fischer
Enterprise Architect, SiemensDid you find what you were looking for today?
Let us know so we can improve the quality of the content on our pages