Overview
Experience the fastest inference and fine-tuning platform with Fireworks AI. Utilize state-of-the-art open-source AI models at blazing speed, optimized for your use case, scaled globally with the Fireworks Inference Cloud
- Own Your AI: Control your models, data, and costs
- Customize Your AI: Tune model quality, speed, and cost to your use case
- Scale effortlessly: Run production workloads globally with 99.9% SLA
- Access 1000s of models: Day-0 support for models like DeepSeek, Kimi, gpt-oss, Qwen, etc.
Start in seconds and pay-per-token with our serverless deployment.
Or
Use our dedicated deployments, fully optimized to your use case.
Highlights
- Build: Prototype Instantly1000s of Day-Zero Optimized Open Models: Instantly access a vast, pre-optimized library of state-of-the-art open-source models (text, image, audio, multimodal).Launch with Zero Overhead: Go from idea to output in second-with just a prompt. Run the latest models on Fireworks serverless, with no GPU setup
- Tune: Perfect Your Usecase Your use case is unique. The most valuable AI is built by combining models with your product data. Fireworks AI empowers you to own the full lifecycle of your Generative AI applications, ensuring maximum performance and control. Leverage advanced reinforcement fine-tuning to custom-train models on your proprietary data without complexity. Fine-Tune with our LoRA-based service, twice as cost-efficient as other providers
- Scale: Deploy Anywhere, Effortlessly Managed Infrastructure: We abstract away the complexity of managing GPU infrastructure, offering auto-scaling dedicated or on-demand deployments. Deploy Globally: Scale production workloads seamlessly across AWS. Continuous Performance Optimization: Our infrastructure maximizes your model's performance at all times, ready to handle massive spikes and mission-critical traffic.
Details
Introducing multi-product solutions
You can now purchase comprehensive solutions tailored to use cases and industries.
Features and programs
Financing for AWS Marketplace purchases
Pricing
Dimension | Description | Cost/unit |
|---|---|---|
Fireworks_PAYG | $ / 1M tokens | $10,000.00 |
Vendor refund policy
All fees are non-refundable and non-cancellable except as required by law.
How can we make this page better?
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
Software as a Service (SaaS)
SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.
Resources
Vendor resources
Support
Vendor support
Email support services are available from Monday to Friday.
support@fireworks.ai
support@fireworks.ai
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.
Similar products
