Fireworks AI (Pay-As-You-Go)

Experience the fastest inference and fine-tuning platform with Fireworks AI. Utilize state-of-the-art open-source models, fine-tune them, or deploy your own at no additional cost. Access a diverse library of models across various modalities - including text, vision, embedding, audio, image, and multimodal - to build and scale your AI applications efficiently. - Blazing fast inference for 100+ models - Fine-tune and deploy in minutes - Building blocks for compound AI systems Start in seconds and pay-per-token with our serverless deployment. Or Use our dedicated deployments, fully optimized to your use case.

3.9

View purchase options

Overview

Try agent mode

Create proposal

Ask question

Experience the fastest inference and fine-tuning platform with Fireworks AI. Utilize state-of-the-art open-source AI models at blazing speed, optimized for your use case, scaled globally with the Fireworks Inference Cloud
- Own Your AI: Control your models, data, and costs
- Customize Your AI: Tune model quality, speed, and cost to your use case
- Scale effortlessly: Run production workloads globally with 99.9% SLA
- Access 1000s of models: Day-0 support for models like DeepSeek, Kimi, gpt-oss, Qwen, etc.

Start in seconds and pay-per-token with our serverless deployment.
Or
Use our dedicated deployments, fully optimized to your use case.

Highlights

Build: Prototype Instantly1000s of Day-Zero Optimized Open Models: Instantly access a vast, pre-optimized library of state-of-the-art open-source models (text, image, audio, multimodal).Launch with Zero Overhead: Go from idea to output in second-with just a prompt. Run the latest models on Fireworks serverless, with no GPU setup
Tune: Perfect Your Usecase Your use case is unique. The most valuable AI is built by combining models with your product data. Fireworks AI empowers you to own the full lifecycle of your Generative AI applications, ensuring maximum performance and control. Leverage advanced reinforcement fine-tuning to custom-train models on your proprietary data without complexity. Fine-Tune with our LoRA-based service, twice as cost-efficient as other providers
Scale: Deploy Anywhere, Effortlessly Managed Infrastructure: We abstract away the complexity of managing GPU infrastructure, offering auto-scaling dedicated or on-demand deployments. Deploy Globally: Scale production workloads seamlessly across AWS. Continuous Performance Optimization: Our infrastructure maximizes your model's performance at all times, ready to handle massive spikes and mission-critical traffic.

Details

Sold by

Fireworks AI

Introducing multi-product solutions

You can now purchase comprehensive solutions tailored to use cases and industries.

Learn more

Explore multi-product solutions

Features and programs

Buyer guide

Gain valuable insights from real users who purchased this product, powered by PeerSpot.

Get the buyer guide

Financing for AWS Marketplace purchases

AWS Marketplace now accepts line of credit payments through the PNC Vendor Finance program. This program is available to select AWS customers in the US, excluding NV, NC, ND, TN, & VT.

View financing details

Pricing

Fireworks AI (Pay-As-You-Go)

Info

View purchase options

Pricing is based on actual usage, with charges varying according to how much you consume. Subscriptions have no end date and may be canceled any time.

Additional AWS infrastructure costs may apply. Use the AWS Pricing Calculator to estimate your infrastructure costs.

Usage costs (1)

Info

Dimension	Description	Cost/unit
Fireworks_PAYG	$ / 1M tokens	$1.00

Vendor refund policy

All fees are non-refundable and non-cancellable except as required by law.

How can we make this page better?

Tell us how we can improve this page, or report an issue with this product.

Legal

Vendor terms and conditions

Upon subscribing to this product, you must acknowledge and agree to the terms and conditions outlined in the vendor's End User License Agreement (EULA) .

Content disclaimer

Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

Usage information

Info

Delivery details

Software as a Service (SaaS)

SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.

Resources

Vendor resources

Fireworks Docs

Fireworks Blogs

Fireworks Models

Support

Vendor support

Email support services are available from Monday to Friday.
support@fireworks.ai
support@fireworks.ai

AWS infrastructure support

AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.

Get support

Similar products

Fireworks

By Fireworks AI

Fireworks.ai offers a generative AI platform as a service. We optimize for rapid product iteration building on top of gen AI as well as minimizing cost to serve.

View product

Firework Shoppable Livestream

By Firework Live Commerce

Welcome to Firework, the world's fastest growing short-video platform live commerce and livestream platform for Brands, Retailers and Publishers.

View product

Cisco Firewall Migration - ASA to FTD

By Gruve Korea Limited

Cisco Fireworks program

View product

Customer reviews

Leave a review

Ratings and reviews

Info

3.9

14 ratings

5 star

4 star

3 star

2 star

1 star

29%

57%

4 AWS reviews

10 external reviews

External reviews are from G2 and PeerSpot .

Gabriel G.

Fireworks AI: Plenty of Open-Source Models and Low Latency, Plus Faster CUDA Kernels

Reviewed on Jul 10, 2026

Review provided by G2

What do you like best about the product?

I like Fireworks AI, and what separates it from other products is the sheer number of open-source models it offers, along with the low latency. I also find that building CUDA kernels help to generate much faster.

What do you dislike about the product?

There isn't much i dislike about fireworks AI. However, I very much dislike how there isn't even a free tier, and that I have to add a credit card to even use the features.

What problems is the product solving and how is that benefiting you?

The hardware kernels of Fireworks AI really helps with response times compared to typical open-source deployments. Also, with fireworks AI, it eliminates the need to buy dedicated GPU's.

JOSE FRANCISCO M.

Easy to use, customizable, and with effective and scalable protection

Reviewed on Jul 09, 2026

Review provided by G2

What do you like best about the product?

Easy customization. Easy to use and effective protection of my environment, with simple integration and scalable performance.

What do you dislike about the product?

Licensing/pricing not suitable for some business environments

What problems is the product solving and how is that benefiting you?

The robustness in protection in a way customized to how my teams need it, and with proper support and response to my problems.

Aman S.

The best way to host open-source models seamlessly.

Reviewed on Jul 07, 2026

Review provided by G2

What do you like best about the product?

Their inference speeds are truly top-tier, which makes running open-source models in production feel effortless. The consistently low latency has noticeably improved our user experience, and the platform has been highly reliable overall, with only minimal downtime.

What do you dislike about the product?

The developer dashboard and analytics could be a bit more robust. It gives you the basics, but it would be really helpful to have more granular usage tracking and detailed cost breakdown tools directly in the UI to manage larger team deployments better.

What problems is the product solving and how is that benefiting you?

We were running into incredibly high API costs with closed-source models. Fireworks AI allows us to migrate key workflows over to high-performing open-source alternatives without sacrificing speed. It has dramatically lowered our monthly API spend while maintaining the low-latency performance our application requires to keep users happy.

reviewer2865477

Billing has created unexpected financial risk while inference works reliably for short trials

Reviewed on Jun 26, 2026

Review from a verified AWS customer

What is our primary use case?

I used the latest-gen OSS models.

How has it helped my organization?

It did not improve the organization—the billing experience caused serious harm. Despite subscribing to a plan explicitly labeled 'Pay-As-You-Go,' I was suddenly hit with a $40,000 charge through AWS Marketplace . Nothing ever disclosed that a pay-as-you-go account could incur a flat five-figure charge: no spending cap, no threshold alert, no notification, no warning. Receiving a $40,000 bill out of nowhere, for usage I never knowingly incurred and which my own dashboard does not reflect, has been an extremely stressful experience. This is misleading and unacceptable for a product marketed as pay-as-you-go.

What is most valuable?

The inference itself works fine. The model breadth and speed are reasonable. My problem is entirely with billing, not the technology.

What needs improvement?

Billing transparency and safeguards are urgently needed. A product sold as 'pay-as-you-go' should never produce a surprise $40,000 charge with no spending cap, no real-time threshold alert, and no notification. There must be hard spend limits, immediate alerts, and clear disclosure that large flat charges are possible. The AWS Marketplace metering also appears mis-scaled, billed as flat $10,000 units, approximately 1,000 times the actual usage, which needs to be fixed so customers aren't blindsided.

For how long have I used the solution?

I have used the solution for a few weeks on the 'Pay-As-You-Go' plan via AWS Marketplace.

Which solution did I use previously and why did I switch?

We didn't switch from our primary inference, which runs on Amazon Bedrock . We were trialing Fireworks alongside it for its open-source model selection.

What's my experience with pricing, setup cost, and licensing?

Be extremely cautious. Although it is marketed as 'Pay-As-You-Go,' there are no spending caps, no threshold alerts, and no notifications. We received a sudden $40,000 charge via AWS Marketplace, which we obviously never used, and we filed a case for it.

Which other solutions did I evaluate?

We used Amazon Bedrock as our main platform and other hosted open-source inference providers. We were evaluating Fireworks as an additional option for OSS models.

What other advice do I have?

The inference technology itself is fine, but the billing is a serious risk. We were billed $40,000 on a pay-as-you-go plan for usage we never knowingly incurred and which our own usage dashboard does not reflect. Until there are enforceable spend limits, real-time alerts, and transparent metering, evaluate the billing exposure very carefully before relying on this in production.

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)

Ike Christian

Custom AI models have transformed our customer chatbot and now deliver faster, tailored responses

Reviewed on Jun 14, 2026

Review provided by PeerSpot

What is our primary use case?

We use Fireworks AI as a powerful tool that helps us in building and scaling our customized AI application model for our business.

We wanted to create a customer base where our customers could interact with us through our chatbot, and Fireworks AI helped us in scaling through that by customizing the AI application model for our business to suit our customer's taste.

Fireworks AI helped us customize the application for our customers by creating strong platform leverage in the ecosystem around it, and that's what we leveraged by it providing us multiple model tiers, which we use in creating that customized AI application for our teams.

What is most valuable?

Fireworks AI has a very fast inference speed by providing minimal delay for our real-time applications.

The best feature Fireworks AI offers is the multiple model tiers; it has very vast model applications where it's more about grasping the infrastructure component quickly, and I think it helps our team balance quality and cost.

Having access to multiple model tiers helps our team balance quality and cost by giving us leverage where we can make options and look at what best suits our company and what we could use, which is beneficial because when you have multiple choices, you can tailor your approach and get what you actually need, so our options are not limited.

Fireworks AI has impacted us positively as it helps in offering us access to the open-source models by advancing fine-tuning options, a massive library where we can get information from the database that we can use in line with our company policy.

Fireworks AI helped us reduce costs, and it helps our team balance quality and improve customer satisfaction because interacting with us at that moment could provide them with easier access and quick answers and responses.

What needs improvement?

The only challenge is that Fireworks AI is not a ready-made business application; you have to customize it to suit your organization's taste, and it lacks a user-friendly dashboard, making it very difficult to grasp. You need to be very detailed to understand how the system works, so I think it could be improved in this aspect.

There is always room for improvement, and that's my fair view and overall scaling for them; as much as it has a fast inference speed, the platform could become more user-friendly. Making it more user-friendly is probably why I chose eight out of ten as my rating.

For how long have I used the solution?

We have been using Fireworks AI for at least two years now.

What do I think about the stability of the solution?

Fireworks AI is very much stable.

What do I think about the scalability of the solution?

The scalability of Fireworks AI is satisfactory to us.

How are customer service and support?

Customer support for Fireworks AI is very friendly, active, and responsive.

Which solution did I use previously and why did I switch?

We were using Groq before we switched to Fireworks AI.

How was the initial setup?

My experience with pricing, setup cost, and licensing was a bit difficult, but the pricing was cost-effective for us, so we were able to get it done. I think it is renewable every year, so that's not a challenge for us.

What was our ROI?

There is a return on investment as Fireworks AI's accuracy helps us with our turnaround time, and I think that's a return on investment for us. It saves us cost as well.

Which other solutions did I evaluate?

Before choosing Fireworks AI, we evaluated other options, including Claude and Groq AI, but then we had to look at the options available to us, considering the cost-effectiveness and the license model.

What other advice do I have?

I advise others looking into using Fireworks AI to use it because the ecosystem around Fireworks creates strong platform leverage and provides multiple model tiers that can let their team balance quality and cost.

Regarding Fireworks AI's AI capabilities, its governance and security policy is deeply rooted, following global standards, and I think that's a fair offering from them.

Regarding Fireworks AI's AI capabilities and the reliability of the output, this has not posed any challenge for us. It's good and satisfactory. I rated this review eight out of ten overall.

View all reviews