Overview
Firecrawl is a self-hosted web scraping API that turns websites into clean markdown and structured data for AI, search, and automation workflows.
This AMI runs Firecrawl with Docker Compose on Ubuntu 24.04 LTS. It includes the Firecrawl API, Playwright service, Redis, RabbitMQ, and PostgreSQL, with helper commands for status, logs, smoke testing, and updates.
Getting started:
- Launch the AMI. First boot can take a few minutes while Docker services start.
- SSH into the instance: ssh ubuntu@
- Check status: firecrawl-status
- Run a scrape smoke test: firecrawl-smoke-test
- Call the API: POST http://:3002/v2/scrape
Configuration is in /opt/firecrawl/.env. Add optional provider keys such as OPENAI_API_KEY, proxy settings, SearXNG, or Ollama settings if needed.
Security: open TCP 3002 only to trusted clients or place the service behind your own reverse proxy/VPN. SSH remains on port 22.
Minimum tested instance: t3.medium (2 vCPU, 4GB RAM). Recommended for production: t3.large or larger, especially for concurrent Playwright scraping.
Firecrawl is AGPL-3.0. Source code is included in /opt/firecrawl.
Highlights
- Self-hosted Firecrawl API with Docker Compose on Ubuntu 24.04
- Prebuilt API, Playwright, Redis, RabbitMQ, and PostgreSQL services
- Includes helper commands for status, logs, smoke testing, and updates
Details
Introducing multi-product solutions
You can now purchase comprehensive solutions tailored to use cases and industries.
Features and programs
Financing for AWS Marketplace purchases
Pricing
- ...
Dimension | Cost/hour |
|---|---|
t3.large Recommended | $0.03 |
m8i.48xlarge | $0.05 |
m7i.16xlarge | $0.05 |
m6i.16xlarge | $0.05 |
m8a.metal-48xl | $0.05 |
m7i.48xlarge | $0.05 |
m7a.metal-48xl | $0.05 |
c8a.4xlarge | $0.05 |
m8i.24xlarge | $0.05 |
m8a.metal-24xl | $0.05 |
Vendor refund policy
Refunds are handled according to AWS Marketplace policies.
How can we make this page better?
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
64-bit (x86) Amazon Machine Image (AMI)
Amazon Machine Image (AMI)
An AMI is a virtual image that provides the information required to launch an instance. Amazon EC2 (Elastic Compute Cloud) instances are virtual servers on which you can run your applications and workloads, offering varying combinations of CPU, memory, storage, and networking resources. You can launch as many instances from as many different AMIs as you need.
Version release notes
Initial Firecrawl AMI release. Includes Firecrawl commit 2f407eb on Ubuntu 24.04 with Docker/Compose, API, Playwright service, Redis, RabbitMQ, PostgreSQL, Redis overcommit tuning, first-boot startup hardening, and helper commands: firecrawl-status, firecrawl-logs, firecrawl-smoke-test, and firecrawl-update.
Additional details
Usage instructions
After launch, SSH into the instance: ssh ubuntu@<public-ip>
First boot can take a few minutes while Docker and Firecrawl finish starting.
Useful commands:
- firecrawl-status
- firecrawl-logs
- firecrawl-smoke-test
- sudo systemctl status firecrawl
- sudo systemctl restart firecrawl
Configuration: sudoedit /opt/firecrawl/.env
API endpoint: http://<public-ip>:3002 Scrape endpoint: POST /v2/scrape
Open TCP 3002 in your instance security group only for trusted clients, or put Firecrawl behind your own reverse proxy/VPN.
Minimum tested instance: t3.medium (2 vCPU, 4GB RAM). Recommended: t3.large or larger for production/concurrent Playwright scraping.
Support
Vendor support
Community support via Firecrawl upstream resources. For AMI-specific issues, contact the publisher through AWS Marketplace.
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.