Overview
This is a repackaged open source software product wherein additional charges apply for support. Scrapy is a powerful and flexible web scraping framework built in Python that allows developers to create spiders for crawling websites and extracting useful information. It works using a request and response cycle where the spider sends requests to web pages and processes the returned responses to extract data using selectors like XPath or CSS. Scrapy includes built in support for handling requests, managing sessions, following links, and exporting data into formats such as JSON, CSV, and XML. It is designed for high performance and can handle large scale scraping tasks efficiently by using asynchronous networking. Scrapy also supports middleware and pipelines which allow customization of data processing and request handling. Due to its structured architecture and speed, it is widely used in industries for data mining, price monitoring, research, and automation tasks.
Highlights
- Fast and scalable web scraping framework
- Uses spiders to crawl and extract data
- Supports multiple data export formats like JSON and CSV
Details
Introducing multi-product solutions
You can now purchase comprehensive solutions tailored to use cases and industries.
Features and programs
Financing for AWS Marketplace purchases
Pricing
Dimension | Cost/hour |
|---|---|
m4.large Recommended | $0.07 |
t3.micro | $0.07 |
t2.micro | $0.01 |
t2.large | $0.07 |
r4.large | $0.07 |
r3.large | $0.07 |
t3.large | $0.07 |
t3.nano | $0.07 |
t2.2xlarge | $0.07 |
t2.medium | $0.07 |
Vendor refund policy
No refund
How can we make this page better?
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
64-bit (x86) Amazon Machine Image (AMI)
Amazon Machine Image (AMI)
An AMI is a virtual image that provides the information required to launch an instance. Amazon EC2 (Elastic Compute Cloud) instances are virtual servers on which you can run your applications and workloads, offering varying combinations of CPU, memory, storage, and networking resources. You can launch as many instances from as many different AMIs as you need.
Version release notes
Packaged with latest updates as of April/2026
Additional details
Usage instructions
Connect your instance via SSH, the username is ubuntu. More info on SSH: https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/AccessingInstancesLinux.html - Run the following commands: #sudo su #cd /opt #source scrapy-env/bin/activate #scrapy version
Support
Vendor support
Feel free to reach out anytime. Our support team is available 24x7 for assistance. Email: meha@kcloudhubs.com
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.