Overview
DuckDB 1.4.3 on Ubuntu 24.04 with 24/7 Support by bCloud
DuckDB 1.4.3 on Ubuntu 24.04 is a repackaged open source software offering with optional paid support, optimised for deployment on the AWS Marketplace. DuckDB is a lightweight, high performance, in process SQL OLAP database built specifically for analytical workloads. It delivers blazing fast query performance without requiring any outside services, complex configuration, or client-server setup.
Designed for efficiency and simplicity, DuckDB runs as a single binary inside your application, eliminating network overhead while delivering enterprise-grade analytical capabilities. It is production ready for modern data analytics, ETL pipelines, and cloud-native data science workflows on AWS.
- Blazing fast analytical queries with vectorized execution
- Zero external dependencies runs as a single binary
- In-process database eliminates client-server latency
- Columnar storage optimized for OLAP workloads
- Direct querying of Parquet, CSV, and JSON without data import
- Larger-than-memory processing with automatic disk spilling
- Full ACID transaction support
- Rich SQL dialect with window functions and CTEs
- MIT open-source license
- Runs everywhere from laptops to large AWS EC2 instances
Optimised for AWS Marketplace
This AMI is fully optimized for AWS Cloud environments and designed for fast, secure deployment
- Pre-installed DuckDB on Ubuntu 24.04 LTS
- Python 3 with DuckDB package preconfigured
- CloudWatch Agent for monitoring
- AWS Systems Manager SSM Agent enabled
- ENA drivers for enhanced networking performance
- NVMe drivers for high-speed I/O
- Security hardened OS configuration
- Pre-configured for S3 access and cloud storage
- Ready for data science, analytics, and ETL workloads
Supporting Software used python - 3.12
Key Features
- In-process SQL OLAP database
- Columnar storage with vectorized query execution
- Parallel query processing
- Direct Parquet, CSV, JSON, and NDJSON querying
- Larger than memory workloads with disk spilling
- Full ACID compliance
- Advanced SQL features including window functions, CTEs, and macros
- Extensions for spatial analytics, full-text search, and HTTP/HTTPS
- APIs for Python, R, Java, and Node.js
- Native S3 and cloud storage integration
- Partitioned and Hive-style dataset support
Performance and Analytics
- ranked in ClickBench and TPC benchmarks
- Vectorized OLAP execution engine
- Efficient compression and column pruning
- Filter and projection pushdown
- Join and aggregation optimizations
- Direct streaming of query results to Python and R
Data Source Support
- Native Parquet with metadata pushdown
- CSV with automatic schema detection
- JSON and NDJSON
- Excel support via extensions
- Direct S3 access
- Remote HTTP/HTTPS data sources
- Partitioned datasets with Hive partitioning
- Delta Lake and Apache Iceberg support
Python Data Science Ready
- Native Python API with zero-copy Arrow integration
- Pandas and Polars support without data duplication
- NumPy compatibility
- Jupyter Notebook ready
- SQL directly inside notebooks
- Query results as DataFrames
- User-defined Python functions
- SQL over Pandas DataFrames
Extensions Ecosystem
- Spatial extension PostGIS-compatible
- Full-text search
- JSON processing
- HTTP/HTTPS access
- AWS and S3 extensions
- Parquet and Arrow
- Delta Lake and Iceberg
- Active open-source community ecosystem
Common Use Cases
- Interactive analytics on large datasets
- ETL pipelines processing Parquet files on S3
- Data science workflows with Python and R
- Log analysis and aggregation
- Business intelligence and reporting
- Time-series analytics
- Geospatial analytics using the spatial extension
- Real-time dashboards querying S3 data
- CSV and JSON transformation
- Serverless and cloud-native analytics
Highlights
- DuckDB is designed to be lightweight and embeddable, making it easy to integrate into existing applications without a lot of overhead.
- DuckDB supports standard SQL, making it easy for developers and analysts to use familiar query languages.
- It uses efficient columnar storage and compression techniques to reduce the storage footprint and improve performance.
Details
Introducing multi-product solutions
You can now purchase comprehensive solutions tailored to use cases and industries.
Features and programs
Financing for AWS Marketplace purchases
Pricing
Dimension | Cost/hour |
|---|---|
m4.large Recommended | $0.05 |
t3.micro | $0.05 |
t2.micro | $0.001 |
t2.small | $0.05 |
m5.large | $0.05 |
m3.large | $0.05 |
t2.xlarge | $0.05 |
r5.large | $0.05 |
c5.large | $0.05 |
c4.large | $0.05 |
Vendor refund policy
No refund
How can we make this page better?
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
64-bit (x86) Amazon Machine Image (AMI)
Amazon Machine Image (AMI)
An AMI is a virtual image that provides the information required to launch an instance. Amazon EC2 (Elastic Compute Cloud) instances are virtual servers on which you can run your applications and workloads, offering varying combinations of CPU, memory, storage, and networking resources. You can launch as many instances from as many different AMIs as you need.
Version release notes
Packaged with latest updates as of Jan/2026
Additional details
Usage instructions
Connect your instance via SSH, the username is ubuntu. More info on SSH: https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/AccessingInstancesLinux.html - Run the following commands: #sudo su #sudo apt update #cd /opt/duckdb #source duckdb-venv/bin/activate #pip show duckdb
Support
Vendor support
Feel free to reach out anytime. Our support team is available 24x7 for assistance.
Phone: +1 (408) 646-8523
Email: cloud@bcloud.ai
Website:
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.