Overview

Product video
Tabsdata is a new foundation for enterprise data integration that replaces fragile pipelines with a declarative, publish/subscribe model for tables. Data producers can publish trusted datasets once, while consumers across analytics, research, and AI subscribe to them as governed, versioned tables. Every dataset carries built in lineage and provenance, ensuring that data movement is transparent, explainable, and reproducible end to end. By treating lineage as part of the architecture, Tabsdata unifies ingestion, transformation, and delivery into a single system of record. This eliminates redundant ETL jobs, lowers operational overhead, and reduces latency across platforms. Provenance metadata enables record level traceability and rapid impact analysis, strengthening compliance and trust while accelerating delivery. Tabsdata enables self service at scale. Data engineers, analysts, and researchers can discover, subscribe, and enrich data through intuitive interfaces while governance and consistency remain automatic. The result is a leaner, faster, and more resilient data ecosystem ready for analytics, research, and AI innovation at enterprise scale.
Highlights
- Modern ETL without pipelines: Tabsdata replaces fragile ETL pipelines with a declarative, publish/subscribe model for tables, simplifying data integration and delivery across cloud and on prem systems.
- Built in lineage and provenance: Every dataset carries version history, lineage, and record level provenance, ensuring transparent, auditable, and trusted data movement for analytics and AI.
- Unified ingestion, transformation, and delivery: A single system of record that eliminates redundant jobs, lowers operational overhead, and accelerates access to governed, AI ready data.
Details
Introducing multi-product solutions
You can now purchase comprehensive solutions tailored to use cases and industries.
Features and programs
Financing for AWS Marketplace purchases
Pricing
Vendor refund policy
Refunds are handled in accordance with AWS Marketplace policies. If you believe you were charged in error or experienced an issue preventing use of the product, contact us at support@tabsdata.com with your AWS account ID and purchase details. Requests will be reviewed and processed as permitted by AWS Marketplace guidelines.
How can we make this page better?
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
64-bit (x86) Amazon Machine Image (AMI)
Amazon Machine Image (AMI)
An AMI is a virtual image that provides the information required to launch an instance. Amazon EC2 (Elastic Compute Cloud) instances are virtual servers on which you can run your applications and workloads, offering varying combinations of CPU, memory, storage, and networking resources. You can launch as many instances from as many different AMIs as you need.
Version release notes
Tabsdata AMI Release Notes What is Included
- Preconfigured Tabsdata server on RHEL 9 (x86_64)
- Tabsdata CLI ready to use immediately after launch
- Automatic first-boot setup and virtual python environment initialization
- Built-in HTTPS with generated self-signed certificates
- Automatic detection of public/private IPs and correct bind addresses
- Clean instance startup with port validation
- Interactive, guided setup flow for creating your first instance
- Support for running multiple Tabsdata instances on the same VM
Additional details
Usage instructions
Set Up Security Groups Ensure you have a security group set with:
- Inbound Rules including Port 22
- Inbound rules that include the ports you will be running your TabsData server on.
Connect to the VM Launch an EC2 instance from the AMI. SSH into it as the default user: ssh -i /path/to/your-key.pem ec2-user@<EC2_PUBLIC_DNS>
Switch to the tabsdata user When you log in as ec2-user you will be prompted to switch to the preconfigured tabsdata user. Selecting Yes loads the environment that already includes Tabsdata configuration, helper scripts, a ready to use Python virtual environment (tabsdata-env), and the tabsdata CLI on your PATH. To switch manually later: sudo su - tabsdata All steps below assume the tabsdata user.
Option A: Quickstart (recommended) Run the interactive setup tool: tabsdata init This guides you through creating or starting an instance, selecting external and internal API sockets, and generating a self signed HTTPS certificate. After setup, access the Tabsdata UI in your browser using the configured socket. If you are using HTTPS, run this once per new server socket: td auth add-cert --server https://<PUBLIC_IP>:<PORT> --pem cert.pem
Option B: Manual setup
Activate a Python virtual environment You may use the provided tabsdata-env or create your own.
Create or configure a Tabsdata instance Example: tdserver create --instance tabsdata This creates ~/.tabsdata/instances/<INSTANCE_NAME>/ which contains workspace and config files.
Configure ports Edit: ~/.tabsdata/instances/<INSTANCE_NAME>/workspace/config/proc/regular/apiserver/config/config.yaml
Set your external and internal sockets: addresses: 0.0.0.0:<external_port> internal_addresses: 127.0.0.1:<internal_port>
The external ip address must be either 0.0.0.0 or the EC2 private IP address. Each Tabsdata instance must use unique internal and external ports.
Configure HTTPS Follow Setting up Tabsdata over HTTPS https://docs.tabsdata.com/latest/guide/security/main.htmlÂ
to create a self signed certificate. To use a CA issued certificate, upload the CA certificate and key into config/ssl/ and point the config.yaml addresses to the certificate hostname.
Start the server tdserver start
NOTE There is a bug where tdserver stop fails to release the ports used by the apiserver. You must manually kill the process. ps aux | grep apiserver Then kill the process.
You can also run tabsdata init which sources a function called kill_legacy_process that automates this action.
Next steps If using HTTPS, add the certificate for CLI access: td auth add-cert --server https://<socket> --pem cert.pem This must be done for every new socket your server is run on.
Log in through the CLI: td login --server https://<socket> --user admin --password tabsdata --role sys_admin
After login you can create publishers, transformers, subscribers, and triggers. These can be configured through the Tabsdata UI or through the CLI. Once configured, workflows can be registered, executed, and monitored directly on the VM.
Resources
Vendor resources
Support
Vendor support
Tabsdata provides community support for AWS Marketplace based deployments. To avail this effort, please post your question on our community support channel on Slack at: https://tabsdata community.slack.com/
We also have a support line at support@tabsdata.com .
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.