reference deployment

Apache Superset on AWS

Explore and visualize data, from line charts to detailed geospatial charts

This Partner Solution deploys the Apache Superset reference architecture on the AWS Cloud. Apache Superset is open-source software that can help you with data exploration and visualization. This deployment supports many database management systems to store data that can be visualized through Apache Superset. Examples of supported database systems include Amazon Athena, Amazon Redshift, ClickHouse, MySQL, and PostgreSQL.

AWS logo

This Partner Solution was developed by Apache in collaboration with AWS.

  •  What you'll build
  • This Partner Solution sets up the following:

    • A highly available architecture that spans two Availability Zones.*
    • A virtual private cloud (VPC) configured with public and private subnets, according to AWS best practices, to provide you with your own virtual network on AWS.*
    • A managed internet gateway to direct inbound traffic to an Application Load Balancer, which manages traffic to the AWS Fargate cluster.
    • In the public subnets, managed network address translation (NAT) gateways to provide outbound internet access for resources in the private subnets.*
    • In the private subnets:
      • An Amazon Elastic Container Service (Amazon ECS) cluster using Fargate to provide all Superset functions, including the core system, cache, database, message queue, and frontend.
      • Amazon Elastic File System (Amazon EFS) to provide metadata, query cached-data persistency, and share service modules.
      • AWS Cloud Map, a discovery service for application resources.
    • Supported database systems, such as Amazon Athena, Amazon Redshift, ClickHouse, MySQL, and PostgreSQL.

    *  The template that deploys the Partner Solution into an existing VPC skips the components marked by asterisks and prompts you for your existing VPC configuration.

  •  How to deploy
  • To deploy Apache Superset on AWS, follow the instructions in the deployment guide. The deployment process takes about 30 minutes and includes these steps:

    1. Sign in to your AWS account. If you don't have an account, sign up at https://aws.amazon.com.
    2. Launch the Partner Solution. Before you create the stack, choose the AWS Region from the top toolbar. You can choose from two options:
    3. Test your deployment.

    Amazon may share user-deployment information with the AWS Partner that collaborated with AWS on this solution.  

  •  Costs and licenses
  • You are responsible for the cost of the AWS services and any third-party licenses used while running this Partner Solution reference deployment. There is no additional cost for using the Partner Solution.

    The AWS CloudFormation templates for Partner Solutions include configuration parameters that you can customize. Some of the settings, such as the instance type, affect the cost of deployment. For cost estimates, refer to the pricing pages for each AWS service you use. Prices are subject to change.

    Tip: After you deploy the Partner Solution, create AWS Cost and Usage Reports to track costs associated with the Partner Solution. These reports deliver billing metrics to an Amazon S3 bucket in your account. They provide cost estimates based on usage throughout each month and aggregate the data at the end of the month. For more information about the report, refer to  What are AWS Cost and Usage Reports?