reference deployment

DataStax Enterprise on AWS

Always-on data platform for cloud applications, powered by Apache Cassandra

This Partner Solution deploys DataStax Enterprise (DSE) to the Amazon Web Services (AWS) Cloud configuration of your choice.

DSE is the always-on data platform for cloud applications that is powered by Apache Cassandra. DSE is designed to handle big data workloads across multiple nodes with no single point of failure, by employing a peer-to-peer distributed system for data across homogeneous nodes. Integrated within each node of DSE is powerful indexing, search, analytics, and graph functionality, provided by combining Cassandra with Apache Solr, Apache Spark, and DSE Graph.

The Partner Solution includes AWS CloudFormation templates and a guide that provides step-by-step instructions to help you get the most out of your deployment.

DataStax logo

This Partner Solution was developed by DataStax in collaboration with AWS. DataStax is an AWS Partner.

  •  What you'll build
  • Use this Partner Solution to set up the following configurable environment on AWS:

    • A highly available architecture that spans three Availability Zones.*
    • A virtual private cloud (VPC) configured with public and private subnets according to AWS best practices. This provides the network infrastructure for your DSE deployment.*
    • An internet gateway to provide access to the internet.*
    • Managed NAT gateways to allow outbound internet access for resources in the private subnets.*
    • One EC2 instance running DSE OpsCenter, which is the DSE cluster management web console.
    • Additional EC2 instances for DSE data centers and nodes. (A DSE data center is a logical grouping of nodes for workload separation.) By default, the Partner Solution creates a single data center with three nodes, but you can choose up to four data centers with any number of nodes.
    • One Amazon Elastic Block Store (Amazon EBS) data volume per node instance deployed.

    * The template that deploys the Partner Solution into an existing VPC skips the tasks marked by asterisks and prompts you for your existing VPC configuration.

  •  How to deploy
  • To deploy DataStax Enterprise, follow the instructions in the deployment guide. The deployment process includes these steps:

    1. If you don't already have an AWS account, sign up at https://aws.amazon.com.
    2. Sign up for a DataStax Academy account.
    3. Launch the Partner Solution. The stack takes about 30 minutes to launch. You can choose from two options:
    4. Test the deployment.
    5. Add a backup location. You can back up data on the instances themselves or on Amazon S3.

    To customize your deployment, you can choose to create up to four data centers, and specify the names, instance types, nodes, and volume sizes for each data center.

    Amazon may share user-deployment information with the AWS Partner that collaborated with AWS on this solution.  

  •  Costs and licenses
  • This Partner Solution includes a limited no-fee license from DataStax. As described in section 1.4 of DataStax Enterprise Terms, the limited license is for development or non-production use. To use the Partner Solution in a production environment, you must acquire a DataStax subscription license. For more information about licensing for production or DataStax Managed Cloud on AWS, contact sales@datastax.com.

    You are responsible for the cost of the AWS services and any third-party licenses used while running this solution. There is no additional cost for using the solution.

    This solution includes configuration parameters that you can customize. Some of these settings, such as instance type, affect the cost of deployment. For cost estimates, refer to the pricing pages for each AWS service you use. Prices are subject to change.

    Tip: After you deploy a solution, create AWS Cost and Usage Reports to track associated costs. These reports deliver billing metrics to an Amazon Simple Storage Service (Amazon S3) bucket in your account. They provide cost estimates based on usage throughout each month and aggregate the data at the end of the month. For more information, refer to What are AWS Cost and Usage Reports?