Sign in
Your Saved List Become a Channel Partner Sell in AWS Marketplace Amazon Web Services Home Help

Alluxio Enterprise Edition - Caching for data analytics

Alluxio Enterprise Edition - Caching for data analytics

By: Alluxio, Inc. Latest Version: 2.1.0-1.0

This version has been removed and is no longer available to new customers.

Product Overview

Alluxio is Data Orchestration for the cloud and enables compute frameworks to leverage data from anywhere, an S3 data lake or remote Hadoop environments. It enables speeding up of frameworks like Apache Spark, Presto, Hive & Tensorflow by caching data and also enables hybrid cloud environments when data is remote. Alluxio moves data closer to compute from where it is stored across zones, regions or countries, creating better data locality and accessibility. Data orchestration is to data like container orchestration is to containers. This Alluxio AMI is best when used with AWS EMR for caching metadata and data to improve performance of Spark, Presto and Hive services within AWS EMR. It can also be used to create a standalone cluster of Alluxio. Learn more about Alluxio Data Orchestration here: Find tutorials for Alluxio on AWS here:




Operating System

Linux/Unix, Amazon Linux 2018_03

Delivery Methods

  • CloudFormation Template

Pricing Information

Usage Information

Support Information

Customer Reviews