Overview
Architecture1
Architecture1
Architecture2
Architecture3
What is Alluxio
Alluxio is a distributed data orchestration system that brings your data closer to your compute frameworks. It acts as a caching layer between your persistent storage (like Amazon S3, HDFS, or Azure Blob Storage) and your computation frameworks (like Spark, Presto, and PyTorch).
By caching frequently accessed data on the compute cluster, Alluxio dramatically speeds up data access, reduces network congestion, and eliminates I/O bottlenecks, which is especially critical for data-intensive applications like AI/ML training and large-scale data analytics.
Why Use Alluxio?
You should consider using Alluxio if you are experiencing any of the following challenges:
- Slow AI/ML Training: Your expensive GPUs are often idle, waiting for data to be fetched from slow object stores, leading to long training times and high costs.
- Slow Cold Start of Deploying Models: When deploying new models for inference, the initial requests are slow because the model must be downloaded from a remote object store. This "cold start" problem leads to poor user experience and can be a bottleneck for autoscaling.
- Data Silos: Your data is spread across multiple data centers or cloud providers, and you need a unified way to access it without complex data migration.
- High Egress Costs: You are paying high fees to your cloud provider for repeatedly reading the same data from object storage.
Overview
This offering deploys the AlluxioCluster Custom Resource (CR) to automatically provision, configure, and manage an Alluxio Enterprise data orchestration system on Azure Kubernetes Service (AKS). Designed for high-performance AI and analytics workloads, Alluxio bridges the gap between compute frameworks and underlying storage, providing high performance data access.
IMPORTANT PREREQUISITE: ALLUXIO OPERATOR REQUIRED
You MUST install the >Alluxio Operator extension on your target AKS cluster before deploying this offer. This deployment creates an AlluxioCluster custom resource, which relies entirely on the Alluxio Operator (alluxio.alluxio-operator-extensions) to reconcile its state and spin up the underlying pods. If the Operator is not detected on your cluster, this deployment will fail.
If you haven't installed the Operator yet, please search for "Alluxio Operator" in the Azure Marketplace and deploy it first.
Key Features & Components Managed by this CRD:
- Automated Provisioning: Instantiates the complete Alluxio topology, including the Coordinator (Master), Distributed Workers, and CSI/FUSE daemonsets for seamless POSIX-like data access.
- Built-in High Availability: Automatically configures and integrates an embedded etcd cluster for Coordinator high availability and state management.
- Flexible Sizing Profiles: Choose from pre-defined "T-Shirt Sizing" deployment profiles (e.g., Standard) for quick setup, or select "Custom" to unlock advanced configuration.
- License Management: Securely injects and manages your Alluxio Enterprise License string through Azure's protected configuration settings.
Highlights
- Accelerating Performance: By caching data, Alluxio can improve I/O performance by over 10x for both model training and deployment.
- Providing Seamless Data Access: Alluxio provides standard APIs like POSIX (FUSE), S3, and FSSpec, allowing your applications to connect to your data without any code changes.
- Enabling High Scalability: The distributed architecture can scale to handle billions of objects and thousands of clients.
Details
Introducing multi-product solutions
You can now purchase comprehensive solutions tailored to use cases and industries.
Features and programs
Financing for AWS Marketplace purchases
Pricing
Vendor refund policy
We do not offer refunds for this product. Cancel at any time.
How can we make this page better?
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
Alluxio Cluster delivery
- Amazon EKS
- Amazon EKS Anywhere
Helm chart
Helm charts are Kubernetes YAML manifests combined into a single package that can be installed on Kubernetes clusters. The containerized application is deployed on a cluster by running a single Helm install command to install the seller-provided Helm chart.
Version release notes
Alluxio Enterprise Cluster CRD definition, wrapped by Helm.
Additional details
Usage instructions
This Helm chart can be deployed on top of EKS. Please check our documentation for more details
Once you run the "helm install" command, you will be prompted with the instructions to use the solution. You can also run the "helm status" command to print the instructions later.
Please follow the steps:
- Contract with Alluxio to obtain the License and dedicated Docker images.
- Ensure the Alluxio Operator is successfully installed and running on your AKS cluster.
- Carefully configure values.yaml, including the image tag and license.
- Use helm command to deploy cluster.
Support
Vendor support
Email: customer-success@alluxio.com Contacting Technical Support: The Designated Contact may create support tickets during business hours and reach out to technical support via the Alluxio support service system to request information regarding the use, configuration, or operation of Alluxio products running on any supported cluster. Technical support services are strictly limited to resolving issues related to Alluxio products. Technical support services include reviewing the Alluxio support system and responding to inquiries related to the following matters: (a) Best practices for setting up and configuring clusters to run Alluxio, including: hardware and operating system selection, Alluxio configuration within the cluster, and practices for the installation, deployment, and upgrading of Alluxio. (b) Operational support for clusters running Alluxio products, including: best practices for loading data into the cluster, as well as tools and techniques for monitoring supported clusters. (c) Issue diagnosis and resolution, including: failure prevention and remediation, troubleshooting, error diagnosis, and the provision of patches and workarounds to fix Alluxio product errors.
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.