Overview
X-ScaleAI provides an optimized and integrated software stack for high-performance distributed pre-training, fine-tuning, and inference. We support any model defined in PyTorch or HuggingFace, including Large Language Models such as Llama-3, OLMo, Pythia, and BERT and Vision Models such as ResNet, U-Net, ViT and Stable Diffusion. You can also define your own model and your own dataset. Leave the headache of scaling up your AI workloads to us, and focus on your domain-specific strengths.
The end-to-end optimized software stack in X-ScaleAI provides out-of-the-box optimal performance for distributed AI workloads. It has a proprietary MVAPICH MPI implementation that has been tuned for the Elastic Fabric Adapter (EFA) instances on AWS. It comes with a very simple one-command launcher, xscale-ai-run, that significantly simplifies launching a distributed training workload on a AWS parallel cluster. No more complex commands and suboptimal performance.
X-ScaleAI also provides an easy-to-use API to easily scale up your AI workloads. The API automatically applies various optimizations pertaining to distributed data loading and training. It provides scalable model checkpoint and restart support for long-running training and fine-tuning applications. Use X-ScaleAI and save time and effort involved in getting a distributed AI set up and optimized. Reduce your carbon footprint and time to solution by using our optimized stacks.
Highlights
- Leave the headache of scaling up your AI workloads to us, and focus on your domain-specific strengths.
- Use X-ScaleAI and save time and effort involved in getting a distributed AI setup in place and scaling up your AI workloads.
- Reduce your carbon footprint and time to solution by using our optimized stacks.
Details
Typical total price
$1.202/hour
Pricing
Free trial
Instance type | Product cost/hour | EC2 cost/hour | Total/hour |
---|---|---|---|
g4dn.xlarge | $0.32 | $0.526 | $0.846 |
g4dn.2xlarge Recommended | $0.45 | $0.752 | $1.202 |
g4dn.4xlarge | $0.72 | $1.204 | $1.924 |
g4dn.8xlarge | $1.31 | $2.176 | $3.486 |
g4dn.12xlarge | $2.35 | $3.912 | $6.262 |
g4dn.16xlarge | $2.61 | $4.352 | $6.962 |
g4dn.metal | $4.69 | $7.824 | $12.514 |
Additional AWS infrastructure costs
Type | Cost |
---|---|
EBS General Purpose SSD (gp3) volumes | $0.08/per GB/month of provisioned storage |
Vendor refund policy
We do not currently support refunds, but you can cancel at any time.
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
64-bit (x86) Amazon Machine Image (AMI)
Amazon Machine Image (AMI)
An AMI is a virtual image that provides the information required to launch an instance. Amazon EC2 (Elastic Compute Cloud) instances are virtual servers on which you can run your applications and workloads, offering varying combinations of CPU, memory, storage, and networking resources. You can launch as many instances from as many different AMIs as you need.
Version release notes
- Optimized and integrated software stack for high-performance distributed inference.
- Several pre optimized models ready for inference (Llama2-7b, Llama3-8b, Mixtral)
- xscale-ai-inf tool to convert, build, and launch inference jobs
Additional details
Usage instructions
To use the X-ScaleAI AMI, you must first launch an instance. Our AMI supports standard instances and AWS ParallelCluster. After launching your instance, you can connect via SSH using ubuntu as the username.
Once you have connected to the instance, we provide a user guide in the /home/ubuntu/xscale_ai directory. This guide will provide the necessary instructions on how to run applications using xscale-ai-run and how to optimize your applications using the X-ScaleAI software stack and our custom API.
If you do have more questions, please reach out to us at contactus@x-scalesolutions.com .
Support
Vendor support
Our support email id is contactus@x-scalesolutions.com . Please let us know if you have any questions. There is also a user guide (~/x-scaleai/userguide.md) in the AMI.
We also offer custom support contracts to assist you in achieving the best performance and scaling of your distributed AI workloads on AWS and also on other cloud providers and even on-premise clusters. Please email us at contactus@x-scalesolutions.com , if you are interested. We would be happy to work with you.
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.