
Overview
SDXL Beta 0.9 offers a preview to the largest open source image model from Stability AI. SDXL produces high resolution images at native 1024px resolution. With SDXL Beta 0.9, Stability introduces a brand new 2-stage architecture for image generative AI models, and integrates the largest CLIP model. The result: stunning quality in image generation with minimal need for prompt engineering.
Highlights
- The foundation model for images: All text to image, image to image, inpainting, and outpainting workflows are handled by the official SDXL model. SDXL introduces a new SOTA architecture for image generation, comprising a 3.5B parameter base model stage and a 6.6B parameter ensemble pipeline.
- Unprecedented quality and ease of use: Native 1024x1024 image generation with cinematic photorealism and fine detail. The most advanced text generation within images. Fine-tuned to create complex compositions with basic natural language prompting, thanks to the largest CLIP model in production.
- Beta: SDXL 0.9 is a beta version of SDXL 1.0. While it is a powerful image generation model, for increased professionalism in image output, consider using SDXL 1.0
Details
Unlock automation with AI agent solutions

Features and programs
Financing for AWS Marketplace purchases
Pricing
Dimension | Description | Cost |
|---|---|---|
ml.g4dn.12xlarge Inference (Batch) Recommended | Model inference on the ml.g4dn.12xlarge instance type, batch mode | $0.00/host/hour |
ml.g5.2xlarge Inference (Real-Time) Recommended | Model inference on the ml.g5.2xlarge instance type, real-time mode | $0.00/host/hour |
ml.p4de.24xlarge Inference (Real-Time) | Model inference on the ml.p4de.24xlarge instance type, real-time mode | $0.00/host/hour |
ml.p4d.24xlarge Inference (Real-Time) | Model inference on the ml.p4d.24xlarge instance type, real-time mode | $0.00/host/hour |
inference.count.m.i.c Inference Pricing | inference.count.m.i.c Inference Pricing | $0.00/request |
Vendor refund policy
n/a
How can we make this page better?
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
Amazon SageMaker model
An Amazon SageMaker model package is a pre-trained machine learning model ready to use without additional training. Use the model package to create a model on Amazon SageMaker for real-time inference or batch processing. Amazon SageMaker is a fully managed platform for building, training, and deploying machine learning models at scale.
Version release notes
This release resolves a bug with mask image inputs.
Additional details
Inputs
- Summary
This model accepts JSON input aligned with the Stability REST APIÂ as well as protocol buffers aligned with the Stability GRPC APIÂ .
- Limitations for input type
- Input images must be base64 encoded in JSON or supplied via protobuf.
- Input MIME type
- application/json, application/x-protobuf
Input data descriptions
The following table describes supported input data fields for real-time inference and batch transform.
Field name | Description | Constraints | Required |
|---|---|---|---|
height | The height of the image in pixels. Must be a multiple of 64.
On p4d/p4de instances, resolutions values are unlocked and only limited by GPU memory.
On g5 instances, optimized native resolutions are limited to:
1024x1024
1152x896 / 896x1152
1216x832 / 832x1216
1344x768 / 768x1344
1536x640 / 640x1536
Lower resolutions than native will be cropped. Higher resolutions will result in an error. | Default value: 1024
Type: Integer | No |
width | The width of the image in pixels. Must be a multiple of 64.
On p4d/p4de instances, resolutions values are unlocked and only limited by GPU memory.
On g5 instances, optimized native resolutions are limited to:
1024x1024
1152x896 / 896x1152
1216x832 / 832x1216
1344x768 / 768x1344
1536x640 / 640x1536
Lower resolutions than native will be cropped. Higher resolutions will result in an error. | Default value: 1024
Type: Integer | No |
text_prompts | An array of text prompts to use for generation.
Given a text prompt with the text A lighthouse on a cliff and a weight of 0.5, it would be represented as:
"text_prompts": [
{
"text": "A lighthouse on a cliff",
"weight": 0.5
}
] | Type: FreeText
Limitations: Structured JSON array of prompts. | Yes |
cfg_scale | How strictly the diffusion process adheres to the prompt text (higher values keep your image closer to your prompt) | Default value: 7
Type: Continuous
Minimum: 0
Maximum: 35 | No |
sampler | Which sampler to use for the diffusion process. | Default value: auto
Type: Categorical
Allowed values: DDIM,DDPM,K_DPMPP_SDE,K_DPMPP_2M,K_DPMPP_2S_ANCESTRAL,K_DPM_2,K_DPM_2_ANCESTRAL,K_EULER,K_EULER_ANCESTRAL,K_HEUN,K_LMS | No |
seed | Random noise seed (omit this option or use 0 for a random seed) | Default value: 0
Type: Integer | No |
style_preset | Pass in a style preset to guide the image model towards a particular style. This list of style presets is subject to change. | Default value: none
Type: Categorical
Allowed values: enhance,anime,photographic,digital-art,comic-book,fantasy-art,line-art,analog-film,neon-punk,isometric,low-poly,origami,modeling-compound,cinematic,3d-model,pixel-art,tile-texture | No |
steps | Number of diffusion steps to run. | Default value: 30
Type: Integer
Minimum: 10
Maximum: 150 | No |
Resources
Vendor resources
Support
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.
Similar products


