Amazon Aurora Documentation

Amazon Aurora is a relational database service that combines the speed and availability of high-end commercial databases with the simplicity and cost-effectiveness of open-source databases. Aurora is fully compatible with MySQL and PostgreSQL, allowing existing applications and tools to run without requiring modification.

Amazon Aurora Features: MySQL-Compatible Edition

Performance and Scalability

High Throughput

Amazon Aurora uses a variety of software and hardware techniques to confirm the database engine is able to fully use available compute, memory and networking. I/O operations use distributed systems techniques such as quorums to improve performance consistency.

Serverless Configuration

Amazon Aurora Serverless is an on-demand, auto-scaling configuration for Aurora that is designed to automatically start-up, shut down, and scale capacity up or down based on your application's needs. Aurora Serverless helps you to run your database in the cloud without managing any database instances. You can also use Aurora Serverless v2 instances along with provisioned instances in your existing or new database clusters.

Push-Button Compute Scaling

Using the Amazon Relational Database Service (Amazon RDS) APIs or the AWS Management Console to scale provisioned instances., powering your deployment up or down. Compute scaling operations typically complete in a few minutes.

Storage Auto-Scaling

Amazon Aurora is designed to automatically scale I/O to match the needs of your applications. It also increases the size of your database volume as your database storage needs grow. Your volume will grow in increments of 10 GB up to a maximum of 128 TiB. When using the Amazon Aurora I/O-Optimized configuration for your database clusters, Aurora is designed to provide cost savings.

Low-Latency Read Replicas

You can increase read throughput to support high-volume application requests by creating up to 15 database Amazon Aurora Replicas. Aurora Replicas share the same underlying storage as the source instance, which can help lower costs and help avoid the need to perform writes at the replica nodes. This helps free up more processing power to serve read requests and reduces the replica lag time. Aurora provides a reader endpoint so the application can connect without having to keep track of replicas as they are added and removed. It also supports auto-scaling, as it is designed to automatically add and remove replicas in response to changes in performance metrics that you specify.

Aurora supports cross-Region read replicas. Cross-Region replicas are designed to provide fast local reads to your users, and each region can have an additional 15 Aurora Replicas to further scale local reads.  

Custom Database Endpoints

Custom endpoints help you to distribute and load balance workloads across different sets of database instances. For example, you may provision a set of Aurora Replicas to use an instance type with higher memory capacity in order to run an analytics workload. A custom endpoint can then help you route the workload to these appropriately-configured instances, while keeping other instances isolated it.

Parallel Query

Amazon Aurora Parallel Query is designed to provide faster analytical queries compared to your current data. It can speed up queries, while maintaining high throughput for your core transaction workload. By pushing query processing down to the Aurora storage layer, it gains computing power while reducing network traffic. Use Parallel Query to run transactional and analytical workloads alongside each other in the same Aurora database.

Amazon DevOps Guru for RDS

Amazon DevOps Guru is a cloud operations service powered by machine learning (ML) designed to help improve application availability. With Amazon DevOps Guru for RDS, you can use ML-powered insights to help easily detect and diagnose performance-related relational database issues and is designed to resolve them in minutes rather than days. Developers and DevOps engineers can use DevOps Guru for RDS to automatically identify the root cause of performance issues and get intelligent recommendations to help address the issue, without needing help from database experts.

Availability and Durability

Instance Monitoring and Repair

Amazon RDS is designed to continuously monitor the health of your Amazon Aurora database and underlying Amazon Elastic Compute Cloud (Amazon EC2) instance. In the event of database failure, Amazon RDS will automatically restart the database and associated processes. Amazon Aurora is designed to not require crash recovery replay of database redo logs, which greatly reduce restart times. It also isolates the database buffer cache from database processes, which allows the cache to survive a database restart.

Multi-AZ deployments with Amazon Aurora Replicas

On instance failure, Amazon Aurora uses Amazon RDS Multi-AZ technology, which is designed to automate failover to one of up to 15 Amazon Aurora Replicas you have created in any three Availability Zones. If no Amazon Aurora Replicas have been provisioned, in the case of a failure, Amazon RDS is designed to automatically attempt to create a new Amazon Aurora DB instance for you automatically. You can also help minimize failover time by replacing community MySQL and PostgreSQL drivers with open source and drop-in compatible AWS JDBC Driver for MySQL and AWS JDBC Driver for PostgreSQL. You may also use RDS Proyx to reduce failover times and improve availability. If failovers occur, Amazon RDS Proxy is designed to route requests directly to the new database instance.

Global Database

For globally distributed applications you can use Aurora Global Database, where a single Aurora database can span multiple AWS Regions to enable fast local reads and disaster recovery. An Aurora Global Database uses storage-based replication to replicate a database across multiple Regions, with low latency. You can use a secondary Region as a backup option in case you need to recover quickly from a regional degradation or outage. A database in a secondary Region can be promoted to full read/write capabilities quickly.    

Fault-Tolerant and Self-Healing Storage

Amazon Aurora's database volume is segmented in 10 GiB chunks and replicated across three Availability Zones, with each Availability Zone persisting 2 copies of each write. Amazon Aurora storage is fault-tolerant, transparently handling the loss of up to two copies of data without affecting database write availability and up to three copies without affecting read availability. Amazon Aurora storage is also designed to be self-healing; data blocks and disks are continuously scanned for errors and replaced automatically.

Automatic, Continuous, Incremental Backups and Point-in-Time Restore

Amazon Aurora's backup capability is designed to enable point-in-time recovery for your instance. This allows you to restore your database to any second during your retention period, up to the last 5 minutes. Your automatic backup retention period can be configured up to 35 days. Automated backups are stored in Amazon Simple Storage Service (Amazon S3), which is designed for 99.999999999% durability. Amazon Aurora backups are designed to be automatic, incremental, and continuous and have no impact on database performance.

Database Snapshots

DB snapshots are user-initiated backups of your instance stored in Amazon S3 will be kept until you explicitly delete them. They leverage the automated incremental snapshots to reduce the time and storage required. You can create a new instance from a DB snapshot whenever you desire.

Backtrack

Backtrack is designed to let you quickly move a database to a prior point in time without needing to restore data from a backup. This lets you quickly recover from user errors, such as dropping the wrong table or deleting the wrong row. When you enable Backtrack, Aurora will retain data records for the specified Backtrack duration. For example, you could set up Backtrack to allow you to move your database up to 72 hours back. Backtrack is designed to complete in seconds, even for large databases, because no data records need to be copied. You can go backwards and forwards to find the point just before the error occurred.

Backtrack may also be useful for development and test, particularly in situations where your test deletes or otherwise invalidates the data. Simply backtrack to the original database state, and you're ready for another test run. You can create a script that calls Backtrack through an API and then runs the test, for simple integration into your test framework.

Security

Network Isolation

Amazon Aurora runs in Amazon Virtual Private Cloud (VPC), which helps you isolate your database in your own virtual network, and connect to your on-premises IT infrastructure using industry-standard encrypted IPsec VPNs. In addition, when using Amazon RDS, you can configure firewall settings and control network access to your DB instances.

Resource-Level Permissions

Aurora is integrated with AWS Identity and Access Management (IAM) and is designed to provide you the ability to control the actions that your AWS IAM users and groups can take on specific Aurora resources (for example., DB instances, DB snapshots, DB parameter Groups, DB event subscriptions, DB options groups). Also, you can tag your Aurora resources, and control the actions that your IAM users and groups can take on groups of resources that have the same tag (and tag value). 

Encryption

Amazon Aurora is designed to help you encrypt your databases using keys you create and control through AWS Key Management Service (KMS). On a database instance running with Amazon Aurora encryption, data stored at rest in the underlying storage is encrypted, as are the automated backups, snapshots, and replicas in the same cluster. Amazon Aurora uses SSL (AES-256) to secure data in transit.

Advanced Auditing

Amazon Aurora is designed to help you log database events with minimal impact on database performance. Logs can later be analyzed for database management, security, governance, regulatory compliance and other purposes. You can also monitor activity by sending audit logs to Amazon CloudWatch.

Threat Detection

Aurora is integrated with Amazon GuardDuty to help you identify potential threats to data stored in Aurora databases. GuardDuty RDS Protection is designed to profile and monitor login activity to existing and new databases in your account and uses tailored ML models to accurately detect suspicious logins to Aurora databases. If a potential threat is detected, GuardDuty is designed to generate a security finding that includes database details and contextual information on the suspicious activity. Aurora integration with GuardDuty is designed to give direct access to database event logs without requiring you to modify your databases and is designed not to have an impact on database performance.

Cost-effectiveness

Price predictability at scale

Aurora offers two configurations options based on your price-performance and price-predictability needs, regardless of the I/O consumption of your application. The two configuration options are Aurora I/O-Optimized and Aurora Standard. Both are designed to scale I/O to support your most demanding applications.

Aurora I/O-Optimized is a database cluster configuration. It delivers improved price performance for customers with I/O-intensive workloads with Aurora I/O optimized you pay for database instances and storage.

Aurora Standard is a database cluster configuration that offers cost-effective pricing for the vast majority of applications with low to moderate I/O usage. With Aurora Standard you pay for database instances, storage, and pay-per-request I/O.

Optimized I/O costs

Aurora is designed to eliminate unnecessary I/O operations to reduce costs and ensure resources are available for serving read/write traffic. Write I/O operations are only consumed when pushing transaction log records to the storage layer for the purpose of making write durable. Write I/O operations are counted in 4 KB units. Certain concurrent write operations can be batched together by the Aurora does not push modified database pages to the storage layer, resulting in further I/O consumption savings.

You are charged for read and write I/O operations when you configure your database clusters to the Aurora Standard configuration. You are not charged for read and write I/O operations when you configure you database clusters to Aurora I/O-Optimized.

Managed

Use

Just launch a new Amazon Aurora DB instance using the Amazon RDS Management Console or a single API call or CLI. Amazon Aurora DB instances are pre-configured with parameters and settings appropriate for the DB instance class you have selected. You can launch a DB instance and connect your application within minutes without additional configuration. DB parameter groups provide granular control and fine-tuning of your database.

Monitoring and Metrics

Amazon Aurora provides Amazon CloudWatch metrics for your DB instances at no additional charge. You can use the AWS Management Console to view over 20 key operational metrics for your database instances, including compute, memory, storage, query throughput, cache hit ratio, and active connections. In addition, you can use Enhanced Monitoring to gather metrics from the operating system instance that your database runs on. You can use Amazon RDS Performance Insights, a database monitoring tool that enables you to detect database performance problems and take corrective action, with a dashboard that visualizes database load. Finally, you also can use Amazon DevOps Guru for RDS to detect performance issues, automatically identify the root cause of performance issues, and get recommendations to help address the issue without needing help from database experts.

Amazon RDS Blue/Green Deployments

Amazon RDS Blue/Green Deployments is designed to allow you to make safer, simpler, and faster database updates with zero data loss on Amazon Aurora MySQL-Compatible Edition. Blue/Green Deployments can create a staging environment that mirrors the production environment and is designed to keep the two environments in sync using logical replication. Blue/Green Deployments is designed so you can make changes--such as major/minor version upgrades, schema modifications, and parameter setting changes--without impacting your production workload.

When promoting your staging environment, Blue/Green Deployments blocks writes to both the blue and green environments until switchover is complete. Blue/Green Deployments uses built-in switchover guardrails that time out promotion if it exceeds your maximum tolerable downtime, detects replication errors, checks instance health, and more.

Automatic Software Patching

Amazon Aurora is designed to keep your database up-to-date with the latest available patches. You can control if and when your instance is patched through DB Engine Version Management. Aurora uses zero-downtime patching when possible: if a suitable time window appears, the instance is updated in place, application sessions are preserved and the database engine restarts while the patch is in progress, leading to only a transient drop in throughput.

DB Event Notifications

Amazon Aurora can notify you by email or SMS of important database events such as an automated failover. You can use the AWS Management Console or the Amazon RDS APIs to subscribe to over 40 different DB events associated with your Amazon Aurora databases.

Fast Database Cloning

Amazon Aurora is designed to support cloning operations, where entire multi-terabyte database clusters can be cloned quickly. Cloning is useful for a number of purposes including application development, testing, database updates, and running analytical queries. Immediate availability of data can accelerate your software development and upgrade projects, and make analytics more accurate.

Database Start/Stop

You can manually stop and start an Amazon Aurora database. This makes it easier and more affordable to use Aurora for development and test purposes, where the database is not required to be running all of the time. Amazon Aurora is designed so that stopping your database doesn't delete your data.

Migration Support

MySQL Database Migrations

Standard MySQL import and export tools work with Amazon Aurora. You can also create a new Amazon Aurora database from an Amazon RDS for MySQL DB Snapshot. Migration operations based on DB snapshots typically complete in under an hour, but will vary based on the amount and format of data being migrated.

Alternatively, AWS Database Migration Service (AWS DMS) offers built-in native tooling from within the DMS Console for a seamless migration. With no replication instances to provision or scale, you can initiate a database migration with a few simple clicks.

You can also set up binlog-based replication between an Aurora MySQL database and an external MySQL database running inside or outside of AWS.

PostgreSQL Database Migrations

Standard PostgreSQL import and export tools work with Amazon Aurora, including pg_dump and pg_restore. Amazon Aurora also supports snapshot import from Amazon RDS for PostgreSQL, and replication with AWS Database Migration Service (AWS DMS).

Commercial Database Migrations

Amazon Aurora provides an ideal environment for moving database workloads off of commercial databases. Aurora is designed to provide functional capabilities which are a close match to those of commercial database engines, and delivers the enterprise-grade performance, durability, and high availability required by most enterprise database workloads. AWS Database Migration Service (AWS DMS) can help accelerate database migrations to Amazon Aurora, with managed features like DMS Schema Conversion and DMS Serverless. DMS Schema Conversion is designed to automatically assess and convert schemas and source objects to be compatible with the target Aurora cluster. Meanwhile, DMS Serverless is designed to automate provisioning, monitoring, and scaling of migration resources.

Babelfish for Aurora PostgreSQL

Babelfish for Aurora PostgreSQL is a capability for Amazon Aurora PostgreSQL-Compatible Edition that is designed to enable Aurora to understand commands from applications written for Microsoft SQL Server. With Babelfish, Aurora PostgreSQL is designed to understand T-SQL, Microsoft SQL Server's proprietary SQL dialect, and supports the same communications protocol, so your apps that were originally written for SQL Server can now work with Aurora with fewer code changes. As a result, the effort required to modify and move applications running on SQL Server 2005 or newer to Aurora is reduced, leading to faster, lower-risk, and more cost-effective migrations. Babelfish is a built-in capability of Amazon Aurora.

Developer Productivity

Trusted Language Extensions for PostgreSQL

Trusted Language Extensions (TLE) for PostgreSQL is a development kit and open-source project designed to allow you to quickly build high performance extensions and safely run them on Amazon Aurora without needing AWS to certify code. Developers can use popular trusted languages--like JavaScript, PL/pgSQL, Perl, and SQL--to safely write extensions. TLE is designed to prevent access to unsafe resources and limits extension defects to a single database connection. Amazon Aurora is designed so that DBAs have fine-grained, online control over who can install extensions and can create a permissions model for running them.

Machine Learning

Aurora offers machine learning capabilities directly from the database, enabling you to add ML-based predictions to your applications through the familiar SQL programming language. With integration between Aurora and AWS machine learning services, you have access to a selection of ML algorithms without having to build custom integrations or move data around. 

RDS Proxy Support

Aurora works in conjunction with Amazon RDS Proxy, a managed, highly available database proxy that is designed to make applications more scalable, more resilient to database failures, and more secure. RDS Proxy allows applications to pool and share connections established with the database, improving database efficiency and application scalability. It can reduce failover times by automatically connecting to a new database instance while preserving application connections. It can also enhance security through integrations with AWS IAM and AWS Secrets Manager.

Generative AI

Aurora offers capabilities to enable machine learning (ML) and generative artificial intelligence (AI) models to work with data stored in Aurora in real-time and without moving the data. With Amazon Aurora PostgreSQL-Compatible Edition, you can access vector database capabilities to store, search, index, and query ML embeddings.

A vector embedding is a numerical representation that represents the semantic meaning of content such as text, images, and video. Generative AI and other AI/ML systems use embeddings to capture the semantic meaning of this content input into a large language model (LLM). You can store embeddings from ML and AI models, such as those from Amazon Bedrock (limited preview), and Amazon SageMaker in your Aurora PostgreSQL database.

Aurora machine learning also simplifies adding generative AI models predictions and embeddings to your Aurora database. Aurora ML is designed to expose ML models as SQL functions, allowing you to use standard SQL to call ML models, pass data to them, and return predictions or embeddings to your database real-time via periodic calls to the SageMaker model which returns the latest, up-to-date embeddings.

Additional Information

For additional information about service controls, security features and functionalities, including, as applicable, information about storing, retrieving, modifying, restricting, and deleting data, please see https://docs.aws.amazon.com/index.html. This additional information does not form part of the Documentation for purposes of the AWS Customer Agreement available at http://aws.amazon.com/agreement, or other agreement between you and AWS governing your use of AWS’s services.