Amazon Neptune

Amazon Neptune FAQs

Get started with Neptune

Connect with an AWS specialist

General
7
Neptune Database
46
Neptune Analytics
13
Neptune ML
3

General

Open all

Amazon Neptune is a serverless graph database for connected data, improves the accuracy of AI applications, and lowers operational burden and costs. Neptune instantly scales graph workloads removing the need to manage capacity. By modeling data as a graph, Neptune captures context that improves accuracy and explainability of generative AI applications. To make AI application development easier, Neptune offers fully managed GraphRAG with Amazon Bedrock Knowledge Bases, and integrations with Strands AI Agents SDK and popular agentic memory tools. It also easily analyzes tens of billions of relationships across structured and unstructured data within seconds delivering strategic insights. Neptune is the only database that gives you the power of connected data with the enterprise capabilities and value of AWS.

Neptune Database is a fully managed graph database with a full set of enterprise features such as high availability (up to 99.99%), multi-Region for improved disaster recovery and increased resiliency, dynamic scaling with serverless, and native integrations with other AWS services. You can search and query billions of relationships in milliseconds across thousands of concurrent queries.

Neptune automatically scales storage, growing storage and rebalancing I/O operations to provide consistent performance without the need for overprovisioning. Its storage is fault-tolerant and self-healing, and disk failures are repaired in the background without loss of database availability. Neptune also automatically detects database crashes and restarts without the need for crash recovery or to rebuild the database cache. If the entire instance fails, Neptune will automatically fail over to one of up to 15 read replicas.

With Neptune, you can use popular graph query languages such as Apache TinkerPop Gremlin, SPARQL, and openCypher to run powerful queries that are easy to write and perform well on connected data. This significantly reduces code complexity, and allows you to more quickly create applications that process relationships. You can quickly launch a Neptune database instance with a few steps in the Neptune console.

Neptune Analytics is an analytics database engine that supports graph analytics, graph algorithms, and vector search of graph data stored in Amazon S3 or Neptune database. You can analyze tens of billions of relationships in seconds. With Neptune Analytics, you can load data from an existing Neptune database or S3 with a few simple API calls and satisfy the most demanding graph analytic workloads. When you select an existing Neptune database as the data source, the data will be automatically loaded into Neptune Analytics. Alternatively, you can also choose to have Neptune Analytics load graph data directly from S3 using CSV files in common graph export formats.

Neptune ML is an integration between Neptune Database and Amazon SageMaker. It trains graph neural networks (GNNs) to make fast and more accurate predictions using your graph data. Neptune ML supports real-time predictions on nodes, edges, and properties (entities) that were added to the graph after the ML model training process, giving you predictions on new data without needing to retrain your ML models each time.

Neptune Database, with Neptune developer tools, are the right choice for building mission-critical systems at large scale. Systems such as product recommendation engines, identity and access management systems, and compliance systems often require geographically distributed capabilities that are available in Neptune Global Database. Neptune Database stores tens of billions of relationships and can process hundreds of thousands of interactive graph queries per second.

Neptune Analytics, with Neptune notebooks, are the right choice for interacting with data to derive insights. These capabilities empower users to interact with data using familiar tools, such as Pandas, Jupyter, and Python, to discover and pinpoint interactions and patterns of behavior in the data that are indicative of fraud, illegal activities, optimization opportunities, and more.

Some common use cases for Neptune Analytics include ephemeral analytics, running low-latency analytic queries, running built-in graph algorithms, and performing vector similarity search. With vector similarity search, Neptune Analytics can be used for building Retrieval Augmented Generation (RAG) applications that search through dense data representations provided by embeddings. The vector search results can be combined with contextually aware data representations in graphs for providing rich contextual information related to relationships.

Neptune ML can be used for designing, building, optimizing, and predicting relationships and categorizations using state-of-the-art GNNs. For augmenting feature tables, Neptune Analytics can be used for deriving critical features from connected data using common algorithms such as clustering, centrality, and path finding.

Yes. Please see the Amazon Neptune SLA.

Yes, you can purchase a Database Savings Plans for your Amazon Neptune usage and reduce your costs by up to 30% when you commit to a consistent amount of usage over a 1-year term. Additional information on eligible usage can be found on Database Savings Plans pricing page.

Neptune Database

Open all

Yes. Please see the Amazon Neptune SLA.Neptune Database supports two query languages for the property graph data model, the open-source Apache TinkerPop Gremlin graph traversal language and the openCypher query language, and for the Resource Description Framework (RDF) data model, Neptune supports the W3C open standard SPARQL query language.

Yes, each Neptune Database cluster can store both property graph data and RDF data. Neptune provides a Gremlin endpoint (HTTPS and WebSocket), openCypher endpoint (HTTPS and Bolt), and a SPARQL 1.1 Protocol REST endpoint.

For property graphs, you can execute either a Gremlin or openCypher query over the same data regardless of which language was used to enter that data. You may find it more convenient to use Gremlin for some workloads and openCypher for others. You cannot execute a query for property graph data (Gremlin or openCypher) over RDF data or vice-versa.

Neptune Database provides an Apache TinkerPop Gremlin Server that supports both HTTPS and WebSocket connections. Once you provision an instance of Neptune, you can configure your existing TinkerPop application to use the endpoint provided by the service. See also accessing the Graph via Gremlin.

No, Neptune Gremlin Server supports clients that are compatible with Apache TinkerPop using both WebSockets and HTTPS REST connections. The latest version of Neptune Database supports TinkerPop 3.6.x. Please consult the documentation for more information.

With Neptune support for the openCypher query language, you can move most Cypher or Neo4j workloads that use the Bolt protocol or HTTPS to Neptune. For more detailed information on how to migrate an openCypher application, read the migration guide in the documentation.

Neptune provides an HTTPS REST endpoint that implements the SPARQL 1.1 Protocol. Once you provision a service instance, you can configure your application to point to the SPARQL endpoint. See also accessing the Graph via SPARQL.

No, the Neptune SPARQL endpoint will work with any client that supports the SPARQL 1.1 Protocol.

Yes, Neptune is ACID compliant with immediate consistency on the primary writer instance, and eventual consistency on the read replica instances.

Neptune Database is a purpose-built, high-performance graph database engine. For certain management features such as instance lifecycle management, encryption at rest with AWS Key Management Service (AWS KMS) keys, and security groups management, Neptune uses operational technology that is shared with Amazon Relational Database Service (Amazon RDS).

Neptune Database is designed to support graph applications that require high throughput and low-latency graph queries. With support for up to 15 read replicas, Neptune Database can support hundreds of thousands of queries per second.

Yes, Neptune uses query optimization for Gremlin, openCypher and SPARQL queries. To learn more, see the Amazon Neptune alternative query engine (DFE).

No, Neptune is a purpose-built, high-performance graph database engine. Neptune efficiently stores and navigates graph data, and uses a scale-up, in-memory optimized architecture to allow for fast query evaluation over large graphs.

See our pricing page for current pricing information.

For more information about the AWS Regions where Neptune Database is available, see the AWS Regions table.

No. Neptune Database replication is bundled into the price. You are charged based on the storage your database consumes at the database layer, not the storage consumed in the Neptune virtualized storage layer.

Neptune Database was designed to remove unnecessary I/O operations to reduce costs and ensure resources are available for serving read/write traffic. Write I/O operations are only consumed when pushing transaction log records to the storage layer for the purpose of making writes durable. Write I/O operations are counted in 4 KB units. For example, a transaction log record that is 1,024 bytes will count as one I/O operation.

However, concurrent write operations whose transaction log is less than 4 KB can be batched together by the Neptune database engine in order to optimize I/O consumption. Unlike traditional database engines, Neptune never pushes modified database pages to the storage layer, resulting in further I/O consumption savings.

The minimum storage is 10 GiB. Based on your database usage, your Neptune storage will automatically grow, up to 128 TiB, in 10 GiB increments with no impact to database performance. There is no need to provision storage in advance.

You can scale the compute resources allocated to your database instance in the AWS Management Console by selecting the desired database instance and choosing the Modify button. Memory and CPU resources are modified by changing your DB Instance class.

When you modify your DB instance class, your requested changes will be applied during your specified maintenance window. Alternatively, you can use the Apply Immediately flag to apply your scaling requests immediately. Both of these options will have an availability impact for a few minutes as the scaling operation is performed. Bear in mind that any other pending system changes will also be applied.

You can automatically scale your database capacity with Amazon Neptune Serverless. Neptune Serverless allows you to run and instantly scale graph workloads, without the need to manage and optimize capacity. Neptune Serverless automatically determines and provisions the compute and memory resources to run the graph database, and scales capacity based on the workload’s changing requirements to maintain consistent performance.

Yes, Neptune supports auto-scaling of read replicas of instances. You can configure auto-scaling to automatically add or remove read replicas in response to changes in your connectivity or workload requirements. For more information, see the documentation.

Automated backups are always enabled on Neptune Database instances. Backups do not impact database performance.

Yes, and there is no performance impact when taking snapshots. Note that restoring data from database snapshots requires creating a new database instance.

Neptune Database automatically maintains copies of your data across three Availability Zones and will automatically attempt to recover your database in a healthy Availability Zone with no data loss. In the unlikely event your data is unavailable within Neptune storage, you can restore from a database snapshot or perform a point-in-time restore operation to a new instance. Note that the latest restorable time for a point-in-time restore operation can be up to 5 minutes in the past.

You can choose to create a final database snapshot when deleting your database Instance. If you do, you can use this database Snapshot to restore the deleted database instance at a later date. Neptune retains this final user-created database Snapshot along with all other manually created database snapshots after the database instance is deleted. Only database snapshots are retained after the database instance is deleted (for example, automated backups created for point-in-time restore are not kept).

There is no charge for sharing snapshots between accounts. However, you may be charged for the snapshots themselves, as well as any databases you restore from shared snapshots. Learn more about Amazon Neptune pricing.

No, Neptune snapshots can only be used inside of the service.

A Neptune Database cluster can only be created in an Amazon VPC that has at least two subnets in at least two Availability Zones. By distributing your cluster instances across at least two Availability Zones, Neptune helps ensure that there are instances available in your database cluster in the unlikely event of an Availability Zone failure. The cluster volume for your Neptune Database cluster always spans three Availability Zones to provide durable storage with less possibility of data loss. Neptune is designed to transparently handle the loss of up to two copies of data without affecting database write availability and up to three copies without affecting read availability. Neptune storage is also self-healing. Data blocks and disks are continuously scanned for errors and repaired automatically.

Unlike other databases, after a database crash, Neptune does not need to replay the redo log from the last database checkpoint (typically 5 minutes) and confirm that all changes have been applied before making the database available for operations. This reduces database restart times to less than 60 seconds in most cases. Neptune moves the buffer cache out of the database process and makes it available immediately at restart time. This prevents you from having to throttle access until the cache is repopulated to avoid brownouts.

Neptune supports read replicas, which share the same underlying volume as the primary instance. Updates made by the primary are visible to all Amazon Neptune Replicas. One Neptune cluster can have one writer instance and up to 15 read replicas. In the event of a writer instance failure, a read replica will be automatically promoted to a writer instance.

Yes, Neptune Database supports cross-Region replication by configuring your Neptune cluster to use Neptune Global Database.

Yes. You can assign a promotion priority tier to each instance on your cluster. When the primary instance fails, Neptune Database will promote the replica with the highest priority to primary. If there is contention between two or more replicas in the same priority tier, then Neptune will promote the replica that is the largest size.

You can modify the priority tier for an instance at any time. Simply modifying priority tiers will not trigger a failover.

You can assign lower priority tiers to replicas that you don’t want promoted to the primary instance. However, if the higher priority replicas on the cluster are unhealthy or unavailable for some reason, then Neptune will promote the lower priority replica.

You can add Neptune Replicas, which share the same underlying storage as the primary instance. Any Neptune Replica can be promoted to become primary without any data loss and therefore can be used for enhancing fault tolerance in the event of a primary database Instance failure. To increase database availability, simply create 1 to 15 replicas, and Neptune will automatically include them in failover primary selection in the event of a database outage.

Failover is automatically handled by Neptune Database so that your applications can resume database operations as quickly as possible without manual administrative intervention. If you have a Neptune Replica, in the same or a different Availability Zone, when failing over, Neptune flips the canonical name record (CNAME) for your database primary endpoint to a healthy replica, which in turn is promoted to become the new primary. From start to finish, failover typically completes within 30 seconds.

Additionally, the read replicas endpoint doesn't require any CNAME updates during failover. If you do not have a Neptune Replica (such as a single instance), Neptune will first attempt to create a new database instance in the same Availability Zone as the original instance. If unable to do so, Neptune will attempt to create a new database instance in a different Availability Zone. From start to finish, failover typically completes in under 15 minutes. Your application should retry database requests in the event of connection loss.

Neptune Database will automatically detect a problem with your primary instance and begin routing your read/write traffic to a Neptune Database Replica. On average, this failover will complete within 30 seconds. In addition, the read traffic that your Neptune Database Replicas were serving will be briefly interrupted.

Since Neptune Database Replicas share the same data volume as the primary instance, there is virtually no replication lag. We typically observe lag times in the tens of milliseconds.

Yes, all Amazon Neptune Database instances must be created in a VPC. With Amazon VPC, you can define a virtual network topology that closely resembles a traditional network that you might operate in your own datacenter. This gives you complete control over who can access your Neptune databases.

Currently, encrypting an existing unencrypted Neptune instance is not supported. To use Neptune encryption for an existing unencrypted database, create a new database instance with encryption enabled and migrate your data into it.

Access to Neptune databases must be done through the HTTPS port entered on database creation within your VPC. This is done to provide an additional layer of security for your data. Step-by-step instructions on how to connect to your Neptune database are provided in the Amazon Neptune User Guide.

Neptune Analytics

Open all

You can use openCypher, an open-source project that makes it easy to use the Cypher language for graph processing, invoking the Neptune Analytics algorithms, and for vector similarity search.

Neptune Analytics is well suited for graph queries that access large parts of a graph or whole graphs. Neptune Analytics is an in-memory engine, and it can load these large graphs into memory to deliver a response in seconds. In addition, Neptune Analytics can serve thousands of analytic queries per second using a library of popular graph analytics algorithms for operations such as ranking social influencers, detecting groups for fraud, or finding patterns in network activity. For generative AI applications, Neptune Analytics can store vector embeddings and provide vector similarity searches.

You can select an existing Neptune cluster as the data source, which will be automatically loaded into Neptune Analytics.

Neptune Analytics supports 12 algorithms for path finding, detecting communities (clustering), identifying important data (centrality), and quantifying similarity. Path finding algorithms are used for use cases such as route planning for supply chain optimization, while centrality algorithms such as page rank identify the most influential sellers in a graph. Similarly, algorithms such as connected components, clustering, and centrality algorithms can be used for fraud-detection use cases to determine whether the connected network is a group of friends or a fraud ring formed by a set of coordinated fraudsters.

Yes, Neptune Analytics is ACID compliant with strong consistency.

Neptune Analytics supports a vector search index on embeddings (up to 65,000 dimensions) stored in your graph data.

Neptune Analytics supports one vector search index on embeddings stored in your graph data.

No, you do not need a separate vector database with Neptune Analytics. Neptune Analytics supports a vector search index on embeddings (up to 65,000 dimensions) stored in your graph data. Neptune Analytics provides efficient vector search that can be invoked directly from the openCypher query language that is used for writing your graph queries.

Neptune Analytics stores the vectors and supports Hierarchical Navigable Small Worlds (HNSW) for performing vector indexing and similarity search. You should use a separate vector database if you want to use different indexing and similarity search algorithms or if you want to use multiple indices built on different properties.

Yes, you can use Neptune Analytics in Amazon VPC. For private access, you can create a graph with ‘public-access’ disabled (default) and specify the subnets in a VPC. Neptune Analytics will create a requester-managed VPC interface endpoint per graph in your VPC. You will be able to attach security groups and endpoint policies to the endpoints, but you will not be able to delete the endpoints. Standard VPC interface endpoint charges will apply.

Yes. You can optionally enable a public graph-specific endpoint to connect to the graph over the internet. With Neptune Analytics, all clients must authenticate, all requests need to be SigV4-signed, and all connections must use the graph ID to interact with the graph.

Yes, Neptune Analytics offers Multi-AZ deployments with enhanced availability and durability. By default, it provisions a hot standby in a separate Availability Zone. With a hot standby, the failover time is in seconds. Without a standby, the service provisions new underlying compute capacity within minutes.

Visit the Neptune Pricing page for current pricing information.

For more information about the Regions where Neptune Analytics is available, see the AWS Regions table.

Neptune ML

Open all

Gremlin and SPARQL are supported with Neptune ML.

Neptune ML provides node classification, multi-class classification, node regression, edge classification, single-class classification, edge regression, and link (edge) prediction. For more details, visit the Neptune User Guide.

Pricing details can be found on the Neptune Pricing page.

Next steps

Resources

Learn with hands-on training

Get started with Neptune

Documentation

Dive deep on Neptune

Read the documentation

Free tier

Sign up for a free account

Try for free

Console

Amazon Neptune FAQs

General

Neptune Database

Neptune Analytics

Neptune ML

Next steps

Learn with hands-on training

Dive deep on Neptune

Sign up for a free account

Start building in the console

Learn

Resources

Developers

Help

Amazon Neptune FAQs

General

What is Amazon Neptune?

What is Amazon Neptune Database?

What is Amazon Neptune Analytics?

What is Amazon Neptune ML?

When would I use Neptune Database, Neptune Analytics, or Neptune ML?

Does Amazon Neptune have a service level agreement (SLA)?

Is Amazon Neptune supported on Database Savings Plans?

Neptune Database

What popular graph query languages does Neptune Database support?

Can I use Apache TinkerPop Gremlin, openCypher, and RDF/SPARQL on the same Neptune Database cluster?

How can I migrate from an existing Apache TinkerPop Gremlin application to Neptune Database?

Do I need to change client drivers to use Neptune Gremlin Server?

How can I migrate from an existing openCypher application to Neptune Database?

How can I migrate from a triple store with a SPARQL endpoint to Neptune Database?

Do I need to change client drivers to use Neptune SPARQL endpoint?

Is Neptune Database ACID (Atomicity, Consistency, Isolation, Durability) compliant?

Why are Amazon RDS permissions and resources required to use Neptune Database?

What types of graph query workloads are optimized to work with Neptune Database?

Does Neptune Database perform query optimization?

Is Neptune Database built on a relational database?

How much does Neptune Database cost?

In which AWS Regions is Neptune Database available?

Neptune Database replicates each chunk of my database volume across three Availability Zones. Does that mean that my effective storage price will be three times what is shown on the pricing page?

What are I/O operations in Neptune Database and how are they calculated?

What are the minimum and maximum storage limits of a Neptune database?

How do I scale the compute resources associated with my Neptune Database instance?

Can I scale my Neptune database up and down automatically based on usage?

Does Neptune Database support auto-scaling?

How do I enable backups for my Neptune Database instance?

Can I take database snapshots and keep them around as long as I want?

If my database fails, what is my recovery path?

What happens to my automated backups and database snapshots if I delete my database instance?

Can I share my snapshots with another AWS account?

Will I be billed for shared snapshots?

Can I automatically share snapshots?

How many accounts can I share snapshots with?

In which Regions can I share my Neptune Database snapshots?

Can I share my Neptune Database snapshots across different Regions?

Can I share an encrypted Neptune Database snapshot?

Can I use Neptune snapshots outside of the service?

How does Neptune Database improve my database’s fault tolerance to disk failures?

How does Neptune Database improve recovery time after a database crash?

What kind of replication does Neptune Database support?

Can I have cross-Region replicas with Neptune Database?

Can I prioritize certain replicas as failover targets over others?

Can I modify priority tiers for instances after they have been created?

Can I prevent certain replicas from being promoted to the primary instance?

How can I improve upon the availability of a single Neptune database?

What happens during failover and how long does it take?

If I have a primary database and an Amazon Neptune Database Replica actively taking read traffic and a failover occurs, what happens?

How far behind the primary will my replicas be?

Can I use Neptune Database in Amazon Virtual Private Cloud (Amazon VPC)?

Can I encrypt an existing unencrypted database?

How do I access my Neptune Database?

Neptune Analytics

What popular graph query languages does Amazon Neptune Analytics support?

What types of graph query workloads are optimized to work with Neptune Analytics?

How can I use Neptune Analytics with graphs in my Neptune Database?

Which graph algorithms are supported today?

Is Neptune Analytics ACID compliant?

What is the maximum dimensionality of vectors supported with Neptune Analytics?

How many indices can I add?

Do I need a separate vector database with Neptune Analytics?

Can I use Neptune Analytics in Amazon VPC?

Can I access Neptune Analytics over the public internet?

Does Neptune Analytics support high availability?

How much does Neptune Analytics cost?

In which Regions is Neptune Analytics available?

Neptune ML

What languages are supported with Neptune ML?

What kind of prediction tasks can I do with Neptune ML?

How much does Neptune ML cost?

Next steps

Learn with hands-on training

Dive deep on Neptune

Sign up for a free account

Start building in the console

Learn