How do I troubleshoot high JVM memory pressure on my OpenSearch Service cluster?

3 minute read

My Amazon OpenSearch Service cluster has high JVM memory pressure, and I don't know how to resolve this.

Short description

By default, OpenSearch Service uses 50% of an instance's RAM for JVM heaps up to 32 GiB in size. The JVM memory pressure specifies the percentage of the Java heap in a cluster node. The following guidelines indicate what the JVM memory pressure percentages mean:

If JVM memory pressure reaches 75%, then OpenSearch Service initiates the Concurrent Mark Sweep (CMS) garbage collector for x86 Amazon Elastic Compute Cloud (Amazon EC2) instance types. ARM-based Graviton Amazon EC2 instance types use the Garbage-First (G1) garbage collector that uses additional short pauses and heap defragmentation. The garbage collection is a CPU-intensive process. If memory usage continues to grow, then you might encounter ClusterBlockException, JVM OutOfMemoryError, or other cluster performance issues. For more information, see Recovering from a continuous heavy processing load.
If JVM memory pressure exceeds 92% for 30 minutes, then OpenSearch Service blocks all write operations.
If JVM memory pressure reaches 100%, then OpenSearch Service JVM is configured to exit and eventually restarts on OutOfMemory (OOM).

The following reasons can cause high JVM memory pressure:

Spikes in the numbers of requests to the cluster.
Aggregations, wildcards, and selecting wide time ranges in the queries.
Unbalanced shard allocations across nodes or too many shards in a cluster.
Field data or index mapping explosions.
Instance types that can't handle incoming loads.

Resolution

Reduce traffic to the cluster to resolve high JVM memory pressure issues. To reduce traffic to the cluster, follow these best practices:

Clear the field data cache with the POST /index_name/_cache/clear?fielddata=true API operation.
Note: Clearing the cache can disrupt queries that are in progress.
Don't aggregate text fields or change the mapping type to "keyword".
Scale the domain so that the maximum heap size per node is 32 GB.
Turn on slow logs (OpenSearch website) to determine faulty requests.
Note: Verify that the JVM memory pressure is below 90%. For more information about slow Elasticsearch queries, see Advanced tuning: finding and fixing slow Elasticsearch queries on the Elasticsearch website.
Choose the correct number of shards to optimize search or indexing. For more information about indexing and shard count, see How do I rebalance the uneven shard distribution in my Amazon OpenSearch Service cluster?
Delete old or unused indices to reduce the number of shards.
For advanced users, you can update the parent field data cache allocation, or request circuit breaker settings according to your use case. For more information about JVM circuit breakers, see JVM OutOfMemoryError.

For more information about how to troubleshoot high JVM memory pressure, see Why did my OpenSearch Service node crash?

Related information

Troubleshooting Amazon OpenSearch Service

How can I scale up or scale out an Amazon OpenSearch Service domain?

Get started with Amazon Elasticsearch Service: How many shards do I need?

Topics

Serverless Analytics

Relevant content

OpenSearch stuck at updating service software for months
rePost-User-0261550
asked 10 months ago
tomcat8 jvm memory usage metric
AWS-User-4628301
asked 2 years ago
Amazon Opensearch MasterJVM Memory Pressure usage gets high after upgrading Data nodes
chris2022
asked 2 years ago
AWS OpenSearch Service recommended JVM heap size is often set to 50% of system RAM. If JVM heap size is limited to 32GB, does that mean there is no point in using node types with more than 64GB RAM?
Accepted Answer
SUPPORT ENGINEER
jude-aws
asked 2 years ago
What does opensearch error type: cluster_block_exception reason: blocked by: [FORBIDDEN/8/index write (api)....... mean?
rePost-User-3074038
asked a year ago
Why is the SysMemoryUtilization so high on my Amazon OpenSearch Service cluster?
AWS OFFICIALUpdated 3 years ago
How do I troubleshoot search latency spikes in my Amazon OpenSearch Service cluster?
AWS OFFICIALUpdated a year ago
How do I troubleshoot high CPU utilization on my Amazon OpenSearch Service cluster?
AWS OFFICIALUpdated a year ago
How do I troubleshoot a circuit breaker exception in Amazon OpenSearch Service?
AWS OFFICIALUpdated 3 years ago
How to e-mail OpenSearch reports and dashboards via opensearch-reporting-cli
EXPERT
Sohaib Katariwala
published 2 months ago