Short Description
-----------------

When you produce to a Kinesis data stream, the following internal errors can occur:

*   **PutRecord** or **PutRecords** returns an **AmazonKinesisException 500** or **AmazonKinesisException 503** error with a rate above 1% for several minutes
*   **SubscribeToShard.Success** or **GetRecords** returns an **AmazonKinesisException 500** or **AmazonKinesisException 503**error with a rate above 1% for several minutes

To troubleshoot these internal errors, complete the following steps:

1. Calculate your error rate.
2. Implement a retry mechanism.

Resolution
----------

### Calculate your error rate

Look for significant drops in the time windows of either **PutRecord.Success** or **GetRecord.Success**. You can review these metrics under the **Monitoring** tab. If there are significant drops, then calculate the error rate to determine the severity of your Kinesis data stream issue.  
To calculate your error rate, compute the average value of **PutRecord.Success** and **GetRecord.Success**.

### Implement a retry mechanism

Confirm that the error rate falls below 0.1%. Kinesis Data Streams allows for high throughput writes with a low error rate. Average error rates are typically below 0.01%.

If you wrote your own consumer or producer, then implement a retry mechanism in your application code. For more information about retry mechanism implementations, see the **Retries** section in [Implementing efficient and reliable producers with the Amazon Kinesis Producer Library](https://aws.amazon.com/blogs/big-data/implementing-efficient-and-reliable-producers-with-the-amazon-kinesis-producer-library/).

If your error rate exceeds 1% for several minutes, then [contact AWS Support](https://console.aws.amazon.com/support/home#/case/create). Include the following information in your communications:

*   Applications used to read or write data to and from Data Streams
*   Number of shards in your Kinesis data stream
*   Server-side encryption settings
*   Specific shard IDs that are impacted
*   Time frame where drops in success rates are observed
*   Request IDs that are reporting internal failures

Related Information
-------------------

[Developing producers using the Amazon Kinesis Producer Library](https://docs.aws.amazon.com/streams/latest/dev/developing-producers-with-kpl.html)

[Developing KCL 2.x consumers](https://docs.aws.amazon.com/streams/latest/dev/developing-consumers-with-kcl-v2.html)

I want to troubleshoot Amazon Kinesis Data Stream 500 Internal Server Errors or a 503 Service Unavailable Errors.

Troubleshoot 500 internal server errors in Kinesis Data Streams

Why is my Kinesis Data Stream returning a 500 Internal Server Error?

Analytics

Internet of Things (IoT)

Amazon Kinesis Video Streams with WebRTC operation returned status code: 0x5600000f

Why are my Kinesis Data Streams throttling?

How can I use the KPL to  put data records into a Kinesis data stream?

How do I troubleshoot timeout errors when writing from Flink to Kinesis Data Streams?

Why is my Kinesis Data Streams trigger not able to invoke my Lambda function?

Streaming events from Kinesis Data Stream in account A to Kinesis Data Firehose in account B

Kinesis data streams limits

When using Kinesis Producer Library to aggregate records, send them to a Kinesis Data Stream, and run a Kinesis Data Analytics application, how is de-aggregation handled?

Error RTSP stream kinesis

Kinesis data stream - data transfer charges for on-premises consumer/producer

Why is my Kinesis Data Stream returning a 500 Internal Server Error?

Short Description

Resolution

Calculate your error rate

Implement a retry mechanism

Related Information

Relevant content