AWS Database Blog

Evaluate logical replication performance with synthetic load generation using Apache JMeter on Amazon Aurora PosgreSQL

Many customers face challenges when they have to choose the right instance size and assess the logical replication latency. When they’re using third-party replication tools like HVR, SharePlex, Oracle GoldenGate, or AWS Database Migration Service (AWS DMS ), it’s important to identify any networking glitches or compute resource allocation at the beginning. In this post, we explain how to generate synthetic workload with Apache JMeter, and evaluate logical replication performance. You can use Apache JMeter either with DMS or other replication tools in heterogeneous or in homogeneous database migration.

Apache JMeter overview

Apache JMeter is a Java-based open-source testing tool used for analyzing and measuring the performance of different software web applications and products. It supports a wide variety of applications, servers and protocols. Also, it is primarily used to run performance and load testing of web applications and databases. We focus on the replication performance, if you are looking for a post about JMeter with Amazon Aurora, refer to Configure a performance testing framework for Amazon Aurora PostgreSQL

Solution overview

In this post, we demonstrate how Apache JMeter helps to run data-driven performance and load tests against Amazon Aurora PostgreSQL-Compatible Edition. To mimic a production scenario, we created an active-active environment with AWS DMS as a logical replication tool to another Aurora PostgreSQL database instance. We monitor database performance with the help of Amazon RDS Performance Insights, and we can control database load with Apache JMeter and monitor performance.

We test replication throughput performance with different workload sizes; we don’t cover AWS DMS instances or Aurora PostgreSQL database performance.

The following diagram illustrates the solution architecture.

Prerequisites

  • An Amazon Elastic Compute Cloud (Amazon EC2) instance for installing and configuring the Apache JMeter software. The EC2 instance should have connectivity to the Aurora database instance and adequate computing power to run a performance test, at lest
  • 4GB memory allocated just for JMeter.
  • Two Aurora PostgreSQL database instances to test and deploy replication.
  • One AWS DMS instance, having connectivity to Primary and Secondary databases.

Implement the solution

We install Apache JMeter on an EC2 instance and connect to an Aurora PostgreSQL database as source and Aurora PostgreSQL as target with AWS DMS as the replication tool. The solution uses a synthetic workload testing schema and SQLs to create multiple load sizes. You can customize this based on your organization’s requirements to capture replication statistics.

With Apache JMeter, we can create multiple threads to generate synthetic load with database manipulation language with different INSERT, UPDATE, and DELETE statements. We can create various sample sizes based on SQL and generate different iterations to create stress and capture throughput and any bottlenecks within the database.

Apache JMeter test plan

The following screenshot shows JDBC Database Connection.

The following screenshots shows Thread Group configuration, where you can configure Number of threads, Ramp-up Period and Loop Counts.

For Apache JMeter installation, refer to Building high-quality benchmark tests for Amazon Redshift using Apache JMeter and complete 25 steps to create a test plan. This test plans will help you create synthetic load in the database and we can measure throughput with aggregate report.

  1. Connect to EC2 instance.
  2. Choose the Apache JMeter executable JAR file to open the console.
  3. Create a new test plan.
  4. Create the JDBC connection config element.
  5. Update the JDBC connection config—the variable name for the created pool needs to be the same across the test plan and the subsequent thread group.
  6. Choose (right-click) Test Plan, choose Add, Config Element, and User Defined Variable.
  7. Update the user-defined variable.
  8. Choose (right-click) Test Plan, choose Add, Config Element, and Random Variable.
  9. Update the random variable.
  10. Choose (right-click) Test Plan, choose Add, Threads (Users), and Thread Group.
  11. Update the thread group.
  12. Choose (right-click) Thread Group, choose Add, Logic Controller, and Loop Controller.
  13. Update the loop controller.
  14. Choose (right-click) Thread Group, choose Add, Logic Controller, and Throughput Controller.
  15. Copy the same controller three times for insert, update, and delete.
  16. Update the throughput controller.
  17. Choose (right-click) JDBC Request, under each throughput controller.
  18. Pass your preferred SQL statements for testing.You can use the sample SQL code from the template below included in this post.
  19. Choose (right-click) Test Plan, choose Add, Listener, and View Results Tree.This enables quick debugging and status checks on each throughput controller activity. The results tree should be turned off during test cycles to avoid excessive workloads on the test machines and overall JMeter performance.
  20. Choose (right-click) Test Plan, choose Add, Listener, and Aggregate Report.
  21. Label the report appropriately:
    1. Columns of interest are #Samples, which shows the number of operations from each throughput controller.
    2. Column Average shows the average time it takes to complete the throughput controller statement.
    3. Error% shows the number of failures on each statement.
    4. Throughput shows the TPS (transaction per second).
  22. Change the JVM parameters in the jmeter.bat (Windows) file located in the Apache Home /bin folder (for example, C:\apache-jmeter-5.4.1\binjmeter).
  23. Increase the heap size from 1G to 4G; set HEAP=-Xms4g -Xmx4g -XX:MaxMetaspaceSize=256m.
  24. Restart Apache JMeter.
  25. Run Apache JMeter from the command prompt or GUI mode. From the command prompt, use the following example code:
    jmeter -n -t ~\DMS_Perftest\apgpocw_2MB_custom_payload.jmx -Jusers=50 -Jduration=20 -Jloopcount=200 -l ..\DMS_Perftest\Jmeter100_04282021.jtl

The following objects are created on both the source and target databases for testing AWS DMS replication:

CREATE TABLE PERFTEST.POC_PERF_PARTITION_TEST (
   COL_DT_BI_ID bigserial NOT NULL,
   COL_DT_TMP TIMESTAMP NOT NULL,
   COL_DT_BI_ID2 BIGINT NOT NULL,
   COL_DT_TMP_LAST_DML TIMESTAMP,
   COL_DT_BL boolean,
   COL_DT_NMR NUMERIC (4,2),
   COL_DT_VCHR4000 VARCHAR(4000),
   COL_DT_VCHR80_LAST_UPDATER VARCHAR(80) DEFAULT current_user,
   COL_DT_VCHR01 VARCHAR(40),
   COL_DT_VCHR02 VARCHAR(40),
   COL_DT_VCHR03 VARCHAR(40),
   COL_DT_VCHR04 VARCHAR(40),
   COL_DT_VCHR05 VARCHAR(40),
   COL_DT_VCHR06 VARCHAR(40),
   COL_DT_VCHR07 VARCHAR(40),
   COL_DT_VCHR08 VARCHAR(40),
   COL_DT_VCHR09 VARCHAR(40),
   COL_DT_VCHR10 VARCHAR(40),
   COL_DT_VCHR11 VARCHAR(40),
   COL_DT_VCHR12 VARCHAR(40),
   COL_DT_VCHR13 VARCHAR(40),
   COL_DT_VCHR14 VARCHAR(40),
   COL_DT_VCHR15 VARCHAR(40),
   COL_DT_VCHR16 VARCHAR(40),
   COL_DT_VCHR17 VARCHAR(40),
   COL_DT_VCHR18 VARCHAR(40),
   COL_DT_VCHR19 VARCHAR(40),
   COL_DT_VCHR20 VARCHAR(40),
   COL_DT_VCHR21 VARCHAR(40),
   COL_DT_VCHR22 VARCHAR(40),
   COL_DT_VCHR23 VARCHAR(40),
   COL_DT_VCHR24 VARCHAR(40),
   COL_DT_VCHR25 VARCHAR(40),
   COL_DT_BGINT01 BIGINT,
   COL_DT_BGINT02 BIGINT,
   COL_DT_BGINT03 BIGINT,
   COL_DT_BGINT04 BIGINT,
   COL_DT_BGINT05 BIGINT,
   COL_DT_BGINT06 BIGINT,
   COL_DT_BGINT07 BIGINT,
   COL_DT_BGINT08 BIGINT,
   COL_DT_BGINT09 BIGINT,
   COL_DT_BGINT10 BIGINT,
   COL_DT_BGINT11 BIGINT,
   COL_DT_BGINT12 BIGINT,
   COL_DT_BGINT13 BIGINT,
   COL_DT_BGINT14 BIGINT,
   COL_DT_BGINT15 BIGINT,
   COL_DT_BGINT16 BIGINT,
   COL_DT_BGINT17 BIGINT,
   COL_DT_BGINT18 BIGINT,
   COL_DT_BGINT19 BIGINT,
   COL_DT_BGINT20 BIGINT,
   COL_DT_BGINT21 BIGINT,
   COL_DT_BGINT22 BIGINT,
   COL_DT_BGINT23 BIGINT,
   COL_DT_BGINT24 BIGINT,
   COL_DT_BGINT25 BIGINT,
   COL_DT_BOOL01 boolean,
   COL_DT_BOOL02 boolean,
   COL_DT_BOOL03 boolean,
   COL_DT_BOOL04 boolean,
   COL_DT_BOOL05 boolean,
   COL_DT_BOOL06 boolean,
   COL_DT_BOOL07 boolean,
   COL_DT_BOOL08 boolean,
   COL_DT_BOOL09 boolean,
   COL_DT_BOOL10 boolean,
   COL_DT_BOOL11 boolean,
   COL_DT_BOOL12 boolean,
   COL_DT_BOOL13 boolean,
   COL_DT_BOOL14 boolean,
   COL_DT_BOOL15 boolean,
   COL_DT_BOOL16 boolean,
   COL_DT_BOOL17 boolean,
   COL_DT_BOOL18 boolean,
   COL_DT_BOOL19 boolean,
   COL_DT_BOOL20 boolean,
   COL_DT_BOOL21 boolean,
   COL_DT_BOOL22 boolean,
   COL_DT_BOOL23 boolean,
   COL_DT_BOOL24 boolean,
   COL_DT_BOOL25 boolean,
   COL_DT_BOOL26 boolean,
   COL_DT_TMSTMP01 TIMESTAMP,
   COL_DT_TMSTMP02 TIMESTAMP,
   COL_DT_TMSTMP03 TIMESTAMP,
   COL_DT_TMSTMP04 TIMESTAMP,
   COL_DT_TMSTMP05 TIMESTAMP,
   COL_DT_TMSTMP06 TIMESTAMP,
   COL_DT_TMSTMP07 TIMESTAMP,
   COL_DT_TMSTMP08 TIMESTAMP,
   COL_DT_TMSTMP09 TIMESTAMP,
   COL_DT_TMSTMP10 TIMESTAMP,
   COL_DT_TMSTMP11 TIMESTAMP,
   COL_DT_TMSTMP12 TIMESTAMP,
   COL_DT_TMSTMP13 TIMESTAMP,
   COL_DT_TMSTMP14 TIMESTAMP,
   COL_DT_TMSTMP15 TIMESTAMP,
   COL_DT_TMSTMP16 TIMESTAMP,
   COL_DT_TMSTMP17 TIMESTAMP,
   COL_DT_TMSTMP18 TIMESTAMP,
   COL_DT_TMSTMP19 TIMESTAMP,
   COL_DT_TMSTMP20 TIMESTAMP,
   COL_DT_TMSTMP21 TIMESTAMP,
   COL_DT_TMSTMP22 TIMESTAMP,
   COL_DT_TMSTMP23 TIMESTAMP,
   COL_DT_TMSTMP24 TIMESTAMP,
   COL_DT_TMSTMP25 TIMESTAMP,
   COL_DT_JSONB JSONB,
   COL_DT_BYTEA BYTEA,
   CONSTRAINT PK_POC_PERF_PARTITION_TEST PRIMARY PRIMARY KEY (COL_DT_BI_ID)
) PARTITION BY HASH(COL_DT_BI_ID);

CREATE INDEX PERFTEST.IDX_POC_PERF_PART_TMP ON PERFTEST.POC_PERF_PARTITION_TEST (COL_DT_BI_ID2, COL_DT_TMP) ;
-- partitions 16
CREATE TABLE PERFTEST.POC_PERF_PART_P00 PARTITION OF PERFTEST.POC_PERF_PARTITION_TEST FOR VALUES WITH (MODULUS 16, REMAINDER 0) ;
CREATE TABLE PERFTEST.POC_PERF_PART_P01 PARTITION OF PERFTEST.POC_PERF_PARTITION_TEST FOR VALUES WITH (MODULUS 16, REMAINDER 1) ;
CREATE TABLE PERFTEST.POC_PERF_PART_P02 PARTITION OF PERFTEST.POC_PERF_PARTITION_TEST FOR VALUES WITH (MODULUS 16, REMAINDER 2) ;
CREATE TABLE PERFTEST.POC_PERF_PART_P03 PARTITION OF PERFTEST.POC_PERF_PARTITION_TEST FOR VALUES WITH (MODULUS 16, REMAINDER 3) ;
CREATE TABLE PERFTEST.POC_PERF_PART_P04 PARTITION OF PERFTEST.POC_PERF_PARTITION_TEST FOR VALUES WITH (MODULUS 16, REMAINDER 4) ;
CREATE TABLE PERFTEST.POC_PERF_PART_P05 PARTITION OF PERFTEST.POC_PERF_PARTITION_TEST FOR VALUES WITH (MODULUS 16, REMAINDER 5) ;
CREATE TABLE PERFTEST.POC_PERF_PART_P06 PARTITION OF PERFTEST.POC_PERF_PARTITION_TEST FOR VALUES WITH (MODULUS 16, REMAINDER 6) ;
CREATE TABLE PERFTEST.POC_PERF_PART_P07 PARTITION OF PERFTEST.POC_PERF_PARTITION_TEST FOR VALUES WITH (MODULUS 16, REMAINDER 7) ;
CREATE TABLE PERFTEST.POC_PERF_PART_P08 PARTITION OF PERFTEST.POC_PERF_PARTITION_TEST FOR VALUES WITH (MODULUS 16, REMAINDER 8) ;
CREATE TABLE PERFTEST.POC_PERF_PART_P09 PARTITION OF PERFTEST.POC_PERF_PARTITION_TEST FOR VALUES WITH (MODULUS 16, REMAINDER 9) ;
CREATE TABLE PERFTEST.POC_PERF_PART_P10 PARTITION OF PERFTEST.POC_PERF_PARTITION_TEST FOR VALUES WITH (MODULUS 16, REMAINDER 10) ;
CREATE TABLE PERFTEST.POC_PERF_PART_P11 PARTITION OF PERFTEST.POC_PERF_PARTITION_TEST FOR VALUES WITH (MODULUS 16, REMAINDER 11) ;
CREATE TABLE PERFTEST.POC_PERF_PART_P12 PARTITION OF PERFTEST.POC_PERF_PARTITION_TEST FOR VALUES WITH (MODULUS 16, REMAINDER 12) ;
CREATE TABLE PERFTEST.POC_PERF_PART_P13 PARTITION OF PERFTEST.POC_PERF_PARTITION_TEST FOR VALUES WITH (MODULUS 16, REMAINDER 13) ;
CREATE TABLE PERFTEST.POC_PERF_PART_P14 PARTITION OF PERFTEST.POC_PERF_PARTITION_TEST FOR VALUES WITH (MODULUS 16, REMAINDER 14) ;
CREATE TABLE PERFTEST.POC_PERF_PART_P15 PARTITION OF PERFTEST.POC_PERF_PARTITION_TEST FOR VALUES WITH (MODULUS 16, REMAINDER 15) ;

We use the following SQL commands to generate our synthetic load in:

INSERT INTO PERFTEST.POC_PERF_PARTITION_TEST(col_dt_bi_id, col_dt_tmp,col_dt_bi_id2,col_dt_tmp_last_dml,col_dt_bl,col_dt_nmr,col_dt_vchr4000,col_dt_vchr01,col_dt_vchr02,col_dt_vchr03,col_dt_vchr04,col_dt_vchr05,col_dt_vchr06,col_dt_vchr07,col_dt_vchr08,col_dt_vchr09,col_dt_vchr10,col_dt_vchr11,col_dt_vchr12,col_dt_vchr13,col_dt_vchr14,col_dt_vchr15,col_dt_vchr16,col_dt_vchr17,col_dt_vchr18,col_dt_vchr19,col_dt_vchr20,col_dt_vchr21,col_dt_vchr22,col_dt_vchr23,col_dt_vchr24,col_dt_vchr25,col_dt_bgint01,col_dt_bgint02, col_dt_bgint03, col_dt_bgint04, col_dt_bgint05, col_dt_bgint06,col_dt_bgint07,col_dt_bgint08,col_dt_bgint09,col_dt_bgint10,col_dt_bgint11,col_dt_bgint12,col_dt_bgint13,col_dt_bgint14,col_dt_bgint15,col_dt_bgint16,col_dt_bgint17,col_dt_bgint18,col_dt_bgint19,col_dt_bgint20,col_dt_bgint21,col_dt_bgint22,col_dt_bgint23, col_dt_bgint24, col_dt_bgint25, col_dt_bool01, col_dt_bool02, col_dt_bool03,col_dt_bool04,col_dt_bool05,col_dt_bool06,col_dt_bool07,col_dt_bool08,col_dt_bool09,col_dt_bool10,col_dt_bool11,col_dt_bool12,col_dt_bool13,col_dt_bool14,col_dt_bool15,col_dt_bool16,col_dt_bool17,col_dt_bool18,col_dt_bool19,col_dt_bool20,col_dt_bool21,col_dt_bool22,col_dt_bool23,col_dt_bool24,col_dt_bool25,col_dt_bool26,col_dt_tmstmp01,col_dt_tmstmp02,col_dt_tmstmp03,col_dt_tmstmp04,col_dt_tmstmp05,col_dt_tmstmp06,col_dt_tmstmp07,col_dt_tmstmp08,col_dt_tmstmp09,col_dt_tmstmp10,col_dt_tmstmp11,col_dt_tmstmp12,col_dt_tmstmp13,col_dt_tmstmp14,col_dt_tmstmp15,col_dt_tmstmp16,col_dt_tmstmp17,col_dt_tmstmp18,col_dt_tmstmp19,col_dt_tmstmp20,col_dt_tmstmp21,col_dt_tmstmp22,col_dt_tmstmp23,col_dt_tmstmp24,col_dt_tmstmp25,col_dt_jsonb,col_dt_bytea ) values (${VI_col_dt_bi_id}, now(),random(), now(),true,32.21,'testtest', 'text','text','text','text','text','text','text','text','text','text','text','text','text','text','text','text','text','text','text','text','text','text','text','text','text',random(),random(),random(),random(),random(),random(),random(),random(),random(),random(),random(),random(),random(),random(),random(),random(),random(),random(),random(),random(),random(),random(),random(),random(),random(),false,true,true,true,true,true,true,true,true,true,true,true,true,false,false,false,false,false,false,false,false,false,false,false,false,false,now(),now(),now(),now(),now(),now(),now(),now(),now(),now(),now(),now(),now(),now(),now(),now(),now(),now(),now(),now(),now(),now(),now(),now(),now(),'{"date":"2021/05/27"}','${__RandomString(${custom_payload},abcdefghijklmnopqrstuvwxyz,)}');

UPDATE PERFTEST.POC_PERF_PARTITION_TEST set COL_DT_TMP_LAST_DML = now() + INTERVAL '1 Hour', COL_DT_VCHR80_LAST_UPDATER='Jiwerqqqqqqadfffffffffffffll' ,COL_DT_TMSTMP01= NOW() - INTERVAL '5 days', COL_DT_BOOL25=false where COL_DT_BI_ID = ${VI_col_dt_bi_id};

You can also add the test statements to the Apache JMeter plan if you have an application that has a greater percentage of deletes:

DELETE FROM 

PERFTEST.POC_PERF_PARTITION_TEST where COL_DT_BI_ID = ${VI_col_dt_bi_id};

You can create additional INSERT or DELETE statements based on specific requirements.

For this post, we generate load in three different phases. During the ramp-up phase, we increase the database load and capture database performance and evaluate database performance statistics along with latency in the replicated database with AWS DMS replication tools.

During the ramp-up period, you can set users to 100 and period to 60 seconds, so that it creates 100 users in the span of 1 minute, which is approximately 0.6 seconds per user (100 users / 60 seconds). You can set the loop count to 100 or forever, which tells you how many times you want to repeat your test. For our testing purposes we use forever, so we can generate sufficient load and can stop it when we have sufficient data to understand key performance indicators (KPIs).

The Apache JMeter load generated 30 parallel threads with 2-minute increments. We can observe how change data capture (CDC) is capturing those incoming changes and replicating a similar pattern in the target database.

We can use the same SQLs in other databases like Oracle or MySQL; we just have to change the SQL syntax respective to the database engine, and we can repurpose the same database table structure to generate artificial load and evaluate database performance.

The following CloudWatch CDC graphs shows replication latency on source and target systems during the test.

On the target database, the CDC process is applying that SQL, so you see an exponential growth of DML statements and a steadily declining number of runs over a period. From this bell curve, we can capture and estimate latency of CDC and make changes to the instance size to increase throughput.

KPIs for any replication is transaction per second, and it varies based on the size of the transaction, network latency, and data integrity constraint on the target system. In our solution, we generate a real-time scenario where we have different sizes of workloads. For workload 1, transactions are 1 KB, which runs frequently, and 75 MB transactions occur every 15 minutes. For workload 2, transaction size is 2MB.

For database admins, it’s very important to maintain sufficient free space for transactional log generation so it can maintain the database’s optimal performance. In our solution, data is being replicated to the target system, we need to make sure that the source and target instance have sufficient compute resources to process the application data and replicated data, Cloud gives you flexibility to change compute resources. We can adjust the instance class based on performance evaluation. We can also change the instance class for our AWS DMS instance, and change the AWS DMS task settings to increase replication performance. On the target system, source tables without primary keys (PKs) would be a big problem for DMS CDC operations.

The following graphs illustrate how CDC captures metrics at the source and target system. Also you can observe throughput bandwidth for the source system.

The following graphs show how CDC applies metrics at the target.

In the following screenshot, we can observe the thread group with 2 MB inserts.

To evaluate replication performance, you can go through the following screenshot which shows the aggregate report, including your throughputs. In our example, it shows that 15 KB inserts were run at 10.2/sec. Similarly, updates ran at around 4.1/sec, and for 2 MB size, at 39.1/hour. We can analyze this data and identify any bottlenecks in our application to identify whether we have any issues with database constraints or issues with triggers or any errors. You can do multiple tests, save the results, and go through various scenarios to evaluate replication performance.

Clean up

The services involved in this solution incur costs. When you’re done using this solution, clean up the following resources:

Conclusion

In this post, we showed how to use Apache JMeter to create a synthetic load to mimic a production scenario where you have various DML statements and evaluate performance. Choosing right AWS DMS replication instance makes a significant difference in performance. We recommend doing thorough testing of your environment based on the test scenario described in this post to help you choose the right database instance for your database migration. Leave a comment with your questions or feedback.


About the authors

Harshad Gohil is a Cloud/Database Consultant with Professional Services team at Amazon Web Services. He helps customers to build scalable, highly available and secure solutions in AWS cloud. His focus area is homogenous and heterogeneous migrations of on-premise infrastructure to AWS cloud.

John Lonappan is a Senior Database Specialist Consultant / Solutions Architect at Amazon Web Services (AWS) with a focus on relational databases. Prior to AWS, John has worked as Database Architect for large Data center providers across the Globe. Outside of work, he is passionate about, Long drives, EV Conversion, playing chess and traveling.

Arjun Nakkala is a Database consultant with Professional services team. He is helping and enabling customers to build high-available, cost-effective database solutions and migrate their commercial engines to AWS cloud. Curious to learn and implement new technologies around the databases and orchestrate migrations through automation. Prior to joining AWS, he supported production and mission-critical database implementation across the financial, banking and healthcare industry segments.