PTF - Data Export Test Report (Sunflower) [NON-ECS]

PTF - Data Export Test Report (Sunflower) [NON-ECS]

Overview

  • This document contains the results of three times testing Data Export (MARC BIB) on the Sunflower[NON-ECS] release. After one month of the first test, two additional tests were conducted. In the second test, mod circulation storage was enabled. In the third test, mod circulation storage on module was disabled. All test results have been summarized correctly.

PERF-1115: [Sunflower] [non-ECS] [Data Export] MARC BIBIn QA

Summary

  • Data Export tests finished successfully(except some duplications) on using the profiles Default instances export job profile and SRS - holdings and items job profile.

  • Comparing with previous results of Ramsons and Sunflower releases

    • Data Export processed all files including file with 500k records with negligible duplication errors in Sunflower env.

    • Data export durations are nearly twice as slow compared to Ramsons releases when mod circulation storage is enabled.

    • When mod circulation storage is disabled, it faster than run #1 and #2 but slower compared to Ramsons/

Test Results

This table contains durations for Data Export with two job profiles. 

 

Profile

CSV  File

Tenant (fs09000000)

Result

Status

DE MARC Bib (Default instances export job profile)

1k.csv

0:00:09

COMPLETED

100k.csv

0:06:03

COMPLETED

500k.csv

0:06:46

COMPLETED

DE MARC Bib (SRS - holdings and items job profile)

1k.csv

0:00:17

COMPLETED

100k.csv

0:12:20

COMPLETED

500k.csv

0:15:13

COMPLETED

Comparison

This table contains durations comparison between Sunflower and Ramsons releases.

Profile

CSV  File

DE Duration
Sunflower

DE Duration
Ramsons

DE Duration, DELTA Sunflower/Ramsons

Result
hh:mm:ss

Result
hh:mm:ss

hh:mm:ss / percent

DE MARC Bib (Default instances export job profile)

1k.csv

0:00:09

0:00:03

+00:00:06

100k.csv

0:06:03

0:02:19

+00:03:44
+161%

500k.csv

0:07:49

0:04:33

+00:03:16
+71%

DE MARC Bib (SRS - holdings and items job profile)

1k.csv

0:00:17

0:00:06

+00:00:11

100k.csv

0:12:20

0:07:02

+00:05:18
+75%

500k.csv

0:17:45

0:08:57

+00:08:48
+98%

 

Resource utilization for Tests

 Resource utilization table

CPU 

RAM 

CPU 

RAM 

mod-data-export-b

88.2%

mod-inventory-b

78.3%

mod-remote-storage-b

13.7%

mod-data-export-b

63%

mod-inn-reach-b

13.7%

mod-users-keycloak-b

56.4%

mod-audit-b

11.3%

mod-audit-b

55.1%

mod-pubsub-b

8.2%

mod-pubsub-b

45.1%

mod-users-keycloak-b

7.8%

mod-quick-marc-b

37.6%

mod-inventory-b

7.7%

mod-inn-reach-b

31.8%

mod-quick-marc-b

7.4%

mod-invoice-storage-b

31.8%

mod-invoice-storage-b

7.4%

mod-remote-storage-b

 

22.2%

Instance CPU Utilization

image-20250502-055819.png

Service CPU Utilization

Here we can see that mod-data-export used 88% CPU in spike.

image-20250501-141747.png

Service Memory Utilization

Here we can see all services have stable trends but mod-data-export which used more memory when data volume increased.

image-20250502-122302.png

DB CPU Utilization

DB CPU spike was 77% when 1k data was exported and first 100k data started its job. However, for data 500k, DB cpu utilization fluctuated between 50% and 60%.

image-20250610-114108.png

DB Connections

DB connections was 1201.

image-20250501-193522.png

Kafka metrics

image-20250501-194344.png
image-20250501-194903.png

OpenSearch Data Nodes metrics

image-20250501-201005.png

DB load

image-20250501-212518.png

Top SQL-queries

image-20250502-163504.png

Top applications

image-20250501-212820.png

Appendix

Infrastructure

PTF - environment Sunflower NON-ECS (secp1)

  • secp1 12 r7g.2xlarge EC2 instances located in US East (N. Virginia)us-east-1

  • 1 instance of db.r7.xlarge database instance: Writer instance

  • MSK fse-test

    • 4 kafka.m7g.xlarge brokers in 2 zones (2 brokers per zone)

      • Apache Kafka version 3.7.x, metadata mode - KRaft

      • EBS storage volume per broker 300 GiB

      • auto.create.topics.enable=true

      • log.retention.minutes=480

      • default.replication.factor=3

      • revision - 26

  • OpenSearch 2.13 ptf-test cluster

    • r7g.2xlarge.search 4 data nodes

    • r6g.large.search 3 dedicated master nodes