PTF - Data Export Test Report (Sunflower CSP1) [NON-ECS]
1 Overview
2 Summary
6 Appendix
6.1 Infrastructure
Overview
This document contains the results of three times testing Data Export (MARC BIB) on the Sunflower[NON-ECS] CSP1 release. All test results have been summarized correctly.
PERF-1200: [Sunflower CSP1] [non-ECS] [Data Export] MARC BIBClosed
Summary
Data Export tests finished successfully(except some duplications) on using the profiles Default instances export job profile and SRS - holdings and items job profile.
Comparing with Sunflower release
Data Export processed all files including file with 500k records with negligible duplication errors in Sunflower env.
Data export durations are 13-32% slower compared to Sunflower releases.
During the testing, the results were not very good, and it was decided to make a vacuum and conduct the testing again.
Test Results
This table contains durations for Data Export with two job profiles.
Profile | CSV File | Tenant (fs09000000) | |
|---|---|---|---|
Result | Status | ||
DE MARC Bib (Default instances export job profile) | 1k.csv | 0:00:08 | COMPLETED |
100k.csv | 0:07:00 | COMPLETED | |
500k.csv | 0:08:43 | COMPLETED | |
DE MARC Bib (SRS - holdings and items job profile) | 1k.csv | 0:00:14 | COMPLETED |
100k.csv | 0:18:17 | COMPLETED | |
500k.csv | 0:22:16 | COMPLETED | |
Test Results after vacuum
This table contains durations for Data Export with two job profiles after vacuum. Several tests were performed to obtain more results for comparison.
Profile | CSV File | Tenant (fs09000000) | |||||
|---|---|---|---|---|---|---|---|
Result №1 | Result №2 | Result №3 | Result №4 | Result №5 | Status | ||
DE MARC Bib (Default instances export job profile) | 1k.csv | - | - | - | - | - | COMPLETED |
100k.csv | 0:01:40 | 00:01:39 | 00:01:39 | 00:01:39 | 00:01:40 | COMPLETED | |
500k.csv | 0:05:09 | 00:03:11 | 00:03:01 | 00:03:00 | 00:02:59 | COMPLETED | |
DE MARC Bib (SRS - holdings and items job profile) | 1k.csv | - | - | - | - | - | COMPLETED |
100k.csv | 0:12:52 | 00:07:24 | 00:05:29 | 00:05:32 | 00:05:22 | COMPLETED | |
500k.csv | 0:13:29 | 00:14:24 | 00:14:29 | 00:14:27 | 00:14:19 | COMPLETED | |
Comparison
This table contains durations comparison between Sunflower and Sunflower CSP1 releases.
Profile | CSV File | DE Duration | DE Duration | DE Duration, DELTA Sunflower CSP1/Sunflower |
Result | Result | hh:mm:ss / percent | ||
DE MARC Bib (Default instances export job profile) | 1k.csv | 0:00:08 | 0:00:09 | -00:00:01 |
100k.csv | 0:07:00 | 0:06:03 | +00:00:57 | |
500k.csv | 0:08:43 | 0:06:46 | +00:01:57 | |
DE MARC Bib (SRS - holdings and items job profile) | 1k.csv | 0:00:14 | 0:00:17 | -00:00:03 |
100k.csv | 0:18:17 | 0:12:20 | +00:05:57 | |
500k.csv | 0:22:16 | 0:15:13 | +00:07:03 |
Resource utilization for Tests
Resource utilization table
CPU | RAM | ||
|---|---|---|---|
mod-users-keycloak-b | 8.15% | mgr-applications-b | 81.41% |
mod-tlr-b | 7.65% | mod-oa-b | 76.63% |
mod-login-keycloak-b | 6.87% | mod-finance-b | 69.99% |
mod-roles-keycloak-b | 6.49% | mgr-tenant-entitlements-b | 68.91% |
mod-data-export-b | 5.44% | mod-roles-keycloak-b | 68.61% |
mod-requests-mediated-b | 5.43% | mod-scheduler-b | 67.57% |
mod-consortia-keycloak-b | 5.03% | mod-audit-b | 66.29% |
mod-inventory-storage-b | 4.55% | mod-dcb-b | 65.72% |
mod-circulation-bff-b | 3.98% | mod-users-keycloak-b | 65.12% |
Instance CPU Utilization
Service CPU Utilization
Here we can see that mod-data-export used 24% CPU in spike.
Service Memory Utilization
Here we can see all services have stable trends but mod-data-export which used more memory when data volume increased.
DB CPU Utilization
CPU utilization during data export peaked up to ~30% with higher loads (100k/500k), while smaller loads (1k/100k) caused moderate spikes (~15–20%).
DB Connections
DB connections was 1406.
Kafka metrics
OpenSearch Data Nodes metrics
DB load
Top SQL-queries
Top applications
Appendix
Infrastructure
PTF - environment Sunflower NON-ECS (secp1)
secp1 7 r7g.2xlarge EC2 instances located in US East (N. Virginia)us-east-1
1 instance of db.r7.xlarge database instance: Writer instance
MSK fse-test
4 kafka.m7g.xlarge brokers in 2 zones (2 brokers per zone)
Apache Kafka version 3.7.x, metadata mode - KRaft
EBS storage volume per broker 300 GiB
auto.create.topics.enable=true
log.retention.minutes=480
default.replication.factor=3
revision - 26
OpenSearch 2.13 ptf-test cluster
r7g.2xlarge.search 4 data nodes
r6g.large.search 3 dedicated master nodes