Data Import BIB Sunflower CSP1 [ECS]
- 1 Overview
- 2 Summary
- 3 Results
- 3.1 Memory Utilization
- 3.2 CPU Utilization
- 3.3 RDS Metrics
- 3.4 MSK service
- 4 Appendix
Overview
This document contains the results of testing Data Import for MARC Bibliographic records creates and updates with different file sizes (10K, 25K, 50K, 100K, 500K) at Sunflower release [ECS].
Note: Starting from Sunflower all releases are Eureka.
Ticket: https://folio-org.atlassian.net/browse/PERF-1178
Summary
All Data-imports jobs finished successfully without errors.
Duration of data imports for creates and updates are mostly the same as was in Sunflower GA
DI duration growth correlates to the number of records imported.
No memory leak is suspected for DI modules.
Services CPU utilization, Service memory utilization, and DB CPU utilization have the same utilization trend and values as in the R release.
During testing, we discovered a slight dependency on mod-orders. With the module disabled, we see better performance. A ticket has been created for Thunderjet team for a more detailed analysis of the issue: https://folio-org.atlassian.net/browse/MODORDSTOR-483
Results
Test # | Data-import test | Profile | Duration Sunflower CSP1 enabled orders | Duration Sunflower CSP1 disabled orders | Duration Sunflower enabled orders | Duration Sunflower disabled orders-storage | Results |
|---|---|---|---|---|---|---|---|
1 | 5k MARC BIB Create | PTF - Create 2 | 3 min 3 s | - | 2 min 32 s | 1 min 26 s | Completed |
2 | 10k MARC BIB Create | PTF - Create 2 | 5 min 32 s | - | 4 min 46 s | 2 min 50 s | Completed |
3 | 25k MARC BIB Create | PTF - Create 2 | 12 min 25 s | - | 11 min 3 s | 6 min 48 s | Completed |
4 | 50k MARC BIB Create | PTF - Create 2 | 26 min 20 s | 18 min 05 s | 20 min 44 s | 13 min 37 s | Completed |
5 | 100k MARC BIB Create | PTF - Create 2 | 54 min 8 s | 39 min 51 s | 46 min 24 s | 30 min 3 s | Completed |
6 | 5k MARC BIB Update | PTF - Updates Success - 6 | 2 min 44 s | - | - | 3 min 53 s | Completed |
7 | 10k MARC BIB Update | PTF - Updates Success - 6 | 5 min 33 s | - | - | 7 min 16s | Completed |
8 | 25k MARC BIB Update | PTF - Updates Success - 6 | 12 min 51 s | - | - | 18 min 38 s | Completed |
9 | 50k MARC BIB Update | PTF - Updates Success - 6 | 25 min 32 s | - | - | 37 min 55 s | Completed |
10 | 100k MARC BIB Update | PTF - Updates Success - 6 | 54 min 41 s | - | - | 1 hr 22 min | Completed |
Memory Utilization
Memory utilisation showing stable trend, without spikes and drops. No signs
CPU Utilization
RDS Metrics
MSK service
Appendix
Infrastructure
PTF -environment secon
8 r7g.2xlarge EC2 instances located in US East (N. Virginia)us-east-1
db.r7g.xlarge database instances, writer
MSK fse-test
4 kafka.m7g.xlarge brokers in 2 zones
Apache Kafka version 3.7.x (KRaft mode)
EBS storage volume per broker 300 GiB
auto.create.topics.enable=true
log.retention.minutes=480
default.replication.factor=3