Data Import MARC Authorities [Orchid]
Overview
This document contains the results of testing Data Import MARC Authorities in Orchid release to detect performance trends.
Ticket: - PERF-444Getting issue details... STATUS
Previous test report: Data Import MARC Authorities (Nolana)
Summary
- All the tests were done successfully without errors or issues.
- There is no specific trend for DI performance and resources utilization for Orchid release compared to Nonala. There is degradation for smaller file jobs, while for bigger file jobs processing time improved (up to 19%). See the details...
- Max CPU utilization was observed for mod-inventory module, it reached 43% during 50K DI import. See the details...
- Max DB CPU reached 86% during 50K DI import. No memory leaks found. See the details...
It should be noted that trigger process_marc_records_lb_insert_update_trigger was enabled for mod_source_record_storage.marc_records_lb table to make the environment production-like. It could influence the results in current release.
Test Results
Profile used for testing - "Default - Create SRS MARC Authority"
Test | File | Duration | Diff absolute | Diff percentage | ||
---|---|---|---|---|---|---|
Morning glory | Nolana | Orchid | ||||
1 | 1K | 24 s | 27 s | 41s | +14s | +51% |
2 | 5K | 1 min 21 s | 1 min 15 s | 1min 21s | +6s | +8% |
3 | 10K | 2 min 32 s | 2 min 31 s | 2min 53s | +22s | +14% |
4 | 25K | 11 min 14 s | 7 min 7 s | 5 min 42s | -1min 25s | -19% |
5 | 50K | 22 min | 11 min 24 s | 11 min 11s | -13s | -1% |
Instance CPU Utilization
Service CPU Utilization
Max CPU utilization was observed for mod-inventory module, it reached 43% during 50K DI import.
Max CPU utilization for non-DI modules was observed for mod-quick-marc module, it reached 53% during 50K DI import.
Memory Utilization
DB CPU Utilization
Max DB CPU reached 86% during 50K DI import.
DB Connections
Infrastructure
Records count :
- mod_source_record_storage.marc_records_lb = 25015778
- mod_source_record_storage.raw_records_lb = 25048647
- mod_source_record_storage.records_lb = 25048897
- mod_source_record_storage.marc_indexers = 283585109 (all records)
- mod_source_record_storage.marc_indexers with field_no 010 = 219260
- mod_source_record_storage.marc_indexers with field_no 035 = 10165466
- mod_inventory_storage.authority = 7493975
- mod_inventory_storage.holdings_record = 23900853
- mod_inventory_storage.instance = 22617024
- mod_inventory_storage.item = 23801604
PTF -environment ncp5
- 11 m6i.2xlarge EC2 instances located in US East (N. Virginia)us-east-1
2 database instances, one reader, and one writer
Name API Name Memory GIB vCPUs max_connections R6G Extra Large db.r6g.xlarge 32 GiB 4 vCPUs 2731 - Number of connections for mod-source-record-manager and mod-source-record-storage: 30 connections.
- Profile used: Default - Create SRS MARC Authority
- MSK ptf-kakfa-3
- 4 m5.2xlarge brokers in 2 zones
Apache Kafka version 2.8.0
EBS storage volume per broker 300 GiB
- auto.create.topics.enable=true
- log.retention.minutes=480
- default.replication.factor=3
- Kafka topics partitioning: - 2 partitions for DI topics
Modules memory and CPU parameters
Modules | Version | Task Definition | Running Tasks | CPU | Memory | MemoryReservation | MaxMetaspaceSize | Xmx |
---|---|---|---|---|---|---|---|---|
mod-inventory-storage | 26.0.0 | 12 | 2 | 2048 | 4096 | 3690 | 512 | 3076 |
mod-inventory | 20.0.6 | 12 | 2 | 1024 | 2880 | 2592 | 512 | 1814 |
mod-source-record-storage | 5.6.7 | 27 | 2 | 2048 | 5600 | 5000 | 512 | 3500 |
mod-quick-marc | 3.0.0 | 6 | 1 | 128 | 2288 | 2176 | 512 | 1664 |
mod-source-record-manager | 3.6.4 | 18 | 2 | 2048 | 5600 | 5000 | 512 | 3500 |
mod-di-converter-storage | 2.0.5 | 6 | 2 | 128 | 1024 | 896 | 128 | 768 |
mod-data-import | 2.7.1 | 10 | 1 | 256 | 2048 | 1844 | 512 | 1292 |
okapi | 5.0.1 | 8 | 3 | 1024 | 1684 | 1440 | 512 | 922 |
nginx-okapi | 2022.06.14 | 7 | 2 | 128 | 1024 | 896 | - | - |
pub-okapi | 2022.06.14 | 7 | 2 | 128 | 1024 | 896 | - | 768 |