Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Table of Contents

...

Ticket: 

Jira Legacy
serverSystem Jira
serverId01505d01-b853-3c2e-90f1-ee9b165564fc
keyPERF-756


Summary

  • There is significant performance improvement for in data import performance in Poppy with Poppy using the file splitting feature compared with to Orchid (40% for DI Create, 25% for DI Update) but . However, there is a small degradation (up to 5%) compared to Poppy without the file splitting feature when running with CICO. CO response times are almost the same as identical to Poppy without the file splitting feature. The CI response time is 20% slower with and without Data Import.
  • Average CPU utilization did not exceed 150% for all the modules. The highest consumption was observed from mod-inventory. It was growing from 110% up to 250% at the end of the test (So as memory grows too, we can suspect the issue
    Jira Legacy
    serverSystem Jira
    serverId01505d01-b853-3c2e-90f1-ee9b165564fc
    keyMODINV-944
    . It is fixed in version 20.1.9 but this test was run on version 20.1.7 of mod-inventory). 
    Spikes of mod-data-import were observed in Data Import jobs with 50k files up to 130%. for jobs and a 250% spike for 100k. For Data Import jobs CPU utilization didn't exceed 110% for all other modules
  • Memory utilization increase is a result of previous modules restarting (everyday cluster shutdown process). Memory consumption before tests for mod-search was 45% and for mod-inventory - 55%. During the test with 100k files mod-search memory utilization increases to 90% and mod-inventory up to 100%.
  • Average DB CPU usage during data import is about 95% The same results compared to 95% which is consistent with the performance observed during the same tests in Orchid.
  • Average The average connection count during data import is about approximately 600 connections for Create jobs, which is two times higher than without twice as high as when the file splitting feature is disabled. For Update jobs - 560 connections, the connection count is 560.

Test Runs 

Comment

Test #

Scenario

Load level
1DI MARC Bib Create5K, 10K, 25K, 50K, 100K consequentially (with 5 min pause)
CICO 8 users
2DI MARC Bib Update5K, 10K, 25K, 50K, 100K consequentially (with 5 min pause)
CICO 8 users

...

DI MARC BIB Create + CICO

Image RemovedImage Added

DI Bib Update + CICO

Image RemovedImage Added

Service CPU Utilization

...

Spikes of mod-data-import were observed in Data Import jobs with 50k files up to 130%. for jobs and 250% spike for 100k. For Data Import jobs with 5k, 10k, 25k files CPU utilization didn't exceed 110%


Image RemovedImage Added

Service Memory Utilization

...

Memory consumption before tests for mod-search was 45% and for mod-inventory - 55%. During test with 100k file mod-search grew up to 90% and mod-inventory up to 100%.

Image RemovedImage Added



DB CPU Utilization

Average DB CPU usage during data import is about 95% The same results if to compare with the same tests in Orchid.

Image RemovedImage Added



DB Connections

Average connection count during data import is about 600 connections for create jobs that is two times higher than without file splitting feature. For update jobs - 560 connections

Image RemovedImage Added

DB load

Image RemovedImage Added

Appendix

Infrastructure

PTF -environment pcp1

  • 10 m6i.2xlarge EC2 instances located in US East (N. Virginia)us-east-1
  • 2 1 database  instancesinstance, writer/reader


    NameMemory GIBvCPUsmax_connections

    db.r6g.xlarge

    32 GiB4 vCPUs2731


  • MSK cluster - tenant
    • 4 m5.2xlarge brokers in 2 zones
    • Apache Kafka version 2.8.0

    • EBS storage volume per broker 300 GiB

    • auto.create.topics.enable=true
    • log.retention.minutes=480
    • default.replication.factor=3


ModuleTask Def. RevisionModule VersionTask CountMem Hard LimitMem Soft limitCPU unitsXmxMetaspaceSizeMaxMetaspaceSizeR/W split enabled
pcp1-pvt
mod-remote-storage10(11)*3.0.024920447210243960512512FALSE
mod-data-import18(20)*3.0.71204818442561292384512FALSE
mod-authtoken13(16)*2.14.121440115251292288128FALSE
mod-configuration9(10)*5.9.22102489612876888128FALSE
mod-users-bl9(10)*7.6.021440115251292288128FALSE
mod-inventory-storage12(15)*27.0.3(27.0.4)*24096369020483076384512FALSE
mod-circulation-storage12(14)*17.1.3(17.1.7)*22880259215361814384512FALSE
mod-source-record-storage15(18)*5.7.3(5.7.5)*25600500020483500384512FALSE
mod-inventory11(14)*20.1.3(20.1.7)*22880259210241814384512FALSE
mod-di-converter-storage15(18)*2.1.2(2.1.5)*2102489612876888128FALSE
mod-circulation12(14)*24.0.8(24.0.11)*22880259215361814384512FALSE
mod-pubsub11(13)*2.11.2(2.11.3)*2153614401024922384512FALSE
mod-patron-blocks9(10)*1.9.021024896102476888128FALSE
mod-source-record-manager14(17)*3.7.4(3.7.8)*25600500020483500384512FALSE
mod-quick-marc9(11)*5.0.0(5.0.1)*1228821761281664384512FALSE
nginx-okapi92023.06.1421024896128000FALSE
okapi-b115.1.23168414401024922384512FALSE
mod-feesfines10(11)*19.0.02102489612876888128FALSE
pub-okapi92023.06.142102489612876800FALSE

...

Grafana dashboard:

MARC Bib Create + CICO

MARC Bib Update + CICOhttp://carrier-io.int.folio.ebsco.com/grafana/d/elIt9zCnz/jmeter-performance-with-average-latency?orgId=1&from=1710766047451&to=1710775415290&var-percentile=95&var-test_type=fix_load&var-test=circulation_checkInCheckOut_orchid&var-env=int&var-grouping=1s&var-low_limit=250&var-high_limit=750&var-db_name=jmeter&var-sampler_type=All&var-Request=All