Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Table of Contents
Overview

...

Top long query for failed job on third tenant during DI Create job with 10k- SELECT jsonb,id FROM fs07000002_mod_inventory_storage.instance_holdings_item_view. Average latency- 386455.99 ms/call

Test Runs 

Test #

Scenario

Load level
1 - Concurrent Create importsDI MARC Bib Create10K, 25K concurrently (with 5 min pause) on 2 and 3 tenants
2 - Concurrent Update importsDI MARC Bib Update10K, 25K concurrently (with 5 min pause) on 2 and 3 tenants
3 - Concurrent Create imports ("smoke test") of 50K DI MARC Bib Create50k concurrently on 3 tenants 

...

DI Create# of testNumber of concurrent jobs

Main tenant

(fs09000000)

Second tenant

(fs07000001)

Third tenant

(fs07000002)

10KBaseline100:04:56

1200:10:4300:10:37
2300:21:1200:21:0600:20:57 *
25KBaseline100:11:24

3200:23:4400:23:30
4300:37:1100:37:0500:36:58
DI Update




10KBaseline100:06:32

5200:09:4700:11:26
6300:19:0800:19:0600:18:31
25KBaseline100:15:13

7200:30:4900:30:52
8300:47:4700:48:1700:47:54
DI Create (Smoke test)




50K9100:22:31

10301:12:5401:12:4401:12:35

...

Infrastructure

PTF -environment pcp1

  • 10 m6i.2xlarge EC2 instances located in US East (N. Virginia)us-east-1
  • 2 database  instances, writer/reader


    NameMemory GIBvCPUsmax_connections

    db.r6g.xlarge

    32 GiB4 vCPUs2731


  • DB fs09000000
    • instances 
  • MSK tenant
    • 4 m5.2xlarge brokers in 2 zones
    • Apache Kafka version 2.8.0

    • EBS storage volume per broker 300 GiB

    • auto.create.topics.enable=true
    • log.retention.minutes=480
    • default.replication.factor=3

...

ModuleTask Def. RevisionModule VersionTask CountMem Hard LimitMem Soft limitCPU unitsXmxMetaspaceSizeMaxMetaspaceSizeR/W split enabled
pcp1-pvt
mod-remote-storage10(11)*3.0.024920447210243960512512FALSE
mod-data-import18(20)*3.0.71204818442561292384512FALSE
mod-authtoken13(16)*2.14.121440115251292288128FALSE
mod-configuration9(10)*5.9.22102489612876888128FALSE
mod-users-bl9(10)*7.6.021440115251292288128FALSE
mod-inventory-storage12(15)*27.0.3(27.0.4)*24096369020483076384512FALSE
mod-circulation-storage12(14)*17.1.3(17.1.7)*22880259215361814384512FALSE
mod-source-record-storage15(18)*5.7.3(5.7.5)*25600500020483500384512FALSE
mod-inventory11(14)*20.1.3(20.1.7)*22880259210241814384512FALSE
mod-di-converter-storage15(18)*2.1.2(2.1.5)*2102489612876888128FALSE
mod-circulation12(14)*24.0.8(24.0.11)*22880259215361814384512FALSE
modmod-pubsub11(13)*2.11.2(2.11.3)*2153614401024922384512FALSE
mod-patron-blocks9(10)*1.9.021024896102476888128FALSE
mod-source-record-manager14(17)*3.7.4(3.7.8)*25600500020483500384512FALSE
mod-quick-marc9(11)*5.0.0(5.0.1)*1228821761281664384512FALSE
nginx-okapi92023.06.1421024896128000FALSE
okapi-b115.1.23168414401024922384512FALSE
mod-feesfines10(11)*19.0.02102489612876888128FALSE
pub-okapi92023.06.142102489612876800FALSE

...


Methodology/Approach

DI tests were started from UI with 5-minute pauses between the testsconcurrently with 1 job on each tenant, fs09000000 first and then on fs07000001 so in total two jobs on two tenants. Then 1 job on three tenants concurrently with several seconds delay - started with tenant fs09000000, second tenant -fs07000001 and third tenant - fs07000002. 
DI Create jobs were conducted with 10k and 25k first. Then DI Update jobs.