Skip to end of banner
Go to start of banner

[Poppy] List App with multiple tenants and R/W split enabled

Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 16 Next »

Overview

This document contains the results of testing List App refreshing of 200k records on multiple tenants with R/W split enabled (Poppy release). The goal of testing is to assess the performance of mod-lists with load spread across multiple tenants.

Ticket:

PERF-665 - Getting issue details... STATUS

Summary

  • Load tests showed that ListApp refreshing duration is
    • 1.5 min for 3 users test (1 user on each of three tenants);
    • 2.3 min for 10 users test (3-4 user on each of three tenants).
  • Load test for 30 users (10 users per tenant) failed due to DB overload.
  • Load tests showed that ListApp refreshing duration increases 2-3 times when conducted with the same user number but on 1 tenant instead of 3 tenants. For example, during 3 users test duration increased from 1.5 min to 2.8 min, for 10 users - from 2.3 min to 6.1 min.
  • During the 10 users test CPU utilization reached 200% for mod-fqm-manager and 111% for mod-lists. Also, mod-permissions CPU utilization exceeded 100% during 30 users test.
  • Maximum DB CPU utilization reached 83% (writer instance) and 99% (reader instance) during 30 user test. 
  • Memory utilization for mod-permissions increased from 48% to 76% during the tests. No memory leak is suspected for all the modules.

Test runs

Query used in lists - "Item status != Available". List refresh result is about 200K records.

Scenario

Data quantity

List App 3 users refresh

multiple tenants

tenant 1 - 1 user

tenant 2 - 1 user

tenants 3 - 1 user

List App 3 users refresh

single tenant*

tenant 1 - 3 users

List App 10 users refresh

multiple tenants

tenant 1 - 3 users

tenant 2 - 3 users

tenants 3 - 4 users

List App 10 users refresh

single tenant*

tenant 1 - 10 users

List App 30 users refresh

multiple tenants

tenant 1 - 10 users

tenant 2 - 10 users

tenants 3 - 10 users

*Additional tests run to analyze dependency between tenant quantity and ListApp refresh duration

Results

TransactionDuration, avgReleaseTenantsNumber of usersR/W splitOther conditions

ListApp refresh

previous test results*

10 min 40 sec

[Orchid]1 tenant10 usersdisabled
8.5 min[Poppy]1 tenant10 usersdisabled
17.7 min[Poppy]1 tenant10 usersdisabledTesting in parallel with DI and CICO
ListApp refresh

current test results**
1.5 min[Poppy]3 tenants3 usersenabled
2.8 min[Poppy]1 tenant3 usersenabled
2.3 min[Poppy]3 tenants10 usersenabled
6.1 min[Poppy]1 tenant10 usersenabled
Server error[Poppy]3 tenants30 usersenabled

* Query used in lists - "Item status == Checked out". List refresh result is 200K records.

**Query used in lists - "Item status != Available". List refresh result is about 200K records.

Instance CPU Utilization

*results for test with 3-30 users, 3 tenants, R/W split enabled

Service CPU Utilization

During the 10 users test CPU utilization reached 200% for mod-fqm-manager and 111% for mod-lists. Also, mod-permissions CPU utilization exceeded 100% during 30 users test.

*results for test with 3-30 users, 3 tenants, R/W split enabled

Memory Utilization

Memory utilization for mod-permissions increased from 48% to 76% during the tests. No memory leak is suspected for all the modules.

*results for test with 3-30 users, 3 tenants, R/W split enabled

DB CPU Utilization

Maximum DB CPU utilization reached 83% (writer instance) and 99% (reader instance) during 30 user test. 

*results for test with 3-30 users, 3 tenants, R/W split enabled

DB Connections

*results for test with 3-30 users, 3 tenants, R/W split enabled

DB Load

Writer DB node

Reader DB node

*results for test with 3-30 users, 3 tenants, R/W split enabled

TOP SQL

Writer DB node

Reader DB node

Long-running queries: 

select id from [tenant]_mod_fqm_manager.drv_item_callnumber_location where lower(cast(item_status as varchar)) <> lower($1)
parameters: $1 = 'Available'


delete from list_contents where list_id=$1 and refresh_id=$2

Appendix

Infrastructure

PTF -environment pcp1

  • 10 m6i.2xlarge EC2 instances located in US East (N. Virginia)us-east-1
  • 1 database  instance, writer

    NameAPI NameMemory GIBvCPUsmax_connections
    R6G Extra Largedb.r6g.xlarge32 GiB4 vCPUs2731
  • MSK tenant
    • 4 m5.2xlarge brokers in 2 zones
    • Apache Kafka version 2.8.0

    • EBS storage volume per broker 300 GiB

    • auto.create.topics.enable=true
    • log.retention.minutes=480
    • default.replication.factor=3

Module
pcp1-pvt
Fri Oct 27 08:26:47 UTC 2023
Task Def. RevisionTask CountMem Hard LimitMem Soft limitCPU unitsXmxMetaspaceSizeMaxMetaspaceSizeR/W split enabled
mod-inventory-storage:27.0.01024096369020483076384512false
mod-users:19.2.0192102489612876888128false
nginx-okapi:2023.06.14821024896128000false
mod-circulation-storage:17.1.01022880259215361814384512false
okapi:5.1.193168414401024922384512false
mod-inventory:20.1.0922880259210241814384512false
mod-circulation:24.0.01022880259215361814384512false
pub-okapi:2023.06.1482102489612876800false
mod-fqm-manager:1.0.052102489612876888128true
mod-lists:1.0.052300026001282048384512false

Methodology

  1. Enable R/W split for mod-fqm-manager.
  2. Create 10 lists with the query "Item status != Available" on each of three tenants to be able to run a test for up to 30 concurrent users.
  3. Prepare 200K item records for the query to return.
  4. Conduct tests with JMeter script for multiple tenants.


  • No labels