Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Table of Contents
outlinetrue

Overview

  • The purpose of the document is to to identify how ECS-related changes are affecting reindexing on multi tenant environment. Calculate reindex time and size of index.

Recommendations & Jiras

Test Summary

Reindexing on 3 tenants in parallel takes the same time as reindexing on main tenant only. If to start in sequential order from secondary tenants it may take more time for main tenant afterwards (25 minutes more). The reason may be less resource utilization from CPU side (mod-search-44% against 38% and mod-inventory-storage - 20% against 19%).

Comparing interim results of indexing size it was found that the major time during reindexing was taken by contributor and instance_subject indexing (at least 7 hours out of 9 hours 45 min).

Test Runs /Results

Test #

Instances number

Test Conditions

reindexing on Poppy release

Duration 

Notes


1. 2023_10_12 08:47 - 18:32 UTC


In parallel: 3 tenants

9 hours 45 min
  • mod-search task count =8
  • mod-inventory-storage task count = 2
  • mod-okapi task count = 3
  • open search instance scaled up to r6g.4xlarge.search
  • without configuration of number_of_replicas and refresh_interval values of ES/OpenSearch

2. 2023-10-13 09:04 - 09:07 UTC

100032

In sequential: fs09000002

3 min

3. 2023-10-13 09:15 - 09:18 UTC

100055

In sequential: fs09000003

3 min
4. 2023-10-13 09:29 - 19:37 UTC10,733,729In sequential: fs0900000010 hours 8 min

Indexing size

Test #1 Here I would like to share interim results during reindexing on 3 tenants. In 30 minutes after start we observe that all instances on secondary tenants have been already indexed except of ocp2_contributor_fs09000002 without 1 record. In 2 hours after the start we observe that docs.count column for instances on main tenant has all records but reindexing goes on with previously mentioned contributor_fs09000000 and instance_subject_fs09000000.

...

fs09000002







healthstatusindexuuidprirepdocs.countdocs.deletedstore.sizepri.store.size
greenopenocp2_instance_fs09000002Pd2I7_Q8Rj2bNgSP4vSzIg421000320579mb190.3mb
greenopenocp2_authority_fs09000002vFMI3x8bTuqp5HkAL3jk8A42002.4kb832b
greenopenocp2_instance_subject_fs090000023vfuyqMBRlGtmblChed_LA42945413.9kb136.5kb
greenopenocp2_contributor_fs09000002xgXcKSDWR3W2IYC6JiFqSA423691238696.8mb165.4mb
Test #3







fs09000003







healthstatusindexuuidprirepdocs.countdocs.deletedstore.sizepri.store.size
greenopenocp2_instance_fs09000003YjbGBqXpRG6ap03WlRFokw421000550579.8mb190.8mb
greenopenocp2_authority_fs09000003RGz9CTCoT7CoT3H3ge5gIA42002.4kb832b
greenopenocp2_contributor_fs09000003O4SdQSXqT_y2UapCuer6GQ4237261853mb259.1mb
greenopenocp2_instance_subject_fs090000032P6hQELJTWKYqvJ5l-5jTw42986413.6kb136.3kb
Test #4







fs09000000







healthstatusindexuuidprirepdocs.countdocs.deletedstore.sizepri.store.size
greenopenocp2_instance_fs09000000VIviDunkSkinkiZJuMjKUQ42107337290119.6gb39.9gb
greenopenocp2_authority_fs09000000YCS4y0GFTfy3EpbJm99X0g42002.4kb832b
greenopenocp2_contributor_fs090000002kdpF4fZTaGtFmG_ypXq6A42466665961250312.2gb4.1gb
greenopenocp2_instance_subject_fs09000000q3wktVUgTsmElTq6Iv-p2A42409701165883917.8gb6.1gb

Memory Usage

Test #1

mod-search max. 70% during an hour, 60% - during second hour

...

Test #Module

Initial spike

1

mod-search70%
mod-inventory-storage28%
2mod-search50%
mod-inventory-storage26%
3mod-search50%
mod-inventory-storage26%
4

mod-search70%
mod-inventory-storage28%

CPU Utilization 

Test #1

mod-search - 44%

mod-inventory-storage - 20%

...

Test #Module

Initial spike

1

mod-search44%
mod-inventory-storage20%
2mod-search10%
mod-inventory-storage10%
3mod-search10%
mod-inventory-storage10%
4

mod-search38%
mod-inventory-storage19%

RDS CPU Utilization 

Test #1

Utilization max. - 72%

...

Test #4 main tenant

Utilization max. - 65%

Open Search KPIs

CPU utilization

Test #1

Max. 89%

...

Test #2, 3 secondary tenants

Test #4 main tenant

Appendix

Infrastructure

PTF -environment ocp2

  • 10 m6i.2xlarge EC2 instances located in US East (N. Virginia)us-east-1 
  • 2 instances of db.r6g.xlarge database instances, one reader, and one writer 
  • MSK ptf-kakfa-3
    • 4 m5.2xlarge brokers in 2 zones
    • Apache Kafka version 2.8.0

    • EBS storage volume per broker 300 GiB

    • auto.create.topics.enable=true
    • log.retention.minutes=480
    • default.replication.factor=3

...

Modules

Version

Task Definition

Running Tasks 

CPU

Memory

MemoryReservation

MaxMetaspaceSize

Xmx

mod-search Poppy3.0.0-SNAPSHOT.156682048259224805121440
mod-inventory-storage Poppy26.1.0-SNAPSHOT.6851122048409636905123076
mod-search Orchid 
2.1.0-SNAPSHOT.108
384002592248010241440
mod-inventory-storage Orchid26.1.0-SNAPSHOT.644321024195222085121440


Methodology/Approach

  • Use PTF's "Bugfest" Poppy cluster, which has 10M records, to test with (2 times)
  • Configure the environment in accordance with Infrastructure parameters  to the one that FSE commonly uses
  • Reindex on the new Poppy environment, get the results for indexing time, index size
  • Create a comparison of results between sequential order and parallel start.

...