Skip to end of banner
Go to start of banner

MODAUD-155 - Performance Improvement in Searching Circulation Log by Barcode

Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 8 Next »

Submitted

 

Approved
Status

DRAFT

Impact

LOW

Context

Tracking

TypeTicketModule
Module ticket

Error rendering macro 'jira' : Unable to locate Jira server for this macro. It may be due to Application Link configuration.

mod-audit
Arch ticket

Unable to locate Jira server for this macro. It may be due to Application Link configuration.


Technical details

The circulation log data is persisted in jsonb  format in FOLIO database and itemBarcode  field is part of the nested array items as each circulation log record can contain multiple items. No indexes are present for the table.

Problem Statement

Circulation log search by item barcode slow when table contains millions of rows.

Architecturally Significant Requirements

#RequirementDescriptionValidation ApproachRemarks
1PerformanceSearch on table with <X> records should be done in <Y> seconds 
  1. Prepopulate table with X records
  2. Do search by itemBarcode
  3. Validate timing
TBC

Stakeholders

Solution Options

#TitleDescriptionProsCons
1Define retention policy (use archiving or table partitioning)
  1. Archive the old records (maybe not delete them) in another table or use PostgreSQL partitioning - like 1+ year old record.
  2. Create and add an index for createTimeStamp column. This will restrict the search to most recent actions that are taken on the item and improves the search performance.
  1. Future-proof solution. Will have same response time in case of the same amount of records per time period
  1. In case of unpredicted growth of transactional volume search can be slow
  2. Partitioning is not used in other modules, hence breaks the unity or common approach
  3. Search in longer date period may still be slow
  4. Not quite clear what is the common period for retention
2Add gin index for nested items → itemBarcode field
  1. Create gin index for itemBarcode field (see link)
  1. Improves performance
  2. Requires minimal backend changes
  1. May negatively affect performance of writing due to calculation of index.
  2. Has limitations
3Limit search date range
  1. Make UI from / to fields in search view mandatory
  2. Force maximum date range to one year
  3. If further search is required add controls to go to next or previous period
  1. No backend changes required
  2. Can serve as a workaround 
  1. Limits user experience


Questions

#QuestionAnswerStatus
1

How many records per year clients produce?


OPEN

2

Do clients have any retention policy or regulations on circulation records?


OPEN

Rationale

Option 1 depends on client needs that might be hard to define for the whole community.

Option 2 should be implemented in any case but requires additional performance testing for writing into circulation log.

Option 3 requires minimal effort but has tradeoffs. Can be implemented as quick fix.

Proposed solution is option 2 due to minimal changes on backend and no changes at frontend

Estimation

#TitleEstimation
1Define retention policy (use archiving or table partitioning)
  1. POC - 1 sprint
  2. Implementation -2 sprints
  3. Performance testing - 1 sprint
2Add gin index for nested items → itemBarcode field
  1. Implementation - 1 sprint
  2. Performance testing - 1 spring
3Limit search date range
  1. Implementation - 1 sprint
  2. Performance testing - 1 spring

Decision

<TBD>

  • No labels