Skip to end of banner
Go to start of banner

SPIKE: MODKBEKBJ-255 - Harden process to load holdings

Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 5 Next »

MODKBEKBJ-255 - Getting issue details... STATUS

The main goal of the issue is to define additional steps to harden the process of loading holdings entities from RM API.

Here are the points to think about:

1. Create a separate endpoint that gives us the status of the progress of the process.
2. Add retry mechanism in case something fails - at least 3 retries
3. What happens if someone tries to filter resources by tags while the table is being populated/has no entries - we end up making multiple requests to RM API - which can be avoided if we add an updated_at column to our holdings table. Based on when the entry was last updated, we decide whether we want to insert/update the entry or leave it as is - this eliminates the need to truncate the entire holdings table and re-populate increasing performance.
4. What happens if RM API is down? Approach outlined in 3. helps us still retain holdings data albeit stale. 


1. Create a separate endpoint that gives us the status of the progress of the process.

Here is the proposed definition of the endpoint:

"methods": ["GET"],
"pathPattern": "/loadHoldings/status",
"permissionsRequired": ["kb-ebsco.holdings.load.status.get"]

The status enum includes the following values :

  • Not Started - before the first start of loading holdings.
  • Started - the request POST /{custid}/holdings was sent to RM API.
  • In Progress - the loading is in progress. Both GET /{custid}/holdings/status and GET /{custid}/holdings are assumed as one action.
  • Completed - loading is finished and holdings saved in database.
  • Failed - some request failed.

and saved as a class variable. 

If the request returns an error from the RM API, it should be returned to the user???

...

2. Add retry mechanism in case something fails - at least 3 retries

3. What happens if someone tries to filter resources by tags while the table is being populated/has no entries - we end up making multiple requests to RM API - which can be avoided if we add an updated_at column to our holdings table. Based on when the entry was last updated, we decide whether we want to insert/update the entry or leave it as is - this eliminates the need to truncate the entire holdings table and re-populate increasing performance.

Update an existing holdings table to add an additional column updated_at to have a timestamp of the date, that indicates when the table was last modified like following or more optimized way

ADD COLUMN IF NOT EXISTS updated_at TIMESTAMP WITH TIME ZONE DEFAULT CURRENT_TIMESTAMP;

and update the SQL statement INSERT_OR_UPDATE_HOLDINGS_STATEMENT, currently used for holdings table, to update entry instead of ignoring. 

4. What happens if RM API is down? Approach outlined in 3. helps us still retain holdings data albeit stale. 

Useful links:

  • No labels