Create script to standardize OCLC identifier values and remove duplicates

Description

Create script to identify OCLC identifiers by identifier type and normalize values as following:

  • Retain (OCoLC)

    • If prefix of 'ocn' or 'ocm' appear, they should be removed

    • If leading zeros appear, they should be removed

    • EXAMPLE: value of OCLC identifier - (OCoLC)ocm123456 should result in (OCoLC)123456

  • If duplicates exist after normalization, they should be de-duplicated so that only one of the normalized identifier remain.

  • If there's any prefix values that are NOT 'ocm' or 'ocn', the original prefix should remain.

    • Example: OCLC identifier - (OCoLC)tfe0000501056183 should result in (OCoLC)tfe501056183

See more examples at MODINV-949

Environment

None

Potential Workaround

None

Checklist

hide

TestRail: Results

Activity

Show:

Details

Assignee

Reporter

Priority

Development Team

Folijet

Release

Not Scheduled

TestRail: Cases

Open TestRail: Cases

TestRail: Runs

Open TestRail: Runs
Created January 23, 2024 at 1:22 PM
Updated June 27, 2024 at 10:40 AM
TestRail: Cases
TestRail: Runs