Batch Importer (Bib/Acq) (UXPROD-47)

[UXPROD-4590] Data import matching by normalized ISBN Created: 08/Dec/23  Updated: 06/Feb/24

Status: Open
Project: UX Product
Components: None
Affects versions: None
Fix versions: TBD
Parent: Batch Importer (Bib/Acq)

Type: New Feature Priority: P2
Reporter: Ann-Marie Breaux (Inactive) Assignee: Ryan Taylor
Resolution: Unresolved Votes: 0
Labels: data-import, match-details, needs-t-shirt-size-BE, needs-t-shirt-size-FE
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original estimate: Not Specified

Issue links:
Defines
defines UXPROD-47 Batch Importer (Bib/Acq) Analysis Complete
is defined by MODDICORE-95 MarcRecordReader should normalize isb... Open
is defined by MODSOURCE-254 SPIKE: Review MARC Query work Open
is defined by MODSOURCE-195 MARC-MARC matching for identifier fie... Draft
Relates
relates to UXPROD-2742 MARC-MARC matching enhancements: Narr... In Progress
relates to MODSOURMAN-269 SPIKE: Normalize ISBNs for matching p... Closed
Release: Not Scheduled
Epic Link: Batch Importer (Bib/Acq)
Development Team: Folijet

 Description   

Current situation or problem:

  • In the SRS MARC Bib 020 field, the ISBN may
    • be the 10 or 13 digit version (10 digit version is obsolete and can be translated to the 13 digit version; also the 10-digit version may contain a letter X as the final character, whereas the 13 digit version is all-numeric)
    • have a qualifier in the same subfield (a or z) as the number
    • may not calculate properly into a valid ISBN, even if recorded in $a instead of $z
  • In the FOLIO Instance identifier field, the ISBN may
    • be the 10 or 13 digit version
    • will have any qualifiers in the same field as the number, regardless of whether they are in separate subfields in the underlying MARC
    • may not calculate properly into a valid ISBN, even if pulled from the underlying MARC 020 $a
  • All of the above makes ISBN matching in MARC-MARC and MARC-Instance matches less-reliable than desired
  • This feature seeks to improve the reliability of these matches

In scope:

  • When matching MARC-MARC or MARC-Instance
    • If the data is coming from MARC 020$a or Instance ISBN field, and is the 10-digit version, then calculate the 13-digit version and use that for matching
      • If the 13-digit version cannot be calculated, then use the existing data for matching
    • If the data is coming from MARC 020$a or Instance ISBN field, and has qualifier information after the number (in the same MARC subfield, or in the Instance ISBN field), then ignore the qualifier or additional information after the number when matching
    • If the data is coming from MARC 020$z or Instance Invalid ISBN field, then use the data as-is for matching purposes
      • Question: should qualifier info in the 020$z or in the Instance Invalid ISBN field be ignored?

Out of scope:

  • What else?

Use case(s):

  • SMEs: Please add examples*

Proposed solution/stories:

Links to additional info:

Questions:


Generated at Fri Feb 09 00:41:04 UTC 2024 using Jira 1001.0.0-SNAPSHOT#100246-sha1:7a5c50119eb0633d306e14180817ddef5e80c75d.