2021-12-01 Data Import Subgroup meeting

 Recordings are posted Here

Slack channel for Q&A, discussion between meetings

Additional discussion topics in Subgroup parking lot

Attendees: Ann-Marie Breaux (Deactivated) leeda.adkins@duke.edu  Jennifer Eustis Timothy Watters Christie Thomas Jenn Colt Monica Arnold Nick Cappadona Raegan Wiechert Lloyd Chittenden 

Sequencing of MARC fields in View source:

  • Monica Arnold will send sample to A-M
  • Check what happens to 0XX fields when imported
  • And when updated in quickMARC
  • And during migration (are they being put in numerical order by a process outside of DI? If yes, hat would be concerning for 5xx, 6xx, 7xx fields also
  • Ann-Marie Breaux (Deactivated) will add a Jira and investigate

Kiwi Bugfix work: Done for DI

  • Location match regressions fixed
  • Importing large invoices
  • Data Import profiles with data from multiple EDIFACT fields added to a single Invoice or Invoice line field
  • Still updating PTF stats for size of files and timing of import; should have updated recommendations in the next couple of weeks
  • Fixed the 00X field protection bug

Juniper Hotfix 4 work: Done for DI

  • Importing large invoices

Lotus

  • Lotus Folijet planning: dashboard where you can see the current scope and status of Data Import work for Lotus
    • Matching Inventory records on POL and VRN: work has been finalized 
    • Per Jenn, in addition to 9xx, see if we can match on 035
    • Duke sometimes has two 901s, and they will have different vendor reference numbers; would want to check both 901s when matching
    • Christie: non-OCLC 035s are being matched to the Instance System Control Number
    • Need MARC-MARC matching to do field protection; can't do that if match MARC-Instance
    • Any sample files you would like the devs to use for testing? Will need the preliminary file and final file (since these are updating source = FOLIO Instances and/or SRS MARC Bibs created at point of order)
    • We need better documentation of the DI flow
      • the implicit/explicit
      • also making sure that we finish features that are partially done and displaying in the UI
      • make sure that the 001/999 ff matching is on the matching documentation wiki page
      • make sure there's a clear explanation of what each action does on the actions page
  • Current Data Import feature development dashboard and bugfix support

Agenda topics: 

  • UUIDs for Inventory and SRS
    • From Honeysuckle HF 3 through Kiwi, FOLIO has been creating Instances and MARC Bibs with the same UUID if the Instance and SRS MARC Bib are created at the same time, and putting that UUID in 999 ff $i and $s
    • UUID handling for Inventory Holdings and SRS MARC Holdings, and Inventory Authorities and SRS MARC Authorities is the same, but has not been happening as long
    • In Lotus, will change the UUID assignment so that different UUIDs are assigned for Inventory and SRS versions of records, and put those UUIDs in 999 ff $i and $s
    • Preference for the existing records with the same UUIDs is to NOT try to update them to have different UUIDs
      • Why? Seems risky, especially if libraries have exported  FOLIO records to other organizations  and are expecting the UUDs to stay the same
      • Seems difficult
      • Is that an OK decision? Any issues or questions?
      • Jenn: don't try to change the existing ones
      • A-M to talk to devs
        • Ask the devs: for records that have the same UUIDs in 999 $i and $s, especially in the future, when this time period will have been a small blip
        • Ask the devs: why are we changing it? Is it because something is broken because we have the same UUID for 2 different record types, or is it something more serious? If it's important to have different UUIDs, then will it be a problem if we leave some records (or even all, going forward) with the same UUIDs?
        • Make sure any architectural decisions like this are documented on a wiki page and in the code
        • Do the tech leads or TC have a place where architectural decisions are documented?
      • In the Import lab 2 Dec 2021
        • Are the 999 ff $i and $s really the same for all new Instance/SRS creates at the same time? DI, Inv Single Rec Import, quickMARC updates, quickMARC derives
        • Play with 999s in lab on 2 Dec 2021 and see what happens when they are created at the same time or different times, if the incoming record has 999s or not
        • Jenn: 999s should be trustworthy for matching, but not necessarily for reporting
        • Could libraries change on their own, instead of a centralized cleanup script?
        • Recommendation for users: Don't build a process that assumes $i and $s are always the same, or are always different
  • Data Import log improvements: brainstorming session
    • Is there additional info on the backend that is not surfacing in the UI?
    • Filtering by errors is helpful
    • Not all completed with errors jobs give enough information, especially if it's status Failed
    • DI will create the Instance, but no SRS created, will Complete with errors, but not have any error messages
    • Having error messages be more understandable - see Lisa's InstanceType example (seems to be related to the 008)
    • If SRS is discarded, need to have asterisk and explanation of errors
    • The summary page showing the overall results of the job has not yet been developed, and it's really hard to find the errors, especially for large jobs - counts of what succeeded and what failed
    • There are some records that cannot be loaded, and no idea why
    • Is it important to log the single record imports? Yes, everyone agrees
    • START HERE NEXT MEETING
    • Is it important to keep DI logs online forever? (Kiwi BF has 7k+, Cornell has 20K+) challenging for the View all screen, since it's trying to keep track of so much data
      • What if they were automatically purged after x days?
      • What if the log results could be exported?