2024-2-7 Data Import Subgroup meeting
Recordings are posted Here (2022+) and Here (pre-2022) Slack channel for Q&A, discussion between meetings
Requirements details Here Additional discussion topics in Subgroup parking lot
Attendees:
Notetaker: Jennifer Eustis, Corrie Hutchinson, Christie Thomas
Links:
Agenda:
Topic | Who | Meeting Notes | Related Jira | Decisions and Actions |
|---|---|---|---|---|
Announcements | Ryan | CSP #1 might have a timeline next week. Ryan will let us know next week. | N/A | Ryan to post on Slack when more information is available |
Missing Action Profiles in Job Profile after Poppy migration: As called out in Poppy Release Notes, there is a known issue that's been observed in which some links to reusable Action Profiles might be missing from Job Profiles after Poppy migration.
Release notes recommend the following:
Recommended script will provide list of Action profiles to help users manually recreate any affected Job profiles.
| All | There are 2 issues: experience of unlinking post migration and then experience of unlinking during migration. MODICONV-361 is a P1 with the hope to be released in a CSP #1. The MODICONV-365 is being investigated. It looks like FOLIO system job profiles are being affected in terms of actions being unlinked. 5C saw that the default ISRI overlay wasn't working correctly. When we checked the default system job profile there were no actions profiles. Ryan confirmed this issue only affects Action profiles. It is difficult to know how common this is. For 361, the behavior seems consistent. But for 365, this seems to be less common and different tenants have the issue occur on different jobs. This is the 3rd or 4th time that the issue in MODDICONV-361 has appeared during a flower release. The unlinking/linking issues date back several releases. To gather more information, it is worth keeping the corrupted jobs and create replacements. A job with no action profiles or an empty job can be run. There are no error messages when such a job is run. This is something we shouldn't be able to do. Perhaps a warning or an error message is needed.
Overview : Action profiles connected to multiple job profiles are 'unlinked' from job profiles after migration to Poppy.
Comments :
Sidebar discussion in chat on how job profiles are deleted spurred #42 in the Data Import Issue Tracker. Until MODDICONV-361 is fixed, any time a re-used action profile is unlinked in a job profile it will be unlinked in all other job profiles. Fixing it after migration doesn't stop it from happening again should a re-used action profile be unlinked. The development team will be adding new test cases to their workflow to test this type of scenario (re-used profiles) going forward. | https://folio-org.atlassian.net/browse/MODDICONV-361 is scheduled for Quesnelia release to fix root cause of unlinking issue behavior that's occurring post-migration. Set as P1, request for CSP, and in progress now. https://folio-org.atlassian.net/browse/MODDICONV-365 | Ryan will : Confirm this only impacts action profiles and not match profiles Get an idea for how commonly this occurs Test to ensure MODDICONV-361 only impacts re-used action profiles when 'unlinked' and not when added to a job profile. Talk to the dev team about bumping up the priority and included in CSP#1 Add error/warning to job with no actions running to Data Import Topic Tracker |
Partial Matching: | Subject raised by @Yael Hod | Previous notes from 1/31: Partial matching, e.g. begins with, ends with, is required but it does not function as it should regardless of how it is configured.
| Ryan will : Review Jira with Folijet leads to understand current design and identify requirement gaps. | |
Documentation: The group has identified a need for new, enhanced, or reorganized documentation around Data Import.
| All | Not discussed at the 1/31/ meeting
In lab session on 1/18/2024, we created a wiki page, Data Import Requesting a New Topic, with guidelines on how to contribute and a spreadsheet to track issues. This is based on the work done in the Acquisitions SIG. An archive area was also created where we could archive outdated pages such as the Archived Data Import Implementers and Feature Discussion Topics. The idea was to put down issues whether they were linked to a Jira issue or not. Some of the important information that we wanted to track was if there was a linked Jira and in particular when the issue was discussed in the working group and the decision(s) made in regard to that issue. The spreadsheet is still being developed. Before we add more issues, the group in lab wanted to know:
Discussion: A link to the new Data Import topic tracker is at the top of the page. Format was worked on at last week's data import session. Q: is this only to track Jira tickets? Or will there be other topics added to the agenda. R: In Acq /RM individuals add stories to the topic tracker and the Jira may only be added later to the spreadsheet. (many think this is a good idea.) Can reference the Acq/Resource Management implementers topic tracker. Perhaps add widgets that bring in Jiras automatically based on the tag. Q: How to add "Click here and expand" text. R: Put the cursor where you want the text block to begin and use Insert Macro function. Type "Expand" to locate the Expand Macro. Agreed to use the de-duplication discussion to work on building a useful functionality framework. | N/A | Get volunteers to create a spreadsheet and start brainstorming - DONE |
De-duplication: Continue conversation from previous session to clarify what we expect from de-duplication of field values when a record is loaded into FOLIO via Data Import. | All | Not discussed at the 1/31 meeting. Previous notes from 1/24 meeting:
@Jennifer Eustis and @Aaron Neslin found comments in the data-import-processing-core code that provides details about expected behavior for de-duplication. These comments align with the behavior we are seeing except for when there is duplicate data in the incoming record. Data is being removed from the incoming record on update as well. Consensus seems to be that FOLIO should not be de-duplicating within the incoming record unless it is explicitly defined in an import profile. Q: Is de-duplication something that should be able to be deactivated on a field by field basis? R: Sounds like a reasonable approach. There is also some concern that this would complicate an already complicated situation. Possible solution - a tool to deduplicate in another tool rather than within data import instead. Suggestion to start with the functionality audit. RT can connect with the developers as a part of this audit. Q: Are we starting with how we as users expect functionality work or with how the developers expect it to work. R: Really should have both for each feature. Start from perceived / desired functionality of the users and add to it with designed functionality. Suggestion to provide examples to the developers so that it is clear what we are expecting. Pilot functionality audit with de-duplication and start with our understanding and then get input from the developers.
| MODDATAIMP-879: Data Import removes duplicate 856s in SRS | Clarify current behavior of field value de-duplication. Define desired behavior of field value de-duplication (if different). @Christie Thomas will create some dummy data to illustrate deduping 856s. |
Upcoming meetings/agenda topics:
Chat: