2022-11-29 Discovery Integration Subgroup Meeting notes
Date
EDT 10:00am to 11:00am
Goals
Discussion items
Time | Item | Who | Notes |
---|---|---|---|
10:00 | Start of the meeting | ||
10:05 | requests currently not covered by an API for discovery systems (e.g. trending items) | See Brainstorm page for the diverse interface | |
10:15 | start of discussion about data export (OAI-PMH) | Update from Magda via Slack: Hi all, I would like to follow up on the OAI-PMH discussion from the last week meeting.Harvesting holdings and items data
Stabilization issues: You can also find more information about FOLIO's implementation of OAI-PMH in: https://folio-org.atlassian.net/wiki/display/FOLIOtips/OAI-PMH+Best+Practices Unfortunately, I won't be able to attend this week meeting either but I will listen to the recording and respond here. Also, I should be able to attend the meeting on December 6th if you think that would be helpful. Discussion: Villanova has used OAI-PMH to index a couple million records in their test system. Some records get dropped, e.g. due to bad leaders or illegal control characters that cannot occur in XML, so important to watch the export counts and errors. (can usually deal with by fixing the records) Noted the stabilization work mentioned above. Is there a real use case for multiple full harvests to different systems simultaneously? Currently we have Five Colleges, which are on a multi-tenant system, but all have EDS, so EBSCO can apply the full export to all. (Currently harvest about 5.2 million in 11 hours, down from 36 hours over the summer.) Possible use case where you do not control the schedule of all systems which harvest from you, the system doesn't handle that so well right now. [Magda Zacharska]: This should be addressed in UXPROD-3772 Implement Retry-after property for OAI-PMH response One solution might be to harvest to an intermediary system like VuFind and then do the multiple harvests from that. Q: does the FOLIO OAI-PMH support multiple formats, i.e. only MARCXML, or just Dublin Core? (We think just Dublin Core.) A: [Magda Zacharska] FOLIO supports both: MARCXML and Dublin Core Q: is there a need to support more formats or more verbs? Currently supports "get Record" but does not respect format (see above), which complicates troubleshooting. A: [Magda Zacharska] FOLIO supports:
ListSets – Used to retrieve the set structure of a repository will be implemented in UXPROD-2439 (currently planned for the Poppy release) The GetRecord issue not supporting marc21_withholdings metadataPrefix (MODOAIPMH-426) bug has been resolved in Morning Glory release. One issue is that edge module hits inventory every time it needs to build a list, that is the performance Achilles heel. | |
10:40 | How to move forward with these? |
Ideas
| |
11:00 | End of the meeting |