2024-03-20 Meeting notes

Meeting Time:    8 am ET /  1 pm CET / 12 pm UK

Meeting URLhttps://openlibraryfoundation.zoom.us/j/89145608254

Password needed: please see link.

 Date

Mar 20, 2024

Housekeeping

  • Convener: @Martina Schildt

  • Minute taker: for this first meeting @Martina Schildt

  • Recording March 20

  • Next meeting:  Apr 17, 2024

  • All questions and implementers' topics can be posted on: ERM implementers.

 Discussion topics

Item

Presenter

Notes

Actions

Item

Presenter

Notes

Actions

Organisational matters

@Martina Schildt

  • Quick intro: Why this subgroup and Wiki space intro

Questions to clarify:

  • Do we need an own Slack channel?

    • we start with staying in #erm-team

    • eholdings subgroup did the same

    • if need be we can create an own channel at a later point

  • Do we need an own mailing list to publish and distribute the agenda?

    • we start without

    • if need be we can create a mailing list for this subgroup

  • Do we need a separate implementers' page?

    • we add our implementers topic to the ERM implementers page

    • in addition: we will have a topics list as the eholdings subgroup does

  • Who would volunteer to take minutes?

    • Felix: If there were multiple people taking notes, I would volunteer.

    • Sabrina: In this case, I would volunteer too

    • Karina: I can also take notes next time

  • Who would volunteer to note attendance?

    • Is not needed for this group

    • Everyone is welcome to add their names to the members list (will be created within the next days)

 

 

 

 

 

 

 

 

@Martina Schildt create future topics page

https://folio-org.atlassian.net/wiki/x/MYFRC

 

 

 

@Martina Schildt create members page and circulate

https://folio-org.atlassian.net/wiki/x/gwDcAg

 

Update on changes to Local KB

@Owen Stephens

Link to slide deck:

  • Upcoming changes March 2024 on Local KB Synchonization

  • already increased speed of data ingest

  • when doing testing in GBV loading GOKB took ~ 24 hours

  • behaviour until now: titles are treated as universal -

  • same titles from different sources (e.g. GOKB or other)are treated as the same title

  • IDs used to match titles:

    • ISSN, ISBN, DOI, ZDB ID (unique ID used in Germany)

  • this mechanism has not always worked well even if there was only one external source

  • sometimes Local KB treated 2 titles as one where GOKB treated them as 2 > seemed like a title was missing when it was not

New approach

  • Works will be treated as source specific

  • incoming titles require a work source identifier

  • if there are 2 different work source IDs for a single title > 2 separate titles will be created in local KB

  • for GOKB: GOKB reference title UUID

  • for tenants only using GOKB for data this means we mirror GOKB data much more precisely

  • for tenants using multiple data sources (e.g. file imports as well as GOKB) titles will appear multiple times unless the same work source ID is used on import

Transition between old and new approach

  • because existring systems will not have work source ID for each work, this will need to be added

  • new approach does fall back matching on old IDs if incoming work source ID does not already exist > existing titles are used as correctly as possible

  • titles will only get a work source ID added as they are matched

  • for GOKB users this means re-harvesting the full GOKB

  • Bernd: what happens if titles were merged wrongly?

  • Owen: will work out and end up in the right state as soon as all the system has re-harvested all titles

  • on harvest should do the job, but maybe it needs 2 harvests

  • Bernd in chat: In the local KB, but what about the agreements? As a result the wrong title could be attached?

  • Owen: if the title is directly attached to an AGL this can happen - system cannot know whteher that is the title that was meant to be matched - TO DO work on reports to inform users what happened / which title was matched

  • title matching method is set by environment setting (system admin)

  • Poppy defaults to old approach but can be switched to new

  • Quesnelia defaults to new approach but can be switched to old

  • not recommended to switch multiple times

  • we have tested switching from old to new but not the other way around

  • on Snapshot GOKB is harvested every day without errors

GOKB harvesting

Currently using an approach called “OAI-PMH”

  • that is package based

  • even a single and minor change to a package results in a complete update of the whole package

  • if a package fails, we do not get any data from that package

Working on switching to a new approach called the “scrolling API”

  • separate data feeds for package metadata (no titles) and titles

  • this means we can partially ingest packages - failure of a single title does not affect rest of the package

  • changes can be taken from title level

New approach

  • moves work outside FOLIO

  • New API called “pushKB”

  • new external service also called “push KB”

  • can be used for any data

Development status

  • pushKB API has been added to FOLIO but is not active by default

  • cannot be active at the same time as current OAI-PMH approach

  • external service is work in progress

  • most likely expect external service to be tested against Quesnelia release

 

  • Felix in chat: Q for after Owen's presentation: Has logic changed for matching of GOKb package titles?

  • if new titles are added or old titles are deleted from packages

  • Owen: for package titles = PCIs is a combination of title record in FOLIO, package record in FOLIO and platform the title is on

  • if we see a different title, this new title will be added - new PCI

  • the old one would be treated as a deletion

 

  • Christin in chat: Thank you Martina and Owen. I have to go to the next meeting. It would be great if the presentation could be made available. Then I'll take them with me to the rest of the GOKb team. Thank you 🙂 bye

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

@Owen Stephens Create a ticket for working on reports to inform about AGL title matching

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

@Owen Stephens Share presentation

 

 

Chat

00:03:04 Zorian Sasyk | EBSCO FOLIO Implementation:
00:21:11 scolglaz: Visibility Settings are new-ish HLM, for example, which includes Exclude from MARC Export.
00:21:24 Becca Banach: Reacted to "Visibility Settings ..." with 👍🏻
00:22:30 Owen Stephens: That would be my understanding as well Sara
00:30:06 Owen Stephens: Just on the ‘admin’ vs ‘access’ aspects that Zorian mentioned, we originally envisioned Agreements helping to bridge this gap, and I’m very interested in having this discussion. For example - could we have Agreements help with ensuring the right resources are active/not active in eHoldings/HLM?
00:31:14 scolglaz: Reacted to "Just on the ‘admin’ ..." with 👍
00:32:13 Becca Banach: Reacted to "Just on the ‘admin’ ..." with 👍
00:44:25 scolglaz: The ID string like xx-xx-xx would helpful, because it could help me figure out what the vendor - title/entity - package was … to be able to figure out what to go back and turn on, etc … just to explain a bit more
00:44:40 Stephanie Kaceli, Cairn University: Reacted to "The ID string like x..." with 👍
00:54:24 scolglaz: Thanks for that Owen!
00:54:36 Owen Stephens:
00:56:32 scolglaz: Great! So good to know! Thanks, Owen
00:57:19 Liisa Mobley: I have to head to another meeting. Nice to meet everyone!

 Participants

Present

Name

Home Organization

Present

Name

Home Organization

 

Aaron Neslin

UMass

 

Alice Daugherty

University of Alabama

 

Alissa Hafele

Stanford University

 

Amelia Sutton

UMass

 

Andrea Meindl

UB Regensburg

 

Annika Schröer

UB Leipzig

 

Annika Zierhut

 

 

Anya Arnold

EBSCO

 

Beate Aretz

SBB

 

Becca Banach

Smith College

x

Benjamin Ahlborn

SuUB Bremen

 

Bernd Oberknapp

UB Freiburg / ReDI

 

Beverly Geckle

MTSU

 

Birgit Neumann

 

 

Carol Sterenberg

 

 

Carola Bauch-Schuster

UB Eichstätt-Ingolstadt

 

Carole Godfrey

EBSCO

 

Catherine Tuohy

Emmanuel College 

 

Charlotte Whitt

Index Data

x

Christin Seegert

hbz

x

Christoph Kohl

hbz

 

Clara Marino

 

 

Claudia Malzer

ULB Darmstadt, Developer

 

Constantin Lehenmeier

 

 

Dorothe Kugler

UB Mainz

 

Dwayne Swigert

Missouri State University

 

Emma Raub 

Cornell

 

Elizabeth Richens

 

 

Eric Hartnett

Texas A&M University

x

Felix Hemme

ZBW Kiel

 

Gail Murray

Five Colleges / Smith College

 

Gang Zhou

Shanghai Library

x

Gill Osguthorpe

UX/UI Designer - K-Int

x

Gisela Weinerth

SUB Hamburg

 

Gregory Edwards

Lehigh University

 

Heather MacFarlane

 

 

Heather Thoele

Texas A&M University

x

Helge Knüttel

UB Regensburg

 

Ian Ibbotson

Developer Lead - K-Int

 

Jag Goraya

K-Int

 

Janet Ewing

Five Colleges / Mount Holyoke College Library

 

Jenna Lantermann

Five Colleges / Smith College 

 

Jessica Harris

Chicago

 

Jir Shin Boey

Missouri State University

 

Joe Sikowitz

Fenway Library Organization

 

Johanna Radding

Five Colleges / Amherst College

 

Kathleen Berry

UMass Amherst

 

Kathleen Norton

Five Colleges / Mount Holyoke College Library

 

Katrin Brüggemann

UB Leipzig

 

Khalilah Gambrell

EBSCO

 

Kristin Martin

Chicago

 

Kyle Banerjee

EBSCO

 

Lars-Hakan Herbertsson

Chalmers

 

Laurenz Kern

Stabi Berlin

 

Leonie Bodinus

UB Erlangen-Nürnberg

 

Lilly Zirngibl

 

 

Lindsey Lowry

University of Alabama

 

Lisa Hartlmüller

UB Regensburg

 

Lisa Schneeberger

 

 

Lola Estelle

EBSCO

 

Luca Lanzillo

Sapienza Library System - Sapienza University of Rome

 

Lucas Moder

UB Regensburg

 

Lucia McBride

Lehigh University

 

Mara Egherman

EBSCO

 

Maria Taylor

 

 

Marie Widigson

Chalmers

 

Mariyam Thorhira

Johns Hopkins University Libraries

 

Marjorie Snyder

 

 

Mark Arnold

Missouri State University

 

Martin Scholz

 

 

Martina Karlsson

Chalmers

 

Martina Schildt

VZG, Göttingen

 

Mary O’Brien

 

 

Marya Kovalenko

 

 

Matthieu Bordet

DMCultura, Ravenna, Italy

 

Mohammad Alhamad

Missouri State University

 

Molly Driscoll

EBSCO

 

Nancy Finn

 

 

Nancy Pelis

Five Colleges / Mount Holyoke College Library

 

Nicole Trujillo

 

x

Nina Stellmann

VZG, Göttingen

 

Norma Flores

Texas A&M University

x

Owen Stephens

Product Owner -  Owen Stephens Consulting

 

Paul Trumble

Amherst

 

Paivi Rentz

Texas State University

 

Peter Böhm

HeBIS, Frankfurt

 

Peter McCracken

Cornell

 

Phil Adams

De Montfort University

 

Ray Murray

 

 

Robert Heaton

EBSCO

 

Robert Scheier

College of the Holy Cross

 

Rüdiger Stratmann

IAI SPK Berlin

 

Sabine Howahl

ULB Darmstadt

x

Sabrina Bayer

UB Regensburg

x

Sara Colglazier

Five Colleges / Mount Holyoke College Library

 

Sarah Dennis

Texas A&M University

 

Sarah Morgenstern-Einenkel

UB Leipzig

 

Scott Perry

 

 

Scott Stangroom

UMass

 

Sharon Beltaine

 

 

Siobhan McManamon

Five Colleges / Smith College

 

Sobha Duvvuri

EBSCO

 

Stephani Kaceli

Cairn University

 

Stephanie Larrison

Texas State University

 

Stew MacLehose

University of New England

 

Susanne Gill

BVB

 

Susanne Schuster

BSZ Konstanz

 

Susie Skowronek

 

 

Tara Barnett

 

 

Tatjana Clemens

UB Frankfurt

 

Tracy Patton

Missouri State University

 

Valerie Cervantes

 

 

Xiaoyan (Yan) Song

NCSU

x

Zorian Sasyk

EBSCO

x

Saskia Jorzenuk

HAB Wolfenbüttel

 

 

 

 Action items

 Decisions