2025-01-07 Better Sample Data Meeting notes
Ā Date
Jan 7, 2025
Ā Participants
@Yogesh Kumar(regrets), @Lee Braginsky, @Charlotte Whitt , @Kristin Martin, @Autumn Faulkner , @Tod Olson @Shelley Doljack
Ā Goals
Follow up on the status of discussion topics and task
Ā Discussion topics
Time | Item | Notes |
---|---|---|
Ā | General:
Ā
Ā Ā Ā Ā Ā Ā FOLIO Snapshot:
Ā Ā
Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Anonymization of data in Bugfest environments: Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Golden Copy: Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā Ā | Ā @Autumn Faulkner did send the finalized letter to the dean for the MSU early December. Any updates ? No news yet. Ā Document written up by the TC: Draft report on adoption and timeline of the Eureka platform Does this cause any changes to what this working group is working on? Tod mentions that no official decision on when to roll out Eureka has be taken yet. There is a meeting in the Tri-Council on 1/13/2025 Ā Ā
Ā Does anonymizing matter for Inventory records
Will come back to the following topics at next meeting:
Ā Leeās group will be tied up in Ramsons and Sunflower work. Lee suggests to focus on one area; e.g. Patron and usergroups data. Can use a tool to fake data (names, phone numbers, addresses - all PII data). Shelley is working on this in Phython. Shelley has put together a wiki page to gather requirements for anonymization - https://folio-org.atlassian.net/wiki/x/BQA4K . Shelley has pulled out all reference data. Tod mentioned that maybe Chicago could contribute with a Phyton developer too. Will come back to this in the new year. Shelley would need to have the technical requirements written up. Will start with the document provided by Lee and his project on the POC. Ā Golden copy has ~8-9 Million instances. Contributing institutions will need to stand up a second test environment. Shelley asked about the tenant, and the tenant IDs? Is the reference environment to be a multi tenant environment - Chicago, Stanford, MSU. The current Bugfest environment is a single tenant environment. Tod pointed out that data from all three institutions would cause inconsistency in the use of reference data. Tod thinks specifically on locations; but also the use of item material types. Shelley asked if multi tenant environment would mean that each institutions had their own reference data. Will a solution be to have multiple stand alone environments (A, B, and C)? Yogesh confirmed. Merging data would be phase 2. |
Ā | Review timeline document |
|
Ā | Other topics | Lee will be out on vacation - starting as of 1/9/2025. Back on 1/17/2025. Autumn will be absent next Tuesday too. Next meeting will be 1/21/2025 at noon (EDT) 6:00 pm CET. Ā |
Ā Action items
Ā Decisions
Lee, Yogesh, Shelley will inform the working group on the talk and progress on developing the anonymization tool. Lee, Yogesh, Shelley, and Noah meets every Monday.
Ā