Skip to end of banner
Go to start of banner

2019-08-26 Reporting SIG Meeting notes

Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 14 Next »

Date

Attendees

Present?

Name

Organization

Present?

Name

Organization








Sharon BeltaineCornell University
Sara ColglazierMount Holyoke College/Five Colleges

Elizabeth BerneyDuke University
Erin NettifeeDuke University

Joyce ChapmanDuke University
Karen NewberyDuke University

Elizabeth EdwardsUniversity of Chicago
Tod OlsonUniversity of Chicago

Claudius Herkt-JanuschekSUB Hamburg
Scott PerryUniversity of Chicago


Doreen HeroldLehigh University
Stefan StadtherrMPIL Heidelberg

Anne L. HighsmithTexas A&M
Simona TabacaruTexas A&M

Harry KaplanianEBSCO
Kevin WalkerThe University of Alabama

Ingolf Kusshbz
Charlotte WhittIndex Data

Lina LakhiaSOAS

Andi Bihler

Munich Technical University Library

Joanne LearyCornell University
Uschi KluteGBV

Michael PatrickThe University of Alabama
Vandana ShahCornell University

Nassib NassarIndex Data
Angela Zoss

Duke University


Veit KöppenUniversity Magdeburg
Lisa DeCarolisSmith College/Five Colleges

Linda MillerCornell University
Elena O'Malley

Emerson


Matt HarringtonDuke University     Holly MistlebauerCornell University







Discussion Items

Item

Who

Notes

AttendanceSharon

Today's Attendance Taker: 

Today's notetaker: 

Last week's notetaker: Angela Zoss

Labor Day meeting cancelledSharon

The Reporting SIG meeting is cancelled on Monday, September 2, 2019 in observance of Labor Day in the U.S. 


Updates from Various Reporting Related Groups and EffortsVarious

The Reporting SIG is using small work groups to address priorities and complete our work. Each week, we will provide updates to the Reporting SIG from these various reporting-related groups and efforts:

  • Community and coordination
  • LDP Report Working Group
  • LDP Data Privacy Working Group
  • LDP SysOps Working Group
  • Software development
  • Others?


LDP Road MapNassib
  • Considerations in developing this first road map for LDP releases:
    • Key requirements:
      • Many reports have been ranked by institutions for "go live (MVP)", which means that they will have to be written and debugged within the next few months.
      • It is not recommended to do significant SQL reporting using FOLIO's internal operational database, as has often been done with many traditional ILS systems.  One reason is that there is no requirement that FOLIO modules use the same database, which means that cross-domain table joins on the operational database may break irreparably in the future.
      • Reporting analysts want an easy and familiar query model, and one that works with common reporting tools.
      • Reporting analysts will want queries to run efficiently.
      • Reporting SIG members have repeatedly communicated the need for access to all FOLIO data for reporting purposes.
    • Challenges and constraints:
      • The number of storage interfaces in FOLIO will soon exceed 100.  Although ETL for most of them is relatively simple, synchronizing such a large number of tables reliably on schema changes requires very active coordination with FOLIO, which so far has not proved to be possible.
      • LDP critical dependencies on FOLIO core development which have been requested and flagged as critical beginning in late 2018 will likely not be addressed until mid-2020 or beyond, based on discussions with the FOLIO capacity planning and project management groups.
      • FOLIO is requesting that "go live (MVP)" features be completed by January 2020, to be released in Summer 2020.
      • Future developer resources appear to continue to be roughly 1 FTE or less on average, consisting of several part-time developers.
  • LDP 1.0 proposed core features, for "go live (MVP)":
    • Support for ad hoc, cross-domain queries for all, or a very large proportion, of FOLIO data extracted from storage modules.  We would ask this working group and the Reporting SIG to help us determine, as soon as possible, the definition of "all FOLIO data" required for inclusion in the LDP.
    • Include MARC records extracted from FOLIO SRS and transformed for easier querying.
    • Historical data will be retained in the LDP but not transformed into a single schema.
    • LDP database recommended to be refreshed once per day from the FOLIO operational database.
    • Support for optional anonymization of personal data.  The Data Privacy WG will propose requirements for this feature, in particular which fields should be anonymized.
    • Implementation guidelines (documentation) for local tables.
    • Support for PostgreSQL and Redshift database systems.
    • Proposed data model design for LDP 1.0 based on the original LDP Architecture proposal.  See LDP documentation at: https://github.com/folio-org/ldp
  • Schedule:  LDP Beta (feature complete) in January 2020, LDP 1.0 in Summer 2020.
  • LDP beyond 1.0:  Historical queries using a single schema, full ETL, and relational or star schema could in theory be implemented for later releases, but this is highly dependent on the identified critical dependencies and availability of developer resources.
  • Current support for query development:
    • The test database is now using the proposed data model design for LDP 1.0.  Please take a look and send feedback as soon as possible.
    • Also in the test database are data needed for the Circ Item Detail query, in the following tables:
      • groups

      • holdings

      • instance_types

      • instances

      • institutions

      • items

      • loans

      • locations

      • material_types

      • service_points

      • temp_loans (workaround for missing effective location attribute in FOLIO)

      • users

    • Additional tables will soon be added to support writing report queries as prioritized by the Report Prototype working group.
    • Propose that the LDP Report Prototype Working Group be refocused, as originally envisioned, to implement prototype reports/queries and to bring them before this SIG for comment.
    • A few members of our development team will set aside time to assist this working group with SQL if needed.
    • We still urgently need test data that we can extract from running FOLIO installations.  Currently talking with Chicago about access to their data.
Additional Topics?All 
Topics for Future MeetingsAll

Review and update Topics for Future Reporting SIG Meetings 


Action items

  • Reporting Data Privacy to follow up with Jesse, Tod, and Cate, and Chair-Elect to determine approach to audit trails in LDP
  •  





  • No labels