2018-06-04 Reporting SIG notes

Date

Attendees

Present?NameOrganizationPresent?NameOrganization
XSharon BeltaineCornell University
Peter MurrayIndex Data

Elizabeth BerneyDuke University
Erin NettifeeDuke University

Joyce ChapmanDuke UniversityXKaren NewberyDuke University

Elizabeth EdwardsUniversity of ChicagoXTod OlsonUniversity of Chicago
XClaudius Herkt-JanuschekSUB HamburgXScott PerryUniversity of Chicago

Doreen HeroldLehigh University
Robert SassQulto
XAnne L. HighsmithTexas A&MXSimona TabacaruTexas A&M

Vince BareauEBSCO
Mark VekslerEBSCO

Harry KaplanianEBSCOXKevin WalkerThe University of Alabama
XIngolf Kusshbz
Charlotte WhittIndex Data

Lina LakhiaSOAS

Michael Winkler

OLE
XJoanne LearyCornell University
Uschi KluteGBV
XMichael PatrickThe University of Alabama
Holly MistlebauerCornell University






Discussion items

ItemWhoNotes
Assign Notetaker, Take Attendance, Review agendaSharon

Today's notetaker: ALL (attendees will comment on the Data Warehouse Proposal)

Last week's notetaker: Anne Highsmith

Interim Reporting POHolly MistlebauerHolly is stepping in as Interim Product Owner for Reporting until Product Council assigns a permanent Reporting PO. She will be joining us for Reporting SIG meetings and working with other Product Owners on reporting issues.
Data Warehouse Proposal reviewAll

Reporting SIG Proposal to FOLIO Product Council for a Test Data Warehouse

-Sharon to present a Data Warehouse Test Environment Proposal to Product Council

-proposal is to add reporting test environment to Folio tenant infrastructure e.g., Prod, Dev, Test, Reporting (Test)...

Notes by Ingolf:

End goal is a Data Warehouse, not a Data Lake.

In Postgres, JSON is not unstructured. Postgres lets you query JSON objects like you would request columns in a RDBMS. You can have a BIRT tool that uses Postgres extensions to SQL to query e.g. user documents. The Reporting Tool should be able to load a Postgres driver (BIRT does). We need tools that connect to Postgres and reach out to JSON objects.

The Data Warehouse Test will reveal: Can we read and grep the data tha we need using the Postgres extensions ? We haven't tried this yet, this is why we have to set it up and test it.

Critizism: Direct SQL-query construction requires a very high level of technical expertise. Contra-critizism: There will be a steep learning curve.

We need a Roadmap for Reporting. At least a lightweight roadmap. We will need support of developers, therefore we need to put forward this proposal.

Test Data Warehouse: a template for the data warehouse that each institution will set up for their own use.

Lessons learned from the Data Lake PoC : Need a structured data warehouse. Setting up an operational data warehouse takes significant design, developing and testing. Takes significant effort of FOLIO project: Core developers, report developers, members of SysOps SIG etc.

existing Jira-Tickets:

OKAPI-570 : Tapping transactional data in Okapi

OKAPI-591 Pre-Handlers and Post-Handlers

Roadmap:

  • June-Aug. 2018: Setting up the Test Data Warehouse
  • Sept-Dec. 2018: Data analysists will build test reports on the test data warehouse
  • Jan-June 2019: First Implementers use the test data warehouse to set up a fully-functionating data warehouse environment. Final versions of the reports are completed.
  • July 2019: Data Warehouse Reporting is ready to GoLive for First Implementers.
Other Topics?AllAny other topics to discuss today?
Topics for Future MeetingsAll

Next week's meeting focus: Bib Control, In-app Reporting

Review and update Topics for Future Reporting SIG Meetings

Action items

  •