2018-06-04 Reporting SIG notes

2018-06-04 Reporting SIG notes

Date

Jun 4, 2018

Attendees

Present?

Name

Organization

Present?

Name

Organization

Present?

Name

Organization

Present?

Name

Organization

X

Sharon Beltaine

Cornell University

 

Peter Murray

Index Data

 

Elizabeth Berney

Duke University

 

Erin Nettifee

Duke University

 

Joyce Chapman

Duke University

X

Karen Newbery

Duke University

 

Elizabeth Edwards

University of Chicago

X

Tod Olson

University of Chicago

X

Claudius Herkt-Januschek

SUB Hamburg

X

Scott Perry

University of Chicago

 

Doreen Herold

Lehigh University

 

Robert Sass

Qulto

X

Anne L. Highsmith

Texas A&M

X

Simona Tabacaru

Texas A&M

 

Vince Bareau

EBSCO

 

Mark Veksler

EBSCO

 

Harry Kaplanian

EBSCO

X

Kevin Walker

The University of Alabama

X

Ingolf Kuss

hbz

 

Charlotte Whitt

Index Data

 

Lina Lakhia

SOAS

 

Michael Winkler

OLE

X

Joanne Leary

Cornell University

 

Uschi Klute

GBV

X

Michael Patrick

The University of Alabama

 

Holly Mistlebauer

Cornell University

 

 

 

 

 

 

Discussion items

Item

Who

Notes

Item

Who

Notes

Assign Notetaker, Take Attendance, Review agenda

Sharon

Today's notetaker: ALL (attendees will comment on the Data Warehouse Proposal)

Last week's notetaker: Anne Highsmith

Interim Reporting PO

Holly Mistlebauer

Holly is stepping in as Interim Product Owner for Reporting until Product Council assigns a permanent Reporting PO. She will be joining us for Reporting SIG meetings and working with other Product Owners on reporting issues.

Data Warehouse Proposal review

All

Reporting SIG Proposal to FOLIO Product Council for a Test Data Warehouse

-Sharon to present a Data Warehouse Test Environment Proposal to Product Council

-proposal is to add reporting test environment to Folio tenant infrastructure e.g., Prod, Dev, Test, Reporting (Test)...

Notes by Ingolf:

End goal is a Data Warehouse, not a Data Lake.

In Postgres, JSON is not unstructured. Postgres lets you query JSON objects like you would request columns in a RDBMS. You can have a BIRT tool that uses Postgres extensions to SQL to query e.g. user documents. The Reporting Tool should be able to load a Postgres driver (BIRT does). We need tools that connect to Postgres and reach out to JSON objects.

The Data Warehouse Test will reveal: Can we read and grep the data tha we need using the Postgres extensions ? We haven't tried this yet, this is why we have to set it up and test it.

Critizism: Direct SQL-query construction requires a very high level of technical expertise. Contra-critizism: There will be a steep learning curve.

We need a Roadmap for Reporting. At least a lightweight roadmap. We will need support of developers, therefore we need to put forward this proposal.

Test Data Warehouse: a template for the data warehouse that each institution will set up for their own use.

Lessons learned from the Data Lake PoC : Need a structured data warehouse. Setting up an operational data warehouse takes significant design, developing and testing. Takes significant effort of FOLIO project: Core developers, report developers, members of SysOps SIG etc.

existing Jira-Tickets:

OKAPI-570 : Tapping transactional data in Okapi

OKAPI-591 Pre-Handlers and Post-Handlers

Roadmap:

  • June-Aug. 2018: Setting up the Test Data Warehouse

  • Sept-Dec. 2018: Data analysists will build test reports on the test data warehouse

  • Jan-June 2019: First Implementers use the test data warehouse to set up a fully-functionating data warehouse environment. Final versions of the reports are completed.

  • July 2019: Data Warehouse Reporting is ready to GoLive for First Implementers.

Other Topics?

All

Any other topics to discuss today?

Topics for Future Meetings

All

Next week's meeting focus: Bib Control, In-app Reporting

Review and update Topics for Future Reporting SIG Meetings

Action items