Date
Attendees
Present? | Name | Organization | Present? | Name | Organization |
---|---|---|---|---|---|
X | Sharon Beltaine | Cornell University | Peter Murray | Index Data | |
Elizabeth Berney | Duke University | Erin Nettifee | Duke University | ||
Joyce Chapman | Duke University | Karen Newbery | Duke University | ||
Elizabeth Edwards | University of Chicago | X | Tod Olson | University of Chicago | |
X | Claudius Herkt-Januschek | SUB Hamburg | X | Scott Perry | University of Chicago |
X | Doreen Herold | Lehigh University | Robert Sass | Qulto | |
X | Anne L. Highsmith | Texas A&M | X | Simona Tabacaru | Texas A&M |
Vince Bareau | EBSCO | Mark Veksler | EBSCO | ||
X | Harry Kaplanian | EBSCO | X | Kevin Walker | The University of Alabama |
X | Ingolf Kuss | hbz | Charlotte Whitt | Index Data | |
Lina Lakhia | SOAS | X | Michael Winkler | OLE | |
X | Joanne Leary | Cornell University | Uschi Klute | GBV | |
X | Michael Patrick | The University of Alabama | X | Holly Mistlebauer | Cornell University |
X | Nassib Nassar | Index Data | X | Angela Zoss | Duke University |
X | Veit Köppen | University Magdeburg |
Discussion items
Item | Who | Notes |
---|---|---|
Assign Notetaker, Take Attendance, Review agenda | Sharon | Today's notetaker: Ingolf Kuss Last week's notetaker: Simona Tabacaru |
Glint | Nassib Nassar | Nassib will discuss the https://glintcore.net project, which provides a foundation for the library data platform architecture for the FOLIO reference data warehouse. Glint is open source software for communicating, describing, and integrating data. Notes Glint and the LDP (slides by Nassib) (LDP = Library Data Platform) Glint : An open source software for sharing, curating and integrating data. Features:
Data curation = describing data. Data curation usually takes place at a late stage, after data collection, data pipelines and sharing with a research team. Glint takes care ot the data curation at an earlier stage. This helps to integrate and re-use the data. Glint is kept simple (lightweight) to be able to run in many different contexts. Glint has 2 interfaces:
Glint could be configured to store the data in a repository. The data is posted to glint (e.g. by the command line interface) and will be assigned an URL.The data is user-modifiable. There is a simple language to modifiy the data. Jason Skomorowski has written an UI-interface for integrating data in FOLIO (ui-datasets, a FOLIO client for Glint). LDP - Library Data Platform that is build for Reporting. It consists of (classical) Batch-ETL and a database (on the left hand side, for a classicalReporting Tool), but also streaming ETL, distributed databases and other databases (on the right hand side, for other data analysis tools). One might use Glint as a data platform. Questions that come up with LDP:
How Glint can help: Glint allows for a lot of flexibility how data are stored:
Glint can be used to offer an interface for accessing reports. With Glint, one can access data that are within as well as outside of a database. Integration with R and Python: Can use Python calls for Glint, a Glint library has been developed for Python. LDP can be made accessible to FOLIO modules in form of Glient data sets, when direct database access is not feasible. I.e. if that would put to much load on the database. What Glint doesn't do:
Discussion Q: Does Glint replace the database ? Will Glint be the primary source when writing reports ? A: Glint would not replace the direct database connection. Glint would share reports, share database views. Glint gives access to the data (to people who do not have direct access to the database). Glint is for data scientists, not so much for data analysts (the latter ones would prefer to use SQL). Glint can be used for data which do not fit in a database. It is also for sharing data to the public (can be views or generated reports). A Reporting SIG Data Warehouse might be redundant for institutions which already have a data warehouse (e.g. needed for legal reasons or for national reporting). But most of us are not in the situation where we already have a data warehouse. LDP is a hybrid solution for evolving needs for data analysis. The model (LDP and Glint) is superior in some ways but inferior in many ways. It would be a more modern system, but some things will have to be re-invented. |
Prioritizing Your Institution's Reports | Sharon Beltaine | In order to determine the order in which we will prototype our reports for the development of data models for the data warehouse, we need to prioritize them within each functional area (e.g., RM, RA, etc.). Please see the Prioritizing Reports page in the wiki for detailed instructions. -This week, we will review our group's progress on prioritizing reports in the "import and export," "external statistics," and "resource mgmt" worksheets in the Reporting SIG Master Spreadsheet -Monday November 12, 2018 is our target completion date |
Assigning Yourself to JIRA Reports | Holly Mistlebauer | Holly walked us through the process of assigning ourselves to our reports in the FOLIO JIRA System. Instructions are provided on this wiki page, which will also contain other JIRA info: Working on a JIRA Issue -review of JIRA report assignments -Monday November 12, 2018 is our target completion date |
Topics for Future Meetings | All | Review and update Topics for Future Reporting SIG Meetings |
Other Topics? | All | Any other topics to discuss today? |