Date
...
Time | Item | Who | Notes | |||
---|---|---|---|---|---|---|
1 min | Scribe | All | Marc Johnson is next, followed by Ingolf Kuss | |||
5 min | TCR Board Review | All | ||||
10 min | Technical Council Sub Groups Updates | All | 5-10 min | RFCs | All | RFC to review from Olamide: https://github.com/folio-org/rfcs/pull/4 Previous Notes:
Today: We have 1 weeks for remaining in the DRAFT REVIEW stage.Tod Olson advised that the Technical Council Goals and Objectives group intends to have a draft available to the TC next week Marc Johnson stated that the Translations group has been on hiatus since before the turn of the year. And suggested that the group might conclude, once it has summarised it's thoughts, in anticipation for product decisions in this area. Zak Burke can likely advise better the groups' current situation Craig McNally suggested that we need to find a way to wrap up some of these working groups, in particular the longer running ones Jeremy Huff advised that it is possible that the Breaking Changes group could have an RFC to present (on the definition of breaking changes) to the TC next week Jeremy Huff advised that the TCR improvement group spent last week putting together improvements to (try to) close the hello world loophole in the current process. He asked how the group should bring the suggested changes back to the TC Craig McNally suggested they be shared with the TC separately to review and will be discussed next week Jenn Colt advised that the Charter group produced another draft and asked all TC members to review this prior to discussing it further at next weeks meeting. She described there is already some good conversation happening on the draft and asked that folks raise concerns, especially anything that might block approval, prior to the next week's meeting |
5-10 min | RFCs | All | Olamide Kolawole advised that the draft review group met yesterday to review the Reducing Kafka Partitions RFC and will be updating the RFC following the discussion. He advised that the group is advocating that FOLIO provide the ability to support either single topic per tenant or mixed tenant topics. The group will meet again next week Craig McNally suggested that we have one week remaining for this stage of the process. He asked if the group can be finished within that timeframe? Olamide Kolawole suggested that the extra week will likely be needed, and the group was working on the clock starting when the group was formed Craig McNally acknowledged that reducing the ambiguity of when the clock starts for these phases and how long each phase needs to be should be addressed during the next review of this process | |||
30 min | DR-000032 - Splitting Database Read/Write Traffics in RMB | Review DR-000032 - Splitting Database Read/Write Traffics Martin Tran to give a short presentation, including performance results | 1 Martin Tran presented the proposed ADR for Splitting Database Read / Write Traffic along with an accompanying . He advised that the FSE team and Core Platform team implemented this work in RMB during the Morning Glory and Nolana releases, during which most RMB storage modules were upgraded to support the proposed mechanism (meaning that this design is already in use in production). These changes were performance tested using circulation and data import flows. He advised that CPU use is more evenly split between read and write nodes, reducing the pressure on the write node when under higher loads (25 concurrent use). This also led to a reduction in CPU usage in the ("back up the chain") modules making the database requests. During circulation only testing, whilst there was a slight deduction in response times for check in, responses times for check out increases in some cases. However, once data import and circulation work flows were running concurrently, the read / write split significantly reduced response times over a single node. There is remaining work to fully support this across FOLIO:
Ingolf Kuss asked if this approach only works if the database is clustered? Martin Tran advised that is the case, however there is no advice given for how to achieve that, however EBSCO use AWS RDS for this purpose. Ingolf Kuss asked if this was specific to a module and required configuration? Martin Tran advised that this involves specifying environment variables for the read and write node. Marc Johnson advised that this is specific to each module because currently only some modules support this mechanism. And that identifying which modules support this is an exercise folks need to do Craig McNally asked if this was an opt-in model (as the single node configuration is still supported)? Martin Tran confirmed that is the case. Only operators wanting to use the split have to identify which models support this. Tod Olson asked where do we direct folks for documentation on how to use / configure this mechanism? Martin Tran advised that the details of the current implementation are available in the RMB readme. And that configuration of the cluster is an exercise for system implementors / operators. Marc Johnson suggested that there are different kinds of documentation that needs to be provided: architectural; per-module configuration; operations documentation Jeremy Huff asked if this has only been implemented for AWS RDS instances? Martin Tran confirmed that this has only been tested on AWS RDS and that how database nodes synchronise data is outside of the scope of this work Jeremy Huff requested that the ADR provide more details of the solution (separate to the RMB implementation). Martin Tran agreed to add that into the ADR Maccabee Levine suggested that there is very little downside if other modules don't adopt this approach, and asked what are the downsides if other modules do adopt this approach? Martin Tran suggested that one downside is that RMB creates separate connection pools for read and write connections, which could use more memory in the database. Marc Johnson suggested some of the other trade offs involved in this could be: doubling the database connections (and port usage) for each module; this architectural pattern introduces the potential for stale reads; doubling the potential configurations (similarly to the approach mentioned in the discussion above about the Kafka RFC) of the system which may need testing. Tod Olson asked which RMB storage modules have been upgraded to support this mechanism? Martin Tran advised that over 40 of the modules have been upgraded to use the version of RMB that supports this mechanism, however only one has been updated to use unambiguous operations. Marc Johnson clarified that most RMB modules have been updated to (partially) support this mechanism. And shared his understanding that none of these modules having automated tests that exercise this mechanism during builds and that many of them have some use of ambiguous (RMB does not know if they are read or write) operations that need to be changed, meaning there is outstanding work in almost all of the modules that have already been upgraded. Jeremy Huff asked if a system configured for a single read / write node would also double the amount of connections needed? Martin Tran advised that in this case, there is only a single connection pool (and thus the number of connections remains as before) Jeremy Huff asked what are the implication for running in a hybrid state (where some modules support read / write traffic splitting and others do no not)? Martin Tran suggested that each module that is moved to using separate read and write nodes reduces the load on the write node. Marc Johnson advised that one of the primary trade offs for having different tooling support different configuration options in this area is increased operations complexity for developers and operations folks needing to understand that different parts of the system are using read and write nodes and others are only using the write node and cognitive load of using different architectural patterns Craig McNally asked what the next steps are? Maccabee Levine suggested that Martin Tran update the ADR to reflect some of the questions / concerns raised during the conversation. Craig McNally advised that after those changes are made, the TC will reach a conclusion in this ADR either via Slack or in a future meeting. | |||
1 min | Upcoming meetings |
| ||||
* | Decision Log Review | Craig McNally /All | Moved to discussion for next week.
| |||
Topic Backlog | ||||||
20 min | WOLFcon Hot Topics | All | An overview was provided of the "hot topics" at WOLFcon. It seems clear that the TC ought to be involved in these discussions/efforts; what is the best way to participate?
Notes: Deferred | |||
Cyber Resilience Act | Craig McNally /All | From Craig McNally in #tech-council: This was brought to my attention earlier today...
Today: Deferred | ||||
Ease of Installing FOLIO | All / Ian Walls | From last week:
Today:
| ||||
Revisiting FOLIO Governance | All / Ian Walls | Slack discussion: Revisiting FOLIO Governance | ||||
...