Steps

Gather existing issues (Vladimir Shalaev , Kateryna Senchenko )
Create new features (Vladimir Shalaev, Kateryna Senchenko ) - review UXPROD-3135 - Getting issue details... STATUS links and put to according issues in table.
Provide feature dependencies (Vladimir Shalaev , Kateryna Senchenko )
Estimate (priorities + complexity) (Vladimir Shalaev , Kateryna Senchenko )
Remove duplicates (grooming with Ann-Marie)
Final priorities
Align to timeline, and assign to appropriate Jira Feature, and review Jira issue priorities (Taisiya Trunova)

Priorities

High, Mid, Low

Complexity

S, M, L, XL, XXL

Table

	Category	Problem definition	Business impact	Proposed solution	Priority DEV	Complexity	Existing Jira item(s)	Final feature(s)
1	Performance	Kafka producer closed after sending	Low performance of import	Create pool of active producers. Start pool on module launch, close on shutdown. Reuse connections. Add max/min pool sizes.	High	L		UXPROD-3135 - Getting issue details... STATUS
2		WARN message when no handler found	none	Do not subscribe to messages you're not going to process OR Lower log lever for this type of messages	Low	S		UXPROD-3135 - Getting issue details... STATUS
3	Stability/Reliability	Race condition on start (Kafka consumers start working before DB is configured)	Imports might get stuck on module restart	Need investigation / check	Low	M		UXPROD-3135 - Getting issue details... STATUS
4	Performance Stability/Reliability	High CPU/Memory consumption on modules	Low performance of import. Higher costs for hosting	Significantly decrease size of payload: Remove immutable parts. Instead fetch them on demand and cache locally for reuse. Change message handling mechanism (currently relies on pt1 - profile) (optional) Move archiving to Kafka instead of module level	High	XXL	MODDATAIMP-439 - Getting issue details... STATUS MODSOURMAN-519 - Getting issue details... STATUS	UXPROD-3135 - Getting issue details... STATUS
5	Performance	Kafka cache resource consumption	Low performance of import. Higher costs of hosting.	Remove Kafka cache. Modules that do not do persistent changes will sometimes (on duplicates read) do unnecessary calls. Can be optimized further upon adding distributed in-memory cache (ex hazelcast) (blocked by 6)	Mid	M	MODINV-444 - Getting issue details... STATUS MODINV-401 - Getting issue details... STATUS	UXPROD-3135 - Getting issue details... STATUS
6	Stability/Reliability	Duplicates created upon import	Data inconsistency on import.	Make consumers behave idempotent. Add pass-through identifier to de-duplicate messages.	High	XL	MODDATAIMP-474 - Getting issue details... STATUS MODDATAIMP-440 - Getting issue details... STATUS MODDATAIMP-491 - Getting issue details... STATUS	UXPROD-3135 - Getting issue details... STATUS
7	Stability/Reliability	Kafka consumers stop reading messages eventually, breaking job progress until module restart.	Imports eventually get stuck until module restart	Need investigation	High	?	MODINV-417 - Getting issue details... STATUS	UXPROD-3135 - Getting issue details... STATUS
8	Stability/Reliability	Test coverage is not high enough (Unit)	Higher amount of bugs	Write more tests	Mid	S	MODPUBSUB-168 - Getting issue details... STATUS	UXPROD-3135 - Getting issue details... STATUS
9	Stability/Reliability	Test coverage is not high enough (Karate)	Higher amount of bugs	Write more tests (define test cases)	Mid	L	UXPROD-2697 - Getting issue details... STATUS	UXPROD-2697 - Getting issue details... STATUS
10	Stability/Reliability	mod-data-import stores input file in memory, limiting size of uploaded file and possibly having oom	Data import file size is limited	Split to chunks, put to database, work with database/temp storage. Partially done (to be investigated)	Mid	L	MODDATAIMP-390 - Getting issue details... STATUS MODDATAIMP-392 - Getting issue details... STATUS MODDATAIMP-465 - Getting issue details... STATUS	UXPROD-3135 - Getting issue details... STATUS
11	Performance	Data import impacts other processes	Slower response of system during data import	Need investigation (possible solution - configure rate limiter)				UXPROD-3135 - Getting issue details... STATUS
12	Performance	High resource consumption to get job(s) status/progress	Slow performance of import and landing page.	Add some kind of caching for progress tracking (database or in-memory)	Low	S	MODSOURMAN-469 - Getting issue details... STATUS	UXPROD-3135 - Getting issue details... STATUS
13	Stability/Reliability	SRS can fail when processing message during import	Import can end up creating some instances but not creating holdings/items for some MARC records	Generate "INSTANCE CREATED" from mod-inventory. Consume in SRS to update HRID in BIB and in INVENTORY to continue processing. Remove unnecessary topics (* ready for post processing and hrid set)	Mid	L		UXPROD-3135 - Getting issue details... STATUS
14	Stability/Reliability	Periodical DB shutdown after SRS restart. Jobs get stuck if not able to update status in DB (messages ACKed even if we could not process them)	Sometimes DI import jobs get stuck if there was a restart of SRS during job run.	Investigate the issue with DB Do not ACK messages in Kafka if there's not a logic, but infrastructure error/exception	Mid		MODSOURCE-339 - Getting issue details... STATUS	UXPROD-3135 - Getting issue details... STATUS

Filters

key	summary	type	created	updated	due	assignee	reporter	priority	status	resolution

Loading...

Refresh

Links

Data Import Observations for Improvements

Data Import Stabilization plan

Steps

Categories

Priorities

Complexity

Table

Filters

Links