Feat/remove-non-ispa #121

kassyray · 2025-11-18T20:29:01Z

This pull request introduces configuration-driven filtering of diseases during the preprocessing step of the pipeline. The main change is the ability to specify diseases to ignore via a config file, which then filters both the vaccine reference data and the records enriched during preprocessing. This improves flexibility and control over which diseases are included in downstream analysis.

Configuration and Filtering Enhancements:

Added support for loading an ignore_diseases list from the config file (parameters.yaml) in run_step_2_preprocess, and used it to filter out specified diseases from the vaccine reference before further processing.
Updated the preprocessing pipeline to pass the ignore_diseases parameter through to the build_preprocess_result and enrich_grouped_records functions, allowing ignored diseases to be excluded during enrichment. [1] [2] [3]

Disease Enrichment Logic:

Modified the enrich_grouped_records function to accept ignore_diseases and remove any diseases specified in this list from the enrichment results.
Adjusted the disease lookup and enrichment logic to ensure that records with vaccines not mapped in the filtered reference are removed, and improved normalization of vaccine names. [1] [2]

Preprocessing Pipeline Integration:

Ensured that the filtered vaccine reference and ignore list are consistently used throughout the preprocessing pipeline, including in artifact creation and enrichment.

These changes make the pipeline more configurable and robust by allowing easy exclusion of unwanted diseases via configuration.

…oll back point

…iour when ignore diseases that have multiple mappings.

…ole vaccine entries.

codecov · 2025-11-19T21:19:37Z

Codecov Report

❌ Patch coverage is 41.66667% with 14 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
pipeline/orchestrator.py	25.00%	8 Missing and 1 partial ⚠️
pipeline/preprocess.py	58.33%	3 Missing and 2 partials ⚠️

📢 Thoughts on this report? Let us know!

kassyray · 2025-11-21T16:13:41Z

Regarding this PR - the code pushed is specific to a feature request from a PHU. I think the logic will break the needs required by other PHUs.

Need to think of logic to handle various use cases and update the function accordingly.

… from the immunization histories of clients. This needs to be an optional request and should not yet be pushed to main

kassyray added 8 commits November 18, 2025 20:00

This almost works but the date is still showing up. Committing as a r…

438a3c1

…oll back point

Another checkpoint. This is working but there is some erroneous behav…

65362a7

…iour when ignore diseases that have multiple mappings.

Update filtering so ignored diseases are removed individually, not wh…

68eb4b8

…ole vaccine entries.

Adding tests

3bade2d

Adding tests

07b1de8

Adding tests

162c861

Adding tests

4cbb36b

Adding tests

2b5e68e

kassyray requested a review from jangevaare November 19, 2025 21:19

Adding a patch to have HPV and HepB vaxes map to the other col

f3d1bc6

Adding functionality to remove non-ispa diseases besides hpv and hepb…

73ef228

… from the immunization histories of clients. This needs to be an optional request and should not yet be pushed to main

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feat/remove-non-ispa #121

Feat/remove-non-ispa #121

Uh oh!

kassyray commented Nov 18, 2025 •

edited

Loading

Uh oh!

codecov bot commented Nov 19, 2025 •

edited

Loading

Uh oh!

kassyray commented Nov 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Feat/remove-non-ispa #121

Are you sure you want to change the base?

Feat/remove-non-ispa #121

Uh oh!

Conversation

kassyray commented Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

kassyray commented Nov 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kassyray commented Nov 18, 2025 •

edited

Loading

codecov bot commented Nov 19, 2025 •

edited

Loading