Multimodal Federated Learning for Robust Lung Cancer Prognosis

YusufBrima 37 views 13 slides Feb 26, 2025
Slide 1
Slide 1 of 13
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13

About This Presentation

Lung cancer is a leading cause of cancer-related deaths, yet accurate prognosis remains a challenge due to data limitations, privacy concerns, and institutional data silos. This presentation explores how federated learning—a decentralized machine learning approach—combined with multi-modal data ...


Slide Content

Multimodal Federated Learning for Robust Lung Cancer
Prognosis

AI & Data Science Group,
Department of Bioinformatics,
Fraunhofer Institute for Algorithms and Scientific Computing (SCAI)

February 19
th
, 2025
Yusuf Brima

Outline
●Introduction
●Background and Context
●Deep Learning in Diagnosis, Prognosis, and Treatment Monitoring
●Federated Learning Overview
●Unimodal Learning
●Multimodal Learning
●Combining Federated Learning with Multimodal Data Integration
●Prominent Challenges and Proposed Solutions
●Key Takeaways
2

Introduction


3
Lung Cancer Facts

●Lung cancer accounts for approximately 1.8
million deaths annually
2
●Cancer is a leading cause of death
worldwide, accounting for nearly 10 million
deaths in 2020
2
●Approximately 5.7 percent of men and women
will be diagnosed with lung and bronchus
cancer at some point during their lifetime
1
Source
[2] World Health Organization (WHO), "Lung cancer," https://www.who.int/news-room/fact-sheets/detail/cancer

Introduction


4

●Significance: Lung cancer is one of the leading
causes of cancer-related deaths worldwide
1
.
●Opportunity: The use of Deep Learning (DL)
for early detection/diagnosis/prognosis.
●Challenge: Limited (annotated) data
availability, trust, and privacy concerns hinder
its applications.
●Objective: Explore how federated learning
combined with multimodal data (EHR and
imaging) can enhance prognosis accuracy
while preserving privacy.
Source
[1] National Cancer Institute (NCI), "Cancer Stat Facts: Lung and Bronchus Cancer," https://seer.cancer.gov/statfacts/html/lungb.html

Contextualizing Deep Learning in Diagnosis, Prognosis, and
Treatment Monitoring


●Diagnosis:
○Definition: The identification of a health condition or disease via examination.
○Goal: Detect the presence of a disease.
○Example: Identifying a lung tumor from imaging (e.g., chest X-ray or CT scans).
●Prognosis:
○Definition: Predicting the future course or outcome of a disease.
○Goal: Forecast the disease progression and long-term outlook.
○Example: Predicting 5-year survival rates for lung cancer patients.
●Treatment Monitoring:
○Definition: Assessing the effectiveness of a treatment over time.
○Goal: Track patient response and adjust treatment as needed.
○Example: Monitoring tumor shrinkage during chemotherapy.
5

Motivation for Lung Cancer Prognosis




●Supports personalized treatment strategies.
●Recent works, such as
3,4
, have shown that multi-modal representation
learning can significantly improve survival prediction in lung cancer
by combining structured gnomic data with imaging (CT and/or PET)
data.
●Current Limitations:
○Data silos across institutions impede large-scale analysis for robust and
generalizable predictive models.
6
[3] Farooq, A., Mishra, D. and Chaudhury, S., 2024. Survival Prediction in Lung Cancer through Multi-Modal Representation Learning. arXiv preprint arXiv:2409.20179.
[4] Z. Ning et al., "Multi-Constraint Latent Representation Learning for Prognosis Analysis Using Multi-Modal Data," in IEEE Transactions on Neural Networks and Learning Systems, vol. 34, no. 7, pp.
3737-3750, July 2023, doi: 10.1109/TNNLS.2021.3112194.

Federated Learning Overview


●Definition: A decentralized ML
approach where data remains
local, and only model updates
are shared.
●Benefits in Healthcare:
○Preserves patient privacy.
○Leads to more robust ML systems
○Enables inter-institutional
collaboration.
○Key Mechanism: Federated
averaging (e.g., FedAvg).
7

Unimodal Representation Learning




●EHR-Only Model: Predict outcomes
based on structured clinical data.
○Strength: Often low-dimensional, thus less
computationally intensive and models if
traditional are interpretable and explainable.
○Limitation: Misses spatial information from
imaging.
●Imaging-Only Model: Leverage CT
scans for feature extraction.
○Strength: Visual representation of tumors.
○Limitation: Lack of contextual patient
information, computational intensive and
often less interpretable/explainable.
8

Multimodal Representation Learning




●Definition: Combining data
from multiple modalities to
enhance performance.
●Methods:
○Feature-level fusion (e.g.,
concatenation).
○Decision-level fusion (e.g.,
ensemble models).
●Advantages:
○Richer representation (e.g., more
holistic view of a patient state).
○Improved generalization across
diverse cases.
9

Combining Federated Learning with Multimodal Data Integration






●Framework:
○Local multimodal training on structured and imaging data at
participating institutions.
○Federated aggregation of model weights.
●Key benefits:
○Preserves data privacy.
○Leverages complementary strengths of different modalities to enable
better robustness and generalization.
10

Key Challenges and Proposed Solutions

●Challenges:
○Limited and imbalanced datasets.
○Data heterogeneity across institutions.
○Communication overhead for federated learning and
computational cost for multimodal learning.
●Proposed Solutions:
○Data augmentation, class weighting, regularization and other
training recipes.
○Efficient model updates using sparsification (e.g., quantization,
pruning, distillation, etc).
11

Key Takeaways




●Federated learning enables privacy-preserving model development.
●Multimodal integration enhances the robustness and generalization of
lung cancer prognosis models.
●Addressing challenges such as data scarcity and heterogeneity is crucial
for success.
●Addressing the highlighted challenges and combining these two
techniques could enable more personalized prognosis of cancer to help
improve patient outcomes.
12

Thank you!
13