An explainable longitudinal multi-modal fusion...

An explainable longitudinal multi-modal fusion model for predicting neoadjuvant therapy response in women with breast cancer

Abstract

Multi-modal image analysis using deep learning (DL) lays the foundation for neoadjuvant treatment (NAT) response monitoring. However, existing methods prioritize extracting multi-modal features to enhance predictive performance, with limited consideration on real-world clinical applicability, particularly in longitudinal NAT scenarios with multi-modal data. Here, we propose the Multi-modal Response Prediction (MRP) system, designed to mimic real-world physician assessments of NAT responses in breast cancer. To enhance feasibility, MRP integrates cross-modal knowledge mining and temporal information embedding strategy to handle missing modalities and remain less affected by different NAT settings. We validated MRP through multi-center studies and multinational reader studies. MRP exhibited comparable robustness to breast radiologists, outperforming humans in predicting pathological complete response in the Pre-NAT phase (ΔAUROC 14% and 10% on in-house and external datasets, respectively). Furthermore, we assessed MRP’s clinical utility impact on treatment decision-making. MRP may have profound implications for enrolment into NAT trials and determining surgery extensiveness.

Authors

Gao Y; Ventura-Diaz S; Wang X; He M; Xu Z; Weir A; Zhou H-Y; Zhang T; van Duijnhoven FH; Han L

Journal

Nature Communications, Vol. 15, No. 1,

Publisher

Springer Nature

Publication Date

December 1, 2024

DOI

10.1038/s41467-024-53450-8

ISSN

2041-1723

Associated Experts

Sofia Ventura Diaz

Assistant Professor, Faculty of Health Sciences

Visit profile

Labels