Multimodal Artificial Intelligence in Medicine

Abstract

Traditional medical artificial intelligence models that are approved for clinical use restrict themselves to single-modal data ( e.g ., images only), limiting their applicability in the complex, multimodal environment of medical diagnosis and treatment. Multimodal transformer models in health care can effectively process and interpret diverse data forms, such as text, images, and structured data. They have demonstrated impressive performance on standard benchmarks, like United States Medical Licensing Examination question banks, and continue to improve with scale. However, the adoption of these advanced artificial intelligence models is not without challenges. While multimodal deep learning models like transformers offer promising advancements in health care, their integration requires careful consideration of the accompanying ethical and environmental challenges.

Authors

Judge CS; Krewer F; O'Donnell MJ; Kiely L; Sexton D; Taylor GW; Skorburg JA; Tripp B

Journal

Kidney360, Vol. 5, No. 11, pp. 1771–1779

Publisher

Wolters Kluwer

Publication Date

November 1, 2024

DOI

10.34067/kid.0000000000000556

ISSN

2641-7650

Associated Experts

Martin O'Donnell

Associate Clinical Professor, Faculty of Health Sciences

Visit profile

Labels

Multimodal Artificial Intelligence in Medicine

Abstract

Authors

Journal

Publisher

Publication Date

DOI

ISSN

Associated Experts

Labels

Fields of Research (FoR)

Sustainable Development Goals (SDG)

Medical Subject Headings (MeSH)

Contact the Experts team