Home
Scholarly Works
Large Language Models and Non-Negative Matrix...
Preprint

Large Language Models and Non-Negative Matrix Factorization for Bioacoustic Signal Decomposition

Abstract

Large language models have shown a remarkable ability to extract meaning from unstructured data, offering new ways to interpret biomedical signals beyond traditional numerical methods. In this study, we present a matrix factorization framework for bioacoustic signal analysis which is enhanced by large language models. The focus is on separating bioacoustic signals that commonly overlap in clinical recordings, using matrix factorization to decompose the mixture into interpretable components. A large language model is then applied to the separated signals to associate distinct acoustic patterns with potential medical conditions such as cardiac rhythm disturbances or respiratory abnormalities. Recordings were obtained from a digital stethoscope applied to a clinical manikin to ensure a controlled and high-fidelity acquisition environment. This hybrid approach does not require labeled data or prior knowledge of source types, and it provides a more interpretable and accessible framework for clinical decision support. The method demonstrates promise for integration into future intelligent diagnostic tools.

Authors

Torabi Y; Shirani S; Reilly JP

Publication date

July 12, 2025

DOI

10.48550/arxiv.2507.09161

Preprint server

arXiv
View published work (Non-McMaster Users)

Contact the Experts team