Journal article
Evaluating the performance of five large language models in answering Delphi consensus questions relating to patellar instability and medial patellofemoral ligament reconstruction
Abstract
PurposeArtificial intelligence (AI) has become incredibly popular over the past several years, with large language models (LLMs) offering the possibility of revolutionizing the way healthcare information is shared with patients. However, to prevent the spread of misinformation, analyzing the accuracy of answers from these LLMs is essential. This study will aim to assess the accuracy of five freely accessible chatbots by specifically evaluating …
Authors
Vivekanantha P; Cohen D; Slawaska-Eng D; Nagai K; Tarchala M; Matache B; Hiemstra L; Longstaffe R; Lesniak B; Meena A
Journal
BMC Musculoskeletal Disorders, Vol. 26, No. 1,
Publisher
Springer Nature
DOI
10.1186/s12891-025-09227-1
ISSN
1471-2474