Experts has a new look! Let us know what you think of the updates.

Provide feedback
Home
Scholarly Works
Evaluating the performance of five large language...
Journal article

Evaluating the performance of five large language models in answering Delphi consensus questions relating to patellar instability and medial patellofemoral ligament reconstruction

Abstract

PurposeArtificial intelligence (AI) has become incredibly popular over the past several years, with large language models (LLMs) offering the possibility of revolutionizing the way healthcare information is shared with patients. However, to prevent the spread of misinformation, analyzing the accuracy of answers from these LLMs is essential. This study will aim to assess the accuracy of five freely accessible chatbots by specifically evaluating …

Authors

Vivekanantha P; Cohen D; Slawaska-Eng D; Nagai K; Tarchala M; Matache B; Hiemstra L; Longstaffe R; Lesniak B; Meena A

Journal

BMC Musculoskeletal Disorders, Vol. 26, No. 1,

Publisher

Springer Nature

DOI

10.1186/s12891-025-09227-1

ISSN

1471-2474