Radiographic Arthrosis After Elbow Trauma: Interobserver Reliability Journal Articles uri icon

  • Overview
  • Research
  • Identity
  • Additional Document Info
  • View All


  • PURPOSE: This study measured observer variation in radiographic rating of elbow arthrosis. METHODS: Thirty-seven independent orthopedic surgeons graded the extent of elbow arthrosis in 20 consecutive sets of plain radiographs, according to the Broberg and Morrey rating system (grade 0, normal joint; grade 1, slight joint-space narrowing with minimum osteophyte formation; grade 2, moderate joint-space narrowing with moderate osteophyte formation; and grade 3, severe degenerative change with gross destruction of the joint). The kappa multirater measure (κ) was used to estimate reliability between observers, with 0 indicating no agreement above chance, and 1 indicating perfect agreement. RESULTS: There was fair agreement in arthrosis ratings between surgeons. Surgeons with more than 10 years of experience had greater agreement than did surgeons with less experience, and surgeons who treated more than 10 elbow fractures per year had better agreement than did those treating fewer fractures. In post hoc analyses, 2 simplified binary rating systems (eg, "none or mild" vs "moderate or severe" arthrosis) resulted in moderate agreement among observers. CONCLUSIONS: The 4 grades of the Broberg and Morrey classification system have only fair interobserver reliability that is influenced by subspecialty and experience. Binary rating systems might be more reliable. TYPE OF STUDY/LEVEL OF EVIDENCE: Diagnostic III.


  • Lindenhovius, Anneluuk
  • Karanicolas, Paul Jack
  • Bhandari, Mohit
  • Ring, David

publication date

  • April 2012