Although several classifications are used to assess radiographs following radial head arthroplasty (RHA), including the Popovic classification for radiolucency, the Chanlalit classification for stress shielding (SS), the Brooker classification for heterotopic ossification (HO), and the Broberg-Morrey classification for radiocapitellar arthritis, little is known about the reliability of these classification systems. The purpose of this study was to determine the interobserver and intraobserver reliability of these classifications. Six orthopedic surgeons at various levels of training reviewed elbow radiographs of 20 patients who underwent RHA and classified them according to the Popovic, Chanlalit, Brooker, and Broberg-Morrey classifications for radiolucency, SS, HO, and RHA, respectively. Four weeks after initial review, radiographic reviews were repeated. Reliability was measured using the Fleiss kappa and the intraclass correlation coefficient. Agreement was interpreted as none (<0), slight (0.01-0.2), fair (0.21-0.4), moderate (0.41-0.6), substantial (0.61-0.8), and almost perfect (0.81-1) based on agreement among attending surgeons. Among fellowship-trained attending surgeons, interobserver reliability was slight for SS (Chanlalit) and the categorical interpretation of radiolucency (Popovic), fair for radiocapitellar arthritis (Broberg-Morrey) and HO (Brooker), and substantial for the ordinal interpretation of radiolucency (Popovic). Residents had a higher interobserver reliability than attending physicians when using the Brooker classification. Mean intraobserver reliability was fair for SS (Chanlalit) and the categorical interpretation of radiolucency (Popovic), moderate for HO (Brooker) and radiocapitellar arthritis (Broberg-Morrey), and almost perfect for the ordinal interpretation of radiolucency (Popovic). Trainees had higher intraobserver reliability than attending surgeons using the SS (Chanlalit) classification. The number of Popovic zones is reliable for communication between physicians, but caution should be taken with the Brooker, Chanlalit, Broberg-Morrey, and categorical interpretation of the Popovic classifications. All the classifications had better intraobserver than interobserver reliability. Reliability of classification systems for radiographic complications after RHA is less than substantial except the number of zones of radiolucency; therefore, caution is required when drawing conclusions based on these classifications.
Read full abstract