Stephanie Wichuk1, Walter P. Maksymowych1, Joel Paschke2, Abhilash Hareendranathan3 and Jacob Jaremko1, 1University of Alberta, Edmonton, AB, Canada, 2CARE Arthritis, Edmonton, AB, Canada, 3Radiology and Diagnostic Imaging, Edmonton, AB, Canada
Background/Purpose: Bone marrow lesions (BML) are an important scoring feature on magnetic resonance imaging (MRI) of knee osteoarthritis (OA), and their presence in certain anatomical locations may be more closely related to various clinical measures than in others. At the same time, anatomical location may impact BML scoring reliability. The Knee Inflammation MRI Scoring System (KIMRISS) scores BML from fluid-sensitive sagittal MRI1. Readers place ready-made overlays onto scorable portions of the femur, tibia, and patella, demarcating medial, intercondylar, and lateral knee compartments. These overlays divide each scorable region into smaller grid squares which are evaluated for presence of BML on each slice. We aimed to evaluate 3-dimensional patterns of regional variation in inter-rater reliability for BML within these sagittally-placed KIMRISS grid elements when divided into subsections across the coronal plane. This granular analysis serves as a precursor to further study of the relationship of precise BML location to clinical measures and outcomes, as reliable data is required to ensure validity of such analyses.
Methods: MRI from 61 patients in the Osteoarthritis Initiative (OAI) dataset were scored by 8 trained readers using KIMRISS. Each grid overlay region in the femur and tibia (14 and 10 regions, respectively) was divided into 6 equal subregions medially to laterally, excluding the intercondylar region.We calculated an overall 8-reader interclass correlation coefficient (ICC) for each subregion and generated a 3D segmentation model of the knee to visualize variation in ICCs.
Results: Before any subdivision in the coronal plane, ICCs ≥ 0.70 were achieved in 61% of grid regions (Figure 1). Highest reliability was found trochlear region of the femur (ICCs 0.84-0.90), while lowest ICCs occurred in a small interior region of the femur [mean (SD) ICC=0.49 (0.26)] and the two posteriormost tibial regions [mean (SD) ICC=0.41 (0.26) and 0.31 (0.33)]. After further subdividing regions medially to laterally, highest overall reliability amongst all KIMRISS grid squares occurred in the central slices of the lateral compartment of the femur (median ICC=0.72) and the innermost slices of the medial compartment of the tibia (median ICC=0.42). (Figure 2, Table 1). Lowest overall reliability occurred in the central slices of the medial compartment of the femur (median ICC=0.15) and the medialmost slices of the medial compartment of the tibia (median ICC= 0.28)
Conclusion: While reduced ICCs are to be expected across the board in subdivided scoring regions compared to larger summed regions, this analysis highlights differences in inter-rater reliability over 3 dimensions of the knee. Any analysis of the relationship of clinical variables to precise BML location should consider the degree of scoring reliability at the scoring locations entered into the model. Certain regions may need to be excluded or enlarged to achieve inter-rater consistency.
Figure 1. KIMRISS grid region labels with 8-reader mean ICC from baseline knee MRIs from the Osteoarthritis Initiative dataset (n=61).
Figure 2. 3-D Heat Map of Femoral and Tibial BML scoring reliability in KIMRISS Grid squares when subdivided into 6 compartments laterally to medially, excluding intercondylar region.
Table 1. Median (IQR) 8-Reader ICC among all KIMRISS femoral and tibial subregions when divided into compartments in coronal plane.
S. Wichuk: None; W. Maksymowych: AbbVie, 2, 5, 6, BMS, 2, 6, Boehringer-Ingelheim, 2, CARE Arthritis Ltd, 4, CARE Arthritis Ltd., 4, Celgene, 2, 6, Eli Lilly, 2, 6, Galapagos, 2, 6, Gilead, 2, Janssen, 2, 6, Novartis, 2, 5, 6, Pfizer, 2, 5, 6, UCB, 2, 5, 6; J. Paschke: None; A. Hareendranathan: None; J. Jaremko: None.