Uncertainty quantification in the radiogenomics modeling of EGFR amplification in glioblastoma

Hu, Leland S.; Wang, Lujia; Hawkins-Daarud, Andrea; Eschbacher, Jennifer M.; Singleton, Kyle W.; Jackson, Pamela R.; Clark-Swanson, Kamala; Sereduk, Christopher P.; Peng, Sen; Wang, Panwen; Wang, Junwen; Baxter, Leslie C.; Smith, Kris A.; Mazza, Gina L.; Stokes, Ashley M.; Bendok, Bernard R.; Zimmerman, Richard S.; Krishna, Chandan; Porter, Alyx B.; Mrugala, Maciej M.; Hoxworth, Joseph M.; Wu, Teresa; Tran, Nhan L.; Swanson, Kristin R.; Li, Jing

doi:10.1038/s41598-021-83141-z

Abstract Radiogenomics uses machine-learning (ML) to directly connect the morphologic and physiological appearance of tumors on clinical imaging with underlying genomic features. Despite extensive growth in the area of radiogenomics across many cancers, and its potential role in advancing clinical decision making, no published studies have directly addressed uncertainty in these model predictions. We developed a radiogenomics ML model to quantify uncertainty using transductive Gaussian Processes (GP) and a unique dataset of 95 image-localized biopsies with spatially matched MRI from 25 untreated Glioblastoma (GBM) patients. The model generated predictions for regional EGFR amplification status (a common and important target in GBM) to resolve the intratumoral genetic heterogeneity across each individual tumor—a key factor for future personalized therapeutic paradigms. The model used probability distributions for each sample prediction to quantify uncertainty, and used transductive learning to reduce the overall uncertainty. We compared predictive accuracy and uncertainty of the transductive learning GP model against a standard GP model using leave-one-patient-out cross validation. Additionally, we used a separate dataset containing 24 image-localized biopsies from 7 high-grade glioma patients to validate the model. Predictive uncertainty informed the likelihood of achieving an accurate sample prediction. When stratifying predictions based on uncertainty, we observed substantially higher performance in the group cohort (75% accuracy, n = 95) and amongst sample predictions with the lowest uncertainty (83% accuracy, n = 72) compared to predictions with higher uncertainty (48% accuracy, n = 23), due largely to data interpolation (rather than extrapolation). On the separate validation set, our model achieved 78% accuracy amongst the sample predictions with lowest uncertainty. We present a novel approach to quantify radiogenomics uncertainty to enhance model performance and clinical interpretability. This should help integrate more reliable radiogenomics models for improved medical decision-making.

More Like this