Predictive overfitting in immunological applications: Pitfalls and solutions

Gygi, Jeremy P.; Kleinstein, Steven H.; Guan, Leying

doi:10.1080/21645515.2023.2251830

Citation Details

Predictive overfitting in immunological applications: Pitfalls and solutions

Overfitting describes the phenomenon where a highly predictive model on the training data generalizes poorly to future observations. It is a common concern when applying machine learning techniques to contemporary medical applications, such as predicting vaccination response and dis-ease status in infectious disease or cancer studies. This review examines the causes of overfitting and offers strategies to counteract it, focusing on model complexity reduction, reliable model evaluation, and harnessing data diversity. Through discussion of the underlying mathematical models and illustrative examples using both synthetic data and published real datasets, our objective is to equip analysts and bioinformaticians with the knowledge and tools necessary to detect and mitigate overfitting in their research. more »

Award ID(s):: 2310836

PAR ID:: 10470206

Author(s) / Creator(s):: Gygi, Jeremy P.; Kleinstein, Steven H.; Guan, Leying

Publisher / Repository:: Taylor & Francis

Date Published:: 2023-08-01

Journal Name:: Human Vaccines & Immunotherapeutics

Volume:: 19

Issue:: 2

ISSN:: 2164-5515

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1080/21645515.2023.2251830

More Like this