skip to main content


Search for: All records

Creators/Authors contains: "Kumar, Vijay"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Imbalanced data, a common challenge encountered in statistical analyses of clinical trial datasets and disease modeling, refers to the scenario where one class significantly outnumbers the other in a binary classification problem. This imbalance can lead to biased model performance, favoring the majority class, and affecting the understanding of the relative importance of predictive variables. Despite its prevalence, the existing literature lacks comprehensive studies that elucidate methodologies to handle imbalanced data effectively. In this study, we discuss the binary logistic model and its limitations when dealing with imbalanced data, as model performance tends to be biased towards the majority class. We propose a novel approach to addressing imbalanced data and apply it to publicly available data from the VITAL trial, a large-scale clinical trial that examines the effects of vitamin D and Omega-3 fatty acid to investigate the relationship between vitamin D and cancer incidence in sub-populations based on race/ethnicity and demographic factors such as body mass index (BMI), age, and sex. Our results demonstrate a significant improvement in model performance after our undersampling method is applied to the data set with respect to cancer incidence prediction. Both epidemiological and laboratory studies have suggested that vitamin D may lower the occurrence and death rate of cancer, but inconsistent and conflicting findings have been reported due to the difficulty of conducting large-scale clinical trials. We also utilize logistic regression within each ethnic sub-population to determine the impact of demographic factors on cancer incidence, with a particular focus on the role of vitamin D. This study provides a framework for using classification models to understand relative variable importance when dealing with imbalanced data.

     
    more » « less
    Free, publicly-accessible full text available September 28, 2024
  2. Free, publicly-accessible full text available June 4, 2024
  3. Free, publicly-accessible full text available June 4, 2024
  4. Free, publicly-accessible full text available May 1, 2024
  5. Liquid–vapor phase change including evaporation, boiling, and condensation is a ubiquitous process found in power generation, desalination, thermal management, building heating and cooling, and additive manufacturing. The dynamics of droplets and bubbles during phase change including nucleation, growth, and departure critically influence the thermal transport performance and system efficiency. This review will highlight recent advancements using static and dynamic strategies to manipulate droplets and bubbles for phase change applications and beyond. 
    more » « less
  6. null (Ed.)