NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Generalizability of a Machine Learning Model for Improving Utilization of Parathyroid Hormone-Related Peptide Testing across Multiple Clinical Centers

https://doi.org/10.1093/clinchem/hvad141

Yang, He S; Pan, Weishen; Wang, Yingheng; Zaydman, Mark A; Spies, Nicholas C; Zhao, Zhen; Guise, Theresa A; Meng, Qing H; Wang, Fei (September 2023, Clinical Chemistry)

Abstract BackgroundMeasuring parathyroid hormone-related peptide (PTHrP) helps diagnose the humoral hypercalcemia of malignancy, but is often ordered for patients with low pretest probability, resulting in poor test utilization. Manual review of results to identify inappropriate PTHrP orders is a cumbersome process. MethodsUsing a dataset of 1330 patients from a single institute, we developed a machine learning (ML) model to predict abnormal PTHrP results. We then evaluated the performance of the model on two external datasets. Different strategies (model transporting, retraining, rebuilding, and fine-tuning) were investigated to improve model generalizability. Maximum mean discrepancy (MMD) was adopted to quantify the shift of data distributions across different datasets. ResultsThe model achieved an area under the receiver operating characteristic curve (AUROC) of 0.936, and a specificity of 0.842 at 0.900 sensitivity in the development cohort. Directly transporting this model to two external datasets resulted in a deterioration of AUROC to 0.838 and 0.737, with the latter having a larger MMD corresponding to a greater data shift compared to the original dataset. Model rebuilding using site-specific data improved AUROC to 0.891 and 0.837 on the two sites, respectively. When external data is insufficient for retraining, a fine-tuning strategy also improved model utility. ConclusionsML offers promise to improve PTHrP test utilization while relieving the burden of manual review. Transporting a ready-made model to external datasets may lead to performance deterioration due to data distribution shift. Model retraining or rebuilding could improve generalizability when there are enough data, and model fine-tuning may be favorable when site-specific data is limited.
more » « less
Full Text Available
Multicenter target trial emulation to evaluate corticosteroids for sepsis stratified by predicted organ dysfunction trajectory

https://doi.org/10.1038/s41467-025-59643-z

Rajendran, Suraj; Xu, Zhenxing; Pan, Weishen; Zang, Chengxi; Siempos, Ilias; Torres, Lisa; Xu, Jie; Bian, Jiang; Schenck, Edward J; Wang, Fei (December 2025, Nature Communications)

Free, publicly-accessible full text available December 1, 2026
Federated target trial emulation using distributed observational data for treatment effect estimation

https://doi.org/10.1038/s41746-025-01803-y

Li, Haoyang; Zang, Chengxi; Xu, Zhenxing; Pan, Weishen; Rajendran, Suraj; Chen, Yong; Wang, Fei (December 2025, npj Digital Medicine)

Free, publicly-accessible full text available December 1, 2026
Unified Insights: Harnessing Multi-modal Data for Phenotype Imputation via View Decoupling

Zhang, Qiannan; Pan, Weishen; Bai, Zilong; Su, Chang; Wang, Fei (June 2025, Advances in neural information processing systems)

Free, publicly-accessible full text available June 5, 2026
Identifying progression subphenotypes of Alzheimer’s disease from large-scale electronic health records with machine learning

https://doi.org/10.1016/j.jbi.2025.104820

Zhou, Manqi; Tang, Alice S; Zhang, Hao; Xu, Zhenxing; Ke, Alison MC; Su, Chang; Huang, Yu; Mantyh, William G; Jaffee, Michael S; Rankin, Katherine P; et al (May 2025, Journal of Biomedical Informatics)

Free, publicly-accessible full text available May 1, 2026
Exploring Personalized Health Support through Data-Driven, Theory-Guided LLMs: A Case Study in Sleep Health

https://doi.org/10.1145/3706598.3713852

Wang, Xingbo; Griffith, Janessa; Adler, Daniel A; Castillo, Joey; Choudhury, Tanzeem; Wang, Fei (April 2025, ACM)

Free, publicly-accessible full text available April 25, 2026
Local Causal Discovery for Structural Evidence of Direct Discrimination

https://doi.org/10.1609/aaai.v39i18.34130

Maasch, Jacqueline; Gan, Kyra; Chen, Violet; Orfanoudaki, Agni; Akpinar, Nil-Jana; Wang, Fei (April 2025, Proceedings of the AAAI Conference on Artificial Intelligence)

Identifying the causal pathways of unfairness is a critical objective for improving policy design and algorithmic decision-making. Prior work in causal fairness analysis often requires knowledge of the causal graph, hindering practical applications in complex or low-knowledge domains. Moreover, global discovery methods that learn causal structure from data can display unstable performance on finite samples, preventing robust fairness conclusions. To mitigate these challenges, we introduce local discovery for direct discrimination (LD3): a method that uncovers structural evidence of direct unfairness by identifying the causal parents of an outcome variable. LD3 performs a linear number of conditional independence tests relative to variable set size, and allows for latent confounding under the sufficient condition that all parents of the outcome are observed. We show that LD3 returns a valid adjustment set (VAS) under a new graphical criterion for the weighted controlled direct effect, a qualitative indicator of direct discrimination. LD3 limits unnecessary adjustment, providing interpretable VAS for assessing unfairness. We use LD3 to analyze causal fairness in two complex decision systems: criminal recidivism prediction and liver transplant allocation. LD3 was more time-efficient and returned more plausible results on real-world data than baselines, which took 46× to 5870× longer to execute.
more » « less
Free, publicly-accessible full text available April 11, 2026
Automated multi-scale computational pathotyping (AMSCP) of inflamed synovial tissue

https://doi.org/10.1038/s41467-024-51012-6

Bell, Richard D; Brendel, Matthew; Konnaris, Maxwell A; Xiang, Justin; Otero, Miguel; Fontana, Mark A; Bai, Zilong; Albrecht, Jennifer; Apruzzese, William; Boyce, Brendan F; et al (December 2024, Nature Communications)

Free, publicly-accessible full text available December 1, 2025
Local Discovery by Partitioning: Polynomial-Time Causal Discovery Around Exposure-Outcome Pairs

Maasch, Jacqueline RMA; Pan, Weishen; Gupta, Shantanu; Kuleshov, Volodymyr; Gan, Kyra; Wang, Fei (July 2024, Proceedings of The 40th Conference on Uncertainty in Artificial Intelligence.)

Full Text Available
Implementing AI models in clinical workflows: a roadmap

https://doi.org/10.1136/bmjebm-2023-112727

Wang, Fei; Beecy, Ashley (June 2024, BMJ Evidence-Based Medicine)

Full Text Available

« Prev Next »

Search for: All records