skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Collaborative inference for treatment effect with distributed data‐sharing management in multicenter studies
Data sharing barriers present paramount challenges arising from multicenter clinical studies where multiple data sources are stored and managed in a distributed fashion at different local study sites. Merging such data sources into a common data storage for a centralized statistical analysis requires a data use agreement, which is often time‐consuming. Data merging may become more burdensome when propensity score modeling is involved in the analysis because combining many confounding variables, and systematic incorporation of this additional modeling in a meta‐analysis has not been thoroughly investigated in the literature. Motivated from a multicenter clinical trial of basal insulin treatment for reducing the risk of post‐transplantation diabetes mellitus, we propose a new inference framework that avoids the merging of subject‐level raw data from multiple sites at a centralized facility but needs only the sharing of summary statistics. Unlike the architecture of federated learning, the proposed collaborative inference does not need a center site to combine local results and thus enjoys maximal protection of data privacy and minimal sensitivity to unbalanced data distributions across data sources. We show theoretically and numerically that the new distributed inference approach has little loss of statistical power compared to the centralized method that requires merging the entire data. We present large‐sample properties and algorithms for the proposed method. We illustrate its performance by simulation experiments and the motivating example on the differential average treatment effect of basal insulin to lower risk of diabetes among kidney‐transplant patients compared to the standard‐of‐care.  more » « less
Award ID(s):
2113564
PAR ID:
10544001
Author(s) / Creator(s):
; ;
Publisher / Repository:
Wiley
Date Published:
Journal Name:
Statistics in Medicine
Volume:
43
Issue:
11
ISSN:
0277-6715
Page Range / eLocation ID:
2263 to 2279
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Loh, Po-Ling (Ed.)
    When data are distributed across multiple sites or machines rather than centralized in one location, researchers face the challenge of extracting meaningful information without directly sharing individual data points. While there are many distributed methods for point estimation using sparse regression, few options are available for estimating uncertainties or conducting hypothesis tests based on the estimated sparsity. In this paper, we introduce a procedure for performing selective inference with distributed data. We consider a scenario where each local machine solves a lasso problem and communicates the selected predictors to a central machine. The central machine then aggregates these selected predictors to form a generalized linear model (GLM). Our goal is to provide valid inference for the selected GLM while reusing data that have been used in the model selection process. Our proposed procedure only requires low-dimensional summary statistics from local machines, thus keeping communication costs low and preserving the privacy of individual data sets. Furthermore, this procedure can be applied in scenarios where model selection is repeatedly conducted on randomly subsampled data sets, addressing the p-value lottery problem linked with model selection. We demonstrate the effectiveness of our approach through simulations and an analysis of a medical data set on ICU admissions. 
    more » « less
  2. Abstract BackgroundEffective diabetes management requires precise glycemic control to prevent both hypoglycemia and hyperglycemia, yet existing machine learning (ML) and reinforcement learning (RL) approaches often fail to balance competing objectives. Traditional RL-based glucose regulation systems primarily focus on single-objective optimization, overlooking factors such as minimizing insulin overuse, reducing glycemic variability, and ensuring patient safety. Furthermore, these approaches typically rely on centralized data processing, which raises privacy concerns due to the sensitive nature of health care data. There is a critical need for a decentralized, privacy-preserving framework that can personalize blood glucose regulation while addressing the multiobjective nature of diabetes management. ObjectiveThis study aimed to develop and validate PRIMO-FRL (Privacy-Preserving Reinforcement Learning for Individualized Multi-Objective Glycemic Management Using Federated Reinforcement Learning), a novel framework that optimizes clinical objectives—maximizing time in range (TIR), reducing hypoglycemia and hyperglycemia, and minimizing glycemic risk—while preserving patient privacy. MethodsWe developed PRIMO-FRL, integrating multiobjective reward shaping to dynamically balance glucose stability, insulin efficiency, and risk reduction. The model was trained and tested using simulated data from 30 simulated patients (10 children, 10 adolescents, and 10 adults) generated with the Food and Drug Administration (FDA)–approved UVA/Padova simulator. A comparative analysis was conducted against state-of-the-art RL and ML models, evaluating performance using metrics such as TIR, hypoglycemia (<70 mg/dL), hyperglycemia (>180 mg/dL), and glycemic risk scores. ResultsThe PRIMO-FRL model achieved a robust overall TIR of 76.54%, with adults demonstrating the highest TIR at 81.48%, followed by children at 77.78% and adolescents at 70.37%. Importantly, the approach eliminated hypoglycemia, with 0.0% spent below 70 mg/dL across all cohorts, significantly outperforming existing methods. Mild hyperglycemia (180-250 mg/dL) was observed in adolescents (29.63%), children (22.22%), and adults (18.52%), with adults exhibiting the best control. Furthermore, the PRIMO-FRL approach consistently reduced glycemic risk scores, demonstrating improved safety and long-term stability in glucose regulation.. ConclusionsOur findings highlight the potential of PRIMO-FRL as a transformative, privacy-preserving approach to personalized glycemic management. By integrating federated RL, this framework eliminates hypoglycemia, improves TIR, and preserves data privacy by decentralizing model training. Unlike traditional centralized approaches that require sharing sensitive health data, PRIMO-FRL leverages federated learning to keep patient data local, significantly reducing privacy risks while enabling adaptive and personalized glucose control. This multiobjective optimization strategy offers a scalable, secure, and clinically viable solution for real-world diabetes care. The ability to train personalized models across diverse populations without exposing raw data makes PRIMO-FRL well-suited for deployment in privacy-sensitive health care environments. These results pave the way for future clinical adoption, demonstrating the potential of privacy-preserving artificial intelligence in optimizing glycemic regulation while maintaining security, adaptability, and personalization. 
    more » « less
  3. Beard, Daniel A. (Ed.)
    Environmentally induced or epigenetic-related beta-cell dysfunction and insulin resistance play a critical role in the progression to diabetes. We developed a mathematical modeling framework capable of studying the progression to diabetes incorporating various diabetogenic factors. Considering the heightened risk of beta-cell defects induced by obesity, we focused on the obesity-diabetes model to further investigate the influence of obesity on beta-cell function and glucose regulation. The model characterizes individualized glucose and insulin dynamics over the span of a lifetime. We then fit the model to the longitudinal data of the Pima Indian population, which captures both the fluctuations and long-term trends of glucose levels. As predicted, controlling or eradicating the obesity-related factor can alleviate, postpone, or even reverse diabetes. Furthermore, our results reveal that distinct abnormalities of beta-cell function and levels of insulin resistance among individuals contribute to different risks of diabetes. This study may encourage precise interventions to prevent diabetes and facilitate individualized patient treatment. 
    more » « less
  4. Diabetes poses a high risk for debilitating complications in neural tissues, regulating glucose uptake through insulin-dependent and predominantly insulin-independent pathways. Supramolecular nanostructures provide a flexible strategy for combinatorial regulation of glycemia. Here, we compare the effects of free insulin to insulin bound to positively charged nanofibers comprised of self-assembling amino acid compounds (AACs) with an antioxidant-modified side chain moiety (AAC2) in both in vitro and in vivo models of type 1 diabetes. Free AAC2, free human insulin (hINS) and AAC2-bound-human insulin (AAC2-hINS) were tested in streptozotocin (STZ)-induced mouse model of type 1 diabetes. AAC2-hINS acted as a complex and exhibited different properties compared to free AAC2 or hINS. Mice treated with the AAC2-hINS complex were devoid of hypoglycemic episodes, had improved levels of insulin in circulation and in the brain, and increased expression of neurotransmitter taurine transporter, Slc6a6. Consequently, treatment with AAC2-hINS markedly advanced both physical and cognitive performance in mice with STZ-induced and genetic type 1 diabetes compared to treatments with free AAC2 or hINS. This study demonstrates that the flexible nanofiber AAC2 can serve as a therapeutic platform for the combinatorial treatment of diabetes and its complications. 
    more » « less
  5. Alterations in ocular blood flow have been identified as important risk factors for the onset and progression of numerous diseases of the eye. In particular, several population-based and longitudinal-based studies have provided compelling evidence of hemodynamic biomarkers as independent risk factors for ocular disease throughout several different geographic regions. Despite this evidence, the relative contribution of blood flow to ocular physiology and pathology in synergy with other risk factors and comorbidities (e.g., age, gender, race, diabetes and hypertension) remains uncertain. There is currently no gold standard for assessing all relevant vascular beds in the eye, and the heterogeneous vascular biomarkers derived from multiple ocular imaging technologies are non-interchangeable and difficult to interpret as a whole. As a result of these disease complexities and imaging limitations, standard statistical methods often yield inconsistent results across studies and are unable to quantify or explain a patient's overall risk for ocular disease. Combining mathematical modeling with artificial intelligence holds great promise for advancing data analysis in ophthalmology and enabling individualized risk assessment from diverse, multi-input clinical and demographic biomarkers. Mechanism-driven mathematical modeling makes virtual laboratories available to investigate pathogenic mechanisms, advance diagnostic ability and improve disease management. Artificial intelligence provides a novel method for utilizing a vast amount of data from a wide range of patient types to diagnose and monitor ocular disease. This article reviews the state of the art and major unanswered questions related to ocular vascular anatomy and physiology, ocular imaging techniques, clinical findings in glaucoma and other eye diseases, and mechanistic modeling predictions, while laying a path for integrating clinical observations with mathematical models and artificial intelligence. Viable alternatives for integrated data analysis are proposed that aim to overcome the limitations of standard statistical approaches and enable individually tailored precision medicine in ophthalmology. 
    more » « less