NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

FSLearning: An Efficient Federated Split Learning Framework for Privacy-Preserving Disease Prediction

https://doi.org/10.1007/978-3-031-95838-0_22

Li, Bin; Jiang, Xiaoqian; Hsu, Yu-Chun; Harmanci, Arif O; Gao, Hongchang; Shi, Xinghua (January 2025, Springer Nature Switzerland)

Full Text Available
Joint Participant and Learning Topology Selection for Federated Learning in Edge Clouds

https://doi.org/10.1109/TPDS.2024.3413751

Wei, Xinliang; Ye, Kejiang; Shi, Xinghua; Xu, Cheng-Zhong; Wang, Yu (August 2024, IEEE Transactions on Parallel and Distributed Systems)

Full Text Available
Impact of Missense Mutations on Spike Protein Stability and Binding Affinity in the Omicron Variant

https://doi.org/10.3390/v16071150

Mahase, Vidhyanand; Sobitan, Adebiyi; Yao, Qiaobin; Shi, Xinghua; Qin, Hong; Kidane, Dawit; Tang, Qiyi; Teng, Shaolei (July 2024, Viruses)

The global effort to combat the COVID-19 pandemic faces ongoing uncertainty with the emergence of Variants of Concern featuring numerous mutations on the Spike (S) protein. In particular, the Omicron Variant is distinguished by 32 mutations, including 10 within its receptor-binding domain (RBD). These mutations significantly impact viral infectivity and the efficacy of vaccines and antibodies currently in use for therapeutic purposes. In our study, we employed structure-based computational saturation mutagenesis approaches to predict the effects of Omicron missense mutations on RBD stability and binding affinity, comparing them to the original Wuhan-Hu-1 strain. Our results predict that mutations such as G431W and P507W induce the most substantial destabilizations in the Wuhan-Hu-1-S/Omicron-S RBD. Notably, we postulate that mutations in the Omicron-S exhibit a higher percentage of enhancing binding affinity compared to Wuhan-S. We found that the mutations at residue positions G447, Y449, F456, F486, and S496 led to significant changes in binding affinity. In summary, our findings may shed light on the widespread prevalence of Omicron mutations in human populations. The Omicron mutations that potentially enhance their affinity for human receptors may facilitate increased viral binding and internalization in infected cells, thereby enhancing infectivity. This informs the development of new neutralizing antibodies capable of targeting Omicron’s immune-evading mutations, potentially aiding in the ongoing battle against the COVID-19 pandemic.
more » « less
Full Text Available
Participant Selection for Hierarchical Federated Learning in Edge Clouds

https://doi.org/10.1109/NAS55553.2022.9925313

Wei, Xinliang; Liu, Jiyao; Shi, Xinghua; Wang, Yu (October 2022, 2022 IEEE International Conference on Networking, Architecture and Storage (NAS))

Federated learning (FL) has been emerging as a new distributed machine learning paradigm recently. Although FL can protect the data privacy of participants by keeping their training data on local devices, there are recent works raising new privacy concerns especially when workers or the parameter server of FL are untrustworthy or malicious. One effective way to solve the problem is using hierarchical federated learning (HFL) where a few middle-layer aggregators (or called group leaders) are used to aggregate local model updates from workers and send group model updates to the parameter server. In this paper, we consider the participant selection problem of HFL in an edge cloud with multiple FL models, where each model needs to select one parameter server, a few group leaders and a certain amount of workers from edge servers to jointly perform HFL. We first formulate this problem as a non-linear integer programming, aiming to minimize the total learning cost of all models while satisfying the constrained edge resources. We then design a three-stage algorithm by decoupling the original problem into three sub-problems and solving them iteratively. Simulations with real-world datasets and FL models confirm that our proposed algorithm can efficiently reduce the average total learning cost in edge cloud compared with existing methods.
more » « less
Offspring GAN augments biased human genomic data

https://doi.org/10.1145/3535508.3545537

Das, Supratim; Shi, Xinghua (January 2022, Proceedings of the 13th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics)

Full Text Available
Artificial Intelligence for Biology

https://doi.org/10.1093/icb/icab188

Hassoun, Soha; Jefferson, Felicia; Shi, Xinghua; Stucky, Brian; Wang, Jin; Rosa, Epaminondas (August 2021, Integrative and Comparative Biology)
null (Ed.)
Abstract Despite efforts to integrate research across different subdisciplines of biology, the scale of integration remains limited. We hypothesize that future generations of Artificial Intelligence (AI) technologies specifically adapted for biological sciences will help enable the reintegration of biology. AI technologies will allow us not only to collect, connect and analyze data at unprecedented scales, but also to build comprehensive predictive models that span various subdisciplines. They will make possible both targeted (testing specific hypotheses) and untargeted discoveries. AI for biology will be the cross-cutting technology that will enhance our ability to do biological research at every scale. We expect AI to revolutionize biology in the 21st century much like statistics transformed biology in the 20th century. The difficulties, however, are many, including data curation and assembly, development of new science in the form of theories that connect the subdisciplines, and new predictive and interpretable AI models that are more suited to biology than existing machine learning and AI techniques. Development efforts will require strong collaborations between biological and computational scientists. This white paper provides a vision for AI for Biology and highlights some challenges.
more » « less
Full Text Available
On the Convergence of Stochastic Compositional Gradient Descent Ascent Method

Gao, Hongchang; Wang, Xiaoqian; Luo, Lei; Shi, Xinghua (January 2021, Thirtieth International Joint Conference on Artificial Intelligence (IJCAI))
null (Ed.)
Full Text Available
Privacy-Preserving Participant Grouping for Mobile Social Sensing Over Edge Clouds

https://doi.org/10.1109/TNSE.2020.3020159

Li, Ting; Qiu, Zhijin; Cao, Lijuan; Cheng, Dazhao; Wang, Weichao; Shi, Xinghua; Wang, Yu (April 2021, IEEE Transactions on Network Science and Engineering)
null (Ed.)
Full Text Available
Complex genetic variation in nearly complete human genomes

https://doi.org/10.1101/2024.09.24.614721

Logsdon, Glennis A; Ebert, Peter; Audano, Peter A; Loftus, Mark; Porubsky, David; Ebler, Jana; Yilmaz, Feyza; Hallast, Pille; Prodanov, Timofey; Yoo, DongAhn; et al (September 2024, Nature)

Diverse sets of complete human genomes are required to construct a pangenome reference and to understand the extent of complex structural variation. Here, we sequence 65 diverse human genomes and build 130 haplotype-resolved assemblies (130 Mbp median continuity), closing 92% of all previous assembly gaps and reaching telomere-to-telomere (T2T) status for 39% of the chromosomes. We highlight complete sequence continuity of complex loci, including the major histocompatibility complex (MHC), SMN1/SMN2, NBPF8, and AMY1/AMY2, and fully resolve 1,852 complex structural variants (SVs). In addition, we completely assemble and validate 1,246 human centromeres. We find up to 30-fold variation in α-satellite high-order repeat (HOR) array length and characterize the pattern of mobile element insertions into α-satellite HOR arrays. While most centromeres predict a single site of kinetochore attachment, epigenetic analysis suggests the presence of two hypomethylated regions for 7% of centromeres. Combining our data with the draft pangenome reference significantly enhances genotyping accuracy from short-read data, enabling whole-genome inference to a median quality value (QV) of 45. Using this approach, 26,115 SVs per sample are detected, substantially increasing the number of SVs now amenable to downstream disease association studies.
more » « less
Full Text Available
Population-scale Genomic Data Augmentation Based on Conditional Generative Adversarial Networks

https://doi.org/10.1145/3388440.3412475

Chen, Junjie; Mowlaei, Mohammad Erfan; Shi, Xinghua (January 2020, Proceedings of the 11th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics)

Full Text Available

« Prev Next »

Search for: All records