Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
Free, publicly-accessible full text available August 1, 2025
-
The global effort to combat the COVID-19 pandemic faces ongoing uncertainty with the emergence of Variants of Concern featuring numerous mutations on the Spike (S) protein. In particular, the Omicron Variant is distinguished by 32 mutations, including 10 within its receptor-binding domain (RBD). These mutations significantly impact viral infectivity and the efficacy of vaccines and antibodies currently in use for therapeutic purposes. In our study, we employed structure-based computational saturation mutagenesis approaches to predict the effects of Omicron missense mutations on RBD stability and binding affinity, comparing them to the original Wuhan-Hu-1 strain. Our results predict that mutations such as G431W and P507W induce the most substantial destabilizations in the Wuhan-Hu-1-S/Omicron-S RBD. Notably, we postulate that mutations in the Omicron-S exhibit a higher percentage of enhancing binding affinity compared to Wuhan-S. We found that the mutations at residue positions G447, Y449, F456, F486, and S496 led to significant changes in binding affinity. In summary, our findings may shed light on the widespread prevalence of Omicron mutations in human populations. The Omicron mutations that potentially enhance their affinity for human receptors may facilitate increased viral binding and internalization in infected cells, thereby enhancing infectivity. This informs the development of new neutralizing antibodies capable of targeting Omicron’s immune-evading mutations, potentially aiding in the ongoing battle against the COVID-19 pandemic.more » « lessFree, publicly-accessible full text available July 17, 2025
-
Federated learning (FL) has been emerging as a new distributed machine learning paradigm recently. Although FL can protect the data privacy of participants by keeping their training data on local devices, there are recent works raising new privacy concerns especially when workers or the parameter server of FL are untrustworthy or malicious. One effective way to solve the problem is using hierarchical federated learning (HFL) where a few middle-layer aggregators (or called group leaders) are used to aggregate local model updates from workers and send group model updates to the parameter server. In this paper, we consider the participant selection problem of HFL in an edge cloud with multiple FL models, where each model needs to select one parameter server, a few group leaders and a certain amount of workers from edge servers to jointly perform HFL. We first formulate this problem as a non-linear integer programming, aiming to minimize the total learning cost of all models while satisfying the constrained edge resources. We then design a three-stage algorithm by decoupling the original problem into three sub-problems and solving them iteratively. Simulations with real-world datasets and FL models confirm that our proposed algorithm can efficiently reduce the average total learning cost in edge cloud compared with existing methods.more » « less
-
null (Ed.)Abstract Despite efforts to integrate research across different subdisciplines of biology, the scale of integration remains limited. We hypothesize that future generations of Artificial Intelligence (AI) technologies specifically adapted for biological sciences will help enable the reintegration of biology. AI technologies will allow us not only to collect, connect and analyze data at unprecedented scales, but also to build comprehensive predictive models that span various subdisciplines. They will make possible both targeted (testing specific hypotheses) and untargeted discoveries. AI for biology will be the cross-cutting technology that will enhance our ability to do biological research at every scale. We expect AI to revolutionize biology in the 21st century much like statistics transformed biology in the 20th century. The difficulties, however, are many, including data curation and assembly, development of new science in the form of theories that connect the subdisciplines, and new predictive and interpretable AI models that are more suited to biology than existing machine learning and AI techniques. Development efforts will require strong collaborations between biological and computational scientists. This white paper provides a vision for AI for Biology and highlights some challenges.more » « less
-
Genotype imputation, where missing genotypes can be computationally imputed, is an essential tool in genomic analysis ranging from genome wide associations to phenotype prediction. Traditional genotype imputation methods are typically based on haplotype-clustering algorithms, hidden Markov models (HMMs), and statistical inference. Deep learning-based methods have been recently reported to suitably address the missing data problems in various fields. To explore the performance of deep learning for genotype imputation, in this study, we propose a deep model called a sparse convolutional denoising autoencoder (SCDA) to impute missing genotypes. We constructed the SCDA model using a convolutional layer that can extract various correlation or linkage patterns in the genotype data and applying a sparse weight matrix resulted from the L1 regularization to handle high dimensional data. We comprehensively evaluated the performance of the SCDA model in different scenarios for genotype imputation on the yeast and human genotype data, respectively. Our results showed that SCDA has strong robustness and significantly outperforms popular reference-free imputation methods. This study thus points to another novel application of deep learning models for missing data imputation in genomic studies.more » « less