skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Creators/Authors contains: "Rai, Navneet"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Abstract We present a machine learning framework to automate knowledge discovery through knowledge graph construction, inconsistency resolution, and iterative link prediction. By incorporating knowledge from 10 publicly available sources, we construct an Escherichia coli antibiotic resistance knowledge graph with 651,758 triples from 23 triple types after resolving 236 sets of inconsistencies. Iteratively applying link prediction to this graph and wet-lab validation of the generated hypotheses reveal 15 antibiotic resistant E. coli genes, with 6 of them never associated with antibiotic resistance for any microbe. Iterative link prediction leads to a performance improvement and more findings. The probability of positive findings highly correlates with experimentally validated findings ( R 2  = 0.94). We also identify 5 homologs in Salmonella enterica that are all validated to confer resistance to antibiotics. This work demonstrates how evidence-driven decisions are a step toward automating knowledge discovery with high confidence and accelerated pace, thereby substituting traditional time-consuming and expensive methods. 
    more » « less
  2. null (Ed.)
    Glutaraldehyde is a widely used biocide on the market for about 50 years. Despite its broad application, several reports on the emergence of bacterial resistance, and occasional outbreaks caused by poorly disinfection, there is a gap of knowledge on the bacterial adaptation, tolerance, and resistance mechanisms to glutaraldehyde. Here, we analyze the effects of the independent selection of mutations in the transcriptional regulator yqhC for biological replicates of Escherichia coli cells subjected to adaptive laboratory evolution (ALE) in the presence of glutaraldehyde. The evolved strains showed improved survival in the biocide (11–26% increase in fitness) as a result of mutations in the activator yqhC , which led to the overexpression of the yqhD aldehyde reductase gene by 8 to over 30-fold (3.1–5.2 log2FC range). The protective effect was exclusive to yqhD as other aldehyde reductase genes of E. coli , such as yahK , ybbO , yghA , and ahr did not offer protection against the biocide. We describe a novel mechanism of tolerance to glutaraldehyde based on the activation of the aldehyde reductase YqhD by YqhC and bring attention to the potential for the selection of such tolerance mechanism outside the laboratory, given the existence of YqhD homologs in various pathogenic and opportunistic bacterial species. 
    more » « less
  3. Abstract How to design experiments that accelerate knowledge discovery on complex biological landscapes remains a tantalizing question. We present an optimal experimental design method (coined OPEX) to identify informative omics experiments using machine learning models for both experimental space exploration and model training. OPEX-guided exploration ofEscherichia coli’s populations exposed to biocide and antibiotic combinations lead to more accurate predictive models of gene expression with 44% less data. Analysis of the proposed experiments shows that broad exploration of the experimental space followed by fine-tuning emerges as the optimal strategy. Additionally, analysis of the experimental data reveals 29 cases of cross-stress protection and 4 cases of cross-stress vulnerability. Further validation reveals the central role of chaperones, stress response proteins and transport pumps in cross-stress exposure. This work demonstrates how active learning can be used to guide omics data collection for training predictive models, making evidence-driven decisions and accelerating knowledge discovery in life sciences. 
    more » « less
  4. Abstract Microbial fermentation is an essential process for research and industrial applications, yet our understanding of cellular dynamics during long‐term fermentation is limited. Here, we report a reproducible phenomenon of abrupt population collapse followed by a rapid population rescue that was observed during long‐term chemostat cultivations, for various strains ofEscherichia coliin minimal media. Through genome resequencing and whole‐genome transcriptional profiling of replicate runs over time, we identified that changes in the tRNA and carbon catabolic genes are the genetic basis of this phenomenon. Since current fermentation models are unable to capture the observed dynamics, we present an extended model that takes into account critical biological processes during fermentation, and we further validated carbon source predictions through forward experimentation. This study extends the predictability of current models for microbial fermentation and adds to our system‐level knowledge of cellular adaptation during this crucial biotechnological process. 
    more » « less