skip to main content


Search for: All records

Award ID contains: 1840761

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Database and data structure research can improve machine learning performance in many ways. One way is to design better algorithms on data structures. This paper combines the use of incremental computation as well as sequential and probabilistic filtering to enable “forgetful” tree-based learning algorithms to cope with streaming data that suffers from concept drift. (Concept drift occurs when the functional mapping from input to classification changes over time). The forgetful algorithms described in this paper achieve high performance while maintaining high quality predictions on streaming data. Specifically, the algorithms are up to 24 times faster than state-of-the-art incremental algorithms with, at most, a 2% loss of accuracy, or are at least twice faster without any loss of accuracy. This makes such structures suitable for high volume streaming applications. 
    more » « less
    Free, publicly-accessible full text available June 1, 2024
  2. Nitrogen (N) and Water (W) - two resources critical for crop productivity – are becoming increasingly limited in soils globally. To address this issue, we aim to uncover the gene regulatory networks (GRNs) that regulate nitrogen use efficiency (NUE) - as a function of water availability - in Oryza sativa, a staple for 3.5 billion people. In this study, we infer and validate GRNs that correlate with rice NUE phenotypes affected by N-by-W availability in the field. We did this by exploiting RNA-seq and crop phenotype data from 19 rice varieties grown in a 2x2 N-by-W matrix in the field. First, to identify gene-to-NUE field phenotypes, we analyzed these datasets using weighted gene co-expression network analysis (WGCNA). This identified two network modules ("skyblue" & "grey60") highly correlated with NUE grain yield (NUEg). Next, we focused on 90 TFs contained in these two NUEg modules and predicted their genome-wide targets using the N-and/or-W response datasets using a random forest network inference approach (GENIE3). Next, to validate the GENIE3 TF→target gene predictions, we performed Precision/Recall Analysis (AUPR) using nine datasets for three TFs validated in planta . This analysis sets a precision threshold of 0.31, used to "prune" the GENIE3 network for high-confidence TF→target gene edges, comprising 88 TFs and 5,716 N-and/or-W response genes. Next, we ranked these 88 TFs based on their significant influence on NUEg target genes responsive to N and/or W signaling. This resulted in a list of 18 prioritized TFs that regulate 551 NUEg target genes responsive to N and/or W signals. We validated the direct regulated targets of two of these candidate NUEg TFs in a plant cell-based TF assay called TARGET, for which we also had in planta data for comparison. Gene ontology analysis revealed that 6/18 NUEg TFs - OsbZIP23 (LOC_Os02g52780), Oshox22 (LOC_Os04g45810), LOB39 (LOC_Os03g41330), Oshox13 (LOC_Os03g08960), LOC_Os11g38870, and LOC_Os06g14670 - regulate genes annotated for N and/or W signaling. Our results show that OsbZIP23 and Oshox22, known regulators of drought tolerance, also coordinate W-responses with NUEg. This validated network can aid in developing/breeding rice with improved yield on marginal, low N-input, drought-prone soils. 
    more » « less
  3. Nitrate is a nutrient and a potent signal that impacts global gene expression in plants. However, the regulatory factors controlling temporal and cell type–specific nitrate responses remain largely unknown. We assayed nitrate-responsive transcriptome changes in five major root cell types of the Arabidopsis thaliana root as a function of time. We found that gene-expression response to nitrate is dynamic and highly localized and predicted cell type–specific transcription factor (TF)–target interactions. Among cell types, the endodermis stands out as having the largest and most connected nitrate-regulatory gene network. ABF2 and ABF3 are major hubs for transcriptional responses in the endodermis cell layer. We experimentally validated TF–target interactions for ABF2 and ABF3 by chromatin immunoprecipitation followed by sequencing and a cell-based system to detect TF regulation genome-wide. Validated targets of ABF2 and ABF3 account for more than 50% of the nitrate-responsive transcriptome in the endodermis. Moreover, ABF2 and ABF3 are involved in nitrate-induced lateral root growth. Our approach offers an unprecedented spatiotemporal resolution of the root response to nitrate and identifies important components of cell-specific gene regulatory networks. 
    more » « less
  4. null (Ed.)
    All aspects of transcription and its regulation involve dynamic events. However, capturing these dynamic events in gene regulatory networks (GRNs) offers both a promise and a challenge. The promise is that capturing and modeling the dynamic changes in GRNs will allow us to understand how organisms adapt to a changing environment. The ability to mount a rapid transcriptional response to environmental changes is especially important in nonmotile organisms such as plants. The challenge is to capture these dynamic, genome-wide events and model them in GRNs. In this review, we cover recent progress in capturing dynamic interactions of transcription factors with their targets—at both the local and genome-wide levels—and using them to learn how GRNs operate as a function of time. We also discuss recent advances that employ time-based machine learning approaches to forecast gene expression at future time points, a key goal of systems biology. Expected final online publication date for the Annual Review of Plant Biology, Volume 72 is May 2021. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates. 
    more » « less
  5. null (Ed.)
    Abstract Deciphering gene regulatory networks (GRNs) is both a promise and challenge of systems biology. The promise lies in identifying key transcription factors (TFs) that enable an organism to react to changes in its environment. The challenge lies in validating GRNs that involve hundreds of TFs with hundreds of thousands of interactions with their genome-wide targets experimentally determined by high-throughput sequencing. To address this challenge, we developed ConnecTF, a species-independent, web-based platform that integrates genome-wide studies of TF–target binding, TF–target regulation, and other TF-centric omic datasets and uses these to build and refine validated or inferred GRNs. We demonstrate the functionality of ConnecTF by showing how integration within and across TF–target datasets uncovers biological insights. Case study 1 uses integration of TF–target gene regulation and binding datasets to uncover TF mode-of-action and identify potential TF partners for 14 TFs in abscisic acid signaling. Case study 2 demonstrates how genome-wide TF–target data and automated functions in ConnecTF are used in precision/recall analysis and pruning of an inferred GRN for nitrogen signaling. Case study 3 uses ConnecTF to chart a network path from NLP7, a master TF in nitrogen signaling, to direct secondary TF2s and to its indirect targets in a Network Walking approach. The public version of ConnecTF (https://ConnecTF.org) contains 3,738,278 TF–target interactions for 423 TFs in Arabidopsis, 839,210 TF–target interactions for 139 TFs in maize (Zea mays), and 293,094 TF–target interactions for 26 TFs in rice (Oryza sativa). The database and tools in ConnecTF will advance the exploration of GRNs in plant systems biology applications for model and crop species. 
    more » « less
  6. null (Ed.)
  7. Xu, Guohua (Ed.)
    Abstract Nitrogen (N) and water (W) are crucial inputs for plant survival as well as costly resources for agriculture. Given their importance, the molecular mechanisms that plants rely on to signal changes in either N or W status have been under intense scrutiny. However, how plants sense and respond to the combination of N and W signals at the molecular level has received scant attention. The purpose of this review is to shed light on what is currently known about how plant responses to N are impacted by W status. We review classic studies which detail how N and W combinations have both synergistic and antagonistic effects on key plant traits, such as root architecture and stomatal aperture. Recent molecular studies of N and W interactions show that mutations in genes involved in N metabolism affect drought responses, and vice versa. Specifically, perturbing key N signaling genes may lead to changes in drought-responsive gene expression programs, which is supported by a meta-analysis we conduct on available transcriptomic data. Additionally, we cite studies that show how combinatorial transcriptional responses to N and W status might drive crop phenotypes. Through these insights, we suggest research strategies that could help to develop crops adapted to marginal soils depleted in both N and W, an important task in the face of climate change. 
    more » « less