Eukaryotic diversity is largely microbial, with macroscopic lineages (plant, animals and fungi) nesting among a plethora of diverse protists. Understanding the evolutionary relationships among eukaryotes is rapidly advancing through omics analyses, but phylogenomics are challenging for microeukaryotes, particularly uncultivable lineages, as single-cell sequencing approaches generate a mixture of sequences from hosts, associated microbiomes, and contaminants. Moreover, many analyses of eukaryotic gene families and phylogenies rely on boutique datasets and methods that are challenging for other research groups to replicate. To address these challenges, we present EukPhylo v1.0, a modular, user-friendly pipeline that enables effective data curation through phylogeny-informed contamination removal, estimation of homologous gene families (GFs), and generation of both multisequence alignments and gene trees. Analyses can use a hook database of ~15k ancient GFs or users can easily replace this hook with a set of gene families of interest. We demonstrate the power of EukPhylo, including a suite of stand-alone utilities, through analyses of 500 conserved GFs sampled from 1,000 diverse species of eukaryotes, bacteria and archaea. We show improvements in estimates of the eukaryotic tree of life, recovering clades that are well established in the literature, through successive rounds of curation using the EukPhylo contamination loop. The final trees corroborate numerous hypotheses in the literature (e.g. Opisthokonta, Rhizaria, Amoebozoa) while challenging others (e.g. CRuMs, Obazoa, Diaphoretickes). We believe that the flexibility and transparency of EukPhylo sets standards for curation of omics data for future studies.
more »
« less
Ten Quick Tips for Accurate Reconstruction of Prokaryotic and Eukaryotic Genome-Scale Metabolic Models
Constraint-based metabolic modeling approaches have enhanced our knowledge and understanding of the metabolism of prokaryotes and eukaryotes. This approach highly depends on the reconstruction process of genome-scale metabolic models (Mmodels). M-models can guide effective experimental design and yield new insights into the function and control of biological systems. Despite the recent advances in the automated generation of draft metabolic network reconstructions, the manual curation of these networks remains a labor-intensive and challenging task. Thus, these ten quick tips for the manual curation process are essential for optimizing high-quality metabolic model generation in less time. This collection of tips describes in great detail the resources and methods to ensure successful reconstruction. Furthermore, it increases the scope of other protocols of metabolic modeling by including resources to reconstruct eukaryotic organisms. Thus, all tips are applicable to a wide range of eukaryotic organisms. We believe this manuscript will interest a broad audience and researchers from different disciplines, spanning from microbiology and systems biology to biotechnology.
more »
« less
- Award ID(s):
- 2313313
- PAR ID:
- 10533354
- Editor(s):
- BiologyandLifeSciences, Other
- Publisher / Repository:
- Preprints.org
- Date Published:
- Edition / Version:
- 1
- Volume:
- 1
- Issue:
- 1
- Page Range / eLocation ID:
- 1-20
- Subject(s) / Keyword(s):
- Genome-scale metabolic model reconstruction manual curation quick tips systems biology
- Format(s):
- Medium: X Size: 2MB Other: xls
- Size(s):
- 2MB
- Institution:
- San Diego State University
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Abstract The modeling of rates of biochemical reactions—fluxes—in metabolic networks is widely used for both basic biological research and biotechnological applications. A number of different modeling methods have been developed to estimate and predict fluxes, including kinetic and constraint‐based (Metabolic Flux Analysis and flux balance analysis) approaches. Although different resources exist for teaching these methods individually, to‐date no resources have been developed to teach these approaches in an integrative way that equips learners with an understanding of each modeling paradigm, how they relate to one another, and the information that can be gleaned from each. We have developed a series of modeling simulations in Python to teach kinetic modeling, metabolic control analysis, 13C‐metabolic flux analysis, and flux balance analysis. These simulations are presented in a series of interactive notebooks with guided lesson plans and associated lecture notes. Learners assimilate key principles using models of simple metabolic networks by running simulations, generating and using data, and making and validating predictions about the effects of modifying model parameters. We used these simulations as the hands‐on computer laboratory component of a four‐day metabolic modeling workshop and participant survey results showed improvements in learners' self‐assessed competence and confidence in understanding and applying metabolic modeling techniques after having attended the workshop. The resources provided can be incorporated in their entirety or individually into courses and workshops on bioengineering and metabolic modeling at the undergraduate, graduate, or postgraduate level.more » « less
-
Synopsis Pursuing cutting edge questions in organismal biology in the future will require novel approaches for training the next generation of organismal biologists, including knowledge and use of systems-type modeling combined with integrative organismal biology. We link agendas recommending changes in science education and practice across three levels: Broadening the concept of organismal biology to promote modeling organisms as systems interacting with higher and lower organizational levels; enhancing undergraduate science education to improve applications of quantitative reasoning and modeling in the scientific process; and K-12 curricula based on Next Generation Science Standards emphasizing development and use of models in the context of explanatory science, solution design, and evaluating and communicating information. Out of each of these initiatives emerges an emphasis on routine use of models as tools for hypothesis testing and prediction. The question remains, however, what is the best approach for training the next generation of organismal biology students to facilitate their understanding and use of models? We address this question by proposing new ways of teaching and learning, including the development of interactive web-based modeling modules that lower barriers for scientists approaching this new way of imagining and conducting integrative organismal biology.more » « less
-
null (Ed.)Abstract Cells can sense changes in their extracellular environment and subsequently adapt their biomass composition. Nutrient abundance defines the capability of the cell to produce biomass components. Under nutrient-limited conditions, resource allocation dramatically shifts to carbon-rich molecules. Here, we used dynamic biomass composition data to predict changes in growth and reaction flux distributions using the available genome-scale metabolic models of five eukaryotic organisms (three heterotrophs and two phototrophs). We identified temporal profiles of metabolic fluxes that indicate long-term trends in pathway and organelle function in response to nitrogen depletion. Surprisingly, our calculations of model sensitivity and biosynthetic cost showed that free energy of biomass metabolites is the main driver of biosynthetic cost and not molecular weight, thus explaining the high costs of arginine and histidine. We demonstrated how metabolic models can accurately predict the complexity of interwoven mechanisms in response to stress over the course of growth.more » « less
-
Abstract BackgroundCircadian (daily) timekeeping is essential to the survival of many organisms. An integral part of all circadian timekeeping systems is negative feedback between an activator and repressor. However, the role of this feedback varies widely between lower and higher organisms. ResultsHere, we study repression mechanisms in the cyanobacterial and eukaryotic clocks through mathematical modeling and systems analysis. We find a common mathematical model that describes the mechanism by which organisms generate rhythms; however, transcription’s role in this has diverged. In cyanobacteria, protein sequestration and phosphorylation generate and regulate rhythms while transcription regulation keeps proteins in proper stoichiometric balance. Based on recent experimental work, we propose a repressor phospholock mechanism that models the negative feedback through transcription in clocks of higher organisms. Interestingly, this model, when coupled with activator phosphorylation, allows for oscillations over a wide range of protein stoichiometries, thereby reconciling the negative feedback mechanism inNeurosporawith that in mammals and cyanobacteria. ConclusionsTaken together, these results paint a picture of how circadian timekeeping may have evolved.more » « less
An official website of the United States government

