Within the broad field of plant sciences, what are the most pressing challenges and opportunities to advance? Answers to this question usually include food and nutritional security, climate change mitigation, adaptation of plants to changing climates, preservation of biodiversity and ecosystem services, production of plant‐based proteins and products, and growth of the bioeconomy. Genes and the processes their products carry out create differences in how plants grow, develop, and behave, and thus, the key solutions to these challenges lie squarely in the space where plant genomics and physiology intersect. Advancements in genomics, phenomics, and analysis tools have generated massive datasets, but these data are complex and have not always generated scientific insights at the anticipated pace. Further, new tools may need to be created or adapted, and field‐relevant applications tested, to advance scientific discovery derived from such datasets. Meaningful, relevant conclusions and connections from genomics and plant physiological and biochemical data require both subject matter expertise and the collaborative skills needed to work together outside of specific disciplines. Bringing the best expertise to bear on complex problems in plant sciences requires enhanced, inclusive, and sustained collaboration across disciplines. However, despite significant efforts to enable and sustain collaborative research, a variety of challenges persist. Here, we present the outcomes and conclusions of two workshops convened to address the need for collaboration between scientists engaged in plant physiology, genetics, and genomics and to discuss the approaches that will create the necessary environments to support successful collaboration. We conclude with approaches to share and reward collaboration and the need to train inclusive scientists that will have the skills to thrive in interdisciplinary contexts.
The boom of massive parallel sequencing (
- Award ID(s):
- 1537959
- NSF-PAR ID:
- 10246686
- Publisher / Repository:
- Wiley-Blackwell
- Date Published:
- Journal Name:
- Molecular Ecology
- Volume:
- 25
- Issue:
- 13
- ISSN:
- 0962-1083
- Page Range / eLocation ID:
- p. 2967-2977
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
Abstract -
Abstract Comparative genomics of social insects has been intensely pursued in recent years with the goal of providing insights into the evolution of social behaviour and its underlying genomic and epigenomic basis. However, the comparative approach has been hampered by a paucity of data on some of the most informative social forms (e.g. incipiently and primitively social) and taxa (especially members of the wasp family Vespidae) for studying social evolution. Here, we provide a draft genome of the primitively eusocial model insect
Polistes dominula , accompanied by analysis of caste‐related transcriptome and methylome sequence data for adult queens and workers.Polistes dominula possesses a fairly typical hymenopteran genome, but shows very low genomewideGC content and some evidence of reduced genome size. We found numerous caste‐related differences in gene expression, with evidence that both conserved and novel genes are related to caste differences. Most strikingly, these –omics data reveal a major reduction in one of the major epigenetic mechanisms that has been previously suggested to be important for caste differences in social insects:DNA methylation. Along with a conspicuous loss of a key gene associated with environmentally responsiveDNA methylation (the de novoDNA methyltransferaseDnmt3 ), these wasps have greatly reduced genomewide methylation to almost zero. In addition to providing a valuable resource for comparative analysis of social insect evolution, our integrative –omics data for this important behavioural and evolutionary model system call into question the general importance ofDNA methylation in caste differences and evolution in social insects. -
Abstract The genomic revolution has fundamentally changed how we survey biodiversity on earth. High‐throughput sequencing (“
HTS ”) platforms now enable the rapid sequencing ofDNA from diverse kinds of environmental samples (termed “environmentalDNA ” or “eDNA ”). CouplingHTS with our ability to associate sequences fromeDNA with a taxonomic name is called “eDNA metabarcoding” and offers a powerful molecular tool capable of noninvasively surveying species richness from many ecosystems. Here, we review the use ofeDNA metabarcoding for surveying animal and plant richness, and the challenges in usingeDNA approaches to estimate relative abundance. We highlighteDNA applications in freshwater, marine and terrestrial environments, and in this broad context, we distill what is known about the ability of differenteDNA sample types to approximate richness in space and across time. We provide guiding questions for study design and discuss theeDNA metabarcoding workflow with a focus on primers and library preparation methods. We additionally discuss important criteria for consideration of bioinformatic filtering of data sets, with recommendations for increasing transparency. Finally, looking to the future, we discuss emerging applications ofeDNA metabarcoding in ecology, conservation, invasion biology, biomonitoring, and howeDNA metabarcoding can empower citizen science and biodiversity education. -
Abstract New computational methods and next‐generation sequencing (NGS) approaches have enabled the use of thousands or hundreds of thousands of genetic markers to address previously intractable questions. The methods and massive marker sets present both new data analysis challenges and opportunities to visualize, understand, and apply population and conservation genomic data in novel ways. The large scale and complexity of NGS data also increases the expertise and effort required to thoroughly and thoughtfully analyze and interpret data. To aid in this endeavor, a recent workshop entitled “Population Genomic Data Analysis,” also known as “ConGen 2017,” was held at the University of Montana. The ConGen workshop brought 15 instructors together with knowledge in a wide range of topics including NGS data filtering, genome assembly, genomic monitoring of effective population size, migration modeling, detecting adaptive genomic variation, genomewide association analysis, inbreeding depression, and landscape genomics. Here, we summarize the major themes of the workshop and the important take‐home points that were offered to students throughout. We emphasize increasing participation by women in population and conservation genomics as a vital step for the advancement of science. Some important themes that emerged during the workshop included the need for data visualization and its importance in finding problematic data, the effects of data filtering choices on downstream population genomic analyses, the increasing availability of whole‐genome sequencing, and the new challenges it presents. Our goal here is to help motivate and educate a worldwide audience to improve population genomic data analysis and interpretation, and thereby advance the contribution of genomics to molecular ecology, evolutionary biology, and especially to the conservation of biodiversity.
-
Abstract Alignments of multiple genomes are a cornerstone of comparative genomics, but generating these alignments remains technically challenging and often impractical. We developed the
msa_pipeline workflow (https://bitbucket.org/bucklerlab/msa_pipeline) to allow practical and sensitive multiple alignment of diverged plant genomes and calculation of conservation scores with minimal user inputs. As high repeat content and genomic divergence are substantial challenges in plant genome alignment, we also explored the effect of different masking approaches and parameters of the LAST aligner using genome assemblies of 33 grass species. Compared with conventional masking with RepeatMasker, a masking approach based onk ‐mers (nucleotide sequences ofk length) increased the alignment rate of coding sequence and noncoding functional regions by 25 and 14%, respectively. We further found that default alignment parameters generally perform well, but parameter tuning can increase the alignment rate for noncoding functional regions by over 52% compared with default LAST settings. Finally, by increasing alignment sensitivity from the default baseline, parameter tuning can increase the number of noncoding sites that can be scored for conservation by over 76%. Overall, tuning of masking and alignment parameters can generate optimized multiple alignments to drive biological discovery in plants.