Haplotype phasing maize genetic variants is important for genome interpretation, population genetic analysis and functional analysis of allelic activity. We performed an isoform-level phasing study using two maize inbred lines and their reciprocal crosses, based on single-molecule, full-length cDNA sequencing. To phase and analyze transcripts between hybrids and parents, we developed IsoPhase. Using this tool, we validated the majority of SNPs called against matching short-read data from embryo, endosperm and root tissues, and identified allele-specific, gene-level and isoform-level differential expression between the inbred parental lines and hybrid offspring. After phasing 6907 genes in the reciprocal hybrids, we annotated the SNPs and identified large-effect genes. In addition, we identified parent-of-origin isoforms, distinct novel isoforms in maize parent and hybrid lines, and imprinted genes from different tissues. Finally, we characterized variation in cis- and trans-regulatory effects. Our study provides measures of haplotypic expression that could increase accuracy in studies of allelic expression.
Single‐parent expression (SPE) is defined as gene expression in only one of the two parents. SPE can arise from differential expression between parental alleles, termed non‐presence/absence (non‐PAV) SPE, or from the physical absence of a gene in one parent, termed PAV SPE. We used transcriptome data of diverse
- Award ID(s):
- 1934384
- PAR ID:
- 10454334
- Publisher / Repository:
- Wiley-Blackwell
- Date Published:
- Journal Name:
- The Plant Journal
- Volume:
- 105
- Issue:
- 1
- ISSN:
- 0960-7412
- Page Range / eLocation ID:
- p. 93-107
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
Abstract -
Summary Plants respond to abiotic stress through a variety of physiological, biochemical, and transcriptional mechanisms. Many genes exhibit altered levels of expression in response to abiotic stress, which requires concerted action of both
cis‐ andtrans‐ regulatory features. In order to study the variability in transcriptome response to abiotic stress,RNA sequencing was performed using 14‐day‐old maize seedlings of inbreds B73, Mo17, Oh43,PH 207 and B37 under control, cold and heat conditions. Large numbers of genes that responded differentially to stress between parental inbred lines were identified.RNA sequencing was also performed on similar tissues of theF 1hybrids produced by crossing B73 and each of the three other inbred lines. By evaluating allele‐specific transcript abundance in theF 1hybrids, we were able to measure the abundance ofcis‐ andtrans‐ regulatory variation between genotypes for both steady‐state and stress‐responsive expression differences. Although examples oftrans‐ regulatory variation were observed,cis‐ regulatory variation was more common for both steady‐state and stress‐responsive expression differences. The genes withcis‐ allelic variation for response to cold or heat stress provided an opportunity to study the basis for regulatory diversity. -
Bomblies, K (Ed.)
Abstract DNA methylation in plants is depleted from cis-regulatory elements in and near genes but is present in some gene bodies, including exons. Methylation in exons solely in the CG context is called gene body methylation (gbM). Methylation in exons in both CG and non-CG contexts is called TE-like methylation (teM). Assigning functions to both forms of methylation in genes has proven to be challenging. Toward that end, we utilized recent genome assemblies, gene annotations, transcription data, and methylome data to quantify common patterns of gene methylation and their relations to gene expression in maize. We found that gbM genes exist in a continuum of CG methylation levels without a clear demarcation between unmethylated genes and gbM genes. Analysis of expression levels across diverse maize stocks and tissues revealed a weak but highly significant positive correlation between gbM and gene expression except in endosperm. gbM epialleles were associated with an approximately 3% increase in steady-state expression level relative to unmethylated epialleles. In contrast to gbM genes, which were conserved and were broadly expressed across tissues, we found that teM genes, which make up about 12% of genes, are mainly silent, are poorly conserved, and exhibit evidence of annotation errors. We used these data to flag teM genes in the 26 NAM founder genome assemblies. While some teM genes are likely functional, these data suggest that the majority are not, and their inclusion can confound the interpretation of whole-genome studies.
-
Abstract Polyploidy complicates transcriptional regulation and increases phenotypic diversity in organisms. The dynamics of genetic regulation of gene expression between coresident subgenomes in polyploids remains to be understood. Here we document the genetic regulation of fiber development in allotetraploid cotton
Gossypium hirsutum by sequencing 376 genomes and 2,215 time-series transcriptomes. We characterize 1,258 genes comprising 36 genetic modules that control staged fiber development and uncover genetic components governing their partitioned expression relative to subgenomic duplicated genes (homoeologs). Only about 30% of fiber quality-related homoeologs show phenotypically favorable allele aggregation in cultivars, highlighting the potential for subgenome additivity in fiber improvement. We envision a genome-enabled breeding strategy, with particular attention to 48 favorable alleles related to fiber phenotypes that have been subjected to purifying selection during domestication. Our work delineates the dynamics of gene regulation during fiber development and highlights the potential of subgenomic coordination underpinning phenotypes in polyploid plants. -
Summary Relative to homozygous diploids, the presence of multiple homologs or homeologs in polyploids affords greater tolerance to mutations that can impact genome evolution. In this study, we describe sequence and structural variation in the genomes of six accessions of cultivated potato (
Solanum tuberosum L.), a vegetatively propagated autotetraploid and their impact on the transcriptome. Sequence diversity was high with a mean single nucleotide polymorphisms (SNP ) rate of approximately 1 per 50 bases suggestive of high levels of allelic diversity. Additive gene expression was observed in leaves (3605 genes) and tubers (6156 genes) that contrasted the preferential allele expression of between 2180 and 3502 and 3367 and 5270 genes in the leaf and tuber transcriptome, respectively. Preferential allele expression was significantly associated with evolutionarily conserved genes suggesting selection of specific alleles of genes responsible for biological processes common to angiosperms during the breeding selection process. Copy number variation was rampant with between 16 098 and 18 921 genes in each cultivar exhibiting duplication or deletion. Copy number variable genes tended to be evolutionarily recent, lowly expressed, and enriched in genes that show increased expression in response to biotic and abiotic stress treatments suggestive of a role in adaptation. Gene copy number impacts on gene expression were detected with 528 genes having correlations between copy number and gene expression. Collectively, these data suggest that in addition to allelic variation of coding sequence, the heterogenous nature of the tetraploid potato genome contributes to a highly dynamic transcriptome impacted by allele preferential and copy number‐dependent expression effects.