Abstract Hop production utilizes exclusively female plants, whereas male plants only serve to generate novel variation within breeding programs through crossing. Currently, hop lacks a rapid and accurate diagnostic marker to determine whether plants are male or female. Without a diagnostic marker, breeding programs may take 1–2 years to determine the sex of new seedlings. Previous research on sex-linked markers was restricted to specific populations or breeding programs and therefore had limited transferability or suffered from low scalability. A large collection of 765 hop genotypes with known sex phenotypes, genotyping-by-sequencing, and genome-wide association mapping revealed a highly significant marker on the sex chromosome (LOD score = 208.7) that predicted sex within our population with 96.2% accuracy. In this study, we developed a PCR allele competitive extension (PACE) assay for the diagnostic SNP and tested three quick DNA extraction methodologies for rapid, high-throughput genotyping. Additionally, the marker was validated in a separate population of 94 individuals from 15 families from the USDA-ARS hop breeding program in Prosser, WA with 96% accuracy. This diagnostic marker is located in a gene predicted to encode the basic helix-loop-helix transcription factor protein, a family of proteins that have been previously implicated in male sterility in a variety of plant species, which may indicate a role in determining hop sex. The marker is diagnostic, accurate, affordable, and highly scalable and has the potential to improve efficiency in hop breeding.
more »
« less
Exploring impact of recombination landscapes on breeding outcomes
Plant breeding relies on crossing-over to create novel combinations of alleles needed to confer increased productivity and other desired traits in new varieties. However, crossover (CO) events are rare, as usually only one or two of them occur per chromosome in each generation. In addition, COs are not distributed evenly along chromosomes. In plants with large genomes, which includes most crops, COs are predominantly formed close to chromosome ends, and there are few COs in the large chromosome swaths around centromeres. This situation has created interest in engineering CO landscape to improve breeding efficiency. Methods have been developed to boost COs globally by altering expression of anti-recombination genes and increase CO rates in certain chromosome parts by changing DNA methylation patterns. In addition, progress is being made to devise methods to target COs to specific chromosome sites. We review these approaches and examine using simulations whether they indeed have the capacity to improve efficiency of breeding programs. We found that the current methods to alter CO landscape can produce enough benefits for breeding programs to be attractive. They can increase genetic gain in recurrent selection and significantly decrease linkage drag around donor loci in schemes to introgress a trait from unimproved germplasm to an elite line. Methods to target COs to specific genome sites were also found to provide advantage when introgressing a chromosome segment harboring a desirable quantitative trait loci. We recommend avenues for future research to facilitate implementation of these methods in breeding programs.
more »
« less
- Award ID(s):
- 1822162
- PAR ID:
- 10426748
- Date Published:
- Journal Name:
- Proceedings of the National Academy of Sciences
- Volume:
- 120
- Issue:
- 14
- ISSN:
- 0027-8424
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Improving drought resistance in crops is imperative under the prevailing erratic rainfall patterns. Drought affects the growth and yield of most modern rice varieties. Recent breeding efforts aim to incorporate drought resistance traits in rice varieties that can be suitable under alternative irrigation schemes, such as in a (semi)aerobic system, as row (furrow-irrigated) rice. The identification of quantitative trait loci (QTLs) controlling grain yield, the most important trait with high selection efficiency, can lead to the identification of markers to facilitate marker-assisted breeding of drought-resistant rice. Here, we report grain yield QTLs under greenhouse drought using an F2:3 population derived from Cocodrie (drought sensitive) × Nagina 22 (N22) (drought tolerant). Eight QTLs were identified for yield traits under drought. Grain yield QTL under drought on chromosome 1 (phenotypic variance explained (PVE) = 11.15%) co-localized with the only QTL for panicle number (PVE = 37.7%). The drought-tolerant parent N22 contributed the favorable alleles for all QTLs except qGN3.2 and qGN5.1 for grain number per panicle. Stress-responsive transcription factors, such as ethylene response factor, WD40 domain protein, zinc finger protein, and genes involved in lipid/sugar metabolism were linked to the QTLs, suggesting their possible role in drought tolerance mechanism of N22 in the background of Cocodrie, contributing to higher yield under drought.more » « less
-
Abstract Transcriptome-wide association studies (TWASs) integrate expression quantitative trait loci (eQTLs) studies with genome-wide association studies (GWASs) to prioritize candidate target genes for complex traits. Several statistical methods have been recently proposed to improve the performance of TWASs in gene prioritization by integrating the expression regulatory information imputed from multiple tissues, and made significant achievements in improving the ability to detect gene-trait associations. Unfortunately, most existing multi-tissue methods focus on prioritization of candidate genes, and cannot directly infer the specific functional effects of candidate genes across different tissues. Here, we propose a tissue-specific collaborative mixed model (TisCoMM) for TWASs, leveraging the co-regulation of genetic variations across different tissues explicitly via a unified probabilistic model. TisCoMM not only performs hypothesis testing to prioritize gene-trait associations, but also detects the tissue-specific role of candidate target genes in complex traits. To make full use of widely available GWASs summary statistics, we extend TisCoMM to use summary-level data, namely, TisCoMM-S2. Using extensive simulation studies, we show that type I error is controlled at the nominal level, the statistical power of identifying associated genes is greatly improved, and the false-positive rate (FPR) for non-causal tissues is well controlled at decent levels. We further illustrate the benefits of our methods in applications to summary-level GWASs data of 33 complex traits. Notably, apart from better identifying potential trait-associated genes, we can elucidate the tissue-specific role of candidate target genes. The follow-up pathway analysis from tissue-specific genes for asthma shows that the immune system plays an essential function for asthma development in both thyroid and lung tissues.more » « less
-
Abstract Cultivated grapevines are commonly grafted on closely related species to cope with specific biotic and abiotic stress conditions. The three North American Vitis species V. riparia , V. rupestris , and V. berlandieri , are the main species used for breeding grape rootstocks. Here, we report the diploid chromosome-scale assembly of three widely used rootstocks derived from these species: Richter 110 (110R), Kober 5BB, and 101–14 Millardet et de Grasset (Mgt). Draft genomes of the three hybrids were assembled using PacBio HiFi sequences at an average coverage of 53.1 X-fold. Using the tool suite HaploSync, we reconstructed the two sets of nineteen chromosome-scale pseudomolecules for each genome with an average haploid genome size of 494.5 Mbp. Residual haplotype switches were resolved using shared-haplotype information. These three reference genomes represent a valuable resource for studying the genetic basis of grape adaption to biotic and abiotic stresses, and designing trait-associated markers for rootstock breeding programs.more » « less
-
ABSTRACT Landscape genomic approaches for detecting genotype‐environment associations (GEA), isolation by distance (IBD) and isolation by environment (IBE) have seen a dramatic increase in use, but there have been few thorough analyses of the influence of sampling strategy on their performance under realistic genomic and environmental conditions. We simulated 24,000 datasets across a range of scenarios with complex population dynamics and realistic landscape structure to evaluate the effects of the spatial distribution and number of samples on common landscape genomics methods. Our results show that common analyses are relatively robust to sampling scheme as long as sampling covers enough environmental and geographic space. We found that for detecting adaptive loci and estimatingIBE, sampling schemes that were explicitly designed to increase coverage of available environmental space matched or outperformed sampling schemes that only considered geographic space. When sampling does not cover adequate geographic and environmental space, such as with transect‐based sampling, we detected fewer adaptive loci and had higher error when estimatingIBDandIBE. We found thatIBDcould be detected with as few as nine sampling sites, while large sample sizes (e.g., greater than 100 individuals) were crucial for detecting adaptive loci andIBE. We also demonstrate that, even with optimal sampling strategies, landscape genomic analyses are highly sensitive to landscape structure and migration—when spatial autocorrelation and migration are weak, commonGEAmethods fail to detect adaptive loci.more » « less
An official website of the United States government

