Abstract The adoption of agriculture triggered a rapid shift towards starch-rich diets in human populations1. Amylase genes facilitate starch digestion, and increased amylase copy number has been observed in some modern human populations with high-starch intake2, although evidence of recent selection is lacking3,4. Here, using 94 long-read haplotype-resolved assemblies and short-read data from approximately 5,600 contemporary and ancient humans, we resolve the diversity and evolutionary history of structural variation at the amylase locus. We find that amylase genes have higher copy numbers in agricultural populations than in fishing, hunting and pastoral populations. We identify 28 distinct amylase structural architectures and demonstrate that nearly identical structures have arisen recurrently on different haplotype backgrounds throughout recent human history.AMY1andAMY2Agenes each underwent multiple duplication/deletion events with mutation rates up to more than 10,000-fold the single-nucleotide polymorphism mutation rate, whereasAMY2Bgene duplications share a single origin. Using a pangenome-based approach, we infer structural haplotypes across thousands of humans identifying extensively duplicated haplotypes at higher frequency in modern agricultural populations. Leveraging 533 ancient human genomes, we find that duplication-containing haplotypes (with more gene copies than the ancestral haplotype) have rapidly increased in frequency over the past 12,000 years in West Eurasians, suggestive of positive selection. Together, our study highlights the potential effects of the agricultural revolution on human genomes and the importance of structural variation in human adaptation.
more »
« less
Independent amylase gene copy number bursts correlate with dietary preferences in mammals
Many mammals can digest starch by using an enzyme called amylase, but different species eat different amounts of starchy foods. Amylase is released by the pancreas, and in certain species such as humans, it is also created by the glands that produce saliva, allowing the enzyme to be present in the mouth. There, amylase can start to break down starch, releasing a sweet taste that helps the animal to detect starchy foods. Curiously, humans have multiple copies of the gene that codes for the enzyme, but the exact number varies between people. Previous research has found that populations with more copies also eat more starch; if this correlation also existed in other species, it could help to understand how diets influence and shape genetic information. In addition, it is unclear how amylase came to be present in saliva, as the ancestors of mammals only produced the protein in the pancreas. Pajic et al. analyzed the genomes of a range of mammals and found that the more starch a species had in its diet, the more amylase gene copies it harbored in its genome. In fact, unrelated mammals living in different habitats and eating different types of food have similar numbers of amylase gene copies if they have the same level of starch in their diet. In addition, Pajic et al. discovered that animals such as mice, rats, pigs and dogs, which have lived in close contact with people for thousands of years, quickly adapted to the large amount of starch present in human food. In each of these species, a mechanism called gene duplication independently created new copies of the amylase gene. This could represent the first step towards some of these copies becoming active in the glands that release saliva. In people, having fewer copies of the amylase gene could mean they have a higher risk for diabetes; this number is also tied to the composition of the collection of bacteria that live in the mouth and the gut. Understanding how the copy number of the amylase gene affects biology will help to grasp how it also affects health and wellbeing, in humans and in our four-legged companions.
more »
« less
- Award ID(s):
- 1714867
- PAR ID:
- 10095259
- Date Published:
- Journal Name:
- eLife
- Volume:
- 8
- ISSN:
- 2050-084X
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
null (Ed.)Microbes (bacteria, yeasts, molds), in addition to plants and animals, were domesticated for their roles in food preservation, nutrition and flavor. Aspergillus oryzae is a domesticated filamentous fungal species traditionally used during fermentation of Asian foods and beverage, such as sake, soy sauce, and miso. To date, little is known about the extent of genome and phenotypic variation of A. oryzae isolates from different clades. Here, we used long-read Oxford Nanopore and short-read Illumina sequencing to produce a highly accurate and contiguous genome assemble of A. oryzae 14160, an industrial strain from China. To understand the relationship of this isolate, we performed phylogenetic analysis with 90 A. oryzae isolates and 1 isolate of the A. oryzae progenitor, Aspergillus flavus . This analysis showed that A. oryzae 14160 is a member of clade A, in comparison to the RIB 40 type strain, which is a member of clade F. To explore genome variation between isolates from distinct A. oryzae clades, we compared the A. oryzae 14160 genome with the complete RIB 40 genome. Our results provide evidence of independent evolution of the alpha-amylase gene duplication, which is one of the major adaptive mutations resulting from domestication. Synteny analysis revealed that both genomes have three copies of the alpha-amylase gene, but only one copy on chromosome 2 was conserved. While the RIB 40 genome had additional copies of the alpha-amylase gene on chromosomes III, and V, 14160 had a second copy on chromosome II and an third copy on chromosome VI. Additionally, we identified hundreds of lineage specific genes, and putative high impact mutations in genes involved in secondary metabolism, including several of the core biosynthetic genes. Finally, to examine the functional effects of genome variation between strains, we measured amylase activity, proteolytic activity, and growth rate on several different substrates. RIB 40 produced significantly higher levels of amylase compared to 14160 when grown on rice and starch. Accordingly, RIB 40 grew faster on rice, while 14160 grew faster on soy. Taken together, our analyses reveal substantial genome and phenotypic variation within A. oryzae .more » « less
-
Previous studies suggested that the copy number of the human salivary amylase gene,AMY1, correlates with starch-rich diets. However, evolutionary analyses are hampered by the absence of accurate, sequence-resolved haplotype variation maps. We identified 30 structurally distinct haplotypes at nucleotide resolution among 98 present-day humans, revealing that the coding sequences ofAMY1copies are evolving under negative selection. Genomic analyses of these haplotypes in archaic hominins and ancient human genomes suggest that a common three-copy haplotype, dating as far back as 800,000 years ago, has seeded rapidly evolving rearrangements through recurrent nonallelic homologous recombination. Additionally, haplotypes with more than threeAMY1copies have significantly increased in frequency among European farmers over the past 4000 years, potentially as an adaptive response to increased starch digestion.more » « less
-
Sohail, Mashaal (Ed.)Abstract Genes within the secretory calcium-binding phosphoprotein locus diversified along with the formation of a calcified skeleton in vertebrates, the emergence of tooth enamel in fish, and the introduction of lactation in mammals, at each stage marking major transitions in life history. The secretory calcium-binding phosphoprotein (SCPP) locus also harbors genes expressed primarily and abundantly in the saliva of humans. Here, we explored the phylogeny and evolution of the saliva-related SCPP genes by harnessing available genomic and transcriptomic resources. We observe extensive diversification of SCPP genes within mammals, driven by gene duplications and losses, with the most pronounced changes occurring in the SCPP genes that are expressed in salivary glands. When comparing rodent and human SCPP genes, we concluded that regulatory shifts and gene turnover events likely facilitated the accelerated gain of salivary gland expression. In primate genomes, we found more recent duplication events that affected genes coding for proteins secreted in saliva. Several saliva-related SCPP genes in the primate lineage show signatures of positive selection, while the other genes in the SCPP locus remain conserved. Our results position saliva-related SCPP genes as highly malleable to evolutionary innovation. Variations shaped by dietary and pathogenic pressures likely influenced the functional properties of saliva proteins, impacting metabolic and immune-related traits in oral health among primates, including humans.more » « less
-
Starch accumulates in the plastids of green plant tissues during the day to provide carbon for metabolism at night. Starch hydrolysis is catalyzed by members of the β-amylase (BAM) family, which in Arabidopsis thaliana (At) includes nine structurally and functionally diverse members. One of these enzymes, AtBAM2, is a plastid-localized enzyme that is unique among characterized β-amylases since it is tetrameric and exhibits sigmoidal kinetics. Sequence alignments show that the BAM domains of AtBAM7, a catalytically inactive, nuclear-localized transcription factor with an N-terminal DNA-binding domain, and AtBAM2 are more closely related to each other than they are to any other AtBAM. Since the BAM2 gene is found in more ancient lineages, it was hypothesized that the BAM7 gene evolved from BAM2 . However, analysis of the genomes of 48 flowering plants revealed 12 species that appear to possess a BAM7 gene but lack a BAM2 gene. Upon closer inspection, these BAM7 proteins have a greater percent identity to AtBAM2 than to AtBAM7, and they share all of the AtBAM2 functional residues that BAM7 proteins normally lack. It is hypothesized that these genes may encode BAM2-like proteins although they are currently annotated as BAM7-like genes. To test this hypothesis, a cDNA for the short form of corn BAM7 (ZmBAM7-S) was designed for expression in Escherichia coli . Small-angle X-ray scattering data indicate that ZmBAM7-S has a tetrameric solution structure that is more similar to that of AtBAM2 than to that of AtBAM1. In addition, partially purified ZmBAM7-S is catalytically active and exhibits sigmoidal kinetics. Together, these data suggest that some BAM7 genes may encode a functional BAM2. Exploring and understanding the β-amylase gene structure could have an impact on the current annotation of genes.more » « less
An official website of the United States government

