Knowledge of locations and activities ofcis-regulatory elements (CREs) is needed to decipher basic mechanisms of gene regulation and to understand the impact of genetic variants on complex traits. Previous studies identified candidate CREs (cCREs) using epigenetic features in one species, making comparisons difficult between species. In contrast, we conducted an interspecies study defining epigenetic states and identifying cCREs in blood cell types to generate regulatory maps that are comparable between species, using integrative modeling of eight epigenetic features jointly in human and mouse in our Validated Systematic Integration (VISION) Project. The resulting catalogs of cCREs are useful resources for further studies of gene regulation in blood cells, indicated by high overlap with known functional elements and strong enrichment for human genetic variants associated with blood cell phenotypes. The contribution of each epigenetic state in cCREs to gene regulation, inferred from a multivariate regression, was used to estimate epigenetic state regulatory potential (esRP) scores for each cCRE in each cell type, which were used to categorize dynamic changes in cCREs. Groups of cCREs displaying similar patterns of regulatory activity in human and mouse cell types, obtained by joint clustering on esRP scores, harbor distinctive transcription factor binding motifs that are similar between species. An interspecies comparison of cCREs revealed both conserved and species-specific patterns of epigenetic evolution. Finally, we show that comparisons of the epigenetic landscape between species can reveal elements with similar roles in regulation, even in the absence of genomic sequence alignment.
more »
« less
An in vitro neurogenetics platform for precision disease modeling in the mouse
The power and scope of disease modeling can be markedly enhanced through the incorporation of broad genetic diversity. The introduction of pathogenic mutations into a single inbred mouse strain sometimes fails to mimic human disease. We describe a cross-species precision disease modeling platform that exploits mouse genetic diversity to bridge cell-based modeling with whole organism analysis. We developed a universal protocol that permitted robust and reproducible neural differentiation of genetically diverse human and mouse pluripotent stem cell lines and then carried out a proof-of-concept study of the neurodevelopmental geneDYRK1A. Results in vitro reliably predicted the effects of genetic background onDyrk1aloss-of-function phenotypes in vivo. Transcriptomic comparison of responsive and unresponsive strains identified molecular pathways conferring sensitivity or resilience toDyrk1a1Aloss and highlighted differential messenger RNA isoform usage as an important determinant of response. This cross-species strategy provides a powerful tool in the functional analysis of candidate disease variants identified through human genetic studies.
more »
« less
- Award ID(s):
- 1851697
- PAR ID:
- 10518022
- Publisher / Repository:
- AAAS
- Date Published:
- Journal Name:
- Science Advances
- Volume:
- 10
- Issue:
- 14
- ISSN:
- 2375-2548
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
To determine the error rate of transcription in human cells, we analyzed the transcriptome of H1 human embryonic stem cells with a circle-sequencing approach that allows for high-fidelity sequencing of the transcriptome. These experiments identified approximately 100,000 errors distributed over every major RNA species in human cells. Our results indicate that different RNA species display different error rates, suggesting that human cells prioritize the fidelity of some RNAs over others. Cross-referencing the errors that we detected with various genetic and epigenetic features of the human genome revealed that the in vivo error rate in human cells changes along the length of a transcript and is further modified by genetic context, repetitive elements, epigenetic markers, and the speed of transcription. Our experiments further suggest that BRCA1, a DNA repair protein implicated in breast cancer, has a previously unknown role in the suppression of transcription errors. Finally, we analyzed the distribution of transcription errors in multiple tissues of a new mouse model and found that they occur preferentially in neurons, compared to other cell types. These observations lend additional weight to the idea that transcription errors play a key role in the progression of various neurological disorders, including Alzheimer’s disease.more » « less
-
SUMMARY A major challenge in global crop production is mitigating yield loss due to plant diseases. One of the best strategies to control these losses is through breeding for disease resistance. One barrier to the identification of resistance genes is the quantification of disease severity, which is typically based on the determination of a subjective score by a human observer. We hypothesized that image‐based, non‐destructive measurements of plant morphology over an extended period after pathogen infection would capture subtle quantitative differences between genotypes, and thus enable identification of new disease resistance loci. To test this, we inoculated a genetically diverse biparental mapping population of tomato (Solanum lycopersicum) withRalstonia solanacearum, a soilborne pathogen that causes bacterial wilt disease. We acquired over 40 000 time‐series images of disease progression in this population, and developed an image analysis pipeline providing a suite of 10 traits to quantify bacterial wilt disease based on plant shape and size. Quantitative trait locus (QTL) analyses using image‐based phenotyping for single and multi‐traits identified QTLs that were both unique and shared compared with those identified by human assessment of wilting, and could detect QTLs earlier than human assessment. Expanding the phenotypic space of disease with image‐based, non‐destructive phenotyping both allowed earlier detection and identified new genetic components of resistance.more » « less
-
‘CandidatusLiberibacter’ is a group of bacterial species that are obligate intracellular plant pathogens and cause Huanglongbing disease of citrus trees and Zebra Chip in potatoes. Here, we examined the extent of intra- and interspecific genetic diversity across the genus using comparative genomics. Our approach examined a wide set ofLiberibactergenome sequences including five pathogenic species and one species not known to cause disease. By performing comparative genomics analyses, we sought to understand the evolutionary history of this genus and to identify genes or genome regions that may affect pathogenicity. With a set of 52 genomes, we performed comparative genomics, measured genome rearrangement, and completed statistical tests of positive selection. We explored markers of genetic diversity across the genus, such as average nucleotide identity across the whole genome. These analyses revealed the highest intraspecific diversity amongst the ‘Ca.Liberibacter solanacearum’ species, which also has the largest plant host range. We identified sets of core and accessory genes across the genus and within each species and measured the ratio of nonsynonymous to synonymous mutations (dN/dS) across genes. We identified ten genes with evidence of a history of positive selection in theLiberibactergenus, including genes in the Tad complex, which have been previously implicated as being highly divergent in the ‘Ca.L. capsica’ species based on high values of dN.more » « less
-
Microsatellites have been a workhorse of evolutionary genetic studies for decades and are still commonly in use for estimating signatures of genetic diversity at the population and species level across a multitude of taxa. Yet, the very high mutation rate of these loci is a double-edged sword, conferring great sensitivity at shallow levels of analysis (e.g. paternity analysis) but yielding considerable uncertainty for deeper evolutionary comparisons. For the present study, we used reduced representation genome-wide data (restriction site-associated DNA sequencing (RADseq)) to test for patterns of interspecific hybridization previously characterized using microsatellite data in a contact zone between two closely related mouse lemur species in Madagascar (Microcebus murinusandMicrocebus griseorufus). We revisit this system by examining populations in, near, and far from the contact zone, including many of the same individuals that had previously been identified as hybrids with microsatellite data. Surprisingly, we find no evidence for admixed nuclear ancestry. Instead, re-analyses of microsatellite data and simulations suggest that previously inferred hybrids were false positives and that the program NewHybridscan be particularly sensitive to erroneously inferring hybrid ancestry. Combined with results from coalescent-based analyses and evidence for local syntopic co-occurrence, we conclude that the two mouse lemur species are in fact completely reproductively isolated, thus providing a new understanding of the evolutionary rate whereby reproductive isolation can be achieved in a primate.more » « less
An official website of the United States government

