skip to main content


Title: Interactions of HMGB Proteins with the Genome and the Impact on Disease
High Mobility Group Box (HMGB) proteins are small architectural DNA binding proteins that regulate multiple genomic processes such as DNA damage repair, nucleosome sliding, telomere homeostasis, and transcription. In doing so they control both normal cellular functions and impact a myriad of disease states, including cancers and autoimmune diseases. HMGB proteins bind to DNA and nucleosomes to modulate the local chromatin environment, which facilitates the binding of regulatory protein factors to the genome and modulates higher order chromosomal organization. Numerous studies over the years have characterized the structure and function of interactions between HMGB proteins and DNA, both biochemically and inside cells, providing valuable mechanistic insight as well as evidence these interactions influence pathological processes. This review highlights recent studies supporting the roles of HMGB1 and HMGB2 in global organization of the genome, as well as roles in transcriptional regulation and telomere maintenance via interactions with G-quadruplex structures. Moreover, emerging models for how HMGB proteins function as RNA binding proteins are presented. Nuclear HMGB proteins have broad regulatory potential to impact numerous aspects of cellular metabolism in normal and disease states.  more » « less
Award ID(s):
1817442
NSF-PAR ID:
10334190
Author(s) / Creator(s):
; ;
Date Published:
Journal Name:
Biomolecules
Volume:
11
Issue:
10
ISSN:
2218-273X
Page Range / eLocation ID:
1451
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Telomeres form the ends of linear chromosomes and usually comprise protein complexes that bind to simple repeated sequence motifs that are added to the 3′ ends of DNA by the telomerase reverse transcriptase (TERT). One of the primary functions attributed to telomeres is to solve the “end-replication problem” which, if left unaddressed, would cause gradual, inexorable attrition of sequences from the chromosome ends and, eventually, loss of viability. Telomere-binding proteins also protect the chromosome from 5′ to 3′ exonuclease action, and disguise the chromosome ends from the double-strand break repair machinery whose illegitimate action potentially generates catastrophic chromosome aberrations. Telomeres are of special interest in the blast fungus, Pyricularia , because the adjacent regions are enriched in genes controlling interactions with host plants, and the chromosome ends show enhanced polymorphism and genetic instability. Previously, we showed that telomere instability in some P. oryzae strains is caused by novel retrotransposons (MoTeRs) that insert in telomere repeats, generating interstitial telomere sequences that drive frequent, break-induced rearrangements. Here, we sought to gain further insight on telomeric involvement in shaping Pyricularia genome architecture by characterizing sequence polymorphisms at chromosome ends, and surrounding internalized MoTeR loci (relics) and interstitial telomere repeats. This provided evidence that telomere dynamics have played historical, and likely ongoing, roles in shaping the Pyricularia genome. We further demonstrate that even telomeres lacking MoTeR insertions are poorly preserved, such that the telomere-adjacent sequences exhibit frequent presence/absence polymorphism, as well as exchanges with the genome interior. Using TERT knockout experiments, we characterized chromosomal responses to failed telomere maintenance which suggested that much of the MoTeR relic-/interstitial telomere-associated polymorphism could be driven by compromised telomere function. Finally, we describe three possible examples of a phenomenon known as “Adaptive Telomere Failure,” where spontaneous losses of telomere maintenance drive rapid accumulation of sequence polymorphism with possible adaptive advantages. Together, our data suggest that telomere maintenance is frequently compromised in Pyricularia but the chromosome alterations resulting from telomere failure are not as catastrophic as prior research would predict, and may, in fact, be potent drivers of adaptive polymorphism. 
    more » « less
  2. INTRODUCTION To faithfully distribute genetic material to daughter cells during cell division, spindle fibers must couple to DNA by means of a structure called the kinetochore, which assembles at each chromosome’s centromere. Human centromeres are located within large arrays of tandemly repeated DNA sequences known as alpha satellite (αSat), which often span millions of base pairs on each chromosome. Arrays of αSat are frequently surrounded by other types of tandem satellite repeats, which have poorly understood functions, along with nonrepetitive sequences, including transcribed genes. Previous genome sequencing efforts have been unable to generate complete assemblies of satellite-rich regions because of their scale and repetitive nature, limiting the ability to study their organization, variation, and function. RATIONALE Pericentromeric and centromeric (peri/centromeric) satellite DNA sequences have remained almost entirely missing from the assembled human reference genome for the past 20 years. Using a complete, telomere-to-telomere (T2T) assembly of a human genome, we developed and deployed tailored computational approaches to reveal the organization and evolutionary patterns of these satellite arrays at both large and small length scales. We also performed experiments to map precisely which αSat repeats interact with kinetochore proteins. Last, we compared peri/centromeric regions among multiple individuals to understand how these sequences vary across diverse genetic backgrounds. RESULTS Satellite repeats constitute 6.2% of the T2T-CHM13 genome assembly, with αSat representing the single largest component (2.8% of the genome). By studying the sequence relationships of αSat repeats in detail across each centromere, we found genome-wide evidence that human centromeres evolve through “layered expansions.” Specifically, distinct repetitive variants arise within each centromeric region and expand through mechanisms that resemble successive tandem duplications, whereas older flanking sequences shrink and diverge over time. We also revealed that the most recently expanded repeats within each αSat array are more likely to interact with the inner kinetochore protein Centromere Protein A (CENP-A), which coincides with regions of reduced CpG methylation. This suggests a strong relationship between local satellite repeat expansion, kinetochore positioning, and DNA hypomethylation. Furthermore, we uncovered large and unexpected structural rearrangements that affect multiple satellite repeat types, including active centromeric αSat arrays. Last, by comparing sequence information from nearly 1600 individuals’ X chromosomes, we observed that individuals with recent African ancestry possess the greatest genetic diversity in the region surrounding the centromere, which sometimes contains a predominantly African αSat sequence variant. CONCLUSION The genetic and epigenetic properties of centromeres are closely interwoven through evolution. These findings raise important questions about the specific molecular mechanisms responsible for the relationship between inner kinetochore proteins, DNA hypomethylation, and layered αSat expansions. Even more questions remain about the function and evolution of non-αSat repeats. To begin answering these questions, we have produced a comprehensive encyclopedia of peri/centromeric sequences in a human genome, and we demonstrated how these regions can be studied with modern genomic tools. Our work also illuminates the rich genetic variation hidden within these formerly missing regions of the genome, which may contribute to health and disease. This unexplored variation underlines the need for more T2T human genome assemblies from genetically diverse individuals. Gapless assemblies illuminate centromere evolution. ( Top ) The organization of peri/centromeric satellite repeats. ( Bottom left ) A schematic portraying (i) evidence for centromere evolution through layered expansions and (ii) the localization of inner-kinetochore proteins in the youngest, most recently expanded repeats, which coincide with a region of DNA hypomethylation. ( Bottom right ) An illustration of the global distribution of chrX centromere haplotypes, showing increased diversity in populations with recent African ancestry. 
    more » « less
  3. Komeili, Arash (Ed.)
    ABSTRACT Histone proteins are found across diverse lineages of Archaea , many of which package DNA and form chromatin. However, previous research has led to the hypothesis that the histone-like proteins of high-salt-adapted archaea, or halophiles, function differently. The sole histone protein encoded by the model halophilic species Halobacterium salinarum , HpyA, is nonessential and expressed at levels too low to enable genome-wide DNA packaging. Instead, HpyA mediates the transcriptional response to salt stress. Here we compare the features of genome-wide binding of HpyA to those of HstA, the sole histone of another model halophile, Haloferax volcanii . hstA , like hpyA , is a nonessential gene. To better understand HpyA and HstA functions, protein-DNA binding data (chromatin immunoprecipitation sequencing [ChIP-seq]) of these halophilic histones are compared to publicly available ChIP-seq data from DNA binding proteins across all domains of life, including transcription factors (TFs), nucleoid-associated proteins (NAPs), and histones. These analyses demonstrate that HpyA and HstA bind the genome infrequently in discrete regions, which is similar to TFs but unlike NAPs, which bind a much larger genomic fraction. However, unlike TFs that typically bind in intergenic regions, HpyA and HstA binding sites are located in both coding and intergenic regions. The genome-wide dinucleotide periodicity known to facilitate histone binding was undetectable in the genomes of both species. Instead, TF-like and histone-like binding sequence preferences were detected for HstA and HpyA, respectively. Taken together, these data suggest that halophilic archaeal histones are unlikely to facilitate genome-wide chromatin formation and that their function defies categorization as a TF, NAP, or histone. IMPORTANCE Most cells in eukaryotic species—from yeast to humans—possess histone proteins that pack and unpack DNA in response to environmental cues. These essential proteins regulate genes necessary for important cellular processes, including development and stress protection. Although the histone fold domain originated in the domain of life Archaea , the function of archaeal histone-like proteins is not well understood relative to those of eukaryotes. We recently discovered that, unlike histones of eukaryotes, histones in hypersaline-adapted archaeal species do not package DNA and can act as transcription factors (TFs) to regulate stress response gene expression. However, the function of histones across species of hypersaline-adapted archaea still remains unclear. Here, we compare hypersaline histone function to a variety of DNA binding proteins across the tree of life, revealing histone-like behavior in some respects and specific transcriptional regulatory function in others. 
    more » « less
  4. Zhou, Jin-Qiu (Ed.)
    The telomere G-strand binding protein Pot1 plays multifaceted roles in telomere maintenance and protection. We examined the structure and activities of Pot1 in Ustilago maydis , a fungal model that recapitulates key features of mammalian telomere regulation. Compared to the well-characterized primate and fission yeast Pot1 orthologs, Um Pot1 harbors an extra N-terminal OB-fold domain (OB-N), which was recently shown to be present in most metazoans. Um Pot1 binds directly to Rad51 and regulates the latter’s strand exchange activity. Deleting the OB-N domain, which is implicated in Rad51-binding, caused telomere shortening, suggesting that Pot1-Rad51 interaction facilitates telomere maintenance. Depleting Pot1 through transcriptional repression triggered growth arrest as well as rampant recombination, leading to multiple telomere aberrations. In addition, telomere repeat RNAs transcribed from both the G- and C-strand were dramatically up-regulated, and this was accompanied by elevated levels of telomere RNA-DNA hybrids. Telomere abnormalities of pot1 -deficient cells were suppressed, and cell viability was restored by the deletion of genes encoding Rad51 or Brh2 (the BRCA2 ortholog), indicating that homology-directed repair (HDR) proteins are key mediators of telomere aberrations and cellular toxicity. Together, these observations underscore the complex physical and functional interactions between Pot1 and DNA repair factors, leading to context-dependent and dichotomous effects of HDR proteins on telomere maintenance and protection. 
    more » « less
  5. Abstract

    Heme b (iron protoporphyrin IX) plays important roles in biology as a metallocofactor and signaling molecule. However, the targets of heme signaling and the network of proteins that mediate the exchange of heme from sites of synthesis or uptake to heme dependent or regulated proteins are poorly understood. Herein, we describe a quantitative mass spectrometry (MS)-based chemoproteomics strategy to identify exchange labile hemoproteins in human embryonic kidney HEK293 cells that may be relevant to heme signaling and trafficking. The strategy involves depleting endogenous heme with the heme biosynthetic inhibitor succinylacetone (SA), leaving putative heme-binding proteins in their apo-state, followed by the capture of those proteins using hemin–agarose resin, and finally elution and identification by MS. By identifying only those proteins that interact with high specificity to hemin–agarose relative to control beaded agarose in an SA-dependent manner, we have expanded the number of proteins and ontologies that may be involved in binding and buffering labile heme or are targets of heme signaling. Notably, these include proteins involved in chromatin remodeling, DNA damage response, RNA splicing, cytoskeletal organization, and vesicular trafficking, many of which have been associated with heme through complementary studies published recently. Taken together, these results provide support for the emerging role of heme in an expanded set of cellular processes from genome integrity to protein trafficking and beyond.

     
    more » « less