Collagen, a vital protein that provides strength to various body tissues, has a triple helix structure containing three polypeptide chains. The chains are composed mostly of a tripeptide of glycine (G), proline (P), and hydroxyproline (O). Using molecular dynamics simulations and theoretical analysis, the study examines the mechanical response of collagen triple helix structures, made up of three different tripeptide units, when subjected to different fracture loading modes. The results show that collagen with GPO tripeptide units at their C-terminal are mechanically stronger than the POG and OGP units with a single amino-acid frame shift. Our work shows that the N-terminal has less effect on collagen fracture than the C-terminal. The differences in mechanical response are explained by the heterogenous rigidity of the amino acid backbone and the resulting shear lag effect near the terminal. The findings have potential applications in developing tough synthetic collagen for building materials and may stimulate further studies on the connection between terminal repeats and the mechanical-thermal behavior of other structural proteins such as silk, elastin, fibrin, and keratin.
more »
« less
Elucidating the Structural Impacts of Protein InDels
The effects of amino acid insertions and deletions (InDels) remain a rather under-explored area of structural biology. These variations oftentimes are the cause of numerous disease phenotypes. In spite of this, research to study InDels and their structural significance remains limited, primarily due to a lack of experimental information and computational methods. In this work, we fill this gap by modeling InDels computationally; we investigate the rigidity differences between the wildtype and a mutant variant with one or more InDels. Further, we compare how structural effects due to InDels differ from the effects of amino acid substitutions, which are another type of amino acid mutation. We finish by performing a correlation analysis between our rigidity-based metrics and wet lab data for their ability to infer the effects of InDels on protein fitness.
more »
« less
- Award ID(s):
- 2031260
- PAR ID:
- 10379035
- Date Published:
- Journal Name:
- Biomolecules
- Volume:
- 12
- Issue:
- 10
- ISSN:
- 2218-273X
- Page Range / eLocation ID:
- 1435
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Abstract Sequence alignment is an essential method in bioinformatics and the basis of many analyses, including phylogenetic inference, ancestral sequence reconstruction, and gene annotation. Sequencing artifacts and errors made during genome assembly, such as abiological frameshifts and incorrect early stop codons, can impact downstream analyses leading to erroneous conclusions in comparative and functional genomic studies. More significantly, while indels can occur both within and between codons in natural sequences, most amino-acid- and codon-based aligners assume that indels only occur between codons. This mismatch between biology and alignment algorithms produces suboptimal alignments and errors in downstream analyses. To address these issues, we present COATi, a statistical, codon-aware pairwise aligner that supports complex insertion–deletion models and can handle artifacts present in genomic data. COATi allows users to reduce the amount of discarded data while generating more accurate sequence alignments. COATi can infer indels both within and between codons, leading to improved sequence alignments. We applied COATi to a dataset containing orthologous protein-coding sequences from humans and gorillas and conclude that 41% of indels occurred between codons, agreeing with previous work in other species. We also applied COATi to semiempirical benchmark alignments and find that it outperforms several popular alignment programs on several measures of alignment quality and accuracy.more » « less
-
Students often struggle with visualizing protein structures when working with two-dimensional textbook and lecture materials, so introducing them to 3D visualization software developed by and for structural biologists offers them a unique opportunity to work with authentic data while furthering their spatial reasoning skills and understanding of molecular structure and function. This article presents an active learning virtual laboratory in which students use authentic structural biology data to investigate the effects of both hypothetical and real-world SARS-CoV-2 mutations on the virus’s ability to bind to human ACE2 receptors and infect a host, causing COVID-19. Through this activity, introductory-level college students or advanced high school students gain a better understanding of applied biology, such as how vaccines and treatments are designed, as well as strengthening their understanding of core disciplinary concepts, such as the relationship between protein structure and function and the central dogma of molecular biology. While there were challenges during the pilot phase of activity development due to COVID-19 restrictions, students in the pilot groups came away from the activity with deeper understanding of the relationship between proteins and amino acid sequences and a new appreciation for the ways researchers design treatments for and study viruses.more » « less
-
SARS-CoV-2, the cause of COVID-19, is a new, highly pathogenic coronavirus, which is the third coronavirus to emerge in the past 2 decades and the first to become a global pandemic. The virus has demonstrated itself to be extremely transmissible and deadly. Recent data suggest that a targeted approach is key to mitigating infectivity. Due to the proliferation of cataloged protein and nucleic acid sequences in databases, the function of the nucleic acid, and genetic encoded proteins, we make predictions by simply aligning sequences and exploring their homology. Thus, similar amino acid sequences in a protein usually confer similar biochemical function, even from distal or unrelated organisms. To understand viral transmission and adhesion, it is key to elucidate the structural, surface, and functional properties of each viral protein. This is typically first modeled in highly pathogenic species by exploring folding, hydrophobicity, and isoelectric point (IEP). Recent evidence from viral RNA sequence modeling and protein crystals have been inadequate, which prevent full understanding of the IEP and other viral properties of SARS-CoV-2. We have thus experimentally determined the IEP of SARS-CoV-2. Our findings suggest that for enveloped viruses, such as SARS-CoV-2, estimates of IEP by the amino acid sequence alone may be unreliable. We compared the experimental IEP of SARS-CoV-2 to variants of interest (VOIs) using their amino acid sequence, thus providing a qualitative comparison of the IEP of VOIs.more » « less
-
Abstract Honey bees have suffered dramatic losses in recent years, largely due to multiple stressors underpinned by poor nutrition [1]. Nutritional stress especially harms larvae, who mature into workers unable to meet the needs of their colony [2]. In this study, we characterize the metabolic capabilities of a honey bee larvae-associated bacterium, Bombella apis (formerly Parasaccharibacter apium), and its effects on the nutritional resilience of larvae. We found that B. apis is the only bacterium associated with larvae that can withstand the antimicrobial larval diet. Further, we found that B. apis can synthesize all essential amino acids and significantly alters the amino acid content of synthetic larval diet, largely by supplying the essential amino acid lysine. Analyses of gene gain/loss across the phylogeny suggest that four amino acid transporters were gained in recent B. apis ancestors. In addition, the transporter LysE is conserved across all sequenced strains of B. apis. Finally, we tested the impact of B. apis on developing honey bee larvae subjected to nutritional stress and found that larvae supplemented with B. apis are bolstered against mass reduction despite limited nutrition. Together, these data suggest a novel role of B. apis as a nutritional mutualist of honey bee larvae.more » « less
An official website of the United States government

