skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Thirty years of molecular dynamics simulations on posttranslational modifications of proteins
Posttranslational modifications (PTMs) are an integral component to how cells respond to perturbation. While experimental advances have enabled improved PTM identification capabilities, the same throughput for characterizing how structural changes caused by PTMs equate to altered physiological function has not been maintained. In this Perspective, we cover the history of computational modeling and molecular dynamics simulations which have characterized the structural implications of PTMs. We distinguish results from different molecular dynamics studies based upon the timescales simulated and analysis approaches used for PTM characterization. Lastly, we offer insights into how opportunities for modern research efforts on in silico PTM characterization may proceed given current state-of-the-art computing capabilities and methodological advancements.  more » « less
Award ID(s):
1845606
PAR ID:
10407586
Author(s) / Creator(s):
; ;
Date Published:
Journal Name:
Physical Chemistry Chemical Physics
Volume:
24
Issue:
43
ISSN:
1463-9076
Page Range / eLocation ID:
26371 to 26397
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. The development and training of deep learning models have become increasingly costly and complex. Consequently, software engineers are adopting pre-trained models (PTMs) for their downstream applications. The dynamics of the PTM supply chain remain largely unexplored, signaling a clear need for structured datasets that document not only the metadata but also the subsequent applications of these models. Without such data, the MSR community cannot comprehensively understand the impact of PTM adoption and reuse.This paper presents the PeaTMOSS dataset, which comprises metadata for 281,638 PTMs and detailed snapshots for all PTMs with over 50 monthly downloads (14,296 PTMs), along with 28,575 open-source software repositories from GitHub that utilize these models. Additionally, the dataset includes 44,337 mappings from 15,129 downstream GitHub repositories to the 2,530 PTMs they use. To enhance the dataset’s comprehensiveness, we developed prompts for a large language model to automatically extract model metadata, including the model’s training datasets, parameters, and evaluation metrics. Our analysis of this dataset provides the first summary statistics for the PTM supply chain, showing the trend of PTM development and common shortcomings of PTM package documentation. Our example application reveals inconsistencies in software licenses across PTMs and their dependent projects. PeaTMOSS lays the foundation for future research, offering rich opportunities to investigate the PTM supply chain. We outline mining opportunities on PTMs, their downstream usage, and cross-cutting questions.Our artifact is available at https://github.com/PurdueDualityLab/PeaTMOSS-Artifact. Our dataset is available at https://transfer.rcac.purdue.edu/file-manager?origin_id=ff978999-16c2-4b50-ac7a-947ffdc3eb1d&origin_path=%2F. 
    more » « less
  2. The development and training of deep learning models have become increasingly costly and complex. Consequently, software engineers are adopting pre-trained models (PTMs) for their downstream applications. The dynamics of the PTM supply chain remain largely unexplored, signaling a clear need for structured datasets that document not only the metadata but also the subsequent applications of these models. Without such data, the MSR community cannot comprehensively understand the impact of PTM adoption and reuse. This paper presents the PeaTMOSS dataset, which comprises metadata for 281,638 PTMs and detailed snapshots for all PTMs with over 50 monthly downloads (14,296 PTMs), along with 28,575 open-source software repositories from GitHub that utilize these models. Additionally, the dataset includes 44,337 mappings from 15,129 downstream GitHub repositories to the 2,530 PTMs they use. To enhance the dataset’s comprehensiveness, we developed prompts for a large language model to automatically extract model metadata, including the model’s training datasets, parameters, and evaluation metrics. Our analysis of this dataset provides the first summary statistics for the PTM supply chain, showing the trend of PTM development and common shortcomings of PTM package documentation. Our example application reveals inconsistencies in software licenses across PTMs and their dependent projects. PeaTMOSS lays the foundation for future research, offering rich opportunities to investigate the PTM supply chain. We outline mining opportunities on PTMs, their downstream usage, and cross-cutting questions. Our artifact is available at https://github.com/PurdueDualityLab/PeaTMOSS-Artifact. Our dataset is available at https://transfer.rcac.purdue.edu/file-manager?origin_id=ff978999-16c2-4b50-ac7a-947ffdc3eb1d&origin_path=%2F. 
    more » « less
  3. null (Ed.)
    Histone post-translational modifications (PTMs) are epigenetic marks that modify the state of chromatin and lead to alterations in gene expression. Advances in mass spectrometry have enabled the high-throughput analysis of histone PTMs without the need for prior knowledge of individual PTMs of interest. In this study, the global histone PTM landscape was analyzed in the gills, kidney, and testes of Mozambique tilapia (Oreochromis mossambicus) through tandem mass spectrometry using data dependent acquisition (DDA-LCMS2) and PTM mapping approaches. PTM assignment to a specific amino acid was validated using A-score and localization probability scores that are based on the detection of diagnostic MSMS ions. These values signify the robustness of PTM assignment to a specific residue within the protein sequence. For PTMs that were represented by both modified and unmodified versions of the corresponding peptide, the stoichiometry was calculated and compared between tissues. We have identified multiple types of histone PTMs and assigned them to specific residues in each tissue. These PTMs include acetylation, methylation, demethylation, trimethylation, phosphorylation/ dehydration, and ubiquitination. Our results indicate that the gills, kidney, and testes each display a unique profile of histone PTMs. These data provide a strong basis for the generation of spectral libraries that enable high-throughput quantitative analyses of histone PTM stoichiometry on a global scale in tilapia exposed to diverse environmental and developmental contexts. 
    more » « less
  4. Reguera, Gemma (Ed.)
    ABSTRACT Polycyclic tetramate macrolactams (PTMs) are bioactive natural products commonly associated with certain actinobacterial and proteobacterial lineages. These molecules have been the subject of numerous structure-activity investigations since the 1970s. New members continue to be pursued in wild and engineered bacterial strains, and advances in PTM biosynthesis suggest their outwardly simplistic biosynthetic gene clusters (BGCs) belie unexpected product complexity. To address the origins of this complexity and understand its influence on PTM discovery, we engaged in a combination of bioinformatics to systematically classify PTM BGCs and PTM-targeted metabolomics to compare the products of select BGC types. By comparing groups of producers and BGC mutants, we exposed knowledge gaps that complicate bioinformatics-driven product predictions. In sum, we provide new insights into the evolution of PTM BGCs while systematically accounting for the PTMs discovered thus far. The combined computational and metabologenomic findings presented here should prove useful for guiding future discovery.<bold>IMPORTANCE</bold>Polycyclic tetramate macrolactam (PTM) pathways are frequently found within the genomes of biotechnologically important bacteria, includingStreptomycesandLysobacterspp.Their molecular products are typically bioactive, having substantial agricultural and therapeutic interest. Leveraging bacterial genomics for the discovery of new related molecules is thus desirable, but drawing accurate structural predictions from bioinformatics alone remains challenging. This difficulty stems from a combination of previously underappreciated biosynthetic complexity and remaining knowledge gaps, compounded by a stream of yet-uncharacterized PTM biosynthetic loci gleaned from recently sequenced bacterial genomes. We engaged in the following study to create a useful framework for cataloging historic PTM clusters, identifying new cluster variations, and tracing evolutionary paths for these molecules. Our data suggest new PTM chemistry remains discoverable in nature. However, our metabolomic and mutational analyses emphasize the practical limitations of genomics-based discovery by exposing hidden complexity. 
    more » « less
  5. Histone post-translational modifications (PTMs) are epigenetic marks that play a critical role in the expression and maintenance of DNA, but they remain largely uninvestigated in non-model organisms due to technical challenges. To begin alleviating this issue, we developed a workflow for histone PTM analysis in the Mozambique tilapia (Oreochromis mossambicus), being a widespread and environmentally hardy fish, using mass spectrometry methods. By incorporating multiple protein digestion methods into the preparation of each sample, we reliably quantified 503 biologically relevant histone PTMs. All of these histone PTMs, collectively referred to as the global histone PTM landscape, were characterized in the gills, kidney, and testes of this fish. By comparing the global histone PTM landscape between the three tissues, we found that 90.46% of histone PTMs were tissue-dependent. The workflow and tools for histone PTM analysis described in this study are now publicly available and enable comprehensive investigation into the influence of environmental stress on histone PTMs in non-model organisms. Given the functionality and flexibility of histone PTMs, we anticipate that the study of histone PTMs in ecologically relevant contexts will provide ground-breaking insights into comparative physiology and evolution. 
    more » « less