skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


This content will become publicly available on June 13, 2025

Title: Top-down proteomics
Proteoforms, which arise from post-translational modifications, genetic polymorphisms and RNA splice variants, play a pivotal role as drivers in biology. Understanding proteoforms is essential to unravel the intricacies of biological systems and bridge the gap between genotypes and phenotypes. By analysing whole proteins without digestion, top-down proteomics (TDP) provides a holistic view of the proteome and can decipher protein function, uncover disease mechanisms and advance precision medicine. This Primer explores TDP, including the underlying principles, recent advances and an outlook on the future. The experimental section discusses instrumentation, sample preparation, intact protein separation, tandem mass spectrometry techniques and data collection. The results section looks at how to decipher raw data, visualize intact protein spectra and unravel data analysis. Additionally, proteoform identification, characterization and quantification are summarized, alongside approaches for statistical analysis. Various applications are described, including the human proteoform project and biomedical, biopharmaceutical and clinical sciences. These are complemented by discussions on measurement reproducibility, limitations and a forward-looking perspective that outlines areas where the field can advance, including potential future applications.  more » « less
Award ID(s):
2307573
PAR ID:
10515024
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ;
Publisher / Repository:
Springer Nature
Date Published:
Journal Name:
Nature Reviews Methods Primers
Volume:
4
Issue:
1
ISSN:
2662-8449
Subject(s) / Keyword(s):
top-down proteomics
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Mass spectrometry (MS)-based spatially resolved top-down proteomics (TDP) of tissues is crucial for understanding the roles played by microenvironmental heterogeneity in the biological functions of organs and for discovering new proteoform biomarkers of diseases. There are few published spatially resolved TDP studies. One of the challenges relates to the limited performance of TDP for the analysis of spatially isolated samples using, for example, laser capture microdissection (LCM) because those samples are usually mass-limited. We present the first pilot study of LCM-capillary zone electrophoresis (CZE)-MS/MS for spatially resolved TDP and used zebrafish brain as the sample. The LCM-CZE-MS/MS platform employed a non-ionic detergent and a freeze–thaw method for efficient proteoform extraction from LCM isolated brain sections followed by CZE-MS/MS without any sample cleanup step, ensuring high sensitivity. Over 400 proteoforms were identified in a CZE-MS/MS analysis of one LCM brain section via consuming the protein content of roughly 250 cells. We observed drastic differences in proteoform profiles between two LCM brain sections isolated from the optic tectum (Teo) and telencephalon (Tel) regions. Proteoforms of three proteins (npy, penkb, and pyya) having neuropeptide hormone activity were exclusively identified in the isolated Tel section. Proteoforms of reticulon, myosin, and troponin were almost exclusively identified in the isolated Teo section, and those proteins play essential roles in visual and motor activities. The proteoform profiles accurately reflected the main biological functions of the Teo and Tel regions of the brain. Additionally, hundreds of post-translationally modified proteoforms were identified. 
    more » « less
  2. Abstract We present a large‐scale top‐down proteomics (TDP) study of plant leaf and chloroplast proteins, achieving the identification of over 4700 unique proteoforms. Using capillary zone electrophoresis coupled with tandem mass spectrometry analysis of offline size‐exclusion chromatography fractions, we identify 3198 proteoforms for total leaf and 1836 proteoforms for chloroplast, with 1024 and 363 proteoforms having post‐translational modifications, respectively. The electrophoretic mobility prediction of capillary zone electrophoresis allowed us to validate post‐translational modifications that impact the charge state such as acetylation and phosphorylation. Identified modifications included Trp (di)oxidation events on six chloroplast proteins that may represent novel targets of singlet oxygen sensing. Furthermore, our TDP data provides direct experimental evidence of the N‐ and C‐terminal residues of numerous mature proteoforms from chloroplast, mitochondria, endoplasmic reticulum, and other sub‐cellular localizations. With this information, we suggest true transit peptide cleavage sites and correct sub‐cellular localization signal predictions. This large‐scale analysis illustrates the power of top‐down proteoform identification of post‐translational modifications and intact sequences that can benefit our understanding of both the structure and function of hundreds of plant proteins. 
    more » « less
  3. Proteoforms, the different forms of a protein with sequence variations including post-translational modifications (PTMs), execute vital functions in biological systems, such as cell signaling and epigenetic regulation. Advances in top-down mass spectrometry (MS) technology have permitted the direct characterization of intact proteoforms and their exact number of modification sites, allowing for the relative quantification of positional isomers (PI). Protein positional isomers refer to a set of proteoforms with identical total mass and set of modifications, but varying PTM site combinations. The relative abundance of PI can be estimated by matching proteoform-specific fragment ions to top-down tandem MS (MS2) data to localize and quantify modifications. However, the current approaches heavily rely on manual annotation. Here, we present IsoForma, an open-source R package for the relative quantification of PI within a single tool. Benchmarking IsoForma's performance against two existing workflows produced comparable results and improvements in speed. Overall, IsoForma provides a streamlined process for quantifying PI, reduces the analysis time, and offers an essential framework for developing customized proteoform analysis workflows. The software is open source and available at https://github.com/EMSL-Computing/isoforma-lib. 
    more » « less
  4. Kelso, Janet (Ed.)
    Abstract MotivationNative top-down proteomics (nTDP) integrates native mass spectrometry (nMS) with top-down proteomics (TDP) to provide comprehensive analysis of protein complexes together with proteoform identification and characterization. Despite significant advances in nMS and TDP software developments, a unified and user-friendly software package for analysis of nTDP data remains lacking. ResultsWe have developed MASH Native to provide a unified solution for nTDP to process complex datasets with database searching capabilities in a user-friendly interface. MASH Native supports various data formats and incorporates multiple options for deconvolution, database searching, and spectral summing to provide a “one-stop shop” for characterizing both native protein complexes and proteoforms. Availability and implementationThe MASH Native app, video tutorials, written tutorials, and additional documentation are freely available for download at https://labs.wisc.edu/gelab/MASH_Explorer/MASHSoftware.php. All data files shown in user tutorials are included with the MASH Native software in the download .zip file. 
    more » « less
  5. null (Ed.)
    Mass spectrometry (MS)-based top-down proteomics (TDP) requires high-resolution separation of proteoforms before electrospray ionization (ESI)-MS and tandem mass spectrometry (MS/MS). Capillary isoelectric focusing (cIEF)-ESI-MS and MS/MS could be an ideal method for TDP because cIEF can enable separation of proteoforms based on their isoelectric points (pIs) with ultra-high resolution. cIEF-ESI-MS has been well-recognized for protein characterization since 1990s. However, the widespread adoption of cIEF-MS for the characterization of proteoforms had been impeded by several technical challenges, including the lack of highly sensitive and robust ESI interface for coupling cIEF to MS, ESI suppression of analytes from ampholytes, and the requirement of manual operations. In this mini review, we summarize the technical improvements of cIEF-ESI-MS for characterizing proteoforms and highlight some recent applications to hydrophobic proteins, urinary albumin variants, charge variants of monoclonal antibodies, and large-scale TDP of complex proteomes. 
    more » « less