skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

Attention:

The NSF Public Access Repository (PAR) system and access will be unavailable from 10:00 PM ET on Friday, February 6 until 10:00 AM ET on Saturday, February 7 due to maintenance. We apologize for the inconvenience.


Title: Large‐scale top‐down proteomics of the Arabidopsis thaliana leaf and chloroplast proteomes
We present a large-scale top-down proteomics (TDP) study of plant leaf and chloroplast proteins, achieving the identification of over 4700 unique proteoforms. Using capillary zone electrophoresis coupled with tandem mass spectrometry analysis of offline size-exclusion chromatography fractions, we identify 3198 proteoforms for total leaf and 1836 proteoforms for chloroplast, with 1024 and 363 proteoforms having post-translational modifications, respectively. The electrophoretic mobility prediction of capillary zone electrophoresis allowed us to validate post-translational modifications that impact the charge state such as acetylation and phosphorylation. Identified modifications included Trp (di)oxidation events on six chloroplast proteins that may represent novel targets of singlet oxygen sensing. Furthermore, our TDP data provides direct experimental evidence of the N- and C-terminal residues of numerous mature proteoforms from chloroplast, mitochondria, endoplasmic reticulum, and other sub-cellular localizations. With this information, we suggest true transit peptide cleavage sites and correct sub-cellular localization signal predictions. This large-scale analysis illustrates the power of top-down proteoform identification of post-translational modifications and intact sequences that can benefit our understanding of both the structure and function of hundreds of plant proteins.  more » « less
Award ID(s):
2034631 1846913
PAR ID:
10396153
Author(s) / Creator(s):
; ;
Date Published:
Journal Name:
PROTEOMICS
ISSN:
1615-9853
Page Range / eLocation ID:
2100377
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Mass spectrometry (MS)‐based top‐down proteomics (TDP) analysis of histone proteoforms provides critical information about combinatorial post‐translational modifications (PTMs), which is vital for pursuing a better understanding of epigenetic regulation of gene expression. It requires high‐resolution separations of histone proteoforms before MS and tandem MS (MS/MS) analysis. In this work, for the first time, we combined SDS‐PAGE‐based protein fractionation (passively eluting proteins from polyacrylamide gels as intact species for mass spectrometry, PEPPI‐MS) with capillary zone electrophoresis (CZE)‐MS/MS for high‐resolution characterization of histone proteoforms. We systematically studied the histone proteoform extraction from SDS‐PAGE gel and follow‐up cleanup as well as CZE‐MS/MS, to determine an optimal procedure. The optimal procedure showed reproducible and high‐resolution separation and characterization of histone proteoforms. SDS‐PAGE separated histone proteins (H1, H2, H3, and H4) based on their molecular weight and CZE provided additional separations of proteoforms of each histone protein based on their electrophoretic mobility, which was affected by PTMs, for example, acetylation and phosphorylation. Using the technique, we identified over 200 histone proteoforms from a commercial calf thymus histone sample with good reproducibility. The orthogonal and high‐resolution separations of SDS‐PAGE and CZE made our technique attractive for the delineation of histone proteoforms extracted from complex biological systems. 
    more » « less
  2. Mass spectrometry (MS)-based top-down characterization of integral membrane proteins (IMPs) is crucial for understanding their functions in biological processes. However, it is technically challenging due to their low solubility in typical MS-compatible buffers. In this work, for the first time, we developed an efficient capillary zone electrophoresis (CZE)-tandem MS (MS/MS) method for the top-down proteomics (TDP) of IMPs enriched from mouse brains. Our technique employs a sample buffer containing 30% (v/v) formic acid and 60% (v/v) methanol for solubilizing IMPs and utilizes a separation buffer of 30% (v/v) acetic acid and 30% (v/v) methanol for maintaining the solubility of IMPs during CZE separation. Single-shot CZE-MS/MS identified 51 IMP proteoforms from the mouse brain sample. Coupling size exclusion chromatography (SEC) to CZE-MS/MS enabled the identification of 276 IMP proteoforms from the mouse brain sample containing 1-4 transmembrane domains. This proof-of-concept work demonstrates the high potential of CZE-MS/MS for the large-scale TDP of IMPs. 
    more » « less
  3. Mass spectrometry (MS)-based top-down proteomics (TDP) has revolutionized biological research by measuring intact proteoforms in cells, tissues, and biofluids. Capillary zone electrophoresis-tandem MS (CZE-MS/MS) is a valuable technique for TDP, offering a high peak capacity and sensitivity for proteoform separation and detection. However, the long-term reproducibility of CZE-MS/MS in TDP remains unstudied, which is a crucial aspect for large-scale studies. This work investigated the long-term qualitative and quantitative reproducibility of CZE-MS/MS for TDP for the first time, focusing on a yeast cell lysate. Over 1000 proteoforms were identified per run across 62 runs using one linear polyacrylamide (LPA)-coated separation capillary, highlighting the robustness of the CZE-MS/MS technique. However, substantial decreases in proteoform intensity and identification were observed after some initial runs due to proteoform adsorption onto the capillary inner wall. To address this issue, we developed an efficient capillary cleanup procedure using diluted ammonium hydroxide, achieving high qualitative and quantitative reproducibility for the yeast sample across at least 23 runs. The data underscore the capability of CZE-MS/MS for large-scale quantitative TDP of complex samples, signaling its readiness for deployment in broad biological applications. The MS RAW files were deposited in ProteomeXchange Consortium with the data set identifier of PXD046651. 
    more » « less
  4. Mass spectrometry (MS)-based spatially resolved top-down proteomics (TDP) of tissues is crucial for understanding the roles played by microenvironmental heterogeneity in the biological functions of organs and for discovering new proteoform biomarkers of diseases. There are few published spatially resolved TDP studies. One of the challenges relates to the limited performance of TDP for the analysis of spatially isolated samples using, for example, laser capture microdissection (LCM) because those samples are usually mass-limited. We present the first pilot study of LCM-capillary zone electrophoresis (CZE)-MS/MS for spatially resolved TDP and used zebrafish brain as the sample. The LCM-CZE-MS/MS platform employed a non-ionic detergent and a freeze–thaw method for efficient proteoform extraction from LCM isolated brain sections followed by CZE-MS/MS without any sample cleanup step, ensuring high sensitivity. Over 400 proteoforms were identified in a CZE-MS/MS analysis of one LCM brain section via consuming the protein content of roughly 250 cells. We observed drastic differences in proteoform profiles between two LCM brain sections isolated from the optic tectum (Teo) and telencephalon (Tel) regions. Proteoforms of three proteins (npy, penkb, and pyya) having neuropeptide hormone activity were exclusively identified in the isolated Tel section. Proteoforms of reticulon, myosin, and troponin were almost exclusively identified in the isolated Teo section, and those proteins play essential roles in visual and motor activities. The proteoform profiles accurately reflected the main biological functions of the Teo and Tel regions of the brain. Additionally, hundreds of post-translationally modified proteoforms were identified. 
    more » « less
  5. Proteoforms, which arise from post-translational modifications, genetic polymorphisms and RNA splice variants, play a pivotal role as drivers in biology. Understanding proteoforms is essential to unravel the intricacies of biological systems and bridge the gap between genotypes and phenotypes. By analysing whole proteins without digestion, top-down proteomics (TDP) provides a holistic view of the proteome and can decipher protein function, uncover disease mechanisms and advance precision medicine. This Primer explores TDP, including the underlying principles, recent advances and an outlook on the future. The experimental section discusses instrumentation, sample preparation, intact protein separation, tandem mass spectrometry techniques and data collection. The results section looks at how to decipher raw data, visualize intact protein spectra and unravel data analysis. Additionally, proteoform identification, characterization and quantification are summarized, alongside approaches for statistical analysis. Various applications are described, including the human proteoform project and biomedical, biopharmaceutical and clinical sciences. These are complemented by discussions on measurement reproducibility, limitations and a forward-looking perspective that outlines areas where the field can advance, including potential future applications. 
    more » « less