<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>Top-down Proteomics of Myosin Light Chain Isoforms Define Chamber-Specific Expression in the Human Heart</title></titleStmt>
			<publicationStmt>
				<publisher>Elsevier -Science Direct</publisher>
				<date>01/27/2023</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10522575</idno>
					<idno type="doi">10.1101/2023.01.26.525767</idno>
					
					<author>Elizabeth F Bayne</author><author>Kalina J Rossler</author><author>Zachery R Gregorich</author><author>Timothy J Aballo</author><author>David S Roberts</author><author>Emily A Chapman</author><author>Wei Guo</author><author>J Carter Ralphe</author><author>Timothy J Kamp</author><author>Ying Ge</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[<title>Abstract</title> <p>Myosin functions as the “molecular motor” of the sarcomere and generates the contractile force necessary for cardiac muscle contraction. Myosin light chains 1 and 2 (MLC-1 and -2) play important functional roles in regulating the structure of the hexameric myosin molecule. Each of these light chains has an ‘atrial’ and ‘ventricular’ isoform, so called because they are believed to exhibit chamber-restricted expression in the heart. However, recently the chamber-specific expression of MLC isoforms in the human heart has been questioned. Herein, we analyzed the expression of MLC-1 and -2 atrial and ventricular isoforms in each of the four cardiac chambers in adult non-failing donor hearts using top-down mass spectrometry (MS)-based proteomics. Strikingly, we detected an isoform thought to be ventricular, MLC-2v, in the atria and confirmed the protein sequence using tandem MS (MS/MS). For the first time, a putative deamidation post-translation modification (PTM) located on MLC-2v in atrial tissue was localized to amino acid N13. MLC-1v and MLC-2a were the only MLC isoforms exhibiting chamber-restricted expression patterns across all donor hearts. Importantly, our results unambiguously show that MLC-1v, not MLC-2v, is ventricle-specific in adult human hearts. Overall, top-down proteomics allowed us an unbiased analysis of MLC isoform expression throughout the human heart, uncovering previously unexpected isoform expression patterns and PTMs.</p>]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>The hexameric protein complex myosin functions as the "molecular motor" of the sarcomere and generates contractile force through cross bridge formation with actin <ref type="bibr">[1,</ref><ref type="bibr">2]</ref>. In cardiac tissue, myosin is composed of two heavy chains (MHC, either the &#945; or &#946; isoform) and four light chains (MLCs), which include two essential (ELC, or MLC-1) and two regulatory (RLC, or MLC-2) light chains <ref type="bibr">[3]</ref><ref type="bibr">[4]</ref><ref type="bibr">[5]</ref>. MLC-1 provides structural stability whereas MLC-2 regulates the function of the myosin motor through calcium binding and phosphorylation <ref type="bibr">[3]</ref>. MLC-1 and -2 each have two isoforms encoded by separate genes: MYL4 and MYL3 encode MLC-1a and MLC-1v, respectively, and MYL7 and MYL2 encode MLC-2a and MLC-2v <ref type="bibr">[6,</ref><ref type="bibr">7]</ref>.</p><p>Each respective light chain has an 'atrial' and a 'ventricular' isoform so called because early studies suggested the expression of these isoforms in the heart is chamber specific <ref type="bibr">[8,</ref><ref type="bibr">9]</ref>. It is noteworthy that MLC-2v has become a commonly-used marker of ventricular specification in stem cell-derived cardiomyocyte (hPSC-CM) cultures <ref type="bibr">[10]</ref>. However, the initial analysis of MLC expression in the heart was performed in small animal models and thus, the findings may not directly translate to humans due to significant physiological differences between rodent and human hearts <ref type="bibr">[11]</ref><ref type="bibr">[12]</ref><ref type="bibr">[13]</ref><ref type="bibr">[14]</ref>. Larger animal models are preferred for modeling cardiovascular disease because they have been shown to better replicate human cardiac physiology compared to small animal models <ref type="bibr">[15]</ref><ref type="bibr">[16]</ref><ref type="bibr">[17]</ref><ref type="bibr">[18]</ref><ref type="bibr">[19]</ref>. In particular, the myofilament composition and post-translational modifications (PTMs) of swine heart show greater similarity to humans than rodents <ref type="bibr">[20]</ref><ref type="bibr">[21]</ref><ref type="bibr">[22]</ref><ref type="bibr">[23]</ref>. Our previous study of the sarcomeric proteome in Yorkshire domestic pig hearts demonstrated the chamber-specific expression of MLC-1 and -2 ventricular and atrial isoforms <ref type="bibr">[24]</ref>. Nevertheless, despite the high similarity, there are still differences between large animal and human hearts so that a direct interrogation of human samples is essential <ref type="bibr">[25]</ref>. Thus, we performed a direct analysis of a sex-balanced cohort of non-failing human donor hearts to decipher basal levels of MLC isoform expression and PTMs across the four chambers of the human heart in both sexes.</p><p>Top-down proteomics analyzes intact proteins, enabling unbiased analysis of proteoforms, a term describing all protein products produced from a single gene as a result of post-translational modifications (PTMs), sequence variants, and alternative splicing <ref type="bibr">[26]</ref>, as well as related proteins arising from different genes (e.g., isoforms not resulting from alternative splicing of a single gene) <ref type="bibr">[27]</ref><ref type="bibr">[28]</ref><ref type="bibr">[29]</ref><ref type="bibr">[30]</ref><ref type="bibr">[31]</ref>. In contrast to the conventional bottom-up proteomics which analyzes peptides digested from proteins thus suffers from the peptide-to-protein inference problem <ref type="bibr">[32,</ref><ref type="bibr">33]</ref>, top-down proteomics analyzes proteins directly without digestions, providing accurate identification and quantification of protein isoforms and PTMs <ref type="bibr">[27]</ref><ref type="bibr">[28]</ref><ref type="bibr">[29]</ref><ref type="bibr">34]</ref>. Thus, a direct analysis of MLC isoforms and proteoforms by top-down proteomics allows accurate quantification of their levels without a priori knowledge of sample contents. Previously, top-down proteomics was employed to characterize the amino acid sequences and N-terminal modifications of atrial and ventricular MLC isoforms from human and swine myocardial tissue <ref type="bibr">[35]</ref>.</p><p>In this study, we employed top-down mass spectrometry (MS)-based proteomics to systematically evaluate differences in MLC proteoform and isoform expression in the four chambers of non-failing human donor hearts (n = 17). Surprisingly, our analysis revealed MLC-1v, but not MLC-2v, exhibited ventricle-specific expression, while MLC-2a exhibits atria-restricted expression. The exact amino acid sequence of MLC-2v in atrial tissue was then confirmed by online liquid chromatographytandem MS (LC-MS/MS). Interestingly, we found a mixture of deamidated and non-deamidated MLC-2v species in the atria that was not observed in ventricular tissue. Overall, top-down proteomics provided us with an unbiased analysis of the MLC proteoform distribution in the four chambers of the non-failing adult human heart.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Materials and methods</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1.">Human donor heart collection</head><p>Myocardial tissue from adult donor hearts with no history of cardiovascular disease were obtained from the University of Wisconsin Organ Procurement Organization (Table <ref type="table">S1</ref> details clinical characteristics). Tissue collection procedures were approved by the Institutional Review Board of the University of Wisconsin-Madison (Study # 2013-1264). Cardiac tissue was dissected into left ventricle (LV), right ventricle (RV), left atrium (LA), and right atrium (RA) while fresh and immediately snap frozen in liquid nitrogen to preserve PTMs <ref type="bibr">[36]</ref> (Fig. <ref type="figure">S1</ref>). The basal ventricular mid-wall was used as a representative region for LV and RV.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.">Sarcomeric protein extraction</head><p>Sarcomeric proteins were extracted from human cardiac tissue using differential pH extraction as reported previously (Fig. <ref type="figure">S2</ref>) <ref type="bibr">[36,</ref><ref type="bibr">37]</ref>. Cardiac tissue was homogenized in 10 volumes of a pH-neutral buffer (25 mM HEPES pH 7.4, 60 mM NaF, 1 mM Na3VO4, 1 mM PMSF, 2.5 mM EDTA, and 1&#215; Halt Protease and Phosphatase inhibitor) and centrifuged for 30 min at 18,000 &#215;g at 4 &#8226; C. The supernatant was removed, and the remaining pellet was homogenized in a 1% TFA solution (1% TFA, 1 mM TCEP, 10 mM L-Methionine) and centrifuged at 21,000 &#215;g at 4 &#8226; C for 15 min. The resulting myofilament-enriched supernatant ("TFA extract") was prepared for MS analysis. The total protein content of the TFA extract was normalized by biochitonic assay and prepared in a diluent of 0.1% formic acid at 2 mM TCEP. Proteins were separated by online reversed phase liquid chromatography (LC) prior to MS analysis (LC-MS).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3.">Online LC-MS analysis</head><p>LC-MS profiling was performed using a NanoAcquity ultra-high pressure LC system (Waters, Milford, MA) coupled to a high-resolution impact II quadrupole time-of-flight (Q-TOF) mass spectrometer (Bruker, Bremen, Germany) <ref type="bibr">[24,</ref><ref type="bibr">38]</ref>. 400 ng total protein per injection were loaded on a home-packed PLRP column (250 &#956;m i.d. x 200 mm length, PLRP-S, 10 &#956;m particle size, 1000 &#197; pore size). Sarcomeric proteins were eluted using a flow rate of 4 &#956;L/min over a gradient of 5% to 95% mobile phase B (0.1% formic acid in 50:50 acetonitrile and ethanol) over 40 min (mobile phase A: 0.1% formic acid in water). Electrospray ionization was used to introduce eluted proteins to the impact II Q-TOF mass spectrometer. Mass spectra were acquired at a scan rate of 0.5 Hz over 500-2000 m/z range. Injection replicates (n = 3) were performed at the beginning of each run to ensure instrument reproducibility and extraction replicates (n = 3) were analyzed to evaluate the reproducibility of sample preparation. Isotopic resolution was achieved for each analyte, and all proteins were identified with high mass accuracy (&lt;3 ppm mass error).</p><p>Online tandem mass spectra were collected by data-dependent acquisition settings. The top 5 highest intensity precursors were selected and fragmented using collision-induced dissociation (CID). MS1 scan rate was 1 Hz and MS2 scan rate varied from 4 to 8 Hz depending on precursor ion intensity. Collisional energies were set to 18-45 eV. An active exclusion period of 1 min was used to maximize number of precursors selected. Electron Transfer Dissociation (ETD) was performed on the maXis II Q-TOF MS. The top 3 most abundant precursors in each MS1 spectrum were selected for fragmentation. The ETD settings were as follows: ion charge control ranging from 0 to 1000 ms, a reagent accumulation time of 25 ms, and an extended reaction time of 50 ms.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.4.">Data analysis</head><p>Mass spectra were analyzed using DataAnalysis v4.3 (Bruker Daltonics) and MASH Explorer <ref type="bibr">[39]</ref>. Mass spectra were averaged across retention time windows within the BPC to create MS1 spectra. Spectral deconvolution using the Maximum Entropy algorithm with a resolving power of 60,000 was performed to reveal the monoisotopic mass of the protein, and putative proteoform identifications are made based on accurate mass. Peaks whose molecular masses matched closely to predicted masses from the UniProtKB database (&#8804; 10 ppm mass error) were assigned the corresponding proteoform identity (Table <ref type="table">S2</ref>). Phosphorylated proteoform quantitation is performed using peak intensities of unmodified and phosphorylated peaks, yielding a ratio of relative phosphorylation, P total (mol P i /mol protein), reported as average &#177; Standard Error of the Mean (SEM) <ref type="bibr">[22]</ref>. <ref type="bibr">[40]</ref> Theoretical isotopic distributions of precursor and fragment ions were generated using IsoPro v3.0.</p><p>To better visualize isoforms over a wide retention time window, extracted ion chromatograms (EICs) were generated for quantitation (Fig. <ref type="figure">S3</ref>). To quantitate relative intensities of isoforms in each sample, peak intensities from deconvoluted spectra were used if the isoforms eluted within the selected retention time window. To quantitate the relative intensity of isoforms in each sample, EICs were created from the top 5 most abundant ions (&#177; 0.2 m/z) for each proteoform as described previously <ref type="bibr">[41]</ref>. Isoform intensity was normalized by dividing the peak areas under the curve (AUC) by the sum of peak areas for all isoforms pertaining to the protein family (e.g. MLC-1v/total MLC-1 content) <ref type="bibr">[38]</ref>. For tandem MS sequence characterization, the monoisotopic mass of each fragment ion was analyzed against each putative protein sequence using IsoPro v3.0 using a mass error tolerance of 10 ppm. Isotopic distributions and theoretical fit were generated using IsoPro.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.5.">Statistics</head><p>P total values were analyzed for statistical significance using RStudio v1.2.5033. Differences between four chambers were evaluated using a linear model. One-way analysis of variance (ANOVA) was used to determine statistical significance between the four chambers. Tukey's HSD post hoc test was used for a pairwise evaluation between regions and adjusted p values are reported. A 95% confidence interval was used for Tukey's HSD. Differences between means were considered statistically significant if p &lt; 0.05. Levels of statistical significance are notated with an asterisk (*): *p &#8804; 0.05, **p &lt; 0.01, and ***p &lt; 0.001; no statistical significance (ns) if p &gt; 0.05.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Results</head><p>In this study, we employed top-down MS-based proteomics to comprehensively analyze the expression patterns of MLC-1 and -2 isoforms across the four chambers of non-failing human donor hearts (n = 17). Clinical characteristics of donors used in this study are presented in Table <ref type="table">S1</ref>. Our platform efficiently captures the diversity of a complex protein mixture, enabling quantification of the relative abundance of proteoforms and isoforms in a single LC-MS experiment. This approach includes myocardial tissue homogenization, sarcomeric protein extraction, LC-MS/MS, and data analysis (Fig. <ref type="figure">S2</ref>).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.">Myofilament protein extraction and LC-MS/MS platform is highly reproducible</head><p>Technical reproducibility of the online LC-MS/MS platform was demonstrated by alignment of retention times and intensity of base peak chromatograms (BPC) between injections (Fig. <ref type="figure">S4</ref>). Method reproducibility was established by replicate extraction of proteins from the same cardiac chamber (n = 3) (Fig. <ref type="figure">S5</ref>). Proteoform quantitation was demonstrated by comparable intensity of the phosphorylated MLC-2v proteoform between extraction replicates (Fig. <ref type="figure">S5B</ref>). Reproducible isoform quantitation was demonstrated by comparing the integrated area of MLC-2v EICs between technical replicates, ranging from 0.1 to 6% relative standard deviation across each cardiac chamber (Fig. <ref type="figure">S5C</ref>). Finally, we demonstrated the highly reproducible detection of MLC-2v proteoform in atrial tissue despite the fact that the extraction and LC-MS sample runs were performed by different personnel (Fig. <ref type="figure">S6</ref>). MSbased quantification of MLC expression was performed confidently after demonstration of high technical and extraction reproducibility.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.">Top-down MS and MS/MS analysis of MLC-1 shows MLC-1v is chamber-specific</head><p>MLC-1a and MLC-1v were detected with high mass accuracy, matching their respective theoretical monoisotopic masses with high mass accuracy (Fig. <ref type="figure">1A</ref>, Table <ref type="table">S2</ref>). MS analysis revealed single peaks corresponding to the ventricular and atrial MLC-1 isoforms in their respective chambers. The peak with a molecular mass of 21,828.91 Da detected in ventricular tissue matched well with the MLC-1v sequence in the UniProt/SwissProt database (Accession No. P08590-1, MYL3_HU-MAN) when considering N-terminal methionine removal (-131.04 Da) and N &#945; -trimethylation (+42.05 Da), consistent with previous reported N-terminal modifications (theoretical mass: 21,828.93 Da, 0.9 ppm mass error) <ref type="bibr">[35]</ref>. The peak detected in atrial tissue with a molecular mass of 21,461.79 Da matched with the reported sequence of MLC-1a (Accession No. P12829-1, MYL4_HUMAN) when accounting for N-terminal methionine removal and N &#945; -trimethylation as reported previously (theoretical mass: 21,461.87 Da, 3 ppm mass error) <ref type="bibr">[35]</ref>.</p><p>LC-MS/MS generated N-and C-terminal protein fragment ions using CID. Fragment ions confirmed the MLC-1 isoform sequences with Nterminal modifications as previously reported <ref type="bibr">[35]</ref>. Ions corresponding to MLC-1v, including b 23 and y 80 , were observed consistently across LV and RV preparations from different hearts (Fig. <ref type="figure">S7A</ref>), matching closely to theoretical isotopic distributions with high mass accuracy. In total, 22 b and 16 y ions were identified within &#8804;10 ppm mass error, corresponding to 87% sequence coverage of MLC-1v from a single LC-MS/MS run (Fig. <ref type="figure">S8</ref>). Similarly, key N-and C-terminal fragments matching MLC-1a were identified with consistency across LA and RA preparations from different hearts, matching theoretical isotopic distributions with high mass accuracy (Fig. <ref type="figure">S7B</ref>). MLC-1a fragmentation by CID yielded 31 total b and 15 y ions within &#8804;10 ppm mass error (Fig. <ref type="figure">S9</ref>). Sequence alignment of MLC-1v and MLC-1v revealed 80.5% sequence similarity between isoforms (Fig. <ref type="figure">S10A</ref>).</p><p>Quantification of MLC-1 isoforms revealed MLC-1v was ventriclespecific. MLC-1v was the most abundant isoform in LV and RV, averaging 99 &#177; 0.6% MLC-1v in LV and 97 &#177; 1.1% in RV (Fig. <ref type="figure">1C</ref>, Fig. <ref type="figure">S11</ref>). In-depth analysis of the raw mass spectra showed MLC-1v was not detected above baseline noise in LA and RA (Fig. <ref type="figure">S12</ref>). One-way ANOVA was used to evaluate MLC-1v expression across the four chambers (p = 2.2e-16), and pairwise comparisons revealed statistically significant contrasts between LA and RA when compared to LV and RV (p &lt; 0.0001 for each comparison, for details refer to Table <ref type="table">S3</ref>). No lateral distinctions were noted in the ventricles and atria (LV-RV p = 0.27; LA-RA p = 1.00).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.3.">Ventricular isoform of MLC-2 detected in atrial tissues using LC-MS/ MS</head><p>Top-down MS revealed 2 proteoforms related to MLC-2v and 3 proteoforms related to MLC-2a. Two proteoforms with molecular masses of 18,699.40 and 18,768.36 Da were detected in each cardiac chamber (Fig. <ref type="figure">1B</ref>, Table <ref type="table">S2</ref>). The peak with a mass of 18,699.40 Da matched the calculated molecular mass of MLC-2v (Accession No. P10916-1, MLRV_HUMAN) with 3 ppm mass error after considering N-terminal methionine cleavage and N &#945; -trimethylation as previously reported. The peak of 18,768.36 Da was an additional +79.97 Da from MLC-2v, corresponding to the addition of meta-phosphoric acid HPO 3 and, thus, was assigned as mono-phosphorylated pMLC-2v (2 ppm mass error). The remaining three proteoforms were exclusively detected in atria and had molecular masses of <ref type="bibr">19,346.54, 19,426.50, and 19,506.47</ref> Da. The 19,346.54 Da peak matches with predicted molecular mass of MLC-2a (Accession No. Q01449-1, MLRA_HUMAN) with 3 ppm mass error after considering N-terminal methionine cleavage and N &#945; -acetylation (+42.01 Da). Mono-phosphorylated p MLC-2a was assigned to the peak of 19,426.50 Da due to an addition of +79.97 Da from MLC-2a (5 ppm mass error) and bis-phosphorylated p MLC-2a was assigned to peak of 19,506.47 Da due to a mass difference of +159.91 Da (2 ppm mass error). Each isoform showed good fitting of theoretical isotopic abundance and distribution of the isotopomer peaks (Fig. <ref type="figure">S14</ref>), including phosphorylated proteoforms (Fig. <ref type="figure">S15</ref>).</p><p>MS/MS enabled by CID was used to confirm the identity of MLC-2v and pMLC-2v across the four cardiac chambers. A total of 47 MLC-2v fragment ions were identified across LV, RV, LA, and RA, including 22 b and 25 y ions (Fig. <ref type="figure">S16</ref>). For example, the y 24 ion corresponding to Cterminal MLC-2v matched closely to the theoretical isotopic distribution for each cardiac chamber, and the experimentally determined mass matches the theoretical mass with sub-ppm accuracy, indicating agreement with previously reported MLC-2v sequence (Fig. <ref type="figure">S17A</ref>). Phosphorylation of MLC-2v was also confirmed by tandem MS (Fig. <ref type="figure">S18</ref>). 33 fragment ions were identified correlating to pMLC-2v, including 13 b and 20 y ions (Fig. <ref type="figure">S18B</ref>). Fragment ion spectra enabled the mapping of phosphorylation site to Ser14 by detecting N-terminal b ions before and after the addition of HPO 3 (Fig. <ref type="figure">S18C</ref>, <ref type="figure">D</ref>). Due to considerable sequence heterogeneity between MLC-2v and MLC-2a <ref type="bibr">[42]</ref>, the observed fragment ions overwhelmingly indicate the presence of MLC-2v in the atrial chambers and do not match with a truncated or alternatively-spliced MLC-2a isoform. LC-MS/MS revealed key fragment ions matching to the reported sequence of MLC-2a, such as the y 25 ion shown across left and right atrial tissues (Fig. <ref type="figure">S17B</ref>). In sum, 37 b and 12 y ions were matched to the MLC-2a sequence within &#8804;10 ppm mass error, resulting in 61% sequence coverage (Fig. <ref type="figure">S196</ref>). Sequence alignment of MLC-2v and MLC-2a revealed 62.1% sequence similarity between isoforms (Fig. <ref type="figure">S10B</ref>).</p><p>Quantification of MLC-2 isoforms revealed significant differences of MLC-2v expression across the four cardiac chambers. In LV and RV, MLC-2v remained at 100% of total signal. MLC-2v content ranged widely in atrial tissues, from 6 to 68% in LA and 0-39% in RA, averaging 34 &#177; 5.1% MLC-2v in LA and 17 &#177; 2.7% in RA. MLC-2a was not detected above baseline noise in LV and RV, as shown by close examination of each raw mass spectrum (Fig. <ref type="figure">S14</ref>). MLC-2v expression across the four chambers was significant by one-way ANOVA (p = 2.0e-16) (Fig. <ref type="figure">1D</ref>). Pairwise comparisons between LV, RV, LA, and RA revealed significant differences between ventricles and atria (all pairwise comparisons p &lt; 0.0001 by Tukey HSD). The % MLC-2v was also statistically significant laterally, between LA and RA (p = 0.0003).</p><p>No chamber-related differences in relative phosphorylation (mol P i / mol protein, or P total ) of MLC-2 isoforms were detected. The relative phosphorylation (P total ) of MLC-2v is 0.15 &#177; 0.02 in LV and 0.13 &#177; 0.02 in RV, 0.19 &#177; 0.03 in LA, and 0.16 &#177; 0.03 in RA. These values did not yield a significant difference across the four chambers (p = 0.48) when compared by one-way ANOVA (Fig. <ref type="figure">S15C</ref>). Similarly, a comparison of the relative phosphorylation (P total ) of MLC-2a between the left and right atria (0.25 &#177; 0.03 in LA and 0.22 &#177; 0.03 in RA) was not significantly different (p = 0.52 by Student t-test).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.4.">Atrial-specific deamidation of MLC-2v localized to N13 by MS/MS</head><p>Top-down MS-based proteomics identified and localized an atrialspecific deamidation of human MLC-2v. A difference of +0.98 Da was found in the measured monoisotopic masses of MLC-2v in atrial tissues compared to ventricular tissue (Fig. <ref type="figure">2A</ref>). N-terminal fragment ions produced by CID MS/MS revealed a + 0.98 Da mass shift near the Nterminus of the protein. This phenomenon was observed in all N-terminal fragments produced by CID MS/MS in atrial tissue, including the shortest fragment b 15 (Fig. <ref type="figure">2B</ref>). Putative PTMs matching the observed mass shift are deamidation of asparagine <ref type="bibr">[43]</ref> or citrullination of arginine <ref type="bibr">[44]</ref>. Since both asparagine and arginine residues are in the first 15 aa residues of MLC-2v sequence, fragmentation closer to the N-terminus was needed to localize and identify the PTM. However, the CID spectra contained no N-terminal fragments shorter than b 15 , possibly due to the Fig. <ref type="figure">1</ref>. Top-down MS of MLCs Reveals MLC-1v is Ventricle-specific. A) Deconvoluted spectra showing expression of ventricular and atrial isoforms of myosin light chain 1 (MLC-1). MLC-1v is the dominant isoform in LV and RV tissues and was exclusively detected in LV and RV. MLC-1a was the only isoform detected in LA and RA tissues. B) Deconvoluted spectra showing ventricular and atrial isoforms of myosin light chain 2 (MLC-2). MLC-2v was consistently detected throughout all four chambers of the human heart. MLC-2v was the dominant isoform in LV and RV and detected at varying levels in LA and RA. The atrial isoform MLC-2a was detected exclusively in LA and RA. Isotopic resolution was achieved for each of the isoforms and all proteins were identified with high mass accuracy (3 ppm mass error). C) Quantitation of the expression of MLC-1v relative to total MLC-1a + MLC-1v in LV, RV, LA, and RA (n = 17 donors). Statistically-significant differences across the four chambers were determined by one-way ANOVA followed by Tukey's HSD post hoc test for pairwise comparisons (p &#8804; 0.05). One-way ANOVA resulted in overall p value of 2.2e-16 (Table <ref type="table">S3</ref>). Results of pairwise comparisons are reported on the plot and are notated with an asterisk (*): *p &#8804; 0.05, **p &lt; 0.01, and ***p &lt; 0.001; no statistical significance if p &gt; 0.05. D) Quantitation of the expression of MLC-2v relative to total MLC-2a + MLC-2 V in LV, RV, LA, and RA (n = 17 donors). Statistically-significant differences across the four chambers were determined by one-way ANOVA followed by Tukey's HSD post hoc test for pairwise comparisons (p &#8804; 0.05). One-way ANOVA resulted in overall p value of 2.0e-16 (Table <ref type="table">S3</ref>). Results of pairwise comparisons are reported on the plot and are notated with an asterisk (*): *p &#8804; 0.05, **p &lt; 0.01, and ***p &lt; 0.001; no statistical significance if p &gt; 0.05. basic nature of amino acid residues, which may have inhibited fragmentation by CID by conferring resonance-induced stability to the peptide backbone <ref type="bibr">[45]</ref>.</p><p>To further localize the putative modification, we utilized ETD to generate shorter N-terminal fragments. ETD revealed shorter N-terminal fragments including c 7 , c 8 , c 9 , and c 12 that matched well to calculated isotopic distribution of MLC-2v sequence and displayed comparable intensities to the same fragment ions observed in LV and RV (Fig. <ref type="figure">S20A</ref>). This result eliminates the possibility of citrullination at the Arg8 residue, indicating the PTM is located between residues aa <ref type="bibr">[12]</ref><ref type="bibr">[13]</ref><ref type="bibr">[14]</ref><ref type="bibr">[15]</ref>. A c 14 ion showed a reduction of monoisotopic intensity and aligned to the MLC-2v aa <ref type="bibr">[1]</ref><ref type="bibr">[2]</ref><ref type="bibr">[3]</ref><ref type="bibr">[4]</ref><ref type="bibr">[5]</ref><ref type="bibr">[6]</ref><ref type="bibr">[7]</ref><ref type="bibr">[8]</ref><ref type="bibr">[9]</ref><ref type="bibr">[10]</ref><ref type="bibr">[11]</ref><ref type="bibr">[12]</ref><ref type="bibr">[13]</ref><ref type="bibr">[14]</ref> when accounting for deamidation [+O-NH] (Fig. <ref type="figure">S20B</ref>), indicating the PTM was located between aa <ref type="bibr">[12]</ref><ref type="bibr">[13]</ref><ref type="bibr">[14]</ref>. We assigned the putative PTM corresponding to +0.98 Da as deamidation located at Asp13 based on the identity of isolated residues and mass shift. We conclude there is a mixture of deamidated and unmodified MLC-2v proteoform species in the atrial tissues due to the observed overlapping isotopic envelopes of N-terminal fragment ions. Since this PTM was only detected in LA and RA (LV or RV had normal isotopomer distribution matching the predicted N-terminal sequence), the observed atrial deamidation is not an artefactual or preparation-based modification, but one with potential biological relevance.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.5.">Sex differences in MLC proteoforms</head><p>Next, we sought to investigate whether sex differences could be correlated to the variability in MLC proteoform patterns given the sexbalanced cohort of non-failing donor hearts (Fig. <ref type="figure">3</ref>, Table <ref type="table">S4</ref>). Strikingly, the relative phosphorylation of both MLC-2v and MLC-2a was consistently lower in the female donor hearts as compared to male donor hearts. In LV, the relative phosphorylation (P total ) of MLC-2v was 0.20 &#177; 0.03 in male donor hearts vs. 0.10 &#177; 0.01 in female donor hearts, corresponding to a two-fold change (Fig. <ref type="figure">3A</ref>) with statistical significance (p = 8.8E-03, two-tailed Student t-test; Table <ref type="table">S4</ref>. Similarly, MLC-2v relative phosphorylation (P total ) measured in RV was 0.17 &#177; 0.03 in male donor hearts vs 0.10 &#177; 0.01 in female donors, corresponding to 1.7-fold change (p = 0.046, Fig. <ref type="figure">3B</ref>). In LA, MLC-2v relative phosphorylation (P total ) increased by 2.2-fold in males (0.26 &#177; 0.05) compared to females (0.12 &#177; 0.02 , p = 0.036, Fig. <ref type="figure">3C</ref>). In RA tissue, MLC-2v relative phosphorylation (P total ) was 0.21 &#177; 0.05 in males vs 0.12 &#177; 0.05 in females, with 1.7-fold change but without statistical significance (p = 0.16, Fig. <ref type="figure">3D</ref>). Representative deconvoluted mass spectra between agematched males and females demonstrate higher relative abundance of pMLC-2v in male LV tissues (Fig. <ref type="figure">3E</ref>) and male LA tissues (Fig. <ref type="figure">3F</ref>).</p><p>The relative phosphorylation (P total ) of MLC-2a in LA is 0.34 &#177; 0.05 in males and 0.16 &#177; 0.02 in females, corresponding to a 2.1-fold change in the male LA tissues (p = 0.0098, Fig. <ref type="figure">3G</ref>). MLC-2a relative phosphorylation (P total ) in RA increased by 1.8-fold in male donor hearts (0.28 &#177; 0.05) compared to female donor hearts (0.16 &#177; 0.02, p = 0.047, Fig. <ref type="figure">3H</ref>). Deconvoluted mass spectra between age-matched males and females reflects higher relative abundance of pMLC-2a and ppMLC-2a in LA tissues from male donor hearts compared to female donor hearts (Fig. <ref type="figure">3I</ref>).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Discussion</head><p>In this study, we characterized the spatial distribution of MLC isoforms across ventricular and atrial tissue of adult non-failing human hearts. To identify and quantify the MLC isoforms in different chambers from human cardiac tissue, we used top-down MS-based proteomics to enable a "bird's eye view" of the intact MLCs, revealing unique patterns of isoform expression and PTMs across the four chambers of the human heart.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1.">MLC-1v, but not MLC-2v, is ventricle-specific in human hearts</head><p>In the mature heart, MLC atrial and ventricular isoforms are widely considered to be chamber-specific <ref type="bibr">[6,</ref><ref type="bibr">7]</ref>, but in this study, top-down MSbased proteomics uncovers unambiguous evidence of expression of MLC-2v in human atrial tissues. Fig. <ref type="figure">1</ref> shows the detection and  <ref type="table">S2</ref>.</p><p>quantification of MLC isoforms across LV, RV, LA, and RA. We found MLC-1v but not MLC-2v exhibits ventricular-restricted expression. Peaks matching molecular masses of MLC-2v and pMLC-2v were identified in atrial tissue. Strikingly, MLC-2v was detected in the atria of every heart from all 17 donors with no cardiac diseases. The %MLC-2v of total MLC-2 content was quantitated and deemed statistically significant by oneway ANOVA, with significant differences between ventricular and atrial samples and a lateral difference between LA and RA.</p><p>An analysis of myofilament isoform and proteoform expression across the four cardiac chambers of 3 month old Yorkshire domestic pigs was previously performed by our lab <ref type="bibr">[24]</ref>. This previous study revealed chamber-specific expression of the atrial and ventricular isoforms of MLC-1 and -2. Importantly, we did not observe expression of MLC-2v in atrial tissue from pig as we did in the human tissue analyzed in this study. This underscores the critical importance of direct interrogation of human samples despite the high similarity between large animals and human hearts <ref type="bibr">[25]</ref>.</p><p>MLC-1a and -1v have well-established functional differences in both actin-myosin tethering and cross-bridge cycling kinetics <ref type="bibr">[3]</ref>. Ventricular expression of MLC-1a has been found under states of stress such as cardiomyopathy <ref type="bibr">[46]</ref>. It is possible MLC-2v functions as a similar adaptive isoform switch during a state of cardiac stress similarly to MLC-1. MLC-2v has been detected in atrial tissue in previous studies of pressure overload in both humans <ref type="bibr">[47]</ref> and rats <ref type="bibr">[48]</ref>. Interestingly, MLC-2v was also found in atria of adult spontaneously hypertensive rats without induced pressure overload, suggesting the expression of the ventricular isoform in atrial tissue was the result of a predetermined phenotype rather than a direct response to pressure overload <ref type="bibr">[48]</ref>. Functionally, ventricular and atrial isoforms of MLC-2 have been shown to affect the morphogenesis and cross bridge cycling of the myocardium <ref type="bibr">[6]</ref>. In transgenic mice, replacement of MLC-2a by -2v in atria has shown enhanced contractile generation and calcium sensitivity similar to the ventricular cardiomyocyte controls <ref type="bibr">[49,</ref><ref type="bibr">50]</ref>. A nuanced assessment of specific isoform influence on either atrial or ventricular function, and the regulation of their differential expression in healthy and diseased states, remains to be explored.</p><p>Previously, bottom-up shotgun proteomics was employed to provide a region and cell-type resolved quantitative landscape of the human heart proteome by Doll et al. <ref type="bibr">[51]</ref> MLC-2v (MLY2) and MLC-1v (MLY3) were recovered in this bottom-up study and still considered as a known markers for ventricles <ref type="bibr">[51]</ref>. Note that missing values were imputed on the basis of normal distribution in this report which may affect the quantitation results. In a more recent bottom-up paper by Linscheid et al, cardiac tissues from each chamber of human, pig, and other model organisms were analyzed <ref type="bibr">[25]</ref>. Similarly in this study, values were imputed with standard settings in Perseus from a left-shifted normal distribution. Overall, the stochastic nature of data collection in shotgun proteomics experiments leads to significant missingness in the data, leading to implementation of post-processing procedures such as imputation <ref type="bibr">[52,</ref><ref type="bibr">53]</ref>. Since "bottom-up" proteomics analyzes peptides resulted from digestion, inferring measured peptides back to protein content in the original sample <ref type="bibr">[34]</ref>. As a result, there are intrinsic limitations in the protein quantitation in the bottom-up proteomics due to this protein inference problem <ref type="bibr">[27,</ref><ref type="bibr">54]</ref>. Important biological discoveries may be missed by assuming that all peptides from a protein-coding sequence are representative of the same protein <ref type="bibr">[33]</ref>. In contrast, topdown proteomics analyzes proteins directly without digestion thus could provide more accurate identification and quantitation of isoforms together with PTMs <ref type="bibr">[27]</ref><ref type="bibr">[28]</ref><ref type="bibr">[29]</ref><ref type="bibr">[30]</ref><ref type="bibr">[31]</ref><ref type="bibr">34]</ref>. Here this study provides unambiguous identification and quantitation of all MLC isoforms across the four chambers of non-failing human donor hearts.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2.">Identification of atrial-specific MLC-2v deamidation on Asn13</head><p>We detected a small mass difference of &lt;1 Da in MLC-2v in the atrial tissues compared to the ventricles in each human heart analyzed. Fig. <ref type="figure">2</ref> details the localization and identification of this PTM using a combination of fragmentation methods (CID and ETD). Deamidation of MLC-2v from non-failing human cardiac tissue has been previously reported by analysis of enriched tryptic peptides by tandem MS. <ref type="bibr">[55]</ref> For the first time, we have localized deamidation to the Asn13 residue in human MLC-2v using a combination of ETD and CID. This site is consistent with a previous assignment of Asn13 as the site of deamidation of MLC-2v in rabbit myocardium <ref type="bibr">[56]</ref> and in non-human primate skeletal muscle (MLC-2 s isoform) <ref type="bibr">[57]</ref>.</p><p>The site of deamidation is adjacent to the regulatory phosphorylation site Ser14. The additional negative charge imparted by deamidation <ref type="bibr">[58]</ref> adjacent to the site of phosphorylation mimics the pattern of bisphosphorylation of MLC-2a at sites Ser22/23 (Fig. <ref type="figure">2C</ref>). Atrial-specific deamidation of MLC-2v may have functional implications relating to cardiac force generation in the atrial chambers. Phosphorylation induces a conformational change in MLC-2 <ref type="bibr">[55]</ref>, and the addition of a second negative charge may serve to heighten calcium sensitivity and induce further conformational change to the protein. Further investigation to determine the functional consequences of atrial-specific deamidation in human MLC-2v may help elucidate the role of partial replacement from MLC-2a to MLC-2v in the atria.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.3.">Sex-differences in the phosphorylation of MLC-2</head><p>Given our sex-balanced donor cohort (9 females and 8 males), we investigated whether sex differences could be correlated to changes in MLC expression (Fig. <ref type="figure">3</ref>). Notably, we found the relative phosphorylation (P total ) of both MLC-2 atrial and ventricular isoforms was consistently higher in male donor hearts compared to female donor hearts. MLC-2 is primarily phosphorylated by a cardiac-specific Ca 2+ /calmodulin-activated myosin light chain kinase (MLCK) and de-phosphorylated by myosin light chain phosphatases <ref type="bibr">[58]</ref><ref type="bibr">[59]</ref><ref type="bibr">[60]</ref>. Functionally, phosphorylation of MLC-2 increases the calcium sensitivity of the sarcomere through modulating cross-bridge cycling kinetics and steric interaction with myosin binding protein C (MyBP-C) <ref type="bibr">[58,</ref><ref type="bibr">61]</ref>. A previous study in female C57BL/6 J mice suggested female sex hormones may influence myofilament calcium sensitivity and phosphorylation of myofilament proteins including MyBP-C, resulting in overall decreased calcium sensitivity at certain times in the estrous cycle <ref type="bibr">[62]</ref>. Additionally, a study in adult Sprague-Dawley rats reported a 32% decrease (p = 0.011) in MLC-2v phosphorylation in females by 2D gel electrophoresis <ref type="bibr">[63]</ref>. Overall, the findings from this study provide the direct evidence of sex-based difference in cardiac myofilament PTMs.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.4.">Implications for 'chamber-specific' in vitro hPSC-CM models</head><p>As maturation strategies of cultured human pluripotent stem cells (hPSC-CMs) advance <ref type="bibr">[64,</ref><ref type="bibr">65]</ref>, an understanding of sarcomeric isoform and proteoform distribution in healthy adult human cardiac tissue is a useful tool to benchmark against hPSC-CM matured phenotypes in vitro. While hPSC-CMs remain phenotypically immature, the present results detailing chamber-specific MLC isoform expression can inform efforts to faithfully recapitulate endogenous adult CMs, including atrial-and ventricular-specific populations. For example, atrial-exclusive populations of hPSC-CMs have been used to model the pathophysiology of atrial fibrillation and its effects on contractile function <ref type="bibr">[66,</ref><ref type="bibr">67]</ref>. While the absence of MLC-2v expression is often used to verify atrial-like hPSC-CMs <ref type="bibr">[66]</ref>, the present results indicate MLC-2v can be found alongside MLC-2a in human adult atrial tissue in the majority our human cohort (35 of 36 atrial tissues). For applications seeking to closely resemble adult atrial cardiomyocyte physiology, the presence, or absence, of MLC-2v may be an important consideration since there are known isoformspecific effects on contractile performance form both animal and human disease models <ref type="bibr">[49,</ref><ref type="bibr">68]</ref>. MLC-2v is also a commonly-used marker of ventricular specification in hPSC-CM cultures, with the assumption that its presence is indicative of successful ventricular cardiomyocyte differentiation <ref type="bibr">[10,</ref><ref type="bibr">69]</ref>, but the present study identifies MLC-2v in adult atrial cardiomyocytes suggesting caution in using this protein marker alone to identify ventricular cardiomyocytes. In contrast, MLC-1a and MLC1-v have been largely overlooked in the chamber-specific characterization of hPSC-CMs, but the data presented here suggest that in adult heart, these proteins provide a highly specific chamber expression pattern. Nevertheless, robust characterization of hPSC-CM chamber specificity based on MLC expression requires consideration of the changes in isoform expression during development <ref type="bibr">[11]</ref>, and the present study provides only a snapshot of adult heart expression patterns.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Conclusion</head><p>In this study, we characterized the chamber-specific distribution of MLC isoforms in adult non-failing human hearts (n = 17) using top-down MS-based proteomics. MLC atrial and ventricular isoforms are widely considered to be chamber-specific <ref type="bibr">[6,</ref><ref type="bibr">7]</ref>, but top-down MS-based proteomics has uncovered evidence of expression of MLC-2v in human atrial tissues. Top-down proteomics enables an unbiased analysis of isoform distribution in the human heart by directly analyzing intact proteins to reveal PTMs. Here, we found MLC-1v but not MLC-2v exhibits ventricular-restricted expression. We confirmed the protein sequence of MLC-2v in the atria using high resolution top-down MS/MS and detected a deamidated species of MLC-2v exclusive to atrial tissue. The modification was localized to the N13 residue in MLC-2v. Such "bird's eye view" of the proteome by top-down proteomics can unveil</p></div></body>
		</text>
</TEI>
