<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>In Situ Spatiotemporal SERS Measurements and Multivariate Analysis of Virally Infected Bacterial Biofilms Using Nanolaminated Plasmonic Crystals</title></titleStmt>
			<publicationStmt>
				<publisher>ACS</publisher>
				<date>03/24/2023</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10475703</idno>
					<idno type="doi">10.1021/acssensors.2c02412</idno>
					<title level='j'>ACS Sensors</title>
<idno>2379-3694</idno>
<biblScope unit="volume">8</biblScope>
<biblScope unit="issue">3</biblScope>					

					<author>Aditya Garg</author><author>Wonil Nam</author><author>Wei Wang</author><author>Peter Vikesland</author><author>Wei Zhou</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[In situ spatiotemporal biochemical characterization of the activity of living multicellular biofilms under external stimuli remains a significant challenge. Surface-enhanced Raman spectroscopy (SERS), combining the molecular fingerprint specificity of vibrational spectroscopy with the hotspot sensitivity of plasmonic nanostructures, has emerged as a promising noninvasive bioanalysis technique for living systems. However, most SERS devices do not allow reliable long-term spatiotemporal SERS measurements of multicellular systems because of challenges in producing spatially uniform and mechanically stable SERS hotspot arrays to interface with large cellular networks. Furthermore, very few studies have been conducted for multivariable analysis of spatiotemporal SERS datasets to extract spatially and temporally correlated biological information from multicellular systems. Here, we demonstrate in situ label-free spatiotemporal SERS measurements and multivariate analysis of Pseudomonas syringae biofilms during development and upon infection by bacteriophage virus Phi6 by employing nanolaminate plasmonic crystal SERS devices to interface mechanically stable, uniform, and spatially dense hotspot arrays with the P. syringae biofilms. We exploited unsupervised multivariate machine learning methods, including principal component analysis (PCA) and hierarchical cluster analysis (HCA), to resolve the spatiotemporal evolution and Phi6 dose-dependent changes of major Raman peaks originating from biochemical components in P. syringae biofilms, including cellular components, extracellular polymeric substances (EPS), metabolite molecules, and cell lysate-enriched extracellular media. We then employed supervised multivariate analysis using linear discriminant analysis (LDA) for the multiclass classification of Phi6 dose-dependent biofilm responses, demonstrating the potential for viral infection diagnosis. We envision extending the in situ spatiotemporal SERS method to monitor dynamic, heterogeneous interactions between viruses and bacterial networks for applications such as phage-based anti-biofilm therapy development and continuous pathogenic virus detection.]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>&#9632; INTRODUCTION</head><p>Biofilms consist of microbial communities embedded in an extracellular polymeric substance (EPS) matrix <ref type="bibr">1,</ref><ref type="bibr">2</ref> and feature spatially and temporally heterogeneous structures and compositions. <ref type="bibr">2</ref> The introduction of external stimuli (e.g., antibiotics, viruses, and environmental changes in temperature, pH, or nutrients) can affect biofilm evolution by altering bacterial growth dynamics, <ref type="bibr">3</ref> metabolic activity, <ref type="bibr">4</ref> and cell-to-cell quorum sensing. <ref type="bibr">5</ref> In situ spatiotemporal biochemical monitoring of biofilm activity remains an unmet challenge, causing the lack of a holistic system-level understanding of biofilm activity in response to external stimuli and impeding the development of anti-biofilm treatment methods (e.g., antibiotic and phage therapy). <ref type="bibr">6</ref> Furthermore, biofilms can potentially serve as living biosensor elements for detecting bioenvironmental stressors (e.g., viruses and metals) by reporting changes in spatiotemporal biofilm activity following stressor introduction. For example, since bacteriophage viruses can alter host bacterial metabolism in favor of viral replication, it is possible to develop continuous virus detection methods by continuously monitoring bacterial metabolism changes in response to viral infection. <ref type="bibr">4,</ref><ref type="bibr">[7]</ref><ref type="bibr">[8]</ref><ref type="bibr">[9]</ref> Standard bioanalysis methods, including label-free mass spectrometry (MS), <ref type="bibr">4,</ref><ref type="bibr">8</ref> nuclear magnetic resonance (NMR) spectroscopy, <ref type="bibr">9</ref> and label-based fluorescence microscopy, <ref type="bibr">10</ref> have been exploited to study bacterial responses to phage infection or antibiotic treatment. However, these standard bioanalysis approaches cannot provide spatiotemporal bioanalysis of living biosystems due to their invasive nature. Recently, label-free surface-enhanced Raman spectroscopy (SERS), combining the molecular fingerprint specificity of vibrational spectroscopy with the hotspot sensitivity of plasmonic nanostructures, has emerged as an ultrasensitive, noninvasive molecular detection approach for in situ biochemical analysis of living systems. <ref type="bibr">11,</ref><ref type="bibr">12</ref> Compared to standard Raman spectroscopy, SERS can minimize invasiveness and improve the throughput and reliability of in situ spatiotemporal measurements of living biosystems using a low laser power with a short acquisition time. <ref type="bibr">13</ref> However, prior in situ SERS microbial studies have primarily focused on pathogen identification and antibiotic susceptibility testing using isolated individual bacterial cells. <ref type="bibr">14,</ref><ref type="bibr">15</ref> Despite the utility of monitoring average population responses, SERS measurements of isolated individual bacterial cells cannot resolve dynamic heterogeneous bacterial activities in complex biofilm networks. Among the limited number of in situ SERS biofilm studies published to date, the majority have relied upon the incubation of biofilms with discrete colloidal plasmonic nanoparticle suspensions to detect molecular signatures (e.g., DNA, proteins, lipids, and carbohydrates) at the biofilm surface. <ref type="bibr">13,</ref><ref type="bibr">[16]</ref><ref type="bibr">[17]</ref><ref type="bibr">[18]</ref> Unfortunately, in situ SERS measurements with discrete colloidal plasmonic nanoparticles face inherent challenges in allowing spatiotemporal monitoring of biofilms due to uncontrolled nanoparticle distribution, aggregation, and diffusion. In contrast, substrate-bound SERS devices can support dense and mechanically stable plasmonic hotspot arrays, <ref type="bibr">12,</ref><ref type="bibr">19,</ref><ref type="bibr">20</ref> allowing reliable in situ spatiotemporal SERS measurements of multicellular systems. Recent studies have interfaced bacterial biofilms with substrate-bound SERS devices covered with a nanoporous layer (e.g., mesoporous silica) to conduct 2D SERS measurements of the spatial distribution of quorum-sensing molecules (e.g., pyocyanin) secreted by bacterial cells in biofilms. <ref type="bibr">[21]</ref><ref type="bibr">[22]</ref><ref type="bibr">[23]</ref><ref type="bibr">[24]</ref> Notably, the nanoporous layer on the SERS substrates can serve as a filter to block large biomolecules (e.g., proteins) from accessing plasmonic hotspots and thus allow higher signal-to-noise detection of small quorum-sensing molecules. However, such nanoporous layer-covered SERS devices inhibit holistic SERS profiling of various molecules associated with biofilm activity. Finally, substrate-bound SERS devices can integrate with microfluidic systems to monitor biofilm effluent media <ref type="bibr">25</ref> or precursor materials 26 associated with biofilm responses, but such methods still have limitations in resolving spatiotemporal biofilm activities.</p><p>Due to challenges in low-cost nanofabrication of large-area, high-performance SERS devices with good biocompatibility for interfacing with living cellular networks, there has been little research on in situ spatiotemporal label-free SERS measurements and multivariate analysis of holistic molecular profiles associated with living multicellular systems. Since label-free SERS spectra measured from living biological systems consist of highly overlapped vibrational features from various biomolecules in plasmonic hotspots, we need to employ unsupervised or supervised machine learning methods to conduct multivariate analysis of high-dimensional SERS datasets for extracting biologically meaningful information. <ref type="bibr">27</ref> Among unsupervised learning methods, principal component analysis (PCA) and hierarchical cluster analysis (HCA) are popular tools to extract interclass and intraclass relationships from intrinsic spectral features of sample groups (e.g., growth stages, cell types, disease/drug states) and reduce data dimensionality. <ref type="bibr">28</ref> Supervised learning methods, such as linear discriminant analysis (LDA) and support vector machines (SVM), can exploit the label information to train models for multivariate mapping to classify complex spectral features associated with different sample groups. <ref type="bibr">29</ref> Despite advances in SERS bioanalysis of living cells, implementation of machine learning methods for multivariate analysis of spatiotemporal SERS datasets from living multicellular networks remains an underexplored topic.</p><p>Here, we employ novel Au-SiO 2 -Au nanolaminated plasmonic crystals (NLPCs) to conduct in situ spatiotemporal label-free SERS measurements and multivariate analysis of Pseudomonas syringae biofilms during biofilm development and upon infection by bacteriophage virus Phi6 at different dosages. Significantly, we have combined multiple unsupervised and supervised machine learning methods to extract spatially and temporally correlated biological information from multivariable spatiotemporal SERS datasets measured from living biofilms. The lytic bacteriophage Phi6 is a recognized surrogate for enveloped infectious viruses such as SARS-CoV-2 and influenza virus, <ref type="bibr">30,</ref><ref type="bibr">31</ref> and its host bacterium P. syringae is a representative human-safe biofilm-forming bacterium. <ref type="bibr">32,</ref><ref type="bibr">33</ref> Accordingly, the P. syringae-Phi6 pair is a suitable humansafe model biosystem for our proof-of-concept studies. Our results are readily translatable to other biofilm-virus interactions for applications, including anti-biofilm phage therapy development and biofilm-enabled pathogenic virus detection.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>&#9632; RESULTS AND DISCUSSION</head><p>Overall Experimental Workflow of Sample Preparation, Spatiotemporal SERS Measurements, and Multivariate Analysis. Figure <ref type="figure">1</ref> summarizes our approach in spatiotemporal SERS measurements and multivariate analysis of the P. syringae biofilm growth processes and the biofilm viral infection responses: (A) P. syringae biofilm development and Phi6 infection on the NLPCs, (B) in situ spatiotemporal SERS measurements of Phi6 infected/uninfected P. syringae biofilms, and (C) multivariate analysis by unsupervised and supervised machine learning methods. Our NLPC-based SERS samples consist of chemically inert and biocompatible Au-SiO 2 -Au multi-nanogap nanocavities to support mechanically stable, optically dense, highly uniform plasmonic hotspot arrays with SERS enhancement factors (EFs) &gt; 10 7 [Figure <ref type="figure">1A(i)</ref>]. 11 P. syringae bacterial culture suspended in fresh lysogeny broth (LB) medium was pipetted onto the NLPCs to allow cell adherence to the SERS substrate. Following the removal of planktonic bacterial cells and the addition of fresh LB broth, bacterial biofilms developed on the NLPCs under external stimuli by different dosages of Phi6 bacteriophage (0, 10 6 , and 10 8 PFU/mL) [Figure <ref type="figure">1A</ref>(ii-iv)]. Biofilm evolution consists of numerous spatiotemporally coupled biochemical processes (e.g., metabolic activity, EPS formation, quorum sensing). Bacteriophages can infect and replicate within the host bacteria, thus altering the local spatiotemporal biochemistry. <ref type="bibr">4,</ref><ref type="bibr">8</ref> Spatiotemporally resolved label-free SERS measurements were acquired to monitor dynamic heterogeneous biofilm activities with and without viral infection (Figure <ref type="figure">1B</ref>). The spatiotemporal SERS dataset consists of 400 data points measured over the same 20 &#215; 20 &#956;m region at three time-points (t i = 5, 10, and 20 h) for three different Phi6 dosages (c j = 0, 10 6 , and 10 8 PFU/mL). To analyze and classify the complex multivariable SERS data consisting of 3600 data points from three different times and three Phi6 dosages, we employed unsupervised (HCA and PCA) and supervised (PCA-LDA) machine learning methods (Figure <ref type="figure">1C</ref>). Unsupervised HCA was employed to cluster the intra-class spatial data with Raman mode analysis [Figure <ref type="figure">1C(i)</ref>]. Unsupervised PCA was used to analyze the inter-class data (t i and c j ) by identifying key spectral features responsible for inter-class variations. Furthermore, we exploited PCA as a data reduction tool for subsequent supervised machine learning analysis [Figure <ref type="figure">1C</ref>(ii)]. For the supervised machine learning analysis, PCA-LDA was used to classify SERS spectra associated with the growth and phage dosage-dependent response of the P. syringae biofilms [Figure <ref type="figure">1C(iii)]</ref>.</p><p>Spatiotemporal SERS and Multivariate Spatial Data Analysis of the Biofilm Development Process. P. syringae biofilms were developed on the NLPCs, and a Raman microscope with a backscattering configuration was used to capture spatiotemporally correlated label-free SERS spectra of the developing biofilm. Compared to colloidal SERS probes, our NLPCs 11 can support mechanically stable, uniform, and dense plasmonic hotspot arrays for reliable spatiotemporal SERS measurements of living cellular networks. The NLPC SERS devices support healthy biofilm development, as indicated by the increased number of surface-adhered bacteria over time (Figure <ref type="figure">S1</ref>). Figure <ref type="figure">2A</ref> shows average Raman spectra measured from a 400 &#956;m 2 region under 785 nm excitation following 5, 10, or 20 h of biofilm development, along with the corresponding bright field images. We calibrated the SERS signal intensities for each measurement using the electronic Raman scattering (ERS) internal standard. <ref type="bibr">34</ref> This calibration can eliminate the effects of spatial and temporal variations of the local field intensity at the SERS hotspots caused by the spatiotemporally heterogeneous growth of biofilms on top of the SERS substrates, enabling accurate quantitative analysis. <ref type="bibr">12,</ref><ref type="bibr">34</ref> To ensure a high signal-to-noise ratio, spectra whose maximum peak values were smaller than three times the noise level were discarded. <ref type="bibr">35</ref> Figure <ref type="figure">S2</ref> shows the noise level in the averaged SERS spectra of P. syringae biofilms at 20 h, indicating that the major SERS peaks from the P. syringae biofilm components have a higher signal intensity than the noise level. Table <ref type="table">S1</ref> summarizes the assigned origins of the observed SERS peaks. As shown in Figure <ref type="figure">2A</ref>, several peaks attributed to proteins, carbohydrates, lipids, and nucleic acids emerge between 5 and 20 h of biofilm development (Text S1 in the Supporting Information). The development of these peaks reflects biofilm evolution-induced changes originating from the production of bacterial extracellular metabolites, <ref type="bibr">24,</ref><ref type="bibr">36</ref> cell wall components, <ref type="bibr">37</ref> and the EPS matrix. <ref type="bibr">38</ref> However, the average label-free SERS spectra of P. syringae at different growth stages consist of highly overlapped and complex spectroscopic features. The highly overlapped and complex nature of the spectroscopic features can be attributed to the rich biomolecular composition of the living biological system, causing each data point to contain several spectroscopic features that arise from various biomolecules in SERS hotspots within the area illuminated by the Raman laser. Furthermore, the spatially heterogeneous distribution of biochemical components within the biofilm <ref type="bibr">39</ref> causes significant point-topoint variations in the spectral profile over the measured region, generating highly overlapped features in the averaged spectra. Therefore, it is critical to employ multivariate machine learning methods to predict the origins of the observed SERS peaks and better understand the spatiotemporal biochemical evolution of the P. syringae biofilm.</p><p>We employed HCA to resolve the time-dependent spatial SERS information hidden within the average SERS spectra (Figure <ref type="figure">2B</ref>). By quantifying similarity distances between observations, HCA groups data points with similar spectral profiles, which allows visualization of the spatial distributions of various components in the biological matrix (e.g., cell wall, EPS, extracellular metabolites, and background culture medium). At t = 5 h, the clusters obtained via HCA demonstrate similar spectral profiles, indicating the uniform distribution of several significant biochemical components (Figure <ref type="figure">S3</ref> in the Supporting Information). At t = 10 h, the HCA dendrogram reveals three clusters with a significant difference between cluster A (green) and the other two. Cluster A exhibits strong SERS peaks from proteins (526, 1240, and 1550 cm -1 ), carbohydrates (858 cm -1 ), and nucleic acids (1327 cm -1 ) originating from cell wall components intimately interfaced with the SERS hotspots on the NLPCs (Figure <ref type="figure">2B</ref>). For validating the origins of these peaks, we conducted control experiments to obtain the SERS spectra of washed P. syringae cells suspended in phosphate-buffered saline (PBS) (Figure <ref type="figure">S4</ref> in the Supporting Information). The spectrum of P. syringae cells measured immediately after washing with PBS solution also exhibited strong peaks at 526, 858, 1240, 1327, and 1550 cm -1 (Figure <ref type="figure">S4</ref> in the Supporting Information), indicating that these peaks originate from bacterial cell wall components. Additionally, we observed a peak at 930 cm -1 in cluser A (C-O stretch/C-O-C vibration) associated with alginate, <ref type="bibr">40</ref> which is a well-studied EPS matrix polysaccharide in P. syringae biofilms, <ref type="bibr">41</ref> indicating the onset of EPS polysaccharide generation near the P. syringae cells at 10 h. The HCA-resolved spectrum from cluster B (orange) shows weak SERS signals, while the spectrum from cluster C (red) shows a strong SERS peak at 732 cm -1 originating from the LB medium (Figure <ref type="figure">S5</ref> in the Supporting Information). This result indicates the local consumption of LB medium components close to the SERS hotspots by the P. syringae cells in cluster B. However, the absence of SERS signals from cell wall components indicates a less intimate nano-bio interface between the cells and the plasmonic hotspots in cluster B at 10 h.</p><p>At t = 20 h, the HCA dendrogram reveals two clusters with considerable dissimilarity. The HCA-resolved spectrum for cluster A (orange) exhibits weak SERS signals that reflect the consumption of background LB medium components. The HCA-resolved SERS spectra and spatial maps reveal that cluster B (green) exhibits a rich spectral profile, covering a large surface area, indicating expansion of the bacterial biofilm network interfaced with the SERS hotspots. Apart from the peaks at 526, 858, 1240, 1327, and 1550 cm -1 attributed to the cell wall components, we observed additional peaks originating from polysaccharides (885 and 1125 cm -1 ), <ref type="bibr">40,</ref><ref type="bibr">42</ref> proteins (620 and 822 cm -1 ), and nucleic acids (804 cm -1 ) in the cluster B dataset, which can be attributed to the EPS of P. syringae biofilms (Figure <ref type="figure">2B</ref>). <ref type="bibr">43</ref> Lastly, a strong peak at 662 cm -1 from guanine is observed in the cluster B dataset. Since the peak at 662 cm -1 significantly increases in intensity between 0 and 12 h in the control SERS spectra of washed P. syringae cells under nutrient-deprived conditions (Figure <ref type="figure">S4</ref>), the feature at 662 cm -1 likely originates from purine degradation metabolites at the SERS hotspots due to the starvation response of bacterial cells, as reported in prior studies. <ref type="bibr">44</ref> Therefore, HCA analysis of the multidimensional SERS dataset enabled us to visualize the spatiotemporally correlated evolution of biochemical compo- nents in living P. syringae biofilms, including cellular components, EPS matrix components, and metabolite molecules. To visualize the spatiotemporal biochemical changes of individual biomarkers during the biofilm growth process, we plotted 2D maps of the ERS-calibrated Raman intensities of major peaks from proteins, carbohydrates, and nucleic acids as a function of time (Figure <ref type="figure">2C</ref>). Although the HCA analysis can extract the overall trend of the spatiotemporal biochemical evolution of the P. syringae biofilm, the non-uniform and dynamically changing spatial distributions of individual Raman peaks in Figure <ref type="figure">2C</ref> reflect the heterogeneous structure and complex spatiotemporal evolution of the P. syringae biofilm.</p><p>Multivariate Analysis and Classification of the Growth Stages of P. syringae Biofilms. For the interclass temporal analysis of the multivariable SERS data from the P. syringae biofilm, we implemented PCA. As an unsupervised multivariate analysis method capable of extracting features with the highest variance, PCA is useful for extracting the inter-class spectral variations associated with different data classes (e.g., biofilm growth times and phage infection dosages) (Text S2 in Supporting Information). Figure <ref type="figure">3A</ref> illustrates the loading spectra for the first two principal components of the SERS dataset of P. syringae biofilms at different times, revealing that different loadings consist of various spectral features with different molecular origins (Text S3 in the Supporting Information).</p><p>To investigate the temporal evolution of the individual spectral features identified by the principal component (PC) loading scores, we plotted the ERS-calibrated Raman intensities of the identified Raman features as a function of time (Figure <ref type="figure">3B</ref>). First, we observe that the peak at 662 cm -1 from purine metabolites significantly increases in intensity between 10 and 20 h. Concurrently, the peak from the LB medium (732 cm -1 ) significantly decreased between 10 and 20 h, reflecting the synergy between nutrient consumption and the release of purine degradation extracellular metabolites. <ref type="bibr">36,</ref><ref type="bibr">44</ref> Concurrently, rapid bacterial growth was also observed between 10 and 20 h (Figure <ref type="figure">S6</ref> in the Supporting Information). <ref type="bibr">36</ref> Second, the peak intensity at 526 cm -1 , attributed to the cell wall components, first increases from 5 to 10 h and subsequently decreases from 10 to 20 h. Concurrently, the intensities of peaks attributed to EPS components at 822, 885, and 1125 cm -1 rapidly increase between 10 and 20 h (Figures 3B and S7 in the Supporting Information). These observations indicate that the EPS components surround the cells as the P. syringae biofilm develops between 10 and 20 h with limited access of the cell wall components to the SERS hotspots. Although the normalized intensity of individual spectral features can allow us to understand the temporal trends of various biomolecules, they are insufficient to classify SERS spectra measured from different biofilm growth stages due to the intrinsically heterogeneous nature of biochemical environments in biofilms mapped by SERS hotspot arrays.</p><p>We employed supervised PCA-LDA methods to classify the measured SERS spectra acquired during different biofilm growth stages (Text S4 and Figure <ref type="figure">S8</ref> in the Supporting Information). The first 75 PCs (95% spectral variance) were used as input variables for LDA to avoid overfitting. To use LDA, the number of input variables (75 PCs) should be at least 5 times lower than the number of spectra (1200 spectra). <ref type="bibr">45</ref> The classification results demonstrate that the PCA-LDA model can achieve an overall classification accuracy of 92.3% (Figures <ref type="figure">3C</ref> and<ref type="figure">S8B</ref>,C in the Supporting Information). We would like to point out that the LDA classification accuracy was 87.1% with the first five PCs as input variables. The next 70 PCs account for an additional 5.2% cumulative accuracy, which might originate from convoluted noise and real spectroscopic features due to the weak signal intensity of some SERS features. In sum, we can exploit unsupervised HCA and PCA methods to extract the spatial and temporal SERS information related to different biochemical components in biofilms and employ supervised PCA-LDA methods to classify SERS spectra associated with different biofilm growth stages. Compared to previous studies focused on label-free monitoring of either the spatial distribution or temporal evolution of biofilms, <ref type="bibr">17,</ref><ref type="bibr">38</ref> this work is among the first to demonstrate spatiotemporal SERS measurements and multivariate analysis of the dynamic heterogeneous evolution of various molecules in living biofilms, including cellular components, EPS matrix components (e.g., polysaccharide alginate, proteins, and extracellular DNA), and metabolite molecules. Spatiotemporal SERS analysis of living biofilms can potentially aid the development of antibiofilm therapy methods by improving our under- standing of the system-level response of virulent biofilms to therapeutic agents (e.g., bacteriophage viruses) such as different survival mechanisms (e.g., cell to cell quorum sensing, <ref type="bibr">46</ref> polymicrobial synergy, 47 metabolic aspects <ref type="bibr">48</ref> ) and their interactions with each other. <ref type="bibr">49</ref> Spatiotemporal SERS Profiling and Multivariate Spatial Data Analysis of Virally Infected Biofilms. As shown in Figure <ref type="figure">4A</ref>, Phi6 can infect and replicate within P. syringae cells and thereby alter the spatiotemporal biochemical information at the SERS hotspots due to a range of mechanisms such as the disruption of biofilm integrity by the lytic action of Phi6, <ref type="bibr">50</ref> the buildup of molecular components from lysed cells in the extracellular environment, <ref type="bibr">8</ref> and the alteration of host bacterial metabolism in favor of viral replication. <ref type="bibr">4,</ref><ref type="bibr">8</ref> We performed in situ spatiotemporal SERS measurements of biofilm development under Phi6 viral infection at dosages of 0, 10 6 , and 10 8 PFU/mL to study dose effects on P. syringae biofilm activity. As expected, Phi6 infection can inhibit P. syringae growth due to the lytic action of Phi6 (Figure <ref type="figure">S6</ref> in the Supporting Information). Figure <ref type="figure">4B</ref> shows the ERS-normalized average SERS spectra measured at 5, 10, and 20 h over a 400 &#956;m 2 region of the virally infected (10 6 and 10 8 PFU/mL Phi6) and uninfected living biofilms on the NLPCs. At t = 5 h, the observed SERS spectra of virally infected samples exhibit an additional peak at 1085 cm -1 , attributed to membrane phospholipids. <ref type="bibr">51</ref> At t = 10 h, peaks at 960 and 1187 cm -1 , attributed to nucleic acids, are observed in the SERS spectra of samples infected with 10 8 PFU/mL Phi6. At t = 20 h, several peaks originating from proteins at 750, 1360, and 1621 cm -1 (tryptophan) and 1652 cm -1 (amide I) emerge in the SERS spectra of samples infected with 10 8 PFU/ mL Phi6, while the peaks from phenylalanine (1001 and 1595 cm -1 ) significantly increase in intensity. Since a weak SERS signature was observed during the direct label-free detection of Phi6 (Figure <ref type="figure">S9</ref> in the Supporting Information), these peaks can originate from cellular components in the extracellular environment released from lysed P. syringae cells due to the lytic action of Phi6. Previous studies employing MS have similar observations that accumulated cellular materials from lysed bacterial cells in phage-infected supernatants can increase the concentrations of specific amino acids, lipids, and nucleic acids. <ref type="bibr">8,</ref><ref type="bibr">52</ref> The improved SERS signal intensity of lysed cell components compared to intact bacterial cells can be attributed to the improved accessibility of the smaller lysed cell components in the extracellular medium to the nanoscale SERS hotspots.</p><p>Since intact bacterial cells in virally infected samples can remain active and consume molecular components from lysed bacterial cells for promoting viral replication, the detected molecules in the SERS spectra of the virally infected samples at different time points reflect the virus-specific alteration of the bacterial metabolism. <ref type="bibr">4,</ref><ref type="bibr">8</ref> Since Phi6 consists of 62% protein, <ref type="bibr">53</ref> we believe that the absence of the protein peaks after 5 and 10 h of infection can be attributed to the active P. syringae cells' preferential consumption of amino acids from lysed cells to promote Phi6 propagation, thus limiting their presence in the SERS hotspot-probed extracellular environment. As the ratio of the active P. syringae cells to Phi6 particles decreases with time due to rapid Phi6 replication and the lytic action of Phi6, amino acids can likely buildup in the extracellular environment at later times, as indicated by the emergence of amino acidrelated SERS peaks at 20 h. In comparison, the phospholipid and nucleic acid components from lysed cells begin to build up in the SERS hotspot-probed extracellular environment within 5 and 10 h, respectively. Previous studies using MS <ref type="bibr">8</ref> and NMR <ref type="bibr">9</ref> have also demonstrated the preferential consumption of amino acids over lipid components by phage-infected bacteria from the cell lysate-enriched extracellular environment at early stages of infection for promoting the propagation of proteinrich bacteriophage viruses.</p><p>To visualize the embedded spatial information, we plotted the spatial maps of the HCA-resolved spectra and the ERScalibrated Raman intensities of major peaks from infected and uninfected biofilms at 20 h (Text S5, Figures <ref type="figure">S10</ref> and<ref type="figure">S11</ref>). The uninfected biofilms demonstrate spatially correlated spectroscopic signatures corresponding to biofilm components, indicating the development of P. syringae biofilms intimately interfaced with the SERS hotspots of the NLPCs (Figures <ref type="figure">2B,</ref><ref type="figure">C</ref> and S11A, Text S5 in the Supporting Information). In contrast, the infected biofilms demonstrate the non-uniform and spatially uncorrelated distribution of biomolecules from lysed cell components indicating the random adsorptiondesorption at plasmonic nanogap cavities for different molecules from lysed cell components in the extracellular environments (Figure <ref type="figure">S11</ref> and Text S5 in the Supporting Information). To validate our assumptions about the spatiotemporal evolution of the phage-infected and uninfected P. syringae biofilms on the NLPCs (Figure <ref type="figure">S12A</ref> in SI) and planktonic P. syringae cells in the supernatant (Figure <ref type="figure">S12B</ref> in the Supporting Information), we performed fluorescence imaging using SYBR Green 1 nucleic acid stain (Text S6 in Supporting Information). These results further support that the SERS signals in uninfected samples predominantly originate from biofilm components, while the signals in phage-infected samples primarily originate from lysed cell components in the extracellular environment. We want to point out that compared to the invasive fluorescence imaging method, which provides the single-time point spatial distribution information using label-based biomarkers, our spatiotemporal SERS measurements with NLPCs can achieve To analyze the phage-dosage-dependent responses of the P. syringae biofilm from the multivariate SERS dataset, we first implemented PCA to identify critical spectral features. Figure <ref type="figure">5A</ref> illustrates the loading spectra for the first two principal components from the SERS dataset of the uninfected and infected samples (10 6 and 10 8 PFU/mL of Phi6) at 20 h. The PC1-loading spectrum primarily consists of the peaks observed in the uninfected samples at 822, 885, and 1125 cm -1 attributed to EPS components, while the PC2-loading spectrum primarily consists of peaks observed in the infected samples at 750, 1001, 1085, 1187, and 1595 cm -1 attributed to lysed cell components.</p><p>To quantitatively analyze the temporal biochemical changes of individual biomarkers identified by the PC loading scores in infected and uninfected samples, the ERS-calibrated Raman intensities of the identified features were plotted as a function of time (Figure <ref type="figure">5B</ref>). First, we observe that the intensities for peaks attributed to the biofilm EPS polysaccharides at 885 cm -1 and EPS proteins at 822 cm -1 increase with time from 5 to 20 h in the uninfected biofilms, while these peaks do not increase in intensity with time in the infected biofilms, suggesting disruption of biofilm formation due to bacterial cell lysis. <ref type="bibr">54</ref> Second, we observe that the peaks attributed to cellular components released from lysed cells exhibit varying temporal trends in the infected and uninfected samples. The peak intensity of the lipid peak (1085 cm -1 ) is consistently higher in the infected samples compared to the uninfected samples between 5 and 20 h. The peak intensity of the nucleic acid peak (1187 cm -1 ) gradually increases between 5 and 20 h in samples infected with 10 8 PFU/mL, and the peak intensity of the protein peaks (750 and 1001 cm -1 ) rapidly increases between 10 and 20 h in samples infected with 10 8 PFU/mL. As mentioned earlier, the unique temporal trends of different macromolecules in the biological matrix of virally infected samples can reflect the virus-specific alteration of bacterial metabolism. <ref type="bibr">4,</ref><ref type="bibr">8</ref> By extending our model to other virus-cell interactions, our methodology can potentially enable the continuous diagnosis of pathogenic viruses.</p><p>Although the average intensity of individual spectral features can allow us to understand the temporal trend of various biomolecules, they are insufficient to classify SERS spectra measured from virally infected and uninfected biofilm samples for viral diagnosis due to the intrinsically heterogeneous nature of the biochemical environments in biofilms mapped by SERS hotspot arrays. Therefore, we employed PCA-LDA to statistically classify the virus-infected and uninfected biofilms at different growth stages. In Figure <ref type="figure">5C</ref>, the PC score plot with scatters from uninfected, low-dose infected, and high-dose infected biofilms after 5 h of development shows the extensive overlap between the low-dose infected and high-dose infected biofilms. A similar overlap pattern can be observed in the PC score plot after 10 h of biofilm development (Figure <ref type="figure">5D</ref>). In contrast, at 20 h, the PC score plot shows a clear separation between the three cases (Figure <ref type="figure">5E</ref>), manifesting significant differences in the spectra at later stages of biofilm development. Such behaviors are likely due to the viral-dosedependent accumulation of lysed cell components in the extracellular environment of the infected biofilm and the development of a robust biofilm EPS in the uninfected biofilm.</p><p>PCA was used as a data reduction tool to extract significant variables from the data set, and subsequently, LDA was implemented to classify the multivariable dataset at t = 5, 10, and 20 h using the first 75 PC components as input variables. As shown in Figure <ref type="figure">5F</ref>-H, the PCA-LDA classification model can effectively segregate the uninfected, low-dose infected, and high-dose infected biofilms at t = 5, 10, and 20 h. The PCA-LDA classification results obtained using the leave-one-outcross-validation method indicate that overall accuracies of 91.7, 93.5, and 99.8% can be achieved at t = 5, 10, and 20 h, respectively, validating the potential for viral infection diagnosis (Figure <ref type="figure">S13</ref> in the Supporting Information).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>&#9632; CONCLUSIONS</head><p>In conclusion, we have demonstrated that NLPCs, consisting of dense and uniformly distributed SERS hotspots, can enable reliable in situ spatiotemporal SERS measurements of living P. syringae biofilms to monitor the spatiotemporal evolution of the network-level microbial activity during the biofilm growth process. Using multivariate machine learning methods such as HCA and PCA, we have extracted spatially and temporally correlated biological information embedded in the multivariable spatiotemporal SERS datasets, demonstrating the spatial distribution and temporal evolution of important biomolecules related to cell wall components, EPS matrixes, and extracellular metabolites in the P. syringae biofilm matrix. Moreover, when infected by bacteriophage Phi6, the spatiotemporal biochemical changes in the biological matrix due to the phage-bacteria interaction were monitored, allowing reliable multiclass-classification analyses of the phage dosage-dependent responses of living P. syringae biofilms using supervised PCA-LDA machine learning methods. Our results indicate that the temporal dynamics of different biomolecules observed in the SERS spectra from lysed bacterial cell components, such as amino acids, nucleic acids, and lipids, can reflect the virus-specific alteration of the metabolism of host bacteria. Therefore, our developed methods can potentially extend to measure and analyze the spatiotemporal interactions between pathogenic viruses and various living multicellular systems consisting of mammalian cells or genetically engineered bacteria. <ref type="bibr">36</ref> Moreover, we envision that the ability to capture, analyze, and differentiate the dynamically evolving spatiotemporal biochemical activities of living biofilms in response to external stimuli (e.g., temperature, antibiotics, and bacteriophages) can potentially assist in the development of effective anti-biofilm treatment methods.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>&#9632; EXPERIMENTAL SECTION</head><p>Spatiotemporal SERS Analysis of Uninfected and Phi6-Infected P. syringae. NLPCs were attached to the bottom of sterile 60 mm Petri dishes using PU (UV curing for 3 min), and the system was sterilized with ethanol for 3 h. The prepared P. syringae suspension was washed with 10 mL of LB medium (3 times) and resuspended to a final concentration of 10 7 CFU/mL. 5 mL of the bacterial suspension was added to the system and incubated at 25 &#176;C for 12 h to allow the P. syringae cells to adhere to the surface of the NLPCs. Then, we aspirated 5 mL of the solution in the Petri dish to remove the growth media and unadhered planktonic cells. We gently rinsed the adhered bacterial cells by adding 5 mL of fresh LB medium and aspirating the rinsing medium (2 times). Then, we added 4.9 mL of fresh LB medium to the Petri dishes. Lastly, 100 &#956;L of LB medium (uninfected case) and 100 &#956;L of Phi6 suspension with appropriate concentrations (infected cases) were added to the individual systems.</p><p>The three experiments (0, 10 6 , and 10 8 PFU/mL Phi6) were performed sequentially. SERS measurements were performed via a 20&#215; objective lens with 2 mW power and 1s integration time over 20 &#956;m &#215; 20 &#956;m regions containing 400 pixels at 5, 10, and 20 h for each case. Between SERS measurements, the Petri dishes remained in the same position to obtain spatially correlated data, and the Petri dishes were covered to prevent evaporation of the solution.</p><p>Multivariate Analysis. Baseline correction, Savitzky-Golay smoothing, and cosmic ray removal were performed using Project v4.1 software. Spectra whose maximum peak values were smaller than three times the noise level were discarded. The background noise intensity was determined using recorded background signals in the spectral region at 2000 cm -1 without molecular Raman peaks. <ref type="bibr">35</ref> MATLAB was used for performing ERS calibration and data truncation. Lastly, R was used for performing HCA, PCA, and LDA. For PCA and HCA, Raman wavenumbers from 500 to 1600 cm -1 were chosen as variables. HCA was performed using Euclidian distance and Ward's minimum variance method. In the HCA dendrograms, a normalized distance threshold of 0.5 was used for assigning the clusters. Student's t-test was conducted using MATLAB, and the p-value describes the result of the t-test. p &lt; 0.01 represents a significant difference.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>&#9632; ASSOCIATED CONTENT</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>* s&#305; Supporting Information</head><p>The Supporting Information is available free of charge at <ref type="url">https://pubs.acs.org/doi/10.1021/acssensors.2c02412</ref>. Additional experimental details; average SERS spectra of the P. syringae biofilm with the noise level; HCA analysis at 5 h; SERS spectra of washed P. syringae cells; SERS spectra of LB medium; P. syringae growth curves; PCA-LDA analysis of biofilm growth stages; SERS spectra of Phi6; HCA dendrograms of infected and uninfected samples at 20 h; spatial analysis of Phi6 infected biofilms; fluorescence microscopy images; confusion matrix for PCA-LDA of the viral-dose dependent responses of P. syringae biofilms; and assignments and origins for the observed SERS peaks (PDF)</p><p>&#9632; AUTHOR <ref type="bibr">INFORMATION</ref> </p></div><note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_0"><p>https://doi.org/10.1021/acssensors.2c02412 ACS Sens. 2023, 8, 1132-1142 Downloaded via VIRGINIA POLYTECH INST STATE UNIV on November 26, 2023 at 22:12:57 (UTC).See https://pubs.acs.org/sharingguidelines for options on how to legitimately share published articles.</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_1"><p>https://doi.org/10.1021/acssensors.2c02412 ACS Sens. 2023, 8, 1132-1142</p></note>
		</body>
		</text>
</TEI>
