<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>Exploring the Relationship of Microstructure and Conductivity in Metal Halide Perovskites via Active Learning-Driven Automated Scanning Probe Microscopy</title></titleStmt>
			<publicationStmt>
				<publisher></publisher>
				<date>03/30/2023</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10405072</idno>
					<idno type="doi">10.1021/acs.jpclett.3c00223</idno>
					<title level='j'>The Journal of Physical Chemistry Letters</title>
<idno>1948-7185</idno>
<biblScope unit="volume"></biblScope>
<biblScope unit="issue"></biblScope>					

					<author>Yongtao Liu</author><author>Jonghee Yang</author><author>Rama K. Vasudevan</author><author>Kyle P. Kelley</author><author>Maxim Ziatdinov</author><author>Sergei V. Kalinin</author><author>Mahshid Ahmadi</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[Electronic transport and hysteresis in metal halide perovskites (MHPs)  are key to the applications in photovoltaics, light emitting devices, and light and chemical sensors. These phenomena are strongly affected by the materials microstructure including grain boundaries, ferroic domain walls, and secondary phase inclusions. Here, we demonstrate an active machine learning framework for "driving" an automated scanning probe microscope (SPM) to discover the microstructures responsible for specific aspects of transport behavior in MHPs. In our setup, the microscope can discover the microstructural elements that maximize the onset of conduction, hysteresis, or any other characteristic that can be derived from a set of current-voltage spectra. This approach opens new opportunities for exploring the origins of materials functionality in complex materials by SPM and can be integrated with other characterization techniques either before (prior knowledge) or after (identification of locations of interest for detail studies) functional probing.]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><p>M etal halide perovskites (MHPs) have emerged as the most promising class of materials for the next generation of photovoltaics due to their high photovoltaic efficiency and potential for low-cost large-scale manufacturing. <ref type="bibr">1</ref> In addition to single junction MHP solar cells, MHP allows for bandgap engineering via mixing halides, making MHPs suitable for tandem solar cells when stacking with Si or other thin film photovoltaics. However, the current issue for commercializing MHP solar cells is their stability and scalability. <ref type="bibr">2,</ref><ref type="bibr">3</ref> Both stability and scalability are ultimately related to the microstructure of the film and its evolution during deposition and annealing. The formation of the microstructure starts at the deposition stages and is controlled by nucleation and subsequent microstructure evolution during annealing and processing. <ref type="bibr">4</ref> The resultant microstructures are often inhomogeneous and contain grain boundaries, topological defects such as domain walls, and secondary phase inclusions. <ref type="bibr">5</ref> All these elements can affect the carrier accumulation and electron and hole transport. <ref type="bibr">3</ref> Additionally, these elements can strongly affect the ionic dynamics during operation and can be susceptible to environmental stresses, giving rise to light-, bias-and ambient gas-induced instabilities in these materials. <ref type="bibr">5,</ref><ref type="bibr">6</ref> These considerations have stimulated studies of the chemical and physical microstructure of the MHP films using a variety of microscopic imaging and chemical characterization techniques. <ref type="bibr">7,</ref><ref type="bibr">8</ref> Multimodal microscopy measurements revealed the coupled chemical, strain, and ferroic behavior, <ref type="bibr">9</ref> which affects the optoelectronic response, <ref type="bibr">10</ref> implying that it can play a role in charge carrier transport and generation and ultimately affect photovoltaic performance. It was also reported that local chemical disorder results in electronic disorder and charge carrier transport. <ref type="bibr">11</ref> Local phase impurities, depending on the thin film composition and processing method, act as trap sites, which induces structural change and photochemical degradation under illumination. <ref type="bibr">5</ref> Using time-resolved secondary ion mass spectrometry to observe real-time chemical distribution in MHP film, researchers revealed chemical redistribution and associated degradation induced by external fields including light and electric fields. <ref type="bibr">12,</ref><ref type="bibr">13</ref> Ferroic twin domains in MHPs have been found to correlate with both crystallographic difference and chemical heterogeneity. <ref type="bibr">9,</ref><ref type="bibr">14</ref> These nanoscale domains lead to photoluminescence intensity differences, <ref type="bibr">10,</ref><ref type="bibr">15</ref> which was attributed to the difference in local charge carrier concentration. On one hand, the domain walls are found to be benign to charger carrier activity; <ref type="bibr">15</ref> on the other hand, the charge carrier mobility is shown to be different along the domain wall and perpendicular to the domain wall direction. <ref type="bibr">8</ref> In addition, subgrain boundaries can act as energetic barriers to block charge carrier transports. <ref type="bibr">16</ref> Moreover, significant work has revealed that grain boundaries can serve as trap sites, recombination centers, ion migration highways, etc. and ultimately affect the efficiency and stability of MHP photovoltaics. <ref type="bibr">17</ref> This extensive variability of microstructural elements on length scales spanning nanometer scale phase separated regions and potentially larger scale defects that can emerge during industry-scale manufacturing necessitates investigation of local functionalities to build structure-property relationships. Direct information on the local transport properties can be obtained by conductive atomic force microscopy (cAFM). In this technique, the atomic force microscopy (AFM) tip plays a role of nano electrode that establishes the contact with individual regions of the surface. The measured currentvoltage (IV) curve contains information on local electronic and ionic conductivity. The applications of more complex waveforms such as up-down sweeps and first-order reversal curve (FORC) IVs additionally allow us to identify the hysteresis (typically related to ionic motion or electrochemical processes). <ref type="bibr">13,</ref><ref type="bibr">18</ref> While cAFM is affected by factors such as topography, etc., the collected information revealing local conductivity with a high spatial resolution is directly relevant to device operation.</p><p>However, measuring current-voltage characteristics everywhere across the sample surface is highly nonoptimal. Measurements over a dense grid can be time-consuming, are associated with accumulated damage to the AFM tip and surface, and are not guaranteed to sample the object of interest such as grain boundary (GB) if the spatial density of the latter is low. Previously, we have introduced a direct computer vision-based method where we use a pretrained supervised network <ref type="bibr">19,</ref><ref type="bibr">20</ref> to identify objects of interest and explore their transport properties. <ref type="bibr">21</ref> Here, we implement the solution of the opposite problem&#65533;namely, discover what region of the surface and microstructural element are responsible for a specific aspect of transport behavior.</p><p>Recently, we developed an active deep kernel learning (DKL) <ref type="bibr">22</ref> approach, which can actively learn the correlation between structure image data and functional (e.g., spectroscopy) data during an operating experiment and subsequently makes decisions on the next measurement locations based on (continuously updated) learned relationship. Here, the image includes structural features such as grain or GB, and the spectroscopy data (e.g., IV curve) encode the physical properties. We designed a workflow to implement the DKL on an operating AFM, allowing us to explore the materials in an automated manner without human intervention. In this work, we implement this approach in cAFM to explore the conductive properties in MHP thin films. We use the AFM topography that shows the grain and GB as the structure image data for DKL and perform IV measurements as the spectroscopy data. Then, the DKL actively learns the correlation between topography and IV curves. The DKL-cAFM results indicate that GB junction points show interesting phenomena, while previous work only focused on the difference between grain and GB. As a model system, a mixed formamidinium (FA) and cesium (Cs) cation (FACs) MHP (FACs-MHP) is selected for this study. This FACs-MHP is one of the state-of-the-art MHPs, which reports show excellent stability and a PCE of 22.7% for FACs-MHPs solar cells. <ref type="bibr">23,</ref><ref type="bibr">24</ref> Nevertheless, realizing a completely phase-pure FACs-MHP is challenging due to the relatively high Cs concentration (&lt;20%), causing local The Journal of Physical Chemistry Letters microstructural inhomogeneities. <ref type="bibr">23,</ref><ref type="bibr">25</ref> Thus, FACs-MHPs are the ideal platform for the microstructural explorations using DKL-cAFM.</p><p>Traditionally, cAFM is operated in an image or spectroscopy mode to study MHP thin films, as shown in Figure <ref type="figure">1</ref>. Here, the image measurement indicates the conductivity of the film under a static condition, while the spectroscopy measurement shows IV results from certain locations determined by human operators. However, for traditional cAFM, spectroscopy locations must be manually selected and triggered by human operators, the selection of measurement locations is often related to the human's interest and/or prior knowledge of the material system under study. Thus, only a relatively limited number of points can be selected, and the selected locations are based on operators' interest.</p><p>Next, we employ the recently developed deep kernel learning (DKL) method to explore the relationship between IV spectroscopy and surface morphology in an unbiased manner. In DKL analysis, the image (e.g., topography) data showing microstructure is the input and the spectroscopy (e.g., IV) showing the physical property is the output, as shown in Figure <ref type="figure">2a</ref>, the top part of Figure <ref type="figure">2a</ref> is a microscopy image data showing microstructures and the bottom part of Figure <ref type="figure">2a</ref> is a spectroscopy data showing physical properties (e.g., turn-on voltage, hysteresis factor, and maximum current). The DKL includes a neural network (NN) and a Gaussian process (GP)  Then, the workflow transfers this location to the microscope and drives the microscope to perform next IV measurement.</p><p>The above process will be repeated to actively explore the morphology-IV relationship. It should be noted that the DKL model (including both the neural net and GP) is updated as new spectroscopy data are acquired; i.e., it is in an active learning setting. We define several physical descriptors representing the property of the IV curve as target properties in the DKL-cAFM exploration. As shown in the spectroscopy IV example in Figure <ref type="figure">2a</ref>, we define several descriptors including (a) the voltage where the current starts increasing as the turn-on voltage, (b) the difference between forward and reverse IVs as the hysteresis factor, and (c) the maximum current. These descriptors are extracted from real-time IV curves and used as the target property for DKL exploration. It is well-known that light illumination changes strain disorder, ion migration activation energy, phase segregation, etc. in MHPs and consequently alters MHP properties. As such, we performed DKL-cAFM exploration in both dark and light conditions. In this work, we used a built-in LED light in the microscope to illuminate the MHP film, the light intensity is roughly equivalent to 0.1 sun.</p><p>Shown in Figure <ref type="figure">3</ref> are the results of DKL-cAFM exploration over the whole topography region. Here, topography is used as the structure image, and three physical descriptors&#65533;turn-on voltage, hysteresis factor, and maximum current&#65533;are used as target properties to guide DKL-cAFM exploration. Also, each exploration is performed under dark and light conditions. These three physical descriptors are shown in the bottom part of Figure <ref type="figure">2a</ref>, in details, turn-on voltage is the voltage where the current starts increasing, hysteresis factor is the difference between forward scan and reverse scan (the green area in Figure <ref type="figure">2a</ref>), and the maximum current is the maximum current in the IV curve. Overall, six DKL-cAFM explorations are performed. The topography image size is 128 &#215; 128, totally 16384 pixels (locations); in the DKL-cAFM measurement, we performed 200 exploration steps obtained IV from 200 locations, which is &#8764;1.2% of the total locations. In the DKL-cAFM measurement, the DKL analysis is performed on a GPU server (Nvidia DGX-2). A real-time communication between the AFM workstation and remote GPU server is built via socket programming, so that the experiment data stream is sent to the GPU server and the DKL analysis result is sent back to the AFM workstation for the subsequent measurement. After 200 measurement steps, DKL is able to offer a prediction map of the target property of the whole region and an associated uncertainty map showing the variance in the prediction at each location.</p><p>In Figure <ref type="figure">3</ref>, the filled circles in the topography image are the IV measurement locations decided by DKL, where the blue spots indicate the measurement locations at the beginning and the red spots indicate the measurement locations at the end of the measurement; i.e., the color indicates the measurement time step. A universal phenomenon we can observe from all six measurements (note that six measurements are independent) is that the measurement locations tend to concentrate around GBs regardless of target property or light condition. To the MHP community, this is not a surprising result as GBs are known to exhibit multiple functional properties, such as acting </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>The Journal of Physical Chemistry Letters</head><p>as defect centers, traps, ion migration highway, etc. This result is a good indicator that DKL exploration is based on physics. A close look at the results in Figure <ref type="figure">3</ref> leads to a discovery that the measurement locations concentrated near not only GBs but also the GB junction points (the cross point of several GBs).</p><p>To our knowledge, though previous work has largely focused on investigating the difference between grain and GBs, the GB junction points were rarely investigated. This result indicates that GB junction points potentially exhibit interesting functionality, which is a discovery by DKL active learning with human intervention. In this regard, we direct readers to our recent study 21 with a supervised machine learning method to systematically investigate GBs, in which we clarified that GB junction points are insulating under dark and conductive under light. Such a phenomenon is related to spatial chemical composition variance. In addition, in the DKL prediction, spatial variation of each target property is observed (Figure <ref type="figure">3</ref>). This variation is mostly associated with grain structure, which may relate to intrinsic defects distribution, ion migration, or local phase impurities.</p><p>Since the DKL exploration over the whole region indicates GBs to be interesting, a further exploration focusing on GBs is necessitated. A prerequirement to explore over GBs is identifying GB locations, which can be done by the ResHedNet model developed recently, which is a modified version of holistically nested edge detector <ref type="bibr">26</ref> augmented with residual connections. <ref type="bibr">20</ref> To improve the robustness of ResHedNet predictions on real-time data streams, an ensemble of ResHedNet models&#65533;each model is trained independ-ently&#65533;is used in the real-time experiment. Compared to traditional edge detectors as well as a single ResHedNet, this ensembled-ResHedNet overcomes the out-of-distribution effect in real-time experiments and hence is more robust. We trained the ensemble-ResHedNet to identify the locations of the GBs; then we extracted topography patches from the GBs locations and made these patches as exploration space (i.e., now the exploration space is only GBs). In this instance, in the DKL-cAFM experiment, the DKL only explores the morphology-IV relationship at GBs.</p><p>In the same manner as the previous measurements, the topography is used as the structural image and three physical descriptors are used as the target property to guide DKL-cAFM measurement at GBs. The results are shown in Figure <ref type="figure">4</ref>. Again, the exploration locations decided by DKL show a tendency to concentrate around GB junction points regardless of whichever target property is used. In addition, an obvious observation in Figure <ref type="figure">4</ref> is that the DKL procedure tends to explore some GBs but not others&#65533;that is, exploration locations mostly distributed in certain GBs. This can be understood as the internal variation of GBs, i.e., different GBs may exhibit different functionalities.</p><p>The DKL prediction of turn-on voltage draw a map showing complex features around the GBs under light that show higher values under dark. The predicted maximum current map show the higher levels under light across the surface, attributed to the carrier photogeneration in the MHP matrix. The predicted hysteresis factor map also exhibits local complexities, whereas smaller hysteresis factors are predicted around the GBs under light and dark conditions. Together, the resulting DKL predicted maps reveal local complexities in the MHP matrix directly visualizing the morphology-charge transport relation- ship, which has been undermined based on classical operator's interests.</p><p>To obtain further insight into the DKL results, we introduce the approach for the postexperiment data analytics that we refer to as postacquisition forensics. In this case, after the DKL experiment, we have access to (1) the series of local image patches acquired by the DKL algorithm during the experiment, P i , where i = 1, 2, ..., n is the image patch at step i (during the DKL experiment, the DKL algorithm sequentially explored images P i trying to optimize the chosen characteristic of the IV curve in its center); (2) the full set of image patches that can be formed from the original AFM image; (3) the IV curves IV i measured at the center of image patch i.</p><p>As a first step of analysis, we explore the spatial variability of the sampled image patches compared to the full set of possible image patches. To accomplish this, we use a rotationally invariant variational autoencoder (rVAE) approach to discover the intrinsic factors of variability within the data, as described in detail in our previous work. <ref type="bibr">20,</ref><ref type="bibr">27</ref> Briefly, rVAE encodes the information in the image patch P i into the latent vector L. Here for the ease of representation we choose L to be 2D, L = (L 1 , L 2 ). The important aspect of the rVAE approach is that it allows us to disentangle the representation of the data, meaning, to discover the factors of variability in the data set and align them with specific latent variables. With rVAE, the collection of patches can be represented as a distribution in the 2D space, as shown in Figure <ref type="figure">5</ref>. Here, we illustrate the latent distribution of the full data set via the corresponding kernel density estimate (KDE) and illustrate the distribution of the points corresponding to the sampled patches P i via color-coded points. It is clearly visible from this analysis that the experimentally probed density (Figure <ref type="figure">5b</ref>) is very different from the full density of the system (Figure <ref type="figure">5a</ref>), so the active learning workflow probes a specific manifold in the full space of the system.</p><p>The meaning of the latent variables, i.e., relationship between the latent vector and real space image patch can be illustrated using the latent representation as shown in Figure <ref type="figure">5c</ref>,d. To construct the latent representation, the rectangular grid of points in the latent space is constructed and each latent vector is decoded to yield the real space image. The latent representation for the full data set is shown in Figure <ref type="figure">5c</ref>. This shows the key elements of morphology ordered along the latent directions: along the vertical direction, we observe that GB gradually disappear from top to bottom; along the horizontal direction, the GB location changes gradually; in both directions, we observe the change of GB shape, e.g., curvature, as well as a contrast change. In comparison, the latent representation for the image set sampled during the DKL experiment is shown in Figure <ref type="figure">5d</ref>. Here, the predominant difference among grids is the GB location and contrast. We note that from the material perspective, we also studied the GB behavior in details in a separated work. <ref type="bibr">21</ref> With this information in hand, we proceed to analyze the experimental trajectory. Shown in Figure <ref type="figure">5b</ref>, the color represents experiment steps showing the trajectory of the DKL experiment in the latent space.</p><p>In summary, we implemented a DKL algorithm in cAFM to study the surface morphology-conductivity relationship of a polycrystalline MHP thin film. The approach combines the power of machine learning and AFM, allowing an automated exploration without human bias. The DKL exploration over the whole region including both grain and GBs implies that GBs are of interest, which matches physics that GBs are known to exhibit rich functionality. In addition, the results also show that GB junction points can be more interesting sites as DKL tends to learn the GB junction points during experiment. We further combined our ResHedNet with DKL that enables DKL focusing on studying GBs instead of the whole region. Interestingly, the results further confirm the GB junction points can exhibit interesting functionality which requires more in-depth study. The exploration of GBs also indicates a variation between GBs. Then, we explored the DKL sampling mechanism via VAE analysis.</p><p>In spite of intensive efforts investigating microscale properties of MHP, most work focused on the difference between grain and GBs, the variation between different GBs, as well as the GB junction points, is rarely investigated. This work, by active learning driven cAFM, highlights the role of GB junction points and the variation of GBs in MHP functionality in an automated manner without human bias.</p><p>We further note that the DKL approach developed here can be incorporated in more complex machine learning driven exploration pathways. Here, the DKL was realized to establish the relationship between the transport properties and microstructure parametrized via surface topography. However, in the upstream direction, this workflow can be based on the more complex characterization methods including cathodoluminescence, photoluminescence, time-of-flight secondary ion mass spectrometry (ToF-SIMS), electron backscatter diffraction (EBSD), etc. The key requirement here will be either the multimodal imaging when SPM is incorporated in these tools or creating of the fiducial marks that allow alignment of images acquired from different modalities. Similarly, downstream the region of interest identified via DKL SPM can be explored via destructive methods such as scanning transmission electron microscopy or atom probe tomography to establish the origins of the observed phenomena on the atomic level.</p><p>&#9632; METHODS MHP Synthesis. 1.0 M Cs 0.17 FA 0.83 PbI 3 precursor solution was prepared by mixing respective volumetric amounts of FAPbI 3 and CsPbI 3 precursors. A mixture of DMF and DMSO [DMF (v):DMSO (v) = 5:1] is used as the solvent. The Cs 0.17 FA 0.83 PbI 3 precursor was spin-coated on the ITO substrates with 500 rpm for 10 s (ramp 1000 rpm/s) and 4000 rpm for 35 s (ramp 2000 rpm/s) to make the thin film. Antisolvent chlorobenzene was added 10 s before the end of spin coating. The films were then annealed at 150 &#176;C for 10 min. The whole process was performed in a N 2 glovebox.</p><p>cAFM. Conductive AFM measurement were performed with Budget Sensor Multi75E-G Cr/Pt coated AFM probes (&#8764;3 N/ m) in an Oxford Instrument Asylum Research Cypher microscope. A National Instrument DAQ card is equipped in the Cypher microscope and LabView script is used to apply voltage and acquiring IV data. The automated experiment workflow is designed in a Python Jupyter Notebook with deep learning methods from AtomAI. <ref type="bibr">28</ref> &#9632; ASSOCIATED <ref type="bibr">CONTENT</ref> </p></div><note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_0"><p>https://doi.org/10.1021/acs.jpclett.3c00223 J. Phys. Chem. Lett. 2023, 14, 3352-3359 Downloaded via UNIV OF TENNESSEE KNOXVILLE on March 30, 2023 at 17:22:45 (UTC).See https://pubs.acs.org/sharingguidelines for options on how to legitimately share published articles.</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_1"><p>https://doi.org/10.1021/acs.jpclett.3c00223 J. Phys. Chem. Lett. 2023, 14, 3352-3359</p></note>
		</body>
		</text>
</TEI>
