<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>Machine Learning Chemical Guidelines for Engineering Electronic Structures in Half-Heusler Thermoelectric Materials</title></titleStmt>
			<publicationStmt>
				<publisher></publisher>
				<date>04/22/2020</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10195249</idno>
					<idno type="doi">10.34133/2020/6375171</idno>
					<title level='j'>Research</title>
<idno>2639-5274</idno>
<biblScope unit="volume">2020</biblScope>
<biblScope unit="issue"></biblScope>					

					<author>Maxwell T. Dylla</author><author>Alexander Dunn</author><author>Shashwat Anand</author><author>Anubhav Jain</author><author>G. Jeffrey Snyder</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[Half-Heusler materials are strong candidates for thermoelectric applications due to their high weighted mobilities and power factors, which is known to be correlated to valley degeneracy in the electronic band structure. However, there are over 50 known semiconducting half-Heusler phases, and it is not clear how the chemical composition affects the electronic structure. While all the n-type electronic structures have their conduction band minimum at either the                              Γ                            - or                              X                            -point, there is more diversity in the p-type electronic structures, and the valence band maximum can be at either the                              Γ                            -,                              L                            -, or                              W                            -point. Here, we use high throughput computation and machine learning to compare the valence bands of known half-Heusler compounds and discover new chemical guidelines for promoting the highly degenerate                              W                            -point to the valence band maximum. We do this by constructing an “orbital phase diagram” to cluster the variety of electronic structures expressed by these phases into groups, based on the atomic orbitals that contribute most to their valence bands. Then, with the aid of machine learning, we develop new chemical rules that predict the location of the valence band maximum in each of the phases. These rules can be used to engineer band structures with band convergence and high valley degeneracy.]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>High thermoelectric performance requires a high thermoelectric quality factor which is proportional to the weighted mobility, &#956; W , divided by the lattice thermal conductivity, &#954; L <ref type="bibr">[1]</ref>. High weighted mobility, which is correlated to high peak power factor, makes p-type half-Heusler materials strong candidates for thermoelectric applications. These materials owe their high weighted mobilities and high power factors to weak electron-phonon coupling and high valley degeneracy imposed by the symmetry of the Brillouin zone <ref type="bibr">[2]</ref><ref type="bibr">[3]</ref><ref type="bibr">[4]</ref><ref type="bibr">[5]</ref><ref type="bibr">[6]</ref>. However, there are over 50 known semiconducting half-Heusler compounds <ref type="bibr">[7]</ref>, and it is not clear how the chemical composition affects the electronic structure. In recent work, machine learning has become a powerful tool for engineering complex properties in cases where the known physical trends are exhausted, but there are many features left to understand <ref type="bibr">[8]</ref><ref type="bibr">[9]</ref><ref type="bibr">[10]</ref><ref type="bibr">[11]</ref><ref type="bibr">[12]</ref><ref type="bibr">[13]</ref>. Simple models, driven by domain knowledge, are especially useful for discovering ways to engineer these properties, even when there are small amounts of available data <ref type="bibr">[14,</ref><ref type="bibr">15]</ref>. In this work, we use machine learning to develop simple models that explain the electronic structures of half-Heusler phases.</p><p>To begin to understand electronic structure in the half-Heusler family, we calculated the electronic structures' semiconducting (18 valence electrons) phases using density functional theory (DFT). We chose stable phases reported in the Inorganic Crystal Structure Database (ICSD) alongside 10 phases predicted stable (see Methods) in previous studies from DFT calculations <ref type="bibr">[3,</ref><ref type="bibr">16,</ref><ref type="bibr">17]</ref>. To quantitatively compare the calculated phases, we decomposed their near band-edge electronic structures into their chemical components-atomic orbitals. For domain experts, atomic orbitals are a powerful basis for interpreting electronic structure <ref type="bibr">[18]</ref><ref type="bibr">[19]</ref><ref type="bibr">[20]</ref><ref type="bibr">[21]</ref>. For example, small variations in orbital character (s/p/d) explain whether diamond-like semiconductors have direct or indirect band gaps <ref type="bibr">[22]</ref>. Based on a chemical map of each phase's atomic orbitals, we find that there are three distinct classes of electronic structures in the half-Heusler family. While all have conduction band minimum at either the X-point or the &#915;-point-there is more variance in the valence bands-the valence band maximum can be at one of three k-points in the Brillouin zone. Phases that are intermediates of the extreme cases even have increased valley degeneracy from the energy convergence of multiple k-points at the valence band edge. We use machine learning to elucidate how composition affects the relative energies of these k-points, which can direct efforts to engineer band structures with high degeneracy and weighted mobility. Similar to the valence balanced rule that predicts the stability of half-Heusler phases <ref type="bibr">[7]</ref>, we find that a new valence difference rule predicts the relative energies of the k-points. Instead of considering the total valence electron count (rule for stability), these rules consider the relative valence electron configurations of the elements on each site of the crystal structure (Figure <ref type="figure">1</ref>).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Classifying Valence-Band-Edge Electronic Structures</head><p>When discussing electronic structure in crystalline materials, there are dual aspects to consider. On one hand is the reciprocal space representation-that of electronic band diagrams-where electronic states are indexed by their wave vector, k, and band index, n. Reciprocal space holds predictive information for many transport properties. For example, materials with low effective mass (m * ) and high valley degeneracy have favorable electronic properties for thermoelectric applications <ref type="bibr">[23]</ref>. However, in this four-dimensional space, it is difficult to study systematic changes in electronic structure with varying chemical composition. The complementary perspective of the electronic structure is represented in real space, where the electronic states correspond to combinations of atomic orbitals <ref type="bibr">[19]</ref><ref type="bibr">[20]</ref><ref type="bibr">[21]</ref>. Atomic orbitals are the components of electronic structures, analogous to how elements are the components of crystal structures. To further the analogy, relevant portions of the electronic structure are described by atomic orbital compositions. In this work, we consider the atomic orbital composition of the valence band edge using the projected density of states <ref type="bibr">[24,</ref><ref type="bibr">25]</ref>. The electronic structures are computed using density functional theory with the PBE functional without accounting for spinorbit coupling effects. We evaluate the fractions of states that would be occupied by holes in the valence bands (see Methods). This composition depends on the electron chemical potential (Fermi level) and temperature, but for consistency across multiple p-type phases, standard conditions were chosen. In this work, the Fermi level is placed at the valence band edge and the temperature is 700 K, which is near the temperature at the experimental peak power factor for half-Heusler materials <ref type="bibr">[3,</ref><ref type="bibr">4]</ref>. Between the three crystallographic sites (X/Y/Z) and three orbital characters (s/p/d), there are nine components to consider. However, only several of the components contribute meaningfully to the valence states, and 97% of the variation in an orbital character is accounted for by the X-d, Y-d, and Z-p components alone (Figure <ref type="figure">S2</ref> and Table <ref type="table">S1</ref>). Therefore, the phases can be represented in a Gibbs phase triangle (Figure <ref type="figure">2</ref>(a)). In contrast to a conventional phase diagram, which represents the stable phases within a composition region, the "orbital phase diagram" represents the diverse electronic structures expressed by phases within a structure family.</p><p>There are three emergent classes of valence band electronic structures (indicated by blue, red, and green). The first class of electronic structure (blue) has the valence band maximum at &#915;, which has a degeneracy of one in the first Brillouin zone (N v k ). To clarify, we are considering the degeneracy imposed by the symmetry of the Brillouin zone, which does not include the number of degenerate bands (N v o , orbital degeneracy) at that k-point</p><p>TiNiSn is an example compound from this class, where the valence band edge is dominated by Ti-d states (Figure <ref type="figure">2(c)</ref>). The second electronic structure class (red) has its valence band maximum at the L-point-a degeneracy of four. TaFeSb exemplifies this class, where the band-edge states are dominated by Fe-d (Figure <ref type="figure">2(d)</ref>). In the last class of electronic structure (green), the valence band maximum is at the W-point (degeneracy of six). These electronic structures (e.g., NbRhSn in Figure <ref type="figure">2(b)</ref>) have relatively higher band-edge contributions from Z-p orbitals, which originates from the states along the X-W path (green-orange hue). Each of the other electronic structures are hybrids of the three classes. For example, NbCoSn is a hybrid between the W-point (green) and L-point (red) extremes, with both carrier pockets within 100 meV of the band edge. Irrespective of the electronic structure class, the type of atomic orbitals contributing to each k-point (within the first valence band) is similar among all of the half-Heusler materials-the &#915;-point is dominated by X-d states, the L-point is dominated by Y-d states, and Z-p states are mixed into the X-W path. Therefore, the chemical bonding is similar among all the materials. The primary source of variance among their electronic structures is the relative energies of the &#915;-, L-, and W-points, which are linked to the relative energies of their constituent atomic orbitals. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Valence Difference Rules for Engineering &#915;-L Carrier Pockets</head><p>Engineering the &#915;-L energy offset tunes the valley degeneracy and the thermoelectric performance of half-Heusler materials <ref type="bibr">[4]</ref>. The relative energies of the &#915;and L-points are described by simple, chemical differences between the Xand Y-species. The dominant, first-order effect is the difference in valence between the Xand Y-species, which is encoded in their group (column) number on the periodic table. In a linear model, differences in valence account for over 85% of the variation in the &#915;-L energy offset (Figure <ref type="figure">3</ref>). Compounds with larger differences in valence have valence band maxima at &#915; (e.g., TiNiSn, where Ni has six more valence electrons than Ti), while compounds with smaller differences in valence have valence band maxima at L (e.g., NbFeSb, where Fe has only three more valence electrons than Nb). A second-order descriptor is the difference </p><p>TiNiSn TaFeSb</p><p>NbRhSn in Pauling's electronegativity between the Xand Y-species, which can account for differences in the &#915;-L energy offset between compounds with isovalent species (e.g., NbCoSn and NbRhSn). Furthermore, elemental characteristics of the Z-species do not improve the prediction of the &#915;-L energy offset, likely because the energies are properties of the Xand Y-species orbitals. Recall that the &#915;and L-point states are formed from the X-d and Y-d orbitals.</p><p>The valence difference rule extends beyond the semiconducting phases to metastable phases with 17 and 19 valence electrons <ref type="bibr">[26]</ref><ref type="bibr">[27]</ref><ref type="bibr">[28]</ref>, which are p-and n-type metals (Figure <ref type="figure">S7</ref>). For example, while the energy difference between the &#915;and L-points is nearly zero for TiCoSb, the &#915;-pocket dominates the valence band maximum in the Nisubstituted analog; TiNiSb has a larger valence difference and 19 valence electrons. Conversely, in the Fe-substituted analog, the L-pocket dominates; TiFeSb has 17 valence electrons and a smaller difference in valence. While TiNiSb and TiFeSb are not stable themselves, there are implications for forming solid solutions between TiCoSb and either of the metallic end-members (electronic doping) <ref type="bibr">[29]</ref>-the relative energies of the &#915;and L-points may change.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Engineering Highly Degenerate W-Pocket Materials</head><p>Materials that contain both group IV (e.g., Sn) and group IX (e.g., Co) elements adopt a distinct class of electronic structure, where the W-point is at or near the valence band edge (Figure <ref type="figure">4</ref>). In six of these seven phases, the W-point and L-point are both within 100 meV of the valence band edge, effectively converged at 1200 K. The exception to the converged cases is NbRhSn, which is the most extreme example of the W-pocket class. While only Sn-and Ge-containing end-member phases are reported stable in the literature, the calculation of metastable NbCoPb confirms that this valence rule extends beyond Sn-and Ge-containing compounds (Figure <ref type="figure">S8</ref>). Entropy-stabilized solid solutions between NbCoSn and NbCoPb could benefit from reduced lattice thermal conductivity from alloy scattering and retain valleyhigh degeneracy throughout the solid solution <ref type="bibr">[30]</ref><ref type="bibr">[31]</ref><ref type="bibr">[32]</ref>. However, the carrier density must be tuned to optimize the thermoelectric transport properties. There are three sites where aliovalent substitution can introduce additional holes in the system and tune the carrier density. We have computed several site-substituted end-members to investigate the potential changes in band structure induced by candidate dopant elements (Figure <ref type="figure">S8</ref>). Substituting on the Xand Y-sites has the expected behavior of tuning the &#915;-L energy offset, based on the valence difference rules developed in Section 3. Substituting Ti on the Nb-site (X-site) raises the relative energy of the &#915; point, since the valence difference between Ti and Co is larger than between Nb and Co. Introducing Fe on the Co-site (Y-site) has the opposite effect, and pushes the L-point above the W-point, unconverging the bands. However, substituting In on the Sn-site (Z-site) has an entirely new effect. In NbCoIn, the X-point is at the valence band edge. This compound has an entirely different class of electronic structure, distinct from the three archetypal band structures identified in Figure <ref type="figure">2</ref>. The p states from elemental In appear to promote the X-point to the valence band edge.</p><p>The band structure has a flat and dispersive character between the Xand W-points, which is similar to the band character found in SrTiO 3 and some full-Heusler phases <ref type="bibr">[33,</ref><ref type="bibr">34]</ref>. There may be differences in transport properties between materials doped on each of the three sites.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Conclusions</head><p>We have mapped the electronic structures of semiconducting half-Heusler phases according to the atomic orbital composition of their valence bands. This mapping is termed an orbital phase diagram, and it reveals that there are three welldistinguished classes of electronic structures. The k-points forming the valence band maximum are different for each electronic structure class. The relative energies of these k -points can be controlled using simple rules based on the valence electron configurations of the elemental species.</p><p>The difference in valence between the Xand Y-species controls the relative energies of the &#915;and L-point energies, while controlling the valence of the Yand Z-species can lead to the emergence of highly degenerate carrier pockets at the W-point. These rules extend beyond the semiconducting phases, as demonstrated by calculations of metastable 17 and 19 valence electron phases. These results form a foundation for exploring the space of possible solid solutions in this structure family. Forming solid solutions is incredibly important in the half-Heusler family for suppressing their high lattice thermal conductivities <ref type="bibr">[4,</ref><ref type="bibr">[35]</ref><ref type="bibr">[36]</ref><ref type="bibr">[37]</ref><ref type="bibr">[38]</ref><ref type="bibr">[39]</ref><ref type="bibr">[40]</ref><ref type="bibr">[41]</ref>. While lattice thermal conductivities in solid solutions are quantitatively described by empirical models <ref type="bibr">[30]</ref><ref type="bibr">[31]</ref><ref type="bibr">[32]</ref>, changes in electronic properties are understood more qualitatively. To the first order, the apparent band structure in a solid solution is a linear interpolation between the end-member electronic structures <ref type="bibr">[42]</ref><ref type="bibr">[43]</ref><ref type="bibr">[44]</ref><ref type="bibr">[45]</ref><ref type="bibr">[46]</ref><ref type="bibr">[47]</ref>. For example in the Zintl structure family, the band gap and effective mass in n-type Mg 3 Sb 2 Mg 3 Bi 2 change linearly with composition between Mg 3 Sb 2 and Mg 3 Bi 2 <ref type="bibr">[48]</ref>. In the III-V semiconductors, the band gap of InAs-GaAs changes linearly as well <ref type="bibr">[49]</ref>. In future work on half-Heuslers, the effects of forming solid solutions on the electronic structures could be studied by calculating the backfolded band structures <ref type="bibr">[50,</ref><ref type="bibr">51]</ref> or analyzing their transport properties. Furthermore, the orbital phase diagram technique will be useful for tracking the changes in electronic structure throughout the solid solutions.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.">Methods</head><p>6.1. Calculation Details. Electronic structure calculations were carried out using a plane-wave basis (cutoff energy of 520 eV) in the VASP package with PAW pseudopotentials and the PBE functional <ref type="bibr">[25,</ref><ref type="bibr">[52]</ref><ref type="bibr">[53]</ref><ref type="bibr">[54]</ref>. Spin-orbit coupling corrections were not applied to these calculations. The structural degrees of freedom were relaxed using 12 &#215; 12 &#215; 12 Monkhorst-Pack k-point meshes <ref type="bibr">[55]</ref>, followed by relaxation of the electronic degrees of freedom using 15 &#215; 15 &#215; 15 meshes. Finally, a non-self-consistent field calculation with 20 &#215; 20 &#215; 20 gamma-centered meshes was used to calculate quantitatively accurate density of states with tetrahedron smearing <ref type="bibr">[56]</ref>. In addition, inertial (conductivity) effective masses were calculated using the BoltzTraP package <ref type="bibr">[57]</ref>. This set of calculations were performed with the atomate workflow software <ref type="bibr">[58]</ref>. The projected density of states and chemical composition were featurized in the matminer package using the SiteDOS and ElementProperty (with pymatgen data) featurizers <ref type="bibr">[59]</ref>. The Fermi surfaces of the electronic structures were visualized using the pymatgen package <ref type="bibr">[60]</ref>. The most important atomic features for modeling the &#915;-L energy offset were determined by ridge regression <ref type="bibr">[61]</ref>. The calculations were performed on stable phases reported in the Inorganic Crystal Structure Database (ICSD) alongside 10 phases (HfAsIr, HfBiRh, HfNiPb, HfPdPb, NbSbOs, TaS-bOs, TaSnRh, TiAsIr, TiSnPd, and ZrAsIr) predicted stable in previous studies from DFT calculations <ref type="bibr">[3,</ref><ref type="bibr">16,</ref><ref type="bibr">17]</ref>.</p><p>6.2. Measuring Electronic Structure Compositions. In p-type semiconductors, charge-transporting holes occupy states in the valence bands according to the distribution function for holes (h = 1f , where f is the Fermi-Dirac distribution function) <ref type="bibr">[62]</ref>. These valence states are ascribed to particular atomic orbitals in the projected density of states (g i ). The number of occupied holes from a particular atomic orbital (p i ) is accumulated from the valence band states (Figure <ref type="figure">S1</ref>).</p><p>The fractions of atomic-like holes (x i = p i /&#931; i p i ) describe the composition of the system of holes in a particular phase. The composition depends on the Fermi level and the temperature. In this work, the Fermi level is placed at the band edge and the temperature is 700 K. When analyzing conduction bands, the Fermi-Dirac distribution function can replace the hole distribution function.</p><p>6.3. Modeling the &#915;-L Energy Offset. Regression was used to identify design principles for engineering the &#915;-L energy offset. Fivefold crossvalidation was used to score the trained models according to the coefficient of determination (r 2 ). The model pipeline consisted of standard scaling of the input features (generated from the ElementProperty featurizer with pymatgen data, which was applied to each crystal site) to zero mean and unit variance, followed by ridge regression trained by gradient descent with early stopping. The model scoring was optimized over a grid of tolerance values for early stopping. The optimized model scores and regression weights were collected for a series of regularization strengths (Figure <ref type="figure">S4/5</ref>). As the regularization penalty was decreased, the Xand Y-site group number became the most dominant feature as measured by the regression weights. Ordinary least squares reveals that over 85% of the variation in the energy offset is explained by the difference in group number between the Xand Y-sites alone (Figure <ref type="figure">S6</ref>). 6.4. Modeling the W-Pocket Class. It was observed that compounds with both group IV (e.g., Sn) and group IX (e.g., Co) elements adopt the W-pocket type electronic structure. To confirm that this rule describes the distinct class of electronic structure, we compared the distributions of energy offsets between compounds that follow this chemical rule and those that do not (Figure <ref type="figure">4</ref>). The distributions were estimated using a Gaussian kernel. It can be seen that the two distributions are distinct.</p></div></body>
		</text>
</TEI>
