<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>Adaptive Kriging Method for Uncertainty Quantification of the Photoelectron Sheath and Dust Levitation on the Lunar Surface</title></titleStmt>
			<publicationStmt>
				<publisher></publisher>
				<date>03/01/2021</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10247932</idno>
					<idno type="doi">10.1115/1.4050073</idno>
					<title level='j'>Journal of Verification, Validation and Uncertainty Quantification</title>
<idno>2377-2158</idno>
<biblScope unit="volume">6</biblScope>
<biblScope unit="issue">1</biblScope>					

					<author>Xinpeng Wei</author><author>Jianxun Zhao</author><author>Xiaoming He</author><author>Zhen Hu</author><author>Xiaoping Du</author><author>Daoru Han</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[Abstract            This paper presents an adaptive Kriging based method to perform uncertainty quantification (UQ) of the photoelectron sheath and dust levitation on the lunar surface. The objective of this study is to identify the upper and lower bounds of the electric potential and that of dust levitation height, given the intervals of model parameters in the one-dimensional (1D) photoelectron sheath model. To improve the calculation efficiency, we employ the widely used adaptive Kriging method (AKM). A task-oriented learning function and a stopping criterion are developed to train the Kriging model and customize the AKM. Experiment analysis shows that the proposed AKM is both accurate and efficient.]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">INTRODUCTION</head><p>The Moon is directly exposed to solar radiation and solar wind plasma (drifting protons and electrons) lacking an atmosphere and a global magnetic field. Consequently, the lunar surface is electrically charged by the bombardment of solar wind plasma and emission/collection of photoelectrons. Near the illuminated lunar surface, the plasma sheath is dominated by photoelectrons, thus usually referred to as "photoelectron sheath". Additionally, dust grains on the lunar surface may get charged and levitated from the surface under the influence of the electric field within the plasma sheath as well as gravity. This work is motivated by the high computational cost associated with uncertainty quantification (UQ) analysis of plasma simulations using high-fidelity kinetic models such as particle-in-cell (PIC). The main quantities of interest (QoI) of this study is the vertical structure of the photoelectron sheath and its effects on levitation of dust grains with different sizes and electric charges.</p><p>Both the electric potential (&#120601;&#120601;) and the electric field (&#119864;&#119864;) on lunar surface are determined by many parameters, such as solar wind drifting velocity (&#119907;&#119907; d ), electron temperature (&#119879;&#119879; e ), photoelectron temperature (&#119879;&#119879; p ), density of ions at infinity (&#119899;&#119899; i,&#8734; ), and density of photoelectrons (&#119899;&#119899; p ), etc. Due to uncertain factors in lunar environment, the electric potential, electric field, and the dust levitation height, etc., are also uncertain. While many sources uncertainty may exist, they are generally categorized as either aleatory or epistemic. Uncertainties are characterized as epistemic, if the modeler sees a possibility to reduce them by gathering more data or by refining models. Uncertainties are categorized as aleatory if the modeler does not foresee the possibility of reducing them <ref type="bibr">[1]</ref>. An example of the aleatory uncertainty in lunar environment is the solar wind parameters, and an example of the epistemic uncertainty is the photoelectron temperature which is obtained by limited measurement data from Apollo missions. For lunar landing missions, one needs to take into consideration the uncertainties of the electrostatic and dust environment near the lunar surface. For example, the upper and lower bounds of the electric field and dust grain levitation heights in the photoelectron sheath should be considered when determining whether it is safe for a certain area to land a spacecraft.</p><p>Determining the bounds of the electric potential, electric field, and dust levitation height, however, is computationally expensive, because the particle-based kinetic models such as particle-in-cell simulations are time-consuming to evaluate. To address this issue, we develop an adaptive Kriging method (AKM) which can determine those bounds with a small number of calculations of the model. It is straightforward to train and obtain an accurate Kriging model <ref type="bibr">[2]</ref> to replace the actual model and then calculate the bounds with the model. However, it is not necessary for the Kriging model to be accurate everywhere in its input space, because it will need more training samples and hence decrease the efficiency. Since the objective is to determine those bounds, we only need the Kriging model to be partially accurate near the regions of interest, as long as it can help find those bounds accurately. This way, we can save more computational efforts. To this end, we develop a task-oriented learning function and a stopping criterion to adaptively train the Kriging model. We start with an analytic model for the 1-D photoelectron sheath near the lunar surface <ref type="bibr">[3,</ref><ref type="bibr">4]</ref>. This model is computationally cheap and hence the accurate results can be obtained by brute force. With the accurate results, we can test the accuracy of the proposed method. It is noted here that the ultimate application of this method is not the simple 1-D problem presented in this work, but more complicated or computationally expensive models such as 3-D fully kinetic particle-in-cell plasma simulations.</p><p>The rest of this paper is organized as follows. Section 2 presents the 1-D photoelectron sheath and dust levitation problem on lunar surface, as well as the 1-D analytic model. Section 3 briefly introduces the Kriging method and general AKM. Section 4 presents the proposed AKM. Section 5 presents the results. Conclusions are given in Section 6.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">PROBLEM STATEMENT</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1.">1-D Photoelectron Sheath Model on the Lunar Surface</head><p>We employ the recently derived 1-D photoelectron sheath model for the lunar surface <ref type="bibr">[3,</ref><ref type="bibr">4]</ref>. As given in detail in <ref type="bibr">[3,</ref><ref type="bibr">4]</ref>, there are three types of electric potential profiles <ref type="bibr">[3]</ref><ref type="bibr">[4]</ref><ref type="bibr">[5]</ref><ref type="bibr">[6]</ref> in the photoelectron sheath: Type A, Type B, and Type C, as shown in Fig. <ref type="figure">1</ref>, where &#120601;&#120601; is the electric potential and &#119885;&#119885; is the vertical coordinate. In this study, we focus on Type C sheath profile as it is expected at the polar regions of the Moon, where the next lunar landing mission will likely occur.</p><p>Both the electrical potential &#120601;&#120601; and corresponding electric field &#119864;&#119864; are functions of &#119885;&#119885; with a series of parameters &#119823;&#119823; = &#65533;&#119907;&#119907; d , &#119879;&#119879; e , &#119879;&#119879; p , &#119899;&#119899; i,&#8734; , &#119899;&#119899; p &#65533;. To obtain &#120601;&#120601;(&#119885;&#119885;; &#119823;&#119823;) and &#119864;&#119864;(&#119885;&#119885;; &#119823;&#119823;), we need to solve an ordinary differential equation (ODE) <ref type="bibr">[3]</ref>. Once the potential profile &#120601;&#120601; is obtained, it is straightforward to calculate electric field &#119864;&#119864; by</p><p>A typical Type C sample curve of &#119864;&#119864;(&#119885;&#119885;; &#119823;&#119823;) is shown in Fig. <ref type="figure">2</ref>. Note that both &#120601;&#120601; and &#119864;&#119864; converge to zero at large values of &#119885;&#119885; where it is used as the electric potential reference (zero potential and zero field).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.">Dust Levitation</head><p>Subjected to the electric field force, a charged dust on lunar surface may be levitated <ref type="bibr">[7,</ref><ref type="bibr">8]</ref>. Above the lunar surface, there is a position where the upward electric field force balances the downward gravity <ref type="bibr">[4]</ref>. This position is referred to as equilibrium levitation height, denoted as &#119885;&#119885; * . &#119885;&#119885; * can be solved through the following equation of static equilibrium of a charged dust in an electric field:</p><p>where &#119902;&#119902; is the dust charge, &#119898;&#119898; is the mass of the dust, and &#119898;&#119898; = 1.62 m/s 2 is the gravity acceleration on lunar surface <ref type="bibr">[9]</ref>. With the assumption of spherical dust grains, &#119898;&#119898; is given by</p><p>where &#119903;&#119903; is the radius of the lunar dust grain, and &#120588;&#120588; = 1.8 g/cm 3 is the mass density of dust grains <ref type="bibr">[10]</ref>.</p><p>For simplicity, Eq. ( <ref type="formula">2</ref>) is rewritten as</p><p>where &#119908;&#119908; = &#119898;&#119898;&#119898;&#119898;/&#119902;&#119902;. Once both &#119864;&#119864;(&#119885;&#119885;; &#119823;&#119823;) and &#119908;&#119908; have been given or determined, a root-finding scheme is employed to solve Eq. ( <ref type="formula">4</ref>) for &#119885;&#119885; * . Fig. <ref type="figure">3</ref> shows an example of how to obtain &#119885;&#119885; * graphically.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3.">Objective</head><p>Due to the lack of information, it is almost impossible to obtain the distribution functions of &#119823;&#119823;. </p><p>However, this method is too expensive or even unaffordable. One reason is that solving the ODE a large number &#119873;&#119873; MCS of times is time-consuming, even when the analytic solution to the ODE is available for the 1-D problem. Another reason is that there is no analytic solution to complex 2-D or 3-D problems where kinetic particle-in-cell simulations are usually employed to solve the electrostatic field through Poisson's equation.</p><p>The objective of this study is to develop a method to determine &#120601;&#120601;(&#119885;&#119885;), &#120601;&#120601;(&#119885;&#119885;), &#119864;&#119864;(&#119885;&#119885;) and &#119864;&#119864;(&#119885;&#119885;) accurately and then calculate &#119885;&#119885; * of dust grains. It is noted here that the ultimate application of this method is not the relatively simple 1-D problem presented in this work, but more complicated or computationally expensive models such as 3-D fully kinetic particle-in-cell plasma simulations. For computationally expensive models, evaluating the model consumes the majority of computational resource, so we will use the number &#119873;&#119873; ODE of ODEs that we need to solve as a measure of the computational cost.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">INTRODUCTION TO KRIGING MODEL AND AKM</head><p>Before presenting the proposed method, we briefly introduce the Kriging model <ref type="bibr">[12,</ref><ref type="bibr">13]</ref> and AKM <ref type="bibr">[13]</ref><ref type="bibr">[14]</ref><ref type="bibr">[15]</ref><ref type="bibr">[16]</ref><ref type="bibr">[17]</ref><ref type="bibr">[18]</ref><ref type="bibr">[19]</ref><ref type="bibr">[20]</ref><ref type="bibr">[21]</ref><ref type="bibr">[22]</ref><ref type="bibr">[23]</ref><ref type="bibr">[24]</ref><ref type="bibr">[25]</ref><ref type="bibr">[26]</ref><ref type="bibr">[27]</ref><ref type="bibr">[28]</ref>, on which the proposed method is based.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.">Overview of Kriging Model</head><p>Kriging model makes regression to a black-box function (BBF) using a training sample set, or a design of experiment (DoE). The main idea of Kriging is to treat the BBF as a realization of a Gaussian random field indexed by the input variables of the BBF. The theoretical foundation of Kriging model is exactly the Bayesian inference <ref type="bibr">[29]</ref> . From the perspective of Bayesian interface, a prior Gaussian random field is trained by the DoE and hence a posterior Gaussian random field is generated. Then the mean value function, also indexed by the input variables of the BBF, of the posterior random field is the Kriging prediction to the BBF. Besides, the variance function, also indexed by the input variables of the BBF, of the posterior random field quantifies the local prediction uncertainty or prediction error.</p><p>The randomness, or uncertainty, of the posterior random field mainly comes from the fact that only a limit number of samples of the BBF are used to train the prior random field. In other words, only part of the information of the BBF is available, and the missing part of information leads to the epistemic uncertainty in the random field. Generally, the more training samples we use, the less epistemic uncertainty will result and with stronger confidence will we predict the BBF.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.">Formulation for Kriging Model</head><p>A simple yet widely used prior random field is the stationary Gaussian random field given by</p><p>where &#120583;&#120583; is an unknown parameter representing the mean value of the random field &#119870;&#119870;(&#119831;&#119831;) and &#120578;&#120578;(&#119831;&#119831;; &#120585;&#120585; 2 , &#120579;&#120579;) is a zero-mean stationary Gaussian random field indexed by &#119831;&#119831;, the input variables of a BBF &#119896;&#119896;(&#119831;&#119831;). Both the variance parameter &#120585;&#120585; 2 and correlation parameters &#120521;&#120521; of &#120578;&#120578;(&#119831;&#119831;; &#120585;&#120585; 2 , &#120521;&#120521;) are unknown. The parameters &#120583;&#120583;, &#120585;&#120585; 2 and &#120521;&#120521; fully define the prior random field &#119870;&#119870;(&#119831;&#119831;). A DoE, or a training sample set, of &#119896;&#119896;(&#119831;&#119831;) is used to train &#119870;&#119870;(&#119831;&#119831;) and then determine those parameters.</p><p>The correlation function &#119862;&#119862;&#65533;&#119857;&#119857; (&#119894;&#119894;) , &#119857;&#119857; (&#119895;&#119895;) &#65533; of &#120578;&#120578;(&#119831;&#119831;; &#120585;&#120585; 2 , &#120521;&#120521;) is given by</p><p>where &#119877;&#119877;&#65533;&#119857;&#119857; (&#119894;&#119894;) , &#119857;&#119857; (&#119895;&#119895;) ; &#120521;&#120521;&#65533; is the correlation coefficient function of &#120578;&#120578;(&#119831;&#119831;; &#120585;&#120585; 2 , &#120521;&#120521;) at two points &#119857;&#119857; (&#119894;&#119894;) and &#119857;&#119857; (&#119895;&#119895;) of &#119831;&#119831;. There are many models for &#119877;&#119877;&#65533;&#119857;&#119857; (&#119894;&#119894;) , &#119857;&#119857; (&#119895;&#119895;) ; &#120521;&#120521;&#65533;. A widely used model is known as the Gaussian model, or squared exponential model, given by</p><p>where &#119863;&#119863; is the dimension of &#119831;&#119831;, &#119909;&#119909; &#119889;&#119889; (&#119894;&#119894;) is the &#119889;&#119889; th component of &#119857;&#119857; (&#119894;&#119894;) , and &#120579;&#120579; &#119889;&#119889; is the &#119889;&#119889; th component of &#120521;&#120521;.</p><p>For a BBF &#119896;&#119896;(&#119831;&#119831;), the Kriging model predicts &#119896;&#119896;(&#119857;&#119857;) as &#119896;&#119896; &#65533; (&#119857;&#119857;), which is a normal variable whose mean value and variance are &#119896;&#119896; &#65533; (&#119857;&#119857;) and &#120590;&#120590; 2 (&#119857;&#119857;), respectively. Note that &#120590;&#120590; 2 (&#119857;&#119857;) is also termed as mean squared error (MSE). Generally, &#119896;&#119896; &#65533; (&#119857;&#119857;) is regarded as the deterministic prediction to &#119896;&#119896;(&#119857;&#119857;), since a deterministic prediction is usually needed. &#120590;&#120590; 2 (&#119857;&#119857;) measures the prediction uncertainty, or prediction error, and therefore we can validate a Kriging model simply using &#119896;&#119896; &#65533; (&#119857;&#119857;) and &#120590;&#120590; 2 (&#119857;&#119857;) without employing traditional validation methods, such as the cross validation <ref type="bibr">[30]</ref>. Because of this advantage, many algorithms have been proposed to adaptively train a Kriging model for expensive BBFs <ref type="bibr">[14]</ref><ref type="bibr">[15]</ref><ref type="bibr">[16]</ref><ref type="bibr">[17]</ref><ref type="bibr">[18]</ref><ref type="bibr">[19]</ref><ref type="bibr">[20]</ref><ref type="bibr">[21]</ref><ref type="bibr">[22]</ref><ref type="bibr">[23]</ref><ref type="bibr">[24]</ref><ref type="bibr">[25]</ref><ref type="bibr">[26]</ref><ref type="bibr">[27]</ref><ref type="bibr">[31]</ref><ref type="bibr">[32]</ref><ref type="bibr">[33]</ref><ref type="bibr">[34]</ref><ref type="bibr">[35]</ref><ref type="bibr">[36]</ref>. When sufficient training samples have been used for training, &#120590;&#120590; 2 (&#119857;&#119857;) converges to 0 and the normal variable &#119896;&#119896; &#65533; (&#119857;&#119857;)</p><p>degenerates to a deterministic value, i.e., the exact value of &#119896;&#119896;(&#119857;&#119857;). </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.3.">An Example of Kriging Model</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.4.">AKM</head><p>The main idea of AKM is to adaptively add training samples to update the Kriging model iteratively until an expected accuracy is achieved. Fig. <ref type="figure">6</ref>  Given a specific engineering problem, the key of employing an AKM is to make good use of all available information, such as the features of the BBF and QoI, and then design a customized or task-oriented error metric, stopping criterion and learning function.</p><p>In the UQ community, a great number of AKMs have been developed to solve varies kinds of problems, such as reliability analysis <ref type="bibr">[15, 17-24, 26, 31-33, 36]</ref>, robustness analysis <ref type="bibr">[14]</ref>,</p><p>sensitivity analysis <ref type="bibr">[34]</ref>, robust design <ref type="bibr">[25,</ref><ref type="bibr">35]</ref>, and reliability-based design <ref type="bibr">[16,</ref><ref type="bibr">27]</ref>, etc.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">THE PROPOSED METHOD</head><p>In this section, we present detailed procedures of calculating &#120601;&#120601;(&#119885;&#119885;) and &#120601;&#120601;(&#119885;&#119885;). Similar procedures can also apply to calculate &#119864;&#119864;(&#119885;&#119885;) and &#119864;&#119864;(&#119885;&#119885;).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1.">Overview of the Proposed Method</head><p>The main idea of the proposed method is to employ the framework of AKM and customize it to calculate &#120601;&#120601;(&#119885;&#119885;) and &#120601;&#120601;(&#119885;&#119885;) (as well as &#119864;&#119864;(&#119885;&#119885;) and &#119864;&#119864;(&#119885;&#119885;)). Fig. <ref type="figure">7</ref> shows the brief flowchart of the proposed method. In Step 1, we evenly generate &#119873;&#119873; in initial samples of &#119823;&#119823;.</p><p>Generally, &#119873;&#119873; in is much smaller than &#119873;&#119873; MCS . Details of this step will be given in Subsection 4.2. In</p><p>Step </p><p>)</p><p>In Step 5, an error metric is developed to measure the error of &#120601;&#120601;(&#119885;&#119885;) and &#120601;&#120601;(&#119885;&#119885;) estimated by Eq. ( <ref type="formula">16</ref>) and Eq. ( <ref type="formula">17</ref>).</p><p>Step 6 is about a stopping criterion. Details about Steps 5 and 6 will be given in Subsection 4.4. The learning function involved in Step 7 will be given in Subsection 4.3. The implementation of the proposed method will be given in Subsection 4.5.</p><p>There are two significant differences between most existing AKMs and the proposed method. First, the former aims at estimating a constant value, such as the structural reliability and robustness, while the latter aims at estimating two functions, i.e., &#120601;&#120601;(&#119885;&#119885;) and &#120601;&#120601;(&#119885;&#119885;). Second, when given a specific value of input, the output of the BBFs involved in the former methods is a single value. However, in this work, with a given realization &#119849;&#119849; of &#119823;&#119823;, the output of solving the ODE is a function &#120601;&#120601;(&#119885;&#119885;; &#119849;&#119849;). With those differences, we cannot use the existing error metrics, stopping criteria or learning functions. Instead, we take into consideration those differences and design a new error metric, stopping criterion and learning function to fit the problem. This is the main contribution of the proposed algorithm.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2.">Candidate Samples and Initial Training Samples</head><p>For numerical computation, we need to evenly discretize &#937; into a few points. Suppose &#119875;&#119875; &#119894;&#119894; , the &#119894;&#119894; th component of &#119823;&#119823;, is discretized into &#119873;&#119873; &#119894;&#119894; points, then &#937; will be discretized into in total</p><p>points. For convenience, we denote the set of those points as &#119849;&#119849; MCS  The &#119873;&#119873; in initial training samples &#119849;&#119849; in of &#119823;&#119823; are selected such that they are distributed in &#937; as even as possible. Commonly used sampling methods include random sampling, Latin hypercube sampling and Hammersley sampling <ref type="bibr">[37]</ref>. In this study, we employ the Hammersley sampling method because it has better uniformity properties over a multidimensional space <ref type="bibr">[38]</ref>. The Hammersley sampling method firstly generates initial training samples in a 5dimensional hypercube [0,1] 5 and then they are mapped into &#937; to get the initial training samples of &#119823;&#119823;. Note that the five dimensions of the hypercube are assumed to be independent, with the assumption that all variables in &#119823;&#119823; are independent. Those initial training samples, however, are not necessarily among &#119849;&#119849; MCS , so we need to round them to the nearest ones in &#119849;&#119849; MCS . Since the components of &#119823;&#119823; do not necessarily share the same dimension unit, the distances which we use to find the nearest samples should be normalized. For example, the distance &#119889;&#119889; between a sample &#119849;&#119849; (h) generated by Hammersley and a candidate sample &#119849;&#119849; (c) in &#119849;&#119849; MCS is given by</p><p>&#119875;&#119875; &#119894;&#119894;,max -&#119875;&#119875; &#119894;&#119894;,min &#65533; 2 5 &#119894;&#119894;=1 <ref type="bibr">(18)</ref> where &#119901;&#119901; &#119894;&#119894; (&#8462;) is the &#119894;&#119894; th component of &#119849;&#119849; (h) , &#119901;&#119901; &#119894;&#119894; (&#119888;&#119888;) is the &#119894;&#119894; th component of &#119849;&#119849; (c)  Since for any &#119849;&#119849; &#8712; &#119849;&#119849; MCS , it is known that &#120601;&#120601;(&#119885;&#119885; max ; &#119849;&#119849;) &#8801; 0 (Fig. <ref type="figure">1</ref>), theoretically we also need to add all the &#119873;&#119873; &#119875;&#119875; points &#119885;&#119885; max &#215; &#119849;&#119849; MCS as input training samples so that we make good use of all known information. However, it is not practical to do so. For example, if &#119873;&#119873; &#119894;&#119894; = 10, &#119894;&#119894; = 1,2, &#8230; ,5, we need to add &#119873;&#119873; &#119875;&#119875; = 10 5 points as input training samples. So many training samples will make &#120601;&#120601; &#65533; (&#119885;&#119885;; &#119823;&#119823;) complex, expensive and not accurate, losing its expected properties. To balance the need to add them and the drawback of adding all of them, we add part of them.</p><p>Specifically, we evenly generate &#119873;&#119873; &#119875;&#119875; &#8242; samples &#119849;&#119849; &#8242; of &#119823;&#119823; using procedures similar to what we used to generate &#119849;&#119849; in . Then &#119857;&#119857; inp2 is given by</p><p>The input training sample set &#119857;&#119857; inp = &#119857;&#119857; inp1 &#8899; &#119857;&#119857; inp2 . Denote the corresponding electric potential &#120601;&#120601; at &#119857;&#119857; inp as &#120543;&#120543; out . The input-output training sample pairs &#65533;&#119857;&#119857; inp , &#120543;&#120543; out &#65533; are used to build the initial &#120601;&#120601; &#65533; (&#119885;&#119885;; &#119823;&#119823;). More training samples will be added to update &#120601;&#120601; &#65533; (&#119885;&#119885;; &#119823;&#119823;) later.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.3.">Learning Function</head><p>Generally, the initial Kriging model is not accurate enough to get &#120601;&#120601;(&#119885;&#119885;) or &#120601;&#120601;(&#119885;&#119885;) accurately through Eq. ( <ref type="formula">5</ref>) and Eq. ( <ref type="formula">6</ref>). To improve the accuracy of &#120601;&#120601; &#65533; (&#119885;&#119885;; &#119823;&#119823;) and hence of &#120601;&#120601;(&#119885;&#119885;)</p><p>and &#120601;&#120601;(&#119885;&#119885;), we need to add training samples of &#120601;&#120601;(&#119885;&#119885;; &#119823;&#119823;) to refine &#120601;&#120601; &#65533; (&#119885;&#119885;; &#119823;&#119823;). A learning function is used to determine which sample of &#119823;&#119823;, and hence of &#120601;&#120601;(&#119885;&#119885;; &#119823;&#119823;), should be added.</p><p>In our previous work <ref type="bibr">[3]</ref>, we used the learning function given by</p><p>where &#119849;&#119849; (next) is the next to-be-added sample of </p><p>To estimate &#120601;&#120601;(&#119885;&#119885;), we also propose a learning function given by</p><p>In order to estimate both &#120601;&#120601;(&#119885;&#119885;) and &#120601;&#120601;(&#119885;&#119885;) simultaneously, we combine Eq. ( <ref type="formula">23</ref>) and Eq. ( <ref type="formula">25</ref>) to propose a learning function given by</p><p>Once &#119849;&#119849; (next) has been determined, we solve the ODE to numerically get a function &#120601;&#120601;&#65533;&#119885;&#119885;; &#119849;&#119849; (next) &#65533;, from which we get (&#119873;&#119873; &#119885;&#119885; -1) points (the remaining one at &#119885;&#119885; max , where &#120601;&#120601; &#8801; 0, is excluded) and add them into &#65533;&#119857;&#119857; inp , &#120543;&#120543; out &#65533; to enrich the training samples.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.4.">Error Metric and Stopping Criterion</head><p>Since Eq. ( <ref type="formula">16</ref>) and Eq. ( <ref type="formula">17</ref>) cannot obtain absolutely accurate &#120601;&#120601;(&#119885;&#119885;) and &#120601;&#120601;(&#119885;&#119885;) due to the prediction error of &#120601;&#120601; &#65533; (&#119885;&#119885;; &#119823;&#119823;), we need an error metric to measure the error of currently estimated &#120601;&#120601;(&#119885;&#119885;) and &#120601;&#120601;(&#119885;&#119885;). Since &#65533; &#120585;&#120585;(&#119911;&#119911;,&#119849;&#119849;)</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>&#120601;&#120601;(&#119911;&#119911;)</head><p>&#65533; measures the absolute expected improvement rate of</p><p>&#65533; is small for any &#119911;&#119911; &#8712; &#119859;&#119859; MCS and &#119849;&#119849; &#8712; &#119849;&#119849; MCS , &#120601;&#120601;(&#119885;&#119885;) is expected to sufficiently accurate. Therefore, we propose to use max &#119911;&#119911;&#8712;&#119859;&#119859; MCS ,&#119849;&#119849;&#8712;&#119849;&#119849; MCS &#65533; &#120585;&#120585;(&#119911;&#119911;,&#119849;&#119849;) &#120601;&#120601;(&#119911;&#119911;) &#65533; to quantify the error of &#120601;&#120601;(&#119885;&#119885;).</p><p>&#65533; is used to quantify the error of &#120601;&#120601;(&#119885;&#119885;). Combining both, we have the error metric &#120548;&#120548;, which measures the error of both &#120601;&#120601;(&#119885;&#119885;) and &#120601;&#120601;(&#119885;&#119885;), given by</p><p>Once &#120548;&#120548; is small enough, the estimated &#120601;&#120601;(&#119885;&#119885;) and &#120601;&#120601;(&#119885;&#119885;) are expected to be sufficiently accurate. Therefore, the stopping criterion shown in Fig. <ref type="figure">7</ref> is defined as</p><p>where &#120574;&#120574; is a threshold that controls the efficiency and accuracy of the proposed AKM. Generally speaking, a smaller &#120574;&#120574; will lead to higher accuracy but lower efficiency.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.5.">Implementation</head><p>As shown in Fig. <ref type="figure">1</ref>, &#120601;&#120601;(&#119885;&#119885;; &#119823;&#119823;) approaches zero when &#119885;&#119885; takes large value. As a result, &#120601;&#120601;(&#119911;&#119911;)</p><p>and &#120601;&#120601;(&#119911;&#119911;) in Eq. ( <ref type="formula">26</ref>) and Eq. ( <ref type="formula">27</ref>) are likely to take very small values close to zero. It leads to the singularity of the calculation of Eq. ( <ref type="formula">26</ref>) and Eq. ( <ref type="formula">27</ref>), doing harm to the robustness of the proposed algorithm. To solve this issue, we translate all training samples of &#120601;&#120601;(&#119885;&#119885;; &#119823;&#119823;) simply by adding a negative constant &#1013;. This way, the translated &#120601;&#120601;(&#119885;&#119885;; &#119823;&#119823;) will never approach zero and the singularity issue is evitable. Trained by the translated samples of &#120601;&#120601;(&#119885;&#119885;; &#119823;&#119823;), the Kriging model &#120601;&#120601; &#65533; (&#119885;&#119885;; &#119823;&#119823;) will also lead to the translation of &#120601;&#120601;(&#119885;&#119885;) and &#120601;&#120601;(&#119885;&#119885;). We can translate &#120601;&#120601;(&#119885;&#119885;) and &#120601;&#120601;(&#119885;&#119885;) back simply by subtracting &#1013; from them. Note that there is no rigorous theory to quantify how &#1013; affect the properties of the proposed AKM. We suggest determining &#1013; using</p><p>where mean(&#8226;) represents mean value.</p><p>Based on all the procedures given above, we generate the pseudo codes of the proposed AKM given in Algorithm 1.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.6.">Validation discussion</head><p>Theoretically, it is vital to validate the Kriging model to make sure that it has been trained accurately. An explicit validation, however, is not involved in the proposed AKM. There are two main reasons. First, the adaptive training focuses on the accuracy of QoI instead of the accuracy of the Kriging model. Once there is an indication that the accuracy of QoI in current training iteration is sufficient, i.e., the stopping criterion in Eq. ( <ref type="formula">27</ref>) is satisfied, the algorithm stops adding more training samples, no matter the Kriging model is globally accurate or not. As a result, when the algorithm has converged, it is very likely that the Kriging model is accurate only on some subdomains but not accurate on other domains. Therefore, it is not suitable to do explicit cross validation. Second, the error metric &#120548;&#120548; can measure the accuracy of QoI, and therefore we in fact do validation implicitly. As long as the accuracy of QoI is sufficient, it does not matter if the Kriging model is or not accurate on the entire domain.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">RESULTS</head><p>In this section, we illustrate the proposed AKM. MCS is used to solve the same problems with brute force. Results by MCS are treated as standard to verify the proposed AKM. We build the Kriging model and calculate the Kriging predictions using the DACE toolbox <ref type="bibr">[39]</ref>. The anisotropic Gaussian kernel is used.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.1.">Sheath Profile</head><p>We consider the 1-D photoelectron sheath problem discussed in Section 2. The sun elevation angle is given as 9 degrees. The maximal and minimal values of &#119823;&#119823; = &#65533;&#119907;&#119907; d , &#119879;&#119879; e , &#119879;&#119879; p , &#119899;&#119899; i,&#8734; , &#119899;&#119899; p &#65533; are given in Table <ref type="table">1</ref>. We use both MCS and the proposed AKM to estimate &#120601;&#120601;(&#119885;&#119885;) and &#120601;&#120601;(&#119885;&#119885;). The values of all involved parameters are given in Table <ref type="table">2</ref>.</p><p>The domain &#937; of &#119823;&#119823; is discretized into &#119873;&#119873; &#119875;&#119875; = 5 5 points, which are assembled into &#119849;&#119849; MCS .</p><p>The &#119873;&#119873; in = 5 samples in hypercube space [0,1] 5 , generated by Hammersley sampling method, are given in Table <ref type="table">3</ref>. Then the 5 samples are mapped into &#937;, as given in Table <ref type="table">4</ref>. Rounding the 5 samples in &#937; to the nearest ones in &#119849;&#119849; MCS , we get the initial samples &#119849;&#119849; in of &#119823;&#119823;, as given in Table <ref type="table">5</ref>. Solving the ODE five times, each with a sample in &#119849;&#119849; in , we get five samples of &#120601;&#120601;(&#119885;&#119885;; &#119823;&#119823;) as shown in Fig. <ref type="figure">8</ref>. Note that all points in &#65533;&#119857;&#119857; inp2 , &#120543;&#120543; out2 &#65533; have &#119885;&#119885; = &#119885;&#119885; max and &#120601;&#120601; = 0. Combining &#65533; &#119857;&#119857; inp1 , &#120543;&#120543; out1 &#65533; and &#65533;&#119857;&#119857; inp2 , &#120543;&#120543; out2 &#65533;, we have 345 training points in &#65533;&#119857;&#119857; inp , &#120543;&#120543; out &#65533;. To do the translation mentioned in Subsection 4.5, we update &#120543;&#120543; out simply by &#120543;&#120543; out = &#120543;&#120543; out + &#1013; , where &#1013; = -6.97 V is obtained with Eq. ( <ref type="formula">29</ref>). With the updated &#65533;&#119857;&#119857; inp , &#120543;&#120543; out &#65533;, we build an initial Kriging model and then estimate &#120601;&#120601;(&#119885;&#119885;) and &#120601;&#120601;(&#119885;&#119885;) through Eq. ( <ref type="formula">16</ref>) and Eq. ( <ref type="formula">17</ref>). Finally, we translate &#120601;&#120601;(&#119885;&#119885;) and &#120601;&#120601;(&#119885;&#119885;)</p><p>back by &#120601;&#120601;(&#119885;&#119885;) = &#120601;&#120601;(&#119885;&#119885;) -&#1013; and &#120601;&#120601;(&#119885;&#119885;) = &#120601;&#120601;(&#119885;&#119885;) -&#1013;. Fig. <ref type="figure">9</ref>  </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.2.">Dust Levitation</head><p>In this example, we still consider the same 1-D photoelectron sheath problem in Subsection 5.1, but the objective is to estimate &#119864;&#119864;(&#119885;&#119885;) and &#119864;&#119864;(&#119885;&#119885;) and then calculate the dust levitation height. The values of all involved parameters are given in Table <ref type="table">6</ref>.</p><p>The procedures used to estimate &#119864;&#119864;(&#119885;&#119885;) and &#119864;&#119864;(&#119885;&#119885;) are almost the same as that used to estimate &#120601;&#120601;(&#119885;&#119885;) and &#120601;&#120601;(&#119885;&#119885;). The only difference is that the samples of &#119864;&#119864;(&#119885;&#119885;; &#119823;&#119823;) instead of &#120601;&#120601;(&#119885;&#119885;; &#119823;&#119823;) are used. The final estimation of &#119864;&#119864;(&#119885;&#119885;) and &#119864;&#119864;(&#119885;&#119885;) is shown in Fig. <ref type="figure">11</ref>. It shows that the proposed AKM method is very accurate. As for the efficiency, the proposed method needs only &#119873;&#119873; ODE = &#119873;&#119873; in + 18 = 23 ODE solutions. Compared to &#119873;&#119873; &#119875;&#119875; = 3,125 ODE solutions needed in MCS, the proposed method is very efficient.</p><p>When the upper and lower bounds of the electric field have been determined, we can use the them to determine the levitation heights of the dust grains. Assuming there are two types of dust grains, A and B, in the electric field. The relevant parameters of the grains are given in Table <ref type="table">7</ref>, where e = 1.062 &#215; 10 -19 C is the electric charge of an electron. The dust levitation heights are shown in Fig. <ref type="figure">12</ref> and given in Table <ref type="table">8</ref>. Given any dust grain with known &#119908;&#119908; value, we can easily determine its levitation height interval using the method shown in Fig. <ref type="figure">12</ref>. This will help to evaluate the risk or damage caused by the levitated dust grains for lunar exploration missions.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.">CONCLUSIONS</head><p>We presented an adaptive Kriging based method to perform UQ analysis of the 1-D photoelectron sheath and dust levitation on the lunar surface. A recently derived 1-D photoelectron sheath model was used as the high-fidelity physics-based model and the blackbox function. Adaptive Kriging method, with a task-oriented learning function and stopping criterion, was utilized to improve the efficiency in calculating the upper and lower bounds of electric potential as well as dust levitation height, given the intervals of model parameters.</p><p>Experiment analysis shows that the proposed AKM method is both accurate and efficient.</p><p>Current and ongoing efforts are focused on building adaptive Kriging model for 2-D and 3-D kinetic particle simulations of lunar plasma/dust environment and perform UQ analysis.    </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Figure Captions List</head></div></body>
		</text>
</TEI>
