<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>Emulating Ocean Dynamic Sea Level by Two‐Layer Pattern Scaling</title></titleStmt>
			<publicationStmt>
				<publisher></publisher>
				<date>03/01/2021</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10263943</idno>
					<idno type="doi">10.1029/2020MS002323</idno>
					<title level='j'>Journal of Advances in Modeling Earth Systems</title>
<idno>1942-2466</idno>
<biblScope unit="volume">13</biblScope>
<biblScope unit="issue">3</biblScope>					

					<author>Jiacan Yuan</author><author>Robert E. Kopp</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[Sea-level rise impacts coastal communities and ecosystems through permanent inundation, increasingly common tidal flooding, and increasingly frequent and severe storm-driven flooding. Global-mean sea level (GMSL) is rising at an accelerating rate, and under most scenarios is projected to continue accelerating over the 21st century (Oppenheimer et al., 2019). Regional relative sea level (RSL) change differs from global-mean sea-level change due to a variety of processes operating on diverse timescales, including the gravitational, rotational, and deformational effects associated with mass redistribution and ocean dynamic effects associated with changes in surface winds, ocean currents, and heat and freshwater fluxes (Gregory]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Atmosphere-ocean general circulation models (GCMs) are the primary tool used to project ocean dynamic sea level (DSL) change, but the computational demands of these models limit the utility of GCM ensembles for estimating the likelihood of different levels of future sea-level change. Ensembles such as the Coupled Model Intercomparison Project Phase 5 (CMIP5, <ref type="bibr">Landerer et al., 2014;</ref><ref type="bibr">Taylor et al., 2012)</ref> are composed of models contributed based on voluntary effort, not the product of systematic experimental design; as such, they are an "ensemble of opportunity" rather than a probabilistic ensemble <ref type="bibr">(Tebaldi &amp; Knutti, 2007)</ref>. The CMIP future projection experiments are driven by a small number of forcing scenarios-Representative Concentration Pathways (RCPs) in the case of CMIP5-and model simulations are of different lengths; some simulations run the RCPs to the year 2100, while others extend these to 2300.</p><p>The computationally intensive nature of GCMs makes it challenging to produce large perturbed-physics ensembles that represent uncertainties in key feedback parameters, as well as to simulate forcing conditions intermediate between the RCPs. Simple climate models (SCMs) provide an alternative tool for estimating the uncertainties of future projections at the global scale, as they can capture the overall physics of climate evolution and can be run very fast even on a personal computer <ref type="bibr">(Held et al., 2010;</ref><ref type="bibr">Meinshausen et al., 2011;</ref><ref type="bibr">Millar et al., 2017;</ref><ref type="bibr">Perrette et al., 2013)</ref>. However, SCMs represent the climate at a highly aggregated (e.g., global or hemispheric) scale, and thus cannot produce spatial patterns of climate change at each time step.</p><p>Pattern scaling approaches are often used to translate the global mean surface air temperature (GSAT) change into regional-scale changes for impact analysis <ref type="bibr">(Mitchell, 2003;</ref><ref type="bibr">Rasmussen et al., 2016;</ref><ref type="bibr">Santer, 1990;</ref><ref type="bibr">Tebaldi &amp; Arblaster, 2014;</ref><ref type="bibr">Tebaldi et al., 2011)</ref>. Generally speaking, pattern scaling uses a simple statistical model (often, linear regression) to relate local climatic changes to a variable such as GSAT change, assuming the patterns of local response to external forcing remain constant under increased forcing <ref type="bibr">(Tebaldi &amp; Arblaster, 2014)</ref>. Some previous studies use the pattern scaling approach to estimate the uncertainty in DSL projections <ref type="bibr">(Bilbao et al., 2015;</ref><ref type="bibr">M. D. Palmer et al., 2020;</ref><ref type="bibr">Perrette et al., 2013)</ref>. For example, <ref type="bibr">Perrette et al. (2013)</ref> regressed DSL change on GSAT. At New York City, they found that r 2 values across models vary between 0.02 and 0.85, and also that the linear relationship between DSL and GSAT becomes weaker after the 21st century. <ref type="bibr">Bilbao et al. (2015)</ref> examined the relationship between DSL and several variables, including GSAT, global-mean sea-surface temperature, ocean volume mean temperature, and global-mean thermosteric sea-level rise (GMTSLR). They found that GSAT performed best in predicting 21st-century DSL change in a high emissions scenario (RCP 8.5), while ocean-volume mean temperature and GMTSLR performed better in lower emissions scenarios <ref type="bibr">(RCP 2.6 and 4.5)</ref>. They speculated that this difference reflects a more important role for surface warming relative to deep warming in a more strongly forced scenario. They found that, across models and scenarios, area-weighted average root mean square error in pattern-scaled 2081-2100 DSL change ranged from &#8764;1 to 3 cm. Building upon <ref type="bibr">Bilbao et al. (2015)</ref>'s speculation about the relative importance of shallow and deep warming under different scenarios, we developed a bivariate pattern scaling, which uses a multiple linear regression with two predictors: GSAT and global-mean deep ocean temperature change. The two temperature changes can be generated by a two-layer energy-balance model (2LM) <ref type="bibr">(Held et al., 2010;</ref><ref type="bibr">Winton et al., 2010)</ref>, which has proved to be a useful tool for understanding the responses of climate system to climate forcing <ref type="bibr">(Geoffroy et al, 2013a</ref><ref type="bibr">(Geoffroy et al, , 2013b))</ref>. Shallow and deep temperatures from a 2LM have previously been employed in an emulator to extend 21st century CMIP5 projections of GMTSLR to 2300 (M. D. <ref type="bibr">Palmer et al., 2018)</ref>, and M. D. <ref type="bibr">Palmer et al. (2020)</ref> used GSAT from the two-layer model and univariate pattern scaling (based on GSAT) to emulate CMIP5 projections of DSL change.</p><p>In this study, we develop an emulator for DSL changes using both GSAT and deep-ocean temperature change projected by a 2LM. Here we drive the 2LM with radiative forcings from the Finite Amplitude Impulse Response model (FaIR), a SCM which includes a reduced-complexity carbon cycle and calculates atmospheric CO 2 concentrations, radiative forcing and temperature changes based on emissions <ref type="bibr">(Millar et al., 2017;</ref><ref type="bibr">Smith et al., 2017</ref><ref type="bibr">Smith et al., , 2018))</ref>. FaIR was designed to more accurately reflect the temporal evolution of GSAT in response to a pulse emission, and it has been used in previous studies to produce observation-constrained future projections <ref type="bibr">(Millar et al., 2017;</ref><ref type="bibr">Smith et al., 2017</ref><ref type="bibr">Smith et al., , 2018))</ref>. In this study, we develop an emulator for GMTSLR and DSL change using surface and deep-ocean temperature changes generated by the FaIR-2LM (Section 2.2). As the univariate pattern scaling fails to capture the delayed response of the deep ocean to warming, we employ FaIR-2LM and two-layer pattern scaling to project future DSL changes, taking into account uncertainty in climate sensitivity, and demonstrate their ability to interpolate between climate scenarios run by GCMs. Compared to M. D. <ref type="bibr">Palmer et al. (2018</ref><ref type="bibr">Palmer et al. ( , 2020))</ref>, which also use a 2LM to emulate GMTSLR or DSL projections, our approach differs in: (1) employing radiative forcings calculated based on emissions; (2) applying a format of 2LM considering efficacy factor of deep ocean heat uptake; (3) using both surface and deep-ocean temperature for pattern scaling (more details are described in supporting information).</p><p>Section 2 describes data and methodology, including the details of FaIR-2LM, the calibration of the FaIR-2LM based on selected CMIP5 GCMs, the two-layer pattern scaling methodology, and the application of this system to emulate DSL projections. Section 3 evaluates the performance of the two-layer pattern scaling. Section 4 shows the resulting ensemble of DSL projections. Finally, Section 5 discusses and summarizes the results.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Data and Methods</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1.">Data</head><p>We use the zos variable from five CMIP5 general circulation models (GCMs) in RCP 2.6, 4.5, and 8.5 scenarios: MPI-ESM-LR, bcc-csm1-1, HadGEM2-ES, GISS-E2-R, IPSL-CM5A-LR. These five GCMs are used because they were used to calibrate the parameters of the 2LM by <ref type="bibr">Geoffroy et al. (2013)</ref> and provide multi-century data (to 2300) for zos in all three scenarios. DSL is taken as zos with its global mean removed, consistent with the definition of <ref type="bibr">Gregory et al. (2019)</ref>. The drift is removed from DSL by subtracting a linear function of time fitted to the pre-industrial control simulation from each scenario experiments, at each grid point. In addition, we remove the climatology in a baseline period <ref type="bibr">(1986)</ref><ref type="bibr">(1987)</ref><ref type="bibr">(1988)</ref><ref type="bibr">(1989)</ref><ref type="bibr">(1990)</ref><ref type="bibr">(1991)</ref><ref type="bibr">(1992)</ref><ref type="bibr">(1993)</ref><ref type="bibr">(1994)</ref><ref type="bibr">(1995)</ref><ref type="bibr">(1996)</ref><ref type="bibr">(1997)</ref><ref type="bibr">(1998)</ref><ref type="bibr">(1999)</ref><ref type="bibr">(2000)</ref><ref type="bibr">(2001)</ref><ref type="bibr">(2002)</ref><ref type="bibr">(2003)</ref><ref type="bibr">(2004)</ref><ref type="bibr">(2005)</ref> from DSL. The global mean surface air temperature (GSAT) and GMTSLR from the five models in the three scenarios are also used to evaluate the performance of FaIR-2LM.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.">FaIR-Two Layer Model (FaIR-2LM) and Calibration</head><p>This study develops a hybrid SCM model by replacing the temperature module in FaIR 1.3 <ref type="bibr">(Smith et al., 2018)</ref> with a 2LM. In FaIR 1.3, GSAT changes are the sum of two components, representing fast and slow responses to effective radiative forcing (ERF) (Equation <ref type="formula">22</ref>in <ref type="bibr">Smith et al., 2018)</ref>. The fast and slow components of temperature changes in FaIR 1.3 mathematically depend on multiple coefficients (e.g., thermal response timescales) that are obtained from the ensemble mean of multiple CMIP5 models <ref type="bibr">(Geoffroy, et al., 2013b)</ref>. Since these components do not have an unambiguous physical meaning, it is challenging to link them to sea-level change. Therefore, we replace the temperature module in FaIR 1.3 by the 2LM to construct FaIR-2LM. In each step of FaIR-2LM, the 2LM is driven by radiative forcing from FaIR 1.3, and produces the GSAT anomaly, which feeds back to the FaIR carbon cycle (Figure <ref type="figure">S1</ref>).</p><p>We employ a 2LM that includes an efficacy term for deep ocean heat uptake <ref type="bibr">(Geoffroy, et al., 2013a;</ref><ref type="bibr">Held et al., 2010;</ref><ref type="bibr">Winton et al., 2010)</ref>:</p><p>(1)</p><p>where denotes the adjusted radiative forcing, C and C 0 are the heat capacity of the well-mixed upper layer and the deep ocean layer, respectively, and T and T 0 represent the global mean temperature anomalies of the upper and lower layer, respectively. Following Equation 22 in <ref type="bibr">Geoffroy, et al. (2013b)</ref> and using C = 8.2 W yr m -2 K -1 and 0 C 109 W yr m -2 K -1 based on an average across multiple CMIP5 GCMs <ref type="bibr">(Geoffroy, et al., 2013a)</ref>, we estimate the average depths of the upper layer and lower layer are 86 m and 1141 m, respectively. T is equivalent to GSAT perturbation <ref type="bibr">(Held et al., 2010)</ref>. is the parameter for cli-mate feedback, is the coefficient of deep ocean heat uptake, and is the efficacy factor of deep ocean heat uptake, which represents the uneven spatial distribution of heat exchanges between the two layers.</p><p>To calibrate FaIR-2LM, we adjust parameter settings (listed in Table <ref type="table">1</ref>) based on previous studies <ref type="bibr">(Forster et al., 2013;</ref><ref type="bibr">Geoffroy, Saint-Martin, et al., 2013a;</ref><ref type="bibr">Zelinka et al., 2014)</ref>. The radiative forcing in FaIR-2LM is driven by the default emission trajectory for each scenario in FaIR 1.3, but scaled by two parameters determined for each GCM: (1) the radiative forcing of CO 2 doubling ( 2 2</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>CO</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>F</head><p>) reported by <ref type="bibr">Forster et al. (2013)</ref>, and (2) the present-day aerosol forcing (af pd ) estimated in previous studies <ref type="bibr">(Forster et al., 2013;</ref><ref type="bibr">Zelinka et al., 2014)</ref>, or -0.9 W m -2 -the median of range estimated by the Fifth Assessment Report of Intergovernmental Panel on Climate Change (IPCC AR5) <ref type="bibr">(Stocker et al., 2013)</ref>-for models not reported in previous studies. The five parameters in Equations 1 and 2 (i.e., , , , C, C 0 ) are the same as those in <ref type="bibr">Geoffroy et al. (2013)</ref> for the corresponding GCMs.</p><p>GSAT produced by the calibrated FaIR-2LM is compared with that from the corresponding GCMs in the three scenarios (Fig. <ref type="figure">S2</ref>). For the five GCMs, the GSAT simulated by FaIR-2LM is close to the GSAT from the corresponding GCM, with the root mean square error (RMSE) determined over the entire simulation period in a range of 0.15-0.23 K for RCP2.6, 0.14-0.32 K for RCP4.5, and 0.20-0.43 K for RCP8.5.</p><p>GMTSLR is driven by the thermal expansion of sea water volume due to the increase in ocean heat uptake. To calibrate GMTSLR in FaIR-2LM to match a specific GCM, we first correct the drift in the GCM's GMT-SLR field by removing the linear trend in the pre-industrial control simulation, assuming the drift is not sensitive to the external forcing <ref type="bibr">(Hobbs et al., 2016)</ref>.</p><p>Then, we emulate GMTSLR based on the T and T 0 from FaIR-2LM following the approach described in <ref type="bibr">Kuhlbrodt and Gregory (2012)</ref>:</p><p>where is the expansion efficiency of heat in units of 10 -24 m J -1 . The value is calibrated by optimizing GMTSLR emulated from FaIR-2LM to match the GMTSLR simulated from the corresponding GCM.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3.">Two-Layer Pattern Scaling</head><p>Univariate pattern scaling is based on a linear relation between regional changes in a climate variable (DSL for this study) and global mean responses of climate change, such as GSAT (T ):</p><p>, , , , , , DSL t x y x y T t b x y t x y (4)</p><p>where x and y denote longitudes and latitudes, t represents the time, b is an intercept term, and is the residual term. Here, &#945; captures the scaling relationship between DSL and GSAT (Figure <ref type="figure">1</ref>). The five GCMs agree that the linear response of DSL to surface warming is positive over the Arctic and sub-polar Atlantic, and negative over the southeastern Pacific and the southern areas of Southern Ocean.</p><p>In the bivariate pattern scaling approach, we regress the DSL anomaly on both T (GSAT anomaly) and 0 T (deep-ocean temperature anomaly) from FaIR-2LM:</p><p>where i t denotes years in three scenarios, i = 1, 2, 3. For each GCM, we estimate the fields of &#945;, &#946;, b, and &#949; by regressing projections from all three emissions scenarios (RCPs 2.6, 4.5, and 8.5) on T and 0 T on a grid cell- GCMs.</p><p>Wm K , ,</p><p>Wyrm K C , and <ref type="bibr">Geoffroy et al. (2013)</ref>. The units for F 2 &#215; co 2 and afpd are W m -2 same period (Figure <ref type="figure">1</ref>). Consistent with the univariate scaling pattern, the five GCMs agree that the upper-layer response, represented by , is positively correlated with warming over the most areas of Arctic and northern edge of the Southern Ocean, and negatively correlated with warming over the southeastern Pacific YUAN AND KOPP 10.1029/2020MS002323 5 of 17 and the southern areas of Southern Ocean. The deep-layer response represented by is positively correlated with warming over the Indian and tropical and southern Pacific Oceans, and negatively correlated with warming over most areas of the Southern Ocean and Arctic. These reflect opposite behaviors between rapid and sustained changes in DSL over the Arctic, the Indian and tropical and southern Pacific Oceans, and a consistent DSL fall in both rapid and sustained changes over the Southern Ocean.</p><p>There is little agreement on either surface-or deep-layer slopes across the five GCMs over most parts of the Atlantic basin (Figure <ref type="figure">1</ref>). This may reflect limited skill in simulating strong western boundary currents (e.g., the Atlantic Meridional Overturning Circulation (AMOC)) in the GCMs, which have a relatively coarse (&#8764;1&#176;) spatial resolution in ocean component <ref type="bibr">(Small et al., 2014)</ref> and so poorly capture non-linear mesoscale processes in the ocean current <ref type="bibr">(Hallberg, 2013)</ref>. Near the eastern coast of North America, DSL is closely related to AMOC <ref type="bibr">(Goddard et al., 2015)</ref>, which is expected to weaken in a warming climate <ref type="bibr">(Caesar et al., 2018)</ref>. Low skill in capturing AMOC behavior can affect the DSL projections in the Atlantic basin as well as its coasts <ref type="bibr">(van Westen et al., 2020)</ref>. As the coefficients of pattern scaling depend on the simulations by the GCMs, they also do not explicitly resolve the non-linear mesoscale process of the ocean current. Therefore, we should interpret the DSL changes predicted by the two-layer emulator with cautions over the regions where non-linear mesoscale effects of ocean current are strong.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.4.">Projecting DSL Using FaIR-2LM and Patterns</head><p>We use two steps to generate a probabilistic ensemble of DSL projections. First, we generate an ensemble of surface and deep-ocean temperature pairs using FaIR-2LM. The planetary energy balance at the top of the atmosphere <ref type="bibr">(Zelinka et al., 2020)</ref> is:</p><p>where N is the radiative imbalance at the top of the atmosphere. The equilibrium climate sensitivity (ECS) is given by T when N = 0, and =</p><p>2 2 CO F . Therefore, is related to 2 2 CO F and ECS by 2 2 / CO F ECS (7) The uncertainty of 2 2 CO F is small relative to the spread of , while ECS largely determine the uncertainty of . Therefore, we adopt the best estimation in the Intergovernmental Panel on Climate Change Fifth Assessment Report (AR5) for 2 2 CO F = 3.71 2 W m <ref type="bibr">(Collins et al., 2013)</ref>. We produce initial distributions of ECS, , and based on the literature constraints (Fig. <ref type="figure">S4</ref>) outlined below: ECS: Based on multiple lines of evidence, the uncertainties of ECS estimated by AR5 are likely in the range 1.5&#176;C-4.5&#176;C with high confidence, extremely unlikely less than 1&#176;C and very unlikely greater than 6&#176;C <ref type="bibr">(Collins et al., 2013)</ref>. In the AR5 terminology, likely denotes a probability of at least 66%, very unlikely a probability of less than 10%, and extremely unlikely a probability of less than 5% <ref type="bibr">(Mastrandrea et al., 2010)</ref>. Therefore, we construct a log-normal distribution for ECS with parameters optimized to match a 5th percentile of 1&#176;C, a 17th percentile of 1.5&#176;C, an 83rd percentile of 4.5&#176;C, and a 90th percentile of 6&#176;C.</p><p>: We treat as normally distribution, with mean 0.67</p><p>W m K and standard deviation 0.15</p><p>W m K derived from the 16 GCMs in the CMIP5 archive <ref type="bibr">(Geoffroy et al., 2013)</ref>.</p><p>: As the efficacy factor of heat uptake is related to deep-ocean heat uptake <ref type="bibr">(Held et al., 2010)</ref>, we use instead of to maintain the covariance between and . We calculate the mean of 0.86</p><p>W m K and standard deviation 0.29</p><p>W m K of based on the products of and from 16 GCMs in CMIP5 archive <ref type="bibr">(Geoffroy, et al., 2013a)</ref>, The distribution of is constructed as a normal distribution with the multi-model mean and the multi-model standard deviation.</p><p>TCR: Under a zero-layer approximation which considers the 1%/yr increase in CO 2 until doubling scenario occurring on a timescale long enough that the upper ocean is in approximate equilibrium and short enough that the deep-ocean temperature has not yet responded substantially, Transient Climate Response (TCR) can be obtained by (Jim&#233;nez-de-la-Cuesta &amp; Mauritsen, 2019): W m K based on the multi-model mean of GCMs from Coupled Model Intercomparison Project Phase 5 (CMIP5) archive <ref type="bibr">(Geoffroy, et al., 2013a)</ref>.</p><p>We then generate a 100,000-member ensemble of ECS, &#947; and &#947;&#949; based on these distributions via Monte Carlo sampling. As should be larger than 0, we discard parameter sets in which &lt; 0 or &gt; 2 0.86 to keep the mean of in parameter sets to be 0.86</p><p>W m K . Therefore, 99734 parameter sets are kept. An ensemble of is then computed by the best estimation of 2 2 XCO</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>F</head><p>and the ensemble of ECS based on Equation 6 (Fig. <ref type="figure">S4</ref>). The median (central 66% range) of is -1.39 (-2.4 to -0.8)</p><p>W m K . As the likely range of ECS estimated by AR5 is equivalent to the central 90% range of ECS estimated by CMIP5 GCMs, the uncertainty range of estimated by FaIR-2LM is larger than that estimated by ensemble of GCMs <ref type="bibr">(Geoffroy et al., 2013)</ref>. The spread of TCR is estimated as a diagnostic by substituting the ensemble of , , and best estimation of 2 2 XCO</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>F</head><p>into Equation <ref type="formula">8</ref>. The uncertainty of TCR is in a central 66% range of 1.1-2.3 C, with a 95th percentile of 2.9 C. This is consistent with but slightly narrower than the TCR estimated by AR5, which is likely between 1 C and 2.5 C, and is extremely unlikely greater than 3&#8451;.</p><p>We apply Latin hypercube sampling (LHS, <ref type="bibr">Stein, 1987)</ref> to the parameter sets of , , by sampling 1,000 sets from the 99734 parameter sets. For each parameter, LHS divides the probability density function of the 99,734 samples into 1,000 portions that have equal area. A sample is taken from each portion randomly so that the 1,000 sample sets cover the multidimensional distribution of the three parameters. Finally, we applied 1,000 parameter sets together with the fixed parameters ( 2 2</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>XCO</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>F</head><p>, C, C 0 ) to the FAIR-2LM and generate a 1,000-member probabilistic ensemble of temperature pair time-series.</p><p>We compare the spread in GSAT projected by FaIR-2LM with the likely ranges estimated by AR5 for four different periods <ref type="bibr">(Collins et al., 2013)</ref> (Table <ref type="table">3</ref> and Figure <ref type="figure">S5</ref>). The mean of the probabilistic ensemble is slightly lower than the mean estimate of GSAT from AR5 in all four periods of RCP2.6 and RCP4.5, and in the 21st century for RCP8.5. Compared with AR5 likely ranges, the central 66% probability range of GSAT from FaIR-2LM is generally consistent: narrower in all four periods of RCP2.6, narrower in the first two periods but wider in the last two periods in RCP4.5, and wider in the first two periods but narrower in the last two periods in RCP8.5.</p><p>We project GMTSLR based on Equation 3 using the probabilistic ensemble of surface and deep-ocean temperature projections from FaIR-2LM. The C, C 0 and expansion efficiency of heat ( 24m 0.113 10</p><p>) used here are adopted from the multi-model ensemble mean of CMIP5 archive <ref type="bibr">(Geoffroy, et al., 2013a;</ref><ref type="bibr">Kuhlbrodt &amp; Gregory, 2012)</ref>.</p><p>A projection of DSL is constructed as follows: 1) a pair of and is randomly picked with replacement from the pool of two-layer patterns produced in Section 2.3; 2) a temperature pair from the 1000 members is combined with the pair of and in an equation:</p><p>0 , , , , , DSL t x y x y T t x y T t b x y (9)</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Evaluation of Two-Layer Pattern Scaling</head><p>To evaluate the prediction skill of the two-layer pattern scaling, we compare the DSL changes simulated from a GCM with the DSL changes emulated by the two-layer pattern scaling ( DSL) based on FAIR-2LM using the key parameters (i.e. parameters in Table <ref type="table">1</ref>) in the same GCM. Two metrics are used: (1) absolute values of the residual differences between DSL changes and DSL changes during a period at each grid point, and ( <ref type="formula">2</ref>) global average of the absolute values obtained from the metric 1 (Table <ref type="table">S1</ref>). These two metrics are applied to both bivariate pattern scaling and univariate pattern scaling, to examine the improvement of bivariate approach comparing with the univariate approach.</p><p>In 2271-2290, for instance, the global-averaged climatology of DSL DSL (score obtained by the second metric) from the two-layer pattern scaling is less than that from the univariate pattern scaling (bottom row</p><p>Figure <ref type="figure">2</ref>), with a reduction of 36%, 24%, and 34% in RCP2.6, RCP4.5, and RCP8.5, respectively. The spatial pattern of R = DSL DSL is derived from both approaches are various across GCMs (Figures <ref type="figure">S6-S10</ref>). The 5-model ensemble averaged climatology of R in both approaches is higher over high latitudes (e.g., Arctic, subpolar Northern Atlantic, Southern Ocean) than over middle to low latitudes, but is generally lower in two-layer pattern scaling than in univariate pattern scaling (first two rows Figure <ref type="figure">2</ref>). As the pattern-scaling method cannot resolve DSL change due to unforced variability, the relatively large |R| over the high latitudes may be due to the relatively high unforced variability over these regions <ref type="bibr">(Bilbao et al., 2015)</ref>.</p><p>We further compare the time evolution of DSL predicted by the two-layer pattern scaling approaches with the evolution of DSL in corresponding GCMs through the period 1981-2290. As case studies, we pick two grid cells: one in the western Pacific near the Philippines (14.5&#176;N, 127&#176;E), and the other over the North Atlantic near the coast of New York City [NYC] (40&#176;N, 73&#176;W) (solid black dots in Figure <ref type="figure">2</ref>). The grid point near the Philippines is selected because it is in the tropical Pacific, where DSL rise associated with the deep ocean temperature rise is strongest, while the grid point near the NYC is selected represent a coastal area that some projections find experiences significant DSL changes in response to changes in AMOC.</p><p>At the western Pacific grid cell, in RCP 2.6, the relationship between DSL and GSAT anomaly displays a hook-like shape, indicating continued rise in DSL as GSAT stabilizes and declines in response to negative emissions (Figure <ref type="figure">3a</ref>). The delayed adjustment of DSL may be due to the continuous warming of deep layer (T 0 ) when GSAT is stabilized, because the ocean is not yet equilibrated with the elevated forcing. In response to changes in T 0 , the deep ocean density is still changing even without changing circulation in the deep ocean <ref type="bibr">(England, 1995)</ref>, so DSL continues to change. This hook-like shape is captured by the two-layer pattern scaling approach but not by the univariate pattern scaling. Compare to the DSL simulated by a GCM, the RMSE of the predicted DSL is smaller if using the two-layer pattern scaling approach than using the univariate pattern scaling approach. The average RMSE across the five GCMs is reduced 26% if we use the approach from the two-layer pattern scaling instead of the univariate pattern scaling (Table <ref type="table">3</ref>). Across the five GCMs, although the relationship between DSL and GSAT is diverse in RCP4.</p><p>5 and RCP8.5, DSL YUAN AND KOPP 10.1029/2020MS002323 9 of 17 projected by the two-layer technique is consistently closer than that predicted by the univariate technique to the DSL simulated by the GCMs. The sole exception is for bcc-csm1-1 in RCP8.5, for which the simulated DSL projection is quite linearly associated with GSAT. The average of RMSEs across the 5 GCMs decrease from the univariate pattern scaling approach to the two-layer pattern scaling approach by 35% in RCP4.5 and by 33% in RCP8.5 (Table <ref type="table">3</ref>).</p><p>At the North Atlantic grid cell, the relationship between DSL and GSAT also displays non-linear features for all the five models, especially in low-and moderate-emission scenarios (Figure <ref type="figure">3b</ref>). These non-linear features, which may arise from the delayed response of deep branch of AMOC, cannot be captured by univariate pattern scaling but are captured to a large extent by the two-layer pattern scaling (lines in Figure <ref type="figure">3</ref>). The value of the two-layer approach is highlighted by the clear non-linearity of the DSL response when viewed as a function of GSAT anomaly. Compare to the univariate approach, the RMSE between the DSL simulated by GCMs and the DSL predicted by the two-layer pattern scaling is smaller, with a reduction of 19%, 16%, and 13% for RCP2.6, RCP4.5, and RCP8.5, respectively. The method of two-layer pattern scaling generally has a better performance in emulating the DSL from the corresponding GCM than the univariate pattern scaling, as the two-layer pattern scaling includes one more predictor than univariate pattern scaling, allowing it to capture the delayed adjustment of DSL. The delayed adjustment of DSL due to the deep-ocean warming is important for the DSL projections, as different areas present different features that may reveal the regional variation in deep-ocean circulation <ref type="bibr">(Held et al., 2010)</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Projections of DSL</head><p>The procedure described in Section 2.4 allows us to produce a 1000-member probabilistic ensemble of DSL projections not only for the three CMIP5 scenarios: RCP2.6, RCP4.5, and RCP8.5, but also for any other scenarios with an emission pathway between these three scenarios. We demonstrate this capability using SSP3-7.0, a CMIP6 scenario that has forcing intermediate between RCP4.5 and RCP8.5 <ref type="bibr">(O'Neill et al., 2016)</ref> and is closer than either to no-policy reference scenarios from most integrated assessment models <ref type="bibr">(Riahi et al., 2017)</ref>. The emission pathway of SSP3-7.0 used to drive the FaIR-2LM is taken from the Reduced Complexity Model Intercomparison Project <ref type="bibr">(Nicholls et al., 2020)</ref>.</p><p>The five projections using parameters calibrated to the five GCMs respectively are within the 66% range of the 1000-member ensemble for both surface and deep-ocean temperature in the three RCPs (Figure <ref type="figure">4</ref>). By 2300, the median estimates (66% range) of the surface temperature relative the period of 1986-2005 are 0.5 C (0.2-1. 0 C) for RCP2.6, 2. 2 C (1.2-3.6 C) for RCP4.5, 7. 4 C (4.5-11. 7 C) for RCP8.5, and 5. 3 C (3.2-8.6 C) for SSP3-7.0.</p><p>Based on the projections of temperature pairs, we also produced projections of GMTSLR for the four scenarios (Figure <ref type="figure">4</ref>). The spread of GMTSLR ensemble encapsulates the GMTSLR time series from the 5 GCMs (Figure <ref type="figure">4</ref>). During the period of 2081-2100, the median estimates (66% range) of the GMTSLR relative the period of 1986-2005 are 0.12 (0.07-0.18) m for RCP2.6, 0.16 (0.10-0.24) m for RCP4.5, 0.19 (0.12-0.27) m for SSP3-7.0, and 0.24 (0.15-0.34) m for RCP8.5. This compares to Oppenheimer et al. ( <ref type="formula">2019</ref>)'s projected median estimates (66% ranges) of 0.14 (0.10-0.18) m for RCP2.6, 0.19 (0.14-0.23) m for RCP4.5, and 0.27 (0.21-0.33) m for RCP8.5. By 2300, the median estimates (66% range) of GMTSLR relative to the period of 1986-2005 are 0.20 (0.12-0.33) m for RCP2.6, 0.43 (0.25-0.68) m for RCP4.5, 0.85 (0.50-1.33) m for SSP3-7.0, and 1.15 (0.69-1.76) m for RCP8.5. As climate warms, the projections of DSL changes increase along with the increase in GMTSLR. In the five GCMs, although the contributions of DSL changes to the local sterodynamic sea level (DSL + GMTSLR; <ref type="bibr">Gregory et al., 2019)</ref> changes are small at some locations (i.e., regions marked by the light and dark gray shadings in Fig. <ref type="figure">S11</ref>), in others the ratio of DSL change with respect to the GMTSLR changes are fairly significant. For instance, DSL changes at some regions (e.g., Arctic, North Atlantic, and Southern Ocean) are greater than 50% of the GMTSLR during the period of 2271-2290 (identified by yellow contours in Fig. <ref type="figure">S11</ref>).</p><p>Compared with the GSAT and GMTSLR spread in 2300 estimated by M. D. <ref type="bibr">Palmer et al. (2018)</ref>, the FaIR-2LM projections have a slightly lower median for all the three RCPs. The 66% range of both surface temperature and GMTSLR estimated by FaIR-2LM is comparable to the 90% range of that estimated by M. Comparing the DSL projections between the period of 2081-2100 and the period of 2271-2290 (Figure <ref type="figure">5</ref>), the median estimate is lower and the 66% range of uncertainty is narrower at the end of 21st century than that at the end of 23rd century in moderate-to high-emission scenarios (RCP4.5, SSP3-7.0 and YUAN AND KOPP RCP8.5). But in RCP2.6, the median estimate and 66% uncertainty range are comparable in magnitude between these two periods. In both periods, the median DSL anomaly projections across the four scenarios share many similar features (Figure <ref type="figure">5</ref>). Over the Arctic region, a weak increase in DSL is observed over the Chukchi Sea and the Beaufort Sea in RCP2.6. In the higher emission scenarios, the increase in DSL extends to the whole Arctic basin with intensified amplitudes. The changes in DSL over the North Atlantic are dominated by a negative anomaly under RCP2.6, and display positive anomalies over much of the North Atlantic under RCP8.5 and SSP3-7.0. The ensemble spread of the 5th-95th range of DSL projections are relatively large over the Southern Ocean, Arctic and Subpolar Atlantic than other areas.</p><p>The large uncertainties over these areas, consistent with previous literatures (M. D. <ref type="bibr">Palmer et al., 2020;</ref><ref type="bibr">Perrette et al., 2013;</ref><ref type="bibr">Yin, 2012)</ref>, may be interpreted by the diverse characteristics simulated by GCMs that do not explicitly resolve non-linear mesoscale processes of the ocean current over these areas <ref type="bibr">(van Westen et al., 2020)</ref>.</p><p>At the illustrative grid point near Philippines over western Pacific (Figure <ref type="figure">6a</ref>), the 66% range of the probabilistic ensemble encapsulates DSL projections from 2 of the 5 GCMs in the three RCPs, while the 90% range of the probabilistic ensemble contains DSL projections from all the 5 GCMs in the three RCPs, except for HadGEM2-ES in RCP2.6. At the grid point near NYC, the projected DSL changes estimated by the probabilistic ensemble exhibits a fat tail, with a median projection in RCP 8.5 of 0.13 m and a 95th percentile projection of 0.8 m by the end of 23rd century. By contrast, RCP 2.6 exhibits a much narrower range, with a median of 0 m and a 95th percentile of 0.08 m. The 66% range of the projected DSL uncertainties encapsulates 2 of 5 GCM projections. The 90% range of the probabilistic ensemble only encapsulates the DSL projections from three over five GCMs in RCP2.6 and RCP4.5, but encapsulates the DSL projections from all five GCMs in RCP8.5. The emulator fails to capture multidecadal variability in DSL, a limitation which would be expected because the emulator is constructed based on the pattern scaling approach.</p><p>YUAN AND KOPP 10.1029/2020MS002323 To compare with the DSL projections derived from two-layer pattern scaling, we also produced the DSL projections based on univariate pattern scaling following the same procedure (Figures <ref type="figure">S12</ref> and <ref type="figure">S14</ref>). The median and 17th-83rd range of DSL projections derived from univariate pattern scaling are similar in patterns (Figure <ref type="figure">S12</ref>). However, differences of both median and spreads of the DSL projections between univariate and two-layer pattern scaling vary across regions, especially over high latitudes in high-emission scenarios (Figure <ref type="figure">S13</ref>). Specifically, compared to the univariate emulator, the median of DSL projected by the two-layer emulator is lower over the Pacific and Indian Ocean, and higher over Arctic, Atlantic and Southern Ocean in the period of 2081-2100, but the opposite in the period of 2271-2290. In addition, the 17th-83rd range of the DSL projected by the two-layer emulator is wider than that by the univariate emulator over the Arctic. The area-weighted increases in DSL spread over the Arctic are 0.02 m in RCP2.6, 0.03 m in RCP4.5, and 0.04 m in RCP8.5 during 2081-2100, and are 0.006 m in RCP2.6, 0.008 m in RCP4.5, 0.009 m in RCP8.5 during 2271-2290. For the grid cell near Philippines, despite the shift in median, the distributions of DSL projections derived from univariate pattern scaling exhibit a different shape from that derived from two-layer pattern scaling (Figure <ref type="figure">6</ref>). In 2290, the 90% range of DSL projection from the univariate emulator is close to that from the two-layer emulator, slightly narrower by 0.03 m for RCP2.6, 0.02 m for RCP4.5, and 0.01 m for SSP3-7.0 and RCP8.5. The two-layer pattern scaling leads to not only a shift of the distribution but also a different shape of the distribution of the DSL projections, compared to that derived from univariate pattern scaling. There are more DSL projections simulated by GCMs encapsulated within the 90% range of the probabilistic ensemble of DSL projections derived from two-layer pattern scaling than that by the univariate pattern scaling (Figure <ref type="figure">6</ref>). For the grid cell near the NYC, the shape of DSL distributions derived from univariate pattern scaling is similar to that derived from two-layer pattern scaling, except the spread of the DSL projections derived from univariate pattern scaling is slightly narrower than that derived from the two-layer pattern scaling, with the 90% range of DSL projection in 2290 narrower by 0.03 m for RCP2.6, 0.01 m for RCP4.5, and 0.03 m for SSP3-7.0 and RCP8.5 (Figure <ref type="figure">6</ref>). The greatest difference is in RCP 2.6, where the difference in the 90% range is by far the largest compared to the overall range.</p><p>YUAN AND KOPP 10.1029/2020MS002323</p><p>13 of 17 </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Discussion and Conclusions</head><p>We have developed a probabilistic ensemble of DSL projections through 2300 using a novel two-layer emulator. Replacing the climate module in the FaIR simple climate model with a two-layer energy-balance model, we developed FaIR-2LM, which produces projections of global average temperature in the wellmixed upper layer (T) for rapid responses to radiative forcing, and in the deep ocean layer (T 0 ) for delayed responses. Calibrated by the parameters for each GCMs, the GSAT (Figure <ref type="figure">S2</ref>) and GMTSLR (Figure <ref type="figure">S3</ref>) emulated by FaIR-2LM generally follow that from the corresponding GCM, with RMSE &lt;0.43 K for GSAT and &lt;0.05 m for GMTSLR. A two-layer pattern scaling based on surface and deep-ocean temperature is used to project DSL. During the period 2271-2290, for instance, the DSL predicted by the two-layer pattern scaling are closer to the DSL simulated by the corresponding GCM than that predicted by the univariate pattern scaling (Figure <ref type="figure">2</ref>). At two selected grid cells (near the coast of Philippines and NYC), the time evolution of DSL projections predicted by the two-layer emulator more accurately reflects GCM behavior and captures non-linearities and non-stationarity in the relationship between DSL and global-mean warming, comparing with that predicted by the univariate technique (Figure <ref type="figure">3</ref>).</p><p>By perturbating the key parameters, FaIR-2LM allows emulation of projected global-mean surface and deep-ocean temperature pairs and GMT-SLR for emissions scenarios (e.g., SSP3-7.0; Figures <ref type="figure">4</ref> and <ref type="figure">5</ref>) beyond those run by the GCMs to which it is calibrated. Compared with the likely ranges assessed by AR5 in the RCP 2.6, 4.5 and 8.5, the FaIR-2LM performs well in emulating the GSAT spread (Table <ref type="table">2</ref> and Figure <ref type="figure">S5</ref>). By 2300, the ensembles of GSAT and GMTSLR estimated by FaIR-2LM have a slightly lower median and a slightly wider 90% range than the estimations by M. D. <ref type="bibr">Palmer et al. (2018)</ref>, likely because we use the uncertainty of ECS from AR5, which has a larger range than that estimated by CMIP5 multi-model ensemble.</p><p>We produce probabilistic ensembles of DSL projections for four different emissions scenarios. Characteristics of median DSL projections during 2271-2290 include increases in DSL along most of the coast around the Pacific and Indian Oceans and a decrease in DSL over the Southern Ocean in all four scenarios, as well as increased DSL over the Arctic and along the North Atlantic Current in moderate to high emissions scenarios (Figure <ref type="figure">5</ref>). The 66% range (17th-83rd percentile) of uncertainties are small over the middle and low latitudes, and are relatively large over the Southern Ocean, Arctic and North Atlantic, where the simulations of GCMs are diverse due to the challenges of capturing the complex physical processes, such as deep water formation in the subpolar Atlantic, the Antarctic circumpolar current, and ice-albedo feedback in polar regions</p><p>YUAN AND KOPP 10.1029/2020MS002323 14 of 17 Scenarios Western Pacific North Atlantic Univariate Two-layer Reduction Univariate Two-layer Reduction RCP2.6 0.01604 0.01192 25.7% 0.027 0.022 18.5% RCP4.5 0.01782 0.01152 35.4% 0.0246 0.0206 16.3% RCP8.5 0.02126 0.01422 33.1% 0.0305 0.0264 13.4%</p><p>Note. The averaged RMSEs and the reduction of RMSE from univariate pattern scaling approach to two-layer pattern scaling are calculated for the three RCP scenarios, respectively. Table 3 Comparison of the Distributions of GSAT Anomaly (Relative to 1986-2005) Projected by FaIR-2LM With the Distributions of Global-Mean Surface Temperature Assessed by AR5 <ref type="bibr">(Collins et al., 2013)</ref> in <ref type="bibr">RCP 2.6,</ref><ref type="bibr">RCP 4.5,</ref> Columns, units: C) <ref type="bibr">(Flato et al., 2013;</ref><ref type="bibr">Landerer et al., 2014;</ref><ref type="bibr">Wang et al., 2014)</ref>. The ensemble of DSL projections also allows us to examine the trajectories of the DSL projections and their uncertainties at specific locations (Figure <ref type="figure">6</ref>). At selected locations in the North Atlantic and Western Pacific, the 90% range of DSL spread generally encapsulates the time series of DSL changes relative to the baseline period from the 5 GCMs.</p><p>The two-layer emulator provides a useful tool to explore the uncertainty of DSL projections over multiple centuries with computational resources that are much less than a GCM requires. It can be calibrated to match assessments of key values like the equilibrium climate sensitivity, and allows the flexibility of simulating forcing conditions intermediate between the RCPs as the patterns are common for different scenarios. However, we should note that the errors between the DSL predicted by two-layer emulator and DSL simulated by the corresponding GCMs are small in middle and low latitudes but relatively large in high latitudes (e.g., the Southern Ocean, Arctic, and subpolar Atlantic). In addition, the two-layer emulator cannot explicitly resolve the non-linear mesoscale effects of the ocean current due to the coarse resolutions of the CMIP5 GCMs that the two-layer pattern scaling relies on. Comparing with the predicted DSL derived from univariate approach, the improvement of using the two-layer approach on predicting DSL has a similar magnitude with the uncertainty of DSL projections over the middle and low latitudes in RCP2.6 and RCP4.5 scenarios during the period of 2271-2290. But the improvement is limited comparing to the uncertainty of DSL projections in RCP8.5. As the non-linear responses of DSL are more obvious in the RCP2.6 and RCP4.5 than in RCP8.5, the notable improvement of using the two-layer approach over the middle-and low-latitudes in the RCP2.6 and RCP4.5 highlight the advantage on improving the DSL projections in these two scenarios.</p><p>Loss of land ice (e.g., Greenland Ice Sheet and Antarctic Ice Sheet) is an important contributor not only to GMSL but also to RSL. Glacio-isostatic adjustment (GIA) caused by changes in ice sheet mass loading induces local vertical land motion and associated changes in local sea level. Mass loss of an ice sheet also reduces the gravitational attraction that pulls sea water toward it, causing water to migrate away with a distinct spatial pattern, or "fingerprint", of sea level change to the global ocean <ref type="bibr">(Mitrovica et al., 2009)</ref>. Freshwater flux from a melting ice sheet may drastically alter the salinity profile of the near-by ocean, bringing about complex feedbacks involving near-surface ocean stratification, sea ice formation, and corresponding changes in surface temperature, winds, and ocean currents <ref type="bibr">(Bronselaer et al., 2018;</ref><ref type="bibr">Sadai et al., 2020)</ref>-each process could have its impact in RSL. Among these factors, only freshwater flux from the ice sheet can significantly affect both global climate and DSL (e.g., <ref type="bibr">Golledge et al., 2019)</ref>. Despite the importance of polar ice sheets, their contributions to DSL are not included in the current generation coupled climate models.</p><p>Our study, relying on outputs from climate models participating the CMIP5 project, thus cannot take into account the effects of evolving ice sheets on DSL. In more comprehensive analyses, the effect of land-ice loss should be considered.</p></div><note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_0"><p>CO</p></note>
		</body>
		</text>
</TEI>
