<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>Exploring the Atmospheric Responses to Arctic Sea‐Ice Loss in Google's NeuralGCM</title></titleStmt>
			<publicationStmt>
				<publisher>AGU</publisher>
				<date>11/01/2025</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10656226</idno>
					<idno type="doi">10.1029/2025MS005264</idno>
					<title level='j'>Journal of Advances in Modeling Earth Systems</title>
<idno>1942-2466</idno>
<biblScope unit="volume">17</biblScope>
<biblScope unit="issue">11</biblScope>					

					<author>Yu‐Chiao Liang</author><author>Nicholas J Lutsko</author><author>Young‐Oh Kwon</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[The rapid loss of Arctic sea ice is a striking consequence of anthropogenic global warming. Its remote impacts on mid‐latitude weather and climate have attracted scientific and media attention. In this study, we use a hybrid (dynamical plus machine‐learning) atmospheric model—Google's NeuralGCM—to investigate the mid‐latitude atmospheric circulation responses to Arctic sea‐ice loss for the first time. We conduct experiments in which NeuralGCM is forced with pre‐industrial and future sea‐ice concentrations following the protocol of the Polar Amplification Model Intercomparisom Project. To assess the performance of NeuralGCM, we compare the results with those simulated by two physics‐based climate models. NeuralGCM produces a comparable response of near‐surface warming to sea‐ice loss and the subsequent weakened zonal wind in mid‐latitudes. However, there is a substantial discrepancy between the two models' stratospheric responses, where different temperature responses in these models are associated with different zonal wind and geopotential height responses. Further investigation of North Atlantic blocking shows that NeuralGCM produces stronger, more frequent, and more realistic blocking events. Our results demonstrate the capability of NeuralGCM in simulating the tropospheric responses to Arctic sea‐ice loss, but improvements may be needed for the stratospheric representation.</p>]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>The rapid decline of Arctic sea ice, observed by space-borne microwave satellites beginning in 1979, is one of the most robust indicators of anthropogenic climate change <ref type="bibr">(Cavalieri &amp; Parkinson, 2012;</ref><ref type="bibr">Serreze &amp; Stroeve, 2015;</ref><ref type="bibr">Stroeve &amp; Notz, 2018)</ref>. As a result, the radiative effect of Arctic sea ice has been estimated to weaken by 0.04-0.05 W&#8901;m -2 &#8901; decade -1 <ref type="bibr">(Duspayev et al., 2024)</ref>, affecting the global energy balance. The Special Report on Ocean and Cryosphere in a Changing Climate <ref type="bibr">(P&#246;rtner et al., 2019)</ref> and the Sixth Assessment Report of the Intergovernmental Panel on Climate Change <ref type="bibr">(IPCC, 2021)</ref> both highlighted that such dramatic sea-ice changes not only have caused detrimental influences on weather, ecosystems, and socioeconomics within the Arctic, but could also have exerted far-reaching impacts on mid-latitude atmospheric circulation and extreme events. Furthermore, in all future warming scenarios, state-of-the-art global climate models project the Arctic to be likely ice-free in September by 2050 <ref type="bibr">(Jahn et al., 2024)</ref>, and the first ice-free day could even occur before 2030 <ref type="bibr">(Heuz&#233; &amp; Jahn, 2024)</ref>.</p><p>The loss of Arctic sea ice, via a series of feedback processes, amplifies warming of Arctic near-surface air temperature by a factor of 3-4 times the global or tropical average <ref type="bibr">(Chylek et al., 2022;</ref><ref type="bibr">Rantanen et al., 2022)</ref>. This phenomenon is called Arctic amplification (AA). In response to sea-ice loss and the accompanied AA, which intrinsically manifests itself as the reduced meridional temperature gradient in the lower troposphere, a weakening of mid-latitude jet stream due to the thermal wind adjustment is expected. Some studies further suggested that the strength and propagation of tropospheric planetary waves can be altered, and subsequently affect the occurrence of extreme events in mid-latitudes <ref type="bibr">(Cohen et al., 2014</ref><ref type="bibr">(Cohen et al., , 2018;;</ref><ref type="bibr">J. A. Francis &amp; Vavrus, 2012</ref><ref type="bibr">, 2015;</ref><ref type="bibr">J. Francis et al., 2018)</ref>. Another branch of studies highlighted the role of the stratosphere <ref type="bibr">(Cohen et al., 2014;</ref><ref type="bibr">Jaiser et al., 2016;</ref><ref type="bibr">Kim et al., 2014;</ref><ref type="bibr">Liang et al., 2024;</ref><ref type="bibr">Nakamura et al., 2016;</ref><ref type="bibr">Sun et al., 2015;</ref><ref type="bibr">Xu et al., 2021</ref><ref type="bibr">Xu et al., , 2023))</ref>: the sea-ice loss and the associated diabatic heating excite vertically-propagating of planetary waves that disrupt the stratospheric polar vortex via eddy-mean flow interactions, and then the stratospheric anomalies gradually move downward to the troposphere, projecting onto a negative Northern Annular Mode (NAM, <ref type="bibr">Thompson &amp; Wallace, 2000)</ref> or a negative North Atlantic Oscillation (NAO, <ref type="bibr">Barnston &amp; Livezey, 1987)</ref>. These proposed dynamical pathways link Arctic sea-ice loss to large-scale atmospheric circulation changes in midlatitudes. Climate models exhibit inconsistent atmospheric responses to Arctic sea-ice loss, hindering the interpretation of the underlying mechanisms <ref type="bibr">(Cohen et al., 2020;</ref><ref type="bibr">Screen et al., 2018)</ref>. For example, a group of studies using atmospheric general circulation models (AGCMs) forced by sea-ice loss conditions showed that the largescale atmospheric circulation response in the Northern Hemisphere resembles a negative NAM-like or NAO-like pattern <ref type="bibr">(Liang et al., 2021;</ref><ref type="bibr">Peings &amp; Magnusdottir, 2014;</ref><ref type="bibr">Seierstad &amp; Bader, 2009)</ref>, whereas another group presented an opposite or muted response <ref type="bibr">(Cassano et al., 2014;</ref><ref type="bibr">Screen et al., 2014;</ref><ref type="bibr">Singarayer et al., 2006;</ref><ref type="bibr">Strey et al., 2010)</ref>. This inconsistency not only reveals that large internal variability in the atmosphere can overwhelm the sea ice-forced circulation responses, but also reflects varied temporal and spatial distributions of sea-ice forcings prescribed to drive models, different experimental configurations, or model biases.</p><p>To address these issues, several coordinated multi-model projects have been performed (e.g., <ref type="bibr">Liang et al., 2020;</ref><ref type="bibr">Ogawa et al., 2018;</ref><ref type="bibr">D. M. Smith et al., 2019)</ref>. Among them, the Polar Amplification Model Intercomparison Project (PAMIP, <ref type="bibr">Danabasoglu, 2019;</ref><ref type="bibr">D. M. Smith et al., 2019)</ref> of the Coupled Model Intercomparison Project phase 6 (CMIP6, <ref type="bibr">Eyring et al., 2016)</ref> provides simulated results using multiple climate models forced by the same future and pre-industrial sea-ice concentration (SIC) conditions. Following the protocol of PAMIP, modeling centers simulated at least 100 ensemble members for each experiment to address the effects of internal variability. The products of PAMIP serve as a perfect testbed for studying Arctic-mid-latitude linkages and can shed insights into the origin of the inconsistencies. In this study, we use the PAMIP simulation sets from the Whole Atmosphere Community Climate Model version 6 (WACCM6, <ref type="bibr">Gettelman et al., 2019)</ref>, developed by the National Center for Atmospheric Research, as a reference simulation set of fully physics-based modeling. WACCM6 has a high-top configuration that includes more vertical layers in the stratosphere, presumably better capturing the stratospheric pathway linking Arctic sea-ice loss to mid-latitude circulation changes.</p><p>More recently, machine learning (ML) techniques have been applied to forecasting weather and projecting future climates <ref type="bibr">(Barnes et al., 2020;</ref><ref type="bibr">Beucler et al., 2023;</ref><ref type="bibr">Eyring et al., 2024;</ref><ref type="bibr">Lai et al., 2024;</ref><ref type="bibr">Li et al., 2023;</ref><ref type="bibr">Mamalakis et al., 2020;</ref><ref type="bibr">Reichstein et al., 2019)</ref>. Data-driven approaches have achieved remarkable success in forecasting weather on time-scales of a few days to a week <ref type="bibr">(Bi et al., 2023;</ref><ref type="bibr">Keisler, 2022;</ref><ref type="bibr">Lam et al., 2023;</ref><ref type="bibr">Rasp et al., 2024;</ref><ref type="bibr">Schreck et al., 2024;</ref><ref type="bibr">Watt-Meyer et al., 2024)</ref>. However, simulations of these ML models on longer timescale beyond the weather prediction remain challenging and have not demonstrated better performance than existing dynamical climate models. This hinders their applicability to study subseasonal-to-seasonal forecasts, interannual and decadal variability, and climate change <ref type="bibr">(Brenowitz &amp; Bretherton, 2019;</ref><ref type="bibr">Watt-Meyer et al., 2023)</ref>.</p><p>Instead of using purely data-driven approaches, recent studies have begun to adopt a "hybrid" modeling approach to construct climate models that can be integrated over an extended period of time <ref type="bibr">(Kochkov et al., 2024;</ref><ref type="bibr">Rasp et al., 2018;</ref><ref type="bibr">Yuval &amp; O'Gorman, 2020)</ref>. The "hybrid" configuration typically combines two components: (a) a physics-based dynamical core that solves the primitive equations for atmospheric circulation and thermodynamics, and (b) a blending of trained parametrization schemes using neural networks that emulate radiation, convective processes, subgrid-scale dynamics, etc. The coupling of dynamical and ML components with additional training efforts can alleviate the instabilities and model drift that previous data-driven models encountered <ref type="bibr">(Brenowitz &amp; Bretherton, 2019;</ref><ref type="bibr">Kochkov et al., 2024)</ref>, thus allowing for longer, more stable integrations. Furthermore, hybrid models are more interpretable than pure data-driven models, as the large-scale dynamics and thermodynamics are physically resolved in the dynamical core component.</p><p>NeuralGCM, a recently developed hybrid climate model by Google Research <ref type="bibr">(Kochkov et al., 2024)</ref>, represents a significant advancement in atmospheric modeling. In addition to its enhanced skill in ensemble weather forecasting, NeuralGCM exhibits strong capability in simulating atmospheric variability across a broad range of time-scales-from subseasonal to decadal. Owing to its flexible architecture, which permits modification of lower boundary conditions such as SIC, NeuralGCM could be an atmospheric model used to explore atmospheric responses to Arctic sea-ice loss.</p><p>In this study, we use NeuralGCM to perform, for the first time, a set of pre-industrial and future sea-ice experiments and compare the results with simulations conducted using WACCM6. Section 2 provides an overview of the model structure and experimental design. Section 3 presents the large-scale atmospheric responses to Arctic sea-ice loss in both NeuralGCM and WACCM6, including analyses of atmospheric blocking and storm tracks over the North Atlantic and its underlying mechanisms. To test the physical-based climate model dependence, simulations from another GCM-Taiwan Earth System Model Version1 (TaiESM1)-are also analyzed. A broader discussion of the findings, future outlook, and concluding remarks are provided in Section 4. We anticipate that our analysis framework could pave a new avenue for future climate modeling studies using both new generation hybrid or ML models and physics-based models.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Models, Numerical Experiments, and Methodology</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1.">NeuralGCM</head><p>NeuralGCM, recently released by Google Research, aims to provide a deep-learning based atmospheric model for both weather forecasting and climate projections. As reviewed in Section 1, hybrid models, such as NeuralGCM, integrate data-driven approaches with physical modeling, offering enhanced numerical stability and reduced model drift, thereby enabling simulations that extend beyond short-term weather forecasts <ref type="bibr">(Brenowitz &amp; Bretherton, 2019;</ref><ref type="bibr">Kochkov et al., 2024)</ref>. The model architecture consists of a differentiable dynamical core <ref type="bibr">(Gelbrecht et al., 2023)</ref>, which solves a set of hydrostatic primitive equations with moisture for atmospheric dynamics and thermodynamics <ref type="bibr">(Bourke, 1974)</ref>. This dynamical core is coupled to a neural network-based physics module that parameterizes unresolved subgrid-scale processes, including radiative transfer, moist convection, precipitation, and turbulent mixing (see Figure <ref type="figure">1</ref> in <ref type="bibr">Kochkov et al., 2024)</ref>.</p><p>In this study, we utilize the deterministic configuration of NeuralGCM at horizontal resolutions of 2.8&#176;and 1.4&#176;, with 37 interpolated vertical levels extending from 1,000 to 1 hPa (in the dynamical core, 32 equidistant sigma levels for vertical discretization from the surface to the top of atmosphere are implemented), following the configuration described in <ref type="bibr">Kochkov et al. (2024)</ref>. Therefore, NeuralGCM can be categorized as the low-top atmospheric model. Importantly, NeuralGCM's encoder-decoder interface permits modification of surface boundary conditions, such as SIC and sea-surface temperature (SST). Leveraging this flexibility, we perform sea ice-forced experiments by prescribing varying SIC conditions, as detailed in Section 2.3. We show the results of NeuralGCM with 2.8&#176;resolution in the main text and presents the results with 1.4&#176;resolution in the Supporting Information.</p><p>For weather forecasting with a 10-day lead time, NeuralGCM demonstrates prediction skill comparable to that of purely data-driven models such as Pangu <ref type="bibr">(Bi et al., 2023)</ref> and GraphCast <ref type="bibr">(Lam et al., 2023)</ref>. For medium-range forecasts extending up to 15 days, NeuralGCM exhibits superior predictive performance relative to the ensemble prediction products from the European Center for Medium-Range Weather Forecasts (ECMWF). In terms of seasonal prediction, NeuralGCM realistically reproduces the seasonal cycle of global mean temperature at 850 hPa, consistent with both the ERA5 reanalysis data set <ref type="bibr">(Hersbach et al., 2020)</ref> and simulations from the global cloud-resolving climate model X-SHiELD <ref type="bibr">(Cheng et al., 2022)</ref>. On decadal time-scales, Atmospheric Model Intercomparison Project (AMIP)-style simulations for the period 1981-2014 <ref type="bibr">(Eyring et al., 2016)</ref> indicate that NeuralGCM effectively captures the observed global warming trends. Moreover, NeuralGCM exhibits reduced spatial biases compared to AMIP simulations from the CMIP6 archive. These findings, as comprehensively documented in <ref type="bibr">Kochkov et al. (2024)</ref>, reinforce the potential of NeuralGCM as a reliable tool for a wide range of climate modeling applications. More recent studies further showed that NeuralGCM has the ability to capture the Arctic amplification <ref type="bibr">(Rucker et al., 2025)</ref> and extreme heatwaves <ref type="bibr">(Duan et al., 2025)</ref>, and investigated the atmospheric responses to uniform SST warming <ref type="bibr">(Zhang &amp; Merlis, 2025)</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.">Whole Atmosphere Community Climate Model Version 6 (WACCM6)</head><p>To provide a fully dynamical modeling reference for the evaluation of NeuralGCM's performance, we use WACCM6 configuration of the Community Earth System Model version 2 (CESM2, <ref type="bibr">Danabasoglu et al., 2020)</ref> with a horizontal resolution of 0.9&#176;latitude &#215; 1.25&#176;longitude and 70 vertical layers extending from the surface to 6x10 -6 hPa (140 km) <ref type="bibr">(Gettelman et al., 2019)</ref>. The dynamical core in WACCM6 is based on the Godunov-type finite volume scheme in which noise-free numerical solutions can be obtained <ref type="bibr">(Lin &amp; Rood, 1997)</ref>. The physical parameterization schemes of WACCM6 are the same as those in the Community Atmosphere Model version 6 (CAM6, <ref type="bibr">Danabasoglu et al., 2020)</ref>, except for the nonorographic gravity wave drag parameterization scheme <ref type="bibr">(Gettelman et al., 2019)</ref>. The updated gravity wave drag scheme improves the simulation of frontogenesis, and gives rise to a realistic self-generated quasi-biennial oscillation (QBO). Recent studies reported that WACCM6 simulates more realistic sudden warming events and QBO in the stratosphere, as well as storm tracks, stationary waves, and blockings in mid-latitude troposphere, compared with those simulated in CAM6 <ref type="bibr">(Ayarzag&#252;ena et al., 2020;</ref><ref type="bibr">Gettelman et al., 2019;</ref><ref type="bibr">Simpson et al., 2020)</ref>.</p><p>In addition, WACCM6 is developed to resolve a full range of chemical reactions and the interactions between chemical processes and atmospheric dynamics and thermodynamics <ref type="bibr">(Gettelman et al., 2019)</ref>. However, we do not activate the interactive chemistry module in WACCM6 due to its high computational costs (see Table <ref type="table">2</ref> in <ref type="bibr">Gettelman et al., 2019)</ref>, instead we use the version with specified chemistry. In this configuration, for example, the ozone profile is prescribed; therefore, interactions between ozone and atmospheric circulation are not permitted. Thorough documentation and discussions on the effect of chemistry-atmosphere interactions can be found in <ref type="bibr">Gettelman et al. (2019)</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3.">Taiwan Earth System Model Version 1 (TaiESM1)</head><p>We also use the atmospheric component of Taiwan Earth System Model Version 1 (TaiESM1), developed by the Research Center for Environmental Change, Academia Sinica, to test the physical model dependence. TaiESM1 shares the same spatial resolution as WACCM6, except its vertical layer only contains 30 layers with a model top at 2 hPa. Thus, TaiESM1 represents a low-top model with respect to the high-top configuration of WACCM6. The model is based on CAM version 5.3 <ref type="bibr">(Neale et al., 2010)</ref> with a finite-volume dynamical core <ref type="bibr">(Lin, 2004)</ref>. A newly-developed triggering functions for the deep convection scheme (Y.-C. <ref type="bibr">Wang et al., 2015)</ref>, a revised cloud macrophysics scheme <ref type="bibr">(Shiu et al., 2021)</ref>, and a three-moment aerosol scheme <ref type="bibr">(Chen et al., 2013)</ref> were implemented into CAM version 5.3. The model also considers the three-dimensional radiation-topography interactions <ref type="bibr">(Lee et al., 2013)</ref>. More details and model performance were documented in <ref type="bibr">Lee et al. (2020)</ref>. The results of TaiESM1 are shown in Supporting Information S1 and discussed throughout the manuscript.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.4.">Pre-Industrial and Future Sea-Ice Simulations</head><p>We follow the experimental protocol designed by the PAMIP (D. M. <ref type="bibr">Smith et al., 2019)</ref> to perform time-slice simulations using NeuralGCM, WACCM6, and TaiESM1. Each simulation is integrated from April 1st of the initial year through 31 May of the following year, yielding 14-month simulations for analysis. For the WACCM6 and TaiESM1 simulations, the CMIP6 radiative forcing at year 2000 is fixed during the integration period. To be consistent, the NeuralGCM is initialized with conditions from ERA5 for 1 April 2000. The simulations use prescribed SIC fields that represent pre-industrial and future conditions, specified as lower boundary conditions. The pre-industrial SIC fields are derived from the Coupled Model Intercomparison Project Phase 5 (CMIP5) historical simulations, selected based on their consistency with the observed global-mean surface temperature (about 13.67&#176;C) during the pre-industrial era <ref type="bibr">(Haustein et al., 2017)</ref>. Future SIC fields, in contrast, are obtained from CMIP5 projections under the Representative Concentration Pathway 8.5 (RCP8.5) scenario <ref type="bibr">(Hausfather, 2020)</ref>, corresponding to a climate state in which global-mean surface temperature exceeds the preindustrial level by approximately 2&#176;C. In the Southern Hemisphere, SIC is held fixed at present-day values, estimated using the 1979-2008 mean SIC from the Hadley Center Sea Ice and Sea Surface Temperature (HadISST) data set <ref type="bibr">(Rayner et al., 2003)</ref>. The spatial distributions of SIC for March and September, as well as the temporal evolution of Arctic-mean monthly SIC, are illustrated in Figure <ref type="figure">1</ref> of <ref type="bibr">Liang et al. (2024)</ref>. The preindustrial and future experiments are also referred to as experiment 1.5 pdSST-piArcSIC and 1.6 pdSST-futArcSIC listed in Tabel 1 in D. M. <ref type="bibr">Smith et al. (2019)</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Journal of Advances in Modeling Earth Systems</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>10.1029/2025MS005264</head><p>To account for the influence of internal variability, the PAMIP protocol recommends that participating models generate 100 ensemble members for each simulation set (D. M. <ref type="bibr">Smith et al., 2019)</ref>. However, recent studies have indicated that even larger ensemble sizes may be necessary to robustly quantify the uncertainties associated with internal variability <ref type="bibr">(Labe et al., 2019;</ref><ref type="bibr">Liang et al., 2020;</ref><ref type="bibr">Peings et al., 2021;</ref><ref type="bibr">Sun et al., 2022)</ref>. In light of this, we expand the ensemble size to 200 members for the NeuralGCM, WACCM6, TaiESM1 simulations. To generate ensemble members, we add small perturbations to one grid point of the temperature fields in both models when starting integration. The ensemble mean is subsequently computed across all members to effectively suppress the noise arising from internal variability, thereby enhancing the signal-to-noise ratio of the sea-ice forced response <ref type="bibr">(Deser et al., 2020;</ref><ref type="bibr">Maher et al., 2021)</ref>. The atmospheric response to Arctic sea-ice loss is quantified by taking the difference between the ensemble-mean fields from the future sea-ice simulation and those from the pre-industrial counterpart.</p><p>Using GPUs (NVIDIA GeForce RTX 4060 Ti), to complete one 2.8 degree NeuralGCM simulation requires about 8 min, while using CPU 224 cores on supercomputer Forerunner1, where we ran WACCM6 and TaiESM1 simulations, to complete one WACCM6 simulation takes about 8 hr. It is noted that NeuralGCM with 2.8&#176;r esolution successfully generates 200 ensemble members without the stability issue. However, the version with 1.4&#176;suffers from the stability problem: about one-third of the ensemble members crashes during the integration.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.5.">Reanalysis Data</head><p>We use the ERA5 reanalysis data set <ref type="bibr">(Hersbach et al., 2020)</ref> in 1 April of 2000 as the initial conditions to integrate NeuralGCM. We also analyze daily ERA5 500-hPa geopotential height (hereafter Z500) for observational estimates of blocking during the 1979-2023 period. A global mean cold bias in the lower stratosphere during the 2000-2006 period was documented <ref type="bibr">(Simmons et al., 2020)</ref>, so caution is needed when analyzing stratospheric fields.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.6.">Blocking Detection, Jet Latitude, Storm Track, Eady Growth Rate</head><p>Blocking, a key atmospheric phenomenon examined in this study, is characterized by a persistent high-pressure system that disrupts the prevailing jet stream and alters synoptic weather patterns in mid-latitudes <ref type="bibr">(Arnheim et al., 2025;</ref><ref type="bibr">Booth et al., 2017;</ref><ref type="bibr">Rex, 1950;</ref><ref type="bibr">Woollings et al., 2018)</ref>. Given its strong association with extreme weather events-including cold spells and heatwaves-blocking has wide socioeconomic implications <ref type="bibr">(Kautz et al., 2021;</ref><ref type="bibr">Woollings et al., 2018)</ref>. However, previous studies have reported inconsistent responses of blocking to Arctic warming. Some studies suggested an increase in blocking frequency (J. A. <ref type="bibr">Francis &amp; Vavrus, 2015;</ref><ref type="bibr">Liu et al., 2012;</ref><ref type="bibr">Mori et al., 2014)</ref>, while others indicated minimal changes or even a decrease in blocking occurrence <ref type="bibr">(Kennedy et al., 2016)</ref>.</p><p>To investigate the response of blocking in the North Atlantic sector (90&#176;W-60&#176;E and 35&#176;N-75&#176;N) to Arctic seaice loss, we detect a blocking event using Z500. Specifically, we follow the blocking detection algorithm developed by <ref type="bibr">Scherrer et al. (2006)</ref> to calculate the Z500 meridional gradients to the north and south of a target longitude and latitude (x 0 , y 0 ):</p><p>where y S = y 0 -16&#176;and y N = y 0 + 16&#176;. If the Z500 gradients satisfy two criteria, we identify it as an instantaneous blocking: (a) &#8711;Z500 S &gt; 0 and (b) &#8711;Z500 N &lt; -10 m/&#176;. The first criterion characterizes the reversal of the Z500 gradient to the south, which occurs when a block emerges <ref type="bibr">(Lejen&#228;s &amp; &#216;kland, 1983)</ref>; whereas the second criterion ensures that a westerly wind to the north of the blocking occurs as the surrounding jet stream is split <ref type="bibr">(Barnes et al., 2012;</ref><ref type="bibr">Tibaldi &amp; Molteni, 1990)</ref>. Finally, if instantaneous blocking lasts for five or more consecutive days, we count those days as the blocking days.</p><p>To define the jet latitude associated with blocks, we take the zonal average of the daily 850-hPa zonal wind (hereafter U850) across different latitudes over the longitudinal domain of 75&#176;W to 15&#176;E, following previous</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Journal of Advances in Modeling Earth Systems</head><p>10.1029/2025MS005264</p><p>studies <ref type="bibr">(Davini &amp; Cagnazzo, 2014;</ref><ref type="bibr">Kwon et al., 2020;</ref><ref type="bibr">Woollings et al., 2010)</ref>. A 5-day running average is then performed over the time dimension, and the latitude with highest wind speed is determined as the jet latitude.</p><p>We also investigate the storm track response to Arctic sea-ice loss. The storm track is defined as the transient eddy heat flux (&lt;v &#697;T&#697;&gt; ) at 850 hPa following <ref type="bibr">Kwon et al. (2020)</ref>. We first apply the Lanczos high-pass filter at 8 days with 15 weights to the daily meridional wind (v) and the air temperature t. The strength of the storm track is calculated as the covariance of filtered v and T during extended winter from December to March.</p><p>To enhance our understanding of the underlying causes of storm track changes, we calculate the Eady growth rate <ref type="bibr">(Eady, 1949;</ref><ref type="bibr">Lindzen &amp; Farrell, 1980;</ref><ref type="bibr">Simmonds &amp; Lim, 2009)</ref>, which characterizes the development of baroclinic instability <ref type="bibr">(Kwon et al., 2020)</ref>. The Eady growth rate can be formulated as:</p><p>where f is the Coriolis parameter, z the vertical coordinate, U is the wind component, N the Brunt-V&#228;&#305;s&#228;l&#228; frequency (N 2 = g &#952; &#8706;&#952; &#8706;z , g is the acceleration due to gravity and &#952; is the potential temperature). We multiply the rate by 86,400 (second per day) to convert to units of 1/day.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.7.">Climate Mean States and Extremes in NeuralGCM, WACCM6, and TaiESM1</head><p>We briefly investigate the climate mean states and extremes in the three models, as the mean state differences could influence the responses to Arctic sea-ice loss. We first look into the 60&#176;N zonal-mean zonal wind at 10 hPa (U10) and the number of sudden stratospheric warmings (SSWs), which are identified following the algorithm from <ref type="bibr">Charlton and Polvani (2007)</ref>. NeuralGCM has higher U10 during October-January and similar U10 compared with WACCM6, while TaiESM1 simulates stronger U10 throughout the analysis period (Figure <ref type="figure">S1a</ref> in Supporting Information S1). Correspondingly, NeuralGCM produces similar SSWs as WACCM6 in most months, except November, while TaiESM1 underestimates the occurrence of SSWs (Figure <ref type="figure">S1b</ref> in Supporting Information S1). These results suggest NeuralGCM is as good at capturing the strength of mean stratospheric circulation and extreme events, as WACCM6.</p><p>We next analyze the zonal-mean zonal wind and temperature in height-latitude coordinates. In the troposphere, the three models generally agree with each other (Figure <ref type="figure">S2</ref> in Supporting Information S1). Evident difference occurs in the stratosphere where the temperatures between equator and 30&#176;N, between 10 and 1 hPa in Neu-ralGCM and WACCM6 are stronger and extend further to the north than TaiESM1 (Figures S3a-S3c in Supporting Information S1). The stratospheric zonal winds in mid-latitudes are strongest in TaiESM1, followed by WACCM6 and NeuralGCM (Figures S3e and S3f in Supporting Information S1).</p><p>To further examine the longitude-latitude spatial structure of tropospheric and stratospheric dynamical and thermodynamical variables, we select near-surface air temperature at 1,000, 500, 10 hPa (hereafter referred to as T1000, T500, T10), and geopotential height at 1,000, 500, 10 hPa (hereafter referred to as Z1000, Z500, Z10). The spatial patterns of temperature are similar in the three models, but the magnitude of temperature in NeuralGCM is overall weaker than those in the other two models, particularly at 10 and 1,000 hPa (Figure <ref type="figure">S4</ref> in Supporting Information S1). The spatial structures of geopotential height are similar in the three models, for example, the low height values above Barents-Kara Seas at 10 hPa, but NeuralGCM has overall lower values at 10 hPa and higher values at 500 and 1,000 hPa (Figure <ref type="figure">S5</ref> in Supporting Information S1).</p><p>To investigate extreme events simulated in the three models, We follow <ref type="bibr">Johnson et al. (2018)</ref> to identify cold and warm extremes. Without the hourly data output, we instead use daily data and consider the temperature anomalies falling and above below the tenth percentile during consecutive 5 days. We show in Figure <ref type="figure">S6</ref> in Supporting Information S1 the spatial distributions of cold and warm extreme frequency during the December-January-February (DJF) period. For the warm extremes, the three models present similar large-scale patterns. Neu-ralGCM somehow overestimates the low frequency regions, including the western coasts of the North America, North Europe, and north of Iceland. For the cold extremes, NeuralGCM generally underestimates the extreme frequency. Therefore, NeuralGCM may find it challenging to capture cold extremes.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Journal of Advances in Modeling Earth Systems</head><p>10.1029/2025MS005264</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Results</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.">Large-Scale Atmospheric Responses to Arctic Sea-Ice Loss</head><p>We begin by examining the prescribed Arctic sea-ice area (SIA) in both the pre-industrial and future simulations. As shown in Figure <ref type="figure">1a</ref>, the seasonal cycle of SIA in both scenarios displays a minimum in September and a maximum in March. As designed, the SIA in the future scenario is lower than that in the pre-industrial scenario throughout the year, with the largest difference (approximately 9.3 &#215; 10 6 km 2 ) in October (gray bars in Figure <ref type="figure">1a</ref>).</p><p>In NeuralGCM, WACCM6, and TaiESM1, the Arctic T1000 exhibits a delayed seasonal cycle, with minimum values occurring in January or February and maximum values in July (Figure <ref type="figure">1b</ref>). This phase lag between SIA and T1000 reflects the influence of ocean-atmosphere heat exchange and associated local feedback processes, as highlighted in previous studies <ref type="bibr">(Bintanja &amp; Van der Linden, 2013;</ref><ref type="bibr">Boeke &amp; Taylor, 2018;</ref><ref type="bibr">Chung et al., 2021;</ref><ref type="bibr">Hahn et al., 2022;</ref><ref type="bibr">Liang et al., 2022;</ref><ref type="bibr">D. M. Smith et al., 2017;</ref><ref type="bibr">Wu et al., 2023)</ref>. The T1000 values simulated by NeuralGCM are higher during the cold season compared to those simulated by WACCM6 and TaiESM1, and a reduction in the amplitude of the near-surface air temperature seasonal cycle in NeuralGCM. We also notice that the ensemble spread, estimated as one standard deviation across 200 members (color shadings in Figure <ref type="figure">1b</ref>), in the three models are similar, suggesting that NeuralGCM produces internal variability comparable to WACCM6 and TaiESM1.</p><p>Focusing on the atmospheric response to Arctic sea-ice loss, the T1000 response simulated by NeuralGCM begins to increase during the warm season, peaks in November-approximately one month after the maximum sea-ice forcing shown in October-and subsequently declines into the cold season (Figure <ref type="figure">1c</ref>). This lagged response indicates that NeuralGCM effectively captures the delayed atmospheric adjustment to Arctic sea-ice loss, consistent with findings from previous studies <ref type="bibr">(Liang et al., 2021;</ref><ref type="bibr">Peings &amp; Magnusdottir, 2014)</ref>. A similar seasonal evolution in T1000 responses are also simulated by WACCM6 and TaiESM1. The T1000 responses in WACCM6 and TaiESM1 are slightly stronger than that of NeuralGCM after November. In terms of internal variability of the temperature response, the ensemble spreads tend to increase during middle to late winter (post-January), indicating enhanced atmospheric variability in the deep cold season in both models. The ensemble spread range of NeuralGCM is mostly within the range of WACCM6 and TaiESM1, with the former smaller than the latter.</p><p>We next examine the relationship between Arctic and global T1000 responses to assess the extent of AA in NeuralGCM, WACCM6, and TaiESM1 (Figure <ref type="figure">1d</ref>). In all models, the Arctic T1000 response is consistently stronger than the global-mean response, presenting AA. While NeuralGCM simulates a slightly smaller Arctic T1000 response and a similar global response compared to WACCM6 and TaiESM1, the magnitude of ensemble spread is overall similar. This result confirms again that the effect of internal variability is similar in the three models.</p><p>The resolution of the model has been discussed to affect the simulated atmosphere and ocean <ref type="bibr">(Chang et al., 2020;</ref><ref type="bibr">Scaife et al., 2019)</ref>. To test the sensitivity of the response to model spatial resolution, we conduct the same NeuralGCM experiments with a higher horizontal resolution (1.4&#176;). The seasonal evolution remains largely consistent, with the response magnitude in NeuralGCM weakened and the variability more comparable to WACCM6's (Figure <ref type="figure">S7c</ref> in Supporting Information S1). Notably, the higher-resolution version of NeuralGCM enlarges the internal variability of the global T1000 response (red dots in Figure <ref type="figure">1d</ref>). However, the Arctic and global responses become somewhat weaker. We further discuss the impact of resolution in the Discussion section.</p><p>We next look into the zonal-mean temperature and zonal wind responses simulated by NeuralGCM. Two prominent warming regions emerge in the mid-to high-latitude troposphere and stratosphere (Figure <ref type="figure">2a</ref>). The first likely corresponds to the direct thermal response to Arctic sea-ice loss, characterized by a strong near-surface warming whose amplitude decreases both southward and upward. A similar bottom-heavy warming structure is also evident in the WACCM6 simulation (Figure <ref type="figure">2c</ref>). The associated zonal-mean zonal wind responses in both models exhibit a weakening centered around 60&#176;N (Figures <ref type="figure">3a</ref> and <ref type="figure">3c</ref>), broadly consistent with the thermal-wind balance adjustment arising from hydrostatic and geostrophic balance of forces. The second warming region in NeuralGCM appears in the high-latitude stratosphere, centered near 50 hPa (Figure <ref type="figure">2b</ref>). In contrast, WACCM6 shows a different response, with stratospheric cooling north of 60&#176;N and warming to the south, resulting in opposing stratospheric wind responses between the two models (Figures <ref type="figure">3b</ref> and <ref type="figure">3d</ref>). We note that the zonal-mean temperature and wind responses in the higher-resolution version of NeuralGCM are generally stronger than those in the lower-resolution counterpart (Figures S8 and S10 in Supporting Information S1). We also find that TaiESM1 simulates a warming and weakening of zonal wind in the stratosphere (Figures S9 and S11 in Supporting Information S1), but the weakening center is different from NeuralGCM's. The tropospheric responses in TaiESM1 are similar to those in NeuralGCM. Taken together, these results suggest that NeuralGCM produces a reasonable simulation of the large-scale atmospheric temperature and circulation responses to Arctic sea-ice loss in the troposphere when compared with WACCM6 and TaiESM1. However, its representation of stratospheric responses appears to deviate from that of WACCM6 and TaiESM1, highlighting potential limitations in simulating stratospheric processes.</p><p>In addition to the thermal wind adjustment, the propagation of Rossby waves and the associated eddy-mean flow interactions also modulate the tropospheric and stratospheric jets <ref type="bibr">(Andrews et al., 1987)</ref>. To assess this mechanism, we calculate the Eliassen-Palm (EP) flux and its divergence (proportional to positive zonal-wind acceleration) <ref type="bibr">(Edmon et al., 1980)</ref>. In both models, the EP flux responses predominantly propagate upward from the surface between 40&#176;N and 50&#176;N, turn poleward near 300 hPa, and descend around 60&#176;N (Figures <ref type="figure">4a</ref> and <ref type="figure">4c</ref>). In NeuralGCM, this downward propagation is particularly pronounced, resulting in a positive zonal wind tendency due to EP flux divergence between 800 and 500 hPa over the middle-to-high latitudes. In contrast, WACCM6 exhibits an upward EP flux response in the lower atmosphere that counteracts the descending flux, leading to EP flux convergence and a negative zonal wind tendency. This likely contributes to the stronger negative zonal wind response in WACCM6 compared to NeuralGCM (Figures <ref type="figure">3c</ref> and <ref type="figure">3a</ref>).</p><p>In the upper troposphere and stratosphere, EP flux responses display upward behaviors in the two models, but the opposite wind tendencies (Figures <ref type="figure">4b</ref> and <ref type="figure">4d</ref>): in NeuralGCM a negative wind tendency occurs above 50 hPa to the north of 60&#176;but there is a positive tendency in WACCM6. To investigate how the opposite wind tendency occurs, we calculate the total EP fluxes entering the high-latitude stratosphere region (the blue box in Figures <ref type="figure">4b</ref> and <ref type="figure">4d</ref>). We find that large EP fluxes leave the southern boundary of the selected region in NeuralGCM (the blue number, which is calculated following Sigmond and Scinocca (2010)), favoring an enhanced zonal wind tendency. But this is not seen in the wind tendency. In WACCM6, the net loss of EP fluxes, although smaller than that in NeuralGCM, corresponds to enhanced zonal-wind tendency. These results indicate the EP fluxes may not be the main process in NeuralGCM to modulate the zonal wind, or the eddy-mean flow interaction is not well represented. In addition, the high-resolution simulation of NeuralGCM shows opposite EP fluxes and zonal wind tendency responses with low-resolution version in the stratosphere (Figures S12a and S12b in Supporting Information S1). TaiESM1 also shows different EP flux responses from NeuralGCM and WACCM6 in the Journal of Advances in Modeling Earth Systems 10.1029/2025MS005264</p><p>stratosphere, but the enhanced wind tendency locates at levels higher than 5 hPa (Figures <ref type="figure">S13a</ref> and <ref type="figure">S13b</ref> in Supporting Information S1). These discrepancies, again, highlight the limitations of stratospheric representation in NeuralGCM.</p><p>To investigate stratosphere-troposphere interactions in response to Arctic sea-ice forcing during the cold season in the Northern Hemisphere, we analyze the polar-cap (65&#176;N-90&#176;N) averaged geopotential height (hereafter Z) responses from 1 September to 31 March of the following year. NeuralGCM exhibits strong vertical coupling between the troposphere and the entire stratosphere, characterized by persistent positive height anomalies extending from November through March, except for a period of weak anomalies from the middle of December to the middle of January, (Figure <ref type="figure">5a</ref>). In contrast, WACCM6 displays weaker stratosphere-troposphere coupling, with positive height responses largely confined to the lower stratosphere during this period (Figure <ref type="figure">5b</ref>). As a result, notable discrepancies between the two models emerge above 50 hPa, with the largest differences occurring in October, November, February, and March (Figure <ref type="figure">5c</ref>), a pattern also shown in the high-resolution version of NeuralGCM (Figure <ref type="figure">S14c</ref> in Supporting Information S1). Similarly, TaiESM1 presents a weaker stratospheretroposphere coupling (Figure <ref type="figure">S15</ref> in Supporting Information S1), compared with NeuralGCM. These Journal of Advances in Modeling Earth Systems The zonal-mean and polar-cap averages offer an informative dynamical perspective on the large-scale atmospheric circulation responses to Arctic sea-ice loss. However, zonally asymmetric features also deserve examination, as they reveal horizontal patterns that may relate to specific weather regimes and regional weather phenomena. We look at the geographic distribution of T1000 responses in the two models. The warming amplitude of NeuralGCM is overall weaker than that of WACCM6. NeuralGCM captures the warming hotspots associated with large sea-ice loss in the Barents-Kara Seas, the Bering Strait, the Sea of Okhotsk, and the Hudson Bay (Figure <ref type="figure">6c</ref>), which are also shown in WACCM6 (Figure <ref type="figure">6f</ref>), although the temperature response is generally weaker in NeuralGCM than in WACCM6. At 500 hPa, a dipolar temperature response with warming above Greenland and North America, and cooling over eastern Asia and North Pacific appears in both NeuralGCM and WACCM6, although the former has some sporadic cooling to the north of Canadian archipelago and less cooling in the North Pacific (Figures <ref type="figure">6d</ref> and <ref type="figure">6e</ref>). Moving upward to stratosphere, the discrepancy between the two models becomes evident. NeuralGCM exhibits a warming response over Siberia, while WACCM6 shows dominant Journal of Advances in Modeling Earth Systems  <ref type="figure">6a</ref> and <ref type="figure">6d</ref>). In TaiESM1, tropospheric temperature responses are similar, except the warming over land is weaker (Figure <ref type="figure">S17f</ref> in Supporting Information S1). The stratospheric responses, on the other hand, present a dipole pattern with positive center in Eurasia and negative one in North Alantic, distinct from NeuralGCM and WACCM6 (Figure <ref type="figure">S17d</ref> in Supporting Information S1).</p><p>Similar to the temperature responses, the geopotential height responses in NeuralGCM and WACCM6 exhibit similar spatial distributions, except in the stratosphere. At 1,000 hPa, NeuralGCM effectively captures the characteristic heat-low circulation response to local warming induced by sea-ice loss <ref type="bibr">(Deser et al., 2010;</ref><ref type="bibr">Peings &amp; Magnusdottir, 2014)</ref>, along with the elevated height extending from Greenland and the North Atlantic to Siberia (Figure <ref type="figure">7c</ref>), consistent with the WACCM6 simulations (Figure <ref type="figure">7f</ref>). Notably, in the North Atlantic sector, the height response resembles a negative NAO pattern. At 500 hPa, both models show an overall increase in geopotential height, with the strongest anomalies centered over the Barents-Kara Seas, where sea-ice loss is most pronounced. NeuralGCM simulates a slightly weaker Z500 response over the Barents-Kara Seas but exhibits a pronounced height increase over eastern Siberia (Figure <ref type="figure">7b</ref>), a feature absent in WACCM6 (Figure <ref type="figure">7e</ref>). Corresponding to these geopotential height responses, a general weakening of zonal wind is presented over midlatitudes in the lower and middle troposphere (Figure <ref type="figure">8</ref>). Compared to WACCM6, NeuralGCM produces a stronger zonal wind weakening over the North Atlantic but a weaker response over the North Pacific (Figures 8b, In panels (a) and (b), the contour line indicates that the response is statistically significant with a twotailed Student's t test at 95% confidence level, while that in (c) presents the difference of the responses between the two models is statistically significant.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Journal of Advances in Modeling Earth Systems</head><p>10.1029/2025MS005264 8c, 8e, and 8f). As expected, the geopotential height and zonal wind responses in the stratosphere differ substantially between NeuralGCM, WACCM6, and TaiESM1 (Figures <ref type="figure">7a</ref>, <ref type="figure">7d</ref>, 8a, and 8d and Figures <ref type="figure">S19a</ref>, <ref type="figure">S19d</ref>, S21a, and S21d in Supporting Information S1).</p><p>In summary, NeuralGCM effectively simulates key atmospheric responses to Arctic sea-ice loss in the troposphere, broadly consistent with WACCM6 and TaiESM1 simulations. These include localized warming to large </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Journal of Advances in Modeling Earth Systems</head><p>10.1029/2025MS005264 sea-ice decline, the heat-low circulation patterns, and a hemispheric negative NAM response or a negative NAO response in the North Atlantic, accompanied by a general weakening of mid-latitude jets. However, substantial discrepancies emerge in the stratospheric responses between the (the three models). The NeuralGCM may be improved in representing stratospheric dynamical and thermodynamical processes, although a careful assessment of its response within the inter-model range or not is needed. Overall, NeuralGCM serves as a valuable tool for investigating Arctic-midlatitude linkages through tropospheric pathways. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.">Blocking Response in the North Atlantic to Arctic Sea-Ice Loss</head><p>We now examine the response of atmospheric blocking in the North Atlantic sector to Arctic sea-ice loss. We first analyze the DJF ensemble-mean number of blocking days and the zonal wind at 200 hPa simulated by Neu-ralGCM under future and pre-industrial sea-ice conditions (Figures <ref type="figure">9a</ref> and <ref type="figure">9b</ref>). Blocking events predominantly occur along the periphery of the prevailing 200-hPa jet, forming an arc-shaped pattern with centers over Greenland, the British Isles, southern Norway, and the eastern Atlantic. Under future sea-ice conditions, blocking frequency increases north of this arc while decreasing to the south, relative to the pre-industrial simulations (Figures <ref type="figure">9a</ref> and <ref type="figure">9b</ref>). The blocking changes reflect the deceleration to the north and acceleration to the south of the North Atlantic jet (dashed contour lines in Figure <ref type="figure">9c</ref>). In WACCM6 and TaiESM1, the spatial distribution of blocking events exhibits a similar arc pattern, but with the high-frequency region (Figures <ref type="figure">9d</ref> and <ref type="figure">9e</ref> and Figures S23d and S23e in Supporting Information S1) is different from NeuralGCM's, a well-known bias in AGCMs <ref type="bibr">(Kwon et al., 2018)</ref>. WACCM6 and TaiESM1 show comparable blocking and zonal wind responses, but their magnitudes are notably weaker (Figure <ref type="figure">9f</ref> and Figure <ref type="figure">S23f</ref> in Supporting Information S1). In terms of the spatial pattern and magnitude of blocking days in ERA5, it is clear that NeuralGCM simulates more comparable features with ERA5 than WACCM6 and TaiESM1 (cf., Figures <ref type="figure">9a</ref> and <ref type="figure">9d</ref>, Figures <ref type="figure">S23</ref> and <ref type="figure">S28</ref> in Supporting Information S1). In particular, NeuralGCM simulates more frequent blockings above the British Isles and above southern Greenland, which are closer to those in ERA5. This result suggests that NeuralGCM, even using coarser spatial resolution, may be more reasonably capturing the spatiotemporal structure of extremes than traditional physics-based climate models. It is encouraging to see that the NeuralGCM exhibits much more realistic blocking pattern than typical AGCMs.</p><p>To investigate the mechanisms driving the blocking response to sea-ice loss, we examine the meridional temperature gradient (-dT /dy), the Eady growth rate, and the transient eddy heat flux (&lt;v &#697;T&#697;&gt; ) at 850 hPa simulated by NeuralGCM (Figure <ref type="figure">10</ref>). A weakened meridional temperature gradient extends from northeastern Canada to Iceland and the Barents-Kara Seas (Figure <ref type="figure">10a</ref>), indicating a reduction in baroclinic instability under sea-ice forcing. These regions largely coincide with or are adjacent to areas of lower tropospheric warming induced by regional Arctic sea-ice loss. Consistently, the Eady growth rate (Figure <ref type="figure">10b</ref>) is also reduced in these regions, leading to a weakening of transient eddy heat fluxes (Figure <ref type="figure">10c</ref>) and suggesting suppression of storm track activity. WACCM6 and TaiESM1 exhibit similar responses <ref type="bibr">(Figures 10d,</ref><ref type="bibr">10e,</ref><ref type="bibr">and 10f,</ref><ref type="bibr">and Figures S25d,</ref><ref type="bibr">S25e,</ref><ref type="bibr">and</ref> S25f in Supporting Information S1), indicating robust dynamical consistency across these models. Journal of Advances in Modeling Earth Systems 10.1029/2025MS005264</p><p>To further assess the response of the eddy-driven jet, we analyze the 850-hPa jet latitude in NeuralGCM simulations. The jet latitude of the ensemble-mean wind fields during DJF exhibits a southward shift in the future seaice simulations compared to the pre-industrial simulations, as indicated by the shifted histogram (Figure <ref type="figure">11a</ref>). The DJF-averaged ensemble-mean jet latitude in the future simulations is located at 47.64&#176;N (red vertical line in Figure <ref type="figure">11a</ref>), whereas in the pre-industrial simulations, it is positioned at 48.68&#176;N (blue vertical line in Figure <ref type="figure">11a</ref>). In contrast, WACCM6 simulates jet latitudes that are generally positioned farther north, with less latitudinal variability and a smaller difference between future and pre-industrial simulations (Figure <ref type="figure">11c</ref>). These findings suggest that NeuralGCM produces a stronger jet latitude response than WACCM6, consistent with its more pronounced blocking and stronger T responses (Figures <ref type="figure">9c</ref> and <ref type="figure">9f</ref>). We further examine the frequency distribution of jet latitude occurrence across DJF days and 200 ensemble members in both models (Figures <ref type="figure">11b</ref> and <ref type="figure">11d</ref>). Both future and pre-industrial simulations exhibit two distinct peaks in jet latitude, consistent with reanalysis data sets <ref type="bibr">(Kwon et al., 2018</ref><ref type="bibr">(Kwon et al., , 2020))</ref>. Under future sea-ice conditions, the southern peak becomes more pronounced, reflecting the southward shift of the ensemble-mean jet latitude. Additionally, the separation between the two peaks is more distinct in NeuralGCM than in WACCM6, highlighting differences in the ability of these models to simulate the eddy-driven jet in the North Atlantic sector. Also, the more realistic southerly jet position in Neu-ralGCM is consistent with more realistic number of blocking days over Greenland, British Isle and the North Sea. In TaiESM1, the jet position is biased northwards, but the difference between future and pre-industrial simulations is closer to NeuralGCM (Figure <ref type="figure">S27c</ref> in Supporting Information S1).</p><p>In summary, the above results reveal a mechanistic link between Arctic sea-ice loss and blocking responses in the North Atlantic sector can be simulated by NeuralGCM. The sea-ice loss, particularly in Hudson Bay and the periphery of Greenland, induces localized warming, which weakens the meridional temperature gradient and reduces baroclinicity from northeastern North America to the Barents-Kara Seas. This weakening of baroclinicity suppresses storm track activity, as indicated by a reduction in synoptic transient eddy heat flux, leading to a southward shift of the eddy-driven jet through eddy-mean flow interactions <ref type="bibr">(Novak et al., 2015;</ref><ref type="bibr">Shaw, 2019)</ref>. The resulting atmospheric circulation regime favors Rossby wave breaking on the poleward flank of the jet <ref type="bibr">(Barnes &amp; Hartmann, 2010;</ref><ref type="bibr">Davini &amp; Cagnazzo, 2014;</ref><ref type="bibr">Kwon et al., 2018;</ref><ref type="bibr">Rivi&#232;re, 2009;</ref><ref type="bibr">Woollings et al., 2008)</ref>, increasing the frequency of blocking events over Greenland and Iceland while decreasing blocking occurrences along the western European coast. All these processes are well-represented in NeuralGCM that give rise to more realistic blockings.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Conclusions and Discussion</head><p>In this study, we utilize NeuralGCM, a hybrid ML atmospheric model, to investigate atmospheric responses to Arctic sea-ice loss for the first time. Following the PAMIP protocol, we conduct simulations forced by preindustrial and future sea-ice conditions, generating 200 ensemble members. Although NeuralGCM simulates an overall warmer near-surface temperature than WACCM6, a fully dynamical climate model, the internal variability (estimated by the spread across 200 ensemble members) in both models are comparable. NeuralGCM simulates Arctic near-surface warming by 2.64 K during boreal winter in response to sea-ice loss, aligning with the warming by 2.90 K produced by WACCM6. In line with the reduced meridional temperature gradient, the tropospheric jet stream weakens in both models. The spatial distribution of the geopotential height response presents a localized heat-low circulation in the lower troposphere, corresponding to regional Arctic warming hotspots, and a negative NAO-like pattern in the North Atlantic sector in the lower and middle tropospheres, and a weakened midlatitude jet stream. These findings demonstrate the capability of NeuralGCM as a viable hybrid atmospheric model for studying the tropospheric impacts of Arctic climate change. Journal of Advances in Modeling Earth Systems 10.1029/2025MS005264</p><p>Substantial discrepancies emerge between NeuralGCM and WACCM6 in the stratosphere, however. In high latitudes above 50 hPa, the temperature and the associated circulation responses exhibit opposite patterns in the two models. Analysis of EP flux divergence further reveals that it also differs in sign, due to more meridional fluxes leaving the high-latitude stratosphere in NeuralGCM, contributing to the differences in zonal wind response. Additionally, these inconsistencies in stratospheric responses lead to distinct stratosphere-troposphere coupling characteristics, with NeuralGCM simulating an overly strong interaction between the two atmospheric layers. These pronounced differences underscore the need for caution when applying NeuralGCM to study atmospheric responses to Arctic sea-ice loss via stratospheric pathways and highlight the necessity of improving its stratospheric representation.</p><p>A key question immediately arises: what factors contribute to the discrepancies between NeuralGCM and WACCM6? To address this question, we first examine the influence of spatial and vertical resolution, as Neu-ralGCM operates at a horizontal resolution of 2.8&#176;in both latitude and longitude with a low-top vertical configuration, whereas WACCM6 employs a higher resolution of 0.9&#176;latitude &#215; 1.25&#176;longitude and a high-top configuration. Previous studies have suggested that model resolution significantly affects stratospheric representation and extratropical atmospheric variability (e.g., <ref type="bibr">Serva et al., 2024)</ref>. To investigate this effect, we conduct parallel NeuralGCM simulations with an increased horizontal resolution of 1.4&#176;in both latitude and longitude. This refinement leads to a decrease in the Arctic and global ensemble-mean T1000 responses and an increase in the internal variability of the global T1000 response (red dots in Figure <ref type="figure">1d</ref>), making them more comparable to the WACCM6 results. The effect of internal variability on other dynamical variables require further examinations. In the troposphere, the overall response weakens slightly (Figures <ref type="figure">S16</ref>, <ref type="figure">S18</ref>, and S20 in Supporting Information S1). Conversely, in the stratosphere, the zonal-mean atmospheric responses become more pronounced (Figures S8 and S10 in Supporting Information S1), accompanied by enhanced stratosphere-troposphere coupling (Figure <ref type="figure">S14</ref> in Supporting Information S1). We hypothesize that the differences could stem from (a) the low-top configuration of NeuralGCM dynamical core, (b) the absence of a gravity-wave parameterization scheme, (c) differences in basic mean state, and (d) the regionality of sea-ice forcing.</p><p>To assess the first factor, we conduct similar experiments using TaiESM1 with a low-top configuration. Similar and consistent tropospheric responses are simulated by TaiESM1 (Figures S17f, S19f, and S21f in Supporting Information S1), but the stratospheric responses are still inconsistent with NeuralGCM (Figures S9, S11, and S15 in Supporting Information S1). Regarding the second factor, incorporating a neural network or ML-based gravitywave parameterization scheme (e.g., <ref type="bibr">Espinosa et al., 2022;</ref><ref type="bibr">Hardiman et al., 2023)</ref> or a well-trained stratosphere in future generations of NeuralGCM could help address this limitation. Indeed, the second factor can also be applied to any other parameterization scheme absent in NeuralGCM. For the third factor, comparing results from Neu-ralGCM to those from WACCM6 should be considered with caution, as WACCM6 could be biased in simulating the mean state and atmospheric responses to sea-ice loss. For example, the zonal-mean structure of temperature persistence in WACCM6 is smaller than the other models <ref type="bibr">(Lewis, Seviour, et al., 2024)</ref>. To address this issue, we also look into the results from TaiESM1. Indeed, the difference between the mean states and responses in the stratosphere in WACCM6 and TaiESM1 can be similar to or larger than the difference between these variables in WACCM6 and NeuralGCM. The tropospheric responses, on the other hand, are mostly consistent across these models. This again illustrates that the stratospheric responses to Arctic sea-ice loss are distinct in the three models. In future study, it would be informative and useful to use PAMIP simulations from different GCMs to provide a range of model structural uncertainty. For the last factor, we notice that the surface warming in the Pacific sector is stronger in WACCM6 than in NeuralGCM (Figures <ref type="figure">6c</ref> and <ref type="figure">6f</ref>), which could potentially lead to a stronger stratospheric polar vortex. <ref type="bibr">Sun et al. (2015)</ref> showed this connection between warming in the Pacific side and the stratospheric polar vortex using previous version of CESM. An extended study could use the regional sea-ice experiments designed in PAMIP to address this factor more carefully.</p><p>In NeuralGCM simulations, we find that wintertime blocking frequency increases in the northern branch of North Atlantic while decreasing in the southern part of the basin. This pattern is driven by Arctic sea-ice loss, which reduces baroclinic instability and weakens the eddy-driven jet. Additionally, a positive feedback mechanism between blocking and enhanced Rossby wave breaking may further reinforce these responses <ref type="bibr">(Michel &amp; Rivi&#232;re, 2011)</ref>. Notably, both the blocking response and the eddy-driven jet shift are stronger in NeuralGCM than in WACCM6 and in TaiESM1, and increasing the horizontal resolution in NeuralGCM weakens both the blocking response and the latitudinal shift of the jet (Figures S22-S25 in Supporting Information S1). One of the most promising findings from NeuralGCM is that its simulated blocking frequency more closely resembles that of Journal of Advances in Modeling Earth Systems <ref type="bibr">10.1029/2025MS005264 ERA5 data, compared to WACCM6 and</ref><ref type="bibr">TaiESM1 (cf. Figures 9a, 9d, Figures S22, S23, and</ref><ref type="bibr">S28 in Supporting Information S1)</ref>. This result suggests that NeuralGCM may be more capable of capturing highly impactful weather regimes, whereas WACCM6 and TaiESM1 might underestimate the these weather patterns and their responses to sea-ice loss. However, further rigorous analysis of other AI-based and traditional climate models is required to substantiate this conclusion.</p><p>Our results demonstrate that NeuralGCM is largely capable of simulating tropospheric responses to Arctic sea-ice loss. In the current NeuralGCM configuration, SST and SIC are prescribed as lower boundary conditions, thermodynamically forcing the atmosphere. However, previous studies have suggested that the absence of threeway coupling between sea ice, the ocean, and the atmosphere in AMIP-style simulations could influence atmospheric responses to Arctic sea-ice loss (see Figure <ref type="figure">1</ref> in <ref type="bibr">Screen &amp; Blackport, 2019)</ref>. This highlights the importance of investigating the role of atmosphere-ocean-sea ice coupling in Arctic-midlatitude linkages. The PAMIP framework includes coupled atmosphere-ocean simulations with sea ice constrained to pre-industrial and future states, similar to those used in this study. However, NeuralGCM does not yet support a fully coupled configuration. We are also aware that prescribed SIC could cause the spurious warming documented in recent literature <ref type="bibr">(Lewis, England, et al., 2024)</ref>. Caution is needed for this spurious warming in the sea-ice experiments using NeuralGCM in the future. Recent advances in ML Earth system models have enabled the development of data-driven coupled atmosphere-ocean configurations that can simulate climate dynamics and variability on seasonal to centennial timescales <ref type="bibr">(Cresswell-Clay et al., 2025;</ref><ref type="bibr">C. Wang et al., 2024)</ref>. Future sea-ice experiments using these coupled AI-based models hold promise for advancing our understanding of atmosphere-ocean-sea ice interactions and their role in Arctic-midlatitude linkages. Such efforts could provide valuable contributions to the PAMIP research community and stimulate novel experiment designs.</p></div><note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_0"><p>19422466, 2025, 11, Downloaded from https://agupubs.onlinelibrary.wiley.com/doi/10.1029/2025MS005264 by Yu-Chiao Liang -National Taiwan University , Wiley Online Library on [11/11/2025]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License</p></note>
		</body>
		</text>
</TEI>
