<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>Cosmology from CMB lensing and delensed &lt;math display='inline'&gt;&lt;mi&gt;E&lt;/mi&gt;&lt;mi&gt;E&lt;/mi&gt;&lt;/math&gt; power spectra using 2019–2020 SPT-3G polarization data</title></titleStmt>
			<publicationStmt>
				<publisher>American Physical Society</publisher>
				<date>04/01/2025</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10608111</idno>
					<idno type="doi">10.1103/PhysRevD.111.083534</idno>
					<title level='j'>Physical Review D</title>
<idno>2470-0010</idno>
<biblScope unit="volume">111</biblScope>
<biblScope unit="issue">8</biblScope>					

					<author>F Ge</author><author>M Millea</author><author>E Camphuis</author><author>C Daley</author><author>N Huang</author><author>Y Omori</author><author>W Quan</author><author>E Anderes</author><author>A J Anderson</author><author>B Ansarinejad</author><author>M Archipley</author><author>L Balkenhol</author><author>K Benabed</author><author>A N Bender</author><author>B A Benson</author><author>F Bianchini</author><author>L E Bleem</author><author>F R Bouchet</author><author>L Bryant</author><author>J E Carlstrom</author><author>C L Chang</author><author>P Chaubal</author><author>G Chen</author><author>P M Chichura</author><author>A Chokshi</author><author>T-L Chou</author><author>A Coerver</author><author>T M Crawford</author><author>T de_Haan</author><author>K R Dibert</author><author>M A Dobbs</author><author>M Doohan</author><author>A Doussot</author><author>D Dutcher</author><author>W Everett</author><author>C Feng</author><author>K R Ferguson</author><author>K Fichman</author><author>A Foster</author><author>S Galli</author><author>A E Gambrel</author><author>R W Gardner</author><author>N Goeckner-Wald</author><author>R Gualtieri</author><author>F Guidi</author><author>S Guns</author><author>N W Halverson</author><author>E Hivon</author><author>G P Holder</author><author>W L Holzapfel</author><author>J C Hood</author><author>D Howe</author><author>A Hryciuk</author><author>F Kéruzoré</author><author>A R Khalife</author><author>L Knox</author><author>M Korman</author><author>K Kornoelje</author><author>C-L Kuo</author><author>A T Lee</author><author>K Levy</author><author>A E Lowitz</author><author>C Lu</author><author>A Maniyar</author><author>E S Martsen</author><author>F Menanteau</author><author>J Montgomery</author><author>Y Nakato</author><author>T Natoli</author><author>G I Noble</author><author>Z Pan</author><author>P Paschos</author><author>K A Phadke</author><author>A W Pollak</author><author>K Prabhu</author><author>M Rahimi</author><author>A Rahlin</author><author>C L Reichardt</author><author>D Riebel</author><author>M Rouble</author><author>J E Ruhl</author><author>E Schiappucci</author><author>J A Sobrin</author><author>A A Stark</author><author>J Stephen</author><author>C Tandoi</author><author>B Thorne</author><author>C Trendafilova</author><author>C Umilta</author><author>J D Vieira</author><author>A Vitrier</author><author>Y Wan</author><author>N Whitehorn</author><author>W_L K Wu</author><author>M R Young</author><author>J A Zebrowski</author><author>SPT-3G_Collaboration</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[From CMB polarization data alone, we reconstruct the CMB lensing power spectrum, comparable in overall constraining power to previous temperature-based reconstructions, and an unlensed E-mode power spectrum, with clear detections of the third through the tenth acoustic peaks. The observations, taken in 2019 and 2020 with the South Pole Telescope (SPT) and the SPT-3G camera, cover 1500 deg 2 at 95, 150, and 220 GHz with arcminute resolution and roughly 4.9 μK-arcmin coadded noise in polarization. The power spectrum estimates, together with systematic parameter estimates and a joint covariance matrix, follow from a Bayesian analysis using the marginal unbiased score expansion (MUSE) method. The E-mode spectrum at l > 2000 and lensing spectrum at L > 350 are the most precise to date. Assuming the ΛCDM model, and using only these SPT data and priors on τ and absolute calibration from Planck, we find H 0 ¼ 66.81 AE 0.81 km=s=Mpc, comparable in precision to the Planck determination and in 5.4σ tension with the most precise H 0 inference derived via the distance ladder. We also find S 8 ≡ σ 8 ðΩ m =0.3Þ 0.5 ¼ 0.850 AE 0.017, providing further independent evidence of a slight tension with low-redshift structure probes. The ΛCDM model provides a good simultaneous fit to the combined Planck, ACT, and SPT data, and thus passes a powerful test. Combining these CMB datasets with BAO observations, we explore extensions to the ΛCDM model. We find that the effective number of neutrino species, spatial curvature, and primordial helium fraction are consistent with standard model values, and that the 95% confidence upper limit on the neutrino mass sum is 0.075 eV, close to the minimum sum expected from observations of solar and atmospheric neutrino oscillations. The SPT data are consistent with the somewhat weak (<3σ) preference for excess lensing power seen in Planck and ACT data relative to predictions of the ΛCDM model given the combined Planck, ACT, and BAO datasets. We also detect at greater than 3σ the influence of nonlinear evolution in the CMB lensing power spectrum and discuss it in the context of the S 8 tension. Forthcoming SPT-3G analyses will feature deeper and wider observations in temperature and polarization, providing even tighter constraints and more powerful tests of the ΛCDM model.]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>I. INTRODUCTION</head><p>Since the successful completion of the Planck satellite's sky surveys <ref type="bibr">[1]</ref>, the frontier in the study of the cosmic microwave background (CMB) anisotropies has been its polarization. Here, we present the strongest cosmological constraints to date derived from polarization data alone, constraints that are, by some measures, comparable to those from temperature data alone. Low-noise polarization measurements allow for new tests of the standard cosmological model, as the cosmological parameter inference relies on different signals, and they allow for additional robustness to many sources of potential systematic error.</p><p>The data we use come from the SPT-3G camera <ref type="bibr">[2]</ref> installed on the South Pole Telescope (SPT) in early 2017. By October of 2024, SPT-3G had been used to survey 10;000 deg 2 , with the bulk of the Austral winter observing time spent on a 1500 deg 2 patch. The resulting data allow for powerful tests of the standard cosmological model, &#923;CDM, via the primary and CMB lensing signals <ref type="bibr">[3]</ref>, improved constraints on primordial gravitational waves via delensing of the degree-scale observations of the BICEP/Keck Collaboration <ref type="bibr">[4]</ref>, and many other science applications <ref type="bibr">[5]</ref><ref type="bibr">[6]</ref><ref type="bibr">[7]</ref><ref type="bibr">[8]</ref><ref type="bibr">[9]</ref><ref type="bibr">[10]</ref><ref type="bibr">[11]</ref>.</p><p>This paper is the first of a series of papers on CMB power spectra, the CMB lensing power spectrum, and cosmological parameter estimates inferred from the 2019 and 2020 Austral winter observations of the 1500 deg 2 patch. The temperature and polarization maps used in these analyses are the result of 4 times as much observing time and twice as many detectors as were used for similar analyses that used a half season of data taken in 2018 with a partial SPT-3G focal plane <ref type="bibr">[12]</ref><ref type="bibr">[13]</ref><ref type="bibr">[14]</ref><ref type="bibr">[15]</ref>.</p><p>This paper is distinct from the others in the coming series by its focus on polarization data only, and by its use of a novel algorithm for optimally and simultaneously estimating the CMB lensing power spectrum, the unlensed<ref type="foot">foot_0</ref> CMB E-mode polarization power spectrum (EE), and systematics parameters. This algorithm, the marginal unbiased score expansion (MUSE), was developed by Millea and Seljak <ref type="bibr">[16]</ref> and Millea <ref type="bibr">[17]</ref>, with its precursors discussed in <ref type="bibr">[18,</ref><ref type="bibr">19]</ref>. This is the first time the MUSE method has been applied to real data. We present the analysis in some detail and emphasize its advantages, including straightforward incorporation of systematic effects into a statistical model, a lensing potential map with lower noise than if it were produced by a quadratic estimator [e.g., <ref type="bibr">[20]</ref> ], and an unlensed EE power spectrum with reduced sample variance relative to any inference of the lensed EE power spectrum <ref type="bibr">[21]</ref>.</p><p>Delensed power spectra from quadratic estimators have been estimated previously from data by Planck Collaboration et al. <ref type="bibr">[22]</ref> and Han et al. <ref type="bibr">[23]</ref>. The methodology for optimal lensing reconstruction has been principally developed in <ref type="bibr">[16,</ref><ref type="bibr">[24]</ref><ref type="bibr">[25]</ref><ref type="bibr">[26]</ref><ref type="bibr">[27]</ref><ref type="bibr">[28]</ref><ref type="bibr">[29]</ref>; and an application to SPTpol data was described in <ref type="bibr">[30]</ref>. With the SPT-3G data we analyze here, we are crossing the threshold of polarization noise level below which optimal methods lead to significant improvements in lensing reconstruction.</p><p>We focus solely on polarization anisotropies in order to avoid modeling extragalactic foregrounds and their non-Gaussian properties. Such modeling would likely be necessary for a similar analysis which included temperature anisotropies, where the relative impact of foregrounds is larger. Due to the low noise and high angular resolution of the SPT-3G measurements, this initial conservative approach still leaves us with an inferred CMB lensing power spectrum with high signal-to-noise ratio. In the part of sky we observe, over the angular scales used in the analysis, galactic foregrounds are also negligibly small <ref type="bibr">[12,</ref><ref type="bibr">31]</ref>.</p><p>We compare with other CMB measurements in both temperature and polarization, including those from the Planck Collaboration <ref type="bibr">[32]</ref> and the Atacama Cosmology Telescope (ACT) Collaboration <ref type="bibr">[33]</ref><ref type="bibr">[34]</ref><ref type="bibr">[35]</ref><ref type="bibr">[36]</ref>. We check for consistency of our results in the context of the &#923;CDM model. We also determine cosmological parameters assuming the &#923;CDM model and extensions from SPT data alone and in combination with these other datasets. Although the SPT observations we use here cover much less sky relative to Planck and ACT, the very low noise (4.9 &#956;K-arcmin in coadded polarization maps) and arc-minute angular resolution enable measurements of the EE power spectrum and CMB lensing potential power spectrum (&#981;&#981;) that are the most precise measurements ever made at l &gt; 2000 for EE and L &gt; 350 for &#981;&#981;.</p><p>We discuss how the signals in our data allow for the determination of cosmological parameters assuming the &#923;CDM model. The origins of these determinations have interesting differences compared to those from Planck CMB data. For example, the Hubble constant (H 0 ) inference depends heavily on the lensing information; without it, the error increases by more than 10 times. Consistency between the Planck and SPT inferences of H 0 places further constraints on attempts to solve the H 0 tension <ref type="bibr">[37]</ref> with changes to cosmological models <ref type="bibr">[38,</ref><ref type="bibr">39]</ref>.</p><p>We pay particular attention to the implications of our data for the "negative neutrino mass" problem and its relation to excess lensing power pointed out by Craig et al. <ref type="bibr">[40]</ref>, and to the suggestion from Amon and Efstathiou <ref type="bibr">[41]</ref> that the &#963; 8 tension is associated with nonlinear evolution, either due to mismodeling the &#923;CDM predictions or by neglect of some new physics that modifies evolution on nonlinear scales.</p><p>The rest of this paper is constructed as follows. In Sec. II, we describe the process for making CMB maps from real and mock observations. We give an overview of MUSE and our data modeling in Sec. III. We present our pipeline and data validation processes, the blinding procedure, and results in Sec. IV. We show the resulting bandpowers and cosmological parameter constraints in Sec. V. We conclude in Sec. VI. Throughout this work, we use a series of abbreviations for external datasets tabulated in Table <ref type="table">III</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>II. DATA AND MAPMAKING</head><p>We use data taken with the SPT-3G camera during the 2019 and 2020 winter observing seasons. The 1500 deg 2 SPT-3G winter footprint extends from -42&#176;to -70&#176;in declination and -50&#176;to 50&#176;in right ascension. The footprint is divided into four subfields with declination centers of -44.75&#176;; -52.25&#176;; -59.75&#176;, and -67.25&#176;, and that allows for fitting different systematics parameters between patches that improve detector response consistency and linearity across the entire field. Observations are performed in twohour periods and map a single subfield at a time. During one observation, the telescope performs 72 constant-elevation (and thus constant-declination) scans with a scan speed of 1 deg =s in right ascension and an elevation step of 12.5 arcmin between scans. The subfields are observed 1034, 902, 772, and 578 times, respectively, across the two seasons, which produces approximately uniform map noise levels across subfields.</p><p>Each of the &#8764;16; 000 detectors in the SPT-3G camera samples the sky as the telescope scans, producing onedimensional "timestreams" that record sky brightness as a function of right ascension. The timestreams used in this analysis have on-sky sample rates between 0.3 arcmin=sample and 0.6 arcmin=sample depending on declination, corresponding to an effective Nyquist frequency well beyond our maximum multipole of interest.</p><p>The timestreams are low-pass filtered to reduce aliasing when later binning into maps, and high-pass filtered to remove large-scale instrumental and atmospheric noise. The high-pass filter is performed by deprojecting a set of Legendre polynomials up to 30th order and a set of sines and cosines up to an equivalent angular scale of l &#8804; 300.</p><p>The filtering described above can produce undesired "ringing" features in the maps around the locations of point sources (a term we will loosely use here to describe both emissive galaxies and the Sunyaev-Zeldovich effect from galaxy clusters, the latter which can be spatially extended). To reduce these features, when fitting the deprojection coefficients, we ignore regions around point sources, specifically around all emissive sources with 150 GHz flux &gt;6 mJy and all galaxy clusters with a signal-to-noise &gt;10, for a total of 2655 objects. This procedure reduces ringing but does not mask the point sources, which instead happens at the map level later in the pipeline.</p><p>After filtering, we bin the timestream samples using inverse-variance weights into 0.5625 0 pixels given a Lambert projection centered on a right ascension of 0&#176;a nd a declination of -59.55&#176;. Weights are determined from the detector noise in the range 320 &lt; l &lt; 4000. The pixel size was chosen so that the 1500 deg 2 patch fits into maps with a number of pixels on each side that are powers of two, which maximizes the efficiency of FFTs. For this resolution, this yields 4096 &#215; 8192 maps.</p><p>After maps have been made for each observation in the 2019 and 2020 winter observing seasons, we sum these maps into a full-depth "coadd" with a small number of data cuts. In addition to full-depth coadds, we also construct 500 "sign-flip" noise realizations by summing the observations with a random half of them multiplied by -1, canceling the signal but leaving the statistical properties of the noise unchanged. We use these to quantify the noise covariance, and to directly add to simulated signal maps to produce highly realistic simulations of the full dataset. We estimate that the noise realizations are correlated with each other at the percent level due to the finite number of observations in the dataset, but this is below the level which can significantly impact this analysis.</p><p>Next, we apply an isotropic antialiasing filter to the 0.5625 0 maps and rebin them to the final 2.25 0 resolution, 1024 &#215; 2048 pixel analysis maps. We note that the reason for not binning the timestreams directly into the final 2.25 0 resolution is exactly to be able to apply this antialiasing filter, as otherwise nothing would prevent aliasing in the scan-perpendicular direction. The final Nyquist frequency for these maps corresponds to l &#188; 4800, high enough to accommodate the l max &#188; 4000 of our analysis while at the same time minimizing the pixel count and hence, the computational cost of MUSE.</p><p>Finally, we apply the pipeline masks described in Sec. III C. These processed and masked 1024 &#215; 2048-pixel maps of Q and U stokes polarization at 95, 150, and 220 GHz are the basic data, d, which is input to our analysis pipeline.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>III. METHODOLOGY: BAYESIAN INFERENCE WITH MUSE A. Bandpower and systematics inference</head><p>We perform a map-level simultaneous Bayesian inference of the gravitational lensing potential bandpowers, the unlensed CMB EE bandpowers and systematic parameters. This procedure is described in Millea and Seljak <ref type="bibr">[16]</ref> and Millea <ref type="bibr">[17]</ref>, and summarized below with a few additions specific to this work. The central piece of the analysis is the joint posterior probability function, P&#240;f; &#981;; &#952;jd&#222;; &#240;1&#222; where f refers to the unlensed CMB maps (here, just polarization), &#981; to the gravitational lensing potential, &#952; to the bandpower and systematics parameter vector, and d to the maps described in the previous section. The exact details of our modeling and construction of this function are given in the next section. Here, we first discuss how we use it to perform inference. The goal of our analysis is to marginalize over the "latent" f and &#981; variables to produce the marginal posterior on just the parameters of interest, &#952;,</p><p>The constraints on &#952; described by this function are formally optimal, satisfying our goal of an optimal analysis. This optimality is achieved because, implicitly, the marginalization sources information from moments of the data at all orders, unlike, e.g., quadratic estimators, which only use second-order moments. In particular, this means our results use information from all combinations of the data such as EE, EB, BB, EEE, EEB, etc&#8230; Various techniques exist for performing the difficult highdimensional integration in Eq. <ref type="bibr">(2)</ref>. A popular but computationally expensive choice is to use Hamiltonian Monte Carlo to yield the exact answer, up to sampling errors. MUSE instead performs an approximate but much faster marginalization, which yields a multivariate Gaussian approximation to the marginal posterior,</p><p>for posterior mean &#952;, covariance &#931; MUSE , and unimportant constant, C. The Gaussian approximation is well-motivated due to the central limit theorem as long as many modes of f and &#981; contribute to the constraint on each &#952;, which is well-satisfied here.</p><p>The marginal posterior mean, &#952;, is given by solving the following equation for &#952;:</p><p>where s MAP i &#240;&#952;; d&#222; is the gradient of the joint posterior with respect to parameter &#952; i , evaluated at the maximum a posteriori ("MAP") estimate of z &#8801; &#240;f; &#981;&#222;, i.e.,</p><p>MAP i &#240;&#952;; d&#222; &#188; d d&#952; i log P&#240;&#7825;&#240;d; &#952;&#222;; &#952;jd&#222;j &#952; ; &#240;5&#222; with &#7825;&#240;d; &#952;&#222; &#188; arg max z P&#240;z; &#952;jd&#222;:</p><p>One way to understand this definition is to consider s MAP i as a data compression, which, for a given value of &#952;, takes as input our &#8764; million-dimensional data vector and returns another vector of the same dimensionality as &#952;, which in our case is around a hundred bandpower and systematics parameters. The MUSE algorithm then proceeds by picking a starting guess for &#952; and compressing the data according to this, generating a suite of data simulations given the same &#952; and compressing them as well, and finally iteratively searching for the value of &#952; which make the data and simulations most similar, as compared via this data compression [Eq. ( <ref type="formula">4</ref>)]. Intuitively, this yields the parameters &#952;, which are most likely to have actually generated the real data. More formally, Millea and Seljak <ref type="bibr">[16]</ref> show that following this procedure leads to an asymptotically unbiased estimate of parameters, &#952;, for any Gaussian or non-Gaussian latent space, z. For a Gaussian latent space, the data compression is a sufficient statistic, and therefore, &#952; gives the exact marginal posterior mean. For non-Gaussian latent spaces such as the one here, the data compression can be considered "lossy," but this leads only to a possible loss of optimality, with the asymptotic bias remaining zero. For a lensing analysis similar to the one performed here, Millea and Seljak <ref type="bibr">[16]</ref> bounds any loss of optimality to less than 10% of the statistical uncertainty.</p><p>The covariance, &#931; MUSE , is computed from</p><p>where</p><p>and &#931; prior is the covariance of any assumed prior. Note that this is not a pure Monte Carlo covariance of the MUSE estimate, which would be prohibitive computationally, but rather a cheaper Monte Carlo covariance of just s MAP , with analytic propagation to the resulting covariance in parameters. All derivatives in practice are computed with automatic differentiation, and Monte Carlo error stemming from J is reduced using a method described in Appendix C. One aforementioned aspect of MUSE, which is key for this analysis, is that it is unbiased even for "lossy" data compressions. This holds as long as the simulation-generating distribution, P&#240;dj&#952;&#222;, is an accurate representation of the real data. There is not actually any requirement on the form of s MAP i for MUSE to remain asymptotically unbiased and for the covariance expressions to be valid. The default choice for s MAP i given in Eq. ( <ref type="formula">5</ref>) is simply the one which makes MUSE exact for Gaussian problems.</p><p>Here, we keep this choice largely intact, but modify the joint posterior appearing in Eqs. ( <ref type="formula">5</ref>) and (6), P&#240;f; &#981;; &#952;jd&#222;, to be slightly different from the likelihood function used to generate the simulations, P&#240;dj&#952;&#222;. Normally, the two distributions would be related by</p><p>Instead, here we have one statistical model denoted as the posterior model, P&#240;f; &#981;; &#952;jd&#222;, and another called the simulation model, P&#240;dj&#952;&#222;.</p><p>In the analysis, the posterior model is used in computationally costly high-dimensional maximizations, while the simulation model is only ever used to generate a few hundred samples from P&#240;dj&#952;&#222;. This hints at the motivation for this choice of separate models, namely to allow defining a simulation model which fully meets our accuracy requirements, while keeping a simpler "surrogate" posterior model which is much faster to compute and makes the analysis more tractable. With an accurate simulation model, the COSMOLOGY FROM CMB LENSING AND DELENSED EE &#8230; PHYS. REV. D 111, 083534 (2025) MUSE analysis will remain unbiased, and we find little loss of optimality due to approximations in the posterior model. We note that our analysis can also be considered a form of "simulation-based inference" (SBI). This term is used to describe a class of algorithms which leverage the ability to generate samples from P&#240;dj&#952;&#222; directly into the ability to perform inference and has been of increasing interest in various domains in cosmology [see, e.g., <ref type="bibr">[42]</ref><ref type="bibr">[43]</ref><ref type="bibr">[44]</ref><ref type="bibr">[45]</ref><ref type="bibr">[46]</ref><ref type="bibr">[47]</ref><ref type="bibr">[48]</ref> ]. In typical high-dimensional SBI applications to-date, an important first step is to obtain a suitable data compression function to reduce the dimensionality of the problem, often by training a neural network on the generated samples themselves <ref type="bibr">[49]</ref>. The compressed data is then used to infer parameters with something like approximate Bayesian computation (ABC) <ref type="bibr">[50,</ref><ref type="bibr">51]</ref>, or by training a neural network surrogate model for the likelihood which can be inverted to obtain the posterior [see <ref type="bibr">[52]</ref> ].</p><p>MUSE is similar in that it also involves a data compression function, but rather than learned with a neural network, the generic and semianalytic s MAP i is used instead. Furthermore, rather than using something like ABC or training surrogate models for inference, we exploit our access to gradients and Jacobians through the posterior and simulation-generating function to derive semianalytic expressions for the posterior mean and covariance. In this way, MUSE can be considered one of the possible augmentations of SBI suggested by <ref type="bibr">[52]</ref> which use additional tractable quantities from the simulator.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>B. Cosmological parameter inference</head><p>The output of MUSE is a parameter mean, &#952;, and its covariance, &#931; MUSE . The inferred parameters for this analysis include the &#981;&#981; and unlensed EE spectra binned into bandpowers and systematics parameters. The systematicsmarginalized CMB bandpowers and covariance can be obtained by simply dropping the systematics from the estimate and dropping the corresponding block from the covariance (which is equivalent to marginalizing them). That is, part of the output of MUSE is automatically a set of "CMB-compressed" bandpowers such as those which have been derived for other experiments <ref type="bibr">[53]</ref><ref type="bibr">[54]</ref><ref type="bibr">[55]</ref>. We can then estimate cosmological parameters, &#947;, as usual, by forming a cosmological parameter likelihood,</p><p>where &#952;&#240;&#947;&#222; is our theory &#981;&#981; and unlensed EE spectra binned according to our specified bandpower window functions (Appendix A 1). For our main results we sample P&#240; &#952;j&#947;&#222; with Cobaya<ref type="foot">foot_1</ref>  <ref type="bibr">[56]</ref> and use CAMB<ref type="foot">foot_2</ref>  <ref type="bibr">[57]</ref> to compute theory spectra. To get the accurate theory CMB spectra at large l, we set lens_potential_accuracy=4 and lens_margin=1250 in CAMB. We also set halofit_version=mead2020 for the nonlinear correction in CAMB. The convergence criterion of MCMC is set to R -1 &lt; 0.01 of the Gelman-Rubin statistic.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>C. Map-level model summary</head><p>The posterior and simulation models are specified in Eqs. ( <ref type="formula">12</ref>)- <ref type="bibr">(19)</ref>, with a summary given below and full details given in Appendix A.</p><p>Simulation model:</p><p>Posterior model:</p><p>where</p><p>Reading these models top-to-bottom describes how to produce a random sample from the model, with "&#8764;" used to represent generation of a random sample from the distribution on the rhs and N &#240;&#956;; &#931;&#222; a multivariate Gaussian with mean and covariance &#956; and &#931;.</p><p>To summarize, in the simulation model, we generate curved-sky unlensed CMB maps (f) and lensing potential maps (&#981;) from appropriate covariance operators (C), which are parametrized by the bandpower amplitudes which we will infer (A EE b and A &#981;&#981; b ). We apply the lensing operation (L) to produce lensed CMB maps and project to the flat-sky Lambert pixelization of the data (P). We apply an additional small deflection (G) to account for relativistic aberration due to our proper motion relative to the CMB rest frame. We apply the beam convolution operator [B&#240;&#946; n ; &#946; &#957; pol &#222;], which is parametrized by a set of beam eigenmode amplitudes (&#946; n ) and the polarization fraction of the beam sidelobes at each frequency (&#946; &#957; pol ). We rotate the global polarization angle of the maps [R&#240;&#968; pol &#222;] by an angle &#968; pol to model systematic errors in the overall polarization angle of our detectors. In each subfield, i, we scale the maps by A &#957;;i cal , to model the uncertain absolute calibration of the instrument. We apply the model transfer function (TF), and apply the pixel window function (PWF ). We add in temperature to polarization monopole leakage Q and U templates determined directly from our temperature observations (t Q and t U ) with subfield-dependent amplitudes (&#1013; i Q and &#1013; i U ), as well as add in a sample of the noise (n &#957; ) from one of the sign-flip noise realizations. Finally, we apply our chosen pixel, "trough," and Fourier masks (see Appendix A 9) to produce a simulated data vector, d &#957; .</p><p>The posterior model is nearly the same, with the only differences being that the covariances, C, assume flat-sky statistics, the noise is generated from a covariance and is unmasked, and aberration is ignored. For the posterior model, it is possible to calculate the posterior probability function, given in Eq. <ref type="bibr">(20)</ref>. Here, we also choose to include a "supersample" prior, P&#240;&#981;&#222;, in order to reduce the lensing mean-field (before its impact is automatically removed entirely in the process of obtaining the MUSE estimate).</p><p>The final inferred parameter vector is</p><p>containing 77 bandpower parameters and 47 systematics parameters. The MUSE estimate is a joint best-fit and a covariance between all of them. We note that the approximation of leaving the noise unmasked, while seemingly benign, is actually critical to obtaining a sufficiently fast posterior function since it avoids an additional large matrix inversion, while having little impact on the final results (see also Sec. 3.7 of <ref type="bibr">[30]</ref>). We also note that the ability to directly use the sign-flip noise realizations in the simulation model is extremely advantageous. Without this, a Bayesian analysis would depend on an accurate statistical model of all moments of the noise (power spectrum, bispectrum, etc&#8230;) and would require both sampling and evaluating the likelihood of a noise map given such a model. Here, noise modeling is essentially a nonissue since MUSE requires only sampling the noise, and the sign-flip realizations provide these samples and empirically match the true noise distribution.</p><p>There are a few effects which we considered but ultimately chose not to include the model. The first effect is polarized foregrounds. The dominant expected foreground comes from polarized point sources. For the same point source flux cut used here, Dutcher et al. <ref type="bibr">[12]</ref> construct a set of priors on the foreground amplitudes based on measurements of temperature foregrounds scaled to polarization with very conservative priors on the polarization fraction. Based on this, we find that the maximum possible contamination has a power 2 orders of magnitude lower than our noise spectra at all frequencies. That the foreground contribution is small is also consistent with <ref type="bibr">[58]</ref>, in which it is found that even for much deeper observations than ours, the bias to lensing from polarized foregrounds is fairly marginal. As mentioned, this finding that the foreground contribution is negligible in polarization for this analysis was key in our decision to focus only on polarization in this paper.</p><p>The second effect we ignore is the bias due to spatial correlation between the lensing potential and the location of our masked point source holes <ref type="bibr">[59]</ref>. Using the Agora simulations <ref type="bibr">[60]</ref>, which include a model for these spatial correlations, we find that for our point source masking threshold, the maximum bias to C &#981;&#981; L would be &#8764;0.5%. However, Lembo et al. <ref type="bibr">[59]</ref> also show that this maximum bias is highly suppressed in an optimally filtered &#981; map such as the MAP &#981; maps which enter our analysis. We thus conclude this effect is also negligible here.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>IV. VALIDATION OF PIPELINE AND DATA</head><p>Before computing our main results, we conducted a set of validation tests which we present in this section. To mitigate and quantify confirmation bias, we performed these tests "blind", that is, without comparison to data from other experiments or to theory models derived from these experiments. After unblinding, we allowed ourselves to make further changes to the pipeline before producing our final baseline results, but committed to reporting in this paper the parameters that we initially found before any such changes. This motivated and focused the validation work, and allows us to quantify any level of confirmation bias; the initial unblinded results could not be impacted by confirmation bias, only the shifts after unblinding could be. A full discussion of changes made after unblinding is at the end of this section (Sec. IV C).</p><p>Our chosen set of rules for blinding were 1) do not plot our bandpowers on the same plot as bandpowers or theory curves from any other previous analysis or experiment (including from SPT itself), or show bandpowers relative to any theory curves, 2) if we do plot bandpowers, remove all tick marks and tick labels and do not look at numerical values of bandpowers, 3) do not look at absolute cosmological parameters derived from bandpowers (internal null parameter differences, e.g., the parameters from 95 GHz minus parameters from 150 GHz were allowed).</p><p>This procedure left us blind to all but very gross departures from &#923;CDM and did not allow us to know if bandpowers were consistent with any other experiment before unblinding, while not being overly burdensome. We now describe the set of validation tests of our pipeline and data which we performed while blinded.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A. Pipeline tests 1. Simulated data</head><p>To validate our full bandpower and cosmological parameter estimation pipeline, we use an ensemble of highly realistic simulations of the full dataset which we call "mock simulations." The signal part of the mock simulations consists of a lensed CMB realization generated assuming a fiducial Planck &#923;CDM cosmology and a Gaussian foreground model based on Agora simulations <ref type="bibr">[60]</ref> that is calibrated against data at very high-l. These maps are processed through a procedure called mock observation, which generates simulated timestreams and simulates the observation and processing of the timestreams into maps at each frequency, using the identical code which processes the real data. <ref type="foot">4</ref> The noise part of the mock simulations comes from sign-flipped noise realizations.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">End-to-end pipeline tests</head><p>To check that our pipeline is unbiased, we start with an end-to-end test of the pipeline on a set of 100 mock simulations. First, we compute MUSE estimates of bandpowers and systematic parameters on each mock simulation using all three 95 &#254; 150 &#254; 220 GHz polarization maps (hereafter, "all-frequency"). Then we repeat the same procedures on single-frequency polarization maps with the corresponding systematic parameters fixed to the all-frequency results (this is how we handle systematics in the single-frequency runs on real data as well).</p><p>In the left two panels of Fig. <ref type="figure">1</ref>, we show the mean bandpowers over 100 mock simulations compared to the input truth. The PTE of the difference between the mean and input truth and its conversion to a number of &#963; is given in the legend. 5 No bias is detected at 3&#963; significance from the 100 mock simulations. The scatter of the mean bandpowers is at the level of 10% of the statistical uncertainty, as expected from a test with 100 realizations. In the right panel of Fig. <ref type="figure">1</ref>, we also show the mean systematics parameters over 100 all-frequency mock simulations, here scaled by their 1&#963; uncertainty. These are also recovered with no significant bias. Single-frequency runs do not vary systematics parameters; hence, those estimates are not shown.</p><p>We next compare the empirical covariance of the 100 allfrequency estimates with the MUSE covariance given in Eq. <ref type="bibr">(7)</ref>. The left column of Fig. <ref type="figure">2</ref> compares the square root of the diagonal covariance entries. We see good consistency between the two except for A cal . This is because the mock simulations were made without varying systematic parameters within their prior, while the MUSE covariance takes the prior uncertainty of all parameters into account. This difference is particularly evident for prior-dominated parameters such as A cal .</p><p>In the right column of Fig. <ref type="figure">2</ref>, we compare the first and second off-diagonal elements in the bandpower covariance matrices, this time fixing the systematics in &#931; MUSE for a more like-to-like comparison. We see good agreement FIG. <ref type="figure">1</ref>. Verification that our pipeline recovers unbiased bandpower and systematics parameters using 100 mock simulations. The colored lines are the average MUSE parameter estimates over simulations after subtracting the input simulation truth (in the final panel, we also divide by the error bars for one simulation due to the otherwise large dynamic range). The first two panels show &#981;&#981; and unlensed EE bandpower amplitude parameters (and also consider single-frequency runs), and the last panel groups together all the systematics parameters (see Sec. III C and Appendix A for description of the systematics considered). The PTEs of the colored lines relative to the expected scatter and 2-tailed numbers of &#963; are shown in the legends. To get the PTEs, we Monte Carlo sample the expected distribution of &#967; 2 values, accounting both for the look-elsewhere effect of four tests (three single-frequency runs and one all-frequency run) and the fact that we compute the expected scatter itself from 100 mocks. For reference, gray bands are the all-frequency 1 and 2&#963; errors (note that this is not the expected scatter of any of the lines, rather is a way to judge the size of any potential bias).</p><p>between the empirical covariance and the MUSE covariance. The anticorrelation in the first off-diagonal elements is caused by projection effects, masking, delensing, and the transfer function, where the projection effects explain about half of the anticorrelation. Here, the projection effects refers to our use of the flat-sky approximation in the posterior model. This effect has been found in the previous SPT-3G analysis <ref type="bibr">[31]</ref>.</p><p>In Fig. <ref type="figure">3</ref>, we show the MUSE covariance with its diagonal entries rescaled to unity (i.e., the correlation matrix between all bandpowers and systematics parameters). We find no significant correlation between lensing bandpowers and any other parameter, except for A &#981;&#981; at the lowest L bin, which shows correlations with A EE at &#8818;10% level and with &#946; pol at &#8764;12% level. This suggests a robust determination of the lensing spectrum with little uncertainty from systematic effects. The correlations within the unlensed EE block are due to masking, delensing, projection and correlation with systematics parameters. The EE bandpowers are significantly correlated with both calibration uncertainty and the sidelobe polarization fraction, both of which add appreciable systematic uncertainty to our final EE estimates.</p><p>We next obtain &#923;CDM cosmological parameter estimates from the 100 all-frequency mock simulations. Unlike the main results which use Cobaya and CAMB to perform parameter estimation, here, for speed, we have trained and verified the accuracy of an emulator using Capse.jl <ref type="bibr">[61]</ref>, and sample using Hamiltonian Monte Carlo.</p><p>The one-dimensional cosmological parameter posterior distributions inferred from each mock simulation and the product over all of these posteriors is shown in Fig. <ref type="figure">4</ref>. It is expected that the product over all posteriors should cover the input truth, which is found to be the case. The nondetection of any bias with 100 sims suggests that, at 68% confidence, we have bounded any possible pipelineinduced systematic error to be less than &#8818;0.1&#963; for each cosmological parameter.</p><p>To summarize, we have validated the pipeline using 100 highly realistic mock simulations. With 100 simulations, we are able to detect biases at the level of 10% of the statistical uncertainty. We do not detect any such biases to within 3&#963; of the expected scatter on the mean in either our estimates of &#981;&#981; lensing spectra, unlensed EE bandpowers, systematics, or in the cosmological parameters derived from them. We find good agreement of the bandpower covariance matrices between the one derived from mock simulations and the one calculated in MUSE using Eq. <ref type="bibr">(7)</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>B. Data tests</head><p>Next, we turn to tests of the data itself. These include intrafrequency null tests using different splits of the data, inter-frequency comparison of &#981;&#981; and unlensed EE bandpower results, pixel-level &#967; 2 tests of our model fits, and internal comparison of our systematics estimates with those derived with more traditional methods. We report the results of these tests in this section.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Intrafrequency null tests</head><p>Null tests involve splitting the full dataset into two parts and differencing them such that the signal cancels, then checking whether the remaining portion is consistent with FIG. 2. Verification of the accuracy of the all-frequency MUSE covariance, &#931; MUSE . (Top left) Comparison of the square root of the diagonal elements between the MUSE covariance, &#963; MUSE &#8801; ffiffiffiffiffiffiffiffi ffi diag p &#240;&#931; MUSE &#222;, and the empirical covariance from running the estimate on mock simulations. (Bottom left) The ratio of the two curves in the upper panel. The shaded band shows the 1 and 2&#963; standard error on &#963; empirical from the 100 simulations used. These are expected to match for fully likelihood-dominated parameters, and do so for all parameters except the partially priordominated A cal parameters. (Top and bottom right) Comparisons of the first and second off-diagonal elements of the MUSE and empirical correlation matrices. In this case, systematics are assumed fixed in the MUSE covariance, since they are fixed in the simulations.</p><p>the expected statistical properties of noise. Finding something outside of this expectation could indicate the presence of a systematic, which might then also be present in the full dataset coadd at some level. By canceling the signal and thus, the sample variance from the signal, these tests can probe systematics to levels well below even what could impact the final results, which are themselves limited by sample variance. Since temperature data does enter our analysis via leakage templates, we also present null TE spectra to more thoroughly vet the full temperature and polarization dataset. Further details of the null tests will be presented in a future work.</p><p>We consider six types of data splits: sun, moon, azimuth, year, scan, and wafer. The sun and moon tests are designed to check whether a portion of the dataset is significantly contaminated by the radiation from the sun or moon. In these tests, one part of the dataset comprises the data we took when the sun or moon was above the horizon, and the other part below the horizon. The azimuth test is designed to check whether a portion of the dataset is contaminated by thermal radiation from a building near the telescope. One part of the dataset comprises the data we took when the telescope's azimuth angle was within 90&#176;of that of the building, and the other part azimuth angles outside that range. The year test is designed to check whether the data we took during the 2019 season are significantly different from the data we took in 2020. The scan test is designed to check whether the data we took when the telescope was scanning in one direction are significantly different from the data we took when the scanning was in the opposite direction. Lastly, the wafer test is designed to check whether the data from one half of the full detector array are significantly different from the data from the other half.</p><p>For the scan and wafer tests, we do not expect the null spectra to perfectly cancel the signal due to details of our map-making pipeline. For the wafer test, this is because we filter time-ordered data from different detector wafers differently. For the scan test, this is mainly because we do not correct for each individual detector's time constant effect when converting its time-ordered data into map pixel values. The time constant effects refers to the fact that each detector does not respond to changing radiation from the sky instantaneously. As a result, a map made from leftgoing scans contain a version of the CMB shifted to the left, and vice versa for right-going scans. For these two tests, we calculate the expected null spectra through simulations of the map-making pipeline and subtract it. This is computed assuming a fiducial cosmology, but because these expected null spectra are very small compared to the signal, it does not significantly imprint model dependence to these null tests or degrade their usefulness.</p><p>For each null test type, we divide each of the two halves of the data into 25 smaller parts and call them bundles. We create 25 null maps by differencing bundles from each half one-by-one, then compute all 300 cross spectra from these 25 null maps and bin them into the same bandpowers as the analysis. The average of these is what we call a null spectrum, and the standard error is our estimate of the expected scatter of the null spectrum.</p><p>The maps used for these tests are curved-sky projections of the same timestreams which enter the main analysis, with spectra computed with PolSpice <ref type="bibr">[62]</ref>. For convenience, most of the tests have been computed using slightly different masks than the ones used to produce our main results. These tests, which we do not expect to depend significantly on the masking choice, use a pixel mask which includes slightly more sky, and a harmonic space mask, which zeros all a lm for l &#8712; &#189;500; 680 and m &#8712; &#189;350; 425. This harmonic mask removes contamination from a narrow-band signal around 1.1 Hz, which is similar, but not exactly the same as the "trough" mask used to remove the contamination in the MUSE pipeline. As the 1.1 Hz signal presents itself mainly in detector wafer and scan direction tests, we have recomputed these with the exact MUSE pixel and Fourier masking to verify the MUSE masking is sufficient to remove it. FIG. <ref type="figure">4</ref>. Verification that our pipeline recovers unbiased cosmological parameters using 100 mock simulations. We infer &#923;CDM parameters from each simulated all-frequency dataset and plot the one-dimensional marginalized posteriors above as different colored lines for each simulation. The shaded gray region in each panel is the product over all posteriors, which should cover the simulation truth, denoted as vertical black lines. Note that the smaller scatter for &#964; is because this parameter is mainly constrained by our prior. Across the other five parameters, the shaded region is consistent with the input truth with significance equivalent to 1.1&#963;, suggesting no detection of any bias at the level afforded by our 100 simulations.</p><p>After masking, computing null spectra, and subtracting expected contributions as necessary, we calculate a &#967; 2 with respect to the expected scatter, assuming no correlation between neighboring bandpowers. To compute a threshold probability-to-exceed (PTE) for the observed &#967; 2 , we Monte Carlo sample the expected distribution of &#967; 2 values, accounting both for the look-elsewhere effect of 54 tests and the fact that we compute the expected scatter itself from a finite 300 simulations. We consider the test passed if the PTE is &gt;5% (corresponding to a &lt;2&#963; deviation) with respect to this Monte Carlo distribution. This corresponds to a PTE &gt;0.07% on any single individual test amongst our suite (this is roughly 5%=54 with an additional correction for the slightly noisy expected scatter).</p><p>A summary of all tests and their PTEs is given in Fig. <ref type="figure">5</ref>. All tests pass at our threshold of 0.07%, and no obvious features are otherwise visible.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Inter-frequency agreement</head><p>We next check for inter-frequency agreement of the &#981;&#981; and unlensed EE bandpowers inferred from single-frequency maps. We follow the same procedure for inference as described in Sec. III, except that the data, d, includes only a single-frequency map and the systematic parameters are fixed to the value derived from the all-frequency run; only the bandpowers A EE b and A &#981;&#981; b are free parameters. To quantify the expected scatter, we run this identical procedure on 100 mock simulations.</p><p>In Fig. <ref type="figure">6</ref>, we show the ratios of bandpowers inferred from maps at different frequencies relative to the expected scatter. The blue lines show our result prior to unblinding, which showed good overall agreement. The legends give 2-tailed effective number of &#963; for the &#967; 2 values, and all are consistent with statistical fluctuations to within AE2&#963;. We also note that with model changes after unblinding, shown as orange lines, agreement improved slightly, particularly in the comparison between 95 GHz and 150 GHz.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Goodness-of-fit</head><p>We next compare the pixel-level goodness-of-fit of our model to the observed data. To do so, we define &#967; 2 &#188; -2 log P&#240;&#7825;&#240; &#952;; d&#222;; &#952;jd&#222;, that is, minus twice the posterior probability function evaluated at the MUSE estimate of parameters, &#952;, and the MAP given that &#952;. Because of our decoupling of the posterior and simulation models, we cannot compute the expected &#967; 2 simply from counting degrees of freedom. Instead, we use the distribution of &#967; 2 obtained from mock simulations to quantify the expected distribution.</p><p>Table <ref type="table">I</ref> gives the data &#967; 2 values observed for singlefrequency and all-frequency runs before unblinding, as compared to this expectation. We see that the values are on the order of 10 8 , despite that the number of pixels in d is close to 10 7 for the all-frequency case, confirming the fact that the expectation is not given simply by counting degrees of freedom. That the single-frequency expected &#967; 2 are larger than the all-frequencies case is due to the fixing of systematic parameters in those cases. Regardless, with the expected distribution mapped out by simulations, we find that our observed values in all cases are consistent with a FIG. <ref type="figure">5</ref>. Spectra of null maps computed from various data splits divided by the expected 1&#963; scatter due to noise. Detector wafer and scan direction tests also have had a small expected contribution from signal leakage subtracted, and in EE and BB have been computed with the exact pixel and Fourier space masking used in the main results pipeline (indicated as "MUSE masking"; the other tests are not expected to depend significantly on masking). Legends give the PTE for each test without correcting for the look-elsewhere effect. The 2&#963; threshold when considering the look-elsewhere effect is a PTE &gt; 0.07% on any individual test, which is passed in all cases. statistical fluctuation within 3&#963;. This suggests our model is sufficient to describe the observed data.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Agreement of systematics with alternative estimates</head><p>Our analysis infers systematic parameters jointly with bandpowers by incorporating systematics into a map-level model for the data. In this section, we compare our systematics estimates to independent determinations from alternate methods which have traditionally been used to estimate systematics in CMB analyses.</p><p>In this comparison, we consider the global polarization rotation angle, &#968; pol , the amplitude of monopole temperature-to-polarization leakage, &#1013; i Q and &#1013; i U , and the calibration at each frequency, A &#957;;i cal . Details of how these are estimated with alternate methods are given in Appendix A. Briefly here, &#968; pol can be estimated by assuming the sky EB spectrum is zero, and searching for the angle correction which achieves this in the data. &#1013; i Q and &#1013; i U can be estimated by assuming that the sky TQ and TU spectra are zero and finding the deprojection of leakage templates, which achieves this. A &#957;;i cal can be estimated taking ratios of power spectra to a known calibrated spectrum and averaging over very broad multipole ranges. Here, A 150 cal is fit via ratio to Planck, which performs its own absolute calibration, and A 95 cal and A 220 cal are fit from internal comparison to the Planck -calibrated SPT 150 GHz spectrum.</p><p>The source of MUSE estimates of these systematics is generally the same as described above, but happens implicitly and jointly inside the global MUSE fit, driven by our model definition (e.g., the model assumes the CMB EB spectrum is zero and allows a global rotation controlled by &#968; pol to absorb any EB power observed in the data during the fit).</p><p>The best-fit systematic parameters inferred with MUSE from the SPT data are summarized in Table <ref type="table">II</ref>. Broadly speaking, we find around 1% temperature to polarization leakage, a little less than half of a degree of global polarization angle miscalibration, and beam sidelobes (see Appendix A 4) which are polarized at around the 50% level.</p><p>In Fig. <ref type="figure">7</ref>, we compare to systematic parameters derived from the alternative methods. Up to small differences expected due to slightly different choices of masking and filtering made in these alternative estimates, we find good agreement between our two sets of estimates. We also note that in general the MUSE systematic estimate is slightly tighter, likely due to simultaneously fitting for all systematics and increased optimality of the Bayesian approach.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Higher order T-to-P leakage</head><p>In our baseline model, only monopole T-to-P leakage is modeled. However, it is possible that higher-order leakage exists. To verify our baseline choice, we add leakage templates up to octopole in the MUSE model and infer their leakage amplitudes. These templates corresponds to higher-order spatial gradients of the observed temperature maps. The results are shown in Fig. <ref type="figure">8</ref>, where no higherorder T-to-P leakage amplitude parameters are detected at &gt;3&#963;. FIG. <ref type="figure">6</ref>. Tests of the inter-frequency bandpower agreement of our data. Each panel shows the ratio of &#981;&#981; or unlensed EE bandpowers from single-frequency MUSE runs at different frequencies, normalized by the 1&#963; expected scatter as computed from simulations. Legends show the &#967; 2 and 2-tailed effective number of &#963; fluctuation of this &#967; 2 given the number of degrees of freedom. The blue line shows the result using our model before unblinding, which yielded acceptable &#967; 2 in all cases. The orange line shows the result after the post-unblinding changes to our model, which notably slightly improves the agreement between 95 GHz and 150 GHz. TABLE I. The &#967; 2 of the best-fit pixel-level model against the data, as compared to the expectation computed from mock simulations, as described in Sec. IV B 3. The number of &#963; is 1-tailed. We find values within 3&#963; of the expectation for individual frequency fits, as well as for our baseline combination of all frequencies.</p><p>220 93.33 93.50 AE 0.12 4.0% 2.1 95 108.21 108.28 AE 0.18 37.0% 0.9 150 109.32 109.51 AE 0.19 7.0% 1.8 220 100.97 101.00 AE 0.04 24.0% 1.2</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.">MAPs</head><p>We note that prior to unblinding we did look at MAP estimates of the lensing potential and of the lensed and unlensed CMB polarization, and thus present them in this section of the paper. These are shown in Fig. <ref type="figure">9</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>C. Post-unblinding changes</head><p>With all of these validation tests passed, we unblinded results, fit cosmological models, and compared to data from other experiments. We found &#923;CDM was a good fit to our own data and that our &#981;&#981; spectrum agreed very well with other experiments. However, the EE spectrum disagreed with the Planck theory prediction by a smooth slope (shown in the middle panel of Fig. <ref type="figure">10</ref>), leading to an overall 3.5&#963; disagreement across five &#923;CDM parameters (excluding &#964;) between Planck and SPT datasets. The slope was most significant at small scales and would have been undetectable by Planck, ACT, or previous SPT measurements, thus could have indicated a newly discovered departure from &#923;CDM. We were thus motivated to exhaustively check the robustness of the slope. Doing so, however, we discovered that its likely origin was an overly strong assumption we had been making in our polarization beam modeling.</p><p>Prior to unblinding, we had assumed the polarized and temperature beams were the same at the scales used in the analysis. While the temperature beam sidelobes are mapped with high signal-to-noise via observations of bright point sources and planets, we lack sufficiently bright polarized sources to similarly map the sidelobe response of the polarized beams. To check the robustness of the EE slope to this assumption, we devised a model of the polarized beams to realistically account for how much these might differ from the temperature beams. Appendix A 4 describes the model in more detail, but in summary, we took the polarized beams to be the sum of two components. The first is a central main lobe which can be modeled physically using the basic optics of the telescope and is expected from physical modeling to be fully polarized and thus, the same</p><p>TABLE II. Posteriors on systematics parameters given our allfrequency MUSE run (top section) and priors which were input (bottom section). Blank entries or parameters not listed under priors have uniform priors on &#240;-&#8734;; &#8734;&#222;. Superscripts denote each of the four subfields. Note that no cosmological model is assumed here. Posterior 95 GHz 150 GHz 220 GHz A 1 cal 0.8977 AE 0.0074 0.9276 AE 0.0074 0.8612 AE 0.0084 A 2 cal 0.8928 AE 0.0072 0.9133 AE 0.0071 0.864 AE 0.0079 A 3 cal 0.8861 AE 0.0073 0.9399 AE 0.0075 0.849 AE 0.0083 A 4 cal 0.8775 AE 0.008 0.9272 AE 0.0081 0.8405 AE 0.0097 100&#1013; 1 Q 0.291 AE 0.025 0.319 AE 0.021 0.492 AE 0.059 100&#1013; 2 Q 0.402 AE 0.024 0.39 AE 0.022 0.418 AE 0.06 100&#1013; 3 Q 0.555 AE 0.025 0.838 AE 0.021 2.096 AE 0.066 100&#1013; 4 Q 0.603 AE 0.033 0.912 AE 0.03 2.221 AE 0.084 100&#1013; 1 U 0.584 AE 0.027 0.74 AE 0.025 0.735 AE 0.064 100&#1013; 2 U 0.648 AE 0.025 0.748 AE 0.023 0.642 AE 0.058 100&#1013; 3 U 0.851 AE 0.027 1.238 AE 0.023 1.33 AE 0.063 100&#1013; 4 U 0.83 AE 0.035 1.174 AE 0.03 1.121 AE 0.092 &#968; pol &#189;&#176; 0.393 AE 0.024 0.419 AE 0.021 -0.188 AE 0.079 &#946; pol 0.44 AE 0.20 0.60 AE 0.28 0.51 AE 0.26 Prior 95 GHz 150 GHz 220 GHz A 1 cal 0.9275 AE 0.0094 A 2 cal 0.9095 AE 0.0089 A 3 cal 0.9386 AE 0.0096 A 4 cal 0.927 AE 0.011 &#946; pol U&#240;0; 1&#222; U&#240;0; 1&#222; U&#240;0; 1&#222; FIG. 7. Agreement of polarization angle rotation angle (&#968; pol ), monopole T &#8594; P leakage (&#1013; Q and &#1013; U ) and calibration estimates (A cal ), between MUSE estimates and alternative methods which have been previously used. The groups of four points refer to the four different subfields in cases where those are estimated separately. Details of the alternative estimates are given in Appendix A. FIG. 8. No evidence for dipole, quadrupole, or octopole T &#8594; P leakage. These are MUSE posteriors from a run on all-frequency data that includes these extra leakage templates in addition to the other usual bandpower and systematics parameters. In our baseline run, we thus do not include such higher-order templates.</p><p>in polarization as in temperature. This model is fit to match temperature beams themselves at small scales where the model is a good description of the beam. The second component is a diffuse sidelobe stemming from effect such as scattering or reflection from of optical elements in the camera and scattering from panel gaps in the primary mirror, which is not necessarily expected to be fully polarized. We marginalize over the polarization fraction of the sidelobe component with a new parameter at each frequency, &#946; &#957; pol . In this definition, our earlier assumption had corresponded to &#946; &#957; pol &#188; 1, which makes the polarized beam identical to the temperature beam.</p><p>With these free parameters introduced, we reran the MUSE fit, finding &#946; &#957; pol &#188; &#189;0.44 AE 0.20; 0.60 AE 0.28; 0.51 AE 0.26 at <ref type="bibr">[95,</ref><ref type="bibr">150,</ref><ref type="bibr">220]</ref> GHz. That is, our CMB data alone are able to internally constrain the sidelobe polarization fractions, and it constrains them away from our previously assumed value of unity with mild significance. The ability to FIG. 9. Maximum a posteriori (MAP) maps from 95 &#254; 150 &#254; 220 GHz data at the MUSE estimate of theory and systematics parameters, &#952;. MAPs correspond to a filtering of the data which maximizes signal relative to noise (akin to a linear Wiener filter, but in the case of the MAP &#954;, a nonlinear filter). The statistical anisotropy visible particularly in the E map originates from this filtering. No other additional smoothing has been applied, although we have subtracted the simulation-mean of &#954;. Except for this subtraction, these MAPs correspond exactly to &#7825; which enters the MUSE estimate in Eq. ( <ref type="formula">5</ref>). We have chosen to show only unlensed E and lensed B (derived from the &#954; and unlensed E in the top two panels), as lensed and unlensed E look nearly identical at the scale of this figure, and unlensed B is effectively zero by assumption.</p><p>constrain these parameters at all arises from inter-frequency differences, as changing the &#946; &#957; pol changes the beam at each frequency differently. Indeed, if we reexamine the inter-frequency agreement of our data with the &#946; &#957; pol marginalized, we find slightly improved agreement. This is shown as the orange line in Fig. <ref type="figure">6</ref>. Thus, even though our original blind inter-frequency tests did not show any statistically significant evidence for disagreement, there was indeed some small systematic disagreement hidden below the noise, which is alleviated by this improved beam modeling.</p><p>Regarding the slope in EE relative to Planck &#923;CDM, the change from &#946; &#957; pol &#188; 1 to the new marginalized posterior values largely removes it. It also increases our EE uncertainties at l &gt; 2000 by as much as a factor of 2 for some bins, which, however, is highly correlated between bins, with the effect much like an additional relative calibration between high-l and low-l. The impact to our &#981;&#981; spectrum of this post-unblinding change is negligible, as optimal lensing estimation is largely insensitive to smooth changes to beams. Since much of our cosmological information is coming from lensing, it also means that our &#923;CDM constraints from SPT data are not that strongly affected; for example, we originally found H 0 &#188; 67.3 AE 0.77 with the sidelobe polarization fraction fixed to unity and H 0 &#188; 66.81 AE 0.81 with it marginalized. A larger impact occurs in the &#240;n s ; &#937; b h 2 &#222; plane, with both parameters shifting just over 1&#963; and the posteriors broadening. These changes are all summarized in Fig. <ref type="figure">10</ref>. Our final agreement with Planck across five &#923;CDM parameters is 0.5&#963;, with the change roughly coming from equal parts central value shifts and posterior broadening in the &#240;n s ; &#937; b h 2 &#222; plane.</p><p>We consider this beam model sufficient to capture our polarized beam uncertainty because we find no significant shifts in results if we change the main lobe model to a simple Gaussian, suggesting the exact details of the main lobe modeling are not relevant. Additionally, allowing for a free linear dependence of the sidelobe polarization fraction as a function of angular distance from the peak response does not significantly shift results either, suggesting the model of a constant sidelobe polarization fraction is sufficient.</p><p>With this improvement to the polarized beam modeling, some additional small tweaks mentioned in Appenidx A which lead to insignificant changes, and all validation tests passing at the same or better levels than they were prior to unblinding, we adopted this as our final baseline model.</p><p>All results we will now present in the following section utilize this baseline model.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>V. RESULTS</head><p>In this section, we present our &#981;&#981; and unlensed EE bandpowers and resulting cosmological parameter constraints in various model spaces, with and without inclusion of some external datasets. Nomenclature for the external datasets, as well as our SPT-3G dataset, is described in Table <ref type="table">III</ref>. We follow the convention of naming cosmological parameters in <ref type="bibr">[63]</ref> (see Table . 1 there for detail). The setup for MCMC analysis is described in Sec. III B. For simplicity, we adopt a Gaussian prior, &#964; &#8764; N &#240;0.051; 0.006&#222;, using the result from the joint processing of Planck HFI and LFI data <ref type="bibr">[64]</ref> as baseline unless otherwise specified, instead of using Planck low-lEE likelihood or WMAP TE data at l &lt; 24. Unless otherwise specified, all uncertainties quoted in this paper represent the 68% confidence interval.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A. Bandpowers</head><p>In Fig. <ref type="figure">11</ref>, we show our inference of &#981;&#981; and unlensed EE bandpowers. The &#923;CDM model that best fits these data provides a good fit with &#967; 2 &#188; 75.8 given 71 degrees of freedom (76 bandpowers minus 5 cosmological parameters constrained beyond their prior), which corresponds to a PTE of 32.7%. By this test, the SPT data are consistent with the standard cosmological model. We also show as an orange band the 68% confidence level predictions of the &#923;CDM model for the CMB lensing and unlensed EE power spectra given these SPT data. FIG. <ref type="figure">10</ref>. Changes to our results made after unblinding, and their origin in changes to beams. In the left two panels, blue and orange data points show &#981;&#981; and unlensed EE bandpowers relative to Planck &#923;CDM theory for our final result and our initial unblinded result, respectively. Colored lines in the EE panel show ratios of temperature (dashed lines) and polarization beams (solid lines) between final and initial unblinded versions. For the polarized beam changes, we also show the posterior uncertainty from the final polarized beam fit with shaded bands indicating 1&#963;. In the right panel, the same two colors show the corresponding changes to cosmological parameter results from the SPT dataset. TABLE III. Glossary of dataset abbreviations used in the text and in figure captions. When combining SPT with WMAP and Planck data, we sum the likelihood directly assuming no correlation, an assumption justified by the small sky area of the SPT main field (1500 deg 2 ). When combining ACT DR4 with WMAP and Planck data, we followed the procedure in <ref type="bibr">[35]</ref>. When using both Planck NPIPE PR4 lensing and ACT DR6 lensing, we used the built-in likelihood implementation of act_dr6_lenslike package with variant=actplanck_baseline which accounts for correlations between ACT and Planck lensing due to overlapping sky coverage. a For the BAO dataset, we followed the proposal in Sec. III.3 of Adame et al. <ref type="bibr">[65]</ref> to replace the DESI BGS and lowest redshift LRG measurements with SDSS MGS (z eff &#8764; 0.15) and two SDSS DR12 LRG points (z eff &#8764; 0.38, 0.51), as well as replacing the DESI Ly&#945; point with joint eBOSS &#254; DESI Ly&#945; BAO. With the exception of Sec. V I, any CMB dataset combinations we use includes a prior on &#964; of 0.051 AE 0.006 <ref type="bibr">[64]</ref>. In Sec. V I, we instead drop the &#964; prior and PlanckT&amp;E is to be understood as including the Planck low-lEE likelihood.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Name</head><p>Dataset SPT This work, i.e., SPT-3G 2019=2020 b unlensed EE &#254; optimal &#981;&#981; from 95 &#254; 150 &#254; 220 GHz SPT&#981;&#981; Just the &#981;&#981; spectrum from the SPT dataset, with systematics and unlensed EE still marginalized over PlanckT&amp;E Planck 2018 high-lTT; TE; EE &#254; low-lTT [66] Planck&#981;&#981; Planck NPIPE PR4 &#981;&#981; [67,68] Planck PlanckT&amp;E &#254; Planck&#981;&#981; ACT&#981;&#981; ACT DR6 &#981;&#981; [33,34,69] ACT ACT DR4 TT; TE; EE [35,36,70] &#254; ACT&#981;&#981; WMAP WMAP9 [71] (l TE min &#188; 24, i.e., no &#964; information) [72] <ref type="bibr">[65,</ref><ref type="bibr">77,</ref><ref type="bibr">78]</ref> a When combining SPT with other CMB datasets, we ignore any error correlations. We expect these to be very small for two reasons. First, the relatively small area of the SPT-Main survey (at 3.5% of the full sky) means that the ACT and Planck errors predominantly arise from sources of fluctuation which are not in the SPT-Main survey region. Second, for the lensing likelihood specifically, SPT lensing bandpowers are reconstructed from CMB polarization only, while other CMB lensing reconstructions are predominantly informed by CMB temperature maps.</p><p>b The likelihood can be found in <ref type="bibr">[79]</ref>. We refer to this data in the figures as "2 yr-main"; the "main" field is the 1500 deg 2 field as defined in <ref type="bibr">[3]</ref>. We can also check if the SPT data are consistent with the &#923;CDM predictions given Planck data. The 68% confidence level predictions of the &#923;CDM model given Planck data are shown as a gray band, with the best-fit &#923;CDM model given Planck data shown as a black line. The &#967; 2 of the SPT bandpowers given the Planck best-fit &#923;CDM model is 80.7, which corresponds to a PTE of 20.2%. Thus, the SPT bandpowers agree with the Planck &#923;CDM best-fit prediction at the 1.3&#963; level.</p><p>In Fig. <ref type="figure">12</ref>, we compare the SPT bandpowers to prior measurements. The SPT lensing bandpower measurements extend to L &#8764; 3000 and are more precise than prior published measurements at L &gt; 350. The unlensed EE bandpowers extend to l &#8764; 3500 and are more precise than prior published measurements at l &gt; 2000 where we have clear detections of the eighth, ninth and tenth acoustic peaks.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>B. Origins of &#923;CDM parameter constraints</head><p>The SPT bandpowers represent a significant advance in our capacity to constrain cosmological parameters with polarization data alone. In Fig. <ref type="figure">13</ref>, we show constraints on four &#923;CDM parameters from SPT, ACT, and Planck, assuming the &#923;CDM model, and limiting to only EE spectra and polarization-based lensing reconstruction, or in the case of ACT, also including temperature in the lensing reconstruction. We see significantly tighter constraints on three of the parameters from SPT, while the higher dynamic range of the Planck EE data (which extends to lower l than shown in Fig. <ref type="figure">12</ref>) leads to a tighter constraint on n s for Planck.</p><p>Constraints on &#923;CDM parameters from SPT data and other dataset combinations, now including both temperature and polarization as detailed in Table <ref type="table">III</ref>, are summarized in Table <ref type="table">IV</ref>. The SPT constraints are comparable to full Planck results in the matter density, &#937; m h 2 , and angular scale of the sound horizon, &#952; &#8902; . The constraining power on &#937; m h 2 from SPT is mainly due to the CMB lensing bandpowers, which are sensitive to the amount of matter in the Universe. This is evidenced by the fact that adding just SPT lensing to WMAP data reduces the error on &#937; m h 2 by over a factor of 3 from 0.0045 <ref type="bibr">[72]</ref> to 0.0014, and that further adding SPT EE to this combination leaves the error unchanged.</p><p>That the sensitivity to &#937; m h 2 primarily comes from the lensing reconstruction rather than primary power spectra (in this case unlensed EE) is a unique feature of these data compared to prior CMB datasets. The sensitivity of primary power spectra to matter density is weakened due to its weighting toward small scales that are sensitive to conditions at horizon crossing; at such early times the matter density is negligible compared to the radiation density. Conversely, the lensing power spectrum, if probed precisely enough, has high sensitivity to &#937; m h 2 . This comes mainly (but not solely) from how CMB lensing power depends on the scale of matter-radiation equality. FIG. <ref type="figure">13</ref>. Posterior distributions of &#923;CDM cosmological parameters derived only from polarization EE spectra and polarization-based lensing reconstruction, or in the case of ACT [For the ACT DR4 EE likelihood, we adopted their default prior for yp2 &#8764; &#189;0.5; 1.5], also including temperature in the lensing reconstruction. We see tighter constraints on &#937; b h 2 , &#937; c h 2 , and H 0 from the SPT data. We show the SH0ES constraint on H 0 for comparison. See also Table <ref type="table">III</ref> for more details on dataset definitions. TABLE IV. Parameter estimates for various datasets assuming &#923;CDM. The datasets are defined in Table <ref type="table">III</ref>.</p><p>.045 AE 0.024 3.042 AE 0.011 3.046 AE 0.011 3.047 AE 0.012 3.042 AE 0.013 n s 0.944 AE 0.027 0.965 AE 0.004 0.9647 AE 0.0037 0.9637 AE 0.0076 0.961 AE 0.011 100&#952; MC 1.04162 AE 0.00066 1.04092 AE 0.00031 1.04104 AE 0.00028 1.04154 AE 0.00057 1.0394 AE 0.0022 &#937; b h 2 0.02255 AE 0.00057 0.02237 AE 0.00014 0.02239 AE 0.00013 0.02248 AE 0.00027 0.02226 AE 0.00045 &#937; c h 2 0.1227 AE 0.0018 0.1199 AE 0.0011 0.12039 AE 0.00094 0.1215 AE 0.0014 0.1210 AE 0.0014 &#964; reio 0.052 AE 0.006 0.0532 AE 0.0056 0.0547 AE 0.0058 0.052 AE 0.006 0.0515 AE 0.0059 H 0 [km=s=Mpc] 66.81 AE 0.81 67.4 AE 0.5 6 7 .28 AE 0.42 67.1 AE 0.6 6 6 .4 AE 1.1 100&#952; &#8902; 1.04181 AE 0.00069 1.0411 AE 0.0003 1.04125 AE 0.00027 1.04173 AE 0.00057 1.0396 AE 0.0022 &#937; m h 2 0.1459 AE 0.0019 0.143 AE 0.001 0.14340 AE 0.00089 0.1446 AE 0.0015 0.1439 AE 0.0015 &#937; m 0.327 AE 0.011 0.3147 AE 0.0068 0.3169 AE 0.0058 0.3211 AE 0.0085 0.326 AE 0.011 &#963; 8 0.8143 AE 0.0088 0.8104 AE 0.0051 0.8137 AE 0.0046 0.8177 AE 0.0065 0.8118 AE 0.0086 S 8 &#188; &#963; 8 ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi &#937; m =0.3 p 0.850 AE 0.017 0.830 AE 0.012 0.8363 AE 0.0097 0.846 AE 0.016 0.847 AE 0.016</p><p>Additionally, the sensitivity increases at the small scales probed here, with C &#981;&#981; L &#8733; &#240;&#937; m h 2 &#222; 2 at L &#8764; 200 and asymptoting to &#240;&#937; m h 2 &#222; 3 by L &#8764; 2000 <ref type="bibr">[80]</ref>.</p><p>The SPT data are also sensitive to the angular scale of the sound horizon, &#952; &#8902; . This sensitivity comes from the EE bandpowers, where the peak and trough locations of the EE spectrum are highly sensitive to &#952; &#8902; . The importance of the EE bandpowers for this constraint, as opposed to the lensing bandpowers, can be seen in the comparison of &#952; &#8902; constraints from SPT, WMAP &#254; SPT, and WMAP &#254; SPT&#981;&#981;; the exclusion of SPT EE significantly degrades the &#952; &#8902; determination. We note that the constraint on &#952; &#8902; is known to be improved by delensing <ref type="bibr">[81]</ref>, a result which applies to our unlensed spectra inferences here as well, although we do not explicitly attempt to quantify exactly how much improvement we have achieved.</p><p>The constraints on &#964; from the SPT dataset are driven almost entirely by Planck low-l polarization data, included here as a &#964; prior. Although lensing can, in theory, help constrain &#964; via degeneracy breaking, the contribution at current noise levels is negligible. The constraint on log&#240;10 10 A s &#222; is similarly largely driven by Planck, since it is degenerate both with &#964; and the absolute calibration, the latter which also comes from Planck.</p><p>The SPT constraints on &#937; b h 2 and n s are weaker than those inferred from Planck. Both n s and &#937; b h 2 cause tilts of the CMB EE spectrum, since &#937; b h 2 affects the photon diffusion damping at small angular scales and n s affects the shape of the primordial spectrum. Our observations lack the very largest scales probed by Planck, which, logarithmically, contain a significant range of scales useful for constraining a tilt. Additionally, at small scales our uncertainties are increased by polarized beam uncertainty, preventing what could otherwise be a slightly better constraint. Our final result is a 2.5% determination of the baryon density.</p><p>With the baryon density, matter density, and &#952; &#8902; determined, one has everything necessary, in &#923;CDM, for calculating both H 0 and &#937; m [e.g., <ref type="bibr">[82]</ref> ]. Adding in A s and n s one can also calculate &#963; 8 and S 8 . We show constraints on these four derived parameters in Table <ref type="table">IV</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>C. Agreement with other CMB datasets in &#923;CDM</head><p>Before providing interpretations, it is useful to establish the consistency level of our &#923;CDM parameter results with other CMB measurements. Such comparison provides tests of the &#923;CDM model that are complementary to and stronger than the bandpower-level tests above.</p><p>We perform a simple consistency check applicable to uncorrelated datasets by calculating the &#967; 2 between the two sets of parameters,</p><p>where x is a vector of parameter estimates, C are the parameter covariances from each, and the subscripts 1 and 2 distinguish datasets. We then compute the PTE for the given &#967; 2 and quote a 1-tailed conversion to number of &#963;. Since, as discussed, our estimates of &#964; and A s are highly correlated to Planck, we consider only x &#188; &#240;&#952; &#8902; ; &#937; b h 2 ; &#937; c h 2 ; n s &#222;. For these parameters, the SPT results are very close to independent of both Planck and ACT due to non-overlapping sky area and multipole ranges. We find &#923;CDM parameters from SPT and Planck datasets are consistent at 0.8&#963;, and from SPT and ACT datasets at 0.8&#963;.</p><p>A triangle plot comparing marginal posterior distributions from SPT &#254; WMAP, ACT &#254; WMAP, and Planck is shown in Fig. <ref type="figure">14</ref>.</p><p>We now turn to estimates of three parameters of particular contemporary interest in cosmology: H 0 , S 8 , and &#937; m .</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>D. The Hubble constant, H 0</head><p>The Hubble constant inferred from Planck data, assuming the &#923;CDM model, has been found to be in tension with the most precise "distance ladder" measurement <ref type="bibr">[83,</ref><ref type="bibr">84]</ref>, which uses Cepheid-calibrated SNe Ia, at the 5.8&#963; level. <ref type="foot">6</ref>In Fig. <ref type="figure">15</ref>, we show the Hubble constant inferred from various observations. Using the SPT data alone, we report</p><p>This is consistent with the Hubble constant inferred from Planck, and is in 5.4&#963; tension with <ref type="bibr">[83]</ref>. This confirms previous hints of a tension given only CMB polarization data, where Planck, ACTpol, and SPTpol yielded H 0 &#188; 68.7 AE 1.3 <ref type="bibr">[88]</ref>, but almost halves the error bar and increases the significance of the tension from less than 3&#963; to greater than 5&#963;.</p><p>The agreement with previous CMB constraints, that rely heavily on temperature data, is also significant as a passed test of &#923;CDM. Of course, the continued disagreement with distance ladder methods remains a mystery. We expect that these new SPT data will be a significant hurdle for many of the proposed models <ref type="bibr">[37]</ref>, and models yet to come, that address the Hubble tension.</p><p>Combining Planck, ACT, and SPT, we report</p><p>which results in tension with the H 0 reported in <ref type="bibr">[83]</ref> at 6.2&#963;. This is the tightest constraint on H 0 inferred from CMB observations assuming the &#923;CDM model. We also note that the combination WMAP &#254; ACT &#254; SPT achieves H 0 &#188; 67.18 AE 0.45 km=s=Mpc, a result with precision nearly the same as that of the Planck result and nearly independent of Planck data. Overall, inferences of the Hubble constant from a wide range of CMB observations are in good agreement. While the potential for confirmation bias to drive this agreement could be a concern, we highlight that both the unlensed EE and &#981;&#981; spectra which enter our SPT result here were obtained blindly, and while there were some changes to H 0 after we unblinded, these changes were small compared to the magnitude of the tension.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>E. The matter density, &#937; m</head><p>There has recently emerged some degree of tension between &#937; m as inferred from BAO, and values inferred from supernova surveys <ref type="bibr">[65]</ref>. We summarize the situation  <ref type="table">III</ref>. We see that either ACT or SPT, when combined with WMAP for constraints on larger angular scales, achieve constraints on cosmological parameters with similar constraining power as the constraints from Planck.</p><p>in Fig. <ref type="figure">16</ref>. The biggest discrepancy is between DESI BAO and the DES-Y5 determination of &#937; m which yields a 2.5&#963; difference. Inferences from CMB data, with consistency among different observations, are more precise than those from the supernovae and BAO, lying in between the low BAO value and the higher supernova values. From SPT data alone, we infer</p><p>This is the highest central value of the inferences from different choices of CMB dataset, though consistent with all of them, and with an uncertainty only 65% greater than the Planck uncertainty.</p><p>F. The structure amplitude parameter, S 8</p><p>The amplitude of structure measured from galaxy weak lensing surveys also shows mild tension with the &#923;CDM inference using CMB data. In Fig. <ref type="figure">17</ref>, we show the value of S 8 &#8801; &#963; 8</p><p>ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi &#937; m =0.3 p inferred from various observations. From SPT, we find</p><p>which is in agreement with the result from Planck at a similar level of precision. Our result is in 2&#963;, 3.2&#963;, and 3&#963; disagreement with HSC-Y3 <ref type="bibr">[92]</ref>, KiDs-1000 <ref type="bibr">[93]</ref>, and DES-Y3 <ref type="bibr">[94]</ref>, respectively. From the joint fit of Planck &#254; ACT &#254; SPT, we get</p><p>which is at 1.8&#963;, 3.3&#963;, and 3.3&#963; tension with HSC-Y3, KiDs, and DES-Y3 results. We also note that the constraint on S 8 from a combination of WMAP &#254; ACT &#254; SPT is consistent with the S 8 inferred from Planck alone, which provides a nearly-independent check of the Planck result.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>G. Amplitude of nonlinear structure growth</head><p>One proposed phenomenological resolution to the S 8 tension is a suppression of the matter power spectrum at nonlinear scales, potentially due to baryonic feedback or dark matter structure growth which differs from the standard picture <ref type="bibr">[41,</ref><ref type="bibr">95]</ref>. The CMB lensing reconstruction we present in this paper is the most precise measurement of the CMB lensing spectrum at the smaller angular scales where nonlinear effects are more important and can be used to further constrain the space of possible models which FIG. 16. Comparison of the &#923;CDM constraints on &#937; m inferred from different CMB observations (blue) with those from SNe observations (orange) and DESI BAO (green). The SNe observations include Pantheon&#254; [89], DES-Y5 [90], and Union 3 [91]. The biggest discrepancy is between DESI BAO and the DES-Y5 determination of &#937; m , a 2.5&#963; difference. FIG. 17. Comparison of the &#923;CDM constraints on S 8 inferred from different CMB observations (blue) with those from galaxy surveys of HSC Y3, KiDs-1000 and DES-Y3 (orange). The estimate from Planck &#254; ACT &#254; SPT differs with the one from HSC-Y3 at 2&#963;, the one from KiDs at 3.3&#963;, and with the one from DES-Y3 at 3&#963;.</p><p>resolve this tension. To do so, we introduce a new parameter, A CMB mod , equivalent to the A mod parameter of <ref type="bibr">[41]</ref> under the Limber approximation, but here measured at CMB lensing scales and redshifts. We implement this by directly scaling the non-linear correction to the lineartheory CMB lensing spectrum,</p><p>where C &#981;&#981; L;lin and C &#981;&#981; L;nonlin are the linear and nonlinear CMB lensing spectra, the latter computed via our default Halofit. This modified lensing spectrum is then also propagated to changes in the lensed primary spectra.</p><p>As a small aside, we have verified that constraints on A CMB mod are insensitive to the choice of nonlinear and baryonic feedback models. Given that A CMB mod is the parameter most sensitive to the nonlinear prediction, our constraints on other cosmological parameters in this paper are insensitive as well.</p><p>Figure <ref type="figure">18</ref> shows results of fitting the &#923;CDM &#254; A CMB mod model to various CMB datasets. We take the Planck primary CMB measurements as a baseline, which tightly limit the linear contribution to lensing, then explore the impact to constraints on the nonlinear contribution, A CMB mod , from adding lensing measurements from Planck, ACT, or the SPT results from this work. As expected from the spectra shown in Fig. <ref type="figure">12</ref>, there are significant improvements in constraining the nonlinear contribution from Planck to ACT, and again from ACT to SPT. The Planck primary CMB together with SPT detect a non-zero A CMB mod at &gt;3&#963; for the first time.</p><p>All measurements are consistent with the standard value A CMB mod &#188; 1, and the tightest combination of all lensing measurements yields A CMB mod &#188; 1.60 AE 0.39. The A mod inferred from DES-Y3 and KiDs is A mod &#188; 0.820 AE 0.042 <ref type="bibr">[95]</ref>. This latter result measures the amplitude of nonlinear structure growth at redshift z &#8764; 0.3 and scales of k &#8764; 1 Mpc -1 , whereas our CMB lensing measurements are sensitive to higher redshifts and larger scales. Using a Fisher calculation based on our actual achieved error bars, in the top panel of Fig. <ref type="figure">18</ref>, we have computed the range of &#240;k; z&#222; which contribute most to our constraints on total lensing power, as well as to just the non-linear contribution to the lensing power, i.e., to A mod .</p><p>This plot, taken together with our A mod constraints, suggests (at &#8764;2&#963;) that if the S 8 tension is to be explained by some nonstandard physics suppressing the nonlinear matter power spectrum, these effects have not started at the higher redshift and larger scales of CMB lensing as compared to galaxy lensing, and evolve in the decade of separation in &#240;k; z&#222; before suppressing the galaxy lensing power by &#8764;20% at later times and smaller scales. We note that with even better future measurements of small-scale CMB lensing, sensitivity to A CMB mod will push to smaller scales; e.g., significantly tightening up measurement of lensing power in a broad band centered on L &#188; 4000 will lead to sensitivity to galaxy weak lensing scales, although at z &#8771; 1 to 2 rather than z &#188; 0.3.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>H. Massive neutrinos</head><p>The &#923;CDM model, as usually understood, includes a strong assumption about the spectrum of neutrino masses, namely that the sum is 0.058 eV. This particular value follows from assuming that the lightest neutrino mass is effectively zero, that normal ordering applies, and that the usual interpretation of atmospheric and solar neutrino oscillations is correct. Here, we relax this assumption and FIG. <ref type="figure">18</ref>. Top panel: Redshifts and scales to which our lensing measurements are sensitive. Colored regions are iso-contours in the contribution to the square root of the Fisher information from logarithmic bins in k and z. The outermost contour corresponds to 10% of the peak. Red is for the Fisher information in the total lensing power, whereas blue is for the Fisher information just in A CMB mod (and is weighted more towards higher L). Under the Limber approximation, a given &#240;k; z&#222; contributes to a single CMB lensing multipole, L, and dashed lines denote this mapping for scales of L &#188; 30 and L &#188; 4000. Bottom panel: Constraints on A CMB mod , defined as the amplitude of nonlinear contributions to the CMB lensing spectrum [Eq. ( <ref type="formula">29</ref>)], from Planck primary CMB measurements (labeled PlanckT&amp;E) in combination with lensing measurements from Planck, ACT DR6, and the SPT dataset presented here. The orange band shows the 68% and 95% constraints of A mod inferred from DES-Y3 and KiDs datasets in <ref type="bibr">[95]</ref>. study constraints on the sum of neutrino masses, &#931;m &#957; . For a review of neutrino physics and its cosmological significance, see <ref type="bibr">[96]</ref>.</p><p>Relative to the case of massless neutrinos, neutrino rest masses increase the mean energy density and therefore, the expansion rate. To keep the angular size of the sound horizon fixed, the cosmological constant must then decrease, with a result that the expansion rate is boosted at z &gt; 1 and decreased at z &lt; 1 <ref type="bibr">[97]</ref>. The increased expansion rate at z &gt; 1 reduces growth on scales below the neutrino free-streaming length. Increasing mass also decreases the neutrino free-streaming length. On scales above the free-streaming length, neutrinos can cluster, approximately canceling the impact on growth of the increased expansion rate. For neutrino masses consistent with the data we consider, the free-streaming lengths are on very large scales so that the clustering of neutrinos only impacts the CMB lensing power spectrum at L &#8818; 200 <ref type="bibr">[98]</ref>. For this reason, nearly all of the constraining power of CMB data on neutrino masses comes from the changes to the expansion rate, rather than the changed free-streaming scale <ref type="bibr">[99]</ref>. This is not to say that CMB lensing does not contribute to the constraints, as it is indeed sensitive to the reduction in growth that occurs as a result of the increased expansion rate.</p><p>In Table <ref type="table">V</ref>, we show the constraints on &#931;m &#957; from various datasets. Combining the CMB observations of Planck, SPT, and ACT, we report the following constraint on neutrino mass:</p><p>Additionally including BAO measurements yields</p><p>which puts extreme pressure on the inverted hierarchy minimal mass sum of about 0.098 eV. These constraints on &#931;m &#957; should be interpreted with some caution. The need for that caution is well-illustrated by Fig. <ref type="figure">3</ref> in Loverde and Weiner <ref type="bibr">[98]</ref>. Here, they show that, due to constraints from primary CMB data on &#952; s , &#969; bc , and r d (where &#969; bc &#188; &#969; b &#254; &#969; c ), and constraints from BAO data on &#937; m and &#969; m r 2 d (where here &#969; m &#188; &#969; b &#254; &#969; c &#254; &#969; &#957; and likewise for &#937; m ), the preference of the combined constraints is for &#937; m &lt; &#937; bc ; i.e., for an (unphysical) negative neutrino mass. Further, the Pantheon &#254; SH0ES constraint in this plane is completely at odds with both the CMB and BAO (either DESI or SDSS) constraints. Loverde and Weiner <ref type="bibr">[98]</ref> argue that the main cause of DESI BAO data leading to tighter constraints on &#931;m &#957; is the central value of their constraints in the &#937; m -&#969; m r 2 d plane, rather than a decrease in the uncertainty in this plane.</p><p>Similar reason for caution is given by Craig et al. <ref type="bibr">[40]</ref> and Green and Meyers <ref type="bibr">[100]</ref>. They point out that there is some weak (2.5&#963;) evidence for an excess in lensing power present in CMB data, relative to &#923;CDM predictions given CMB and BAO data. This is, again, opposite the expected influence of an increase to &#931;m &#957; from its minimal (&#923;CDM) value of 0.058 eV. We turn to this question of excess lensing power in the next subsection.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>I. Excess lensing power?</head><p>As mentioned, Craig et al. <ref type="bibr">[40]</ref> pointed out that the lensing power inferred from the Planck lensing reconstruction and from the ACT DR6 lensing reconstruction are somewhat in excess of &#923;CDM predictions given DESI BAO and primary CMB data. Here, we investigate this claim, first with L-independent lensing template parameters and then with the phenomenological "negative neutrino mass" parameter used in <ref type="bibr">[40]</ref>, &#931; m&#957; , which can be thought of as an L-dependent lensing template parameter. TABLE V. Summary of constraints on selected extensions to the &#923;CDM model. The errors indicate the 68% confidence region, and the upper limits on the sum of neutrino masses are 95% upper limits.</p><p>&#254;0.014 -0.011 -0.0070 AE 0.0056 -0.0098 AE 0.0052 -0.0076 &#254;0.0053 -0.0047 0.0014 AE 0.0014 w 0 -1.42 &#254;0.29 -0.47 -1.48 &#254;0.20 -0.37 -1.52 &#254;0.18 -0.35 -1.52 &#254;0.19 -0.35 -1.075 AE 0.046 w 0 -1.10 &#254;0.50 -0.69 -1.18 &#254;0.46 -0.62 -1.16 &#254;0.48 -0.60 -1.18 &#254;0.48 -0.58 -0.65 AE 0.23 w a -1.1 &#254;0.6 -1.9 -1.06 &#254;0.61 -1.94 -1.18 &#254;0.52 -1.82 -1.21 &#254;0.57 -1.79 -1.15 &#254;0.71 -0.59 N eff 2.95 AE 0.33 2.89 AE 0.18 2.85 AE 0.16 2.66 AE 0.14 2.86 AE 0.13 &#931;m &#957; &lt;0.38 eV &lt;0.31 eV &lt;0.17 eV &lt;0.20 eV &lt;0.075 eV N eff 2.95 AE 0.32 2.89 AE 0.19 2.84 AE 0.17 2.67 AE 0.14 2.83 AE 0.13 &#931;m &#957; &lt;0.38 eV &lt;0.31 eV &lt;0.17 eV &lt;0.21 eV &lt;0.061 eV N eff 2.79 AE 0.52 2.8 AE 0.3 2 .77 AE 0.26 2.60 AE 0.23 2.89 AE 0.23 Y P 0.256 AE 0.028 0.248 AE 0.018 0.250 AE 0.016 0.246 AE 0.014 0.241 AE 0.015</p><p>To conduct our analysis, we define three L-independent lensing template parameters:</p><p>(1) A recon scales the &#923;CDM model lensing power used to predict the reconstructed lensing power, (2) A 2pt scales the model lensing power used to compute any lensed power spectra (2-point correlation functions), and (3) A lens scales both of these model power spectra; i.e., it is what we call A recon and A 2pt when we force them to be equal to each other. Note that what we call A 2pt has often been called A L .</p><p>We consider four model spaces, each an extension of &#923;CDM. They are &#923;CDM&#254;A recon , &#923;CDM &#254; A 2pt , &#923;CDM&#254;A lens , and &#923;CDM &#254; A recon &#254; A 2pt . Taking &#952; to be the &#923;CDM parameters, the extensions to the model spaces that include these template parameters are done via</p><p>where X &#188; recon, 2pt, or lens and C &#981;&#981; L &#240;&#952;&#222; on the right-hand side is the &#923;CDM model evaluated at &#952;. 7 To assist in comparison with <ref type="bibr">[40,</ref><ref type="bibr">100]</ref>, in this subsection only we take "PlanckT&amp;E" to include the Planck low-lEE likelihood and remove the &#964; prior, and we restrict our use of ACT data to ACT DR6 lensing only. Our choice of BAO data can be found in Table <ref type="table">III</ref>.</p><p>In the dash-dotted curves of the top panel of Fig. <ref type="figure">19</ref>, we see the well-known "A lens" or "A L " anomaly, which in our language is an A 2pt anomaly driven by Planck PR3TT data <ref type="bibr">[101]</ref>. This is a preference, in the lensed TT, TE, and EE spectra, for excess lensing power beyond what one gets from the &#923;CDM model. In the lower left panel, we can see that if both A recon and A 2pt are allowed to vary freely, the 95% confidence regions for the four considered data combinations have central values greater than 1 in both dimensions and no overlap with the &#923;CDM point of</p><p>Turning to the question of an excess of reconstructed lensing power, we see in the solid lines of the right panel that if we marginalize over A 2pt (solid lines) then there is some preference for A recon &gt; 1. The significances of these preferences are shown as ranging from 1.8&#963; to 2.6&#963;. From these significance levels and from the blue, orange, and green curves we see that we get a very similar result whether we use Planck, ACT DR6, or SPT lensing reconstructions, with the ACT DR6 and SPT curves nearly overlapping. If we remove DESI BAO data 8 (a case not shown in the figure), the preferences for A recon &gt; 1 persist, although with somewhat reduced significance ranging from 1.4&#963; to 2.2&#963;. The preference for A recon &gt; 1 is weakened much more if we fix A 2pt to 1 rather than marginalize over it (dashed curves); any evidence for excess reconstructed lensing power is then quite weak.</p><p>This dependence of the A recon posteriors on the treatment of A 2pt (fixing to 1 or marginalizing) makes sense given the correlation we see in the lower left panel. The lower the assumed A 2pt , the lower the distribution of A recon values. Physically, this correlation emerges because as A 2pt is increased, the &#923;CDM parameters used to fit the Planck T&amp;E data adjust in a way that decreases lensing power (both &#937; m h 2 and A s decrease). With the &#923;CDM lensing power decreased, a given level of reconstructed CMB lensing power needs a larger A recon to fit it.</p><p>Readers who are familiar with <ref type="bibr">[100]</ref> may be surprised at this sensitivity of A recon to choice of treatment of A 2pt (marginalizing it or fixing it to 1), given arguments made in that paper about the unimportance of the 2-point lensing anomaly to their conclusion of excess lensing power. But these two different results do not contradict each other. Green and Meyers <ref type="bibr">[100]</ref> define a slightly L-dependent lensing template parameter, A L &#240;&#931; m&#957; &#222;, and an L-independent template parameter, B lens , which are FIG. <ref type="figure">19</ref>. Posteriors of the lensing power phenomenological scaling parameters A 2pt and A recon . Different combinations of CMB lensing data added to the Planck primary CMB and BAO are shown in different colors. The 2D contours and solid lines show the marginal posterior from the &#923;CDM &#254; A 2pt &#254; A recon model; the dot-dashed and dashed lines show the results from &#923;CDM &#254; A 2pt and &#923;CDM &#254; A recon , respectively. The &#963; values give the level at which A recon (A 2pt ) values less than 1 are excluded, for the case of marginalization over A 2pt (A recon ); i.e., for the solid curves. 7 Recall that in this paper the &#923;CDM model has &#931;m &#957; fixed to 0.058 eV. 8 When removing DESI, we also add back in BAO data we had removed (see Table <ref type="table">III</ref>), namely the SDSS DR16 LRG, ELG, and QSO BAO <ref type="bibr">[76]</ref> and the DR12 BAO bin with z eff &#188; 0.61 <ref type="bibr">[75]</ref>.</p><p>approximately related to our parameters by A recon &#188; A L &#240;&#931; m&#957; &#222; and A 2pt &#188; A L &#240;&#931; m&#957; &#222;B lens . <ref type="foot">9</ref> The resulting correlation between B lens and &#931; m&#957; is much smaller than the correlation we see in Fig. <ref type="figure">19</ref>, which explains why they see the posterior for &#931; m&#957; not change much if they switch from fixing B lens &#188; 1 to marginalizing over it.</p><p>We have a strong theoretical prior that the lensing power one would infer from lensed CMB spectra (given the right cosmological model) should be equal, within the expected uncertainties, to what one would infer in a CMB lensing reconstruction. <ref type="foot">10</ref> So, even though the bulk of the probability lies to one side of the A recon &#188; A 2pt line, it is worth exploring what we find if we force these two parameters to be equal. This is the model space we explore with an even greater variety of dataset combinations in Fig. <ref type="figure">20</ref>, and it is also very similar to the model space chiefly studied by Green and Meyers <ref type="bibr">[100]</ref> (with their B lens &#188; 1).</p><p>One of the dataset variations we consider throughout Fig. <ref type="figure">20</ref> is the replacement of the Planck PR3 Plik T&amp;E likelihood with the CamSpec PR4 T&amp;E likelihood <ref type="bibr">[102]</ref>. <ref type="foot">11</ref>The Planck PR4 maps use more timestream data than does PR3 and the CamSpec PR4 T&amp;E likelihood uses more sky than does the Plik PR3 T&amp;E likelihood. If the A 2pt anomaly is driven by some fluctuations that drive the spectra away from the mean (in a manner similar to enhanced lensing) then we would expect the support for it to go down as we add in more data. For Plick PR3 we have, from TT; TE; EE &#254; lowE, A L &#188; 1.180 AE 0.065 <ref type="bibr">[101]</ref>. With the additional PR4 data, we instead have A L &#188; 1.095 AE 0.056 <ref type="bibr">[102]</ref>. Note that there is one other PR4 T&amp;E likelihood <ref type="bibr">[103]</ref>, in which they find an even lower A L &#188; 1.039 AE 0.052.</p><p>The lower half of the entries in Fig. <ref type="figure">20</ref> do not include any BAO data. We see that without the BAO data any conclusion of A lens &gt; 1 by more than 2&#963; is not very robust, occurring only for one of the dataset combinations. With the addition of BAO data (top half of the figure), such a conclusion becomes much more robust. The combinations that include Planck T&amp;E, BAO, and all three lensing reconstructions become robust to the difference between PR3 and PR4. With PR3 specifically we find</p><p>and a 2.7&#963; exclusion of A lens &lt; 1. Note that the constraints are weakened by removing DESI from the BAO dataset. Also, if one does so, the results become dependent again on the choice of Planck T&amp;E. Removing DESI (in the same manner as done above) and switching from Planck PR3 T&amp;E to Planck PR4 T&amp;E, we find A lens &#188; 1.064 AE 0.032.</p><p>We have checked that switching to the L-dependent lensing template used in <ref type="bibr">[40,</ref><ref type="bibr">100]</ref> in place of A lens delivers a similar result. Their template depends on their neutrinomass-like parameter</p><p>and a 2.7&#963; exclusion of &#931; m&#957; &gt; 0.058 eV. We note that this evidence for excess lensing power arises from three types of constraints: 1) lensing power measurements, 2) inferences of &#923;CDM parameters (other than &#964;) from the combination of CMB and BAO data, and 3) inference of &#964; from large angular-scale CMB polarization. We have seen that the preference for A lens &gt; 1 is robust to selection of different datasets for <ref type="bibr">(1)</ref>. Of particular relevance for this paper, the preference is present when the Planck and ACT lensing measurements are replaced with the SPT one. The SPT lensing is inferred entirely from polarization, making it much less susceptible to possible biases from extragalactic foregrounds. Regarding (2), we see that a more than 2&#963; preference for A lens &gt; 1 requires either BAO data (including DESI) or PR3 T&amp;E. The next release of BAO data from DESI should be informative, as will the forthcoming lensed TT, TE, and EE spectra from SPT-3G. Note that &#964; (item 3 above) is inferred from Planck low-lEE data. The importance of &#964; comes from its implications for A s as the combination A s e -2&#964; is very well determined by CMB data: a higher &#964; would lead to higher A s FIG. <ref type="figure">20</ref>. Constraints on the lensing power phenomenological scaling parameter A lens (68% C.L.) using Planck PR3 Plik T&amp;E likelihood (in blue) and CamSpec PR4 T&amp;E likelihood (in orange). and hence, a higher lensing power prediction. Independent and competitive constraints on &#964; may come via the kinematic SZ effect <ref type="bibr">[11,</ref><ref type="bibr">104]</ref>.</p><p>The conclusion that there is an excess of lensing power also relies on assumption of the &#923;CDM model. Craig et al. <ref type="bibr">[40]</ref> introduced a number of changes to the model that could possibly restore concordance. Lynch et al. <ref type="bibr">[105]</ref> noted that nonstandard recombination can simultaneously reduce the H 0 tension and lead to a higher predicted lensing power. The DESI Collaboration points out that with free w 0 and w a the constraints on &#931;m &#957; are significantly relaxed <ref type="bibr">[65]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>J. Other &#923;CDM extensions</head><p>We also explore &#923;CDM extensions with curvature, the density of light relics, primordial abundance of helium, and a time-varying dark energy equation of state. Constraints on additional parameters are summarized in Table <ref type="table">V</ref>.</p><p>For the &#923;CDM extension with free curvature and for the ones with changing dark energy equation of state, we find that all CMB datasets are consistent within 2&#963; with the fiducial values of the &#923;CDM model. The constraints from different CMB datasets are consistent. Adding the BAO data significantly tightens the constraints. This is due to the BAO sensitivity to the expansion rate and the distanceredshift relation over redshifts when the dark energy begins to dominate the density.</p><p>The energy density of additional relativistic particles other than photons is parameterized by the effective number of neutrino species, N eff , defined via</p><p>where &#961; &#947; is the photon energy density. Assuming the standard cosmological model, N eff &#188; 3.044 <ref type="bibr">[106]</ref><ref type="bibr">[107]</ref><ref type="bibr">[108]</ref>.</p><p>The CMB power spectrum is sensitive to N eff <ref type="bibr">[109]</ref><ref type="bibr">[110]</ref><ref type="bibr">[111]</ref><ref type="bibr">[112]</ref><ref type="bibr">[113]</ref>.</p><p>By increasing the density of light relics, one increases the expansion rate during the radiation-dominated era. To keep the precisely measured angular scale of the sound horizon unchanged, the expansion rate after last scattering also increases [e.g., <ref type="bibr">[114]</ref> ]. Therefore, there is a positive correlation between H 0 and N eff . The marginal posterior distribution of N eff and H 0 is shown in the left panel of Fig. <ref type="figure">21</ref>. Combining Planck, SPT, ACT, and BAO, we get</p><p>where N eff is consistent with 3.044 from the standard model prediction, but H 0 is still in 5.4&#963; tension with the SH0ES <ref type="bibr">[83]</ref> measurement. One can further extend the model to allow for a free &#931;m &#957; as well. The marginal posterior of N eff and &#931;m &#957; is shown in the middle panel of Fig. <ref type="figure">21</ref>. There is nearly no degeneracy between N eff and &#931;m &#957; . Adding SPT data to Planck improves the constraints on N eff slightly and &#931;m &#957; by about a factor of 2 as discussed in Sec. V H. The lensing information in SPT data is useful for constraining the neutrino mass sum, as discussed in Sec. V H, and the improvement of the N eff constraint is mostly from the more precisely measured EE bandpowers.</p><p>The density of light relics is also partially degenerate with the primordial fraction of baryonic mass in helium, Y P . Both N eff and Y P affect the photon diffusion damping of the CMB power spectra at small angular scales. In the right FIG. <ref type="figure">21</ref>. Marginal posterior distributions of the &#923;CDM extension models involving N eff . The horizontal red dashed line shows N eff &#188; 3.044. The yellow vertical band in the left panel shows the constraint on H 0 from Breuval et al. <ref type="bibr">[83]</ref>. The vertical brown dashed line shows &#931;m &#957; &#188; 0.058 eV. The purple vertical band shows the inference of the primordial helium fraction from Aver et al. <ref type="bibr">[115]</ref>. panel of Fig. <ref type="figure">21</ref>, we show the marginal posterior of N eff and Y P . Compared to the constraints using only Planck data, the uncertainty is reduced by &#8764;41% with the addition of SPT data. 12 When we fit jointly for N eff and Y P using Planck, ACT, SPT, and BAO measurements, we find values of N eff that are consistent with the standard model prediction of 3.044, N eff -Y P contours that are consistent with BBN predictions, and Y P constraints that are consistent with direct measurements of HII regions in metalpoor galaxies <ref type="bibr">[115]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>VI. CONCLUSIONS</head><p>In this paper, we presented an optimal joint inference of the CMB lensing potential power spectrum and the unlensed CMB EE power spectrum, derived entirely from CMB polarization maps. These maps were made from observations of 1500 deg 2 taken during the SPT-3G 2019 and 2020 Austral winter observing seasons. The inference was performed using a Bayesian framework called the marginal unbiased score expansion (MUSE) method, which also enables propagation of systematic uncertainties into the estimated bandpower uncertainties. The inferred bandpowers have smaller uncertainties than previously published results at l &gt; 2000 for the EE spectrum and at L &gt; 350 for the lensing spectrum, with a 38&#963; detection of lensing power.</p><p>After checking that the &#923;CDM model provides a good fit to the SPT data (combined with a prior on &#964;), we reported constraints on the &#923;CDM parameters. We found H 0 &#188; 66.81 AE 0.81 km=s=Mpc, a significantly tighter constraint than from prior polarization-only measurements. It is consistent with the Planck TT=TE=EE plus Planck lensing constraints assuming &#923;CDM, and in 5.4&#963; tension with the most precise distance ladder measurement <ref type="bibr">[83]</ref>. The SPT constraints on &#937; c h 2 and &#937; b h 2 are also tighter than from prior polarization-only or polarization plus CMB lensing measurements. We also found S 8 &#188; 0.850 AE 0.017, consistent with the result from Planck data and with comparable uncertainty.</p><p>We estimated &#923;CDM parameters for several other CMB datasets and checked for consistency between these estimates. These consistency tests are stringent tests of the &#923;CDM model. All differences investigated were within expectations given the measurement uncertainties and the &#923;CDM model. The comparison between SPT and Planck provides a particularly interesting test of the &#923;CDM model, because the SPT constraints are more heavily weighted toward small-scale polarization power, and especially lensing power, than is the case for Planck.</p><p>We estimated &#923;CDM parameters from SPT data in combination with other CMB datasets. When combined with primary CMB and lensing from Planck and ACT, we find H 0 &#188; 67.33 AE 0.37 km=s=Mpc, a 6.2&#963; tension with the distance ladder <ref type="bibr">[83]</ref>, and S 8 &#188; 0.8380 AE 0.0084, in 3.3&#963; tension with S 8 from the 3 &#215; 2 pt analysis of KiDs-1000 <ref type="bibr">[93]</ref> and also from the 3 &#215; 2 pt analysis of DES-Y3 <ref type="bibr">[94]</ref>.</p><p>Motivated by the work of Amon and Efstathiou <ref type="bibr">[41]</ref> and Preston et al. <ref type="bibr">[95]</ref> regarding the S 8 tension, we investigated the sensitivity of our lensing measurement to nonlinear corrections. By introducing the A CMB mod parameter that scales the amplitude of nonlinear corrections to the CMB lensing spectrum, we found A CMB mod &#188; 1.60 AE 0.39. This is the first &gt;3&#963; "detection" of the nonlinear correction for CMB lensing, and is &#8764;2&#963; higher than the value needed for resolving the S 8 tension between galaxy lensing and CMB measurements. Our data clearly do not favor any suppression of the nonlinear contribution, though the uncertainties are still large. A suppressed S 8 could arise at lower redshifts and from slightly smaller scales than the ones we are probing.</p><p>The CMB lensing potential is also sensitive to the neutrino mass sum, &#931;m &#957; . We investigated constraints on &#931;m &#957; with the SPT lensing spectrum. In the &#923;CDM &#254; &#931;m &#957; model, we find &#931;m &#957; &lt; 0.20 eV &#240;95% C:L:&#222; from the joint fit to Planck, ACT, and SPT data. Adding BAO data, the constraint is further tightened to &#931;m &#957; &lt; 0.075 eV &#240;95% C:L:&#222;, placing pressure on the inverted hierarchy minimal mass sum of 0.098 eV.</p><p>With the addition of BAO data to the CMB datasets, the strongest challenge to the standard model we see in our results comes from an investigation of lensing power amplitudes. We introduced A 2pt to scale the &#923;CDM model CMB lensing spectrum used to calculate lensed model TT=TE=EE spectra and A recon to scale that same &#923;CDM model CMB lensing spectrum, but for comparison to the reconstructed CMB lensing spectrum instead. With both of these parameters free, the &#923;CDM point in the A recon ; A 2pt plane is outside the 95% confidence region for a variety of dataset combinations. After marginalization over A 2pt , Planck lensing, ACT DR6 lensing, and SPT lensing all favor A recon &gt; 1 at &#8764;2&#963;.</p><p>We also set A recon &#188; A 2pt and called the single template parameter A lens . We investigated the posterior distribution of A lens for a variety of datasets. For our most comprehensive combination of datasets, we find A lens &#188; 1.083 AE 0.032 and a 2.7&#963; exclusion of A lens &lt; 1. Following Craig et al. <ref type="bibr">[40]</ref> and Green and Meyers <ref type="bibr">[100]</ref>, we found a similar constraint on their neutrino-mass-like L-dependent lensing template parameter, &#931; m&#957; . For the same dataset, we found a 2.7&#963; exclusion of &#931; m&#957; &gt; 0.058 eV. We also saw that these constraints are weakened by dropping DESI BAO, or by switching from Planck PR3 T&amp;E data to Planck PR4 T&amp;E data. Doing both, we find A lens &#188; 1.064 AE 0.032.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>12</head><p>Although prior work has shown that delensing can improve constraints on N eff and Y P <ref type="bibr">[81,</ref><ref type="bibr">116]</ref>, we suspect that the improvement we see here is largely unrelated to the unlensed nature of our bandpowers; i.e., it would be quite similar even if our bandpowers were lensed bandpowers.</p><p>We also discussed the cosmological inference with SPT CMB lensing potential bandpowers and the unlensed EE bandpowers on other one-and two-parameter extensions of the &#923;CDM model. These model extensions include varying the amount of spatial curvature, the dark energy equation of state and its evolution, the density of light relics, and the primordial fraction of baryonic mass in helium. The constraints on the extended parameters are summarized in Table <ref type="table">V</ref>. Combining the SPT results with Planck, ACT, and BAO, we saw no significant preference for any of these extensions.</p><p>Although the 2019-2020 SPT-3G CMB polarization data analyzed here primarily constrain H 0 and S 8 , SPT-3G constraints on other &#923;CDM parameters and extensions will be improved significantly in the near future. The addition of temperature information to both the power spectrum and lensing analyses will significantly improve the constraints on the inferred cosmological parameters, in particular the effective number of neutrino species, &#931;m &#957; , and curvature <ref type="bibr">[3]</ref>. With polarized beam uncertainties identified as an important contribution to our systematic error budget, we have planned additional deep observations of polarized point sources to better map out the polarized beam response and reduce this source of uncertainty in future analyses. The 1500 deg 2 field has since been observed for three more seasons. The noise level of the full 2019-2023 1500 deg 2 data is expected to be 1.7 times lower than the 2019-20 observations, and the figure of merit for &#923;CDM parameters is expected to be 2 times better <ref type="bibr">[3]</ref>. The addition of &#8764;8500 deg 2 more sky mapped with SPT-3G by the end of 2024 will strengthen constraints even further, eventually surpassing those from Planck <ref type="bibr">[3]</ref>. The analysis of these data, particularly the ultradeep 1500 deg 2 data, will benefit even more from optimal analyses such as the MUSE method demonstrated here. This work has demonstrated the strength of joint Bayesian CMB lensing potential bandpower reconstruction and unlensed CMB EE bandpower estimation, in particular its ability to propagate systematic uncertainties, and paves the way for future analyses of SPT-3G observations and similarly deep datasets in the future.</p><p>these operations for the autodifferentiation (AD) system. The Jacobian of a spherical harmonic transformation (SHT), f &#188; Y a and the Jacobian with respect to f of the lensing operation, f &#188; L&#240;&#981;&#222;f, are trivially just Y and L themselves since they are linear operators. The Jacobian of lensing with respect to &#981; is derived from</p><p>where x is a pixel location and &#1013; is a small perturbation to the lensing potential. The second term demonstrates that the Jacobian is the operator &#8711;f&#240;x &#254; &#8711;&#981;&#222; &#8226; &#8711;, and involves the spatial gradient of a lensed map, which itself is computed with Lenspyx. With these AD rules implemented, our curved-sky lensing model is fully compatible with the MUSE covariance calculation.</p><p>Posterior model.-In the posterior model, we assume the flat-sky approximation, so simulated maps are generated from white-noise maps FFT-convolved with appropriate kernels. We implement lensing with LenseFlow <ref type="bibr">[120]</ref>, which is GPU-compatible and automatically differentiable up to second-order as needed by the MUSE covariance calculation.</p><p>Bandpower parameters.-Both simulation and posterior models take as input a theory unlensed CMB spectrum and gravitational lensing potential spectrum. We introduce free parameters which we will infer, A EE b and A &#981;&#981; b , which control the amplitude of these spectra relative to a fiducial model within each bin. Our EE bandpowers have linearly spaced bins with &#916;l &#188; 50 between 350 and 3500, and our &#981;&#981; bandpowers have 12 logarithmically spaced bins between 20 and 3000. We take the fiducial model to be the Planck best-fit spectrum, but this introduces no dependence on Planck, rather is just a choice of definition of the bandpower amplitude parameters. Within a bin, the parameters scale the theory spectrum uniformly across l's. One can show that with this choice, our inferred A EE b and A &#981;&#981; b parameters correspond to a constraint on an inversevariance weighted average of the theory spectrum across the bin. Since this inverse-variance weighting arises only implicitly, we also construct explicit bandpower window functions (used in parameter estimation) from an interpolation of the diagonal of the MUSE covariance, &#931; MUSE , to every l. The tests on simulations in Sec. IVA 2, which demonstrate no bias and correct scatter of cosmological parameters derived from simulations, provide verification of this procedure.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">NFFT-based projections</head><p>Within our model, we perform interpolation between maps in different projections in four different places: 1) as mentioned, in the simulation model, the lensed CMB is projected from HEALpix N side &#188; 2048 to a Lambert pixelization, 2) in both simulation and posterior models, the transfer function model involves projecting from a Lambert to an Equirectangular pixelization, applying some weights, then projecting back to Lambert, 3) when computing the transfer function, a set of end-to-end pipeline simulations are compared to a direct projection of the HEALpix N side &#188; 8192 maps which are input to the endto-end simulations, and 4) to model deflections due to relativistic aberration.</p><p>To ensure easy differentiability, GPU compatibility, and the best possible accuracy, we perform these interpolations using nonuniform FFTs (NFFTs). NFFT interpolation implicitly uses information from all pixels to compute the interpolated value at a given location, as opposed to other typically-used interpolations such as bilinear interpolation, which uses only the nearest pixel neighbors. It is exact for a band-limited signal (i.e., when there is no power beyond the Nyquist frequency) and remains very accurate for nearly band-limited signals such as ours, where power beyond the Nyquist frequency is highly suppressed by the beam.</p><p>We use the CPU and GPU compatible NFFT library NFFT.jl <ref type="bibr">[121]</ref>. An interpolation is performed by a pair of forward and adjoint NFFTs, F , with appropriate grid points, &#947;, which describe the relative location of the pixel centers in the map projections being interpolated between</p><p>The adjoint of the projection operator, which arises in automatic differentiation, is then simply</p><p>We verify these projections are accurate enough in Fig. <ref type="figure">22</ref>. For use cases (1) and ( <ref type="formula">2</ref>), in the left panel, we generate a simulated lensed CMB map in the Lambert pixelization. We then project to equirectangular and back to Lambert, as well as to HEALpix N side &#188; 2048 and back to Lambert. We then examine the change to the spectrum of the original map. Figure <ref type="figure">22</ref> demonstrates this change is &#8818;0.3% up to the maximum multipole used in the analysis, l &lt; 3500. For use case <ref type="bibr">(3)</ref>, in the right panel, we compare the spectrum of the projected HEALpix N side &#188; 8192 maps to the input theory spectrum, again finding &#8818;0.5% changes up to the maximum multipole. These differences are well below our uncertainties; for example, typical EE bandpower errors are at the 5% to 10% level.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Relativistic aberration</head><p>Relativistic aberration distorts the CMB maps due to our proper motion relative to the CMB rest frame <ref type="bibr">[122]</ref>. This is a small but non-negligible effect, which we expect could bias &#952; s by as much as &#8764;0.3&#963; &#952; s if unaccounted for. One method for modeling aberration is to apply a first-order correction to the power spectrum <ref type="bibr">[122]</ref>,</p><p>where &#946; is our peculiar Lorentz factor and hcos &#968;i measures the average angle to the direction of proper motion for pixels in the field. We note that this correction applies to a lensed spectrum and does not apply to the unlensed spectra inferred here. In particular, some a priori unknown part of the aberration deflection is interpreted in the posterior as very long wavelength gravitational lensing deflection, and then is automatically removed when producing inferences of the unlensed spectrum. Equation (A4) is thus an upper bound on the impact aberration might have on our particular analysis, but it could be much smaller. Without knowing the exact magnitude, we choose to simply include the aberration effect directly in the map-level model. Since the effect is small, it is sufficient to only include it in the simulation model to prevent biases. The deflection angle, &#916;&#968;, is given by</p><p>where &#968; is the angle on the sphere to the direction of proper motion. We assume a fixed velocity of &#946; &#188; 0.00128 and a direction of &#240;l; b&#222; &#188; &#240;264&#176;; 48&#176;&#222; <ref type="bibr">[1]</ref>, ignoring next-order effects from uncertainties on these values. The deflection is performed via NFFTs.</p><p>We note that we do not model the CMB dipole also induced by our proper motion, as this is filtered out by timestream and map-level high-pass filters.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Beams</head><p>Next, the CMB maps are convolved with a model for the polarized instrument beams at each frequency, B &#957; P &#240;&#946; n ; &#946; &#957; pol &#222;. We assume a rotationally symmetric convolution kernel, performed on the Lambert maps with FFTs in both simulation and posterior models. The shape of the convolution kernel in real space, B&#240;&#952;&#222;, or in multipole space, B l , is controlled by free parameters which are marginalized over in the analysis. These parameters include a set of temperature beam eigenmode amplitudes, &#946; n , and the polarization fraction of the beam sidelobes at each frequency, &#946; &#957; pol . We first describe the temperature beams, then how we derive the polarization beams from them. More detail on the beam determination will be presented in Huang et al., (to be published).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>a. Temperature beams</head><p>The first step in determining our polarized beams is to determine the temperature beams. At low radii (&#8818;2 arcmin), temperature beams are determined from observations of several bright AGN in our observing field. The AGN, however, do not have sufficient brightness to fully map out the beam at larger scales (referred to as the beam "sidelobes"), where instead we use observations of Saturn. In turn, observations of Saturn cannot be used at small scales, since its intensity saturates detectors and changes the response as compared to typical CMB observations. Therefore, we stitch the AGN and Saturn maps into projections of the HEALpix N side &#188; 8192 maps, which are input to mock observation (labeled "projskies") vs the input theory which generated those HEALpix maps. Blue bands represent 1 and 2&#963; Monte Carlo error from the finite number of simulations used to compute the blue curve. This confirms the "projskies" sufficiently recover the input theory so as not to bias the transfer function calculation, where they are used. For reference, this plot also shows the expected change to the power spectra if bilinear interpolation was instead used, which shows much worse performance than NFFT interpolation. a composite beam map, which uses the AGN measurements at low radii, and Saturn for the sidelobes.</p><p>To create the composite map, we must account for several systematic effects. First, Saturn is very slightly extended. Before stitching, we convolve the AGN maps with a disk of 8.55 arcsec (the average angular diameter of Saturn during our observations). We later deconvolve the same disk from the final beam. The SPT-3G detectors have a finite response time, which spreads signals along the scan direction. While the response time is constant, the on-sky scan speed varies with declination. Therefore, we deconvolve the map-space effect of the time constants from each map (both for individual AGN and Saturn), and then reconvolve with the mean effect of the time constant over the SPT-3G winter field footprint. We also attempt to remove background signals (primarily the CMB) from both the Saturn maps and AGN maps. Since Saturn moves across the sky, we are able to subtract direct observations of the backgrounds from the Saturn maps. To do this, we mock observe Planck maps for each of our Saturn observations, and subtract the mockobserved maps from our Saturn maps. For the AGN maps, we are unable to do this. However, we are only concerned with small scale (&#8818;7 arcmin) information in the AGN maps. On these scales, the CMB varies very little, and can be modeled with only low-order modes. Therefore, we fit a simple background model to each AGN consisting of a slope in radius and an offset. We fit these parameters (using the Saturn maps as a reference for the beam) at 95 and 150 GHz, since the CMB is expected to be common between bands, and the 220 GHz maps are significantly noisier (both from instrumental noise, and higher astrophysical backgrounds). We subtract the background model from all three bands.</p><p>Finally, we apply a very basic model of intra-band frequency dependence to the beam. We model the beam at each band as a Gaussian made of two components, one which is fixed across the band, and one which scales with frequency. We model the frequency scaling as &#963;&#240;&#957;&#222; &#8733; 1=&#957;, which is the expected behavior for a diffraction-limited beam. We use the same constant of proportionality across all three bands. In order to fit these parameters, we assume that the 95 GHz beam is diffraction limited (i.e., the frequency independent component is nonexistent). We approximate the "effective beam frequency" for each band using the SPT-3G mean bandpass functions, and the source spectrum. Then, we fit the three remaining parameters (the constant of proportionality in the frequency dependent portion, and the two frequency independent portions of the 150 and 220 GHz beams) using observations of the brightest AGN. This simple model allows us to approximate the change in the beam by calculating the effective beam frequency for different source spectra, and convolving the beam maps (or multiplying their spectra) with the appropriate Gaussian. We apply this model to transform the final beam from the mean AGN spectrum to the CMB spectrum, and to transform the Saturn maps to match the AGN spectrum before stitching.</p><p>With the systematic effects accounted for, the stitching process is relatively simple. For each pair of AGN and Saturn maps, we fit for an amplitude to account for the difference in intensity between the sources. As discussed above, we cannot use the innermost portion of the Saturn maps. The AGN maps also become noise-dominated at large distances from the source, so we perform this fit over an annulus with inner radii of (2.25, 1.5, 1.0) arcmin, and outer radii of (3.5, 3.0, 2.5) arcmin at <ref type="bibr">(95,</ref><ref type="bibr">150,</ref><ref type="bibr">220)</ref> GHz. Finally, each composite map is constructed by linearly transitioning from the AGN map to the Saturn map over the same annulus.</p><p>We make composite maps from all pairs of AGN and Saturn maps. We also vary several systematic parameters: the choice of stitching radii, uncertainty in the time constants, the choice of the background fitting region. For each of these variations, we create the full set of composite maps. Then, we construct cross spectra of composite maps such that none of the component maps are the same. If we included the same component map more than once, we would incur a noise bias in the beam. To measure the final beam, we take the mean of these cross spectra and apply the two final effects described above (specifically, deconvolving the Saturn disk, and converting from the AGN spectrum to the CMB spectrum).</p><p>The beam uncertainty is measured from the cross spectra. By including both different Saturn and AGN maps, as well as the systematic variations, we capture both the map noise, systematic uncertainty from our analysis choices, and systematic differences between the AGN. Using the Monte Carlo method, this uncertainty is propagated to a covariance between temperature B &#957; l 's at each frequency, &#957;, and multipole, l. Finally, we add one additional uncertainty directly to the covariance matrix, which accounts for uncertainty in the detector bandpasses and the AGN spectral index. This covariance is decomposed into eigenmodes, and in the analysis the amplitude of five eigenmodes, &#946; n is free to vary jointly with the bandpower and other systematics parameters. We note that these are eigenmodes of the joint covariance between frequencies and multipoles, so each &#946; n changes the beam simultaneously at each frequency (in different ways at each frequency). The total uncertainty in the determination of the temperature beams is 0.2% at l &#188; 3000, and is subdominant to the uncertainty on polarized beams which we now describe.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>b. Polarization beams</head><p>While the temperature beams are mapped out to very high signal-to-noise at angular scales relevant to this analysis via observations of bright point sources, we lack sufficiently bright polarized sources to similarly map the sidelobe response of the polarized beams. Prior to unblinding, we had assumed the polarized and temperature beams were identical at the scales used in the analysis. However, the temperature beam has significant diffuse sidelobes due to diffraction and scattering that may not be fully polarized. To account for this, we devised a model of the beams which allows the polarization of the beam sidelobes to vary. This model assumes the beams are made up of a fully polarized main central beam, and a diffuse beam sidelobe which may be partially unpolarized.</p><p>We consider four components in the model for the main beam 1) Gaussian illumination pattern of the primary by the detector lenslets, 2) truncation of that illumination pattern by the Lyot stop, 3) averaging the frequency dependent beam over the detector bandpass weighted by a CMB spectrum, and finally 4) Gaussian broadening of the diffracted beam due to a frequency independent geometric aberration.</p><p>We model the electric field illumination pattern on the primary mirror as a truncated Gaussian,</p><p>where &#963; 0 is the width of the Gaussian beam from the lenslets at some arbitrary fiducial frequency &#957; 0 , and &#928;&#240;R&#222; is a disk of radius R,</p><p>For SPT-3G, R &#8776; 3.75 m. The illumination on the sky can then be computed from the Fraunhofer diffraction equation,</p><p>U&#240;&#952;; &#957;&#222; &#188; 2&#960;</p><p>where z is the distance to the image plane, &#961; is the distance from the origin on the image plane, and J 0 is the zeroth order Bessel function. In the second line, we have taken the small angle approximation &#961;=z &#188; &#952; to put this in terms of angle on the sky. The beam on the sky is the intensity which is the square of the electric field pattern, B&#240;&#952;; &#957;&#222; ideal &#188; U&#240;&#952;; &#957;&#222; 2 : &#240;A10&#222;</p><p>The beam for our wide frequency bands is then integrated over the bandpass and source spectrum. We integrate over the beam intensity, since the electric field phases will decohere at any significant frequency separation. Adjacent frequencies where phase coherence matters have very nearly the same illumination pattern, because both the spectra of our sources and bandpasses vary slowly. Thus, we have</p><p>where t&#240;&#957;&#222; is the bandpass function, and I&#240;&#957;&#222; &#188; dB dT j T&#188;T CMB is the observed spectrum.</p><p>Finally, we need to account for the effect of geometric aberrations. This is simply a matter of convolving B&#240;&#952;&#222; ideal with a Gaussian, a representation of geometric aberrations, so the final beam is</p><p>In principle, we expect &#963; geom to be the same for all three bands. This model for the main beam has five free parameters, &#963; 0 , &#963; geom , and an overall amplitude at each frequency, A &#957; , and predicts the main beam simultaneously in each of our bands. We fit this model to the observed temperature beams as determined in the previous section, limiting to an inner radius of &#952; &lt; 0.75 arcmin where we observe a good fit, suggesting the main beam dominates and the fit is not biased by the presence of temperature beam sidelobes. For this fit, we construct a &#967; 2 of the form</p><p>2 , which ignores uncertainties in the temperature beams, since we expect the systematic uncertainty from our choice of physical model to dominate to the total uncertainties in fit.</p><p>With this fiducial main beam determined, we denote the difference between this and the temperature beam as the sidelobes, and model the polarized beams as the sum of the fixed main beam and the sidelobes, the latter scaled by some unknown scale-independent polarization fractions, &#946; &#957; pol . That is, the polarized beams used in our analysis are</p><p>where the temperature beam at each frequency is</p><p>where B 0 is the best-fit beam and B n is the nth eigenmode of the beam covariance. The &#946; n is the amplitude of the nth eigenmode, which is the free parameter. The B n is normalized such that &#946; i has a unit normal prior. In Fig. <ref type="figure">23</ref>, we show the beam used in this analysis.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Absolute calibration</head><p>In the analysis, we apply calibration factors, A &#957;;i cal , to each frequency map, &#957;, and subfield, i, independently. We assume a Gaussian prior on the A 150;i cal based on calibrating our maps to Planck and leave uniform priors for A 95;i cal and A 220;i cal . Appendix B describes our choice of prior for the 150 GHz calibration.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.">Global polarization angle</head><p>To model a systematic error on the global polarization angle calibration, we rotate the global polarization angle of the entire field by an angle, &#968; &#957; pol , at each frequency, &#957;. The rotation is given by</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="7.">Transfer function</head><p>The transfer function accounts for the filtering of the timestreams during mapmaking described in Sec. II. In our transfer function model TF , we must approximate the impact of this filtering directly to a Lambert-projected map, since our model is not at the level of timestreams. The nontrivial procedure of sequentially deprojecting templates while masking point sources still corresponds to a linear operator, but not one which is diagonal in any easily accessible basis.</p><p>Our model begins by first reprojecting the map to an equirectangular projection using NFFTs, such that individual scans are straight lines in the horizontal direction. We then create a single map-level deprojection template by multiplying the point source mask assumed in mapmaking and applying a 1D filtering in the horizontal direction using FFTs where the 1D filter is a free function which we will fit for using simulations. This template is subtracted from the entire map (including the masked regions), before projecting back to Lambert. The operation can be represented as</p><p>where P is the NFFT projection from Lambert to equirectangular, M ptsrc is the point source mask and TF&#240;l x &#222; is the fitted FFT kernel in the horizontal direction.</p><p>To fit for the appropriate function, TF&#240;l x &#222;, we compare 1) mock observations of the signal, m, which simulate the entire timestream filtering and mapmaking process, and 2) direct projections of the simulated signal maps which were fed into the mock observations, p. The relation between these maps are</p><p>The factor of the HEALpix N side &#188; 8192 pixel window function is due to the interpolation of p onto mock timestreams during the simulation process, and does not arise in the real data. The factor of Lambert 2.25 0 pixel window function, PWF 2.25 0 arises from the binning of these timestreams into 2.25 0 resolution maps. We then minimize the function km truem model k 2 by fitting TF&#240;l x &#222; at a set of spline points. We find that a single pair of m and p simulations is sufficient to obtain a stable estimate and FIG. <ref type="figure">23</ref>. Beams used in this analysis, in real space (top row) and multipole space (bottom row), at each frequency (each column). Black lines show the temperature beams. Blue and orange lines show our best fits to the main beam, where the fitting region is denoted by the gray shaded region. We used the physical main beam for our final result, but we also verified that using the Gaussian main beam leads to no significant shifts in results (see Sec. IV C). Green lines and green bands represent the posterior mean and 1&#963; uncertainty on the polarized beams from the MUSE analysis, which includes marginalization over &#946; n , &#946; pol , and all other systematics and bandpower parameters.</p><p>have verified the result changes negligibly between a few different simulations.</p><p>The left panel of Fig. <ref type="figure">24</ref> shows the fitted TF&#240;l x &#222;. As expected, it is roughly zero until around l x &#188; 300, then transitions to unity at higher multipoles. We choose to allow some small negative values as it gives a better overall fit with no other observed impact. The right two panels of the figure then show the comparison of EE and BB spectra of the mock observations with those computed by applying the best-fit model transfer function to p. Across the multipole range used for the analysis, the model is accurate to &#8818;0.2% in EE and &#8818;2% in BB, both less than 10% of our uncertainties (which are denoted as gray bands).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="8.">Temperature-to-polarization leakage</head><p>T-to-P leakage can be caused by a variety of sources. Gain mismatch between detector pairs leaks a scaled copy of the temperature map into polarization ("monopole" leakage), whereas differential detector pointing and beam ellipticity introduce copies of the first and second derivatives of the temperature map ["dipole" and "quadrupole" leakage; <ref type="bibr">[123,</ref><ref type="bibr">124]</ref> ]. We find no evidence of dipole or quadrupole leakage in the SPT-3G data (Fig. <ref type="figure">8</ref>), so we only correct for monopole leakage.</p><p>In the MUSE simulation and posterior model, we construct the T-to-P leakage as t &#957; Q &#188; &#240;T &#957; ; 0&#222; and t &#957; U &#188; &#240;0; T &#957; &#222;, where T &#957; is the temperature map at frequency &#957;. The temperature maps are made following the same mapmaking procedure in Sec. II. The amplitudes of the T-to-P templates, &#1013; &#957; Q and &#1013; &#957; U , are free parameters. Note that we used independent leakage parameters per frequency per subfield to be consistent with the choice of calibration parameters.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="9.">Masking</head><p>We apply a pixel mask (M pix ) in the model, which includes a border mask and a point source mask. The border mask is set by zeroing out the areas with weights less than 30% of the median, padding the border of the mask with zero within the radius of 120 arcmin and apodizing with a cosine apodization function of 60 arcmin radius. The point source mask contains emissive sources with brightness greater than 50 mJy and clusters with signal-to-noise ratio greater than 15 at 150 GHz.</p><p>We also apply a trough mask (M trough ) to exclude specific modes from analysis. As discussed in Sec. A 7, the pixel mask aliases the power to higher l x modes in equirectangular projection due to the TOD filtering, and smears out the power when the maps are projected to Lambert. Therefore, we cut out the modes with l x &lt; 400 in equirectangular projection to avoid this aliasing power. We also masked out the region that has been contaminated in the observation by a narrow-band signal around 1.1 Hz.</p><p>The mid-pass filter in the Fourier space (M fourier ) constrains the modes used in the analysis. We include modes between 350 &#8804; l &#8804; 4000.</p><p>The total mask used in the analysis is M &#188; M fourier &#8226; M trough &#8226; M pix , that is, M pix is applied first. The mask is also applied to the real data map before passed down into the analysis.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="10.">Monte Carlo correction</head><p>With the aforementioned modeling ingredients specified, we can compare the pseudo power spectrum of the signal part of d &#957; between simulation-model simulations, as in Eq. ( <ref type="formula">15</ref>), and mock observations. We find a remaining 0.3% discrepancy at l &#8818; 1250 in the EE spectrum between mock observations and MUSE simulations. We believe this is due to residual transfer function modeling errors. Before unblinding, we implement this as an empirical multiplicative Monte Carlo (MC) correction to the A EE b parameters, with a shape fit from these simulations. We note that this correction is small relative to our uncertainties and corresponds to a 0.01&#963; shift in H 0 or a 0.07&#963; shift in &#937; b h 2 . FIG. <ref type="figure">24</ref>. Left panel: The filtering applied in the scan direction as part of the transfer function model. This function is fit on simulations at the spline interpolation points denoted with markers. Note the nonlinear y-axis scaling. Right two panels: Comparison of the EE and BB power spectra of an end-to-end pipeline simulation of the CMB as compared to the model transfer function applied to the same input sky which generated the simulation. Across the multipole range used for the analysis, the model is accurate to &#8818;0.2% in EE and &#8818;2% in BB, both less than 10% of our uncertainties (denoted as gray bands).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="11.">Noise modeling</head><p>Simulation model.-In the simulation model, we use sign-flip noise realizations generated empirically from the real data. This guarantees the noise in the simulation model has the same statistical properties at the true data noise, up to the small correlation between sign-flip realizations, which is not large enough to appreciably impact our final results.</p><p>Posterior model.-In the posterior model, we must be able to compute the posterior probability function, thus must explicitly specify a random distribution for the noise. We chose a Gaussian model with a covariance derived from the sign-flip noise realizations themselves. We model the noise as diagonal in Fourier space with isotropic noise spectra,</p><p>where A and &#945; are parameters to fit to 20 sign-flip noise realizations across the region 500 &#8804; l &#8804; 3000, separately at each frequency. We find this gives an acceptable fit, and note that any deviations of this model from true noise distribution will only lead to slight loss of optimality, not any bias.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>APPENDIX B: CALIBRATION PRIORS AND ALTERNATIVE SYSTEMATICS ESTIMATES</head><p>Here, we describe the priors on A 150;i cal which enter the MUSE analysis, and the alternative estimates of A 95;i cal , A 220;i cal , &#1013; Q , &#1013; U , and &#968; pol which are compared against the MUSE results.</p><p>In practice, calibrating to Planck is performed as two steps. First, we determine a calibration factor for temperature maps in each subfield, T i cal . Then, we determine a correction on top of this for polarization maps, P cal , which is assumed the same across all subfields. By our convention, these factors multiply the SPT data, rather than multiplying the theory model like the MUSE definition of A &#957;;i cal . The relation is thus</p><p>The temperature calibration is determined by computing the cross-spectrum of the full-depth 150 GHz SPT-3G data with the full-depth 143 GHz Planck map passed through the SPT-3G observing pipeline normalized by the crossspectrum of two half-depth SPT-3G maps,</p><p>150 cal;external &#188; SPT 150 full &#215; Planck 143 full SPT 150 half1 &#215; SPT 150 half2 :</p><p>The uncertainty on T cal;external is computed from the scatter in the quantity when it is computed using 100 different subsets of SPT observations entering the SPT maps. After unblinding, we also realized we had neglected to fold in a 0.25% uncertainty <ref type="bibr">[125]</ref> on the Planck absolute calibration <ref type="bibr">[126]</ref>, which leads so insignificant changes, but is fixed in our final results.</p><p>Although not used in MUSE, we also obtain temperature calibrations of 95 and 220 GHz SPT-3G maps by crosscorrelating against the 150 GHz map (this yields tighter constraints than calibrating these against Planck). The calibrations are</p><p>95 cal;internal &#188; SPT 150 half1 &#215; SPT 150 half2 SPT 95 half1 &#215; SPT 150 half2 T 150 cal;external ; T 220 cal;internal &#188; SPT 150 half1 &#215; SPT 150 half2 SPT 220 half1 &#215; SPT 150 half2 T 150 cal;external :</p><p>We recalibrate the Q and U maps by this factor, then correct them for temperature-to-polarization leakage and polarization angle calibration as described below. Then, we determine P 150 cal by cross-correlating the SPT 150 GHz polarization maps to Planck 143 GHz polarization maps in a manner analogous to the absolute temperature calibration in Eq. (B3). The estimated P 150 cal is then corrected by an additional factor of ffiffiffiffiffiffiffiffiffiffiffi 0.966 p to match the final recalibration factor applied to the 143 GHz Planck polarization bandpowers [Eq. ( <ref type="formula">45</ref>), <ref type="bibr">[125]</ref> ], and additionally has a 0.5% uncertainty folded in corresponding to the uncertainty on the Planck polarization calibration. Similarly used only for comparison, we then determine P 95  cal and P 220 cal by internally cross-correlating the SPT maps.</p><p>Next, we describe the alternative estimate of temperature-to-polarization leakage and global polarization angle correction which is used above in determining the calibration factors and is compared against the MUSE results in Fig. <ref type="figure">7</ref>. The main difference from the MUSE estimate is that these are performed at the power spectrum rather than at the map level, and are done with an iterative procedure which assumes a theory cosmological model rather than simultaneously and jointly with all systematics and bandpower parameters.</p><p>Independent estimates of the monopole leakage coefficients, &#1013; Q;TT , &#1013; Q;TQ , &#1013; U;TT , are formed by fitting a template to the C TQ l and C TU l cross-spectra using cross-spectra of the half-depth data maps and cross-spectra from simulations that have no monopole leakage, C TQ l;template &#188; &#1013; Q;TT C TT l;data &#254; &#1013; Q;TQ C TQ l;sim ; &#240;B4&#222; C TU l;template &#188; &#1013; U;TT C TT l;data &#254; &#1013; U;TU C TU l;sim :</p><p>In practice, C TU l;sim scatters around null, and therefore &#1013; U;TU can take arbitrary values. For this reason, the second term is</p></div><note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0"><p>We note a subtle distinction between a delensed spectrum, which contains a residual lensing component which must be modeled, and our inference of an unlensed spectrum, which contains no such residual, independent of cosmological model. Although of minor impact, this explains our usage of "unlensed" throughout the text.</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1"><p>https://cobaya.readthedocs.io/en/latest/</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_2"><p>https://camb.info/</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_3"><p>To reduce computational cost and because many observations have identical filtering, we use only</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_4"><p>5% of the total observations in this step.<ref type="bibr">5</ref> All PTEs we quote are corrected for look-elsewhere in the case that there are multiple tests. We will specify if the conversion to number of &#963; is 2-tailed, in which case negative values correspond to a PTE better than the mean expectation; otherwise, the quoted &#963; number is 1-tailed.</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="6" xml:id="foot_5"><p>See also, e.g., Freedman and Madore<ref type="bibr">[85]</ref>, Freedman et al.<ref type="bibr">[86]</ref>, and Riess et al.<ref type="bibr">[87]</ref> for comparison of alternative "rung 2" methods with the Cepheid method.</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="9" xml:id="foot_6"><p>This is approximate because 1) we have ignored the slight L dependence of A L &#240;&#931; m&#957; &#222; and 2) their template is defined with respect to &#923;CDM models with &#931;m &#957; &#188; 0 eV.</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="10" xml:id="foot_7"><p>Although there are models in which this is violated too, see Craig et al.<ref type="bibr">[40]</ref>.</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="11" xml:id="foot_8"><p>This replacement does not change the low-lTT and EE likelihood, which remain PR3.</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="13" xml:id="foot_9"><p>http://healpix.sourceforge.net</p></note>
		</body>
		</text>
</TEI>
