Abstract While the Lomb–Scargle periodogram is foundational to astronomy, it has a significant shortcoming: the variance in the estimated power spectrum does not decrease as more data are acquired. Statisticians have a 60 yr history of developing variance-suppressing power spectrum estimators, but most are not used in astronomy because they are formulated for time series with uniform observing cadence and without seasonal or daily gaps. Here we demonstrate how to apply the missing-data multitaper power spectrum estimator to spacecraft data with uniform time intervals between observations but missing data during thruster fires or momentum dumps. TheF-test for harmonic components may be applied to multitaper power spectrum estimates to identify statistically significant oscillations that would not rise above a white noise–based false alarm probability. Multitapering improves the dynamic range of the power spectrum estimate and suppresses spectral window artifacts. We show that the multitaper–F-test combination applied to Kepler observations of KIC 6102338 detects differential rotation without requiring iterative sinusoid fitting and subtraction. Significant signals reside at harmonics of both fundamental rotation frequencies and suggest an antisolar rotation profile. Next we use the missing-data multitaper power spectrum estimator to identify the oscillation modes responsible for the complex “scallop-shell” shape of the K2 light curve of EPIC 203354381. We argue that multitaper power spectrum estimators should be used for all time series with regular observing cadence.
more »
« less
This content will become publicly available on June 1, 2026
Multitaper Magnitude‐Squared Coherence for Time Series With Missing Data: Understanding Oscillatory Processes Traced by Multiple Observables
Abstract To explore the hypothesis of a common source of variability in two time series, observers may estimate the magnitude‐squared coherence (MSC), which is a frequency‐domain view of the cross correlation. For time series that do not have uniform observing cadence, MSC can be estimated using Welch's overlapping segment averaging. However, multitaper has superior statistical properties to Welch's method in terms of the tradeoff between bias, variance, and bandwidth. The classical multitaper technique has recently been extended to accommodate time series with underlying uniform observing cadence from which some observations are missing. This situation is common for solar and geomagnetic data sets, which may have gaps due to breaks in satellite coverage, instrument downtime, or poor observing conditions. We demonstrate the scientific use of missing‐data multitaper magnitude‐squared coherence by detecting known solar mid‐term oscillations in simultaneous, missing‐data time series of solar Lyman flux and geomagnetic Disturbance Storm Time index. Due to their superior statistical properties, we recommend that multitaper methods be used for all heliospheric time series with underlying uniform observing cadence.
more »
« less
- Award ID(s):
- 2307978
- PAR ID:
- 10625827
- Publisher / Repository:
- Wiley
- Date Published:
- Journal Name:
- Earth and Space Science
- Volume:
- 12
- Issue:
- 6
- ISSN:
- 2333-5084
- Subject(s) / Keyword(s):
- spectral analysis missing data solar midterm periodicities
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Geomagnetically induced currents (GICs) pose a significant space weather hazard, driven by geomagnetic field variation due to the coupling of the solar wind to the magnetosphere-ionosphere system. Extensive research has been dedicated to understanding ground-level geomagnetic field perturbations as a GIC proxy. Still, the non-uniform aspect of geomagnetic fluctuations make it difficult to fully characterize the ground-level magnetic field across large regions of the globe. Here, we focus on localized geomagnetic disturbances (LGMDs) in the North American region and specify the degree to which these disturbances are localized. Employing the electrodynamics-informed Spherical Elementary Current Systems (SECS) method, we spatially interpolate magnetic field perturbations between ground-based magnetometer stations. In this way, we represent the ground magnetic field as a series of heatmaps at high temporal and spatial resolution. We leverage heatmaps from storm time during solar cycle 24 to automatically identify LGMDs. We build a statistical picture of the frequency with which LGMDs occur, their scale sizes, and their latitude-longitude aspect ratios. Additionally, we use an information theory approach to quantify the dependence of these three attributes on the phase of the solar cycle. We find no clear influence of the solar cycle on any of the three attributes. We offer some avenues toward explaining why LGMDs might behave broadly the same whether they arise during solar maximum or solar minimum.more » « less
-
Abstract Solar flare prediction studies have been recently conducted with the use of Space-Weather MDI (Michelson Doppler Imager on board Solar and Heliospheric Observatory) Active Region Patches (SMARPs) and Space-Weather HMI (Helioseismic and Magnetic Imager on board Solar Dynamics Observatory) Active Region Patches (SHARPs), which are two currently available data products containing magnetic field characteristics of solar active regions (ARs). The present work is an effort to combine them into one data product, and perform some initial statistical analyses in order to further expand their application in space-weather forecasting. The combined data are derived by filtering, rescaling, and merging the SMARP and SHARP parameters, which can then be spatially reduced to create uniform multivariate time series. The resulting combined MDI–HMI data set currently spans the period between 1996 April 4 and 2022 December 13, and may be extended to a more recent date. This provides an opportunity to correlate and compare it with other space-weather time series, such as the daily solar flare index or the statistical properties of the soft X-ray flux measured by the Geostationary Operational Environmental Satellites. Time-lagged cross correlation indicates that a relationship may exist, where some magnetic field properties of ARs lead the flare index in time. Applying the rolling-window technique makes it possible to see how this leader–follower dynamic varies with time. Preliminary results indicate that areas of high correlation generally correspond to increased flare activity during the peak solar cycle.more » « less
-
Abstract The Vera C. Rubin Observatory is expected to start the Legacy Survey of Space and Time (LSST) in early to mid-2025. This multiband wide-field synoptic survey will transform our view of the solar system, with the discovery and monitoring of over five million small bodies. The final survey strategy chosen for LSST has direct implications on the discoverability and characterization of solar system minor planets and passing interstellar objects. Creating an inventory of the solar system is one of the four main LSST science drivers. The LSST observing cadence is a complex optimization problem that must balance the priorities and needs of all the key LSST science areas. To design the best LSST survey strategy, a series of operation simulations using the Rubin Observatory scheduler have been generated to explore the various options for tuning observing parameters and prioritizations. We explore the impact of the various simulated LSST observing strategies on studying the solar system’s small body reservoirs. We examine what are the best observing scenarios and review what are the important considerations for maximizing LSST solar system science. In general, most of the LSST cadence simulations produce ±5% or less variations in our chosen key metrics, but a subset of the simulations significantly hinder science returns with much larger losses in the discovery and light-curve metrics.more » « less
-
Analyzing sequential data is crucial in many domains, particularly due to the abundance of data collected from the Internet of Things paradigm. Time series classification, the task of categorizing sequential data, has gained prominence, with machine learning approaches demonstrating remarkable performance on public benchmark datasets. However, progress has primarily been in designing architectures for learning representations from raw data at fixed (or ideal) time scales, which can fail to generalize to longer sequences. This work introduces a \textit{compositional representation learning} approach trained on statistically coherent components extracted from sequential data. Based on a multi-scale change space, an unsupervised approach is proposed to segment the sequential data into chunks with similar statistical properties. A sequence-based encoder model is trained in a multi-task setting to learn compositional representations from these temporal components for time series classification. We demonstrate its effectiveness through extensive experiments on publicly available time series classification benchmarks. Evaluating the coherence of segmented components shows its competitive performance on the unsupervised segmentation task.more » « less
An official website of the United States government
