Abstract While the Lomb–Scargle periodogram is foundational to astronomy, it has a significant shortcoming: the variance in the estimated power spectrum does not decrease as more data are acquired. Statisticians have a 60 yr history of developing variance-suppressing power spectrum estimators, but most are not used in astronomy because they are formulated for time series with uniform observing cadence and without seasonal or daily gaps. Here we demonstrate how to apply the missing-data multitaper power spectrum estimator to spacecraft data with uniform time intervals between observations but missing data during thruster fires or momentum dumps. TheF-test for harmonic components may be applied to multitaper power spectrum estimates to identify statistically significant oscillations that would not rise above a white noise–based false alarm probability. Multitapering improves the dynamic range of the power spectrum estimate and suppresses spectral window artifacts. We show that the multitaper–F-test combination applied to Kepler observations of KIC 6102338 detects differential rotation without requiring iterative sinusoid fitting and subtraction. Significant signals reside at harmonics of both fundamental rotation frequencies and suggest an antisolar rotation profile. Next we use the missing-data multitaper power spectrum estimator to identify the oscillation modes responsible for the complex “scallop-shell” shape of the K2 light curve of EPIC 203354381. We argue that multitaper power spectrum estimators should be used for all time series with regular observing cadence.
more »
« less
This content will become publicly available on June 1, 2026
Multitaper Magnitude‐Squared Coherence for Time Series With Missing Data: Understanding Oscillatory Processes Traced by Multiple Observables
Abstract To explore the hypothesis of a common source of variability in two time series, observers may estimate the magnitude‐squared coherence (MSC), which is a frequency‐domain view of the cross correlation. For time series that do not have uniform observing cadence, MSC can be estimated using Welch's overlapping segment averaging. However, multitaper has superior statistical properties to Welch's method in terms of the tradeoff between bias, variance, and bandwidth. The classical multitaper technique has recently been extended to accommodate time series with underlying uniform observing cadence from which some observations are missing. This situation is common for solar and geomagnetic data sets, which may have gaps due to breaks in satellite coverage, instrument downtime, or poor observing conditions. We demonstrate the scientific use of missing‐data multitaper magnitude‐squared coherence by detecting known solar mid‐term oscillations in simultaneous, missing‐data time series of solar Lyman flux and geomagnetic Disturbance Storm Time index. Due to their superior statistical properties, we recommend that multitaper methods be used for all heliospheric time series with underlying uniform observing cadence.
more »
« less
- Award ID(s):
- 2307978
- PAR ID:
- 10625827
- Publisher / Repository:
- Wiley
- Date Published:
- Journal Name:
- Earth and Space Science
- Volume:
- 12
- Issue:
- 6
- ISSN:
- 2333-5084
- Subject(s) / Keyword(s):
- spectral analysis missing data solar midterm periodicities
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Abstract Solar flare prediction studies have been recently conducted with the use of Space-Weather MDI (Michelson Doppler Imager on board Solar and Heliospheric Observatory) Active Region Patches (SMARPs) and Space-Weather HMI (Helioseismic and Magnetic Imager on board Solar Dynamics Observatory) Active Region Patches (SHARPs), which are two currently available data products containing magnetic field characteristics of solar active regions (ARs). The present work is an effort to combine them into one data product, and perform some initial statistical analyses in order to further expand their application in space-weather forecasting. The combined data are derived by filtering, rescaling, and merging the SMARP and SHARP parameters, which can then be spatially reduced to create uniform multivariate time series. The resulting combined MDI–HMI data set currently spans the period between 1996 April 4 and 2022 December 13, and may be extended to a more recent date. This provides an opportunity to correlate and compare it with other space-weather time series, such as the daily solar flare index or the statistical properties of the soft X-ray flux measured by the Geostationary Operational Environmental Satellites. Time-lagged cross correlation indicates that a relationship may exist, where some magnetic field properties of ARs lead the flare index in time. Applying the rolling-window technique makes it possible to see how this leader–follower dynamic varies with time. Preliminary results indicate that areas of high correlation generally correspond to increased flare activity during the peak solar cycle.more » « less
-
Abstract The Vera C. Rubin Observatory is expected to start the Legacy Survey of Space and Time (LSST) in early to mid-2025. This multiband wide-field synoptic survey will transform our view of the solar system, with the discovery and monitoring of over five million small bodies. The final survey strategy chosen for LSST has direct implications on the discoverability and characterization of solar system minor planets and passing interstellar objects. Creating an inventory of the solar system is one of the four main LSST science drivers. The LSST observing cadence is a complex optimization problem that must balance the priorities and needs of all the key LSST science areas. To design the best LSST survey strategy, a series of operation simulations using the Rubin Observatory scheduler have been generated to explore the various options for tuning observing parameters and prioritizations. We explore the impact of the various simulated LSST observing strategies on studying the solar system’s small body reservoirs. We examine what are the best observing scenarios and review what are the important considerations for maximizing LSST solar system science. In general, most of the LSST cadence simulations produce ±5% or less variations in our chosen key metrics, but a subset of the simulations significantly hinder science returns with much larger losses in the discovery and light-curve metrics.more » « less
-
Analyzing sequential data is crucial in many domains, particularly due to the abundance of data collected from the Internet of Things paradigm. Time series classification, the task of categorizing sequential data, has gained prominence, with machine learning approaches demonstrating remarkable performance on public benchmark datasets. However, progress has primarily been in designing architectures for learning representations from raw data at fixed (or ideal) time scales, which can fail to generalize to longer sequences. This work introduces a \textit{compositional representation learning} approach trained on statistically coherent components extracted from sequential data. Based on a multi-scale change space, an unsupervised approach is proposed to segment the sequential data into chunks with similar statistical properties. A sequence-based encoder model is trained in a multi-task setting to learn compositional representations from these temporal components for time series classification. We demonstrate its effectiveness through extensive experiments on publicly available time series classification benchmarks. Evaluating the coherence of segmented components shows its competitive performance on the unsupervised segmentation task.more » « less
-
Abstract Electron fluxes (20 eV–2 MeV, RBSP‐A satellite) show reasonable simple correlation with a variety of parameters (solar wind, IMF, substorms, ultralow frequency (ULF) waves, geomagnetic indices) over L‐shells 2–6. Removing correlation‐inflating common cycles and trends (using autoregressive and moving average terms in an ARMAX analysis) results in a 10 times reduction in apparent association between drivers and electron flux, although many are still statistically significant (p < 0.05). Corrected influences are highest in the 20 eV–1 keV and 1–2 MeV electrons, more modest in the midrange (2–40 keV). Solar wind velocity and pressure (but not number density), IMF magnitude (with lower influence ofBz), SME (a substorm measure), a ULF wave index, and geomagnetic indices Kp and SymH all show statistically significant associations with electron flux in the corrected individual ARMAX analyses. We postulate that only pressure, ULF waves, and substorms are direct drivers of electron flux and compare their influences in a combined analysis. SME is the strongest influence of these three, mainly in the eV and MeV electrons. ULF is most influential on the MeV electrons. Pressure shows a smaller positive influence and some indication of either magnetopause shadowing or simply compression on the eV electrons. While strictly predictive models may improve forecasting ability by including indirect driver and proxy parameters, and while these models may be made more parsimonious by choosing not to explicitly model time series behavior, our present analyses include time series variables in order to draw valid conclusions about the physical influences of exogenous parameters.more » « less
An official website of the United States government
