skip to main content

Title: Identifying Different Classes of Seismic Noise Signals Using Unsupervised Learning

Proper classification of nontectonic seismic signals is critical for detecting microearthquakes and developing an improved understanding of ongoing weak ground motions. We use unsupervised machine learning to label five classes of nonstationary seismic noise common in continuous waveforms. Temporal and spectral features describing the data are clustered to identify separable types of emergent and impulsive waveforms. The trained clustering model is used to classify every 1 s of continuous seismic records from a dense seismic array with 10–30 m station spacing. We show that dominate noise signals can be highly localized and vary on length scales of hundreds of meters. The methodology demonstrates the complexity of weak ground motions and improves the standard of analyzing seismic waveforms with a low signal‐to‐noise ratio. Application of this technique will improve the ability to detect genuine microseismic events in noisy environments where seismic sensors record earthquake‐like signals originating from nontectonic sources.

more » « less
Award ID(s):
Author(s) / Creator(s):
 ;  ;  ;  
Publisher / Repository:
DOI PREFIX: 10.1029
Date Published:
Journal Name:
Geophysical Research Letters
Medium: X
Sponsoring Org:
National Science Foundation
More Like this

    Sedimentary basins can strongly amplify seismic waves from earthquakes. To better predict future ground motions, detailed knowledge of the sediment thickness and internal structure of basins is required. We image the sediment-to-bedrock interface of the Kanto Basin in Japan using the P-wave reflectivity response from earthquake and ambient seismic noise autocorrelation functions (ACFs) at 286 shallow borehole stations. Earthquake ACFs are computed using P-wave records from 50 Mw 6+ teleseismic events. Noise ACFs are obtained using 1 month of continuous data. Both methods are used to retrieve P-wave traveltimes between the surface and the bedrock interface and map the basin basement geometry. Our prediction of the basement depth agrees generally well with that from a reference velocity model, except for smoother variations in the central part of the basin. Using full-wavefield simulations, we show that the nature of the autocorrelated wavefield has a significant impact on the shape of the ACF waveforms and that earthquake ACFs yield more accurate results in the Kanto Basin.

    more » « less

    Seismograms contain multiple sources of seismic waves, from distinct transient signals such as earthquakes to continuous ambient seismic vibrations such as microseism. Ambient vibrations contaminate the earthquake signals, while the earthquake signals pollute the ambient noise’s statistical properties necessary for ambient-noise seismology analysis. Separating ambient noise from earthquake signals would thus benefit multiple seismological analyses. This work develops a multitask encoder–decoder network named WaveDecompNet to separate transient signals from ambient signals directly in the time domain for 3-component seismograms. We choose the active-volcanic Big Island in Hawai’i as a natural laboratory given its richness in transients (tectonic and volcanic earthquakes) and diffuse ambient noise (strong microseism). The approach takes a noisy 3-component seismogram as input and independently predicts the 3-component earthquake and noise waveforms. The model is trained on earthquake and noise waveforms from the STandford EArthquake Dataset (STEAD) and on the local noise of seismic station IU.POHA. We estimate the network’s performance by using the explained variance metric on both earthquake and noise waveforms. We explore different neural network designs for WaveDecompNet and find that the model with long-short-term memory (LSTM) performs best over other structures. Overall, we find that WaveDecompNet provides satisfactory performance down to a signal-to-noise ratio (SNR) of 0.1. The potential of the method is (1) to improve broad-band SNR of transient (earthquake) waveforms and (2) to improve local ambient noise to monitor the Earth’s structure using ambient noise signals. To test this, we apply a short-time average to a long-time average filter and improve the number of detected events. We also measure single-station cross-correlation functions of the recovered ambient noise and establish their improved coherence through time and over different frequency bands. We conclude that WaveDecompNet is a promising tool for a broad range of seismological research.

    more » « less
  3. ABSTRACT The ability to monitor seismicity and structural integrity of a mine using seismic noise can have great implication for detecting and managing ground-control hazards. The noise wavefield, however, is complicated by induced seismicity and heavy machinery associated with mining operations. In this study, we investigate the nature of time-dependent noise cross-correlations functions (CCFs) across an active underground longwall coal mine. We analyze one month of continuous data recorded by a surface 17 geophone array with an average station spacing of ∼200 m. To extract coherent seismic signals, we calculate CCFs between all stations for each 5-min window. Close inspection of all 5-min CCFs reveals waveforms that can be categorically separated into two groups, one with strong and coherent 1–5 Hz signals and one without. Using a reference station pair, we statistically isolate time windows within each group based on the correlation coefficient between each 5-min CCF and the monthly stacked CCF. The daily stacked CCFs associated with a high correlation coefficient show a clear temporal variation that is consistent with the progression of mining activity. In contrast, the daily stacked CCFs associated with a low correlation coefficient remain stationary throughout the recording period in line with the expected persistent background noise. To further understand the nature of the high correlation coefficient CCFs, we perform 2D and 3D back projection to determine and track the dominant noise source location. Excellent agreement is observed on both short (5-min) and long (daily) time scales between the CCF determined source locations, the overall migration of the active mining operation, and cataloged seismic event locations. The workflow presented in this study demonstrates an effective way to identify and track mining induced signals, in which CCFs associated with background noise can be isolated and used for further temporal structural integrity investigation. 
    more » « less
  4. Abstract

    Continuous seismograms contain a wealth of information with a large variety of signals with different origin. Identifying these signals is a crucial step in understanding physical geological objects. We propose a strategy to identify classes of signals in continuous single‐station seismograms in an unsupervised fashion. Our strategy relies on extracting meaningful waveform features based on a deep scattering network combined with an independent component analysis. Based on the extracted features, agglomerative clustering then groups these waveforms in a hierarchical fashion and reveals the process of clustering in a dendrogram. We use the dendrogram to explore the seismic data and identify different classes of signals. To test our strategy, we investigate a two‐day‐long seismogram collected in the vicinity of the North Anatolian Fault, Turkey. We analyze the automatically inferred clusters' occurrence rate, spectral characteristics, cluster size, and waveform and envelope characteristics. At a low level in the cluster hierarchy, we obtain three clusters related to anthropogenic and ambient seismic noise and one cluster related to earthquake activity. At a high level in the cluster hierarchy, we identify a seismic burst that includes around 200 events with similar waveforms and high‐frequent signals with correlating envelopes and an anthropogenic origin. The application shows that the cluster hierarchy helps to identify particular families of signals and to extract subclusters for further analysis. This is valuable when certain types of signals, such as earthquakes, are under‐represented in the data. The proposed method may also successfully discover new types of signals since it is entirely data‐driven.

    more » « less
  5. ABSTRACT Earthquake ground motions in the vicinity of receivers couple with the atmosphere to generate pressure perturbations that are detectable by infrasound sensors. These so-called local infrasound signals traverse very short source-to-receiver paths, so that they often exhibit a remarkable correlation with seismic velocity waveforms at collocated seismic stations, and there exists a simple relationship between vertical seismic velocity and pressure time series. This study leverages the large regional network of infrasound sensors in Alaska to examine local infrasound from several light to great Alaska earthquakes. We estimate seismic velocity time series from infrasound pressure records and use these converted infrasound recordings to compute earthquake magnitudes. This technique has potential utility beyond the novelty of recording seismic velocities on pressure sensors. Because local infrasound amplitudes from ground motions are small, it is possible to recover seismic velocities at collocated sites where the broadband seismometers have clipped. Infrasound-derived earthquake magnitudes exhibit good agreement with seismically derived values. This proof-of-concept demonstration of computing seismic magnitudes from infrasound sensors illustrates that infrasound sensors may be utilized as proxy vertical-component seismometers, making a new data set available for existing seismic techniques. Because single-sensor infrasound stations are relatively inexpensive and are becoming ubiquitous, this technique could be used to augment existing regional seismic networks using a readily available sensor platform. 
    more » « less