<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>False Positives in the Identification of Dynamic Earthquake Triggering</title></titleStmt>
			<publicationStmt>
				<publisher>AGU</publisher>
				<date>07/01/2025</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10648617</idno>
					<idno type="doi">10.1029/2025JB031566</idno>
					<title level='j'>Journal of Geophysical Research: Solid Earth</title>
<idno>2169-9313</idno>
<biblScope unit="volume">130</biblScope>
<biblScope unit="issue">7</biblScope>					

					<author>Jeanne L Hardebeck</author><author>Nicolas D DeSalvio</author><author>Wenyuan Fan</author><author>Andrew J Barbour</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[<title>Abstract</title> <p>Dynamic earthquake triggering is commonly identified through the temporal correlation between increased seismicity rates and global earthquakes that are possible triggering events. However, correlation does not imply causation. False positives may occur when unrelated seismicity rate changes coincidently occur at around the time of candidate triggers. We investigate the expected false positive rate in Southern California with global<italic>M</italic>≥6 earthquakes as candidate triggers. We compute the false positive rate by applying the statistical tests used by DeSalvio and Fan (2023),<ext-link href='https://doi.org/10.1029/2023jb026487'>https://doi.org/10.1029/2023jb026487</ext-link>to synthetic earthquake catalogs with no real dynamic triggering. We find a false positive rate of ∼3.5%–8.5% when realistic earthquake clustering is present, consistent with the 95% confidence typically used in seismology. However, when this false positive rate is applied to the tens of thousands of spatial‐temporal windows in Southern California tested in DeSalvio and Fan (2023),<ext-link href='https://doi.org/10.1029/2023jb026487'>https://doi.org/10.1029/2023jb026487</ext-link>, thousands of false positives are expected. The expected false positive occurrence is large enough to explain the observed apparent triggering following 70% of large global earthquakes (DeSalvio & Fan, 2023,<ext-link href='https://doi.org/10.1029/2023jb026487'>https://doi.org/10.1029/2023jb026487</ext-link>), without requiring any true dynamic triggering. Aside from the known triggering from the nearby El Mayor‐Cucapah, Mexico, earthquake, the spatial and temporal characteristics of the reported triggering are indistinguishable from random false positives. This implies that best practice for dynamic triggering studies that depend on temporal correlation is to estimate the false positive rate and investigate whether the observed apparent triggering is distinguishable from the correlations that may occur by chance.</p>]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>Dynamic earthquake triggering due to the passing seismic waves from large distant earthquakes has been welldocumented around the globe (e.g., <ref type="bibr">Brodsky et al., 2000;</ref><ref type="bibr">Gomberg et al., 2004;</ref><ref type="bibr">Hill et al., 1993)</ref>. In case studies, it is possible to visually inspect the triggered earthquakes to observe their sudden onset or their correlation with the passing surface waves (e.g., <ref type="bibr">Hill et al., 1993;</ref><ref type="bibr">Velasco et al., 2008)</ref>. Such temporal correlations are often used to establish a causal connection between the large event and the triggered earthquakes. This inferred connection can be further evaluated with statistical tests to examine the significance of the changes in seismicity rate (e.g., <ref type="bibr">van der Elst &amp; Brodsky, 2010)</ref>. The temporal-correlation premise underlines almost all dynamic triggering studies (e.g., <ref type="bibr">Brodsky &amp; van der Elst, 2014;</ref><ref type="bibr">Hill &amp; Prejean, 2015)</ref>. However, <ref type="bibr">Pankow and Kilb (2020)</ref> found that typical approaches for determining statistically significant seismicity rate changes are inadequate for reliably identifying temporal correlations, mainly due to poor accounting for the intrinsic variability of seismicity rate. Furthermore, it has not been systematically tested whether a temporal correlation is sufficient to establish causal connections.</p><p>Since triggered seismicity is not a direct measurement of the triggering process but rather its outcome, and because natural fault systems are complex and heterogeneous, any observed coincidence could be due to the intrinsic variability of local earthquake rates. Additionally, earthquake catalog completeness and regional seismic network configuration can have a strong impact in triggering identification. Therefore, it is critical to understand and account for the possibility of false negatives (failure to detect real dynamic triggering) and false positives (identifying dynamic triggering where there is none, e.g. mistaking unrelated earthquake rate changes for triggering) in dynamic triggering identifications. However, it is challenging to analyze false identifications in case studies due to their limited samples. Systematic studies covering a large number of candidate triggering earthquakes and a large seismically active area are better suited to evaluate false identifications. However, with large data sets, it is prohibitively time-consuming to visually inspect each instance of potential triggering. Systematic dynamic triggering studies primarily use statistical tests for triggering identifications and apply the same temporal-correlation premises to draw connections between the passing seismic waves and statistically significant seismicity rate anomalies.</p><p>Commonly used statistics to determine the statistical significance of apparent seismicity rate changes are the &#946;statistic <ref type="bibr">(Matthews &amp; Reasenberg, 1988)</ref>:</p><p>and the Z-statistic <ref type="bibr">(Habermann, 1983)</ref>:</p><p>where for both equations, T a and T b are the test and reference time periods, N a and N b are the number of events in these time periods, and &#963; a and &#963; b are the standard deviations of N a and N b . The observed change in rate is considered significant at 95% confidence if the absolute value of the given statistic exceeds 1.96, assuming earthquake occurrence is random and independent. However, there are several difficulties when applying these statistics in practice: they assume independent events while earthquakes are known to cluster, earthquake rate variations make it difficult to define the reference rate (N b /T b ), standard deviations must be estimated in the context of the variable rate, and rates may be affected by catalog incompleteness. <ref type="bibr">Fan et al. (2021)</ref> and <ref type="bibr">DeSalvio and Fan (2023)</ref> implemented multiple improvements to the &#946;-statistic and Zstatistic to account for these difficulties. Following <ref type="bibr">Prejean and Hill (2018)</ref>, the significance of each statistic is evaluated empirically by comparison with the distribution of statistics for random time periods of the same duration, drawn from the reference time period, rather than using a set threshold of 1.96. This accounts for intrinsic earthquake rate variability, as long as that variability is represented in the reference time period, as well as relaxing the assumption that earthquakes are independent. It also makes the tests less sensitive to the estimated standard deviations, because the denominators of Equations 1 and 2 become scaling factors common to the statistics of the real and random samples. DeSalvio and Fan (2023) also estimated standard deviations empirically through repeated sampling of the catalog. DeSalvio and <ref type="bibr">Fan (2023)</ref> introduced the &#946; m -and Z m -statistics, which are analogous to the &#946;-and Z-statistics except applied to total moment instead of earthquake count, decreasing the sensitivity to catalog incompleteness at lower magnitudes. When a statistically significant seismicity rate (or moment release rate) change is identified in conjunction with passing seismic waves, a dynamic triggering case is inferred <ref type="bibr">(DeSalvio &amp; Fan, 2023)</ref>. Applying these statistical tests to Southern California, DeSalvio and <ref type="bibr">Fan (2023)</ref> reported that global M &#8805; 6 earthquakes frequently trigger small earthquakes. Of the 1,388 global candidate triggering events they considered, 70% were associated with a significant increase in seismicity rate within the next day, somewhere in a set of 185 spatial bins spanning the Southern California fault system.</p><p>This statistical procedure provides a framework to test for false positives by looking for apparent triggering in catalogs without dynamically triggered events, or in time windows not associated with the occurrence of large global earthquakes. For example, DeSalvio and Fan (2023) introduced a check on the reliability of the &#946;-, Z-, &#946; m -, and Z m -statistics for dynamic triggering, by estimating the false positive rate through application of the statistical tests to synthetic catalogs that contain no true dynamic triggering. They generated synthetic earthquake catalogs using a Poisson model, an Epidemic-Type Aftershock Sequence (ETAS) model <ref type="bibr">(Ogata, 1988)</ref>, and by randomly shifting the observed catalog in time to break the temporal associations between the catalog and candidate triggering earthquakes. Because there is no true dynamic triggering in these synthetic catalogs, the rate of apparent triggering gives the false positive rate. There are two types of errors that can lead to false positives: erroneous statistical identification of a seismicity rate change, and mistaking the random temporal correlation of a candidate event with a true rate change for causation through dynamic triggering. While only the first type of error will occur for the Poisson catalogs, both types will be present in the ETAS, shifted, and real catalogs.</p><p>These tests lead to reported false positive rates of 0.31%-2.27% for Poisson and ETAS synthetic catalogs, and 2.26%-4.53% for shifted synthetic catalogs <ref type="bibr">(DeSalvio &amp; Fan, 2023)</ref>. These are reasonably low false positive rates, consistent with the 95% confidence threshold used in many seismological studies. However, this rate of false positives, when applied to every combination of 1,388 candidate triggering events and 185 spatial bins, could result in thousands of false positives.</p><p>Here we use the statistical procedure proposed in DeSalvio and <ref type="bibr">Fan (2023)</ref> to systematically evaluate the commonly adopted assumption in dynamic triggering studies that a temporal correlation with a statistically significant seismicity rate anomaly is sufficient to establish dynamic triggering. We generate synthetic catalogs to examine the false positive rate in dynamic triggering identification and compare it to the observed rate of apparent dynamic triggering in the real Southern California catalog. We then look for any differences in the spatial and temporal characteristics of apparent triggering in the real and synthetic catalogs. Our results show that the commonly used temporal-correlation premise can lead to many false positives in dynamic triggering identification when the fault system has frequent seismicity clusters that may have been caused by processes other than the passing seismic waves.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Data and Methods</head><p>We use the 10-year (2008-2017) Quake Template Matching (QTM) seismicity catalog <ref type="bibr">(Ross et al., 2019a)</ref> and 185 spatial bins representing active faults in Southern California (e.g., <ref type="bibr">Plesch et al., 2007)</ref> to analyze false positive rates in dynamic triggering due to the temporal-correlation assumption. We generate synthetic catalogs for Southern California that are different from those in DeSalvio and <ref type="bibr">Fan (2023)</ref>. Here, synthetic Poisson and ETAS earthquake catalogs are generated for each spatial bin, and each spatial bin is analyzed separately, as in the analysis of the real data in DeSalvio and <ref type="bibr">Fan (2023)</ref>. In contrast, DeSalvio and Fan (2023) used a single Poisson or ETAS catalog for all of Southern California, and analyzed the entire catalog together. Using identical analyses for the real and synthetic catalogs will lead to a more accurate assessment of the false positive rate. The smaller subcatalogs used here and for the real data are more prone to rate variability because a few temporal clusters can produce relatively large rate changes, whereas in the large entire catalog many clusters will stack to produce a more consistent rate, muting the true variability (Figure <ref type="figure">1</ref>). Analysis of the synthetics as entire catalogs therefore may underestimate the potential for false positives in the real data.</p><p>We generate synthetic Poisson and ETAS earthquake catalogs in a manner consistent with the real Southern California data for 2008 to 2017, as used in DeSalvio and <ref type="bibr">Fan (2023)</ref>, and analyze the individual spatial bins independently. For each spatial bin, we compute R, the rate of earthquakes in the QTM catalog <ref type="bibr">(Ross et al., 2019a)</ref> above the magnitude of completeness M c for that bin reported by <ref type="bibr">DeSalvio and Fan (2023)</ref>. For the Poisson catalog, we randomly select event times given this rate, and randomly select magnitudes from a Gutenberg-Richter distribution with b = 0.99. The ETAS <ref type="bibr">(Ogata, 1988)</ref> simulations use the temporal decay parameters c = 0.0001 days and p = 1, productivity parameter k = 0.0027, and productivity scaling with magnitude a = b = 0.99, following DeSalvio and <ref type="bibr">Fan (2023)</ref>. The branching ratio, n, (e.g., <ref type="bibr">Helmstetter &amp; Sornette, 2003)</ref>, the fraction of earthquakes that are aftershocks of a prior event, is computed from the other parameters, assuming M max = 6 and T = 10 years. The branching ratio varies from 0.36 to 0.69, depending on the M c of the bin. We randomly select independent event times for each bin given the independent event rate (1-n)*R. We iteratively simulate each generation of aftershocks using the inverse transform method of <ref type="bibr">Felzer et al. (2002)</ref>, which randomly selects the time until each aftershock from an inverse cumulative distribution function based on Omori's Law, and its magnitude from an inverse cumulative distribution function based on the Gutenberg-Richter distribution. To simulate the global candidate triggering events, we generate 1,500 global events occurring at random times uniformly spanning a 10 year period, and remove the first and last 0.5 years.</p><p>Additionally, we expand the shifted synthetic catalog tests from the 8 spatial bins analyzed by <ref type="bibr">DeSalvio and Fan (2023)</ref> to include all 185 bins. We produce 6 versions of the shifted catalog, considering the relative time shift between the QTM and International Seismological Center (ISC) catalogs. For the first 4 shifted synthetic catalog tests, we randomly select a positive or negative time shift between 1 week and 1 year and add this time shift to the origin time of each global event of the ISC catalog before performing the seismicity rate anomaly analysis. For 3 of these tests, we wrap around synthetic candidate events that are shifted out of the study time period, to maintain the same number of global events. For the 5th catalog, we randomly select origin times for the global events, breaking any patterns in the occurrence times of global events. For the 6th catalog, we use different origin time shifts for each spatial bin.</p><p>For all synthetic catalogs, each spatial bin is analyzed for each global event, for 4 test time periods following the event (T a = 2-, 6-, 12-and 24-hr). We compute the &#946;-, Z-, &#946; m -, and Z m -statistics following DeSalvio and Fan (2023), with a reference time period of T b = 60 days for &#946;-and &#946; m -statistics and T b = 30 days for Z-and Z mstatistics. The false positive rate is taken as the fraction of trials with enough events for analysis (&gt;3 events in the test time period, and &#8805;10 in the reference period) for which the given statistic is significant at 95% confidence. The 95% significance value ranges from &#8764;2.2 to &#8764;3.3 for both the real and synthetic catalogs, varying for each test because it's based on the distribution of values for randomly selected time periods.</p><p>Unlike the Poisson and ETAS catalogs, the shifted synthetic catalogs capture any correlation between adjacent spatial bins due to their overlap, as well as any spatial-temporal variations in earthquake rate due to processes other than earthquake-to-earthquake triggering. Using the shifted catalogs, we investigate the number of false positives as a function of spatial bin, global event, and test time period. We compare these spatial and temporal characteristics of the false positives to the characteristics of the apparent triggering in the real catalog.</p><p>In the synthetic Poissonian catalog, the seismicity is independent and uncorrelated; therefore, any identified seismicity rate anomaly is random rather than a true cluster, causing false identifications of dynamic triggering. In the synthetic ETAS catalog, there are temporal seismicity clusters from the simulated mainshock-aftershock sequences, some of which would be identified with 95% confidence as seismicity rate anomalies when the time windows of interest overlap with these clusters. In this sense, false identification in dynamic triggering can result from either misidentifying seismicity rate anomalies or misattributing temporal correlations.</p><p>For the shifted catalog tests, the local earthquake catalog (QTM) has intrinsic variability and contains seismicity clusters, including mainshock-aftershocks, swarms, and triggered sequences due to external forcings. Similar to the ETAS clusters, some of these clusters would be identified as seismicity rate anomalies at a 95% confidence level using the procedure proposed in DeSalvio and <ref type="bibr">Fan (2023)</ref>. If such a seismicity rate anomaly identification coincides with a selected test time window (i.e., 2-24 hr after a global earthquake), a dynamic triggering case is Journal of Geophysical Research: Solid Earth 10.1029/2025JB031566 inferred in the region. However, if the seismicity rate anomaly was not due to the passing waves but caused by intrinsic variability, this coincidence would lead to a false positive identification. In this case, the false positive results from the temporal-correlation premise mentioned above. As the seismicity rate anomaly may appear the same for intrinsically clustered and triggered seismicity, the likelihood of a false positive due to the temporalcorrelation premise depends on the intrinsic variability of local seismicity. In other words, for a region with frequent swarms or correlated seismicity, there is a higher chance of identifying seismicity rate anomalies for randomly selected time windows, and it would be more challenging to identify the true triggering versus false positives.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Results</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.">False Positive Rate</head><p>Using our new synthetic catalogs and the same procedure proposed in DeSalvio and Fan (2023), we find false positive rates for dynamic triggering (Table <ref type="table">1</ref>) larger than those reported in DeSalvio and <ref type="bibr">Fan (2023)</ref>. For tests with the Poisson catalog, the overall false positive rate for all test time periods T a is &#8764;1.5%-2.5%. For tests with the ETAS catalog, which contains simulated earthquake clustering, the overall rate of seismicity-rate anomalies temporally correlated with global earthquakes is &#8764;5%-8%, which we consider representative of the false positive rate. The shifted catalogs, which contain all the true temporal clustering characteristics of the QTM catalog, as well as the correlation between adjacent spatial bins, has an overall rate of seismicity-rate anomalies temporally correlated with global earthquakes of &#8764;3.5%-8.5%, which we take as the false positive rate for the following discussion. Therefore, when realistic clustering is included, and the individual bins are considered individually, the overall false positive rate approximates a 5% rate, meeting the expectation for a test at 95% confidence.</p><p>For short test time periods, however, the false positive rate of the suite of tests can greatly exceed 5%. False positives for the &#946;-and Z-statistics for T a = 2 hr range from &#8764;9% for the Poisson catalogs to &#8764;20% for the ETAS and shifted catalogs. Similarly, the &#946; m -and Z m -statistics for T a = 2 hr exhibit false positives of around 12% for the ETAS catalog. Therefore, the false positive rate is large enough to explain all of the apparent triggering in the real Southern California catalog, without requiring any true dynamic triggering.</p><p>While the rate of apparent triggering in the shifted synthetic catalogs is similar to that of the real catalog, it is not the result of sampling the same earthquake clusters. For the earthquakes found to be apparently triggered in the shifted catalog tests, typically &#8764;25% were also identified as triggered in the real catalog. We repeat the false positive calculations for the shifted catalogs, this time removing all combinations of global candidate events and spatial bins in which apparent triggering was identified in the real and shifted catalogs in overlapping test time periods. The resulting false positive rates slightly decrease, with an overall false positive rate of &#8764;2.5%-6.5% (Table <ref type="table">2</ref>).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.">Spatial-Temporal Characteristics of Apparent Triggering</head><p>When only considering the seismicity occurrence, intrinsically clustered and triggered seismicity may yield similar seismicity rate anomalies. True dynamic triggering may be distinguishable from false positives through spatial-temporal characteristics other than the total observed rate of occurrence (e.g., <ref type="bibr">Aiken et al., 2016</ref><ref type="bibr">Aiken et al., , 2018;;</ref><ref type="bibr">Pankow &amp; Kilb, 2020)</ref>. In this section, we compare the spatial-temporal characteristics of the apparent triggering in the real catalog with that of the six shifted synthetic catalogs (in which the global ISC catalog was shifted relative to the local QTM catalog). The shifted catalogs retain the spatial-temporal earthquake rate variability and clustering of the QTM catalog, and the correlated behavior of overlapping spatial bins, and so reflect any characteristics of false positives that result from those properties.</p><p>The spatial distribution of apparent triggering identified in the shifted catalog tests (Figure <ref type="figure">2</ref>) closely resembles that of the real catalog shown in Figure <ref type="figure">5</ref> of <ref type="bibr">DeSalvio and Fan (2023)</ref>. Spatial bins along the San Jacinto, southern Elsinore, and southern San Andreas Faults, as well as the Coso geothermal field, exhibit the most triggering occurrences, up to &#8764;70 in the most active regions when the &#946;-statistic is considered. In contrast, many bins in the western half of the study area exhibit &lt;10 triggering occurrences. DeSalvio and <ref type="bibr">Fan (2023)</ref> found that the number of triggering observations for a bin roughly correlates with the earthquake rate in that bin, which may explain the similar spatial patterns.</p><p>The number of bins apparently triggered per global event in the shifted catalogs (Figure <ref type="figure">3</ref>) has similar characteristics to that of the real catalog shown in Figure <ref type="figure">13</ref> of <ref type="bibr">DeSalvio and Fan (2023)</ref>. For both the real and shifted catalogs, &#8764;70% of global events exhibit some apparent triggering when using the &#946;-statistic (Figure <ref type="figure">4a</ref>). Most of these global events are associated with apparent triggering in only a few (&lt;10) bins. A few global events have more apparent triggers, up to &#8764;15. In the 10-years period analyzed, the only substantial difference is the &#8764;30 triggered bins following the 2010 M7.2 El Mayor-Cucapah, Mexico, earthquake in the real catalog. Widespread triggering in Southern California from the El Mayor-Cucapah earthquake has also been observed in earlier studies (e.g., <ref type="bibr">Hauksson et al., 2011;</ref><ref type="bibr">Meng &amp; Peng, 2014)</ref>.</p><p>The distribution of the number of apparent triggering occurrences per global candidate triggering event is very similar between the real and shifted catalogs (Figures <ref type="figure">4a</ref> and <ref type="figure">4d</ref>), as is the distribution of the number of apparent triggering occurrences per bin (Figures <ref type="figure">4b</ref> and <ref type="figure">4e</ref>), and the number of apparent triggering occurrences for each of the test time period durations (Figures <ref type="figure">4c</ref> and <ref type="figure">4f</ref>). This is the case for both the &#946;-statistic, which has the highest rate of false positives, and the Z m -statistic, which has the lowest rate. Therefore, aside from the nearby El Mayor-Cucapah, Mexico, earthquake, the reported triggering in the real Southern California catalog appears to lack any general spatial-temporal characteristics that would distinguish it from random false positives. Journal of Geophysical Research: Solid Earth </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Discussion</head><p>False positive rates depend on both the statistical test used and on the characteristics of the catalog studied. For the QTM catalog and the bin configuration used here, we find that the modified Z-statistic has a somewhat lower false positive rate than the modified &#946;-statistic. The moment-based &#946; m -and Z m -statistics have substantially lower false positive rates than the corresponding &#946;-and Z-statistics for short test time windows (T a = 2 hr, 6 hr). The false positive rates for the &#946; m -and Z m -statistics are also lower for the shifted catalogs, which are the only synthetic catalogs that may be incomplete at low magnitudes, suggesting that considering moment rate rather than earthquake rate better accounts for possible incompleteness in real catalog data. Catalogs with fewer earthquakes during the test time windows generally have higher false positive rates, for example, the higher rate for shorter test time windows, and the higher rate when analyzing individual spatial bins compared to analyzing the entire catalog together. This is likely a consequence of allowing statistics to be computed for a relatively small sample size (N a &gt; 3 and N b &#8805; 10). Catalogs with more earthquake clustering also have a higher false positive rate, for example, the higher rates for the ETAS and shifted catalogs compared to the Poisson catalog. This can also be seen by performing additional ETAS simulations with varying productivity, where an increase in productivity and branching ratio (more clustering) leads to an increase in false positive rate (Table <ref type="table">3</ref>). Seismicity clustering has similarly been shown to increase false positives in the identification of other phenomena, such as proposed earthquake precursors (e.g., <ref type="bibr">Hardebeck et al., 2008;</ref><ref type="bibr">Michael, 1997)</ref>.</p><p>Our results imply that best practice is to estimate the false positive rate for each dynamic triggering study based on the characteristics of the catalog used and the study design. True detections could be inferred if the number of detections exceeds the expected number of false positives. For case studies, best practice would be to apply the same visual inspections consistently to an extended period beyond the selected triggering windows for falsepositive evaluations, as was done in <ref type="bibr">Pankow and Kilb (2020)</ref>.</p><p>For the catalog and study design of DeSalvio and <ref type="bibr">Fan (2023)</ref>, the false positive rate is about 3.5%-8.5%. This is not a very high false positive rate, and consistent with the 95% confidence usually used in seismological studies.</p><p>However, this false positive rate implies that 1 out of approximately every 20 tests will return a positive Journal of Geophysical Research: Solid Earth 10.1029/2025JB031566</p><p>identification in seismicity rate anomaly, likely representing a false positive in dynamic triggering. For each candidate triggering event, 185 spatial bins and 4 test time periods are considered. On average only &#8764;6.4% of these combinations have enough data for a statistical test, meaning that on average &#8764;47 tests are performed per candidate global event. If each test were independent (no spatial or temporal overlap), we could use the binomial distribution to determine that the probability of any global candidate earthquake randomly appearing to trigger at least once would be &#8764;91%. The observed apparent triggering by &#8764;70% of global events is therefore not surprisingly high. It is likely less than 91% because of the non-independence of the tests as well as spatially variable seismicity rates.</p><p>When tens of thousands of spatial-temporal windows are tested, even a low false positive rate can lead to thousands of false positives. The problem of large numbers of false positives will likely only worsen in the future, as it becomes possible to test larger numbers of spatial-temporal bins on increasingly larger earthquake catalogs.</p><p>Testing more combinations of candidate triggering events and target grid cells can increase the number of erroneous statistical identifications of a seismicity rate change, and increase the opportunity to mistake intrinsic rate changes for dynamically triggered changes. Additionally, new enhanced seismicity catalogs may reveal additional intrinsic rate complexity.</p><p>The statistically identified seismicity-rate anomalies can be further evaluated under additional empirical assumptions and requirements to reduce positive cases, thereby suppressing false positives. Given a lack of wellagreed physical models for dynamic triggering, these additional assumptions might be conditional on specific regions of interest or reflect a cautious preference. For example, <ref type="bibr">Pankow and Kilb (2020)</ref> consider an earthquake rate increase to be significant only if it is in the highest 1% or 5% of earthquake rates in the full duration of the catalog for that location. This is analogous to expanding the resampling done by DeSalvio and Fan (2023) beyond the 30-to 60-day reference time period to the entire duration of the catalog at the given grid cell. This requirement implies an assumption that true dynamic triggering cases would cause extreme changes in seismicity rates over a large region. Consequently, this requirement will lead to fewer identifications, and may also yield a higher false negative rate. Even with these requirements, however, <ref type="bibr">Pankow and Kilb (2020)</ref> found on inspection that few of the statistically significant rate increases were unambiguous cases of a clear step increase at the time of the global earthquake, and concluded that statistically significant rate changes alone are inadequate to demonstrate dynamic triggering. <ref type="bibr">Pankow and Kilb (2020)</ref> find no dynamic triggering in the Anza region of the San Jacinto Fault in Southern California from 1985 to 2000 (while the magnitude of completeness of their catalog is relatively high) and very little during 2000-2017. Only four global earthquakes have triggering significant at 95% confidence and satisfy their holistic compatibility assessment <ref type="bibr">(2001</ref><ref type="bibr">M7.0 Guam, 2005</ref><ref type="bibr">M7.2 Nicobar Islands, 2010</ref><ref type="bibr">M7.2 El Mayor-Cucapah, and 2013 M8.3 Sea of Okhotsk)</ref>, in contrast with the &#8764;40 triggers for spatial bins in the Anza region reported in DeSalvio and <ref type="bibr">Fan (2023)</ref>.</p><p>Another approach to study triggered earthquakes is to stack observations to enhance any signals and reduce the random noise from false positives, albeit losing resolution in identifying individual triggering cases. <ref type="bibr">Miyazawa et al. (2021)</ref> also studied the QTM catalog to investigate dynamic triggering in Southern California from 2008 to 2017. They stack results of triggering intensity to identify triggered local earthquakes in the region. This approach focuses on the populational behavior of the triggered earthquakes, such as a slower decaying rate for these earthquakes. Additionally, triggering behaviors can be compared with respect to the peak ground velocity (PGV), as a proxy for the peak strain change from the candidate global earthquakes. These ground motion measurements can serve as additional constraints. For example, only considering candidate earthquakes that have Using PGV as a proxy for strain does not account for variations in phase velocity or other focusing and defocusing of the strain wavefield. Where strainmeter data are available, the measured peak dynamic strain can be used directly <ref type="bibr">(DeSalvio et al., 2025)</ref>.</p><p>The false positive rates reported in this study suggests that the dynamic triggering reported by DeSalvio and <ref type="bibr">Fan (2023)</ref> likely consists partially to almost entirely of false positives, which includes many cases reported in previous case studies <ref type="bibr">(Aiken &amp; Peng, 2014;</ref><ref type="bibr">Alfaro-Diaz et al., 2020;</ref><ref type="bibr">Li et al., 2019)</ref>. Aside from the nearby El Mayor-Cucapah, Mexico, earthquake, the reported triggering in the real Southern California catalog appears to lack any general spatial-temporal characteristics that would distinguish it from random false positives. However, we also cannot exclude the reported cases in DeSalvio and Fan (2023) being true dynamic triggering without independent geophysical observations other than earthquake catalogs. We have not visually inspected each statistically identified case to establish whether or not a clear temporal correlation with the global earthquake is present during the passing seismic waves.</p><p>In examining the temporal-correlation premise that underlines almost all dynamic triggering studies, we have shown that a statistically temporal correlation of an apparent seismicity rate increase with a global candidate triggering event is not sufficient establish causal connection. False positives may occur either due to erroneous statistical identification of a seismicity rate change, or mistaking intrinsic seismicity rate complexity for dynamic triggering. However, the larger, more difficult problem how to establish causality in dynamic triggering studies remains.</p><p>Independent geophysical observations such as deformation anomalies could address the problem of erroneous identification of rate changes, by providing evidence that the rate changes reflect real anomalous behavior. However, this does not resolve the problem of whether the real seismicity and deformation changes are dynamically triggered, or are local phenomena that occurred coincidently following a teleseismic event.</p><p>To address the problem of coincidence, one could reduce the probability that the observations could occur by chance. e.g., a seismicity rate change over a large area (e.g., <ref type="bibr">Hauksson et al., 2011)</ref> is less likely occur by chance. If there is a small probability (p) a seismicity anomaly randomly occurring at a single then there is a very small probability (p N ) that an anomaly would occur randomly at many (N) locations at the same time. <ref type="bibr">Pankow and Kilb (2020)</ref> also advised that apparent triggering over a large area makes the case for dynamic triggering more convincing, and found that triggering with greater statistical significance often does trigger larger spatial areas. Similarly, because unrelated seismicity rate changes would have random amplitude with respect to the dynamic strains, a correlation of rate change with strain from the teleseismic event (e.g., van der Elst &amp; Brodsky, 2010) would have a very small probability of happening by chance. However, emphasizing a coherent seismicity rate change over a large region could lead to a high false negative rate, as spatially localized true triggering cases would be disqualified by the evaluation. We note that any empirical, predefined selection procedure will impact the identification of seismicity rate anomalies, compromising the fidelity in different ways.</p><p>Here we considered false positives in seismicity catalogs where delayed dynamic triggering may occur up to 24 hr after a large global earthquake. Other dynamic triggering studies focus on detecting immediately triggered events within the surface wave train (e.g., <ref type="bibr">Hill et al., 1993;</ref><ref type="bibr">Miyazawa &amp; Mori, 2006;</ref><ref type="bibr">West et al., 2005)</ref>. Such studies also rely on temporal correlation to establish dynamic triggering relationships, and also may experience false positives. Best practice would be to apply the same event detection techniques to a long time period to establish the reference seismicity rate and typical variability, and to estimate false positive rates using synthetic catalogs tailored to the specific experiment. We found for catalog-based studies that short test time intervals can have a higher rate of false positives due to the smaller number of events. Waveform-based studies usually have short test time periods, although this may be balanced by higher rates of detected small events. Waveform-based methods additionally have the advantage that they can relate the precise times of the detected events to the characteristics of the passing surface waves, albeit locations of these detected events are often unresolved. There is a very small probability that unrelated seismicity would occur, e.g., in synchronization with the peak extension of the Rayleigh waves as has been observed (e.g., <ref type="bibr">Miyazawa &amp; Mori, 2006;</ref><ref type="bibr">West et al., 2005)</ref>.</p><p>Journal of Geophysical Research: Solid Earth 10.1029/2025JB031566</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Conclusions</head><p>Dynamic earthquake triggering is commonly identified through the temporal correlation between increased seismicity rates and global earthquakes. Correlation does not imply causation, and false positives (dynamic triggering identified when none occurred, e.g. mistaking unrelated earthquake rate changes for triggering) may occur. We estimate the rate of false positives in Southern California, with global M &#8805; 6 earthquakes as candidate triggers. We apply the same statistical tests as DeSalvio and <ref type="bibr">Fan (2023)</ref> to synthetic earthquake catalogs with no true dynamic triggering. The rate of apparent triggering gives an estimate of the false positive rate. We find that the false positive rate varies for different statistical tests, and depends on characteristics of the catalog such as the rate of earthquakes and the extent of clustering. This suggests that best practice is to estimate the false positive rate for each study using synthetic catalogs with similar characteristics.</p><p>For the Southern California catalog, we find the false positive rate to be about 3.5%-8.5%, consistent with the 95% confidence often used in seismology. However, when large numbers of spatial-temporal windows are tested, many false positives are expected. This rate of false positives can the reported triggering in the real catalog <ref type="bibr">(DeSalvio &amp; Fan, 2023)</ref>, including the observation that &#8764;70% of global M &#8805; 6 earthquakes are associated with apparent triggering. The spatial-temporal distribution of triggering in the most realistic synthetic catalogs is indistinguishable from that of the real catalog, implying that all of the apparent triggering of the real catalog could be explained by false positives. The only exception is the nearby 2010 M7.2 El Mayor-Cucapah, Mexico, earthquake, which is known to have triggered earthquakes across much of Southern California (e.g., <ref type="bibr">Hauksson et al., 2011)</ref>. Therefore, the apparent dynamic triggering in Southern California might be largely false positives.</p><p>The problem of false positives requires careful evaluations when using statistical identification as larger earthquake catalogs make it possible to test larger numbers of spatial-temporal bins. When accurate physical insights are available, they can be imposed as additional constraints to further select the identified seismicity rate anomalies as triggering cases. To argue for causation will require going beyond individual temporal correlations between seismicity rate changes and candidate triggering events. To infer the triggering process, one can focus on the extreme cases, e.g., by studying correlations that exhibit larger patterns (e.g., seismicity rate changes over a large area, correlation between seismicity rate change and dynamic strain), which may have a low probability of occurring by chance.</p></div><note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_0"><p>21699356, 2025, 7, Downloaded from https://agupubs.onlinelibrary.wiley.com/doi/10.1029/2025JB031566 by University Of California, Wiley Online Library on [20/11/2025]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License</p></note>
		</body>
		</text>
</TEI>
