Given N geo-located point instances (e.g., crime or disease cases) in a spatial domain, we aim to detect sub-regions (i.e., hotspots) that have a higher probability density of generating such instances than the others. Hotspot detection has been widely used in a variety of important urban applications, including public safety, public health, urban planning, equity, etc. The problem is challenging because its societal applications often have low-tolerance for false positives, and require significance testing which is computationally intensive. In related work, the spatial scan statistic introduced a likelihood ratio based framework for hotspot evaluation and significance testing. However, it fails to consider the effect of spatial nondeterminism, causing many missing detections. Our previous work introduced a nondeterministic normalization based scan statistic to mitigate this issue. However, its robustness against false positives is not stably controlled. To address these limitations, we propose a unified framework which can improve the completeness of results without incurring more false positives. We also propose a reduction algorithm to improve the computational efficiency. Experiment results confirm that the unified framework can greatly improve the recall of hotspot detection without increasing the number of false positives, and the reduction algorithm can greatly reduce execution time.
more »
« less
Automatic identification and quantification of volcanic hotspots in Alaska using HotLINK: the hotspot learning and identification network
An increase in volcanic thermal emissions can indicate subsurface and surface processes that precede, or coincide with, volcanic eruptions. Space-borne infrared sensors can detect hotspots—defined here as localized volcanic thermal emissions—in near-real-time. However, automatic hotspot detection systems are needed to efficiently analyze the large quantities of data produced. While hotspots have been automatically detected for over 20 years with simple thresholding algorithms, new computer vision technologies, such as convolutional neural networks (CNNs), can enable improved detection capabilities. Here we introduce HotLINK: the Hotspot Learning and Identification Network, a CNN trained to detect hotspots with a dataset of −3,800 satellite-based, Visible Infrared Imaging Radiometer Suite (VIIRS) images from Mount Veniaminof and Mount Cleveland volcanoes, Alaska. We find that our model achieves an accuracy of 96% (F1-score 0.92) when evaluated on −1,700 unseen images from the same volcanoes, and 95% (F1-score 0.67) when evaluated on −3,000 images from six additional Alaska volcanoes (Augustine Volcano, Bogoslof Island, Okmok Caldera, Pavlof Volcano, Redoubt Volcano, Shishaldin Volcano). In comparison with an existing threshold-based hotspot detection algorithm, MIROVA (Coppola et al., Geological Society, London, Special Publications, 2016, 426, 181–205), our model detects 22% more hotspots and produces 12% fewer false positives. Additional testing on −700 labeled Moderate Resolution Imaging Spectroradiometer (MODIS) images from Mount Veniaminof demonstrates that our model is applicable to this sensor’s data as well, achieving an accuracy of 98% (F1-score 0.95). We apply HotLINK to 10 years of VIIRS data and 22 years of MODIS data for the eight aforementioned Alaska volcanoes and calculate the radiative power of detected hotspots. From these time series we find that HotLINK accurately characterizes background and eruptive periods, similar to MIROVA, but also detects more subtle warming signals, potentially related to volcanic unrest. We identify three advantages to our model over its predecessors: 1) the ability to detect more subtle volcanic hotspots and produce fewer false positives, especially in daytime images; 2) probabilistic predictions provide a measure of detection confidence; and 3) its transferability, i.e., the successful application to multiple sensors and multiple volcanoes without the need for threshold tuning, suggesting the potential for global application.
more »
« less
- Award ID(s):
- 1855126
- PAR ID:
- 10518543
- Publisher / Repository:
- Frontiers in Earth Science
- Date Published:
- Journal Name:
- Frontiers in Earth Science
- Volume:
- 12
- ISSN:
- 2296-6463
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
null (Ed.)The Cook-Austral volcanic lineament extends from Macdonald Seamount (east) to Aitutaki Island (west) in the South Pacific Ocean and consists of hotspot-related volcanic islands, seamounts, and atolls. The Cook-Austral volcanic lineament has been characterized as multiple overlapping, age-progressive hotspot tracks generated by at least two mantle plumes, including the Arago and Macdonald plumes, which have fed volcano construction for ~20 m.y. The Arago and Macdonald hotspot tracks are argued to have been active for at least 70 m.y. and to extend northwest of the Cook-Austral volcanic lineament into the Cretaceous-aged Tuvalu-Gilbert and Tokelau Island chains, respectively. Large gaps in sampling exist along the predicted hotspot tracks, complicating efforts seeking to show that the Arago and Macdonald hotspots have been continuous, long-lived sources of hotspot volcanism back into the Cretaceous. We present new major- and trace-element concentrations and radiogenic isotopes for three seamounts (Moki, Malulu, Dino) and one atoll (Rose), and new clinopyroxene 40Ar/39Ar ages for Rose (24.81 ± 1.02 Ma) and Moki (44.53 ± 10.05 Ma). All volcanoes are located in the poorly sampled region between the younger Cook-Austral and the older, Cretaceous portions of the Arago and Macdonald hotspot tracks. Absolute plate motion modeling indicates that the Rose and Moki volcanoes lie on or near the reconstructed traces of the Arago and Macdonald hotspots, respectively, and the 40Ar/39Ar ages for Rose and Moki align with the predicted age progression for the Arago (Rose) and Macdonald (Moki) hotspots, thereby linking the younger Cook-Austral and older Cretaceous portions of the long-lived (>70 m.y.) Arago and Macdonald hotspot tracks.more » « less
-
Objectives: This study introduces MetaBIDx, a computational method designed to enhance species prediction in metagenomic environments. The method addresses the challenge of accurate species identification in complex microbiomes, which is due to the large number of generated reads and the ever-expanding number of bacterial genomes. Bacterial identification is essential for disease diagnosis and tracing outbreaks associated with microbial infections. Methods: MetaBIDx utilizes a modified Bloom filter for efficient indexing of reference genomes and incorporates a novel strategy for reducing false positives by clustering species based on their genomic coverages by identified reads. The approach was evaluated and compared with several well-established tools across various datasets. Precision, recall, and F1-score were used to quantify the accuracy of species prediction. Results: MetaBIDx demonstrated superior performance compared to other tools, especially in terms of precision and F1-score. The application of clustering based on approximate coverages significantly improved precision in species identification, effectively minimizing false positives. We further demonstrated that other methods can also benefit from our approach to removing false positives by clustering species based on approximate coverages. Conclusion: With a novel approach to reducing false positives and the effective use of a modified Bloom filter to index species, MetaBIDx represents an advancement in metagenomic analysis. The findings suggest that the proposed approach could also benefit other metagenomic tools, indicating its potential for broader application in the field. The study lays the groundwork for future improvements in computational efficiency and the expansion of microbial databases.more » « less
-
GitGuardian monitored secrets exposure in public GitHub repositories and reported that developers leaked over 12 million secrets (database and other credentials) in 2023, indicating a 113% surge from 2021. Despite the availability of secret detection tools, developers ignore the tools' reported warnings because of false positives (25%−99%). However, each secret protects assets of different values accessible through asset identifiers (a DNS name and a public or private IP address). The asset information for a secret can aid developers in filtering false positives and prioritizing secret removal from the source code. However, existing secret detection tools do not provide the asset information, thus presenting difficulty to developers in filtering secrets only by looking at the secret value or finding the assets manually for each reported secret. The goal of our study is to aid software practitioners in prioritizing secrets removal by providing the assets information protected by the secrets through our novel static analysis tool. We present AssetHarvester, a static analysis tool to detect secret-asset pairs in a repository. Since the location of the asset can be distant from where the secret is defined, we investigated secret-asset co-location patterns and found four patterns. To identify the secret-asset pairs of the four patterns, we utilized three approaches (pattern matching, data flow analysis, and fast-approximation heuristics). We curated a benchmark of 1,791 secret-asset pairs of four database types extracted from 188 public GitHub repositories to evaluate the performance of AssetHarvester. AssetHarvester demonstrates precision of (97%), recall (90 %), and F1-score (94 %) in detecting secret-asset pairs. Our findings indicate that data flow analysis employed in AssetHarvester detects secret-asset pairs with 0 % false positives and aids in improving the recall of secret detection tools. Additionally, AssetHarvester shows 43 % increase in precision for database secret detection compared to existing detection tools through the detection of assets, thus reducing developer's alert fatigue.more » « less
-
Abstract Since the 1919 foundation of the International Association of Volcanology and Chemistry of the Earth’s Interior (IAVCEI), the fields of volcano seismology and acoustics have seen dramatic advances in instrumentation and techniques, and have undergone paradigm shifts in the understanding of volcanic seismo-acoustic source processes and internal volcanic structure. Some early twentieth-century volcanological studies gave equal emphasis to barograph (infrasound and acoustic-gravity wave) and seismograph observations, but volcano seismology rapidly outpaced volcano acoustics and became the standard geophysical volcano-monitoring tool. Permanent seismic networks were established on volcanoes (for example) in Japan, the Philippines, Russia, and Hawai‘i by the 1950s, and in Alaska by the 1970s. Large eruptions with societal consequences generally catalyzed the implementation of new seismic instrumentation and led to operationalization of research methodologies. Seismic data now form the backbone of most local ground-based volcano monitoring networks worldwide and play a critical role in understanding how volcanoes work. The computer revolution enabled increasingly sophisticated data processing and source modeling, and facilitated the transition to continuous digital waveform recording by about the 1990s. In the 1970s and 1980s, quantitative models emerged for long-period (LP) event and tremor sources in fluid-driven cracks and conduits. Beginning in the 1970s, early models for volcano-tectonic (VT) earthquake swarms invoking crack tip stresses expanded to involve stress transfer into the wall rocks of pressurized dikes. The first deployments of broadband seismic instrumentation and infrasound sensors on volcanoes in the 1990s led to discoveries of new signals and phenomena. Rapid advances in infrasound technology; signal processing, analysis, and inversion; and atmospheric propagation modeling have now established the role of regional (15–250 km) and remote (> 250 km) ground-based acoustic systems in volcano monitoring. Long-term records of volcano-seismic unrest through full eruptive cycles are providing insight into magma transport and eruption processes and increasingly sophisticated forecasts. Laboratory and numerical experiments are elucidating seismo-acoustic source processes in volcanic fluid systems, and are observationally constrained by increasingly dense geophysical field deployments taking advantage of low-power, compact broadband, and nodal technologies. In recent years, the fields of volcano geodesy, seismology, and acoustics (both atmospheric infrasound and ocean hydroacoustics) are increasingly merging. Despite vast progress over the past century, major questions remain regarding source processes, patterns of volcano-seismic unrest, internal volcanic structure, and the relationship between seismic unrest and volcanic processes.more » « less
An official website of the United States government

