skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: LinkingLines: Using the Hough Transform to Cluster LineSegments and Mesoscale Feature Extraction
Linear feature analysis plays a fundamental role in geospatial applications, from detecting infrastructure networks to characterizing geological formations. In this paper, we introduce linkinglines, an open-source Python package tailored for the clustering and feature extrac- tion of linear structures in geospatial data. Our package leverages the Hough Transform, commonly used in image processing, performs clustering of line segments in the Hough Space, and then provides unique feature extraction methods and visualization. linkinglines em- powers researchers, data scientists, and analysts across diverse domains to efficiently process, understand, and extract valuable insights from linear features, contributing to more informed decision-making and enhanced data-driven exploration. We have used linkinglines to map dike swarms with thousands of segments associated with Large Igneous Provinces in Kubo Hutchison et al. (2023).  more » « less
Award ID(s):
1848554
PAR ID:
10515278
Author(s) / Creator(s):
; ;
Publisher / Repository:
Journal of Open Source Software
Date Published:
Journal Name:
Journal of Open Source Software
Volume:
9
Issue:
98
ISSN:
2475-9066
Page Range / eLocation ID:
6147
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    This paper introduces a new GeoAI solution to support automated mapping of global craters on the Mars surface. Traditional crater detection algorithms suffer from the limitation of working only in a semiautomated or multi-stage manner, and most were developed to handle a specific dataset in a small subarea of Mars’ surface, hindering their transferability for global crater detection. As an alternative, we propose a GeoAI solution based on deep learning to tackle this problem effectively. Three innovative features are integrated into our object detection pipeline: (1) a feature pyramid network is leveraged to generate feature maps with rich semantics across multiple object scales; (2) prior geospatial knowledge based on the Hough transform is integrated to enable more accurate localization of potential craters; and (3) a scale-aware classifier is adopted to increase the prediction accuracy of both large and small crater instances. The results show that the proposed strategies bring a significant increase in crater detection performance than the popular Faster R-CNN model. The integration of geospatial domain knowledge into the data-driven analytics moves GeoAI research up to the next level to enable knowledge-driven GeoAI. This research can be applied to a wide variety of object detection and image analysis tasks. 
    more » « less
  2. Dike swarms are ubiquitous on terrestrial planets and represent the frozen remnants of magma transport networks. However, spatial complexity, protracted emplacement history, and uneven surface exposure typically make it difficult to quantify patterns in dike swarms on different scales. In this study, we address this challenge using the Hough transform (HT) to objectively link dissected dike segments and analyze multiscale spatial structure in dike swarms. We apply this method to swarms of three scales: the Spanish Peaks, USA; the Columbia River Flood Basalt Group (CRBG), USA; the Deccan Traps Flood Basalts, India. First, we cluster dike segments in HT space, recognizing prevalent linearly aligned structures that represent single dikes or dike packets, with lengths up to 10 −  30x the mapped mean segment length. Second, we identify colinear and radial dike segment mesoscale structures within each data set, using the HT to segment swarms into constituent spatial patterns. We show that for both the CRBG and Deccan Traps, a single radial or circumferential swarm does not well characterize the data. Instead, multiple and sometimes overlapping mesoscale linear and radial features are prevalent suggesting a complex history of crustal stresses. The HT can provide useful insights in a variety of geologic settings where many quasi‐linear features, at any scale, are superimposed spatially. 
    more » « less
  3. This data package includes an ArcMap geodatabase for the Chihuahuan Desert Rangeland Research Center (CDRRC) pastures 1, 4, 14, and 15: one polygon feature class, one point feature class, associated attribute tables and metadata. The spatial data, CDRRC1_4_14_15_StateMap_v1.gdb.zip, represents the ecological sites and states on Pastures 1, 4, 14 and 15 on the Chihuahuan Desert Rangeland Research Center, and includes field traverse data. CDRRC1_4_14_15_StateMapMetadata.pdf and TraversePointsMetadata.pdf contain the geospatial metadata provided by ArcMap. CDRRC1_4_14_15_StateMap_v1.csv is the attribute table associated with the state map’s polygon feature class, and TraversePoints.xlsx is the attribute table associated with the traverse points feature class and includes a sheet containing detailed attribute metadata. 
    more » « less
  4. Sun, Xiaoyong (Ed.)
    Convolutional neural network (CNN)-based deep learning (DL) methods have transformed the analysis of geospatial, Earth observation, and geophysical data due to their ability to model spatial context information at multiple scales. Such methods are especially applicable to pixel-level classification or semantic segmentation tasks. A variety of R packages have been developed for processing and analyzing geospatial data. However, there are currently no packages available for implementing geospatial DL in the R language and data science environment. This paper introduces the geodl R package, which supports pixel-level classification applied to a wide range of geospatial or Earth science data that can be represented as multidimensional arrays where each channel or band holds a predictor variable. geodl is built on the torch package, which supports the implementation of DL using the R and C++ languages without the need for installing a Python/PyTorch environment. This greatly simplifies the software environment needed to implement DL in R. Using geodl, geospatial raster-based data with varying numbers of bands, spatial resolutions, and coordinate reference systems are read and processed using the terra package, which makes use of C++ and allows for processing raster grids that are too large to fit into memory. Training loops are implemented with the luz package. The geodl package provides utility functions for creating raster masks or labels from vector-based geospatial data and image chips and associated masks from larger files and extents. It also defines a torch dataset subclass for geospatial data for use with torch dataloaders. UNet-based models are provided with a variety of optional ancillary modules or modifications. Common assessment metrics (i.e., overall accuracy, class-level recalls or producer’s accuracies, class-level precisions or user’s accuracies, and class-level F1-scores) are implemented along with a modified version of the unified focal loss framework, which allows for defining a variety of loss metrics using one consistent implementation and set of hyperparameters. Users can assess models using standard geospatial and remote sensing metrics and methods and use trained models to predict to large spatial extents. This paper introduces the geodl workflow, design philosophy, and goals for future development. 
    more » « less
  5. ABSTRACT Recent breakthroughs in spatially resolved transcriptomics (SRT) technologies have enabled comprehensive molecular characterization at the spot or cellular level while preserving spatial information. Cells are the fundamental building blocks of tissues, organized into distinct yet connected components. Although many non-spatial and spatial clustering approaches have been used to partition the entire region into mutually exclusive spatial domains based on the SRT high-dimensional molecular profile, most require an ad hoc selection of less interpretable dimensional-reduction techniques. To overcome this challenge, we propose a zero-inflated negative binomial mixture model to cluster spots or cells based on their molecular profiles. To increase interpretability, we employ a feature selection mechanism to provide a low-dimensional summary of the SRT molecular profile in terms of discriminating genes that shed light on the clustering result. We further incorporate the SRT geospatial profile via a Markov random field prior. We demonstrate how this joint modeling strategy improves clustering accuracy, compared with alternative state-of-the-art approaches, through simulation studies and 3 real data applications. 
    more » « less