skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Deep Spatial Prediction via Heterogeneous Multi-Source Self-Supervision
Spatial prediction is to predict the values of the targeted variable, such as PM2.5 values and temperature, at arbitrary locations based on the collected geospatial data. It greatly affects the key research topics in geoscience in terms of obtaining heterogeneous spatial information (e.g., soil conditions, precipitation rates, wheat yields) for geographic modeling and decision-making at local, regional, and global scales. In-situ data, collected by ground-level in-situ sensors, and remote sensing data, collected by satellite or aircraft, are two important data sources for this task. In-situ data are relatively accurate while sparse and unevenly distributed. Remote sensing data cover large spatial areas but are coarse with low spatiotemporal resolution and prone to interference. How to synergize the complementary strength of these two data types is still a grand challenge. Moreover, it is difficult to model the unknown spatial predictive mapping while handling the trade-off between spatial autocorrelation and heterogeneity. Third, representing spatial relations without substantial information loss is also a critical issue. To address these challenges, we propose a novel Heterogeneous Self-supervised Spatial Prediction (HSSP) framework that synergizes multi-source data by minimizing the inconsistency between in-situ and remote sensing observations. We propose a new deep geometric spatial interpolation model as the prediction backbone that automatically interpolates the values of the targeted variable at unknown locations based on existing observations by taking into account both distance and orientation information. Our proposed interpolator is proven to both be the general form of popular interpolation methods and preserve spatial information. The spatial prediction is enhanced by a novel error-compensation framework to capture the prediction inconsistency due to spatial heterogeneity. Extensive experiments have been conducted on real-world datasets and demonstrated our model’s superiority in performance over state-of-the-art models.  more » « less
Award ID(s):
2113350 2318831 2103592 1907805 1942594
PAR ID:
10434550
Author(s) / Creator(s):
; ; ;
Date Published:
Journal Name:
ACM Transactions on Spatial Algorithms and Systems
ISSN:
2374-0353
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. The movement of animals is strongly influenced by external factors in their surrounding environment such as weather, habitat types, and human land use. With advances in positioning and sensor technologies, it is now possible to capture animal locations at high spatial and temporal granularities. Likewise, modern space-based remote sensing technology provides us with an increasing access to large volumes of environmental data, some of which changes on an hourly basis. Environmental data are heterogeneous in source and format, and are usually obtained at different scales and granularities than movement data. Indeed, there remain scientific and technical challenges in developing linkages between the growing collections of animal movement data and the large repositories of heterogeneous remote sensing observations, as well as in the developments of new statistical and computational methods for the analysis of movement in its environmental context. These challenges include retrieval, indexing, efficient storage, data integration, and analytic techniques. We have developed a new system - the Environmental-Data Automated Track Annotation (Env-DATA) - that automates annotation of movement trajectories with remote-sensing environmental information, including high resolution topography, weather from global and regional reanalysis datasets, climatology, human geography, ocean currents and productivity, land use, vegetation and land surface variables, precipitation, fire, and other global datasets. The system automates the acquisition of data from open web resources of remote sensing and weather data and provides several interpolation methods from the native grid resolution and structure to a global regular grid linked with the movement tracks in space and time. Env-DATA provides an easy-to-use platform for end users that eliminates technical difficulties of the annotation processes, including data acquisition, data transformation and integration, resampling, interpolation and interpretation. The new Env-DATA system enhances Movebank (www.movebank.org), an open portal of animal tracking data. The aim is to facilitate new understanding and predictive capabilities of spatiotemporal patterns of animal movement in response to dynamic and changing environments from local to global scales. The system is already in use by scientists worldwide, and by several conservation managers, such as the consortium of federal and private institution that manage the endangered Californian Condor populations. 
    more » « less
  2. Abstract The 2015 spring flood of the Sagavanirktok River inundated large swaths of tundra as well as infrastructure near Prudhoe Bay, Alaska. Its lasting impact on permafrost, vegetation, and hydrology is unknown but compels attention in light of changing Arctic flood regimes. We combined InSAR and optical satellite observations to quantify subdecadal permafrost terrain changes and identify their controls. While the flood locally induced quasi‐instantaneous ice‐wedge melt, much larger areas were characterized by subtle, spatially variable post‐flood changes. Surface deformation from 2015 to 2019 estimated from ALOS‐2 and Sentinel‐1 InSAR varied substantially within and across terrain units, with greater subsidence on average in flooded locations. Subsidence exceeding 5 cm was locally observed in inundated ice‐rich units and also in inactive floodplains. Overall, subsidence increased with deposit age and thus ground ice content, but many flooded ice‐rich units remained stable, indicating variable drivers of deformation. On average, subsiding ice‐rich locations showed increases in observed greenness and wetness. Conversely, many ice‐poor floodplains greened without deforming. Ice wedge degradation in flooded locations with elevated subsidence was mostly of limited intensity, and the observed subsidence largely stopped within 2 years. Based on remote sensing and limited field observations, we propose that the disparate subdecadal changes were influenced by spatially variable drivers (e.g., sediment deposition, organic layer), controls (ground ice and its degree of protection), and feedback processes. Remote sensing helps quantify the heterogeneous interactions between permafrost, vegetation, and hydrology across permafrost‐affected fluvial landscapes. Interdisciplinary monitoring is needed to improve predictions of landscape dynamics and to constrain sediment, nutrient, and carbon budgets. 
    more » « less
  3. null (Ed.)
    Survival data is often collected in medical applications from a heterogeneous population of patients. While in the past, popular survival models focused on modeling the average effect of the covariates on survival outcomes, rapidly advancing sensing and information technologies have provided opportunities to further model the heterogeneity of the population as well as the non-linearity of the survival risk. With this motivation, we propose a new semi-parametric Bayesian Survival Rule List model in this paper. Our model derives a rule-based decision-making approach, while within the regime defined by each rule, survival risk is modelled via a Gaussian process latent variable model. Markov Chain Monte Carlo with a nested Laplace approximation on the Gaussian process posterior is used to search over the posterior of the rule lists efficiently. The use of ordered rule lists enables us to model heterogeneity while keeping the model complexity in check. Performance evaluations on a synthetic heterogeneous survival dataset and a real world sepsis survival dataset demonstrate the effectiveness of our model. 
    more » « less
  4. This study introduces a new automated system that blends multi-satellite information and citizen science data for reliable and timely observations of lake and river ice in under-observed northern regions. The system leverages the Google Earth Engine resources to facilitate the analysis and visualization of ice conditions. The adopted approach utilizes a combination of moderate and high-resolution optical data, along with radar observations. The results demonstrate the system’s capability to accurately detect and monitor river ice, particularly during key periods, such as the freeze-up and the breakup. The integration citizen science data showed added values in the validation of remote sensing products, as well as filling gaps whenever satellite observations cannot be collected due to cloud obstruction. Moreover, it was shown that citizen science data can be converted to valuable quantitative information, such as the case of ice thickness, which is very useful when combined with ice extent derived from remote sensing. In this study, citizen science data were employed for the quantitative assessment of the remote sensing product. Obtained results showed a good agreement between the product and observed river status, with a Critical Success Index of 0.82. Notably, the system has shown effectiveness in capturing the spatial and temporal evolution of snow and ice conditions, as evidenced by its application in analyzing specific ice jam events in 2023. The study concludes that the developed system marks a significant advancement in river ice monitoring, combining technological innovation with community engagement. 
    more » « less
  5. High performance computing (HPC) system runs compute-intensive parallel applications requiring large number of nodes. An HPC system consists of heterogeneous computer architecture nodes, including CPUs, GPUs, field programmable gate arrays (FPGAs), etc. Power capping is a method to improve parallel application performance subject to variable power constraints. In this paper, we propose a parallel application power and performance prediction simulator. We present prediction model to predict application power and performance for unknown power-capping values considering heterogeneous computing architecture. We develop a job scheduling simulator based on parallel discrete-event simulation engine. The simulator includes a power and performance prediction model, as well as a resource allocation model. Based on real-life measurements and trace data, we show the applicability of our proposed prediction model and simulator. 
    more » « less