A geologic map is both a visual depiction of the lithologies and structures occurring at the Earth’s surface and a representation of a conceptual model for the geologic history in a region. The work needed to capture such multifaced information in an accurate geologic map is time consuming. Remote sensing can complement traditional primary field observations, geochemistry, chronometry, and subsurface geophysical data in providing useful information to assist with the geologic mapping process. Two novel sources of remote sensing data are particularly relevant for geologic mapping applications: decameter-resolution imaging spectroscopy (spectroscopic imaging) and meter-resolution multispectral shortwave infrared (SWIR) imaging. Decameter spectroscopic imagery can capture important mineral absorptions but is frequently unable to spatially resolve important geologic features. Meter-resolution multispectral SWIR images are better able to resolve fine spatial features but offer reduced spectral information. Such disparate but complementary datasets can be challenging to integrate into the geologic mapping process. Here, we conduct a comparative analysis of spatial and spectral scaling for two such datasets: one Airborne Visible/Infrared Imaging Spectrometer—Classic (AVIRIS-classic) flightline, and one WorldView-3 (WV3) scene, for a geologically complex landscape in Anza-Borrego Desert State Park, California. To do so, we use a two-stage framework that synthesizes recent advances in the spectral mixture residual and joint characterization. The mixture residual uses the wavelength-explicit misfit of a linear spectral mixture model to capture low variance spectral signals. Joint characterization utilizes nonlinear dimensionality reduction (manifold learning) to visualize spectral feature space topology and identify clusters of statistically similar spectra. For this study area, the spectral mixture residual clearly reveals greater spectral dimensionality in AVIRIS than WorldView (99% of variance in 39 versus 5 residual dimensions). Additionally, joint characterization shows more complex spectral feature space topology for AVIRIS than WorldView, revealing information useful to the geologic mapping process in the form of mineralogical variability both within and among mapped geologic units. These results illustrate the potential of recent and planned imaging spectroscopy missions to complement high-resolution multispectral imagery—along with field and lab observations—in planning, collecting, and interpreting the results from geologic field work.
more »
« less
An information-theoretic approach to study spatial dependencies in small datasets
From epidemiology to economics, there is a fundamental need of statistically principled approaches to unveil spatial patterns and identify their underpinning mechanisms. Grounded in network and information theory, we establish a non-parametric scheme to study spatial associations from limited measurements of a spatial process. Through the lens of network theory, we relate spatial patterning in the dataset to the topology of a network on which the process unfolds. From the available observations of the spatial process and a candidate network topology, we compute a mutual information statistic that measures the extent to which the measurement at a node is explained by observations at neighbouring nodes. For a class of networks and linear autoregressive processes, we establish closed-form expressions for the mutual information statistic in terms of network topological features. We demonstrate the feasibility of the approach on synthetic datasets comprising 25–100 measurements, generated by linear or nonlinear autoregressive processes. Upon validation on synthetic processes, we examine datasets of human migration under climate change in Bangladesh and motor vehicle deaths in the United States of America. For both these real datasets, our approach is successful in identifying meaningful spatial patterns, begetting statistically-principled insight into the mechanisms of important socioeconomic problems.
more »
« less
- Award ID(s):
- 1561134
- PAR ID:
- 10294999
- Date Published:
- Journal Name:
- Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences
- Volume:
- 476
- Issue:
- 2242
- ISSN:
- 1364-5021
- Page Range / eLocation ID:
- 20200113
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Time synchronized measurements of voltage magnitudes or phasors are increasingly common in electrical networks. Voltage measurement statistics are informative of the underlying network structure or topology making them useful for grid monitoring. However, this connection is poorly understood and many proposed voltage analytics are purely heuristic. We use graph theory to establish sound theoretical connections between voltage measurements and the structure of the underlying network. Our results are important for many applications, from topology estimation to missing data recovery. Based on this new theory, we discuss existing analytics, transforming them from heuristic to theoretically justified approaches, and introduce new analytics. We clarify all assumptions made, to indicate when analytics may fail or perform poorly. Our work enables voltage measurement streams to be transformed into physically meaningful, intuitive, visualizable, actionable information through simple algorithms.more » « less
-
In recent years there has been significant interest in evolutionary analysis of large-scale networks. Researchers study network evolution rate and mechanisms, the impact of specific events on evolution, and spatial and spatio-temporal patterns. To support data scientists who are studying network evolution, there is a need to develop scalable and generalizable systems. Tangible systems progress in turn depends on the availability of standardized datasets on which performance can be tested. In this work, we make progress towards a data generator for evolving property graphs, which represent evolution of graph topology, and of vertex and edge attributes. We propose an attribute-based model of preferential attachment, and instantiate this model on a co-authorship network derived from DBLP, with attributes representing publication venues of the authors. We show that this attribute-based model predicts which edges are created more accurately than a structure-only model. Finally, we demonstrate that synthetic graphs are indeed useful for evaluating performance of evolving graph query primitives.more » « less
-
Chaudhuri, K (Ed.)How can we efficiently gather information to optimize an unknown function, when presented with multiple, mutually dependent information sources with different costs? For example, when optimizing a physical system, intelligently trading off computer simulations and real-world tests can lead to significant savings. Existing multi-fidelity Bayesian optimization methods, such as multi-fidelity GP-UCB or Entropy Search-based approaches, either make simplistic assumptions on the interaction among different fidelities or use simple heuristics that lack theoretical guarantees. In this paper, we study multifidelity Bayesian optimization with complex structural dependencies among multiple outputs, and propose MF-MI-Greedy, a principled algorithmic framework for addressing this problem. In particular, we model different fidelities using additive Gaussian processes based on shared latent relationships with the target function. Then we use cost-sensitive mutual information gain for efficient Bayesian optimization. We propose a simple notion of regret which incorporates the varying cost of different fidelities, and prove that MF-MI-Greedy achieves low regret. We demonstrate the strong empirical performance of our algorithm on both synthetic and real-world datasets.more » « less
-
Abstract Researchers theorize that many real-world networks exhibit community structure where within-community edges are more likely than between-community edges. While numerous methods exist to cluster nodes into different communities, less work has addressed this question: given some network, does it exhibitstatistically meaningfulcommunity structure? We answer this question in a principled manner by framing it as a statistical hypothesis test in terms of a general and model-agnostic community structure parameter. Leveraging this parameter, we propose a simple and interpretable test statistic used to formulate two separate hypothesis testing frameworks. The first is an asymptotic test against a baseline value of the parameter while the second tests against a baseline model using bootstrap-based thresholds. We prove theoretical properties of these tests and demonstrate how the proposed method yields rich insights into real-world datasets.more » « less
An official website of the United States government

