skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Editors contains: "Zhi"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Zhou, Zhi (Ed.)
  2. Zhi (Ed.)
    Network representation learning aims at transferring node proximity in networks into distributed vectors, which can be leveraged in various downstream applications. Recent research has shown that nodes in a network can often be organized in latent hierarchical structures, but without a particular underlying taxonomy, the learned node embedding is less useful nor interpretable. In this work, we aim to improve network embedding by modeling the conditional node proximity in networks indicated by node labels residing in real taxonomies. In the meantime, we also aim to model the hierarchical label proximity in the given taxonomies, which is too coarse by solely looking at the hierarchical topologies. Comprehensive experiments and case studies demonstrate the utility of TAXOGAN. 
    more » « less
  3. Banerjee, Arindam; Zhou, Zhi-Hua (Ed.)
    To characterize the location (mean, median) of a set of graphs, one needs a notion of centrality that is adapted to metric spaces, since graph sets are not Euclidean spaces. A standard approach is to consider the Fr ́echet mean. In this work, we equip a set of graph with the pseudometric defined by the l2 norm between the eigenvalues of their respective adjacency matrix. Unlike the edit distance, this pseudometric reveals structural changes at multiple scales, and is well adapted to studying various statistical problems for graph-valued data. We describe an algorithm to compute an approximation to the sample Fr ́echet mean of a set of undirected unweighted graphs with a fixed size using this pseudometric. 
    more » « less
  4. JIA, Zhi-Yun (Ed.)
    Abstract Understanding how alien species assemble is crucial for predicting changes to community structure caused by biological invasions and for directing management strategies for alien species, but patterns and drivers of alien species assemblages remain poorly understood relative to native species. Climate has been suggested as a crucial filter of invasion-driven homogenization of biodiversity. However, it remains unclear which climatic factors drive the assemblage of alien species. Here, we compiled global data at both grid scale (2,653 native and 2,806 current grids with a resolution of 2° × 2°) and administrative scale (271 native and 297 current nations and sub-nations) on the distributions of 361 alien amphibians and reptiles (herpetofauna), the most threatened vertebrate group on the planet. We found that geographical distance, a proxy for natural dispersal barriers, was the dominant variable contributing to alien herpetofaunal assemblage in native ranges. In contrast, climatic factors explained more unique variation in alien herpetofaunal assemblage after than before invasions. This pattern was driven by extremely high temperatures and precipitation seasonality, 2 hallmarks of global climate change, and bilateral trade which can account for the alien assemblage after invasions. Our results indicated that human-assisted species introductions combined with climate change may accelerate the reorganization of global species distributions. 
    more » « less
  5. Shekhar, Shashi; Zhou, Zhi-Hua; Chiang, Yao-Yi; Stiglic, Gregor (Ed.)
    Rapid advancement in inverse modeling methods have brought into light their susceptibility to imperfect data. This has made it imperative to obtain more explainable and trustworthy estimates from these models. In hydrology, basin characteristics can be noisy or missing, impacting streamflow prediction. We propose a probabilistic inverse model framework that can reconstruct robust hydrology basin characteristics from dynamic input weather driver and streamflow response data. We address two aspects of building more explainable inverse models, uncertainty estimation (uncertainty due to imperfect data and imperfect model) and robustness. This can help improve the trust of water managers, handling of noisy data and reduce costs. We also propose an uncertainty based loss regularization that offers removal of 17% of temporal artifacts in reconstructions, 36% reduction in uncertainty and 4% higher coverage rate for basin characteristics. The forward model performance (streamflow estimation) is also improved by 6% using these uncertainty learning based reconstructions. 
    more » « less
  6. Shekhar, Shashi; Zhou, Zhi-Hua; Chiang, Yao-Yi; Stiglic, Gregor (Ed.)
    In many environmental applications, recurrent neural networks (RNNs) are often used to model physical variables with long temporal dependencies. However, due to minibatch training, temporal relationships between training segments within the batch (intra-batch) as well as between batches (inter-batch) are not considered, which can lead to limited performance. Stateful RNNs aim to address this issue by passing hidden states between batches. Since Stateful RNNs ignore intra-batch temporal dependency, there exists a trade-off between training stability and capturing temporal dependency. In this paper, we provide a quantitative comparison of different Stateful RNN modeling strategies, and propose two strategies to enforce both intra- and inter-batch temporal dependency. First, we extend Stateful RNNs by defining a batch as a temporally ordered set of training segments, which enables intra-batch sharing of temporal information. While this approach significantly improves the performance, it leads to much larger training times due to highly sequential training. To address this issue, we further propose a new strategy which augments a training segment with an initial value of the target variable from the timestep right before the starting of the training segment. In other words, we provide an initial value of the target variable as additional input so that the network can focus on learning changes relative to that initial value. By using this strategy, samples can be passed in any order (mini-batch training) which significantly reduces the training time while maintaining the performance. In demonstrating the utility of our approach in hydrological modeling, we observe that the most significant gains in predictive accuracy occur when these methods are applied to state variables whose values change more slowly, such as soil water and snowpack, rather than continuously moving flux variables such as streamflow. 
    more » « less
  7. Shekhar, Shashi; Zhou, Zhi-Hua; Chiang, Yao-Yi; Stiglic, Gregor (Ed.)
    Creating separable representations via representation learning and clustering is critical in analyzing large unstructured datasets with only a few labels. Separable representations can lead to supervised models with better classification capabilities and additionally aid in generating new labeled samples. Most unsupervised and semisupervised methods to analyze large datasets do not leverage the existing small amounts of labels to get better representations. In this paper, we propose a spatiotemporal clustering paradigm that uses spatial and temporal features combined with a constrained loss to produce separable representations. We show the working of this method on the newly published dataset ReaLSAT, a dataset of surface water dynamics for over 680,000 lakes across the world, making it an essential dataset in terms of ecology and sustainability. Using this large unlabelled dataset, we first show how a spatiotemporal representation is better compared to just spatial or temporal representation. We then show how we can learn even better representations using a constrained loss with few labels. We conclude by showing how our method, using few labels, can pick out new labeled samples from the unlabeled data, which can be used to augment supervised methods leading to better classification. 
    more » « less
  8. General Chair: Marc Moreno Chair: Lihong Zhi (Ed.)
    We introduce a new type of reduction in a free difference module over a difference field that uses a generalization of the concept of effective order of a difference polynomial. Then we define the concept of a generalized characteristic set of such a module, establish some properties of these characteristic sets and use them to prove the existence, outline a method of computation and find invariants of a dimension polynomial in two variables associated with a finitely generated difference module. As a consequence of these results, we obtain a new type of bivariate dimension polynomials of finitely generated difference field extensions. We also explain the relationship between these dimension polynomials and the concept of Einstein’s strength of a system of difference equations. 
    more » « less
  9. Zhi-Yun (Ed.)
    Abstract Escape theory has been exceptionally successful in conceptualizing and accurately predicting effects of numerous factors that affect predation risk and explaining variation in flight initiation distance (FID; predator–prey distance when escape begins). Less explored is the relative orientation of an approaching predator, prey, and its eventual refuge. The relationship between an approaching threat and its refuge can be expressed as an angle we call the “interpath angle” or “Φ,” which describes the angle between the paths of predator and prey to the prey’s refuge and thus expresses the degree to which prey must run toward an approaching predator. In general, we might expect that prey would escape at greater distances if they must flee toward a predator to reach its burrow. The “race for life” model makes formal predictions about how Φ should affect FID. We evaluated the model by studying escape decisions in yellow-bellied marmots Marmota flaviventer, a species which flees to burrows. We found support for some of the model’s predictions, yet the relationship between Φ and FID was less clear. Marmots may not assess Φ in a continuous fashion; but we found that binning angle into 4 45° bins explained a similar amount of variation as models that analyzed angle continuously. Future studies of Φ, especially those that focus on how different species perceive relative orientation, will likely enhance our understanding of its importance in flight decisions. 
    more » « less