skip to main content


Title: Supervised learning of large perceptual organization: graph spectral partitioning and learning automata
Perceptual organization offers an elegant framework to group low-level features that are likely to come from a single object. We offer a novel strategy to adapt this grouping process to objects in a domain. Given a set of training images of objects in context, the associated learning process decides on the relative importance of the basic salient relationships such as proximity, parallelness, continuity, junctions, and common region toward segregating the objects from the background. The parameters of the grouping process are cast as probabilistic specifications of Bayesian networks that need to be learned. This learning is accomplished using a team of stochastic automata in an N-player cooperative game framework. The grouping process, which is based on graph partitioning is able to form large groups from relationships defined over a small set of primitives and is fast. We statistically demonstrate the robust performance of the grouping and the learning frameworks on a variety of real images. Among the interesting conclusions is the significant role of photometric attributes in grouping and the ability to form large salient groups from a set of local relations, each defined over a small number of primitives.  more » « less
Award ID(s):
9907141
NSF-PAR ID:
10346818
Author(s) / Creator(s):
;
Date Published:
Journal Name:
IEEE Transactions on Pattern Analysis and Machine Intelligence
Volume:
22
Issue:
5
ISSN:
0162-8828
Page Range / eLocation ID:
504 to 525
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. This work in progress paper presents an example of conducting a systematic literature review (SLR) to understand students’ affective response to active learning practices, and it focuses on the development and testing of a coding form for analyzing the literature. Specifically, the full paper seeks to answer: (1) what affective responses do instructors measure, (2) what evidence is used to study those responses, and (3) how are course features connected with student response. We conducted database searches with carefully-defined search queries which resulted in 2,365 abstracts from 1990 to 2015. Each abstract was screened by two researchers based on meeting inclusion criteria, with an adjudication round in the case of disagreement. We used RefWorks, an online citation management program, to track abstracts during this process. We identified over 480 abstracts which satisfied our criteria. Following abstract screening, we developed and tested a manuscript coding guide to capture the salient characteristics of each paper. We created an initial coding form by determining what paper topics would address our research questions and reviewing the literature to determine the most frequent response categories. We then piloted and tested the reliability of the form over three rounds of independent pair-coding, with each round resulting in clarifications to the form and mutual agreement on terms’ meanings. This process of developing a manuscript coding guide demonstrates how to use free online tools, such as Google Forms and Google Sheets, to inexpensively manage a large SLR team with significant turnover. Currently, we are in the process of applying the coding guide to the full texts. When complete, the resulting data will be synthesized by creating and testing relationships between variables, using each primary source as a case study to support or refute the hypothesized relationship. 
    more » « less
  2. Abbott, Derek (Ed.)
    Abstract

    Human vision, thought, and planning involve parsing and representing objects and scenes using structured representations based on part-whole hierarchies. Computer vision and machine learning researchers have recently sought to emulate this capability using neural networks, but a generative model formulation has been lacking. Generative models that leverage compositionality, recursion, and part-whole hierarchies are thought to underlie human concept learning and the ability to construct and represent flexible mental concepts. We introduce Recursive Neural Programs (RNPs), a neural generative model that addresses the part-whole hierarchy learning problem by modeling images as hierarchical trees of probabilistic sensory-motor programs. These programs recursively reuse learned sensory-motor primitives to model an image within different spatial reference frames, enabling hierarchical composition of objects from parts and implementing a grammar for images. We show that RNPs can learn part-whole hierarchies for a variety of image datasets, allowing rich compositionality and intuitive parts-based explanations of objects. Our model also suggests a cognitive framework for understanding how human brains can potentially learn and represent concepts in terms of recursively defined primitives and their relations with each other.

     
    more » « less
  3. Interactive visual analytics over distributed systems housing voluminous datasets is hindered by three main factors - disk and network I/O, and data processing overhead. Requests over geospatial data are prone to erratic query load and hotspots due to users’ simultaneous interest over a small sub-domain of the overall data space at a time. Interactive analytics in a distributed setting is further hindered in cases of voluminous datasets with large/high-dimensional data objects, such as multi-spectral satellite imagery. The size of the data objects prohibits efficient caching mechanisms that could significantly reduce response latencies. Additionally, extracting information from these large data objects incurs significant data processing overheads and they often entail resource-intensive computational methods. Here, we present our framework, ARGUS, that extracts low- dimensional representation (embeddings) of high-dimensional satellite images during ingestion and houses them in the cache for use in model-driven analysis relating to wildfire detection. These embeddings are versatile and are used to perform model- based extraction of analytical information for a set of dif- ferent scenarios, to reduce the high computational costs that are involved with typical transformations over high-dimensional datasets. The models for each such analytical process are trained in a distributed manner in a connected, multi-task learning fashion, along with the encoder network that generates the original embeddings. 
    more » « less
  4. Human visual grouping processes consolidate independent visual objects into grouped visual features on the basis of shared characteristics; these visual features can themselves be grouped, resulting in a hierarchical representation of visual grouping information. This “grouping hierarchy“ promotes ef- ficient attention in the support of goal-directed behavior, but improper grouping of elements of a visual scene can also re- sult in critical behavioral errors. Understanding of how visual object/features characteristics such as size and form influences perception of hierarchical visual groups can further theory of human visual grouping behavior and contribute to effective in- terface design. In the present study, participants provided free- response groupings of a set of stimuli that contained consistent structural relationships between a limited set of visual features. These grouping patterns were evaluated for relationships be- tween specific characteristics of the constituent visual features and the distribution of features across levels of the indicated grouping hierarchy. We observed that while the relative size of the visual features differentiated groupings across levels of the grouping hierarchy, the form of visual objects and features was more likely to distinguish separate groups within a partic- ular level of hierarchy. These consistent relationships between visual feature characteristics and placement within a grouping hierarchy can be leveraged to advance computational theories of human visual grouping behavior, which can in turn be ap- plied to effective design for interfaces such as voter ballots. 
    more » « less
  5. Human visual grouping processes consolidate independent visual objects into grouped visual features on the basis of shared characteristics; these visual features can themselves be grouped, resulting in a hierarchical representation of visual grouping information. This “grouping hierarchy“ promotes ef- ficient attention in the support of goal-directed behavior, but improper grouping of elements of a visual scene can also re- sult in critical behavioral errors. Understanding of how visual object/features characteristics such as size and form influences perception of hierarchical visual groups can further theory of human visual grouping behavior and contribute to effective in- terface design. In the present study, participants provided free- response groupings of a set of stimuli that contained consistent structural relationships between a limited set of visual features. These grouping patterns were evaluated for relationships be- tween specific characteristics of the constituent visual features and the distribution of features across levels of the indicated grouping hierarchy. We observed that while the relative size of the visual features differentiated groupings across levels of the grouping hierarchy, the form of visual objects and features was more likely to distinguish separate groups within a partic- ular level of hierarchy. These consistent relationships between visual feature characteristics and placement within a grouping hierarchy can be leveraged to advance computational theories of human visual grouping behavior, which can in turn be ap- plied to effective design for interfaces such as voter ballots. 
    more » « less