skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Limited information-processing capacity in vision explains number psychophysics.
Humans and other animals are able to perceive and represent a number of objects present in a scene, a core cognitive ability thought to underlie the development of mathematics. However, the perceptual mechanisms that underpin this capacity remain poorly understood. Here, we show that our visual sense of number derives from a visual system designed to efficiently encode the location of objects in scenes. Using a mathematical model, we demonstrate that an efficient but information-limited encoding of objects’ locations can explain many key aspects of number psychophysics, including subitizing, Weber’s law, underestimation, and effects of exposure time. In two experiments (N = 100 each), we find that this model of visual encoding captures human performance in both a change-localization task and a number estimation task. In a third experiment (N = 100), we find that individual differences in change-localization performance are highly predictive of differences in number estimation, both in terms of overall performance and inferred model parameters, with participants having numerically indistinguishable inferred information capacities across tasks. Our results therefore indicate that key psychophysical features of numerical cognition do not arise from separate modules or capacities specific to number, but rather as by-products of lower level constraints on perception.  more » « less
Award ID(s):
2201843
PAR ID:
10516751
Author(s) / Creator(s):
; ;
Publisher / Repository:
American Psychological Assocition
Date Published:
Journal Name:
Psychological Review
ISSN:
0033-295X
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    People can identify the number of objects in small sets rapidly and without error but become increasingly noisy for larger sets. However, the cognitive mechanisms underlying these ubiquitous psychophysics are poorly understood. We present a model of a limitedcapacity visual system optimized to individuate and remember the location of objects in a scene which gives rise to all key aspects of number psychophysics, including error-free small number perception and scalar variability for larger numbers. We therefore propose that number psychophysics can be understood as an emergent property of primitive perceptual mechanisms — namely, the process of identifying and representing individual objects in a scene. To test our theory, we ran two experiments: a change-localization task to measure participants’ memory for the locations of objects (Experiment 1) and a numerical estimation task (Experiment 2). Our model accounts well for participants’ performance in both experiments, despite only being optimized to efficiently encode where objects are present in a scene. Our results demonstrate that the key psychophysical features of numerical cognition do not arise from separate modules or capacities specific to number, but rather from lower-level constraints on perception which are manifested even in non-numerical tasks 
    more » « less
  2. Collaborative localization is an essential capability for a team of robots such as connected vehicles to collaboratively estimate object locations from multiple perspectives with reliant cooperation. To enable collaborative localization, four key challenges must be addressed, including modeling complex relationships between observed objects, fusing observations from an arbitrary number of collaborating robots, quantifying localization uncertainty, and addressing latency of robot communications. In this paper, we introduce a novel approach that integrates uncertainty-aware spatiotemporal graph learning and model-based state estimation for a team of robots to collaboratively localize objects. Specifically, we introduce a new uncertainty-aware graph learning model that learns spatiotemporal graphs to represent historical motions of the objects observed by each robot over time and provides uncertainties in object localization. Moreover, we propose a novel method for integrated learning and model-based state estimation, which fuses asynchronous observations obtained from an arbitrary number of robots for collaborative localization. We evaluate our approach in two collaborative object localization scenarios in simulations and on real robots. Experimental results show that our approach outperforms previous methods and achieves state-of-the-art performance on asynchronous collaborative localization. 
    more » « less
  3. Collaborative object localization aims to collaboratively estimate locations of objects observed from multiple views or perspectives, which is a critical ability for multi-agent systems such as connected vehicles. To enable collaborative localization, several model-based state estimation and learning-based localization methods have been developed. Given their encouraging performance, model-based state estimation often lacks the ability to model the complex relationships among multiple objects, while learning-based methods are typically not able to fuse the observations from an arbitrary number of views and cannot well model uncertainty. In this paper, we introduce a novel spatiotemporal graph filter approach that integrates graph learning and model-based estimation to perform multi-view sensor fusion for collaborative object localization. Our approach models complex object relationships using a new spatiotemporal graph representation and fuses multi-view observations in a Bayesian fashion to improve location estimation under uncertainty. We evaluate our approach in the applications of connected autonomous driving and multiple pedestrian localization. Experimental results show that our approach outperforms previous techniques and achieves the state-of-the-art performance on collaborative localization. 
    more » « less
  4. Integral imaging (InIm) is useful for passive ranging and 3D visualization of partially-occluded objects. We consider 3D object localization within a scene and in occlusions. 2D localization can be achieved using machine learning and non-machine learning-based techniques. These techniques aim to provide a 2D bounding box around each one of the objects of interest. A recent study uses InIm for the 3D reconstruction of the scene with occlusions and utilizes mutual information (MI) between the bounding box in this 3D reconstructed scene and the corresponding bounding box in the central elemental image to achieve passive depth estimation of partially occluded objects. Here, we improve upon this InIm method by using Bayesian optimization to minimize the number of required 3D scene reconstructions. We evaluate the performance of the proposed approach by analyzing different kernel functions, acquisition functions, and parameter estimation algorithms for Bayesian optimization-based inference for simultaneous depth estimation of objects and occlusion. In our optical experiments, mutual-information-based depth estimation with Bayesian optimization achieves depth estimation with a handful of 3D reconstructions. To the best of our knowledge, this is the first report to use Bayesian optimization for mutual information-based InIm depth estimation. 
    more » « less
  5. Bortfeld, Heather; de Haan, Michelle; Nelson, Charles A.; Quinn, Paul C. (Ed.)
    Abstract Children's ability to discriminate nonsymbolic number (e.g., the number of items in a set) is a commonly studied predictor of later math skills. Number discrimination improves throughout development, but what drives this improvement is unclear. Competing theories suggest that it may be due to a sharpening numerical representation or an improved ability to pay attention to number and filter out non‐numerical information. We investigate this issue by studying change in children's performance (N = 65) on a nonsymbolic number comparison task, where children decide which of two dot arrays has more dots, from the middle to the end of 1st grade (mean age at time 1 = 6.85 years old). In this task, visual properties of the dot arrays such as surface area are either congruent (the more numerous array has more surface area) or incongruent. Children rely more on executive functions during incongruent trials, so improvements in each congruency condition provide information about the underlying cognitive mechanisms. We found that accuracy rates increased similarly for both conditions, indicating a sharpening sense of numerical magnitude, not simply improved attention to the numerical task dimension. Symbolic number skills predicted change in congruent trials, but executive function did not predict change in either condition. No factor predicted change in math achievement. Together, these findings suggest that nonsymbolic number processing undergoes development related to existing symbolic number skills, development that appears not to be driving math gains during this period.Children's ability to discriminate nonsymbolic number improves throughout development. Competing theories suggest improvement due to sharpening magnitude representations or changes in attention and inhibition.The current study investigates change in nonsymbolic number comparison performance during first grade and whether symbolic number skills, math skills, or executive function predict change.Children's performance increased across visual control conditions (i.e., congruent or incongruent with number) suggesting an overall sharpening of number processing.Symbolic number skills predicted change in nonsymbolic number comparison performance. 
    more » « less