NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Sensorimotor adaptation reveals systematic biases in 3D perception

https://doi.org/10.1038/s41598-025-88214-x

Lim, Chaeeun; Vishwanath, Dhanraj; Domini, Fulvio (January 2025, Scientific Reports)
Embeddedness of Earth's gravity in visual perception

https://doi.org/10.1167/jov.24.11.4

Deeb, Abdul-Rahim; Domini, Fulvio (October 2024, Journal of Vision)

Full Text Available
On Human-like Biases in Convolutional Neural Networks for the Perception of Slant from Texture

https://doi.org/10.1145/3613451

Wang, Yuanhao; Zhang, Qian; Aubuchon, Celine; Kemp, Jovan; Domini, Fulvio; Tompkin, James (August 2023, ACM Transactions on Applied Perception)

Depth estimation is fundamental to 3D perception, and humans are known to have biased estimates of depth. This study investigates whether convolutional neural networks (CNNs) can be biased when predicting the sign of curvature and depth of surfaces of textured surfaces under different viewing conditions (field of view) and surface parameters (slant and texture irregularity). This hypothesis is drawn from the idea that texture gradients described by local neighborhoods—a cue identified in human vision literature—are also representable within convolutional neural networks. To this end, we trained both unsupervised and supervised CNN models on the renderings of slanted surfaces with random Polka dot patterns and analyzed their internal latent representations. The results show that the unsupervised models have similar prediction biases as humans across all experiments, while supervised CNN models do not exhibit similar biases. The latent spaces of the unsupervised models can be linearly separated into axes representing field of view and optical slant. For supervised models, this ability varies substantially with model architecture and the kind of supervision (continuous slant vs. sign of slant). Even though this study says nothing of any shared mechanism, these findings suggest that unsupervised CNN models can share similar predictions to the human visual system. Code: github.com/brownvc/Slant-CNN-Biases
more » « less
Full Text Available
Perceiving depth from texture and disparity cues: Evidence for a non-probabilistic account of cue integration

https://doi.org/10.1167/jov.23.7.13

Kemp, Jovan T.; Cesanek, Evan; Domini, Fulvio (July 2023, Journal of Vision)

Full Text Available
The case against probabilistic inference: a new deterministic theory of 3D visual processing

https://doi.org/10.1098/rstb.2021.0458

Domini, Fulvio (January 2023, Philosophical Transactions of the Royal Society B: Biological Sciences)

How the brain derives 3D information from inherently ambiguous visual input remains the fundamental question of human vision. The past two decades of research have addressed this question as a problem of probabilistic inference, the dominant model being maximum-likelihood estimation (MLE). This model assumes that independent depth-cue modules derive noisy but statistically accurate estimates of 3D scene parameters that are combined through a weighted average. Cue weights are adjusted based on the system representation of each module's output variability. Here I demonstrate that the MLE model fails to account for important psychophysical findings and, importantly, misinterprets the just noticeable difference, a hallmark measure of stimulus discriminability, to be an estimate of perceptual uncertainty. I propose a new theory, termed Intrinsic Constraint, which postulates that the visual system does not derive the most probable interpretation of the visual input, but rather, the most stable interpretation amid variations in viewing conditions. This goal is achieved with the Vector Sum model, which represents individual cue estimates as components of a multi-dimensional vector whose norm determines the combined output. This model accounts for the psychophysical findings cited in support of MLE, while predicting existing and new findings that contradict the MLE model. This article is part of a discussion meeting issue ‘New approaches to 3D vision’.
more » « less
Full Text Available

Search for: All records