NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Deep Domain Adaptation: A Sim2Real Neural Approach for Improving Eye-Tracking Systems

https://doi.org/10.1145/3654703

Nguyen, Viet Dung; Bailey, Reynold; Diaz, Gabriel J; Ma, Chengyi; Fix, Alexander; Ororbia, Alexander (May 2024, Proceedings of the ACM on Computer Graphics and Interactive Techniques)

Eye image segmentation is a critical step in eye tracking that has great influence over the final gaze estimate. Segmentation models trained using supervised machine learning can excel at this task, their effectiveness is determined by the degree of overlap between the narrow distributions of image properties defined by the target dataset and highly specific training datasets, of which there are few. Attempts to broaden the distribution of existing eye image datasets through the inclusion of synthetic eye images have found that a model trained on synthetic images will often fail to generalize back to real-world eye images. In remedy, we use dimensionality-reduction techniques to measure the overlap between the target eye images and synthetic training data, and to prune the training dataset in a manner that maximizes distribution overlap. We demonstrate that our methods result in robust, improved performance when tackling the discrepancy between simulation and real-world data samples.
more » « less
Full Text Available
A Robust Backpropagation-Free Framework for Images

Zee, Timothy; Ororbia, Alexander G; Mali, Ankur; Nwogu, Ifeoma (November 2023, Transactions on machine learning research)
Richards, Blake A (Ed.)
While current deep learning algorithms have been successful for a wide variety of artificial intelligence (AI) tasks, including those involving structured image data, they present deep neurophysiological conceptual issues due to their reliance on the gradients that are computed by backpropagation of errors (backprop). Gradients are required to obtain synaptic weight adjustments but require knowledge of feed-forward activities in order to conduct backward propagation, a biologically implausible process. This is known as the “weight transport problem”. Therefore, in this work, we present a more biologically plausible approach towards solving the weight transport problem for image data. This approach, which we name the error-kernel driven activation alignment (EKDAA) algorithm, accomplishes through the introduction of locally derived error transmission kernels and error maps. Like standard deep learning networks, EKDAA performs the standard forward process via weights and activation functions; however, its backward error computation involves adaptive error kernels that propagate local error signals through the network. The efficacy of EKDAA is demonstrated by performing visual-recognition tasks on the Fashion MNIST, CIFAR-10 and SVHN benchmarks, along with demonstrating its ability to extract visual features from natural color images. Furthermore, in order to demonstrate its non-reliance on gradient computations, results are presented for an EKDAA-trained CNN that employs a non-differentiable activation function.
more » « less
Full Text Available
Online evolutionary neural architecture search for multivariate non-stationary time series forecasting

https://doi.org/10.1016/j.asoc.2023.110522

Lyu, Zimeng; Ororbia, Alexander; Desell, Travis (September 2023, Applied Soft Computing)

Full Text Available
A neural active inference model of perceptual-motor learning

https://doi.org/10.3389/fncom.2023.1099593

Yang, Zhizhuo; Diaz, Gabriel J.; Fajen, Brett R.; Bailey, Reynold; Ororbia, Alexander G. (February 2023, Frontiers in Computational Neuroscience)

The active inference framework (AIF) is a promising new computational framework grounded in contemporary neuroscience that can produce human-like behavior through reward-based learning. In this study, we test the ability for the AIF to capture the role of anticipation in the visual guidance of action in humans through the systematic investigation of a visual-motor task that has been well-explored—that of intercepting a target moving over a ground plane. Previous research demonstrated that humans performing this task resorted to anticipatory changes in speed intended to compensate for semi-predictable changes in target speed later in the approach. To capture this behavior, our proposed “neural” AIF agent uses artificial neural networks to select actions on the basis of a very short term prediction of the information about the task environment that these actions would reveal along with a long-term estimate of the resulting cumulative expected free energy. Systematic variation revealed that anticipatory behavior emerged only when required by limitations on the agent's movement capabilities, and only when the agent was able to estimate accumulated free energy over sufficiently long durations into the future. In addition, we present a novel formulation of the prior mapping function that maps a multi-dimensional world-state to a uni-dimensional distribution of free-energy/reward. Together, these results demonstrate the use of AIF as a plausible model of anticipatory visually guided behavior in humans.
more » « less
Full Text Available
Multimodal Modeling of Task-Mediated Confusion

https://doi.org/10.18653/v1/2022.naacl-srw.24

Mince, Camille; Rhomberg, Skye; Alm, Cecilia; Bailey, Reynold; Ororbia, Alexander (January 2022, Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop)

In order to build more human-like cognitive agents, systems capable of detecting various human emotions must be designed to respond appropriately. Confusion, the combination of an emotional and cognitive state, is under-explored. In this paper, we build upon prior work to develop models that detect confusion from three modalities: video (facial features), audio (prosodic features), and text (transcribed speech features). Our research improves the data collection process by allowing for continuous (as opposed to discrete) annotation of confusion levels. We also craft models based on recurrent neural networks (RNNs) given their ability to predict sequential data. In our experiments, we find that text and video modalities are the most important in predicting confusion while the explored audio features are relatively unimportant predictors of confusion in our data.
more » « less
Full Text Available
Like a Baby: Visually Situated Neural Language Acquisition

Ororbia, Alexander G; Mali, Ankur; Kelly, Matthew A; Reitter, David (January 2019, Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL))

Full Text Available

Search for: All records