NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Regression with Uncertainty Quantification in Large Scale Complex Data

https://doi.org/10.1109/SMC53654.2022.9945367

Wilkins, Nicholas; Johnson, Michael; Nwogu, Ifeoma (October 2022, 2022 IEEE International Conference on Systems, Man, and Cybernetics (SMC))

While several methods for predicting uncertainty on deep networks have been recently proposed, they do not always readily translate to large and complex datasets without significant overhead. In this paper we utilize a special instance of the Mixture Density Networks (MDNs) to produce an elegant and compact approach to quantity uncertainty in regression problems. When applied to standard regression benchmark datasets, we show an improvement in predictive log-likelihood and root-mean-square-error when compared to existing state-of-the-art methods. We demonstrate the efficacy and practical usefulness of the method for (i) predicting future stock prices from stochastic, highly volatile time-series data; (ii) anomaly detection in real-life highly complex video segments; and (iii) the task of age estimation and data cleansing on the challenging IMDb-Wiki dataset of half a million face images.
more » « less
Full Text Available
Dynamic Cross-Feature Fusion for American Sign Language Translation

https://doi.org/10.1109/FG52635.2021.9667027

Ananthanarayana, Tejaswini; Kotecha, Nikunj; Srivastava, Priyanshu; Chaudhary, Lipisha; Wilkins, Nicholas; Nwogu, Ifeoma (December 2021, 2021 16th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2021))

While a significant amount of work has been done on the commonly used, tightly -constrained weather-based, German sign language (GSL) dataset, little has been done for continuous sign language translation (SLT) in more realistic settings, including American sign language (ASL) translation. Also, while CNN - based features have been consistently shown to work well on the GSL dataset, it is not clear whether such features will work as well in more realistic settings when there are more heterogeneous signers in non-uniform backgrounds. To this end, in this work, we introduce a new, realistic phrase-level ASL dataset (ASLing), and explore the role of different types of visual features (CNN embeddings, human body keypoints, and optical flow vectors) in translating it to spoken American English. We propose a novel Transformer-based, visual feature learning method for ASL translation. We demonstrate the explainability efficacy of our proposed learning methods by visualizing activation weights under various input conditions and discover that the body keypoints are consistently the most reliable set of input features. Using our model, we successfully transfer-learn from the larger GSL dataset to ASLing, resulting in significant BLEU score improvements. In summary, this work goes a long way in bringing together the AI resources required for automated ASL translation in unconstrained environments.
more » « less
Full Text Available
Analyzing the Extent of Rapport in Groups of Triads Via Interactional Synchrony

https://doi.org/10.1109/SMC42975.2020.9283474

Wilkins, Nicholas; Nwogu, Ifeoma (October 2020, IEEE International Conference on Systems, Man, and Cybernetics (SMC 2020))
null (Ed.)
Full Text Available
Modeling Global Body Configurations in American Sign Language

https://doi.org/10.21437/Interspeech.2020-2873

Wilkins, Nicholas; Galbraith, Max Cordes; Nwogu, Ifeoma (October 2020, Interspeech 2020, 21st Annual Conference of the International Speech Communication Association)
null (Ed.)
Full Text Available

Search for: All records