NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

SwapMix: Diagnosing and Regularizing the Over-Reliance on Visual Context in Visual Question Answering

https://doi.org/10.1109/CVPR52688.2022.00502

Gupta, Vipul; Li, Zhuowan; Kortylewski, Adam; Zhang, Chenyu; Li, Yingwei; Yuille, Alan (June 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR))

Full Text Available
Learning from Temporal Gradient for Semi-supervised Action Recognition

https://doi.org/10.1109/CVPR52688.2022.00325

Xiao, Junfei; Jing, Longlong; Zhang, Lin; He, Ju; She, Qi; Zhou, Zongwei; Yuille, Alan; Li, Yingwei (June 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR))

Full Text Available
Unsupervised learning of optical flow with patch consistency and occlusion estimation

https://doi.org/10.1016/j.patcog.2019.107191

Ren, Zhe; Yan, Junchi; Yang, Xiaokang; Yuille, Alan; Zha, Hongyuan (July 2020, Pattern Recognition)
null (Ed.)
Full Text Available
DASZL: Dynamic Action Signatures for Zero-shot Learning

Kim, Tae Soo; Jones, Jonathan; Peven, Michael; Xiao, Zihao; Bai, Jin; Zhang, Yi; Qiu, Weichao; Yuille, Alan; Hager, Gregory D (February 2021, Proceedings of the AAAI Conference on Artificial Intelligence)

There are many realistic applications of activity recognition where the set of potential activity descriptions is combinatorially large. This makes end-to-end supervised training of a recognition system impractical as no training set is practically able to encompass the entire label set. In this paper, we present an approach to fine-grained recognition that models activities as compositions of dynamic action signatures. This compositional approach allows us to reframe fine-grained recognition as zero-shot activity recognition, where a detector is composed “on the fly” from simple first-principles state machines supported by deep-learned components. We evaluate our method on the Olympic Sports and UCF101 datasets, where our model establishes a new state of the art under multiple experimental paradigms. We also extend this method to form a unique framework for zero-shot joint segmentation and classification of activities in video and demonstrate the first results in zero-shot decoding of complex action sequences on a widely-used surgical dataset. Lastly, we show that we can use off-the-shelf object detectors to recognize activities in completely de-novo settings with no additional training.
more » « less
Full Text Available
STFlow: Self-Taught Optical Flow Estimation Using Pseudo Labels

https://doi.org/10.1109/TIP.2020.3024015

Ren, Zhe; Luo, Wenhan; Yan, Junchi; Liao, Wenlong; Yang, Xiaokang; Yuille, Alan; Zha, Hongyuan (January 2020, IEEE Transactions on Image Processing)
null (Ed.)
Full Text Available
Learning to Refine 3D Human Pose Sequences

Mei, Jieru; Chen, Xingyu; Wang, Chunyu; Yuille, Alan; Lan, Xuguang; Zeng, Wenjun (January 2019, 3DV)

We present a basis approach to refine noisy 3D human pose sequences by jointly projecting them onto a non-linear pose manifold, which is represented by a number of basis dictionaries with each covering a small manifold region. We learn the dictionaries by jointly minimizing the distance between the original poses and their projections on the dictionaries, along with the temporal jittering of the projected poses. During testing, given a sequence of noisy poses which are probably off the manifold, we project them to the manifold using the same strategy as in training for refinement. We apply our approach to the monocular 3D pose estimation and the long term motion prediction tasks. The experimental results on the benchmark dataset shows the estimated 3D poses are notably improved in both tasks. In particular, the smoothness constraint helps generate more robust refinement results even when some poses in the original sequence have large errors.
more » « less
Full Text Available

Search for: All records