VideoSSL: Semi-Supervised Learning for Video Classiﬁcation

Jing, L; Parag, T; Wu, Z; Tian, Y; Wang, H.

Citation Details

We propose a semi-supervised learning approach for video classiﬁcation, VideoSSL, using convolutional neural networks (CNN). Like other computer vision tasks, existing supervised video classiﬁcation methods demand a large amount of labeled data to attain good performance. However, annotation of a large dataset is expensive and time consuming. To minimize the dependence on a large annotated dataset, our proposed semi-supervised method trains from a small number of labeled examples and exploits two regulatory signals from unlabeled data. The ﬁrst signal is the pseudo-labels of unlabeled examples computed from the conﬁdences of the CNN being trained. The other is the normalized probabilities, as predicted by an image classiﬁer CNN, that captures the information about appearances of the interesting objects in the video. We show that, under the supervision of these guiding signals from unlabeled examples, a video classiﬁcation CNN can achieve impressive performances utilizing a small fraction of annotated examples on three publicly available datasets: UCF101, HMDB51, and Kinetics. more »

Award ID(s):: 2041307

PAR ID:: 10279258

Author(s) / Creator(s):: Jing, L; Parag, T; Wu, Z; Tian, Y; Wang, H.

Date Published:: 2021-01-05

Journal Name:: Winter Conference on Applications of Computer Vision (WACV), 2021.

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this