Improving Label Noise Robustness with Data Augmentation and Semi-Supervised Learning (Student Abstract)

Nishi, Kento; Ding, Yi; Rich, Alex; Höllerer, Tobias

Citation Details

Modern machine learning algorithms typically require large amounts of labeled training data to fit a reliable model. To minimize the cost of data collection, researchers often employ techniques such as crowdsourcing and web scraping. However, web data and human annotations are known to exhibit high margins of error, resulting in sizable amounts of incorrect labels. Poorly labeled training data can cause models to overfit to the noise distribution, crippling performance in real-world applications. In this work, we investigate the viability of using data augmentation in conjunction with semi-supervised learning to improve the label noise robustness of image classification models. We conduct several experiments using noisy variants of the CIFAR-10 image classification dataset to benchmark our method against existing algorithms. Experimental results show that our augmentative SSL approach improves upon the state-of-the-art. more »

Award ID(s):: 1911230 1845587

PAR ID:: 10332461

Author(s) / Creator(s):: Nishi, Kento; Ding, Yi; Rich, Alex; Höllerer, Tobias

Date Published:: 2021-05-01

Journal Name:: Proceedings of the AAAI Conference on Artificial Intelligence

Volume:: 35

Issue:: 18

ISSN:: 2159-5399

Page Range / eLocation ID:: 15855-15856

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this