ASL Citizen: A Community-Sourced Dataset for Advancing Isolated Sign Language Recognition

Desai, Aashaka; Berger, Lauren; Minakov, Fyodor O; Milan, Vanessa; Singh, Chinmay; Pumphrey, Kriston; Ladner, Richard E; Daumé_III, Hal; Lu, Alex X; Caselli, Naomi; Bragg, Danielle

Citation Details

Sign languages are used as a primary language by approximately 70 million D/deaf people world-wide. However, most communication technologies operate in spoken and written languages, creating inequities in access. To help tackle this problem, we release ASL Citizen, the first crowdsourced Isolated Sign Language Recognition (ISLR) dataset, collected with consent and containing 83,399 videos for 2,731 distinct signs filmed by 52 signers in a variety of environments. We propose that this dataset be used for sign language dictionary retrieval for American Sign Language (ASL), where a user demonstrates a sign to their webcam to retrieve matching signs from a dictionary. Through our generalizable baselines, we show that training supervised machine learning classifiers with our dataset achieves competitive performance on metrics relevant for dictionary retrieval, with 63% accuracy and a recall-at-10 of 91%, evaluated entirely on videos of users who are not present in the training or validation sets. more »

Award ID(s):: 2234787

PAR ID:: 10535436

Author(s) / Creator(s):: Desai, Aashaka; Berger, Lauren; Minakov, Fyodor O; Milan, Vanessa; Singh, Chinmay; Pumphrey, Kriston; Ladner, Richard E; Daumé_III, Hal; Lu, Alex X; Caselli, Naomi; Bragg, Danielle

Publisher / Repository:: 37th Conference on Neural Information Processing Systems (NeurIPS 2023) Track on Datasets and Benchmarks

Date Published:: 2023-08-23

Format(s):: Medium: X

Location:: 37th Conference on Neural Information Processing Systems (NeurIPS 2023) Track on Datasets and Benchmarks

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Proceeding:
The DOI is not currently available.

More Like this