Blind Users Accessing Their Training Images in Teachable Object Recognizers

Hong, Jonggi; Gandhi, Jaina; Essuah Mensah, Ernest; Zeraati, Farnaz Z.; Jarjue, Ebrima H.; Lee, K.; Kacorri, Hernisa

doi:10.1145/3517428.3544824

Citation Details

Blind Users Accessing Their Training Images in Teachable Object Recognizers

Teachable object recognizers provide a solution for a very practical need for blind people – instance level object recognition. They assume one can visually inspect the photos they provide for training, a critical and inaccessible step for those who are blind. In this work, we engineer data descriptors that address this challenge. They indicate in real time whether the object in the photo is cropped or too small, a hand is included, the photos is blurred, and how much photos vary from each other. Our descriptors are built into open source testbed iOS app, called MYCam. In a remote user study in (N = 12) blind participants’ homes, we show how descriptors, even when error-prone, support experimentation and have a positive impact in the quality of training set that can translate to model performance though this gain is not uniform. Participants found the app simple to use indicating that they could effectively train it and that the descriptors were useful. However, many found the training being tedious, opening discussions around the need for balance between information, time, and cognitive load. more »

Award ID(s):: 1816380

NSF-PAR ID:: 10344780

Author(s) / Creator(s):: Hong, Jonggi; Gandhi, Jaina; Essuah Mensah, Ernest; Zeraati, Farnaz Z.; Jarjue, Ebrima H.; Lee, K.; Kacorri, Hernisa

Date Published:: 2022-10-01

Journal Name:: ACM SIGACCESS Conference on Computers and Accessibility (ASSETS)

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3517428.3544824

More Like this