NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

MAC-U-Vision+: An Improved Application for Individuals with AMD

Cen, Wilmer_Chang; Li, Haolan; Sehaumpai, Max; Seiple, William; Zhu, Zhigang (May 2025, Journal on Technology & Persons with Disabilities)

Free, publicly-accessible full text available May 8, 2026
SMDAF: A Scalable Sidewalk Material Data Acquisition Framework with Bidirectional Cross-Modal Knowledge Distillation

https://doi.org/10.1109/WACV61041.2025.00295

Liu, Jiawei; Lam, Wayne; Zhu, Zhigang; Tang, Hao (February 2025, IEEE)

Free, publicly-accessible full text available February 26, 2026
Medical Image Denosing via Explainable AI Feature Preserving Loss

https://doi.org/10.1007/978-3-031-82475-3_2

Dong, Guanfang; Basu, Anup (January 2025, Springer Nature Switzerland)

Full Text Available
Enhancing Virtual Mobility for Individuals Who Are Blind or Have Low Vision: A Stationary Exploration Method

https://doi.org/10.1145/3696762.3698058

Zhao, Hong; Oyekoya, Oyewole; Tang, Hao (October 2024, ACM)

Full Text Available
User-Centric Crowdsourcing Approach to Improve Urban Accessibility Data Collection

https://doi.org/10.1109/URTC65039.2024.10937632

Ortiz, Tyler; Tang, Vicky; Sutton, Karla (October 2024, IEEE)

Full Text Available
Mapping Urban Obstacles: Improving Route Accessibility for Blind and Low-Vision Pedestrians

https://doi.org/10.1109/URTC65039.2024.10937619

Tang, Victor; Liu, Jiawei (October 2024, IEEE)

Full Text Available
Absolute-ROMP: Recovering Multi-person 3D Poses and Shapes with Absolute Scales from a Single RGB Image

Abdulrahman, B; Zhu, Z (August 2024, springer)

One of the grand challenges in computer vision is to recover 3D poses and shapes of multiple human bodies with absolute scales from a single RGB image. The challenge stems from the inherent depth and scale ambiguity from a single view. The state of the art on 3D human pose and shape estimation mainly focuses on estimating the 3D joint locations relative to the root joint, defined as the pelvis joint. In this paper, a novel approach called Absolute-ROMP is proposed, which builds upon a one-stage multi-person 3D mesh predictor network, ROMP, to estimate multi-person 3D poses and shapes, but with absolute scales from a single RGB image. To achieve this, we introduce absolute root joint localization in the camera coordinate frame, which enables the estimation of 3D mesh coordinates of all persons in the image and their root joint locations normalized by the focal point. Moreover, a CNN and transformer hybrid network, called TransFocal, is proposed to predict the focal length of the image’s camera. This enables Absolute-ROMP to obtain absolute depth information of all joints in the camera coordinate frame, further improving the accuracy of our proposed method. The Absolute-ROMP is evaluated on the root joint localization and root-relative 3D pose estimation tasks on publicly available multi-person 3D pose datasets, and TransFocal is evaluated on a dataset created from the Pano360 dataset. Our proposed approach achieves state-of-the-art results on these tasks, outperforming existing methods or has competitive performance. Due to its real-time performance, our method is applicable to in-the-wild images and videos.
more » « less
Full Text Available
GMC: A general framework of multi-stage context learning and utilization for visual detection tasks

https://doi.org/10.1016/j.cviu.2024.103944

Wang, Xuan; Tang, Hao; Zhu, Zhigang (April 2024, Computer Vision and Image Understanding)

Full Text Available
Surveying Sidewalk Materials for and by Individuals Who Are Blind or Have Low Vision: Audio Data Collection and Classification

Liu, J; Lam, W P; Zhu, Z; Tang, H (March 2024, International Conference on SMART MULTIMEDIA)

Navigating safely and independently presents considerable challenges for people who are blind or have low vision (BLV), as it re- quires a comprehensive understanding of their neighborhood environments. Our user study reveals that understanding sidewalk materials and objects on the sidewalks plays a crucial role in navigation tasks. This paper presents a pioneering study in the field of navigational aids for BLV individuals. We investigate the feasibility of using auditory data, specifically the sounds produced by cane tips against various sidewalk materials, to achieve material identification. Our approach utilizes ma- chine learning and deep learning techniques to classify sidewalk materials solely based on audio cues, marking a significant step towards empowering BLV individuals with greater autonomy in their navigation. This study contributes in two major ways: Firstly, a lightweight and practical method is developed for volunteers or BLV individuals to autonomously collect auditory data of sidewalk materials using a microphone-equipped white cane. This innovative approach transforms routine cane usage into an effective data-collection tool. Secondly, a deep learning-based classifier algorithm is designed that leverages a dual architecture to enhance audio feature extraction. This includes a pre-trained Convolutional Neural Network (CNN) for regional feature extraction from two-dimensional Mel-spectrograms and a booster module for global feature enrichment. Experimental results indicate that the optimal model achieves an accuracy of 80.96% using audio data only, which can effectively recognize sidewalk materials.
more » « less
Full Text Available
Surveying Sidewalk Materials for and by Individuals Who Are Blind or Have Low Vision: Audio Data Collection and Classification

Liu, J; Lam, W P; Zhu, Z; Tang, H (March 2024, International Conference on SMART MULTIMEDIA)

Navigating safely and independently presents considerable challenges for people who are blind or have low vision (BLV), as it re- quires a comprehensive understanding of their neighborhood environments. Our user study reveals that understanding sidewalk materials and objects on the sidewalks plays a crucial role in navigation tasks. This paper presents a pioneering study in the field of navigational aids for BLV individuals. We investigate the feasibility of using auditory data, specifically the sounds produced by cane tips against various sidewalk materials, to achieve material identification. Our approach utilizes ma- chine learning and deep learning techniques to classify sidewalk materials solely based on audio cues, marking a significant step towards empowering BLV individuals with greater autonomy in their navigation. This study contributes in two major ways: Firstly, a lightweight and practical method is developed for volunteers or BLV individuals to autonomously collect auditory data of sidewalk materials using a microphone-equipped white cane. This innovative approach transforms routine cane usage into an effective data-collection tool. Secondly, a deep learning-based classifier algorithm is designed that leverages a dual architecture to enhance audio feature extraction. This includes a pre-trained Convolutional Neural Network (CNN) for regional feature extraction from two-dimensional Mel-spectrograms and a booster module for global feature enrichment.
more » « less
Full Text Available

« Prev Next »

Search for: All records