Search for: All records

Creators/Authors contains: "Ramanan, D."

« Prev Next »

Total Resources

6

Resource Type
Conference Paper

6

Conference Proceeding

0

Dataset

0

Journal Article

0

Workshop Report

0

Availability
Full Text / Resource Available

4

Citation Only

2

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Streaming Motion Forecasting for Autonomous Driving

Pang, Z. ; Ramanan, D. ; Li, M. ; Wang, Y.-X. ( October 2023 , IEEE/RSJ International Conference on Intelligent Robots and Systems)

Free, publicly-accessible full text available October 1, 2024
Learning lightweight object detectors via progressive knowledge distillation

Cao, S. ; Li, M. ; Hays, J. ; Ramanan, D. ; Wang, Y.-X. ; Gui, L.-Y. ( July 2023 , International Conference on Machine Learning)

Free, publicly-accessible full text available July 23, 2024
Towards Long-Tailed 3D Detection

Peri, N. ; Dave, A. ; Ramanan, D. ; Kong, S. ( December 2022 , International Conference on Robot Learning)

Contemporary autonomous vehicle (AV) benchmarks have advanced techniques for training 3D detectors, particularly on large-scale lidar data. Surprisingly, although semantic class labels naturally follow a long-tailed distribution, contemporary benchmarks focus on only a few common classes (e.g., pedestrian and car) and neglect many rare classes in-the-tail (e.g., debris and stroller). However, AVs must still detect rare classes to ensure safe operation. Moreover, semantic classes are often organized within a hierarchy, e.g., tail classes such as child and construction-worker are arguably subclasses of pedestrian. However, such hierarchical relationships are often ignored, which may lead to misleading estimates of performance and missed opportunities for algorithmic innovation. We address these challenges by formally studying the problem of Long-Tailed 3D Detection (LT3D), which evaluates on all classes, including those in-the-tail. We evaluate and innovate upon popular 3D detection codebases, such as CenterPoint and PointPillars, adapting them for LT3D. We develop hierarchical losses that promote feature sharing across common-vs-rare classes, as well as improved detection metrics that award partial credit to "reasonable" mistakes respecting the hierarchy (e.g., mistaking a child for an adult). Finally, we point out that fine-grained tail class accuracy is particularly improved via multimodal fusion of RGB images with LiDAR; simply put, small fine-grained classes are challenging to identify from sparse (lidar) geometry alone, suggesting that multimodal cues are crucial to long-tailed 3D detection. Our modifications improve accuracy by 5% AP on average for all classes, and dramatically improve AP for rare classes (e.g., stroller AP improves from 3.6 to 31.6)! Our code is available at this https URL.
more » « less
Full Text Available
Learning lightweight object detectors via progressive knowledge distillation

Cao, S. ; Li, M. ; Hays, J. ; Ramanan, D. ; Wang, Y.-X. ; Gui L.-Y. ( January 2023 , International Conference on Machine Learning)

Full Text Available
Long-tailed recognition via weight balancing

Alshammari, S ; Wang, Y-W ; Ramanan, D ; Kong, S ( January 2022 , IEEE Conference on Computer Vision and Pattern Recognition)

Full Text Available
Unconstrained Face Detection and Open-Set Face Recognition Challenge

https://doi.org/10.1109/BTAS.2017.8272759

Gunther, M. ; Hu, P. ; Herrmann, C. ; Chan, C. H. ; Jiang, M. ; Yang, S. ; Dhamija, A. R. ; Ramanan, D. ; Beyerer, J. ; Kittler, J. ; et al ( October 2017 , International Joint Conference on Biometrics)

Face detection and recognition benchmarks have shifted toward more difficult environments. The challenge presented in this paper addresses the next step in the direction of automatic detection and identification of people from outdoor surveillance cameras. While face detection has shown remarkable success in images collected from the web, surveillance cameras include more diverse occlusions, poses, weather conditions and image blur. Although face verification or closed-set face identification have surpassed human capabilities on some datasets, open-set identification is much more complex as it needs to reject both unknown identities and false accepts from the face detector. We show that unconstrained face detection can approach high detection rates albeit with moderate false accept rates. By contrast, open-set face recognition is currently weak and requires much more attention.
more » « less
Full Text Available