Search for: All records

Creators/Authors contains: "Pan, Tai-Yu"

« Prev Next »

Total Resources

8

Resource Type
Conference Paper

8

Conference Proceeding

0

Dataset

0

Journal Article

0

Workshop Report

0

Availability
Full Text / Resource Available

8

Citation Only

0

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Learning with Free Object Segments for Long-Tailed Instance Segmentation

Zhang, Cheng ; Pan, Tai-Yu ; Chen, Tianle ; Zhong, Jike ; Fu, Wenjin ; Chao, Wei-Lun ( October 2022 , European Conference on Computer Vision (ECCV))

Full Text Available
Learning with Free Object Segments for Long-Tailed Instance Segmentation

Zhang, Cheng ; Pan, Tai-Yu ; Chen, Tianle ; Zhong, Jike ; Fu, Wenjin ; Chao, Wei-Lun ( October 2022 , European Conference on Computer Vision)

Full Text Available
Learning with Free Object Segments for Long-Tailed Instance Segmentation

Zhang, Cheng ; Pan, Tai-Yu ; Chen, Tianle ; Zhong, Jike ; Fu, Wenjin ; Chao, Wei-Lun ( October 2022 , European Conference on Computer Vision)

One fundamental challenge in building an instance segmen- tation model for a large number of classes in complex scenes is the lack of training examples, especially for rare objects. In this paper, we ex- plore the possibility to increase the training examples without laborious data collection and annotation. We find that an abundance of instance segments can potentially be obtained freely from object-centric images, according to two insights: (i) an object-centric image usually contains one salient object in a simple background; (ii) objects from the same class often share similar appearances or similar contrasts to the background. Motivated by these insights, we propose a simple and scalable frame- work FreeSeg for extracting and leveraging these “free” object fore- ground segments to facilitate model training in long-tailed instance seg- mentation. Concretely, we investigate the similarity among object-centric images of the same class to propose candidate segments of foreground instances, followed by a novel ranking of segment quality. The resulting high-quality object segments can then be used to augment the exist- ing long-tailed datasets, e.g., by copying and pasting the segments onto the original training images. Extensive experiments show that FreeSeg yields substantial improvements on top of strong baselines and achieves state-of-the-art accuracy for segmenting rare object categories. Our code is publicly available at https://github.com/czhang0528/FreeSeg.
more » « less
Full Text Available
One Step at a Time: Long-Horizon Vision-and-Language Navigation with Milestones

https://doi.org/10.1109/CVPR52688.2022.01504

Song, Chan Hee ; Kil, Jihyung ; Pan, Tai-Yu ; Sadler, Brian M. ; Chao, Wei-Lun ; Su, Yu ( June 2022 , Conference on Computer Vision and Pattern Recognition)

Full Text Available
Learning with Free Object Segments for Long-Tailed Instance Segmentation

Zhang, Cheng ; Pan, Tai-Yu ; Chen, Tianle ; Zhong, Jike ; Fu, Wenjin ; and Chao, Wei-Lun ( January 2022 , L3D-IVU: Workshop on Learning with Limited Labeled Data for Image and Video Understanding, in conjunction with the IEEE / CVF Computer Vision and Pattern Recognition Conference)

In this paper, we explore the possibility to increase the training examples without laborious data collection and annotation for long-tailed instance segmentation. We find that an abundance of instance segments can potentially be obtained freely from object-centric images, according to two insights: (i) an object-centric image usually contains one salient object in a simple background; (ii) objects from the same class often share similar appearances or similar contrasts to the background. Motivated by these insights, we propose a simple and scalable framework FREESEG for extracting and leveraging these “free” object segments to facilitate model training. Concretely, we investigate the similarity among object-centric images of the same class to propose candidate segments of foreground instances, followed by a novel ranking of segment quality. The resulting high quality object segments can then be used to augment the existing long-tailed datasets, e.g., by copying and pasting the segments onto the original training images. Extensive experiments show that FREESEG yields substantial improvements on top of strong baselines and achieves state-of-the-art accuracy for segmenting rare object categories.
more » « less
Full Text Available
One Step at a Time: Long-Horizon Vision-and-Language Navigation With Milestones

Song, Chan Hee ; Kil, Jihyung ; Pan, Tai-Yu ; Sadler, Brian M. ; Chao, Wei-Lun ; and Su, Yu ( January 2022 , IEEE / CVF Computer Vision and Pattern Recognition Conference)

We study the problem of developing autonomous agents that can follow human instructions to infer and perform a sequence of actions to complete the underlying task. Significant progress has been made in recent years, especially for tasks with short horizons. However, when it comes to long-horizon tasks with extended sequences of actions, an agent can easily ignore some instructions or get stuck in the middle of the long instructions and eventually fail the task. To address this challenge, we propose a model-agnostic milestone-based task tracker(M-TRACK) to guide the agent and monitor its progress. Specifcally, we propose a milestone builder that tags the instructions with navigation and interaction milestones which the agent needs to complete step by step, and a milestone checker that systemically checks the agent’s progress in its current milestone and determines when to proceed to the next. On the challenging ALFRED dataset, our M-TRACK leads to a notable 33% and 52% relative improvement in unseen success rate over two competitive base models.
more » « less
Full Text Available
On Model Calibration for Long-Tailed Object Detection and Instance Segmentation

Pan, Tai-Yu ; Zhang, Cheng ; Li, Yandong ; Hu, Hexiang ; Xuan, Dong ; Changpinyo, Soravit ; Gong, Boqing ; Chao, Wei-Lun ( December 2021 , Conference on Neural Information Processing Systems)

Full Text Available
On Model Calibration for Long-Tailed Object Detection and Instance Segmentation

Pan, Tai-Yu ; Zhang, Cheng ; Li, Yandong ; Hu, Hexiang ; Xuan, Dong ; Changpinyo, Soravit ; Gong, Boqing ; Chao, Wei-Lun ( January 2021 , Advances in neural information processing systems)

Vanilla models for object detection and instance segmentation suffer from the heavy bias toward detecting frequent objects in the long-tailed setting. Existing methods address this issue mostly during training, e.g., by re-sampling or re- weighting. In this paper, we investigate a largely overlooked approach — post- processing calibration of confidence scores. We propose NORCAL, Normalized Calibration for long-tailed object detection and instance segmentation, a simple and straightforward recipe that reweighs the predicted scores of each class by its training sample size. We show that separately handling the background class and normalizing the scores over classes for each proposal are keys to achieving superior performance. On the LVIS dataset, NORCAL can effectively improve nearly all the baseline models not only on rare classes but also on common and frequent classes. Finally, we conduct extensive analysis and ablation studies to offer insights into various modeling choices and mechanisms of our approach. Our code is publicly available at https://github.com/tydpan/NorCal.
more » « less
Full Text Available