NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

TraceNet: Segment one thing efficiently

Wu, Mingyuan; Liu, Zichuan; Zheng, Haozhen; Guo, Hongpeng; Chen, Bo; Lu, Xin; Nahrstedt, Klara (August 2025, IEEE)

Efficient single instance segmentation is critical for unlocking features in on-the-fly mobile imaging applications, such as photo capture and editing. Existing mobile solutions often restrict segmentation to portraits or salient objects due to computational constraints. Recent advancements like the Segment Anything Model improve accuracy but remain computationally expensive for mobile, because it processes the entire image with heavy transformer backbones. To address this, we propose TraceNet, a one-click-driven single instance segmentation model. TraceNet segments a user-specified instance by back-tracing the receptive field of a ConvNet backbone, focusing computations on relevant regions and reducing inference cost and memory usage during mobile inference. Starting from user needs in real mobile applications, we define efficient single-instance segmentation tasks and introduce two novel metrics to evaluate both accuracy and robustness to low-quality input clicks. Extensive evaluations on the MS-COCO and LVIS datasets highlight TraceNet’s ability to generate high-quality instance masks efficiently and accurately while demonstrating robustness to imperfect user inputs.
more » « less
Free, publicly-accessible full text available August 5, 2026
ImmerScope: Multi-view Video Aggregation at Edge towards Immersive Content Services

https://doi.org/10.1145/3666025.3699324

Chen, Bo; Guo, Hongpeng; Wu, Mingyuan; Yang, Zhe; Yan, Zhisheng; Nahrstedt, Klara (November 2024, ACM)

Free, publicly-accessible full text available November 4, 2025
Scene Graph Driven Hybrid Interactive VR Teleconferencing

https://doi.org/10.1145/3664647.3684996

Wu, Mingyuan; Ji, Ruifan; Zheng, Haozhen; Li, Jiaxi; Tian, Beitong; Chen, Bo; Zhang, Ruixiao; Chakareski, Jacob; Zink, Michael; Sitaraman, Ramesh; et al (October 2024, ACM)

Full Text Available
Vesper: Learning to Manage Uncertainty in Video Streaming

https://doi.org/10.1145/3625468.3647621

Chen, Bo; Wu, Mingyuan; Guo, Hongpeng; Yan, Zhisheng; Nahrstedt, Klara (April 2024, ACM)

Full Text Available
360TripleView: 360-Degree Video View Management System Driven by Convergence Value of Viewing Preferences

https://doi.org/10.1109/ISM59092.2023.00010

Zhou, Qian; Wu, Mingyuan; Zhang, Yinjie; Zink, Michael; Sitaraman, Ramesh; Nahrstedt, Klara (December 2023, IEEE)

Full Text Available
SAVG360: Saliency-aware Viewport-guidance-enabled 360-video Streaming System

https://doi.org/10.1109/ISM59092.2023.00011

Zhang, Yinjie; Wu, Mingyuan; Tian, Beitong; Li, Jiaxi; Chen, Bo; Zhou, Qian; Nahrstedt, Klara (December 2023, IEEE)
Bulterman_Dick; Kankanhalli_Mohan; Muehlhaueser_Max; Persia_Fabio; Sheu_Philip; Tsai_Jeffrey (Ed.)
The emergence of 360-video streaming systems has brought about new possibilities for immersive video experiences while requiring significantly higher bandwidth than traditional 2D video streaming. Viewport prediction is used to address this problem, but interesting storylines outside the viewport are ignored. To address this limitation, we present SAVG360, a novel viewport guidance system that utilizes global content information available on the server side to enhance streaming with the best saliency-captured storyline of 360-videos. The saliency analysis is performed offline on the media server with powerful GPU, and the saliency-aware guidance information is encoded and shared with clients through the Saliency-aware Guidance Descriptor. This enables the system to proactively guide users to switch between storylines of the video and allow users to follow or break guided storylines through a novel user interface. Additionally, we present a viewing mode prediction algorithms to enhance video delivery in SAVG360. Evaluation of user viewport traces in 360-videos demonstrate that SAVG360 outperforms existing tiled streaming solutions in terms of overall viewport prediction accuracy and the ability to stream high-quality 360 videos under bandwidth constraints. Furthermore, a user study highlights the advantages of our proactive guidance approach over predicting and streaming of where users look.
more » « less
Full Text Available
Evaluating and Improving Hybrid Fuzzing

Jiang, Ling; Yuan, Hengchen; Wu, Mingyuan; Zhang, Lingming; Zhang, Yuqun (July 2023, Proceedings of the IEEE/ACM International Conference on Software Engineering)

Full Text Available
Interactive Scene Graph Analysis for Future Intelligent Teleconferencing Systems

https://doi.org/10.1109/ISM59092.2023.00048

Wu, Mingyuan; Lu, Yuhan; Trivedi, Shiv; Chen, Bo; Zhou, Qian; Wang, Lingdong; Singh, Simran; Zink, Michael; Sitaraman, Ramesh; Chakareski, Jacob; et al (December 2023, IEEE)

Full Text Available
ITfuzz: Coverage-guided Fuzzing for JVM Just-in-Time Compilers

Wu, Mingyuan; Lu, Minghai; Cui, Heming; Chen, Junjie; Zhang, Yuqun; Zhang, Lingming (July 2023, Proceedings of the IEEE/ACM International Conference on Software Engineering)

Full Text Available
History-driven test program synthesis for JVM testing

https://doi.org/10.1145/3510003.3510059

Zhao, Yingquan; Wang, Zan; Chen, Junjie; Liu, Mengdi; Wu, Mingyuan; Zhang, Yuqun; Zhang, Lingming (May 2022, International Conference on Software Engineering)

Full Text Available

« Prev Next »

Search for: All records