NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

DeltaStream: 2D-Inferred Delta Encoding for Live Volumetric Video Streaming

https://doi.org/10.1145/3711875.3729131

Lee, Hojeong; Kim, Yu Hong; Ryu, Sangwoo; Hong, James Won-Ki; Ha, Sangtae; Kim, Seyeon (June 2025, ACM)

Free, publicly-accessible full text available June 23, 2026
An automated Cryo-EM computational environment on the HPC system using Pegasus WMS

https://doi.org/10.1109/WORKS56498.2022.00013

Osinski, Tomasz; Rynge, Mats; Hong, James K.; Vahi, Karan; Chu, Ruilin; Sul, Cesar; Deelman, Ewa; Kim, Byoung-Do (November 2022, 2022 IEEE/ACM Workshop on Workflows in Support of Large-Scale Science (WORKS))

Full Text Available
Spotting Temporally Precise, Fine-Grained Events in Video

https://doi.org/10.1007/978-3-031-19833-5_3

Hong, James; Zhang, Haotian; Gharbi, Michaël; Fisher, Matthew; Fatahalian, Kayvon (January 2022, Computer Vision – ECCV 2022: 17th European Conference, Tel Aviv, Israel)

We introduce the task of spotting temporally precise, fine-grained events in video (detecting the precise moment in time events occur). Precise spotting requires models to reason globally about the full-time scale of actions and locally to identify subtle frame-to-frame appearance and motion differences that identify events during these actions. Surprisingly, we find that top performing solutions to prior video understanding tasks such as action detection and segmentation do not simultaneously meet both requirements. In response, we propose E2E-Spot, a compact, end-to-end model that performs well on the precise spotting task and can be trained quickly on a single GPU. We demonstrate that E2E-Spot significantly outperforms recent baselines adapted from the video action detection, segmentation, and spotting literature to the precise spotting task. Finally, we contribute new annotations and splits to several fine-grained sports action datasets to make these datasets suitable for future work on precise spotting.
more » « less
Full Text Available
Video Pose Distillation for Few-Shot, Fine-Grained Sports Action Recognition

https://doi.org/10.1109/ICCV48922.2021.00912

Hong, James; Fisher, Matthew; Gharbi, Michael; Fatahalian, Kayvon (October 2021, International Conference on Computer Vision (ICCV))

Full Text Available
Analysis of Faces in a Decade of US Cable TV News

https://doi.org/10.1145/3447548.3467134

Hong, James; Crichton, Will; Zhang, Haotian; Fu, Daniel Y.; Ritchie, Jacob; Barenholtz, Jeremy; Hannel, Ben; Yao, Xinwei; Murray, Michaela; Moriba, Geraldine; et al (August 2021, KDD '21: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining)

Full Text Available
Learning in situ: a randomized experiment in video streaming

Yan, Francis Y.; Ayers, Hudson; Zhu, Chenzhi; Fouladi, Sadjad; Hong, James; Zhang, Keyi; Levis, Philip; Winstein, Keith (February 2020, 17th USENIX Symposium on Networked Systems Design and Implementation (NSDI '20))

We describe the results of a randomized controlled trial of video-streaming algorithms for bitrate selection and network prediction. Over the last year, we have streamed 38.6 years of video to 63,508 users across the Internet. Sessions are randomized in blinded fashion among algorithms. We found that in this real-world setting, it is difficult for sophisticated or machine-learned control schemes to outperform a "simple" scheme (buffer-based control), notwithstanding good performance in network emulators or simulators. We performed a statistical analysis and found that the heavy-tailed nature of network and user behavior, as well as the challenges of emulating diverse Internet paths during training, present obstacles for learned algorithms in this setting. We then developed an ABR algorithm that robustly outperformed other schemes, by leveraging data from its deployment and limiting the scope of machine learning only to making predictions that can be checked soon after. The system uses supervised learning in situ, with data from the real deployment environment, to train a probabilistic predictor of upcoming chunk transmission times. This module then informs a classical control policy (model predictive control). To support further investigation, we are publishing an archive of data and results each week, and will open our ongoing study to the community. We welcome other researchers to use this platform to develop and validate new algorithms for bitrate selection, network prediction, and congestion control.
more » « less
Full Text Available

Search for: All records