NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

A Reality Check of Positioning in Multiuser Mobile Augmented Reality: Measurement and Analysis

https://doi.org/10.1145/3551626.3564966

Wang, Na; Wang, Haoliang; Petrangeli, Stefano; Swaminathan, Viswanathan; Li, Fei; Chen, Songqing (December 2022, The 4th ACM International Conference on Multimedia in Asia)

Full Text Available
Towards Accurate Positioning in Multiuser Augmented Reality on Mobile Devices

https://doi.org/10.1109/ISM55400.2022.00048

Wang, Na; Wang, Haoliang; Petrangeli, Stefano; Swaminathan, Viswanathan; Li, Fei; Chen, Songqing (December 2022, IEEE International Symposium on Multimedia (ISM))

Full Text Available
Full UHD 360-Degree Video Dataset and Modeling of Rate-Distortion Characteristics and Head Movement Navigation

https://doi.org/10.1145/3458305.3478447

Chakareski, Jacob; Aksu, Ridvan; Swaminathan, Viswanathan; Zink, Michael (June 2021, Proceedings of the 12th ACM Multimedia Systems Conference)

We investigate the rate-distortion (R-D) characteristics of full ultra-high definition (UHD) 360° videos and capture corresponding head movement navigation data of virtual reality (VR) headsets. We use the navigation data to analyze how users explore the 360° look-around panorama for such content and formulate related statistical models. The developed R-D characteristics and modeling capture the spatiotemporal encoding efficiency of the content at multiple scales and can be exploited to enable higher operational efficiency in key use cases. The high quality expectations for next generation immersive media necessitate the understanding of these intrinsic navigation and content characteristics of full UHD 360° videos.
more » « less
Full Text Available
Closing-the-Loop: A Data-Driven Framework for Effective Video Summarization

https://doi.org/10.1109/ISM.2020.00042

Xu, Ran; Wang, Haoliang; Petrangeli, Stefano; Swaminathan, Viswanathan; Bagchi, Saurabh (December 2020, IEEE International Symposium on Multimedia (ISM))
null (Ed.)
Full Text Available
Rotationally-Consistent Novel View Synthesis for Humans

https://doi.org/10.1145/3394171.3413754

Kwon, Youngjoong; Petrangeli, Stefano; Kim, Dahun; Wang, Haoliang; Fuchs, Henry; Swaminathan, Viswanathan (October 2020, MM '20: Proceedings of the 28th ACM International Conference on Multimedia)
null (Ed.)
Human novel view synthesis aims to synthesize target views of a human subject given input images taken from one or more reference viewpoints. Despite significant advances in model-free novel view synthesis, existing methods present two major limitations when applied to complex shapes like humans. First, these methods mainly focus on simple and symmetric objects, e.g., cars and chairs, limiting their performances to fine-grained and asymmetric shapes. Second, existing methods cannot guarantee visual consistency across different adjacent views of the same object. To solve these problems, we present in this paper a learning framework for the novel view synthesis of human subjects, which explicitly enforces consistency across different generated views of the subject. Specifically, we introduce a novel multi-view supervision and an explicit rotational loss during the learning process, enabling the model to preserve detailed body parts and to achieve consistency between adjacent synthesized views. To show the superior performance of our approach, we present qualitative and quantitative results on the Multi-View Human Action (MVHA) dataset we collected (consisting of 3D human models animated with different Mocap sequences and captured from 54 different viewpoints), the Pose-Varying Human Model (PVHM) dataset, and ShapeNet. The qualitative and quantitative results demonstrate that our approach outperforms the state-of-the-art baselines in both per-view synthesis quality, and in preserving rotational consistency and complex shapes (e.g. fine-grained details, challenging poses) across multiple adjacent views in a variety of scenarios, for both humans and rigid objects.
more » « less
Full Text Available
Rotationally-Temporally Consistent Novel View Synthesis of Human Performance Video

https://doi.org/10.1007/978-3-030-58548-8_23

Kwon, Youngjoong; Petrangeli, Stefano; Kim, Dahun; Wang, Haolian; Park, Eunbyung; Swaminathan, Viswanathan; Fuchs, Henry (October 2020, European Conference on Computer Vision ECCV 2020: Computer Vision – ECCV 2020)
Vedaldi, Andrea; Bischof, Horst; Brox, Thomas; Frahm, Jan-Michael (Ed.)
Novel view video synthesis aims to synthesize novel viewpoints videos given input captures of a human performance taken from multiple reference viewpoints and over consecutive time steps. Despite great advances in model-free novel view synthesis, existing methods present three limitations when applied to complex and time-varying human performance. First, these methods (and related datasets) mainly consider simple and symmetric objects. Second, they do not enforce explicit consistency across generated views. Third, they focus on static and non-moving objects. The fine-grained details of a human subject can therefore suffer from inconsistencies when synthesized across different viewpoints or time steps. To tackle these challenges, we introduce a human-specific framework that employs a learned 3D-aware representation. Specifically, we first introduce a novel siamese network that employs a gating layer for better reconstruction of the latent volumetric representation and, consequently, final visual results. Moreover, features from consecutive time steps are shared inside the network to improve temporal consistency. Second, we introduce a novel loss to explicitly enforce consistency across generated views both in space and in time. Third, we present the Multi-View Human Action (MVHA) dataset, consisting of near 1200 synthetic human performance captured from 54 viewpoints. Experiments on the MVHA, Pose-Varying Human Model and ShapeNet datasets show that our method outperforms the state-of-the-art baselines both in view generation quality and spatio-temporal consistency.
more » « less
Full Text Available
Towards field-of-view prediction for augmented reality applications on mobile devices

https://doi.org/10.1145/3386293.3397114

Wang, Na; Wang, Haoliang; Petrangeli, Stefano; Swaminathan, Viswanathan; Li, Fei; Chen, Songqing (June 2020, MMSys '20: 11th ACM Multimedia Systems Conference)

By allowing people to manipulate digital content placed in the real world, Augmented Reality (AR) provides immersive and enriched experiences in a variety of domains. Despite its increasing popularity, providing a seamless AR experience under bandwidth fluctuations is still a challenge, since delivering these experiences at photorealistic quality with minimal latency requires high bandwidth. Streaming approaches have already been proposed to solve this problem, but they require accurate prediction of the Field-Of-View of the user to only stream those regions of scene that are most likely to be watched by the user. To solve this prediction problem, we study in this paper the watching behavior of users exploring different types of AR scenes via mobile devices. To this end, we introduce the ACE Dataset, the first dataset collecting movement data of 50 users exploring 5 different AR scenes. We also propose a four-feature taxonomy for AR scene design, which allows categorizing different types of AR scenes in a methodical way, and supporting further research in this domain. Motivated by the ACE dataset analysis results, we develop a novel user visual attention prediction algorithm that jointly utilizes information of users' historical movements and digital objects positions in the AR scene. The evaluation on the ACE Dataset show the proposed approach outperforms baseline approaches under prediction horizons of variable lengths, and can therefore be beneficial to the AR ecosystem in terms of bandwidth reduction and improved quality of users' experience.
more » « less
Full Text Available
QuRate: power-efficient mobile immersive video streaming

Jiang, Nan; Liu, Yao; Guo, Tian; Xu, Wenyao; Swaminathan, Viswanathan; Xu, Lisong; Wei, Sheng (January 2020, ACM Multimedia Systems Conference)
null (Ed.)
Full Text Available
QuRate: Power-Efficient Mobile Immersive Video Streaming

Jiang, Nan; Liu, Yao; Guo, Tian; Xu, Wenyao; Swaminathan, Viswanathan; Xu, Lisong; Wei, Sheng (January 2020, ACM Multimedia Systems Conference 2020 (MMSys'20))

Smartphones have recently become a popular platform for deploying the computation-intensive virtual reality (VR) applications, such as immersive video streaming (a.k.a., 360-degree video streaming). One specific challenge involving the smartphone-based head mounted display (HMD) is to reduce the potentially huge power consumption caused by the immersive video. To address this challenge, we first conduct an empirical power measurement study on a typical smartphone immersive streaming system, which identifies the major power consumption sources. Then, we develop QuRate, a quality-aware and user-centric frame rate adaptation mechanism to tackle the power consumption issue in immersive video streaming. QuRate optimizes the immersive video power consumption by modeling the correlation between the perceivable video quality and the user behavior. Specifically, QuRate builds on top of the user’s reduced level of concentration on the video frames during view switching and dynamically adjusts the frame rate without impacting the perceivable video quality. We evaluate QuRate with a comprehensive set of experiments involving 5 smartphones, 21 users, and 6 immersive videos using empirical user head movement traces. Our experimental results demonstrate that QuRate is capable of extending the smartphone battery life by up to 1.24X while maintaining the perceivable video quality during immersive video streaming. Also, we conduct an Institutional Review Board (IRB)- approved subjective user study to further validate the minimum video quality impact caused by QuRate.
more » « less
Full Text Available
Viewport-Driven Rate-Distortion Optimized Scalable Live 360° Video Network Multicast

https://doi.org/10.1109/ICMEW.2018.8551577

Aksu, Ridvan; Chakareski, Jacob; Swaminathan, Viswanathan (July 2018, IEEE International Conference on Multimedia and Expo Hot 3D Workshop)

Full Text Available

« Prev Next »

Search for: All records