NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Augmented Photogrammetry: 3D Object Scanning and Appearance Editing in Mobile Augmented Reality

https://doi.org/10.1145/3586182.3616638

Lohn, Daniel; Höllerer, Tobias; Sra, Misha (October 2023, UIST '23 Adjunct: Adjunct Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology)

Full Text Available
OCTOPUS: Open-vocabulary Content Tracking and Object Placement Using Semantic Understanding in Mixed Reality

Yoffe, Luke; Sharma, Aditya; and Höllerer, Tobias (October 2023, IEEE International Symposium on Mixed and Augmented Reality ISMARAdjunct)

Full Text Available
The Impact of Navigation Aids on Search Performance and Object Recall in Wide-Area Augmented Reality

https://doi.org/10.1145/3544548.3581413

Kumaran, Radha; Kim, You-Jin; Milner, Anne E; Bullock, Tom; Giesbrecht, Barry; Höllerer, Tobias (April 2023, CHI '23: Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems)

Full Text Available
Level-of-Detail AR: Dynamically Adjusting Augmented Reality Level of Detail Based on Visual Angle

https://doi.org/10.1109/VR55154.2023.00022

Wysopal, Abby; Ross, Vivian; Passananti, Joyce; Yu, Kangyou; Huynh, Brandon; Höllerer, Tobias (March 2023, 2023 IEEE Conference Virtual Reality and 3D User Interfaces (VR))

Full Text Available
Benefits of Synthetically Pre-trained Depth-Prediction Networks for Indoor/Outdoor Image Classification

https://doi.org/10.1109/WACVW58289.2023.00040

Lin, Kelly X.; Cho, Irene; Walimbe, Amey; Zamora, Bryan A.; Rich, Alex; Zhang, Sirius Z.; Höllerer, Tobias (January 2023, 2023 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW))

Ground truth depth information is necessary for many computer vision tasks. Collecting this information is chal-lenging, especially for outdoor scenes. In this work, we propose utilizing single-view depth prediction neural networks pre-trained on synthetic scenes to generate relative depth, which we call pseudo-depth. This approach is a less expen-sive option as the pre-trained neural network obtains ac-curate depth information from synthetic scenes, which does not require any expensive sensor equipment and takes less time. We measure the usefulness of pseudo-depth from pre-trained neural networks by training indoor/outdoor binary classifiers with and without it. We also compare the difference in accuracy between using pseudo-depth and ground truth depth. We experimentally show that adding pseudo-depth to training achieves a 4.4% performance boost over the non-depth baseline model on DIODE, a large stan-dard test dataset, retaining 63.8% of the performance boost achieved from training a classifier on RGB and ground truth depth. It also boosts performance by 1.3% on another dataset, SUN397, for which ground truth depth is not avail-able. Our result shows that it is possible to take information obtained from a model pre-trained on synthetic scenes and successfully apply it beyond the synthetic domain to real-world data.
more » « less
Full Text Available
Impact of Annotator Demographics on Sentiment Dataset Labeling

https://doi.org/10.1145/3555632

Ding, Yi; You, Jacob; Machulla, Tonja-Katrin; Jacobs, Jennifer; Sen, Pradeep; Höllerer, Tobias (November 2022, Proceedings of the ACM on Human-Computer Interaction)

As machine learning methods become more powerful and capture more nuances of human behavior, biases in the dataset can shape what the model learns and is evaluated on. This paper explores and attempts to quantify the uncertainties and biases due to annotator demographics when creating sentiment analysis datasets. We ask >1000 crowdworkers to provide their demographic information and annotations for multimodal sentiment data and its component modalities. We show that demographic differences among annotators impute a significant effect on their ratings, and that these effects also occur in each component modality. We compare predictions of different state-of-the-art multimodal machine learning algorithms against annotations provided by different demographic groups, and find that changing annotator demographics can cause >4.5 in accuracy difference when determining positive versus negative sentiment. Our findings underscore the importance of accounting for crowdworker attributes, such as demographics, when building datasets, evaluating algorithms, and interpreting results for sentiment analysis.
more » « less
Full Text Available
Investigating Search Among Physical and Virtual Objects Under Different Lighting Conditions

https://doi.org/10.1109/TVCG.2022.3203093

Kim, You-Jin; Kumaran, Radha; Sayyad, Ehsan; Milner, Anne; Bullock, Tom; Giesbrecht, Barry; Hollerer, Tobias (November 2022, IEEE Transactions on Visualization and Computer Graphics)

Full Text Available
Layerable Apps: Comparing Concurrent and Exclusive Display of Augmented Reality Applications

https://doi.org/10.1109/ISMAR55827.2022.00104

Huynh, Brandon; Wysopal, Abby; Ross, Vivian; Orlosky, Jason; Hollerer, Tobias (October 2022, Proceedings International Symposium on Mixed and Augmented Reality ISMAR)

Full Text Available
VoRTX: Volumetric 3D Reconstruction With Transformers for Voxelwise View Selection and Fusion

https://doi.org/10.1109/3DV53792.2021.00042

Stier, Noah; Rich, Alexander; Sen, Pradeep; Hollerer, Tobias (December 2021, 2021 International Conference on 3D Vision (3DV))

Recent volumetric 3D reconstruction methods can produce very accurate results, with plausible geometry even for unobserved surfaces. However, they face an undesirable trade-off when it comes to multi-view fusion. They can fuse all available view information by global averaging, thus losing fine detail, or they can heuristically cluster views for local fusion, thus restricting their ability to consider all views jointly. Our key insight is that greater detail can be retained without restricting view diversity by learning a view-fusion function conditioned on camera pose and image content. We propose to learn this multi-view fusion using a transformer. To this end, we introduce VoRTX, 1 an end-to-end volumetric 3D reconstruction network using transformers for wide-baseline, multi-view feature fusion. Our model is occlusion-aware, leveraging the transformer architecture to predict an initial, projective scene geometry estimate. This estimate is used to avoid back-projecting image features through surfaces into occluded regions. We train our model on ScanNet and show that it produces better reconstructions than state-of-the-art methods. We also demonstrate generalization without any fine-tuning, outperforming the same state-of-the-art methods on two other datasets, TUM-RGBD and ICL-NUIM.
more » « less
Full Text Available
3DVNet: Multi-View Depth Prediction and Volumetric Refinement

https://doi.org/10.1109/3DV53792.2021.00079

Rich, Alexander; Stier, Noah; Sen, Pradeep; Hollerer, Tobias (December 2021, 2021 International Conference on 3D Vision (3DV))

We present 3DVNet, a novel multi-view stereo (MVS) depth-prediction method that combines the advantages of previous depth-based and volumetric MVS approaches. Our key idea is the use of a 3D scene-modeling network that iteratively updates a set of coarse depth predictions, resulting in highly accurate predictions which agree on the underlying scene geometry. Unlike existing depth-prediction techniques, our method uses a volumetric 3D convolutional neural network (CNN) that operates in world space on all depth maps jointly. The network can therefore learn meaningful scene-level priors. Furthermore, unlike existing volumetric MVS techniques, our 3D CNN operates on a feature-augmented point cloud, allowing for effective aggregation of multi-view information and flexible iterative refinement of depth maps. Experimental results show our method exceeds state-of-the-art accuracy in both depth prediction and 3D reconstruction metrics on the ScanNet dataset, as well as a selection of scenes from the TUM-RGBD and ICL-NUIM datasets. This shows that our method is both effective and generalizes to new settings.
more » « less
Full Text Available

« Prev Next »

Search for: All records