Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
Amodal segmentation is an image-based algorithm that aims to predict masks for both visible and occluded parts of objects. Existing methods typically rely on supervised learning with annotated amodal masks or synthetic data. The effectiveness of these methods relies heavily on the quality of the datasets. This dependence can unintentionally restrict their generalization capabilities due to insufficient diversity and size. Although existing zero-shot methods perform well on their reported datasets, their performance does not necessarily transfer to other datasets. We propose a tuning-free approach that re-purposes diffusion-based inpainting foundation models for amodal segmentation. Our approach is motivated by the “occlusion-free bias” of inpainting models, i.e., the inpainted objects tend to be complete and without occlusions. We reconstruct the occluded regions of an object via inpainting and then apply segmentation, all without additional training or fine-tuning. Experiments on five datasets, three previously unreported, demonstrate the generalizability of our approach. On average, our approach achieves 5.3% more accurate masks in mIoU compared to the publicly available state-of-the-art, pix2gestalt.more » « less
-
Finding where the trees are in a city and monitoring any changes are essential for sustainable urban management. Historically, urban forests are mainly inventoried via manual processes often limited to public lands. Leveraging advances in computing, we present a novel generative artificial intelligence (AI) method along with a first-ever national-scale dataset, to automatically localize trees in cities across the nation using satellite imagery. Our monitoring approach is fully automated and can be completed for 330 U.S. cities within less than a day of computing, enabling actionable knowledge of changes in urban trees and supporting sustainable development decisions. We successfully localized and counted over 278 million trees, achieving an average tree count accuracy of 92.5% and spatial accuracy of 1.5m for 2024–2025. Our computational approach allows for novel nationwide analysis to be performed. For example, we can localize approximately 117 million trees on private lands and 161 million on public lands. Further, we show and quantify that urban tree distribution exhibits strong spatial disparity, with low-income communities having substantially fewer trees and less canopy cover than others. In addition, we compare tree count and layouts before and after multiple major events (e.g., major fires and destructive weather phenomena). Overall, our approach enhances computational urban planning, including weather and extreme event forecasting, for the development of sustainable cities.more » « less
-
Not AvailableExtended Reality (XR) learning environments generate rich behavioral data through embodied interaction—head movements, gaze patterns, and spatial navigation—that could enable passive assessment without interrupting the learning experience. We investigate spatial exploration behavior in ACHIEVE, an XR environment for neural network visualization learning. In a between-subjects study (N = 56), XR participants exhibited significantly different spatial behavior than desktop users: over 14 times greater pointer movement (M = 132.5 m vs. M = 9.5 m), extensive head rotation (M = 9,818◦ ), and M = 22.5 m of head translation during the learning session. We visualize individual exploration patterns through head and pointer trajectory traces, revealing substantial variation in how learners navigate the 3D content. These spatial metrics, automatically captured during learning, represent a promising avenue for passive assessment of embodied engagement—enabling educators to identify struggling learners, provide personalized feedback, and adapt content delivery without intrusive testing. Full learning outcomes and user experience metrics are reported in companion publications; here we focus on spatial behavior as a novel contribution toward spatialized learning analytics in educationmore » « less
-
Accurate tree species identification through bark characteristics is essential for effective forest management, but traditionally requires extensive expertise. This study leverages artificial intelligence (AI), specifically the EfficientNet-B3 convolutional neural network, to enhance AI-based tree bark identification, focusing on northern red oak (Quercus rubra), hackberry (Celtis occidentalis), and bitternut hickory (Carya cordiformis) using the CentralBark dataset. We investigated three environmental variables—time of day (lighting conditions), bark moisture content (wet or dry), and cardinal direction of observation—to identify sources of classification inaccuracies. Results revealed that bark moisture significantly reduced accuracy by 8.19% in wet conditions (89.32% dry vs. 81.13% wet). In comparison, the time of day had a significant impact on hackberry (95.56% evening) and northern red oak (80.80% afternoon), with notable chi-squared associations (p < 0.05). Cardinal direction had minimal effect (4.72% variation). Bitternut hickory detection consistently underperformed (26.76%), highlighting morphological challenges. These findings underscore the need for targeted dataset augmentation with wet and afternoon images, alongside preprocessing techniques like illumination normalization, to improve model robustness. Enhanced AI tools will streamline forest inventories, support biodiversity monitoring, and bolster conservation in dynamic forest ecosystems.more » « less
-
We introduce Arenite, a novel physics-based approach for modeling sandstone structures. The key insight of our work is that simulating a combination of stress and multi-factor erosion enables the generation of a wide variety of sandstone structures observed in nature. We isolate the key shape-forming phenomena: multi-physics fabric interlocking, wind and fluvial erosion, and particle-based deposition processes. Complex 3D structures such as arches, alcoves, hoodoos, or buttes can be achieved by creating simple 3D structures with user-painted erodable areas and vegetation and running the simulation. We demonstrate the algorithm on a wide variety of structures, and our GPU-based implementation achieves the simulation in less than 5 minutes on a desktop computer for our most complex example.more » « less
-
We propose a novel approach for the computational modeling of lignified tissues, such as those found in tree branches and timber. We leverage a stateof the-art strand-based representation for tree form, which we extend to describe biophysical processes at short and long time scales. Simulations at short time scales enable us to model different breaking patterns due to branch bending, twisting, and breaking. On long timescales, our method enables the simulation of realistic branch shapes under the influence of plausible biophysical processes, such as the development of compression and tension wood. We specifically focus on computationally fast simulations of woody material, enabling the interactive exploration of branches and wood breaking. By leveraging Cosserat rod physics, our method enables the generation of a wide variety of breaking patterns. We showcase the capabilities of our method by performing and visualizing numerous experiments.more » « less
-
We introduce RGB2Point, an unposed single-view RGB image to a 3D point cloud generation based on Transformer. RGB2Point takes an input image of an object and generates a dense 3D point cloud. Contrary to prior works based on CNN layers and diffusion-denoising approaches, we use pre-trained Transformer layers that are fast and generate high-quality point clouds with consistent quality over available categories. Our generated point clouds demonstrate high quality on a real-world dataset, as evidenced by improved Chamfer distance (51.15%) and Earth Mover’s distance (36.17%) metrics compared to the current state-of the-art. Additionally, our approach shows a better quality on a synthetic dataset, achieving better Chamfer distance (39.26%), Earth Mover’s distance (26.95%), and F-score (47.16%). Moreover, our method produces 63.1% more consistent high-quality results across various object categories compared to prior works. Furthermore, RGB2Point is computationally efficient, requiring only 2.3GB of VRAM to reconstruct a 3D point cloud from a single RGB image, and our implementation generates the results 15,133× faster than a SOTA diffusion-based model.more » « less
An official website of the United States government
