NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

MiRODES: Mini Intelligent Robot for On-campus Domain-specific Event Support

https://doi.org/10.1145/3610978.3641272

Manzano, Mila; O'Donnell, Kristen; Espinosa, Eden; Yoon, Sejong (March 2024, ACM)

Full Text Available
Microscopic modeling of attention-based movement behaviors

https://doi.org/10.1016/j.trc.2024.104583

Li, Danrui; Schwartz, Mathew; Sohn, Samuel S; Yoon, Sejong; Pavlovic, Vladimir; Kapadia, Mubbasir (May 2024, Transportation Research Part C: Emerging Technologies)

Full Text Available
Learning from Synthetic Human Group Activities

Chang, Che-Jui; Li, Danrui; Patel, Deep; Goes, Parth; Zhou, Honglu; Moon, Seonghyeon; Sohn, Samuel S; Yoon, Sejong; Pavlovic, Vladimir; Kapadia, Mubbasir (July 2024, The IEEE/CVF Conference on Computer Vision and Pattern Recognition)

The study of complex human interactions and group activities has become a focal point in human-centric computer vision. However, progress in related tasks is often hindered by the challenges of obtaining large-scale labeled datasets from real-world scenarios. To address the limitation, we introduce M3Act, a synthetic data generator for multi-view multi-group multi-person human atomic actions and group activities. Powered by Unity Engine, M3Act features multiple semantic groups, highly diverse and photorealistic images, and a comprehensive set of annotations, which facilitates the learning of human-centered tasks across singleperson, multi-person, and multi-group conditions. We demonstrate the advantages of M3Act across three core experiments. The results suggest our synthetic dataset can significantly improve the performance of several downstream methods and replace real-world datasets to reduce cost. Notably, M3Act improves the state-of-the-art MOTRv2 on DanceTrack dataset, leading to a hop on the leaderboard from 10th to 2nd place. Moreover, M3Act opens new research for controllable 3D group activity generation. We define multiple metrics and propose a competitive baseline for the novel task. Our code and data are available at our project page: http://cjerry1243.github.io/M3Act.
more » « less
Full Text Available
Toward Realistic Human Crowd Simulations with Data-Driven Parameter Space Exploration

https://doi.org/10.1109/AIxVR59861.2024.00035

Hu, Kaidong; Yoon, Sejong; Pavlovic, Vladimir; Kapadia, Mubbasir (January 2024, IEEE)

Full Text Available
MSI: Maximize Support-Set Information for Few-Shot Segmentation

Moon, Seonghyeon; Sohn, Samuel S.; Zhou, Honglu; Yoon, Sejong; Pavlovic, Vladimir; Khan, Muhammad Haris; Kapadia, Mubbasir (October 2023, International Conference on Computer Vision (ICCV))

FSS (Few-shot segmentation) aims to segment a target class using a small number of labeled images (support set). To extract information relevant to the target class, a dominant approach in best performing FSS methods removes background features using a support mask. We observe that this feature excision through a limiting support mask introduces an information bottleneck in several challenging FSS cases, e.g., for small targets and/or inaccurate target boundaries. To this end, we present a novel method (MSI), which maximizes the support-set information by exploiting two complementary sources of features to generate super correlation maps. We validate the effectiveness of our approach by instantiating it into three recent and strong FSS methods. Experimental results on several publicly available FSS benchmarks show that our proposed method consistently improves performance by visible margins and leads to faster convergence. Our code and trained models are available at: https://github.com/moonsh/MSI-Maximize-Support-Set-Information
more » « less
Full Text Available
Constructivist Approaches for Computational Emotions: A Systematic Survey

Viola, Alexander; Pavlovic, Vladimir; Yoon, Sejong (January 2023, Computational Theory of Mind for Human-Machine Teams. AAAI-FSS 2021. Lecture Notes in Computer Science)
Gurney, Nikolos; Sukthankar, Gita (Ed.)
Computational emotion, is naturally predicated on an operating theory of emotion. This paper seeks to explore the prevalence of three different approaches in the literature, namely basic emotion, dimensional emotion, and constructed emotion. Basic emotion maintains that there exists a discrete set of primitive emotions evolved as responses to certain stimuli; dimensional emotion sees different emotions as systematically related by two or more dimensions (typically valence and arousal); and constructed emotion describes emotional experience as a function of the brain’s general predictive faculties applied to learned social concepts of different emotions. In order to see how these approaches are represented in affective computing literature, we conduct a systematic survey spanning the IEEE, ACM, ScienceDirect, and Engineering Village databases. Out of 204 selected papers, 151 apply basic emotion theory, 48 apply dimensional emotion, and 5 apply constructed emotion. We find promising representation of the constructed emotion theory in the affective computing literature and conclude that it provides a theoretical basis worth pursuing for affective engagement human computer interaction (HCI) applications.
more » « less
Full Text Available
A2X: An end-to-end framework for assessing agent and environment interactions in multimodal human trajectory prediction

https://doi.org/10.1016/j.cag.2022.05.010

Sohn, Samuel S.; Lee, Mihee; Moon, Seonghyeon; Qiao, Gang; Usman, Muhammad; Yoon, Sejong; Pavlovic, Vladimir; Kapadia, Mubbasir (August 2022, Computers & Graphics)

Full Text Available
Harnessing Fourier Isovists and Geodesic Interaction for Long-Term Crowd Flow Prediction

https://doi.org/10.24963/ijcai.2022/185

Sohn, Samuel S.; Moon, Seonghyeon; Zhou, Honglu; Lee, Mihee; Yoon, Sejong; Pavlovic, Vladimir; Kapadia, Mubbasir (January 2022, Thirty-First International Joint Conference on Artificial Intelligence (IJCAI))

Full Text Available
Laying the Foundations of Deep Long-Term Crowd Flow Prediction

Sohn, Sam; Zhou, Honglu; Moon, Seonghyeon; Yoon, Sejong; Pavlovic, Vladimir; Kapadia, Mubbasir (October 2020, European Conference on Computer Vision)

Predicting the crowd behavior in complex environments is a key requirement for crowd and disaster management, architectural design, and urban planning. Given a crowd’s immediate state, current approaches must be successively repeated over multiple time-steps for long-term predictions, leading to compute expensive and error-prone results. However, most applications require the ability to accurately predict hundreds of possible simulation outcomes (e.g., under different environment and crowd situations) at real-time rates, for which these approaches are prohibitively expensive. We propose the first deep framework to instantly predict the long-term flow of crowds in arbitrarily large, realistic environments. Central to our approach are a novel representation CAGE, which efficiently encodes crowd scenarios into compact, fixed-size representations that losslessly represent the environment, and a modified SegNet architecture for instant long-term crowd flow prediction. We conduct comprehensive experiments on novel synthetic and real datasets. Our results indicate that our approach is able to capture the essence of real crowd movement over very long time periods, while generalizing to never-before-seen environments and crowd contexts. The associated Supplementary Material, models, and datasets are available at github.com/SSSohn/LTCF.
more » « less
Full Text Available
Predicting Crowd Egress and Environment Relationships to Support Building Design Optimization

https://doi.org/10.1016/j.cag.2020.03.005

Hu, Kaidong; Yoon, Sejong; Pavlovic, Vladimir; Faloutsos, Petros; Kapadia, Mubbasir (May 2020, Computers & Graphics)

Full Text Available

« Prev Next »

Search for: All records