Towards Open Domain Text-Driven Synthesis of Multi-person Motions

Shan, Mengyi; Dong, Lu; Han, Yutao; Yao, Yuan; Liu, Tao; Nwogu, Ifeoma; Qi, Guo_Jun; Hill, Mitch

Citation Details

This work aims to generate natural and diverse group motions of multiple humans from textual descriptions. While singleperson text-to-motion generation is extensively studied, it remains challenging to synthesize motions for more than one or two subjects from in-the-wild prompts, mainly due to the lack of available datasets. In this work, we curate human pose and motion datasets by estimating pose information from large-scale image and video datasets. Our models use a transformer-based diffusion framework that accommodates multiple datasets with any number of subjects or frames. Experiments explore both generation of multi-person static poses and generation of multiperson motion sequences. To our knowledge, our method is the first to generate multi-subject motion sequences with high diversity and fidelity from a large variety of textual prompts. more »

Award ID(s):: 2223507

PAR ID:: 10569922

Author(s) / Creator(s):: Shan, Mengyi; Dong, Lu; Han, Yutao; Yao, Yuan; Liu, Tao; Nwogu, Ifeoma; Qi, Guo_Jun; Hill, Mitch

Publisher / Repository:: Springer_Science+Business_Media

Date Published:: 2024-10-04

ISBN:: 978-3-031-73650-6

Format(s):: Medium: X

Location:: Milan,Italy

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Proceeding:
The DOI is not currently available.

More Like this