Joint Learning of Speech-Driven Facial Motion with Bidirectional Long-Short Term Memory

Sadoughi, Najmeh; and Busso, Carlos

doi:10.1007/978-3-319-67401-8_49

Citation Details

Joint Learning of Speech-Driven Facial Motion with Bidirectional Long-Short Term Memory

Thefaceconveysablendofverbalandnonverbalinformation playing an important role in daily interaction. While speech articulation mostly affects the orofacial areas, emotional behaviors are externalized across the entire face. Considering the relation between verbal and non-verbal behaviors is important to create naturalistic facial movements for conversational agents (CAs). Furthermore, facial muscles connect areas across the face, creating principled relationships and dependencies between the movements that have to be taken into account. These relationships are ignored when facial movements across the face are sep- arately generated. This paper proposes to create speech-driven models that jointly capture the relationship not only between speech and facial movements, but also across facial movements. The input to the models are features extracted from speech that convey the verbal and emotional states of the speakers. We build our models with bidirectional long-short term memory (BLSTM) units which are shown to be very successful in modeling dependencies for sequential data. The objective and subjective evaluations of the results demonstrate the benefits of joint modeling of facial regions using this framework. more »

Award ID(s):: 1718944

PAR ID:: 10072817

Author(s) / Creator(s):: Sadoughi, Najmeh; and Busso, Carlos

Date Published:: 2017-01-01

Journal Name:: International Conference on Intelligent Virtual Agents (IVA 2017)

Volume:: 10498

Page Range / eLocation ID:: 389-402

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1007/978-3-319-67401-8_49

More Like this