Gesture Generation

Gesture generation focuses on creating realistic and contextually appropriate movements to accompany speech or text, primarily for virtual agents and robots to enhance human-computer interaction. Current research heavily utilizes deep learning models, particularly diffusion models and transformers, often incorporating multimodal data (audio, text, video) to improve the naturalness and semantic coherence of generated gestures. This field is significant for advancing human-robot interaction, virtual character animation, and accessibility technologies by enabling more natural and expressive communication.

Papers

December 26, 2023

Chain of Generation: Multi-Modal Gesture Synthesis via Cascaded Conditional Control
Zunnan Xu, Yachao Zhang, Sicheng Yang, Ronghui Li, Xiu Li
Faithful Generation Side Chain High Quality Gesture Hand Motion Gesture Generation Gesture Synthesis Conditional Control Modal Prior

October 4, 2023

Large language models in textual analysis for gesture selection
Laura B. Hensel, Nutchanon Yongsatianchot, Parisa Torshizi, Elena Minucci, Stacy Marsella
Large Language Model Gesture Recognition High Quality Gesture Gesture Generation Content Analysis Gesture Sequence Pre Existing Gesture

September 17, 2023

August 29, 2023

C2G2: Controllable Co-speech Gesture Generation with Latent Diffusion Model
Longbin Ji, Pengfei Wei, Yi Ren, Jinglin Liu, Chen Zhang, Xiang Yin
Latent Diffusion Model Hand Gesture Gesture Generation

August 24, 2023

The GENEA Challenge 2023: A large scale evaluation of gesture generation models in monadic and dyadic settings
Taras Kucherenko, Rajmund Nagy, Youngwoo Yoon, Jieyeon Woo, Teodor Nikolov, Mihail Tsakov, Gustav Eje Henter
Human Motion Motion Capture Gesture Generation Large Scale Evaluation Dyadic Interaction Speech Driven Gesture Agent Task

August 8, 2023

TranSTYLer: Multimodal Behavioral Style Transfer for Facial and Body Gestures Generation
Mireille Fares, Catherine Pelachaud, Nicolas Obin
Style Transfer Multimodal Transformer Behavior Expressivity Style Gesture Generation Multimodal Behavior Semantic CLIPstyler

June 20, 2023

EMoG: Synthesizing Emotive Co-speech 3D Gesture with Diffusion Model
Lianying Yin, Yijun Wang, Tianyu He, Jinming Liu, Wei Zhao, Bohan Li, Xin Jin, Jianxin Lin
Diffusion Model Joint Modeling Gesture Generation Co Speech Gesture Generation Gesture Synthesis Correlation Transformer Co Speech 3D Gesture

March 26, 2023

GestureDiffuCLIP: Gesture Diffusion Model with CLIP Latents
Tenglong Ao, Zeyi Zhang, Libin Liu
Fine Grained Contrastive Language Image High Quality Gesture Co Speech Gesture Gesture Generation Synthetic Gesture

March 23, 2023

GesGPT: Speech Gesture Synthesis With Text Parsing from ChatGPT
Nan Gao, Zeyu Zhao, Zhi Zeng, Shuwu Zhang, Dongdong Weng, Yihua Bao
ChatGPT Generated Conversation Text Modality Gesture Generation Body Gesture Speech Driven Gesture Gesture Synthesis

March 15, 2023

Evaluating gesture generation in a large-scale open challenge: The GENEA Challenge 2022
Taras Kucherenko, Pieter Wolfert, Youngwoo Yoon, Carla Viegas, Teodor Nikolov, Mihail Tsakov, Gustav Eje Henter
Open Challenge Gesture Generation Synthetic Gesture

January 13, 2023

A Comprehensive Review of Data-Driven Co-Speech Gesture Generation
Simbarashe Nyatsanga, Taras Kucherenko, Chaitanya Ahuja, Gustav Eje Henter, Michael Neff
Comprehensive Review Co Speech Gesture Gesture Generation User Utterance Gesture Synthesis

October 13, 2022

Deep Gesture Generation for Social Robots Using Type-Specific Libraries
Hitoshi Teshima, Naoki Wake, Diego Thomas, Yuta Nakashima, Hiroshi Kawasaki, Katsushi Ikeuchi
Social Robot High Quality Gesture Gesture Generation Gesture Sequence Open Source Library

September 15, 2022

ZeroEGGS: Zero-shot Example-based Gesture Generation from Speech
Saeed Ghorbani, Ylva Ferstl, Daniel Holden, Nikolaus F. Troje, Marc-André Carbonneau
Zero Shot Speech Analysis Style Representation Gesture Generation Motion Style Speech Driven Gesture

August 25, 2022

The ReprGesture entry to the GENEA Challenge 2022
Sicheng Yang, Zhiyong Wu, Minglei Li, Mengchen Zhao, Jiuxin Lin, Liyang Chen, Weihong Bao
Gesture Recognition Modality Specific Multimodal Representation Learning Gesture Generation

August 22, 2022

The GENEA Challenge 2022: A large evaluation of data-driven co-speech gesture generation
Youngwoo Yoon, Pieter Wolfert, Taras Kucherenko, Carla Viegas, Teodor Nikolov, Mihail Tsakov, Gustav Eje Henter
Gesture Generation Large Scale Evaluation Human Gesture

August 5, 2022

Real-time Gesture Animation Generation from Speech for Virtual Human Interaction
Manuel Rebol, Christian Gütl, Krzysztof Pietroszek
Speech Analysis Generative Adversarial Gesture Generation Virtual Co Presence Speech Driven Gesture Gesture Control

August 3, 2022

Zero-Shot Style Transfer for Gesture Animation driven by Text and Speech using Adversarial Disentanglement of Multimodal Style Encoding
Mireille Fares, Michele Grimaldi, Catherine Pelachaud, Nicolas Obin
Speech Analysis Style Transfer Multimodal Representation Hand Gesture Gesture Generation Style Encoder Speaker Variability Multimodal Behavior