Human Object Interaction

Human-object interaction (HOI) research focuses on understanding and modeling how humans interact with objects in images and videos, aiming to accurately detect, classify, and even generate these interactions. Current research emphasizes developing robust models, often leveraging transformer architectures and diffusion models, to handle challenges like occlusion, diverse object categories, and limited training data, particularly in zero-shot and few-shot learning scenarios. This field is crucial for advancing computer vision, robotics, and human-computer interaction, with applications ranging from improved activity recognition and virtual/augmented reality to more intuitive human-robot collaboration and assistive technologies. The development of large-scale, high-quality datasets with detailed annotations is also a significant area of focus.

Papers

June 30, 2024

Learning Granularity-Aware Affordances from Human-Object Interaction for Tool-Based Functional Grasping in Dexterous Robotics
Fan Yang, Wenrui Chen, Kailun Yang, Haoran Lin, DongSheng Luo, Conghui Tang, Zhiyong Li, Yaonan Wang
LeArning Abstract Affordance Learning Human Object Interaction

June 28, 2024

EgoGaussian: Dynamic Scene Understanding from Egocentric Video with 3D Gaussian Splatting
Daiwei Zhang, Gengyan Li, Jiajie Li, Mickaël Bressieux, Otmar Hilliges, Marc Pollefeys, Luc Van Gool, Xi Wang
Gaussian Splatting Human Object Interaction Scene Understanding Egocentric Data Depth Camera Human Scene Interaction

June 27, 2024

CORE4D: A 4D Human-Object-Human Interaction Dataset for Collaborative Object REarrangement
Yun Liu, Chengwen Zhang, Ruofan Xing, Bingda Tang, Bowen Yang, Li Yi
Human Object Interaction Motion Sequence Object Rearrangement Human Motion Forecasting

June 26, 2024

Geometric Features Enhanced Human-Object Interaction Detection
Manli Zhu, Edmond S. L. Ho, Shuang Chen, Longzhi Yang, Hubert P. H. Shum
Human Object Interaction Human Object Interaction Detection Geometric Feature

June 25, 2024

Human-Object Interaction from Human-Level Instructions
Zhen Wu, Jiaman Li, Pei Xu, C. Karen Liu
Human Instruction Human Object Interaction Motion Generator Planning Perspective Human Object Spatial Relation

June 20, 2024

CooHOI: Learning Cooperative Human-Object Interaction with Manipulated Object Dynamics
Jiawei Gao, Ziqin Wang, Zeqi Xiao, Jingbo Wang, Tai Wang, Jinkun Cao, Xiaolin Hu, Si Liu, Jifeng Dai, Jiangmiao Pang
Human Object Interaction Multi Agent Learning Object Dynamic Humanoid Control Multi Character Interaction

May 22, 2024

EgoChoir: Capturing 3D Human-Object Interaction Regions from Egocentric Views
Yuhang Yang, Wei Zhai, Chengfeng Wang, Chengjun Yu, Yang Cao, Zheng-Jun Zha
Affordance Learning Human Object Interaction Egocentric View Object Interaction Ego4D AudioVisual Human Centric Perception

May 16, 2024

April 19, 2024

Exploring Interactive Semantic Alignment for Efficient HOI Detection with Vision-language Model
Jihao Dong, Renjie Pan, Hua Yang
Vision Language Model Human Object Interaction Semantic Alignment Human Object Pair

April 9, 2024

Exploring the Potential of Large Foundation Models for Open-Vocabulary HOI Detection
Ting Lei, Shaofeng Yin, Yang Liu
Foundation Model Full Potential Human Object Interaction Human Object Pair

April 2, 2024

Disentangled Pre-training for Human-Object Interaction Detection
Zhuolong Li, Xingao Li, Changxing Ding, Xiangmin Xu
Human Object Interaction Human Object Interaction Detection Interaction Decoder Disentangled Haze

March 30, 2024

HOI-M3:Capture Multiple Humans and Objects Interaction within Contextual Environment
Juze Zhang, Jingyan Zhang, Zining Song, Zhanhe Shi, Chengfeng Zhao, Ye Shi, Jingyi Yu, Lan Xu, Jingya Wang
Human Object Interaction Object Interaction HOI M3 Dataset Monocular Capture

March 28, 2024

InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction
Sirui Xu, Ziyin Wang, Yu-Xiong Wang, Liang-Yan Gui
Text Modality Human Object Interaction Human Motion Generation Text to Motion Interaction Datasets

March 22, 2024

InterFusion: Text-Driven Generation of 3D Human-Object Interaction
Sisi Dai, Wenhao Li, Haowen Sun, Haibin Huang, Chongyang Ma, Hui Huang, Kai Xu, Ruizhen Hu
Human Object Interaction Text to 3D Generation Text to 3D Text Driven 3D Text Driven Generation

March 17, 2024

March 12, 2024

Towards Zero-shot Human-Object Interaction Detection via Vision-Language Integration
Weiying Xue, Qi Liu, Qiwei Xiong, Yuxiao Wang, Zhenao Wei, Xiaofen Xing, Xiangmin Xu
Zero Shot Human Object Interaction Human Object Interaction Detection Human Object Pair Interaction Decoder

March 4, 2024

FreeA: Human-object Interaction Detection using Free Annotation Labels
Yuxiao Wang, Zhenao Wei, Xinyu Jiang, Yu Lei, Weiying Xue, Jinxiu Liu, Qi Liu
Human Object Interaction Human Object Interaction Detection Group Annotation Human Object Pair

January 18, 2024

ParaHome: Parameterizing Everyday Home Activities Towards 3D Generative Modeling of Human-Object Interactions
Jeonghwan Kim, Jisoo Kim, Jeonghyeon Na, Hanbyul Joo
Human Object Interaction Motion Capture 3D Generative Hand Motion

Human Object Interaction

Papers

Learning Granularity-Aware Affordances from Human-Object Interaction for Tool-Based Functional Grasping in Dexterous Robotics

EgoGaussian: Dynamic Scene Understanding from Egocentric Video with 3D Gaussian Splatting

CORE4D: A 4D Human-Object-Human Interaction Dataset for Collaborative Object REarrangement

Geometric Features Enhanced Human-Object Interaction Detection

Human-Object Interaction from Human-Level Instructions

CooHOI: Learning Cooperative Human-Object Interaction with Manipulated Object Dynamics

EgoChoir: Capturing 3D Human-Object Interaction Regions from Egocentric Views

VirtualModel: Generating Object-ID-retentive Human-object Interaction Image by Diffusion Model for E-commerce Marketing

Learning from Observer Gaze:Zero-Shot Attention Prediction Oriented by Human-Object Interaction Recognition

Exploring Interactive Semantic Alignment for Efficient HOI Detection with Vision-language Model

Exploring the Potential of Large Foundation Models for Open-Vocabulary HOI Detection

Disentangled Pre-training for Human-Object Interaction Detection

HOI-M3:Capture Multiple Humans and Objects Interaction within Contextual Environment

InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction

InterFusion: Text-Driven Generation of 3D Human-Object Interaction

FORCE: Physics-aware Human-object Interaction

THOR: Text to Human-Object Interaction Diffusion via Relation Intervention

Towards Zero-shot Human-Object Interaction Detection via Vision-Language Integration

FreeA: Human-object Interaction Detection using Free Annotation Labels

ParaHome: Parameterizing Everyday Home Activities Towards 3D Generative Modeling of Human-Object Interactions