Human Object Interaction

Human-object interaction (HOI) research focuses on understanding and modeling how humans interact with objects in images and videos, aiming to accurately detect, classify, and even generate these interactions. Current research emphasizes developing robust models, often leveraging transformer architectures and diffusion models, to handle challenges like occlusion, diverse object categories, and limited training data, particularly in zero-shot and few-shot learning scenarios. This field is crucial for advancing computer vision, robotics, and human-computer interaction, with applications ranging from improved activity recognition and virtual/augmented reality to more intuitive human-robot collaboration and assistive technologies. The development of large-scale, high-quality datasets with detailed annotations is also a significant area of focus.

Papers

September 28, 2024

1st Place Solution to the 8th HANDS Workshop Challenge -- ARCTIC Track: 3DGS-based Bimanual Category-agnostic Interaction Reconstruction
Jeongwan On, Kyeonghwan Gwak, Gunyoung Kang, Hyein Hwang, Soohyun Hwang, Junuk Cha, Jaewook Han, Seungryul Baek
3D Reconstruction Human Object Interaction Hand Object Interaction Place Solution Bimanual Manipulation POLAR Keywords Workshop Challenge

September 16, 2024

Highly dynamic physical interaction for robotics: design and control of an active remote center of compliance
Christian Friedrich, Patrick Frank, Marco Santin, Matthias Haag
Robotics Domain Human Robot Interaction External Control Human Object Interaction Regulatory Compliance Remote Human Operator Hybrid Control Interaction Control

September 15, 2024

A Comprehensive Methodological Survey of Human Activity Recognition Across Divers Data Modalities
Jungpil Shin, Najmul Hassan, Abu Saleh Musa Miah1, Satoshi Nishimura
Computer Vision Activity Recognition Human Object Interaction Human Activity Recognition Activity Detection

September 12, 2024

DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors
Thomas Hanwen Zhu, Ruining Li, Tomas Jakab
Zero Shot Human Object Interaction 3D Human Diffusion Prior Subject Driven Generation MeSH Data

September 3, 2024

A New People-Object Interaction Dataset and NVS Benchmarks
Shuai Guo, Houqiang Zhong, Qiuwen Wang, Ziyu Chen, Yijie Gao, Jiajing Yuan, Chenyu Zhang, Rong Xie, Li Song
Human Object Interaction RGB D Sequence

August 25, 2024

InterTrack: Tracking Human Object Interaction without Object Templates
Xianghui Xie, Jan Eric Lenssen, Gerard Pons-Moll
Human Object Interaction Object Tracking Pose Tracking Object Model Interaction Understanding

August 20, 2024

A Review of Human-Object Interaction Detection
Yuxiao Wang, Qiwei Xiong, Yu Lei, Weiying Xue, Qi Liu, Zhenao Wei
Human Object Interaction Visual Understanding Human Object Interaction Detection

August 14, 2024

UAHOI: Uncertainty-aware Robust Interaction Learning for HOI Detection
Mu Chen, Minghan Chen, Yi Yang
Human Object Interaction Interaction Prediction Interaction Learning

August 13, 2024

Efficient Human-Object-Interaction (EHOI) Detection via Interaction Label Coding and Conditional Decision
Tsung-Shan Yang, Yun-Cheng Wang, Chengwei Wei, Suya You, C. -C. Jay Kuo
Data Detection Human Object Interaction Conditional Reasoning Hoi Detection Interaction Decoder Frozen Feature

August 12, 2024

Unseen No More: Unlocking the Potential of CLIP for Generative Zero-shot HOI Detection
Yixin Guo, Yu Liu, Jianghao Li, Weimin Wang, Qi Jia
Full Potential Single CLIP Human Object Interaction CLIP Embeddings

August 11, 2024

An analysis of HOI: using a training-free method with multimodal visual foundation models when only the test set is available, without the training set
Chaoyi Ai
General Analysis Human Object Interaction Training Free Shot Scenario Training Set Multimodal Foundation

August 5, 2024

Exploring Conditional Multi-Modal Prompts for Zero-shot HOI Detection
Ting Lei, Shaofeng Yin, Yuxin Peng, Yang Liu
Human Object Interaction Multi Modal PromPt Human Object Pair

July 31, 2024

A Plug-and-Play Method for Rare Human-Object Interactions Detection by Bridging Domain Gap
Lijun Zhang, Wei Suo, Peng Wang, Yanning Zhang
Human Object Interaction Plug and Play Domain Gap Human Object Interaction Detection Human Object Pair

July 30, 2024

July 25, 2024

ReCorD: Reasoning and Correcting Diffusion for HOI Generation
Jian-Yu Jiang-Lin, Kang-Yang Huang, Ling Lo, Yi-Ning Huang, Terence Lin, Jhih-Ciang Wu, Hong-Han Shuai, Wen-Huang Cheng
Generative Model Raw Data Complex Reasoning Image Generation Faithful Generation Latent Diffusion Model Human Object Interaction

July 19, 2024

Kinematics-based 3D Human-Object Interaction Reconstruction from Single View
Yuhang Chen, Chenxing Wang
Human Object Interaction Single View End Effector Forward Kinematics Hand Object Contact Contact Based Object Representation

July 17, 2024

July 16, 2024

CycleHOI: Improving Human-Object Interaction Detection with Cycle Consistency of Detection and Generation
Yisen Wang, Yao Teng, Limin Wang
Data Detection Faithful Generation Human Object Interaction Human Object Interaction Detection Cycle Consistency

Human Object Interaction

Papers

1st Place Solution to the 8th HANDS Workshop Challenge -- ARCTIC Track: 3DGS-based Bimanual Category-agnostic Interaction Reconstruction

Highly dynamic physical interaction for robotics: design and control of an active remote center of compliance

A Comprehensive Methodological Survey of Human Activity Recognition Across Divers Data Modalities

DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors

A New People-Object Interaction Dataset and NVS Benchmarks

InterTrack: Tracking Human Object Interaction without Object Templates

A Review of Human-Object Interaction Detection

UAHOI: Uncertainty-aware Robust Interaction Learning for HOI Detection

Efficient Human-Object-Interaction (EHOI) Detection via Interaction Label Coding and Conditional Decision

Unseen No More: Unlocking the Potential of CLIP for Generative Zero-shot HOI Detection

An analysis of HOI: using a training-free method with multimodal visual foundation models when only the test set is available, without the training set

Exploring Conditional Multi-Modal Prompts for Zero-shot HOI Detection

A Plug-and-Play Method for Rare Human-Object Interactions Detection by Bridging Domain Gap

Monocular Human-Object Reconstruction in the Wild

StackFLOW: Monocular Human-Object Reconstruction by Stacked Normalizing Flow with Offset

ReCorD: Reasoning and Correcting Diffusion for HOI Generation

Kinematics-based 3D Human-Object Interaction Reconstruction from Single View

F-HOI: Toward Fine-grained Semantic-Aligned 3D Human-Object Interactions

HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects

CycleHOI: Improving Human-Object Interaction Detection with Cycle Consistency of Detection and Generation