Egocentric Video

November 17, 2022

ReLER@ZJU Submission to the Ego4D Moment Queries Challenge 2022
Jiayi Shao, Xiaohan Wang, Yi Yang
Temporal Dependency Egocentric Video Temporal Action Localization UniUD FBK UB UniBZ Submission Ego4D Dataset Ego4D Natural Language Query
InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges
Guo Chen, Sen Xing, Zhe Chen, Yi Wang, Kunchang Li, Yizhuo Li, Yi Liu, Jiahao Wang, Yin-Dong Zheng, Bingkun Huang, Zhiyu Zhao, Junting Pan, Yifei Huang, Zun Wang, Jiashuo Yu, Yinan He, Hongjie Zhang, Tong Lu, Yali Wang, Limin Wang, Yu Qiao
Egocentric Video Video Foundation Model EGO4D Challenge Short Term Object Interaction Anticipation

November 7, 2022

Egocentric Audio-Visual Noise Suppression
Roshan Sharma, Weipeng He, Ju Lin, Egor Lakomkin, Yang Liu, Kaustubh Kalgaonkar
Egocentric Video Egocentric Perception Audio Visual Speech Enhancement

October 26, 2022

IMU2CLIP: Multimodal Contrastive Learning for IMU Motion Sensors from Egocentric Videos and Text
Seungwhan Moon, Andrea Madotto, Zhaojiang Lin, Alireza Dirafzoon, Aparajita Saraf, Amy Bearman, Babak Damavandi
Contrastive Learning Text Modality Human Motion Inertial Measurement Unit Egocentric Video Video Retrieval

October 12, 2022

ViewBirdiformer: Learning to recover ground-plane crowd trajectories and ego-motion from a single ego-centric view
Mai Nishimura, Shohei Nobuhara, Ko Nishino
Egocentric Video Ego Motion Egocentric View Crowd Motion

October 8, 2022

EgoTaskQA: Understanding Human Tasks in Egocentric Videos
Baoxiong Jia, Ting Lei, Song-Chun Zhu, Siyuan Huang
Video Understanding Egocentric Video Video Reasoning Goal Oriented Task Human Centric Task

October 4, 2022

COPILOT: Human-Environment Collision Prediction and Localization from Egocentric Videos
Boxiao Pan, Bokui Shen, Davis Rempe, Despoina Paschalidou, Kaichun Mo, Yanchao Yang, Leonidas J. Guibas
Localization Focus Collision Avoidance Egocentric Video NextGen Communication COPILOT Egocentric RGB

September 19, 2022

MECCANO: A Multimodal Egocentric Dataset for Humans Behavior Understanding in the Industrial-like Domain
Francesco Ragusa, Antonino Furnari, Giovanni Maria Farinella
Multimodal Dataset Egocentric Video Human Behavior Egocentric Perception Industrial Park

August 8, 2022

In the Eye of Transformer: Global-Local Correlation for Egocentric Gaze Estimation
Bolin Lai, Miao Liu, Fiona Ryan, James M. Rehg
Transformer Based Egocentric Video Human Eye Egocentric Video Datasets Local Correlation Egocentric RGB

August 7, 2022

Fine-Grained Egocentric Hand-Object Segmentation: Dataset, Model, and Applications
Lingzhi Zhang, Shenghao Zhou, Simon Stent, Jianbo Shi
Data Set Full Model Financial Application Egocentric Video Hand Segmentation Hand Object Contact Egocentric Hand Object

August 3, 2022

Negative Frames Matter in Egocentric Visual Query 2D Localization
Mengmeng Xu, Cheng-Yang Fu, Yanghao Li, Bernard Ghanem, Juan-Manuel Perez-Rua, Tao Xiang
Egocentric Video First Person Ego4D Dataset Visual Query

August 1, 2022

Exploring the GLIDE model for Human Action-effect Prediction
Fangjun Li, David C. Hogg, Anthony G. Cohn
Egocentric Video Action Prediction Generative Neural Network Scene Context Pulse and GLIDE Driven

July 25, 2022

Intention-Conditioned Long-Term Human Egocentric Action Forecasting
Esteve Valls Mascaro, Hyemin Ahn, Dongheui Lee
Egocentric Video Action Anticipation Human Intention Conditional Variational Auto Encoder EGO4D Challenge Egocentric Action Anticipation

July 24, 2022

Object State Change Classification in Egocentric Videos using the Divided Space-Time Attention Mechanism
Md Mohaiminul Islam, Gedas Bertasius
Egocentric Video Video Recognition Object State

July 22, 2022

July 4, 2022

June 3, 2022

Egocentric Video-Language Pretraining
Kevin Qinghong Lin, Alex Jinpeng Wang, Mattia Soldan, Michael Wray, Rui Yan, Eric Zhongcong Xu, Difei Gao, Rongcheng Tu, Wenzhe Zhao, Weijie Kong, Chengfei Cai, Hongfa Wang, Dima Damen, Bernard Ghanem, Wei Liu, Mike Zheng Shou
Egocentric Video Video Text Egocentric Perception Egocentric View Video Language Pre Training Video Text Pre Training

April 14, 2022

Weakly Supervised Attended Object Detection Using Gaze Data as Annotations
Michele Mazzamuto, Francesco Ragusa, Antonino Furnari, Giovanni Signorello, Giovanni Maria Farinella
Egocentric Video Annotated Chapter Information Gaze Data Weakly Supervised Object Detection Egocentric Vision Supervised Detector

Papers

ReLER@ZJU Submission to the Ego4D Moment Queries Challenge 2022

InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges

Egocentric Audio-Visual Noise Suppression

IMU2CLIP: Multimodal Contrastive Learning for IMU Motion Sensors from Egocentric Videos and Text

ViewBirdiformer: Learning to recover ground-plane crowd trajectories and ego-motion from a single ego-centric view

EgoTaskQA: Understanding Human Tasks in Egocentric Videos

COPILOT: Human-Environment Collision Prediction and Localization from Egocentric Videos

MECCANO: A Multimodal Egocentric Dataset for Humans Behavior Understanding in the Industrial-like Domain

In the Eye of Transformer: Global-Local Correlation for Egocentric Gaze Estimation

Fine-Grained Egocentric Hand-Object Segmentation: Dataset, Model, and Applications

Negative Frames Matter in Egocentric Visual Query 2D Localization

Exploring the GLIDE model for Human Action-effect Prediction

Intention-Conditioned Long-Term Human Egocentric Action Forecasting

Object State Change Classification in Egocentric Videos using the Divided Space-Time Attention Mechanism

EgoEnv: Human-centric environment representations from egocentric video

My View is the Best View: Procedure Learning from Egocentric Videos

Egocentric Video-Language Pretraining @ Ego4D Challenge 2022

Egocentric Video-Language Pretraining @ EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022

Egocentric Video-Language Pretraining

Weakly Supervised Attended Object Detection Using Gaze Data as Annotations