Arbitrary Object

Arbitrary object processing in computer vision aims to develop algorithms capable of understanding, manipulating, and reasoning about objects of any type, regardless of prior knowledge or training data. Current research focuses on developing robust models, often leveraging transformer architectures and diffusion models, to achieve accurate object detection, segmentation, pose estimation, and manipulation in diverse and complex scenes, including those with occlusions and interactions between multiple objects. These advancements are crucial for progress in robotics, autonomous systems, and augmented/virtual reality applications, enabling more flexible and adaptable interactions with the physical world. Furthermore, the development of efficient and generalizable methods for arbitrary object processing is driving innovation in self-supervised learning and knowledge distillation techniques.

272papers

Papers - Page 10

December 13, 2023

Unveiling Parts Beyond Objects:Towards Finer-Granularity Referring Expression Segmentation
Unveiling Camouflaged Object Fine Grained Object Fine Grained Vision Language Expression Segmentation Object Centric Task Arbitrary Object

December 11, 2023

December 8, 2023

SKT-Hang: Hanging Everyday Objects via Object-Agnostic Semantic Keypoint Trajectory Generation
Semantic Keypoint Arbitrary Object

December 5, 2023

November 30, 2023

HOLD: Category-agnostic 3D Reconstruction of Interacting Hands and Objects from Video
Hand Interaction Hand Object Reconstruction Source Video Arbitrary Object 3D Hand

November 28, 2023

Agents meet OKR: An Object and Key Results Driven Agent System with Hierarchical Self-Collaboration and Self-Evaluation
Self Feedback Reflex Prediction Robust Task Agent System Arbitrary Object Hierarchical Agent Agent Smith Bionic Reflex

November 23, 2023

November 12, 2023

Which One? Leveraging Context Between Objects and Multiple Views for Language Grounding
Multi View Context Information Multiple View Language Grounding Arbitrary Object

November 9, 2023

Reconstructing Objects in-the-wild for Realistic Sensor Simulation
Sensor Simulation View Synthesis Sensor Data Object Reconstruction Sparse Input View Arbitrary Object

November 5, 2023

Rotation Invariant Transformer for Recognizing Object in UAVs
Various Fast Moving Drone Arbitrary Object Rotation Augmentation Vision Transformer Architecture

October 29, 2023

Women Wearing Lipstick: Measuring the Bias Between an Object and Its Related Gender
Gender Prediction Gender Information Absolute Stance Bias Arbitrary Object Multi Temporal Lip Audio Memory Female Speaker Image Captioning

October 27, 2023

Learning to recognize occluded and small objects with partial inputs
Masked Supervised Learning Partial Input LeArning Abstract Small Object Multi Label Multi Label Image Recognition Arbitrary Object

October 26, 2023

6-DoF Stability Field via Diffusion Models
Stable Pose Object Placement Arbitrary Object 6 DoF Motion Prediction Robot Manipulation Diffusion Model

October 20, 2023

Higher or Lower: Challenges in Object based SLAM
SLAM Framework SLAM Algorithm Pin Slam Object Feature Object SLAM Arbitrary Object Technical Challenge

October 19, 2023

Putting the Object Back into Video Object Segmentation
Video Object Segmentation Implicit Memory Arbitrary Object Pixel Level

October 18, 2023

Arbitrary Object

Papers - Page 10

Unveiling Parts Beyond Objects:Towards Finer-Granularity Referring Expression Segmentation

Encoding Surgical Videos as Latent Spatiotemporal Graphs for Object and Anatomy-Driven Reasoning

Learning Polynomial Representations of Physical Objects with Application to Certifying Correct Packing Configurations

SKT-Hang: Hanging Everyday Objects via Object-Agnostic Semantic Keypoint Trajectory Generation

Fine-grained Controllable Video Generation via Object Appearance and Context

SAM-Assisted Remote Sensing Imagery Semantic Segmentation with Object and Boundary Constraints

HOLD: Category-agnostic 3D Reconstruction of Interacting Hands and Objects from Video

Agents meet OKR: An Object and Key Results Driven Agent System with Hierarchical Self-Collaboration and Self-Evaluation

FViT-Grasp: Grasping Objects With Using Fast Vision Transformers

Lego: Learning to Disentangle and Invert Personalized Concepts Beyond Object Appearance in Text-to-Image Diffusion Models

Which One? Leveraging Context Between Objects and Multiple Views for Language Grounding

Reconstructing Objects in-the-wild for Realistic Sensor Simulation

Rotation Invariant Transformer for Recognizing Object in UAVs

Women Wearing Lipstick: Measuring the Bias Between an Object and Its Related Gender

Learning to recognize occluded and small objects with partial inputs

6-DoF Stability Field via Diffusion Models

Higher or Lower: Challenges in Object based SLAM

Putting the Object Back into Video Object Segmentation

REVAMP: Automated Simulations of Adversarial Attacks on Arbitrary Objects in Realistic Scenes

Forward Kinematics of Object Transporting by a Multi-Robot System with a Deformable Sheet