Computer Vision

Computer vision, a field focused on enabling computers to "see" and interpret images and videos, aims to develop algorithms that can perform tasks such as object detection, image classification, and scene understanding. Current research heavily utilizes deep learning, particularly convolutional neural networks (CNNs) and vision transformers (ViTs), often combined with techniques like multi-modal fusion (integrating data from different sensors) and transfer learning to improve efficiency and accuracy. These advancements are driving significant progress in diverse applications, including precision agriculture, robotics, medical imaging analysis, and autonomous systems, by providing automated, efficient, and objective solutions to complex visual tasks.

1005papers

Papers - Page 13

August 5, 2024

Tensorial template matching for fast cross-correlation with rotations and its application for tomography
Antonio Martinez-Sanchez, Ulrike Homberg, José María Almira, Harold Phelippeau
Tensorial Template Application Proficiency Template Matching Computer Vision Cross Correlation

August 2, 2024

Autonomous Integration of Bench-Top Wet Lab Equipment
Zachary Logan, Kam Undieh, Mohammad Goli
Laboratory Automation Color Detection Circle Detection Computer Vision Autonomous Framework

August 1, 2024

July 31, 2024

Segment Anything for Videos: A Systematic Survey
Chunhui Zhang, Yawen Cui, Weilin Lin, Guanjie Huang, Yan Rong, Li Liu, Shiguang Shan
Gameplay Video Segment Anything Comprehensive Survey Multi Modal Segment Anything Model Visual Foundation Model Video Understanding Computer Vision

July 29, 2024

July 25, 2024

July 23, 2024

3D-UGCN: A Unified Graph Convolutional Network for Robust 3D Human Pose Estimation from Monocular RGB Images
Jie Zhao, Jianing Li, Weihan Chen, Wentong Wang, Pengfei Yuan, Xu Zhang, Deshu Peng
RGB Image Spatial Temporal Graph Human Pose Skeleton Sequence 3D Human Pose Estimation Computer Vision Graph Convolutional Network

July 18, 2024

July 16, 2024

July 15, 2024

July 14, 2024

Plain-Det: A Plain Multi-Dataset Object Detector
Cheng Shi, Yuchen Zhu, Sibei Yang
3D Object Detection Computer Vision Large Vision Model

Computer Vision

Papers - Page 13

Tensorial template matching for fast cross-correlation with rotations and its application for tomography

Autonomous Integration of Bench-Top Wet Lab Equipment

Explainable Emotion Decoding for Human and Computer Vision

Improving Image De-raining Using Reference-Guided Transformers

Segment Anything for Videos: A Systematic Survey

Global Structure-from-Motion Revisited

More precise edge detections

TiCoSS: Tightening the Coupling between Semantic Segmentation and Stereo Matching within A Joint Learning Framework

Mpox Detection Advanced: Rapid Epidemic Response Through Synthetic Data

3D-UGCN: A Unified Graph Convolutional Network for Robust 3D Human Pose Estimation from Monocular RGB Images

GroupMamba: Efficient Group-Based Visual State Space Model

Beyond Augmentation: Empowering Model Robustness under Extreme Capture Environments

CHOSEN: Compilation to Hardware Optimization Stack for Efficient Vision Transformer Inference

CerberusDet: Unified Multi-Dataset Object Detection

Motion and Structure from Event-based Normal Flow

Learning to Make Keypoints Sub-Pixel Accurate

TCFormer: Visual Recognition via Token Clustering Transformer

Interpreting Hand gestures using Object Detection and Digits Classification

Detecting Omissions in Geographic Maps through Computer Vision

Plain-Det: A Plain Multi-Dataset Object Detector