Computer Vision
Computer vision, a field focused on enabling computers to "see" and interpret images and videos, aims to develop algorithms that can perform tasks such as object detection, image classification, and scene understanding. Current research heavily utilizes deep learning, particularly convolutional neural networks (CNNs) and vision transformers (ViTs), often combined with techniques like multi-modal fusion (integrating data from different sensors) and transfer learning to improve efficiency and accuracy. These advancements are driving significant progress in diverse applications, including precision agriculture, robotics, medical imaging analysis, and autonomous systems, by providing automated, efficient, and objective solutions to complex visual tasks.
Papers
Video-Based Rendering Techniques: A Survey
Rafael Kuffner dos Anjos, João Madeiras Pereira, José Antonio Gaspar
Prospective Role of Foundation Models in Advancing Autonomous Vehicles
Jianhua Wu, Bingzhao Gao, Jincheng Gao, Jianhao Yu, Hongqing Chu, Qiankun Yu, Xun Gong, Yi Chang, H. Eric Tseng, Hong Chen, Jie Chen
CalliPaint: Chinese Calligraphy Inpainting with Diffusion Model
Qisheng Liao, Zhinuo Wang, Muhammad Abdul-Mageed, Gus Xia
Computer Vision for Increased Operative Efficiency via Identification of Instruments in the Neurosurgical Operating Room: A Proof-of-Concept Study
Tanner J. Zachem, Sully F. Chen, Vishal Venkatraman, David AW Sykes, Ravi Prakash, Koumani W. Ntowe, Mikhail A. Bethell, Samantha Spellicy, Alexander D Suarez, Weston Ross, Patrick J. Codd
Deep Metric Learning for Computer Vision: A Brief Overview
Deen Dayal Mohan, Bhavin Jawade, Srirangaraj Setlur, Venu Govindaraj
Infrared Image Super-Resolution via GAN
Yongsong Huang, Shinichiro Omachi
Learning to Estimate Critical Gait Parameters from Single-View RGB Videos with Transformer-Based Attention Network
Quoc Hung T. Le, Hieu H. Pham
Study and Survey on Gesture Recognition Systems
Kshitij Deshpande, Varad Mashalkar, Kaustubh Mhaisekar, Amaan Naikwadi, Archana Ghotkar
RadioGalaxyNET: Dataset and Novel Computer Vision Algorithms for the Detection of Extended Radio Galaxies and Infrared Hosts
Nikhel Gupta, Zeeshan Hayder, Ray P. Norris, Minh Huynh, Lars Petersson
Adaptability of Computer Vision at the Tactical Edge: Addressing Environmental Uncertainty
Hayden Moore