Computer Vision
Computer vision, a field focused on enabling computers to "see" and interpret images and videos, aims to develop algorithms that can perform tasks such as object detection, image classification, and scene understanding. Current research heavily utilizes deep learning, particularly convolutional neural networks (CNNs) and vision transformers (ViTs), often combined with techniques like multi-modal fusion (integrating data from different sensors) and transfer learning to improve efficiency and accuracy. These advancements are driving significant progress in diverse applications, including precision agriculture, robotics, medical imaging analysis, and autonomous systems, by providing automated, efficient, and objective solutions to complex visual tasks.
Papers
The curse of language biases in remote sensing VQA: the role of spatial attributes, language diversity, and the need for clear evaluation
Christel Chappuis, Eliot Walt, Vincent Mendez, Sylvain Lobry, Bertrand Le Saux, Devis Tuia
Large Language Models Meet Computer Vision: A Brief Survey
Raby Hamadi
Integration of Robotics, Computer Vision, and Algorithm Design: A Chinese Poker Self-Playing Robot
Kuan-Huang Yu
Next-gen traffic surveillance: AI-assisted mobile traffic violation detection system
Dila Dede, Mehmet Ali Sarsıl, Ata Shaker, Olgu Altıntaş, Onur Ergen
UniHPE: Towards Unified Human Pose Estimation via Contrastive Learning
Zhongyu Jiang, Wenhao Chai, Lei Li, Zhuoran Zhou, Cheng-Yen Yang, Jenq-Neng Hwang
Trainwreck: A damaging adversarial attack on image classifiers
Jan Zahálka
Exploring Lip Segmentation Techniques in Computer Vision: A Comparative Analysis
Pietro B. S. Masur, Francisco Braulio Oliveira, Lucas Moreira Medino, Emanuel Huber, Milene Haraguchi Padilha, Cassio de Alcantara, Renata Sellaro
PhytNet -- Tailored Convolutional Neural Networks for Custom Botanical Data
Jamie R. Sykes, Katherine Denby, Daniel W. Franks
Cross-view and Cross-pose Completion for 3D Human Understanding
Matthieu Armando, Salma Galaaoui, Fabien Baradel, Thomas Lucas, Vincent Leroy, Romain Brégier, Philippe Weinzaepfel, Grégory Rogez
Applications of Computer Vision in Autonomous Vehicles: Methods, Challenges and Future Directions
Xingshuai Dong, Massimiliano L. Cappuccio