Paper ID: 2411.06776
Machine vision-aware quality metrics for compressed image and video assessment
Mikhail Dremin (1), Konstantin Kozhemyakov (1), Ivan Molodetskikh (1), Malakhov Kirill (2), Artur Sagitov (2 and 3), Dmitriy Vatolin (1) ((1) Lomonosov Moscow State University, (2) Huawei Technologies Co., Ltd., (3) Independent Researcher Linjianping)
A main goal in developing video-compression algorithms is to enhance human-perceived visual quality while maintaining file size. But modern video-analysis efforts such as detection and recognition, which are integral to video surveillance and autonomous vehicles, involve so much data that they necessitate machine-vision processing with minimal human intervention. In such cases, the video codec must be optimized for machine vision. This paper explores the effects of compression on detection and recognition algorithms (objects, faces, and license plates) and introduces novel full-reference image/video-quality metrics for each task, tailored to machine vision. Experimental results indicate our proposed metrics correlate better with the machine-vision results for the respective tasks than do existing image/video-quality metrics.
Submitted: Nov 11, 2024