Near Duplicate
Near-duplicate detection focuses on identifying highly similar items, whether images, videos, text, or code, across vast datasets. Current research emphasizes developing robust algorithms and model architectures, such as Siamese networks and vision transformers, to effectively capture subtle semantic similarities beyond exact matches, often incorporating techniques like embedding refinement and graph-theoretic approaches. This field is crucial for managing large datasets, mitigating copyright infringement, improving search and recommendation systems, and ensuring fair evaluation in machine learning, with applications ranging from biometric security to software development and online learning platforms.
Papers
November 14, 2024
October 25, 2024
August 14, 2024
July 11, 2024
June 10, 2024
April 17, 2024
April 15, 2024
March 19, 2024
February 15, 2024
January 25, 2024
January 10, 2024
December 22, 2023
December 12, 2023
December 6, 2023
October 20, 2023
September 27, 2023
September 19, 2023
July 16, 2023
April 10, 2023