High Similarity
High similarity research focuses on developing methods to effectively measure and leverage similarities between data points, whether they are images, text, or neural network representations. Current research emphasizes the use of transformer models, graph neural networks, and various similarity metrics (e.g., cosine similarity, embedding similarity) to achieve this, often within the context of specific applications like image retrieval, anomaly detection, and multi-task learning. This work is significant because improved similarity assessment enhances the efficiency and accuracy of numerous machine learning tasks, impacting fields ranging from computer vision and natural language processing to copyright protection and personalized recommendations.
Papers
FrameFusion: Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models
Tianyu Fu, Tengxuan Liu, Qinghao Han, Guohao Dai, Shengen Yan, Huazhong Yang, Xuefei Ning, Yu Wang
Position Information Emerges in Causal Transformers Without Positional Encodings via Similarity of Nearby Embeddings
Chunsheng Zuo, Pavel Guerzhoy, Michael Guerzhoy
Interpretable Company Similarity with Sparse Autoencoders
Marco Molinari, Vladimir Tregubiak, Victor Shao, Abhimanyu Pandey, Mateusz Mikolajczak, Sebastião Kuznetsov Ryder Torres Pereira
TSCheater: Generating High-Quality Tibetan Adversarial Texts via Visual Similarity
Xi Cao, Quzong Gesang, Yuan Sun, Nuo Qun, Tashi Nyima