Polyglot File

Polyglot files, encompassing data valid in multiple formats or languages, present challenges and opportunities across diverse fields. Research focuses on improving the detection of malicious polyglot files used to evade security systems, employing machine learning models to achieve high accuracy in identification and sanitization. Simultaneously, advancements in multilingual large language models and multimodal deepfake datasets are exploring how to leverage polyglot capabilities for improved cross-lingual understanding and the detection of sophisticated AI-generated media, while also addressing inherent biases in information retrieval. These efforts are crucial for enhancing cybersecurity, advancing AI capabilities, and promoting equitable access to information across languages.

Papers