Different Embeddings
Different embeddings, numerical representations of data points, are crucial for various machine learning tasks, particularly in natural language processing and computer vision. Current research focuses on improving embedding quality through techniques like dimensionality reduction, domain adaptation (e.g., fine-tuning pre-trained models for specific domains), and novel architectures such as Siamese networks and those leveraging geometric algebra for convex optimization. These advancements aim to enhance the accuracy and robustness of downstream applications, including information retrieval, text summarization, and multimodal data fusion, while addressing challenges like adversarial attacks and modality shifts. The ultimate goal is to create more effective and reliable embeddings for a wide range of applications.
Papers
Pattern or Artifact? Interactively Exploring Embedding Quality with TRACE
Edith Heiter, Liesbet Martens, Ruth Seurinck, Martin Guilliams, Tijl De Bie, Yvan Saeys, Jefrey Lijffijt
Towards Understanding Domain Adapted Sentence Embeddings for Document Retrieval
Sujoy Roychowdhury, Sumit Soman, H. G. Ranjani, Vansh Chhabra, Neeraj Gunda, Shashank Gautam, Subhadip Bandyopadhyay, Sai Krishna Bala