Long Tail

The "long tail" problem in machine learning refers to the challenge of achieving robust performance on rare or under-represented data points, a common occurrence in real-world datasets. Current research focuses on developing methods to improve model accuracy and generalization on these less-frequent data instances, employing techniques like parameter-efficient fine-tuning, mixture-of-experts models, knowledge distillation, and contrastive learning, often integrated with large language models or generative models. Addressing the long tail is crucial for building reliable and fair AI systems across diverse applications, from autonomous driving and medical diagnosis to e-commerce and natural language processing, as it ensures that models perform well not just on common scenarios but also on critical, less frequent ones.

Papers

April 25, 2023

GARCIA: Powering Representations of Long-tail Query with Multi-granularity Contrastive Learning
Weifan Wang, Binbin Hu, Zhicheng Peng, Mingjie Zhong, Zhiqiang Zhang, Zhongyi Liu, Guannan Zhang, Jun Zhou
Contrastive Learning Long Tail Generalizable Representation Intention Information

March 31, 2023

SuperDisco: Super-Class Discovery Improves Visual Recognition for the Long-Tail
Yingjun Du, Jiayi Shen, Xiantong Zhen, Cees G. M. Snoek
Visual Recognition Long Tailed Recognition Novel Class Discovery Long Tail

March 29, 2023

FEND: A Future Enhanced Distribution-Aware Contrastive Learning Framework for Long-tail Trajectory Prediction
Yuning Wang, Pu Zhang, Lei Bai, Jianru Xue
Contrastive Learning Trajectory Prediction Long Tail Long Term Trajectory Prediction

March 23, 2023

Temperature Schedules for Self-Supervised Contrastive Methods on Long-Tail Data
Anna Kukleva, Moritz Böhle, Bernt Schiele, Hilde Kuehne, Christian Rupprecht
Self Supervised Learning Contrastive Loss Learned Representation Long Tail Instance Discrimination

November 20, 2022

When Noisy Labels Meet Long Tail Dilemmas: A Representation Calibration Method
Manyi Zhang, Xuyang Zhao, Jun Yao, Chun Yuan, Weiran Huang
Contrastive Learning Representation Learning Noisy Label Robust Representation Long Tail Apprenticeship Learning Representation Calibration

November 7, 2022

PeSOTIF: a Challenging Visual Dataset for Perception SOTIF Problems in Long-tail Traffic Scenarios
Liang Peng, Jun Li, Wenbo Shao, Hong Wang
Autonomous Driving Long Tail Perception Algorithm Safety of the Intended Functionality

October 21, 2022

October 3, 2022

The Long Tail of Context: Does it Exist and Matter?
Konstantin Bauman, Alexey Vasilev, Alexander Tuzhilin
Recommender System Context Information Recommendation Performance Long Tail Contextual Variable

August 22, 2022

LTE4G: Long-Tail Experts for Graph Neural Networks
Sukwon Yun, Kibum Kim, Kanghoon Yoon, Chanyoung Park
Graph Neural Network Long Tail Long Tailed Data Tail Class Degree Distribution Graph Imbalance

July 27, 2022

Identifying Hard Noise in Long-Tailed Sample Distribution
Xuanyu Yi, Kaihua Tang, Xian-Sheng Hua, Joo-Hwee Lim, Hanwang Zhang
Industrial Disturbing Noise Long Tail Long Tailed Distribution Long Tailed Classification De Noising

June 8, 2022

How unfair is private learning ?
Amartya Sanyal, Yaxi Hu, Fanny Yang
Machine Learning Algorithm Sensitive Data Private Learning Long Tail Privacy Level Accurate Learning

March 11, 2022

Can I see an Example? Active Learning the Long Tail of Attributes and Relations
Tyler L. Hayes, Maximilian Nickel, Christopher Kanan, Ludovic Denoyer, Arthur Szlam
Active Learning Single Example Long Tail Rich Attribute Inter Relation Visual Genome Relation Annotation

February 27, 2022

Taming the Long Tail of Deep Probabilistic Forecasting
Jedrzej Kozerawski, Mayank Sharan, Rose Yu
Probabilistic Forecast Heavy Tailed Long Tail Pareto Optimality Deep Probabilistic Forecasting Kurtosis Concentration

December 9, 2021

Generating Useful Accident-Prone Driving Scenarios via a Learned Traffic Prior
Davis Rempe, Jonah Philion, Leonidas J. Guibas, Sanja Fidler, Or Litany
Autonomous Vehicle Multi Scenario Long Tail Scenario Generation Explanation Plausibility Traffic Flow Learning Conditional VAEs

November 27, 2021

Targeted Supervised Contrastive Learning for Long-Tailed Recognition
Tianhong Li, Peng Cao, Yuan Yuan, Lijie Fan, Yuzhe Yang, Rogerio Feris, Piotr Indyk, Dina Katabi
Contrastive Learning Supervised Contrastive Learning Long Tailed Recognition Long Tail