Cross Entropy Loss

Cross-entropy loss is a widely used objective function in machine learning, primarily for training classification models by minimizing the difference between predicted and true probability distributions. Current research focuses on addressing its limitations, particularly in large-scale applications like recommender systems and large language models, where modifications like scalable or reduced cross-entropy are being developed to improve efficiency and memory usage. Furthermore, research explores alternative loss functions or combinations with other methods (e.g., contrastive learning, Wasserstein loss) to enhance model performance, calibration, and robustness, especially in scenarios with limited data or imbalanced classes. These advancements have significant implications for improving the accuracy, efficiency, and reliability of various machine learning applications.

Papers

March 14, 2023

Dynamic Alignment Mask CTC: Improved Mask-CTC with Aligned Cross Entropy
Xulong Zhang, Haobin Tang, Jianzong Wang, Ning Cheng, Jian Luo, Jing Xiao
Speech Recognition Cross Entropy Loss Cross Entropy

March 7, 2023

Logit Margin Matters: Improving Transferable Targeted Adversarial Attack by Logit Calibration
Juanjuan Weng, Zhiming Luo, Zhun Zhong, Shaozi Li, Nicu Sebe
Adversarial Example Loss Function Cross Entropy Loss Adversarial Sample Transferable Adversarial Attack Logit Calibration Adaptive Margin

February 11, 2023

Jaccard Metric Losses: Optimizing the Jaccard Index with Soft Labels
Zifu Wang, Xuefei Ning, Matthew B. Blaschko
Semantic Segmentation Cross Entropy Loss Soft Label Pixel Wise Loss Jaccard Similarity

February 8, 2023

Cut your Losses with Squentropy
Like Hui, Mikhail Belkin, Stephen Wright
Cross Entropy Loss Planning Loss

February 6, 2023

APAM: Adaptive Pre-training and Adaptive Meta Learning in Language Model for Noisy Labels and Long-tailed Learning
Sunyi Chi, Bo Dong, Yiming Xu, Zhenyu Shi, Zheng Du
Language Model Contrastive Learning Loss Function Meta Learning Adaptive Importance Noisy Label Cross Entropy Loss Self Supervised Pre Training Long Tailed Learning Weighting Function

January 31, 2023

Scaling laws for single-agent reinforcement learning
Jacob Hilton, Jie Tang, John Schulman
Generative Modeling Cross Entropy Loss Model Size Single Agent Reinforcement Learning Intrinsic Evaluation

January 30, 2023

Understanding Self-Distillation in the Presence of Label Noise
Rudrajit Das, Sujay Sanghavi
Loss Function Label Noise Speech Presence Cross Entropy Loss Self Distillation Supervised Learning Problem

January 21, 2023

Improving Deep Regression with Ordinal Entropy
Shihao Zhang, Linlin Yang, Michael Bi Mi, Xiaoxu Zheng, Angela Yao
Cross Entropy Loss Regression Task Deep Regression Feature Vector Contrastive Ordinal

January 8, 2023

Learning the Relation between Similarity Loss and Clustering Loss in Self-Supervised Learning
Jidong Ge, Yuxiang Liu, Jie Gui, Lanting Fang, Ming Lin, James Tin-Yau Kwok, LiGuo Huang, Bin Luo
Contrastive Learning Self Supervised Learning Cross Entropy Loss Relational Information Clustering Loss Similarity Loss

December 2, 2022

Avoiding spurious correlations via logit correction
Sheng Liu, Xu Zhang, Nitesh Sekhar, Yue Wu, Prateek Singhal, Carlos Fernandez-Granda
Cross Entropy Loss Empirical Risk Minimization Spurious Correlation Spurious Attribute

November 27, 2022

Cross-domain Few-shot Segmentation with Transductive Fine-tuning
Yuhang Lu, Xinyi Wu, Zhenyao Wu, Song Wang
Cross Entropy Loss Unseen Domain Shot Segmentation Cross Domain Few Shot

November 23, 2022

Mutual Information Learned Regressor: an Information-theoretic Viewpoint of Training Regression Systems
Jirong Yi, Qiaosheng Zhang, Zhen Chen, Qiao Liu, Wei Shao, Yusen He, Yaohua Wang
Stochastic Gradient Descent Mutual Information Cross Entropy Loss Information Theoretic Entropy Minimization Machine Learning Regression

November 22, 2022

VBLC: Visibility Boosting and Logit-Constraint Learning for Domain Adaptive Semantic Segmentation under Adverse Conditions
Mingjia Li, Binhui Xie, Shuang Li, Chi Harold Liu, Xinjing Cheng
LeArning Abstract Cross Entropy Loss Domain Adaptive Semantic Segmentation Adverse Condition Brand Visibility Robust Adaptation Domain Adaptation Benchmark Adverse Condition Image

November 21, 2022

Blind Knowledge Distillation for Robust Image Classification
Timo Kaiser, Lukas Ehmann, Christoph Reinders, Bodo Rosenhahn
Knowledge Distillation Image Classification Noisy Label Cross Entropy Loss Noisy Sample

November 18, 2022

November 15, 2022

Exploiting Contrastive Learning and Numerical Evidence for Confusing Legal Judgment Prediction
Leilei Gan, Baokui Li, Kun Kuang, Yating Zhang, Lei Wang, Luu Anh Tuan, Yi Yang, Fei Wu
Contrastive Learning Cross Entropy Loss Supervised Contrastive Learning Legal Judgment Prediction Empirical Evidence Legal Article

November 10, 2022

Regression as Classification: Influence of Task Formulation on Neural Network Features
Lawrence Stewart, Francis Bach, Quentin Berthet, Jean-Philippe Vert
Neural Network Classification Code Novel Regression Cross Entropy Loss External Influence Gradient Method Gradient Based Optimization Two Layer ReLU Task Formulation

November 7, 2022

Contrastive Classification and Representation Learning with Probabilistic Interpretation
Rahaf Aljundi, Yash Patel, Milan Sulc, Daniel Olmeda, Nikolay Chumerin
Contrastive Learning Representation Learning Cross Entropy Loss Probabilistic Interpretation

October 20, 2022

Multi-Granularity Optimization for Non-Autoregressive Translation
Yafu Li, Leyang Cui, Yongjing Yin, Yue Zhang
Back Propagation Cross Entropy Loss Non Autoregressive Translation Non Autoregressive Machine Translation