Cross Entropy Loss

Cross-entropy loss is a widely used objective function in machine learning, primarily for training classification models by minimizing the difference between predicted and true probability distributions. Current research focuses on addressing its limitations, particularly in large-scale applications like recommender systems and large language models, where modifications like scalable or reduced cross-entropy are being developed to improve efficiency and memory usage. Furthermore, research explores alternative loss functions or combinations with other methods (e.g., contrastive learning, Wasserstein loss) to enhance model performance, calibration, and robustness, especially in scenarios with limited data or imbalanced classes. These advancements have significant implications for improving the accuracy, efficiency, and reliability of various machine learning applications.

Papers

October 16, 2022

Stability of Accuracy for the Training of DNNs Via the Uniform Doubling Condition
Yitzchak Shmalo
Deep Neural Network Training Data Core Stability Cross Entropy Loss DNN Framework

October 11, 2022

Boosting Adversarial Robustness From The Perspective of Effective Margin Regularization
Ziquan Liu, Antoni B. Chan
Adversarial Attack Adversarial Training Adversarial Robustness Visual Perspective Cross Entropy Loss Adversarial Vulnerability

October 3, 2022

Mutual Information Learned Classifiers: an Information-theoretic Viewpoint of Training Deep Learning Classification Systems
Jirong Yi, Qiaosheng Zhang, Zhen Chen, Qiao Liu, Wei Shao
Deep Neural Network Mutual Information Cross Entropy Loss Information Theoretic Cross Entropy Training Deep Deep Neural Network Classifier

October 1, 2022

Federated Representation Learning via Maximal Coding Rate Reduction
Juan Cervino, Navid NaderiAlizadeh, Alejandro Ribeiro
Meaningful Representation Cross Entropy Loss Low Dimensional Representation Rate Reduction Federated Representation Learning

September 22, 2022

CAMRI Loss: Improving Recall of a Specific Class without Sacrificing Accuracy
Daiki Nishiyama, Kazuto Fukuchi, Youhei Akimoto, Jun Sakuma
Loss Function Cross Entropy Loss Multi Class Classification Class Specific Recall Initiator Angular Margin Loss Penalty Based

September 14, 2022

Meta Pattern Concern Score: A Novel Evaluation Measure with Human Values for Multi-classifiers
Yanyun Wang, Dehui Du, Yuanhao Liu
Cross Entropy Loss Human Value Novel Evaluation Meta Feature Multi Classification

September 13, 2022

On the Optimal Combination of Cross-Entropy and Soft Dice Losses for Lesion Segmentation with Out-of-Distribution Robustness
Adrian Galdran, Gustavo Carneiro, Miguel Ángel González Ballester
Loss Function Cross Entropy Loss Lesion Segmentation Item Combination Dice Loss Distribution Robustness

September 12, 2022

TEDL: A Two-stage Evidential Deep Learning Method for Classification Uncertainty Quantification
Xue Li, Wei Shen, Denis Charles
Cross Entropy Loss Evidential Deep Learning Two Phase Dempster Shafer Theory Uncertainty Expression

September 9, 2022

Calibrating Segmentation Networks with Margin-based Label Smoothing
Balamurali Murugesan, Bingyuan Liu, Adrian Galdran, Ismail Ben Ayed, Jose Dolz
Loss Function Medical Image Segmentation Cross Entropy Loss Label Smoothing Semantic Calibration

September 3, 2022

Noise-Robust Bidirectional Learning with Dynamic Sample Reweighting
Chen-Chen Zong, Zheng-Tao Cao, Hong-Tao Guo, Yun Du, Ming-Kun Xie, Shao-Yuan Li, Sheng-Jun Huang
Label Noise Cross Entropy Loss Complementary Label Negative Learning Sample Reweighting Bidirectional Learning

September 1, 2022

Federated Learning with Label Distribution Skew via Logits Calibration
Jie Zhang, Zhiqi Li, Bo Li, Jianghe Xu, Shuang Wu, Shouhong Ding, Chao Wu
Cross Entropy Loss Label Distribution Second Ranked Logits Pairwise Ranking Softmax Cross Entropy

August 10, 2022

August 5, 2022

Joint Attention-Driven Domain Fusion and Noise-Tolerant Learning for Multi-Source Domain Adaptation
Tong Xu, Lin Wang, Wu Ning, Chunyan Lyu, Kejun Wang, Chenhui Wang
Unsupervised Domain Adaptation Cross Entropy Loss Attention Based Multi Source Domain Adaptation Noise Robust Learning Multi Source Unsupervised Domain Adaptation

July 26, 2022

S-Prompts Learning with Pre-trained Transformers: An Occam's Razor for Domain Incremental Learning
Yabin Wang, Zhiwu Huang, Xiaopeng Hong
LeArning Abstract Continual LEArning Catastrophic Forgetting Cross Entropy Loss Pre Trained Transformer Domain Incremental Learning Occam Algorithm Self Reflective Prompting

July 21, 2022

One-vs-the-Rest Loss to Focus on Important Samples in Adversarial Training
Sekitoshi Kanai, Shin'ya Yamaguchi, Masanori Yamada, Hiroshi Takahashi, Kentaro Ohno, Yasutoshi Ida
Adversarial Training Loss Function Cross Entropy Loss Human Driving Focus One v the Rest Loss

July 1, 2022

Studying the impact of magnitude pruning on contrastive learning methods
Francesco Corti, Rahim Entezari, Sara Hooker, Davide Bacciu, Olga Saukh
Contrastive Learning Global Impact Cross Entropy Loss Earthquake Magnitude Contrastive Loss Function Representation Quality

June 29, 2022

RegMixup: Mixup as a Regularizer Can Surprisingly Improve Accuracy and Out Distribution Robustness
Francesco Pinto, Harry Yang, Ser-Nam Lim, Philip H. S. Torr, Puneet K. Dokania
Distribution Detection Cross Entropy Loss Distribution Sample Spatio Temporal Mixup Mechanism Gradient Regularization

June 25, 2022

On how to avoid exacerbating spurious correlations when models are overparameterized
Tina Behnia, Ke Wang, Christos Thrampoulidis
Full Model Cross Entropy Loss Gaussian Mixture Model Spurious Correlation Data Imbalance Generalization Error Bound

June 24, 2022

Deformable CNN and Imbalance-Aware Feature Learning for Singing Technique Classification
Yuya Yamamoto, Juhan Nam, Hiroko Terasawa
Class Imbalance Cross Entropy Loss Deformable Convolution Vocal Performance Singing Technique

Cross Entropy Loss

Papers

Stability of Accuracy for the Training of DNNs Via the Uniform Doubling Condition

Boosting Adversarial Robustness From The Perspective of Effective Margin Regularization

Mutual Information Learned Classifiers: an Information-theoretic Viewpoint of Training Deep Learning Classification Systems

Federated Representation Learning via Maximal Coding Rate Reduction

CAMRI Loss: Improving Recall of a Specific Class without Sacrificing Accuracy

Meta Pattern Concern Score: A Novel Evaluation Measure with Human Values for Multi-classifiers

On the Optimal Combination of Cross-Entropy and Soft Dice Losses for Lesion Segmentation with Out-of-Distribution Robustness

TEDL: A Two-stage Evidential Deep Learning Method for Classification Uncertainty Quantification

Calibrating Segmentation Networks with Margin-based Label Smoothing

Noise-Robust Bidirectional Learning with Dynamic Sample Reweighting

Federated Learning with Label Distribution Skew via Logits Calibration

Imbalance Trouble: Revisiting Neural-Collapse Geometry

KiPA22 Report: U-Net with Contour Regularization for Renal Structures Segmentation

Joint Attention-Driven Domain Fusion and Noise-Tolerant Learning for Multi-Source Domain Adaptation

S-Prompts Learning with Pre-trained Transformers: An Occam's Razor for Domain Incremental Learning

One-vs-the-Rest Loss to Focus on Important Samples in Adversarial Training

Studying the impact of magnitude pruning on contrastive learning methods

RegMixup: Mixup as a Regularizer Can Surprisingly Improve Accuracy and Out Distribution Robustness

On how to avoid exacerbating spurious correlations when models are overparameterized

Deformable CNN and Imbalance-Aware Feature Learning for Singing Technique Classification