One Hot Encoding
One-hot encoding is a common technique for representing categorical data as numerical vectors, crucial for machine learning algorithms that require numerical input. Recent research focuses on addressing its limitations, such as high dimensionality and inability to capture semantic relationships between categories, exploring alternatives like hierarchical encodings, dense vector representations, and incorporating label uncertainty. These efforts aim to improve model performance, particularly in tasks involving complex categorical data or out-of-vocabulary items, and enhance robustness and generalization capabilities across various applications.
Papers
November 1, 2024
April 28, 2024
March 20, 2024
December 28, 2023
November 10, 2023
September 29, 2023
August 1, 2023
June 30, 2023
April 26, 2023
June 15, 2022
May 30, 2022