Feature Selection
Feature selection aims to identify the most relevant subset of features from a larger dataset, improving model performance, interpretability, and efficiency. Current research emphasizes developing novel algorithms, including those based on neural networks (e.g., RelChaNet), genetic algorithms, and large language models (LLMs), to select features effectively and efficiently, often incorporating techniques like causal inference and uncertainty quantification. These advancements are crucial for various applications, such as medical diagnosis, financial prediction, and recommender systems, where reducing dimensionality and improving model explainability are paramount. The field is also actively exploring new evaluation metrics and addressing challenges like fairness and privacy in feature selection.
Papers
An information theoretic approach to quantify the stability of feature selection and ranking algorithms
Alaiz-Rodriguez, R., Parnell, A. C
A comparative study on feature selection for a risk prediction model for colorectal cancer
N. Cueto-López, M. T. García-Ordás, V. Dávila-Batista, V. Moreno, N. Aragonés, R. Alaiz-Rodríguez
LLpowershap: Logistic Loss-based Automated Shapley Values Feature Selection Method
Iqbal Madakkatel, Elina Hyppönen
Feature Selection via Robust Weighted Score for High Dimensional Binary Class-Imbalanced Gene Expression Data
Zardad Khan, Amjad Ali, Saeed Aldahmani
Binary Feature Mask Optimization for Feature Selection
Mehmet E. Lorasdagi, Mehmet Y. Turali, Ali T. Koc, Suleyman S. Kozat
Analysing the Needs of Homeless People Using Feature Selection and Mining Association Rules
José M. Alcalde-Llergo, Carlos García-Martínez, Manuel Vaquero-Abellán, Pilar Aparicio-Martínez, Enrique Yeguas-Bolívar
Cost-sensitive Feature Selection for Support Vector Machines
Sandra Benítez-Peña, Rafael Blanquero, Emilio Carrizosa, Pepa Ramírez-Cobo
Feature Selection via Maximizing Distances between Class Conditional Distributions
Chunxu Cao, Qiang Zhang
A Contrast Based Feature Selection Algorithm for High-dimensional Data set in Machine Learning
Chunxu Cao, Qiang Zhang