Content Based Feature

Content-based features, extracted from data such as text, images, or source code, are crucial for various machine learning tasks, aiming to represent the inherent information within the data for improved model performance. Current research focuses on developing effective feature extraction methods, often employing deep learning architectures like transformers and convolutional neural networks, and exploring techniques like feature embedding, selection, and disentanglement to enhance model accuracy and efficiency. These advancements have significant implications across diverse fields, improving applications ranging from fake news detection and medical image analysis to software defect prediction and video object segmentation.

Papers