Data Representation
Data representation research focuses on developing effective methods to encode information for machine learning tasks, aiming to improve model accuracy, efficiency, and fairness while preserving crucial data characteristics. Current efforts concentrate on optimizing data representations for various modalities (text, images, time series) using architectures like transformers and autoencoders, and exploring novel approaches such as binary and hybrid representations, along with techniques to mitigate biases and enhance privacy. These advancements have significant implications for diverse fields, improving the performance of models in applications ranging from natural language processing and computer vision to healthcare and finance.
Papers
Comparison of Data Representations and Machine Learning Architectures for User Identification on Arbitrary Motion Sequences
Christian Schell, Andreas Hotho, Marc Erich Latoschik
Metric Distribution to Vector: Constructing Data Representation via Broad-Scale Discrepancies
Xue Liu, Dan Sun, Xiaobo Cao, Hao Ye, Wei Wei