Table Representation
Table representation research focuses on effectively encoding tabular data for machine learning and natural language processing tasks, aiming to improve information extraction, question answering, and data manipulation. Current efforts concentrate on developing flexible formats adaptable to diverse table structures and incorporating contextual information like headers and surrounding text, often leveraging large language models (LLMs) and graph neural networks (GNNs) for enhanced representation learning. These advancements are significant because they enable more robust and efficient processing of tabular data, impacting diverse fields from document understanding and database management to scientific knowledge discovery and spreadsheet automation.
Papers
Enhancing Table Representations with LLM-powered Synthetic Data Generation
Dayu Yang, Natawut Monaikul, Amanda Ding, Bozhao Tan, Kishore Mosaliganti, Giri Iyengar
TableGPT2: A Large Multimodal Model with Tabular Data Integration
Aofeng Su, Aowen Wang, Chao Ye, Chen Zhou, Ga Zhang, Guangcheng Zhu, Haobo Wang, Haokai Xu, Hao Chen, Haoze Li, Haoxuan Lan, Jiaming Tian, Jing Yuan, Junbo Zhao, Junlin Zhou, Kaizhe Shou, Liangyu Zha, Lin Long, Liyao Li, Pengzuo Wu, Qi Zhang, Qingyi Huang, Saisai Yang, Tao Zhang, Wentao Ye, Wufang Zhu, Xiaomeng Hu, Xijun Gu, Xinjie Sun, Xiang Li, Yuhang Yang, Zhiqing Xiao