Text to SQL
Text-to-SQL research aims to automatically translate natural language queries into structured SQL queries, enabling non-experts to access and analyze data in relational databases. Current research heavily utilizes large language models (LLMs), focusing on improving accuracy through techniques like chain-of-thought prompting, multi-agent systems, and data augmentation strategies to address challenges such as complex schemas and ambiguous queries. This field is significant because it democratizes data access, impacting various sectors by streamlining data analysis and reducing the reliance on specialized SQL knowledge. Furthermore, ongoing work addresses the need for more robust evaluation metrics and benchmarks that reflect real-world complexities, particularly within enterprise settings.
Papers
Know What I don't Know: Handling Ambiguous and Unanswerable Questions for Text-to-SQL
Bing Wang, Yan Gao, Zhoujun Li, Jian-Guang Lou
Importance of Synthesizing High-quality Data for Text-to-SQL Parsing
Yiyun Zhao, Jiarong Jiang, Yiqun Hu, Wuwei Lan, Henry Zhu, Anuj Chauhan, Alexander Li, Lin Pan, Jun Wang, Chung-Wei Hang, Sheng Zhang, Marvin Dong, Joe Lilien, Patrick Ng, Zhiguo Wang, Vittorio Castelli, Bing Xiang
Augmenting Multi-Turn Text-to-SQL Datasets with Self-Play
Qi Liu, Zihuiwen Ye, Tao Yu, Phil Blunsom, Linfeng Song
STAR: SQL Guided Pre-Training for Context-dependent Text-to-SQL Parsing
Zefeng Cai, Xiangyu Li, Binyuan Hui, Min Yang, Bowen Li, Binhua Li, Zheng Cao, Weijie Li, Fei Huang, Luo Si, Yongbin Li