Text to SQL
Text-to-SQL research aims to automatically translate natural language queries into structured SQL queries, enabling non-experts to access and analyze data in relational databases. Current research heavily utilizes large language models (LLMs), focusing on improving accuracy through techniques like chain-of-thought prompting, multi-agent systems, and data augmentation strategies to address challenges such as complex schemas and ambiguous queries. This field is significant because it democratizes data access, impacting various sectors by streamlining data analysis and reducing the reliance on specialized SQL knowledge. Furthermore, ongoing work addresses the need for more robust evaluation metrics and benchmarks that reflect real-world complexities, particularly within enterprise settings.
Papers
UNITE: A Unified Benchmark for Text-to-SQL Evaluation
Wuwei Lan, Zhiguo Wang, Anuj Chauhan, Henghui Zhu, Alexander Li, Jiang Guo, Sheng Zhang, Chung-Wei Hang, Joseph Lilien, Yiqun Hu, Lin Pan, Mingwen Dong, Jun Wang, Jiarong Jiang, Stephen Ash, Vittorio Castelli, Patrick Ng, Bing Xiang
Uncovering and Categorizing Social Biases in Text-to-SQL
Yan Liu, Yan Gao, Zhe Su, Xiaokang Chen, Elliott Ash, Jian-Guang Lou
CSS: A Large-scale Cross-schema Chinese Text-to-SQL Medical Dataset
Hanchong Zhang, Jieyu Li, Lu Chen, Ruisheng Cao, Yunyan Zhang, Yu Huang, Yefeng Zheng, Kai Yu