Text to SQL
Text-to-SQL research aims to automatically translate natural language queries into structured SQL queries, enabling non-experts to access and analyze data in relational databases. Current research heavily utilizes large language models (LLMs), focusing on improving accuracy through techniques like chain-of-thought prompting, multi-agent systems, and data augmentation strategies to address challenges such as complex schemas and ambiguous queries. This field is significant because it democratizes data access, impacting various sectors by streamlining data analysis and reducing the reliance on specialized SQL knowledge. Furthermore, ongoing work addresses the need for more robust evaluation metrics and benchmarks that reflect real-world complexities, particularly within enterprise settings.
Papers
Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL
Zijin Hong, Zheng Yuan, Qinggang Zhang, Hao Chen, Junnan Dong, Feiran Huang, Xiao Huang
BookSQL: A Large Scale Text-to-SQL Dataset for Accounting Domain
Rahul Kumar, Amar Raja Dibbu, Shrutendra Harsola, Vignesh Subrahmaniam, Ashutosh Modi
CoE-SQL: In-Context Learning for Multi-Turn Text-to-SQL with Chain-of-Editions
Hanchong Zhang, Ruisheng Cao, Hongshen Xu, Lu Chen, Kai Yu
Open-SQL Framework: Enhancing Text-to-SQL on Open-source Large Language Models
Xiaojun Chen, Tianle Wang, Tianhao Qiu, Jianbin Qin, Min Yang
Overview of the EHRSQL 2024 Shared Task on Reliable Text-to-SQL Modeling on Electronic Health Records
Gyubok Lee, Sunjun Kweon, Seongsu Bae, Edward Choi