Paper ID: 2503.21602 • Published Mar 27, 2025
GenEdit: Compounding Operators and Continuous Improvement to Tackle Text-to-SQL in the Enterprise
Karime Maamari, Connor Landy, Amine Mhedhbi
Distyl AI•Polytechnique Montreal
TL;DR
Get AI-generated summaries with premium
Get AI-generated summaries with premium
Recent advancements in Text-to-SQL, driven by large language models, are
democratizing data access. Despite these advancements, enterprise deployments
remain challenging due to the need to capture business-specific knowledge,
handle complex queries, and meet expectations of continuous improvements. To
address these issues, we designed and implemented GenEdit: our Text-to-SQL
generation system that improves with user feedback. GenEdit builds and
maintains a company-specific knowledge set, employs a pipeline of operators
decomposing SQL generation, and uses feedback to update its knowledge set to
improve future SQL generations.
We describe GenEdit's architecture made of two core modules: (i) decomposed
SQL generation; and (ii) knowledge set edits based on user feedback. For
generation, GenEdit leverages compounding operators to improve knowledge
retrieval and to create a plan as chain-of-thought steps that guides
generation. GenEdit first retrieves relevant examples in an initial retrieval
stage where original SQL queries are decomposed into sub-statements, clauses or
sub-queries. It then also retrieves instructions and schema elements. Using the
retrieved contextual information, GenEdit then generates step-by-step plan in
natural language on how to produce the query. Finally, GenEdit uses the plan to
generate SQL, minimizing the need for model reasoning, which enhances complex
SQL generation. If necessary, GenEdit regenerates the query based on syntactic
and semantic errors. The knowledge set edits are recommended through an
interactive copilot, allowing users to iterate on their feedback and to
regenerate SQL queries as needed. Each generation uses staged edits which
update the generation prompt. Once the feedback is submitted, it gets merged
after passing regression testing and obtaining an approval, improving future
generations.
Figures & Tables
Unlock access to paper figures and tables to enhance your research experience.