Editing-Based SQL Query Generation for Cross-Domain Context-Dependent Questions

@article{Zhang2019EditingBasedSQ,
  title={Editing-Based SQL Query Generation for Cross-Domain Context-Dependent Questions},
  author={Rui Zhang and Tao Yu and He Yang Er and Sungrok Shim and Eric Xue and Xi Victoria Lin and Tianze Shi and Caiming Xiong and Richard Socher and Dragomir R. Radev},
  journal={ArXiv},
  year={2019},
  volume={abs/1909.00786}
}
We focus on the cross-domain context-dependent text-to-SQL generation task. Based on the observation that adjacent natural language questions are often linguistically dependent and their corresponding SQL queries tend to overlap, we utilize the interaction history by editing the previous predicted query to improve the generation quality. Our editing mechanism views SQL as sequences and reuses generation results at the token level in a simple manner. It is flexible to change individual tokens… Expand
PG-GSQL: Pointer-Generator Network with Guide Decoding for Cross-Domain Context-Dependent Text-to-SQL Generation
TLDR
An encoder-decoder model called PG-GSQL based on the interaction-level encoder and with two effective innovations in decoder to solve cross-domain context-dependent text-to-SQL generation task is presented. Expand
Tracking Interaction States for Multi-Turn Text-to-SQL Semantic Parsing
TLDR
Two kinds of interaction states are defined based on schema items and SQL keywords separately, and a relational graph neural network and a non-linear layer are designed to update the representations of these two states respectively. Expand
IGSQL: Database Schema Interaction Graph Based Neural Model for Context-Dependent Text-to-SQL Generation
TLDR
The proposed database schema interaction graph encoder uses a gate mechanism to weigh the importance of different vocabularies and then make the prediction of SQL tokens and achieves new state-of-the-art results on the two datasets. Expand
CoSQL: A Conversational Text-to-SQL Challenge Towards Cross-Domain Natural Language Interfaces to Databases
TLDR
CoSQL is presented, a corpus for building cross-domain, general-purpose database (DB) querying dialogue systems that includes SQL-grounded dialogue state tracking, response generation from query results, and user dialogue act prediction and a set of strong baselines are evaluated. Expand
Bridging Textual and Tabular Data for Cross-Domain Text-to-SQL Semantic Parsing
TLDR
This work presents BRIDGE, a powerful sequential architecture for modeling dependencies between natural language questions and relational databases in cross-DB semantic parsing that effectively captures the desired cross-modal dependencies and has the potential to generalize to more text-DB related tasks. Expand
RYANSQL: Recursively Applying Sketch-based Slot Fillings for Complex Text-to-SQL in Cross-Domain Databases
TLDR
A neural network approach called RYANSQL (Recursively Yielding Annotation Network for SQL) is presented to solve complex Text-to-SQL tasks for cross-domain databases to improve generation performance further. Expand
SPARQLing Database Queries from Intermediate Question Decompositions
TLDR
This work observes that the execution accuracy of queries constructed by the model on the challenging Spider dataset is comparable with the state-of-the-art text-to-SQL methods trained with annotated SQL queries. Expand
KaggleDBQA: Realistic Evaluation of Text-to-SQL Parsers
TLDR
It is shown that KaggleDBQA presents a challenge to state-ofthe-art zero-shot parsers but a more realistic evaluation setting and creative use of associated database documentation boosts their accuracy by over 13.2%, doubling their performance. Expand
GuideSQL: Utilizing Tables to Guide the Prediction of Columns for Text-to-SQL Generation
TLDR
A novel approach to text-to-SQL called GuideSQL is presented which predicts tables first and uses a pruning algorithm for removing the columns which don’t belong to the predicted tables to avoid errors caused by misprediction of table-column dependencies. Expand
An Interactive NL2SQL Approach with Reuse Strategy
TLDR
A new model is proposed which first generates an SQL grammar tree, called Tree-SQL, as the intermediate representation, and then infers an SQL query from the Tree- SQL with domain knowledge, and for semantic dependency among context-dependent questions, a reuse strategy that assigns a probability for each sub-tree of historicaltree-SQLs is proposed. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 57 REFERENCES
Recursive and Clause-Wise Decoding for Complex and Cross-Domain Text-to-SQL Generation
TLDR
This paper proposes a SQL clause-wise decoding neural architecture with a self-attention based database schema encoder to address Spider task, and shows that the model is significantly more effective to predict complex and nested queries than previous works. Expand
Towards Complex Text-to-SQL in Cross-Domain Database with Intermediate Representation
TLDR
The proposed IRNet aims to address two challenges: the mismatch between intents expressed in natural language (NL) and the implementation details in SQL and the challenge in predicting columns caused by the large number of out-of-domain words. Expand
DialSQL: Dialogue Based Structured Query Generation
TLDR
DialSQL is a dialogue-based structured query generation framework that leverages human intelligence to boost the performance of existing algorithms via user interaction and is capable of identifying potential errors in a generated SQL query and asking users for validation via simple multi-choice questions. Expand
SyntaxSQLNet: Syntax Tree Networks for Complex and Cross-Domain Text-to-SQL Task
TLDR
Experimental results show that SyntaxSQLNet can handle a significantly greater number of complex SQL examples than prior work, outperforming the previous state-of-the-art model by 9.5% in exact matching accuracy. Expand
Semantic Parsing with Syntax- and Table-Aware SQL Generation
TLDR
The quality of the generated SQL query is significantly improved through learning to replicate content from column names, cells or SQL keywords; and improving the generation of WHERE clause by leveraging the column-cell relation. Expand
CoSQL: A Conversational Text-to-SQL Challenge Towards Cross-Domain Natural Language Interfaces to Databases
TLDR
CoSQL is presented, a corpus for building cross-domain, general-purpose database (DB) querying dialogue systems that includes SQL-grounded dialogue state tracking, response generation from query results, and user dialogue act prediction and a set of strong baselines are evaluated. Expand
SParC: Cross-Domain Semantic Parsing in Context
TLDR
An in-depth analysis of SParC is provided and it is shown that it introduces new challenges compared to existing datasets and requires generalization to unseen domains due to its cross-domain nature and the unseen databases at test time. Expand
IncSQL: Training Incremental Text-to-SQL Parsers with Non-Deterministic Oracles
TLDR
A sequence-to-action parsing approach for the natural language to SQL task that incrementally fills the slots of a SQL query with feasible actions from a pre-defined inventory and sets a new state-of-the-art performance at an execution accuracy of 87.1%. Expand
Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning
TLDR
This work proposes Seq2 SQL, a deep neural network for translating natural language questions to corresponding SQL queries, and releases WikiSQL, a dataset of 80654 hand-annotated examples of questions and SQL queries distributed across 24241 tables fromWikipedia that is an order of magnitude larger than comparable datasets. Expand
TypeSQL: Knowledge-Based Type-Aware Neural Text-to-SQL Generation
TLDR
This paper presents a novel approach TypeSQL which formats the problem as a slot filling task in a more reasonable way and utilizes type information to better understand rare entities and numbers in the questions. Expand
...
1
2
3
4
5
...