Representing Schema Structure with Graph Neural Networks for Text-to-SQL Parsing

@inproceedings{Bogin2019RepresentingSS,
  title={Representing Schema Structure with Graph Neural Networks for Text-to-SQL Parsing},
  author={Ben Bogin and Matt Gardner and Jonathan Berant},
  booktitle={ACL},
  year={2019}
}
Research on parsing language to SQL has largely ignored the structure of the database (DB) schema, either because the DB was very simple, or because it was observed at both training and test time. [...] Key Method In this paper, we present an encoder-decoder semantic parser, where the structure of the DB schema is encoded with a graph neural network, and this representation is later used at both encoding and decoding time. Evaluation shows that encoding the schema structure improves our parser accuracy from 33…Expand
Hierarchical Schema Representation for Text-to-SQL Parsing With Decomposing Decoding
TLDR
A schema-aware neural network with decomposing architecture, namely HSRNet, which aims to address the complex and cross-domain Text-to-SQL generation task and demonstrates the effectiveness of the hierarchical schema representation by incorporating the information into different baselines. Expand
Graph Enhanced Cross-Domain Text-to-SQL Generation
TLDR
This paper improves upon a state-of-the-art Spider model, SyntaxSQLNet, by constructing a graph of column names for all databases and using graph neural networks to compute their embeddings, which offer better cross-domain representations and SQL queries. Expand
Global Reasoning over Database Structures for Text-to-SQL Parsing
TLDR
This work uses message-passing through a graph neural network to softly select a subset of database constants for the output query, conditioned on the question, and trains a model to rank queries based on the global alignment ofdatabase constants to question words. Expand
Bridging Textual and Tabular Data for Cross-Domain Text-to-SQL Semantic Parsing
TLDR
This work presents BRIDGE, a powerful sequential architecture for modeling dependencies between natural language questions and relational databases in cross-DB semantic parsing that effectively captures the desired cross-modal dependencies and has the potential to generalize to more text-DB related tasks. Expand
A Pilot Study for Chinese SQL Semantic Parsing
TLDR
A Spider dataset for Chinese is built, showing that word-based semantic parser is subject to segmentation errors and cross-lingual word embeddings are useful for text-to-SQL. Expand
ValueNet: A Neural Text-to-SQL Architecture Incorporating Values
TLDR
The main idea of the approach is to use not only metadata information about the underlying database but also information on the base data as input for the neural network architecture for the text-to-SQL system incorporating values on the challenging Spider dataset. Expand
IGSQL: Database Schema Interaction Graph Based Neural Model for Context-Dependent Text-to-SQL Generation
TLDR
The proposed database schema interaction graph encoder uses a gate mechanism to weigh the importance of different vocabularies and then make the prediction of SQL tokens and achieves new state-of-the-art results on the two datasets. Expand
An Investigation Between Schema Linking and Text-to-SQL Performance
TLDR
This study provides the ground-truth annotation of schema linking information onto the Spider dataset and hypothesizes that the internal behavior of models at hand becomes much easier to analyze if the detailed performance of schema links simultaneously as the additional information of the text-to-SQL performance. Expand
Re-examining the Role of Schema Linking in Text-to-SQL
TLDR
This work provides a schema linking corpus based on the Spider text-to-SQL dataset, and builds a simple BERT-based baseline to perform a data-driven study on schema linking, finding when schema linking is done well, SLSQL demonstrates good performance on Spider despite its structural simplicity. Expand
End-to-End Cross-Domain Text-to-SQL Semantic Parsing with Auxiliary Task
TLDR
This work proposes a column selection auxiliary task to empower the encoder with the relevance matching capability by using explicit learning targets, and proposes two value filling methods to build the bridge from the existing zero-shot semantic parsers to real-world applications. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 21 REFERENCES
SyntaxSQLNet: Syntax Tree Networks for Complex and Cross-Domain Text-to-SQL Task
TLDR
Experimental results show that SyntaxSQLNet can handle a significantly greater number of complex SQL examples than prior work, outperforming the previous state-of-the-art model by 9.5% in exact matching accuracy. Expand
Grammar-based Neural Text-to-SQL Generation
The sequence-to-sequence paradigm employed by neural text-to-SQL models typically performs token-level decoding and does not consider generating SQL hierarchically from a grammar. Grammar-basedExpand
Neural Semantic Parsing with Type Constraints for Semi-Structured Tables
TLDR
A new semantic parsing model for answering compositional questions on semi-structured Wikipedia tables with a state-of-the-art accuracy and type constraints and entity linking are valuable components to incorporate in neural semantic parsers. Expand
Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task
TLDR
This work defines a new complex and cross-domain semantic parsing and text-to-SQL task so that different complicated SQL queries and databases appear in train and test sets and experiments with various state-of-the-art models show that Spider presents a strong challenge for future research. Expand
Modeling Semantics with Gated Graph Neural Networks for Knowledge Base Question Answering
TLDR
This paper proposes to use Gated Graph Neural Networks to encode the graph structure of the semantic parse, and shows on two data sets that the graph networks outperform all baseline models that do not explicitly model the structure. Expand
Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning
TLDR
This work proposes Seq2 SQL, a deep neural network for translating natural language questions to corresponding SQL queries, and releases WikiSQL, a dataset of 80654 hand-annotated examples of questions and SQL queries distributed across 24241 tables fromWikipedia that is an order of magnitude larger than comparable datasets. Expand
Sequence-based Structured Prediction for Semantic Parsing
TLDR
An approach for semantic parsing that uses a recurrent neural network to map a natural language question into a logical form representation of a KB query and shows how grammatical constraints on the derivation sequence can easily be integrated inside the RNN-based sequential predictor. Expand
Abstract Syntax Networks for Code Generation and Semantic Parsing
Tasks like code generation and semantic parsing require mapping unstructured (or partially structured) inputs to well-formed, executable outputs. We introduce abstract syntax networks, a modelingExpand
Learning to Parse Database Queries Using Inductive Logic Programming
TLDR
Experimental results with a complete database-query application for U.S. geography show that CHILL is able to learn parsers that outperform a preexisting, hand-crafted counterpart, and provide direct evidence of the utility of an empirical approach at the level of a complete natural language application. Expand
Learning a Neural Semantic Parser from User Feedback
We present an approach to rapidly and easily build natural language interfaces to databases for new domains, whose performance improves over time based on user feedback, and requires minimalExpand
...
1
2
3
...