A comparative survey of recent natural language interfaces for databases

  title={A comparative survey of recent natural language interfaces for databases},
  author={Katrin Affolter and Kurt Stockinger and Abraham Bernstein},
  journal={The VLDB Journal},
  pages={793 - 819}
Over the last few years, natural language interfaces (NLI) for databases have gained significant traction both in academia and industry. These systems use very different approaches as described in recent survey papers. However, these systems have not been systematically compared against a set of benchmark questions in order to rigorously evaluate their functionalities and expressive power. In this paper, we give an overview over 24 recently developed NLIs for databases. Each of the systems is… 

State of the Art and Open Challenges in Natural Language Interfaces to Data

This tutorial will review natural language interface solutions in terms of their interpretation approach, as well as the complexity of the queries they can generate, and discuss open research challenges.

Neural Approaches for Natural Language Interfaces to Databases: A Survey

This survey focuses on the key design decisions behind current state of the art neural approaches, which are group into encoder and decoder improvements, and highlights the three most important directions, namely linking question tokens to database schema elements, better architectures for encoding the textual query taking into account the schema, and improved generation of structured queries using autoregressive neural models.

A Natural Language and Interactive End-to-End Querying and Reporting System

This paper presents an end-to-end system Query Enterprise Data (QED) towards making enterprise descriptive analytics and reporting easier and natural and highlights the role of an assistive user interface that provides a better user experience.

ValueNet : a natural language-to-SQL system that learns from database information

This paper proposes a novel architecture sketch to extract values from a user question and come up with possible value candidates which are not explicitly mentioned in the question, and uses a neural model based on an encoder-decoder architecture to synthesize the SQL query.

ATHENA++: Natural Language Querying for Complex Nested SQL Queries

This paper presents ATHENA++, an end-to-end system that can answer complex queries in natural language by translating them into nested SQL queries, and combines linguistic patterns from NL queries with deep domain reasoning using ontologies to enable nested query detection and generation.

Towards a Natural Language Query Processing System

The novelty in the study lies in defining a graph database as a middle layer to store necessary metadata needed to transform a natural language query into structured query language that can be executed on backend databases.

xDBTagger: Explainable Natural Language Interface to Databases Using Keyword Mappings and Schema Graph

Translating natural language queries (NLQ) into structured query language (SQL) in interfaces to relational databases is a challenging task that has been widely studied by researchers from both the

Querying knowledge graphs in natural language

The key idea is to break up the translation process into 5 smaller, more manageable sub-tasks and use ensemble machine learning methods as well as Tree-LSTM-based neural network models to automatically learn and translate a natural language question into a SPARQL query.

Generating Grammars from Lemon Lexica for Questions Answering over Linked Data: a Preliminary Analysis

Very preliminary results are presented showing that a large percentage of the questions of the training set of QALD-7 can be rephrased in terms of questions that the grammar can parse, and that with a hand-crafted lexicon, this model-based approach comes with a number of advantages compared to a machine learning approach.

Weakly Supervised Mapping of Natural Language to SQL through Question Decomposition

This work uses the recently proposed question decomposition representation called QDMR, an intermediate between NL and formal query languages, and uses NL-QDMR pairs, along with the question answers, as supervision for automatically synthesizing SQL queries.



Constructing an Interactive Natural Language Interface for Relational Databases

The architecture of an interactive natural language query interface for relational databases is described, able to correctly interpret complex natural language queries, in a generic manner across a range of domains, and is good enough to be usable in practice.

TR Discover: A Natural Language Interface for Querying and Analyzing Interlinked Datasets

The TR Discover system, a natural language-based system that allows non-technical users to create well-formed questions, is developed for future use with Thomson Reuters Cortellis and is shown to be usable and portable, and report on the relative performance of queries using SQL and SPARQL back ends.

Natural Language Interfaces to Ontologies: Combining Syntactic Analysis and Ontology-Based Lookup through the User Interaction

This work presents FREyA, which combines syntactic parsing with the knowledge encoded in ontologies in order to reduce the customisation effort, and is evaluated using Mooney Geoquery dataset with very high precision and recall.

NaLIX: an interactive natural language interface for querying XML

It is shown that NaLIX, while far from being able to pass the Turing test, is perfectly usable in practice, and able to handle even quite complex queries in a variety of application domains.

Knowledge Rich Natural Language Queries over Structured Biological Databases

A multi-level knowledge-based middleware to facilitate such mappings that separate the conceptual level from the physical level is proposed and a Datalog based prototype system is presented that can compute responses to arbitrary natural language queries over arbitrary databases once a syntactic classification of the natural language query is made.

Natural language Interface for Database: A Brief review

This paper is an introduction to Intelligent Database System and Natural Language Interface to Databases, which is a step towards the development of intelligent database systems (IDBS) to enhance the users in performing flexible querying in databases.

AMUSE: Multilingual Semantic Parsing for Question Answering over Linked Data

This work presents the first multilingual QALD pipeline that induces a model from training data for mapping a natural language question into logical form as probabilistic inference and presents a novel combination of machine translation and word embedding approaches for this purpose.

A Comprehensive Study of Natural Language Interface To Database

A brief review of various NLIDB systems developed so far is provided and the idea of using natural language instead of SQL triggered the development of new type of processing method Natural Language Interface to Database.

Natural Language Data Management and Interfaces: Recent Development and Open Challenges

This tutorial explores two more relevant areas of overlap to the database community: (1) managing natural language text data in a relational database, and (2) developing natural language interfaces to databases.

AskNow: A Framework for Natural Language Query Formalization in SPARQL

This paper proposes a framework, called AskNow, where users can pose queries in English to a target RDF knowledge base e.g. DBpedia, and empirically evaluated the framework with respect to the syntactic robustness of NQS and semantic accuracy of the SPARQL translator on standard benchmark datasets.