Provenance for Natural Language Queries
@article{Deutch2017ProvenanceFN, title={Provenance for Natural Language Queries}, author={Daniel Deutch and Nave Frost and Amir Gilad}, journal={Proc. VLDB Endow.}, year={2017}, volume={10}, pages={577-588} }
Multiple lines of research have developed Natural Language (NL) interfaces for formulating database queries. We build upon this work, but focus on presenting a highly detailed form of the answers in NL. The answers that we present are importantly based on the provenance of tuples in the query result, detailing not only the results but also their explanations. We develop a novel method for transforming provenance information to NL, by leveraging the original NL query structure. Furthermore…
Figures and Tables from this paper
29 Citations
Explaining Natural Language query results
- Computer ScienceThe VLDB Journal
- 2019
This work develops a novel method for transforming provenance information to NL, by leveraging the original NL query structure, and presents two solutions for its effective presentation as NL text: one based on provenance factorization, with novel desiderata relevant to the NL case and one that is based on summarization.
Explaining Queries Over Web Tables to Non-experts
- Computer Science2019 IEEE 35th International Conference on Data Engineering (ICDE)
- 2019
This work augments a state-of-the-art NL interface over web tables, enhancing it in both its training and deployment phase, and introduces novel query explanations that provide a graphic representation of the query cell-based provenance in its execution on a given table.
Reverse-Engineering ConjunctiveQueries from Provenance Examples
- Computer Science
- 2019
The theoretical analysis shows that there may be many (for some models, even infinitely many in presence of self-joins) consistent queries, yet the algorithms provided provide practically efficient algorithms to find (best-fit) such queries.
Provenance for Non-Experts
- Computer ScienceIEEE Data Eng. Bull.
- 2018
This paper outlines in this paper the ongoing research and preliminary results, addressing the challenges towards developing provenance solutions that serve to explain computation results to non-expert users.
Explaining Missing Query Results in Natural Language
- Computer ScienceEDBT
- 2020
This paper proposes a novel approach to “marry" NLIDBs with an existing model for explaining missing query results by pinpointing the last query operator that is “responsible" for the missing result.
Provenance Summaries for Answers and Non-Answers
- Computer ScienceProc. VLDB Endow.
- 2018
PUG limits provenance capture to what is relevant to explain a ( Missing) result of interest and uses an efficient sampling-based summarization method to produce compact explanations for (missing) answers.
From Natural Language Questions to SPARQL Queries: A Pattern-based Approach
- Computer ScienceBTW
- 2019
The main contribution of the proposed approach constitutes the simple replaceability of the underlying knowledge base, which is based on general question and query patterns and only accesses the knowledge base for the actual query generation and execution.
Putting Things into Context: Rich Explanations for Query Answers using Join Graphs
- Computer ScienceSIGMOD Conference
- 2021
This work proposes a new approach for explaining query results by augmenting provenance with information from other related tables in the database by using a suite of optimization techniques.
ML Based Lineage in Databases
- Computer ScienceArXiv
- 2021
A novel approach for approximating lineage tracking, using a Machine Learning (ML) and Natural Language Processing (NLP) technique; namely, word embedding, and designs an alternative lineage tracking mechanism, that of keeping track of and querying lineage at the column (“gene”) level to better distinguish between the provenance features and the textual characteristics of a tuple.
Fragment-Driven Natural Language Interaction with Databases
- Computer ScienceCIDR
- 2020
This work proposes an alternative fragment-driven interaction model, where the system provides an explanation as to how the natural language produced the resulting SQL, which enables the user to interact with the system purely in natural language and to make incremental modifications to their resulting database query without having to learn any SQL.
References
SHOWING 1-10 OF 49 REFERENCES
NLProv: Natural Language Provenance
- Computer ScienceProc. VLDB Endow.
- 2016
This work develops a novel method for transforming provenance information to NL, by leveraging the original NL question structure, and presents two solutions for its effective presentation as NL text: one based on provenance factorization with novel desiderata relevant to the NL case, and one that is based on summarization.
Selective Provenance for Datalog Programs Using Top-K Queries
- Computer ScienceProc. VLDB Endow.
- 2015
A novel top-k query language for querying datalog provenance, supporting selection criteria based on tree patterns and ranking based on the rules and database facts used in derivation, and an efficient novel algorithm based on instrumenting the datalog program so that it generates only relevant provenance.
Querying data provenance
- Computer ScienceSIGMOD Conference
- 2010
A query language for provenance is developed, which can express all of the aforementioned types of queries, as well as many more, and the feasibility of provenance querying and the benefits of the indexing techniques across a variety of application classes and queries are experimentally validated.
TR Discover: A Natural Language Interface for Querying and Analyzing Interlinked Datasets
- Computer ScienceSEMWEB
- 2015
The TR Discover system, a natural language-based system that allows non-technical users to create well-formed questions, is developed for future use with Thomson Reuters Cortellis and is shown to be usable and portable, and report on the relative performance of queries using SQL and SPARQL back ends.
Explaining structured queries in natural language
- Computer Science2010 IEEE 26th International Conference on Data Engineering (ICDE 2010)
- 2010
This paper represents various forms of structured queries as directed graphs and annotate the graph edges with template labels using an extensible template mechanism and presents different graph traversal strategies for efficiently exploring these graphs and composing textual query descriptions.
Using SQL for Efficient Generation and Querying of Provenance Information
- Computer ScienceIn Search of Elegance in the Theory and Practice of Computation
- 2013
This chapter reviews some of the main contributions of Perm, a DBMS that generates different types of provenance information for complex SQL queries (including nested and correlated subqueries and aggregation).
Quelo Natural Language Interface: Generating queries and answer descriptions
- Computer Science
- 2014
This work describes Quelo NLI functionality and presents a grammar-based natural language generation approach that better supports the domain-independent generation of fluent queries and naturally extends for the generation of answers descriptions.
Approximated Summarization of Data Provenance
- Computer ScienceCIKM
- 2015
The notion of approximated summarized provenance is introduced, which provides a compact representation of the provenance at the possible cost of information loss, and a novel provenance summarization algorithm is presented which outputs a summary of the input provenance.
Provenance: On and Behind the Screens
- Computer ScienceSIGMOD Conference
- 2016
The second part of this tutorial focuses on enabling users to leverage provenance through adapted visualizations, and will present some fundamental concepts of visualization before discussing possible visualizations for provenance.
A Natural Language Interface for Querying General and Individual Knowledge
- Computer ScienceProc. VLDB Endow.
- 2015
A modular translation framework that employs new solutions along with state-of-the art NL parsing tools is designed and implemented that provides a high-quality translation for many questions that are not handled by previous translation tools.