• Corpus ID: 3624235

SQL Query Completion for Data Exploration

@article{Guilly2018SQLQC,
  title={SQL Query Completion for Data Exploration},
  author={Marie Le Guilly and Jean-Marc Petit and Vasile-Marian Scuturici},
  journal={ArXiv},
  year={2018},
  volume={abs/1802.02872}
}
Within the big data tsunami, relational databases and SQL are still there and remain mandatory in most of cases for accessing data. On the one hand, SQL is easy-to-use by non specialists and allows to identify pertinent initial data at the very beginning of the data exploration process. On the other hand, it is not always so easy to formulate SQL queries: nowadays, it is more and more frequent to have several databases available for one application domain, some of them with hundreds of tables… 
Designing Scientific SPARQL Queries Using Autocompletion by Snippets
TLDR
This work introduces the first (to the authors' knowledge) autocompletion approach able to consider snippets based on queries expressed by previous users, enriching the user experience and demonstrates the interest of the approach on real biomedical queries involving services offered by the Wikidata collaborative knowledge base.
Efficient SPARQL Autocompletion on Large Knowledge Bases
TLDR
An autocompletion system, that is able to help users formulate complex SPARQL queries on large knowledge bases using a template-based approach that allows the automatic generation of the AC queries, and shows that context-sensitive autocom completion leads to significantly better suggestions than alternative approaches which do not consider the context.
INODE: Building an End-to-End Data Exploration System in Practice [Extended Vision]
TLDR
INODE is introduced an end-to-end data exploration system that leverages Machine Learning and semantics for the purpose of Data Management (DM) and is uniquely accessible to a wide range of users from larger scientific communities to the public.
Understanding Data Exploration Search A brief Study of User’s Exploratory Search facets
  • Samit Kumar, Vikram Singh
  • Computer Science
    2018 4th International Conference on Computing Communication and Automation (ICCCA)
  • 2018
TLDR
The investigation is intended to examine the core-research issues and tentative directions, and envisioned the challenges exist in the current exploratory techniques, in two brackets: User’s inability and limitation in DE system.
Augmented Business Intelligence
TLDR
This paper proposes an approach that, based on the sensed augmented context, proposes a set of relevant analytical queries to the user based on a mapping between the entities that can be recognized by the devices and the elements of the enterprise data, and also taking into account the queries preferred by users during previous interactions that occurred in similar contexts.

References

SHOWING 1-10 OF 26 REFERENCES
Data Exploration with SQL using Machine Learning Techniques
TLDR
A pseudo-polynomial heuristic to pick the negation closest in size to the initial query, and construct a balanced learning set whose positive examples correspond to the results desired by analysts, and negative examples to those they do not want are described.
ATLAS: A Small but Complete SQL Extension for Data Mining and Data Streams
SnipSuggest: Context-Aware Autocompletion for SQL
TLDR
SnipSuggest is able to recommend useful snippets with up to 93.7% average precision, at interactive speed, and it is shown that SnipSuggest outperforms naive approaches, such as recommending popular snippets.
Guided Interaction: Rethinking the Query-Result Paradigm
TLDR
This paper examines the conventional query-result paradigm employed by databases and demonstrates challenges encountered when following this paradigm for an information seeking task, and proposes a new paradigm, guided interaction, to solve the noted challenges.
Data Mining and Database Systems: Where is the Intersection?
TLDR
It is argued that (1) the authors need to focus on generic scalability requirements (rather than on features tuned to specific algorithms) wherever possible and (2) data mining systems that are not just scalable, but “SQL-aware” are needed.
Query Steering for Interactive Data Exploration
TLDR
This work proposes database systems be augmented with an automated "database navigator" (DBNav) service that assists as a "tour guide" to facilitate Interactive Data Exploration (IDE) applications.
Interactive Join Query Inference with JIM
TLDR
Jim (Join Inference Machine), a system for interactive join specification tasks, where the user infers an n-ary join predicate by selecting tuples that are part of the join result via Boolean membership queries, is presented.
Data Mining: Concepts and Techniques
TLDR
This book presents dozens of algorithms and implementation examples, all in pseudo-code and suitable for use in real-world, large-scale data mining projects, and provides a comprehensive, practical look at the concepts and techniques you need to get the most out of real business data.
Discovering queries based on example tuples
TLDR
This work studies the problem of discovering the minimal project join query that contains the given example tuples in its output and proposes novel algorithms to solve this problem.
AIDE: An Active Learning-Based Approach for Interactive Data Exploration
TLDR
AIDE, an Automatic Interactive Data Exploration framework that assists users in discovering new interesting data patterns and eliminate expensive ad-hoc exploratory queries, and provides interactive performance as it limits the user wait time per iteration of exploration to less than a few seconds.
...
...