Divide & Conquer-based Inclusion Dependency Discovery

@article{Papenbrock2015DivideC,
  title={Divide & Conquer-based Inclusion Dependency Discovery},
  author={Thorsten Papenbrock and Sebastian Kruse and Jorge-Arnulfo Quian{\'e}-Ruiz and Felix Naumann},
  journal={PVLDB},
  year={2015},
  volume={8},
  pages={774-785}
}
The discovery of all inclusion dependencies (INDs) in a dataset is an important part of any data profiling effort. Apart from the detection of foreign key relationships, INDs can help to perform data integration, query optimization, integrity checking, or schema (re-)design. However, the detection of INDs gets harder as datasets become larger in terms of number of tuples as well as attributes. To this end, we propose Binder, an IND detection system that is capable of detecting both unary and… CONTINUE READING

Citations

Publications citing this paper.
SHOWING 1-10 OF 20 CITATIONS

Improving the Efficiency of Inclusion Dependency Detection

VIEW 10 EXCERPTS
CITES METHODS, BACKGROUND & RESULTS
HIGHLY INFLUENCED

Incrementally updating unary inclusion dependencies in dynamic data

  • Distributed and Parallel Databases
  • 2018
VIEW 6 EXCERPTS
CITES BACKGROUND
HIGHLY INFLUENCED

Incremental Discovery of Inclusion Dependencies

VIEW 8 EXCERPTS
CITES METHODS & BACKGROUND
HIGHLY INFLUENCED

A Survey of Database Dependency Concepts

VIEW 2 EXCERPTS
CITES BACKGROUND & METHODS

Data Profiling: A Tutorial

VIEW 1 EXCERPT
CITES METHODS

Data curation with ontology functional dependences

VIEW 2 EXCERPTS
CITES RESULTS & BACKGROUND

References

Publications referenced by this paper.
SHOWING 1-10 OF 17 REFERENCES

The plista dataset

VIEW 6 EXCERPTS
HIGHLY INFLUENTIAL

Unary and n-ary inclusion dependency discovery in relational databases

  • Journal of Intelligent Information Systems
  • 2007
VIEW 10 EXCERPTS
HIGHLY INFLUENTIAL

Efficiently Computing Inclusion Dependencies for Schema Discovery

  • 22nd International Conference on Data Engineering Workshops (ICDEW'06)
  • 2006
VIEW 6 EXCERPTS
HIGHLY INFLUENTIAL

Data profiling revisited

  • SIGMOD Record
  • 2013
VIEW 1 EXCERPT

Discover Dependencies from Data—A Review

  • IEEE Transactions on Knowledge and Data Engineering
  • 2012
VIEW 1 EXCERPT

CLIM: Closed Inclusion Dependency Mining in Databases

  • 2011 IEEE 11th International Conference on Data Mining Workshops
  • 2011
VIEW 1 EXCERPT

Discovery of high-dimensional inclusion dependencies

  • Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405)
  • 2003
VIEW 1 EXCERPT

Similar Papers