Auto-Join: Joining Tables by Leveraging Transformations

@article{Zhu2017AutoJoinJT,
  title={Auto-Join: Joining Tables by Leveraging Transformations},
  author={Erkang Zhu and Yeye He and S. Chaudhuri},
  journal={Proc. VLDB Endow.},
  year={2017},
  volume={10},
  pages={1034-1045}
}
Traditional equi-join relies solely on string equality comparisons to perform joins. However, in scenarios such as ad-hoc data analysis in spreadsheets, users increasingly need to join tables whose join-columns are from the same semantic domain but use different textual representations, for which transformations are needed before equi-join can be performed. We developed Auto-Join, a system that can automatically search over a rich space of operators to compose a transformation program, whose… Expand
25 Citations
Auto-transform
Transform-Data-by-Example (TDE): An Extensible Search Engine for Data Transformations
  • 19
  • PDF
Transform-Data-by-Example (TDE): Extensible Data Transformation in Excel
  • 7
  • PDF
PEXESO: Finding Joinable Tables by Distance-based Similarities
  • PDF
Putting Things into Context: Rich Explanations for Query Answers using Join Graphs (extended version)
  • PDF
Interactive rule correction, imputation and execution in rule-driven database completion system
  • K. Reddy
  • Computer Science
  • 2020 IEEE International Conference on Systems, Man, and Cybernetics (SMC)
  • 2020
Technical Report: Optimizing Human Involvement for Entity Matching and Consolidation
  • PDF
Auto-Transform: Learning-to-Transform by Patterns
  • PDF
Lazo: A Cardinality-Based Method for Coupled Estimation of Jaccard Similarity and Containment
  • 10
  • PDF
...
1
2
3
...

References

SHOWING 1-10 OF 23 REFERENCES
SEMA-JOIN: Joining Semantically-Related Tables Using Big Table Corpora
  • 20
  • PDF
A Primitive Operator for Similarity Joins in Data Cleaning
  • 562
  • PDF
TEGRA: Table Extraction by Global Record Alignment
  • 27
  • PDF
Spreadsheet table transformations from examples
  • 168
  • PDF
Foofah: Transforming Data By Example
  • 57
  • PDF
BlinkFill: Semi-supervised Programming By Example for Syntactic String Transformations
  • R. Singh
  • Computer Science
  • Proc. VLDB Endow.
  • 2016
  • 77
  • PDF
Harvesting Relational Tables from Lists on the Web
  • 29
Mining database structure; or, how to build a data quality browser
  • 203
  • PDF
Discovering Linkage Points over Web Data
  • 34
  • PDF
iMAP: discovering complex semantic matches between database schemas
  • 425
  • PDF
...
1
2
3
...