Interactive and Deterministic Data Cleaning

  title={Interactive and Deterministic Data Cleaning},
  author={Jian He and Enzo Veltri and Donatello Santoro and Guoliang Li and Giansalvatore Mecca and Paolo Papotti and Nan Tang},
  booktitle={SIGMOD Conference},
We present Falcon, an interactive, deterministic, and declarative data cleaning system, which uses SQL update queries as the language to repair data. Falcon does not rely on the existence of a set of pre-defined data quality rules. On the contrary, it encourages users to explore the data, identify possible problems, and make updates to fix them. Bootstrapped by one user update, Falcon guesses a set of possible sql update queries that can be used to repair the data. The main technical challenge… CONTINUE READING