Corpus ID: 6296617

PROBER: Ad-Hoc Debugging of Extraction and Integration Pipelines

@article{Sarma2010PROBERAD,
  title={PROBER: Ad-Hoc Debugging of Extraction and Integration Pipelines},
  author={A. Sarma and A. Jain and P. Bohannon},
  journal={ArXiv},
  year={2010},
  volume={abs/1004.1614}
}
Complex information extraction (IE) pipelines assembled by plumbing together off-the-shelf operators, specially customized operators, and operators re-used from other text processing pipelines are becoming an integral component of most text processing frameworks. A critical task faced by the IE pipeline user is to run a post-mortem analysis on the output. Due to the diverse nature of extraction operators (often implemented by independent groups), it is time consuming and error-prone to describe… Expand

References

SHOWING 1-10 OF 18 REFERENCES
Toward best-effort information extraction
Understanding provenance black boxes
I4E: interactive investigation of iterative information extraction
A quality-aware optimizer for information extraction
Join Optimization of Information Extraction Output: Quality Matters!
Optimizing SQL Queries over Text Databases
Provenance in Databases: Past, Current, and Future
  • W. Tan
  • Computer Science
  • IEEE Data Eng. Bull.
  • 2007
On the provenance of non-answers to queries over extracted data
Exploring a Few Good Tuples from Text Databases
  • A. Jain, D. Srivastava
  • Computer Science
  • 2009 IEEE 25th International Conference on Data Engineering
  • 2009
...
1
2
...