Joint unsupervised structure discovery and information extraction

@inproceedings{Vilarinho2011JointUS,
  title={Joint unsupervised structure discovery and information extraction},
  author={Eli Cortez C. Vilarinho and Daniel Arcuschin de Oliveira and Altigran Soares da Silva and Edleno Silva de Moura and Alberto H. F. Laender},
  booktitle={SIGMOD Conference},
  year={2011}
}
In this paper we present JUDIE (Joint Unsupervised Structure Discovery and Information Extraction), a new method for automatically extracting semi-structured data records in the form of continuous text (e.g., bibliographic citations, postal addresses, classified ads, etc.) and having no explicit delimiters between them. While in state-of-the-art Information Extraction methods the structure of the data records is manually supplied the by user as a training step, JUDIE is capable of detecting the… CONTINUE READING
Highly Cited
This paper has 25 citations. REVIEW CITATIONS
16 Citations
7 References
Similar Papers

Citations

Publications citing this paper.
Showing 1-10 of 16 extracted citations

References

Publications referenced by this paper.
Showing 1-7 of 7 references

Automatic Segmentation of Text into Structured Records

  • V. Borkar et. al
  • In Proc. ACM SIGMOD Intl. Conf. on Management of…
  • 2001
Highly Influential
6 Excerpts

Similar Papers

Loading similar papers…