A Large Public Corpus of Web Tables containing Time and Context Metadata

@inproceedings{Lehmberg2016ALP,
  title={A Large Public Corpus of Web Tables containing Time and Context Metadata},
  author={Oliver Lehmberg and Dominique Ritze and Robert Meusel and Christian Bizer},
  booktitle={WWW},
  year={2016}
}
The Web contains vast amounts of HTML tables. Most of these tables are used for layout purposes, but a small subset of the tables is relational, meaning that they contain structured data describing a set of entities [2]. As these relational Web tables cover a very wide range of different topics, there is a growing body of research investigating the utility of Web table data for completing cross-domain knowledge bases [6], for extending arbitrary tables with additional attributes [7, 4], as well… CONTINUE READING
Highly Cited
This paper has 42 citations. REVIEW CITATIONS

From This Paper

Figures, tables, and topics from this paper.

Citations

Publications citing this paper.

Similar Papers

Loading similar papers…