Web Data Extraction, Applications and Techniques: A Survey

@article{Ferrara2014WebDE,
  title={Web Data Extraction, Applications and Techniques: A Survey},
  author={Emilio Ferrara and Pasquale De Meo and Giacomo Fiumara and Robert Baumgartner},
  journal={Knowl. Based Syst.},
  year={2014},
  volume={70},
  pages={301-323}
}
  • Emilio Ferrara, Pasquale De Meo, +1 author Robert Baumgartner
  • Published 2014
  • Computer Science
  • Knowl. Based Syst.
  • Web Data Extraction is an important problem that has been studied by means of different scientific tools and in a broad range of applications. [...] Key Method We provided a simple classification framework in which existing Web Data Extraction applications are grouped into two main classes, namely applications at the Enterprise level and at the Social Web level.Expand Abstract

    Citations

    Publications citing this paper.
    SHOWING 1-10 OF 278 CITATIONS

    Analysis Of Different Web Data Extraction Techniques

    Towards data extraction of dynamic content from JavaScript Web applications

    VIEW 1 EXCERPT
    CITES METHODS

    Wrapper approaches for web data extraction : A review

    VIEW 1 EXCERPT
    CITES METHODS

    Practical Web Data Extraction: Are We There Yet? - A Short Survey

    VIEW 1 EXCERPT
    CITES METHODS

    Extraction Rule Language for Web Information Extraction and Integration

    VIEW 2 EXCERPTS
    CITES BACKGROUND

    Articulating the construction of a web scraper for massive data extraction

    Trend of Supervised Web Data Extraction

    VIEW 2 EXCERPTS
    CITES BACKGROUND & METHODS

    DEiXTo: a web data extraction suite

    VIEW 1 EXCERPT
    CITES BACKGROUND

    FILTER CITATIONS BY YEAR

    2012
    2020

    CITATION STATISTICS

    • 8 Highly Influenced Citations

    • Averaged 32 Citations per year from 2018 through 2020

    References

    Publications referenced by this paper.
    SHOWING 1-10 OF 181 REFERENCES

    A Survey of Web Information Extraction Systems

    VIEW 1 EXCERPT

    Structured Data Extraction from the Web Based on Partial Tree Alignment

    • Yanhong Zhai, B. Liu
    • Computer Science
    • IEEE Transactions on Knowledge and Data Engineering
    • 2006
    VIEW 2 EXCERPTS

    DeepWeb Navigation in Web Data Extraction

    • Robert Baumgartner, Michal Ceresna, Gerald Ledermuller
    • Computer Science
    • International Conference on Computational Intelligence for Modelling, Control and Automation and International Conference on Intelligent Agents, Web Technologies and Internet Commerce (CIMCA-IAWTIC'06)
    • 2005
    VIEW 1 EXCERPT