Corpus ID: 38160065

A Proof-of-Concept of D³ Record Mining using Domain-Dependent Data

  title={A Proof-of-Concept of D³ Record Mining using Domain-Dependent Data},
  author={Y. S. Lee and Michaela Geierhos and Sa-Kwang Song and Hanmin Jung},
  • Y. S. Lee, Michaela Geierhos, +1 author Hanmin Jung
  • Published 2012
  • Computer Science
  • Our purpose is to perform data record extraction from onlineevent calendars exploiting sublanguage and domain characteristics. [...] Key Method One of the most remarkable advantages of our method is that it does not require any additional classification steps based on machine learning algorithms or keyword extraction methods; it is a so-called one-step mining technique. Moreover, another important criteria is that our system is robust to DOM and layout modifications made by web designers. Thus, preliminary…Expand Abstract


    Efficient record-level wrapper induction
    • 48
    • PDF
    Web data extraction based on partial tree alignment
    • 592
    • PDF
    Vision-based Web Data Records Extraction
    • 68
    • PDF
    Mining data records in Web pages
    • 547
    • PDF
    Mining Data Regions from Web Pages
    • 17
    • PDF
    Visual Clue Based Extraction of Web Data from Flat and Nested Data Records
    • 5
    • PDF
    Extracting structured data from Web pages
    • 559
    • PDF
    RoadRunner: Towards Automatic Data Extraction from Large Web Sites
    • 1,150
    • PDF
    Efficient approaches for record level web information extraction systems
    • International Journal of Advanced Engineering & Application 2(1)
    • 2011