SOF: a semi‐supervised ontology‐learning‐based focused crawler

  title={SOF: a semi‐supervised ontology‐learning‐based focused crawler},
  author={Hai Dong and F. Hussain},
  journal={Concurrency and Computation: Practice and Experience},
  • Hai Dong, F. Hussain
  • Published 2013
  • Computer Science
  • Concurrency and Computation: Practice and Experience
  • The rapid increase in the volume of data available on the Internet makes it increasingly impractical for a crawler to index the whole Web. Instead, many intelligent crawlers, known as ontology‐based semantic focused crawlers, have been designed by making use of Semantic Web technologies for topic‐centered Web information crawling. Ontologies, however, have constraints of validity and time, which may influence the performance of the crawlers. Ontology‐learning‐based focused crawlers are… CONTINUE READING
    A survey of Web crawlers for information retrieval
    • 16
    Extracting Event-Centric Document Collections from Large-Scale Web Archives
    • 9
    • PDF
    What Do You Want to Collect from the Web ? ?
    • 9
    • PDF
    Towards extracting event-centric collections from Web archives
    • 2
    • PDF


    Publications referenced by this paper.
    A translation approach to portable ontology specifications
    • 11,629
    • PDF
    Focused Crawling: A New Approach to Topic-Specific Web Resource Discovery
    • 1,718
    • PDF
    Ontology-focused crawling of Web documents
    • 180
    • PDF
    Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language
    • 2,064
    • Highly Influential
    • PDF
    Intelligent crawling on the World Wide Web with arbitrary predicates
    • 293
    • PDF
    Ontology learning from text: A look back and into the future
    • 303
    • PDF
    A training algorithm for optimal margin classifiers
    • 9,320
    • PDF
    URBE: Web Service Retrieval Based on Similarity Evaluation
    • 238
    • PDF
    An efficient adaptive focused crawler based on ontology learning
    • 54