Learning Domain Ontologies from Document Warehouses and Dedicated Web Sites

  title={Learning Domain Ontologies from Document Warehouses and Dedicated Web Sites},
  author={R. Navigli and Paola Velardi},
  journal={Computational Linguistics},
We present a method and a tool, OntoLearn, aimed at the extraction of domain ontologies from Web sites, and more generally from documents shared among the members of virtual organizations. OntoLearn first extracts a domain terminology from available documents. Then, complex domain terms are semantically interpreted and arranged in a hierarchical fashion. Finally, a general-purpose ontology, WordNet, is trimmed and enriched with the detected domain concepts. The major novel aspect of this… 

Chapter I Contextual Hierarchy Driven Ontology Learning

The results show that the concept discovery approach improves the conceptual quality and the relevance of the extracted ontological concepts, provides a support for the domain experts and facilitates the evaluation task for them.

A Domain Ontology Learning from Web Documents

Experimental results show that the proposed approach can effectively construct a cancer domain ontology from unstructured text documents.

Ontology Engineering and Knowledge Extraction for Cross-Lingual Retrieval

In this paper, we show that by integrating existing NLP techniques and Semantic Web tools in a novel way, we can provide a valuable contribution to the solution of the knowledge acquisition

A semantic framework for extracting taxonomic relations from text corpus

A framework that combines Word Sense Disambiguation (WSD) and web approach to extract taxonomic relations from a domain-text corpus and outperforms the state of the art method in corpora having high WSD results is proposed.

OntOAIr: a method to construct lightweight ontologies from document collections

The OntOAIr method uses simplified representations of documents, an adaptation of the Frequent Itemset-based Hierarchical Clus- tering algorithm (FIHC), and ontological engineering techniques to construct lightweight ontologies from document collections.

Learning to Build a Semantic Thesaurus from Free Text Corpora without External Help

This chapter describes the knowledge-poor process of extracting ontological information in the economic domain mostly automatically from Modern Greek text using statistical filters and machine learning techniques.

OntOAIr: A Method to Construct Lightweight Ontologies from Document Collections

The OntOAIr method uses simplified representations of documents, an adaptation of the frequent item set-based hierarchical clustering algorithm (FIHC), and ontological engineering techniques to construct lightweight ontologies from document collections.

Automatic Algorithm for Extracting an Ontology for a Specific Domain Name

This study proposes a framework that separates the process of finding important concepts from linguistic analysis to extract more taxonomic and non-taxonomic relationships and compares the results of this framework with an existing ontology in the area of “biochemy”.



Enriching very large ontologies using the WWW

The overall goal is to overcome two shortcomings of WordNet: the lack of topical links among concepts, and the proliferation of senses, which are improved when the hierarchical clusters are used.

Semi-Automatic Engineering of Ontologies from Text

This work presents a general architecture for discovering conceptual structures and engineering ontologies, and proposes a new approach to extend current approaches, who mostly focus on the semi-automatic acquisition of taxonomies, by the discovery of non-taxonomic conceptual relations.

Structural semantic interconnections: a knowledge-based approach to word sense disambiguation

  • R. NavigliP. Velardi
  • Computer Science
    IEEE Transactions on Pattern Analysis and Machine Intelligence
  • 2005
Structural semantic interconnections (SSI) is presented, which creates structural specifications of the possible senses for each word in a context and selects the best hypothesis according to a grammar G, describing relations between sense specifications.

Ontology Learning from Text

Ontologies have shown their usefulness in application areas such as information integration, natural language processing, metadata for the world wide, to name but a few. However, there remains the

Ontology Learning and Its Application to Automated Terminology Translation

The OntoLearn system is an infrastructure for automated ontology learning from domain text that uses natural language processing and machine learning techniques, and is part of a more general ontology engineering architecture.

An analysis of ontology-based query expansion strategies

It is shown that expanding with synonyms or hyperonyms has a limited effect on web information retrieval performance, while other types of semantic information derivable from an ontology are much more effective at improving search results.

Extending, trimming and fusing WordNet for technical documents

This paper describes a tool for the automatic extension and trimming of a multilingual WordNet database for cross-lingual retrieval and multilingual ontology building in intranets and

Identification of Relevant Terms to Support the Construction of Domain Ontologies

A text mining technique to aid an Ontology Engineer to identify the important concepts in a Domain Ontology is described.

Sweetening Ontologies with DOLCE

In this paper we introduce the DOLCE upper level ontology, the first module of a Foundational Ontologies Library being developed within the WonderWeb project. DOLCE is presented here in an intuitive