• Corpus ID: 59336169

TiFi: Taxonomy Induction for Fictional Domains [Extended version]

@article{Chu2019TiFiTI,
  title={TiFi: Taxonomy Induction for Fictional Domains [Extended version]},
  author={Cuong Xuan Chu and Simon Razniewski and Gerhard Weikum},
  journal={ArXiv},
  year={2019},
  volume={abs/1901.10263}
}
Taxonomies are important building blocks of structured knowledge bases, and their construction from text sources and Wikipedia has received much attention. [] Key Method Our fiction-targeted approach, called TiFi, consists of three phases: (i) category cleaning, by identifying candidate categories that truly represent classes in the domain of interest, (ii) edge cleaning, by selecting subcategory relationships that correspond to class subsumption, and (iii) top-level construction, by mapping classes onto a…

References

SHOWING 1-10 OF 49 REFERENCES

Automatic taxonomy construction from keywords

This paper develops a Bayesian approach to build a hierarchical taxonomy for a given set of keywords and reduces the complexity of previous hierarchical clustering approaches, so that it can derive a domain specific taxonomy from one million keyword phrases in less than an hour.

MENTA: inducing multilingual taxonomies from wikipedia

This paper investigates how entities from all editions of Wikipedia as well as WordNet can be integrated into a single coherent taxonomic class hierarchy, resulting in MENTA (Multilingual Entity Taxonomy), a resource that describes 5.4 million entities and is presumably the largest multilingual lexical knowledge base currently available.

Learning Concept Hierarchies from Text Corpora using Formal Concept Analysis

A novel approach to the automatic acquisition of taxonomies or concept hierarchies from a text corpus based on Formal Concept Analysis, which model the context of a certain term as a vector representing syntactic dependencies which are automatically acquired from the text corpus with a linguistic parser.

Large-Scale Taxonomy Mapping for Restructuring and Integrating Wikipedia

We present a knowledge-rich methodology for disambiguating Wikipedia categories with WordNet synsets and using this semantic information to restructure a taxonomy automatically generated from the

Yago: a core of semantic knowledge

YAGO builds on entities and relations and currently contains more than 1 million entities and 5 million facts, which includes the Is-A hierarchy as well as non-taxonomic relations between entities (such as HASONEPRIZE).

Biperpedia: An Ontology for Search Applications

Bperpedia, an ontology with 1.6M (class, attribute) pairs and 67K distinct attribute names, can increase the number of Web tables whose semantics the authors can recover by more than a factor of 4 compared with Freebase.

Folksonomy-Based Visual Ontology Construction and Its Applications

This paper considers the problem of automatically constructing a folksonomy-based visual ontology (FBVO) from the user-generated annotated images and proposes a systematic framework consisting of three stages as concept discovery, concept relationship extraction, and concept hierarchy construction.

Taxonomy induction based on a collaboratively built knowledge repository

Learning Syntactic Patterns for Automatic Hypernym Discovery

This paper presents a new algorithm for automatically learning hypernym (is-a) relations from text, using "dependency path" features extracted from parse trees and introduces a general-purpose formalization and generalization of these patterns.

Leveraging Community-Built Knowledge for Type Coercion in Question Answering

A high-level overview of the TyCor framework is provided and how it is integrated in Watson is discussed, focusing on and evaluating three TyCor components that leverage the community built semi-structured and structured knowledge resources -- DBpedia, Wikipedia Categories and Lists.