TiFi: Taxonomy Induction for Fictional Domains [Extended version]
@article{Chu2019TiFiTI, title={TiFi: Taxonomy Induction for Fictional Domains [Extended version]}, author={Cuong Xuan Chu and Simon Razniewski and Gerhard Weikum}, journal={ArXiv}, year={2019}, volume={abs/1901.10263} }
Taxonomies are important building blocks of structured knowledge bases, and their construction from text sources and Wikipedia has received much attention. [] Key Method Our fiction-targeted approach, called TiFi, consists of three phases: (i) category cleaning, by identifying candidate categories that truly represent classes in the domain of interest, (ii) edge cleaning, by selecting subcategory relationships that correspond to class subsumption, and (iii) top-level construction, by mapping classes onto a…
Figures and Tables from this paper
References
SHOWING 1-10 OF 49 REFERENCES
Automatic taxonomy construction from keywords
- Economics, Computer ScienceKDD
- 2012
This paper develops a Bayesian approach to build a hierarchical taxonomy for a given set of keywords and reduces the complexity of previous hierarchical clustering approaches, so that it can derive a domain specific taxonomy from one million keyword phrases in less than an hour.
MENTA: inducing multilingual taxonomies from wikipedia
- BiologyCIKM
- 2010
This paper investigates how entities from all editions of Wikipedia as well as WordNet can be integrated into a single coherent taxonomic class hierarchy, resulting in MENTA (Multilingual Entity Taxonomy), a resource that describes 5.4 million entities and is presumably the largest multilingual lexical knowledge base currently available.
Learning Concept Hierarchies from Text Corpora using Formal Concept Analysis
- Computer ScienceJ. Artif. Intell. Res.
- 2005
A novel approach to the automatic acquisition of taxonomies or concept hierarchies from a text corpus based on Formal Concept Analysis, which model the context of a certain term as a vector representing syntactic dependencies which are automatically acquired from the text corpus with a linguistic parser.
Large-Scale Taxonomy Mapping for Restructuring and Integrating Wikipedia
- Computer ScienceIJCAI
- 2009
We present a knowledge-rich methodology for disambiguating Wikipedia categories with WordNet synsets and using this semantic information to restructure a taxonomy automatically generated from the…
Yago: a core of semantic knowledge
- Computer ScienceWWW '07
- 2007
YAGO builds on entities and relations and currently contains more than 1 million entities and 5 million facts, which includes the Is-A hierarchy as well as non-taxonomic relations between entities (such as HASONEPRIZE).
Biperpedia: An Ontology for Search Applications
- Computer ScienceProc. VLDB Endow.
- 2014
Bperpedia, an ontology with 1.6M (class, attribute) pairs and 67K distinct attribute names, can increase the number of Web tables whose semantics the authors can recover by more than a factor of 4 compared with Freebase.
Folksonomy-Based Visual Ontology Construction and Its Applications
- Computer ScienceIEEE Transactions on Multimedia
- 2016
This paper considers the problem of automatically constructing a folksonomy-based visual ontology (FBVO) from the user-generated annotated images and proposes a systematic framework consisting of three stages as concept discovery, concept relationship extraction, and concept hierarchy construction.
Taxonomy induction based on a collaboratively built knowledge repository
- Computer ScienceArtif. Intell.
- 2011
Learning Syntactic Patterns for Automatic Hypernym Discovery
- Computer ScienceNIPS
- 2004
This paper presents a new algorithm for automatically learning hypernym (is-a) relations from text, using "dependency path" features extracted from parse trees and introduces a general-purpose formalization and generalization of these patterns.
Leveraging Community-Built Knowledge for Type Coercion in Question Answering
- Computer ScienceSEMWEB
- 2011
A high-level overview of the TyCor framework is provided and how it is integrated in Watson is discussed, focusing on and evaluating three TyCor components that leverage the community built semi-structured and structured knowledge resources -- DBpedia, Wikipedia Categories and Lists.