Corpus ID: 8108340

Extracting Lexical Semantic Knowledge from Wikipedia and Wiktionary

@inproceedings{Zesch2008ExtractingLS,
  title={Extracting Lexical Semantic Knowledge from Wikipedia and Wiktionary},
  author={Torsten Zesch and Christof M{\"u}ller and Iryna Gurevych},
  booktitle={LREC},
  year={2008}
}
Recently, collaboratively constructed resources such as Wikipedia and Wiktionary have been discovered as valuable lexical semantic knowledge bases with a high potential in diverse Natural Language Processing (NLP) tasks. Collaborative knowledge bases however significantly differ from traditional linguistic knowledge bases in various respects, and this constitutes both an asset and an impediment for research in NLP. This paper addresses one such major impediment, namely the lack of suitable… Expand
Extracting Lexical-Semantic Knowledge from the Portuguese Wiktionary
Public domain collaborative resources like Wiktionary and Wikipedia have recently become attractive sources for information extraction. To use these resources in natural languague processing (NLP)Expand
Extracting Arabic semantic graph from Aljazeera.net
TLDR
This paper presents a framework designed for mining the explicit and implicit lexical semantic information impeded in the structure and the content of Aljazeera.net and provides an efficient and structured access to the resulted semantic graph. Expand
Representational Interoperability of Linguistic and Collaborative Knowledge Bases
TLDR
A model of representational interoperability between LKBs and CKBs is developed, which abstracts over the differences in their structures, and enables a uniform representation of their content in terms of entities and lexical-semantic relations between them. Expand
An interactive semantic knowledge base unifying Wikipedia and HowNet
TLDR
An interactive, exoteric semantic knowledge base, which integrates HowNet and the online encyclopedia Wikipedia, which mainly builds on items, categories, attributes and relation between is presented. Expand
Wikitology: a novel hybrid knowledge base derived from wikipedia
TLDR
The value of the derived knowledge base is demonstrated by developing problem specific intelligent approaches that exploit Wikitology for a diverse set of use cases, namely, document concept prediction, cross document co-reference resolution, Entity Linking to KB entities defined as a part of Text Analysis Conference - Knowledge Base Population Track 2009 and interpreting tables. Expand
WikTDV: Data extraction and vector representation resource for Wiktionary senses
  • D. S. Carvalho, M. Nguyen
  • Computer Science
  • 2017 9th International Conference on Knowledge and Systems Engineering (KSE)
  • 2017
TLDR
A system for extracting information from Wiktionary to a machine-readable format and using this information to obtain vector representations that can be used for semantic similarity computation and basic word sense disambiguation is described. Expand
Dbnary: Wiktionary as a LMF based Multilingual RDF network
TLDR
A word net that has been extracted from French, English and German wiktionaries is presented and it is shown how the extracted data is represented as a Lexical Markup Framework (LMF) compatible lexical network represented in Resource Description Framework (RDF) format. Expand
Wiktionary for Natural Language Processing: Methodology and Limitations
TLDR
An in-depth study of synonymy networks extracted from Wiktionary is provided and two methods for semiautomatically improving this network by adding missing relations are described, using a kind of semantic proximity measure and using translation relations of Wiktionsary itself. Expand
Dbnary : Wiktionary as a Lemon Based RDF Multilingual Lexical Resource
Contributive resources, such as Wikipedia, have proved to be valuable to Natural Language Processing or multilingual Information Retrieval applications. This work focusses on Wiktionary, theExpand
An approach for building lexical-semantic resources based on heterogeneous information sources
TLDR
This work proposes a new approach to automatically build LSRs that are tailored to semantic search engines, i.e., the approach builds L SRs that favor disambiguation and faceted search. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 27 REFERENCES
WikiRelate! Computing Semantic Relatedness Using Wikipedia
TLDR
This work presents experiments on using Wikipedia for computing semantic relatedness and compares it to WordNet on various benchmarking datasets, and shows that Wikipedia outperforms WordNet when applied to the largest available dataset designed for that purpose. Expand
Automatic Assignment of Wikipedia Encyclopedic Entries to WordNet Synsets
TLDR
An approach taken for automatically associating entries from an on-line encyclopedia with concepts in an ontology or a lexical semantic network is described, which will be applied to enriching ontologies with encyclopedic knowledge. Expand
Overcoming the Brittleness Bottleneck using Wikipedia: Enhancing Text Categorization with Encyclopedic Knowledge
TLDR
It is proposed to enrich document representation through automatic use of a vast compendium of human knowledge--an encyclopedia, and empirical results confirm that this knowledge-intensive representation brings text categorization to a qualitatively new level of performance across a diverse collection of datasets. Expand
Measuring and Improving the Quality of World Knowledge extracted from WordNet
TLDR
This report describes the attempts to arrive at a quantitative measure of the quality of the information that can be extracted from WordNet by interpreting it as a formal taxonomy, and to design automatic techniques for improving the quality by filtering out dubious assertions. Expand
Analysis of the Wikipedia Category Graph for NLP Applications
TLDR
A graphtheoretic analysis of the category graph is performed, and it is shown that it is a scale-free, small world graph like other well-known lexical semantic networks. Expand
Comparing Wikipedia and German Wordnet by Evaluating Semantic Relatedness on Multiple Datasets
TLDR
The combination of wordnets and Wikipedia to improve the performance of semantic relatedness measures is investigated, showing that their performance depends on the definition of relatedness that was underlying the construction of the evaluation dataset and the knowledge source used for computing semanticrelatedness. Expand
What to be? - Electronic Career Guidance Based on Semantic Relatedness
TLDR
A study aimed at investigating the use of semantic information in a novel NLP application, Electronic Career Guidance (ECG), in German, and evaluating the performance of SR measures intrinsically on the tasks of computing SR, and solving Reader’s Digest Word Power questions. Expand
Using Encyclopedic Knowledge for Named entity Disambiguation
TLDR
A disambiguation SVM kernel is trained to exploit the high coverage and rich structure of the knowledge encoded in an online encyclopedia and significantly outperforms a less informed baseline. Expand
Using Wikipedia at the TREC QA Track
We describe our participation in the TREC 2004 Question Answering track. We provide a detailed account of the ideas underlying our approach to the QA task, especially to the so-called ?other?Expand
WordNet : an electronic lexical database
TLDR
The lexical database: nouns in WordNet, Katherine J. Miller a semantic network of English verbs, and applications of WordNet: building semantic concordances are presented. Expand
...
1
2
3
...