Corpus ID: 14264833

Single Document Keyphrase Extraction Using Neighborhood Knowledge

@inproceedings{Wan2008SingleDK,
  title={Single Document Keyphrase Extraction Using Neighborhood Knowledge},
  author={Xiaojun Wan and J. Xiao},
  booktitle={AAAI},
  year={2008}
}
Existing methods for single document keyphrase extraction usually make use of only the information contained in the specified document. This paper proposes to use a small number of nearest neighbor documents to provide more knowledge to improve single document keyphrase extraction. A specified document is expanded to a small document set by adding a few neighbor documents close to the document, and the graph-based ranking algorithm is then applied on the expanded document set to make use of… Expand

Figures, Tables, and Topics from this paper

Exploiting neighborhood knowledge for single document summarization and keyphrase extraction
TLDR
This article proposes using a small number of nearest neighbor documents to improve document summarization and keyphrase extraction for the specified document, under the assumption that the neighbor documents could provide additional knowledge and more clues. Expand
A Comparison of Supervised Keyphrase Extraction Models
TLDR
A comparison of existing supervised approaches to this task to determine the current best performing model is provided and uses research articles on the Web as the case study. Expand
Clustering to Find Exemplar Terms for Keyphrase Extraction
TLDR
This work proposes an unsupervised method for keyphrase extraction that outperforms sate-of-the-art graph-based ranking methods (TextRank) by 9.5% in F1-measure and guarantees the document to be semantically covered by these exemplar terms. Expand
A Graph-based Approach of Automatic Keyphrase Extraction
TLDR
This work proposes to take full advantage of the reinforcement between words and sentences by melting three kinds of relationships between them, so that the extracted keyphrases should be synthetic in the sense that they should deal with all the main topics in a document. Expand
WikiRank: Improving Keyphrase Extraction Based on Background Knowledge
TLDR
This paper proposes WikiRank, an unsupervised method for keyphrase extraction based on the background knowledge from Wikipedia, which obtains improvements over other state-of-art models by more than 2% in F1-score. Expand
Automatic Keyphrase Extraction via Topic Decomposition
TLDR
A Topical PageRank (TPR) is built on word graph to measure word importance with respect to different topics and shows that TPR outperforms state-of-the-art keyphrase extraction methods on two datasets under various evaluation metrics. Expand
Latent Keyphrase Extraction Using LDA Model
TLDR
A new approach is proposed that selects candidate keyphrases from the keyphRases of neighbor documents which are similar to the given document and evaluates the importance of the candidates with the individual words in the candidates. Expand
A review of keyphrase extraction
TLDR
This article introduces keyphrase extraction, provides a well‐structured review of the existing work, offers interesting insights on the different evaluation approaches, highlights open issues and presents a comparative experimental study of popular unsupervised techniques on five datasets. Expand
Distant Supervision for Keyphrase Extraction using Search Queries
TLDR
A widely applicable distant supervision framework based on auxiliary data from query logs that propagates information from queries and subsequent consumption of content, weak labels are produced, transforming the problem into the easier supervised task. Expand
Automatic Keyphrase Extraction by Bridging Vocabulary Gap
TLDR
The method is considered that a document and its keyphrases both describe the same object but are written in two different languages, and outperforms existing unsupervised methods on precision, recall and F-measure. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 26 REFERENCES
CorePhrase: Keyphrase Extraction for Document Clustering
TLDR
Subjective as well as quantitative evaluation show that the algorithm outperforms keyword-based cluster-labeling algorithms, and is capable of accurately discovering the topic, and often ranking it in the top one or two extracted keyphrases. Expand
Domain-Specific Keyphrase Extraction
TLDR
This paper shows that a simple procedure for keyphrase extraction based on the naive Bayes learning scheme performs comparably to the state of the art, and explains how this procedure's performance can be boosted by automatically tailoring the extraction process to the particular document collection at hand. Expand
Single Document Summarization with Document Expansion
TLDR
The experimental results on the DUC2002 dataset demonstrate the effectiveness of the proposed approach based on document expansion, and the cross-document relationships between sentences in the expanded document set are validated to be very important for single document summarization. Expand
KEA: practical automatic keyphrase extraction
TLDR
This paper uses a large test corpus to evaluate Kea’s effectiveness in terms of how many author-assigned keyphrases are correctly identified, and describes the system, which is simple, robust, and publicly available. Expand
Coherent Keyphrase Extraction via Web Mining
TLDR
This paper presents enhancements to the Kea keyphrase extraction algorithm that are designed to increase the coherence of the extracted keyphrases. Expand
Using Noun Phrase Heads to Extract Document Keyphrases
TLDR
The simple noun phrase-based system performs roughly as well as a state-of-the-art, corpus-trained keyphrase extractor; ratings for individual keyphrases do not necessarily correlate with ratings for sets of keyphRases for a document. Expand
Learning Algorithms for Keyphrase Extraction
TLDR
The experimental results support the claim that a custom-designed algorithm (GenEx), incorporating specialized procedural domain knowledge, can generate better keyphrases than a general-purpose algorithm (C4.5). Expand
Automatic Hypertext Keyphrase Detection
TLDR
It is found that hyperlink information can improve the effectiveness of automatic keyphrase extraction by 50% and shows promise for the applicability of these techniques to more far-reaching tasks. Expand
Keyphrase Extraction in Scientific Publications
TLDR
In the evaluation using a corpus of 120 scientific publications multiply annotated for keyphrases, the system significantly outperformed Kea at the p < .05 level. Expand
Thesaurus based automatic keyphrase indexing
  • Olena Medelyan, I. Witten
  • Computer Science
  • Proceedings of the 6th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL '06)
  • 2006
We propose a new method that enhances automatic keyphrase extraction by using semantic information on terms and phrases gleaned from a domain-specific thesaurus. We evaluate the results againstExpand
...
1
2
3
...