CollabRank: Towards a Collaborative Approach to Single-Document Keyphrase Extraction

@inproceedings{Wan2008CollabRankTA,
  title={CollabRank: Towards a Collaborative Approach to Single-Document Keyphrase Extraction},
  author={Xiaojun Wan and J. Xiao},
  booktitle={COLING},
  year={2008}
}
Previous methods usually conduct the keyphrase extraction task for single documents separately without interactions for each document, under the assumption that the documents are considered independent of each other. This paper proposes a novel approach named CollabRank to collaborative single-document keyphrase extraction by making use of mutual influences of multiple documents within a cluster context. CollabRank is implemented by first employing the clustering algorithm to obtain appropriate… Expand
Clustering to Find Exemplar Terms for Keyphrase Extraction
TLDR
This work proposes an unsupervised method for keyphrase extraction that outperforms sate-of-the-art graph-based ranking methods (TextRank) by 9.5% in F1-measure and guarantees the document to be semantically covered by these exemplar terms. Expand
A Graph-based Approach of Automatic Keyphrase Extraction
TLDR
This work proposes to take full advantage of the reinforcement between words and sentences by melting three kinds of relationships between them, so that the extracted keyphrases should be synthetic in the sense that they should deal with all the main topics in a document. Expand
WordTopic-MultiRank: A New Method for Automatic Keyphrase Extraction
TLDR
This work proposes WordTopic-MultiRank as a new method for keyphrase extraction, based on the idea that words relate with each other via multiple relations, and treats various latent topics in documents as heterogeneous relations between words and construct a multi-relational word network. Expand
Collaborative Ranking between Supervised and Unsupervised Approaches for Keyphrase Extraction
TLDR
It is concluded that knowledge collaboration between supervised and unsupervised methods can produce higher-quality keyphrases than applying these methods individually. Expand
Automatic Keyphrase Extraction via Topic Decomposition
TLDR
A Topical PageRank (TPR) is built on word graph to measure word importance with respect to different topics and shows that TPR outperforms state-of-the-art keyphrase extraction methods on two datasets under various evaluation metrics. Expand
WikiRank: Improving Keyphrase Extraction Based on Background Knowledge
TLDR
This paper proposes WikiRank, an unsupervised method for keyphrase extraction based on the background knowledge from Wikipedia, which obtains improvements over other state-of-art models by more than 2% in F1-score. Expand
Exploiting neighborhood knowledge for single document summarization and keyphrase extraction
TLDR
This article proposes using a small number of nearest neighbor documents to improve document summarization and keyphrase extraction for the specified document, under the assumption that the neighbor documents could provide additional knowledge and more clues. Expand
Most Important First - Keyphrase Scoring for Improved Ranking in Settings With Limited Keyphrases
TLDR
This paper proposes a scoring of the extracted keyphrases as post-processing to rerank the list of extracted phrases in order to improve precision and recall particularly for the top phrases. Expand
Keyphrase extraction: from distributional feature engineering to distributed semantic composition
TLDR
This research aims to develop a keyphrase extraction approach that automatically learns features of phrases and documents via deep neural networks, which not only eliminates the effort of feature engineering, but also encodes the semantics of phrase and documents in the features. Expand
Accurate keyphrase extraction by discriminating overlapping phrases
TLDR
The document phrase maximality index (DPM-index) is defined, a new measure to discriminate overlapping keyphrase candidates in a text document that makes the DPM- index contribution comparable to that of the well-known TFIDF measure on such a system. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 33 REFERENCES
CorePhrase: Keyphrase Extraction for Document Clustering
TLDR
Subjective as well as quantitative evaluation show that the algorithm outperforms keyword-based cluster-labeling algorithms, and is capable of accurately discovering the topic, and often ranking it in the top one or two extracted keyphrases. Expand
A practical system of keyphrase extraction for web pages
TLDR
A regression model was first trained based on a set of human-labeled documents then used to extract keyphrases from new pages automatically, and a method was put forward in this paper in order to evaluate the similarity of phrases. Expand
Generic summarization and keyphrase extraction using mutual reinforcement principle and sentence clustering
  • H. Zha
  • Computer Science
  • SIGIR '02
  • 2002
A novel method for simultaneous keyphrase extraction and generic text summarization is proposed by modeling text documents as weighted undirected and weighted bipartite graphs. Spectral graphExpand
Coherent Keyphrase Extraction via Web Mining
TLDR
This paper presents enhancements to the Kea keyphrase extraction algorithm that are designed to increase the coherence of the extracted keyphrases. Expand
KEA: practical automatic keyphrase extraction
TLDR
This paper uses a large test corpus to evaluate Kea’s effectiveness in terms of how many author-assigned keyphrases are correctly identified, and describes the system, which is simple, robust, and publicly available. Expand
Domain-specific keyphrase extraction
TLDR
A Keyphrase Identification Program (KIP) is described, which extracts document keyphrases by using prior positive samples of human identified domain key phrases to assign weights to the candidate keyphRases. Expand
Towards an Iterative Reinforcement Approach for Simultaneous Document Summarization and Keyword Extraction
TLDR
A novel iterative reinforcement approach to simultaneously extractingsummary and keywords from single document under the assumption that the summary and keywords of a document can be mutually boosted. Expand
Keyphrase Extraction in Scientific Publications
TLDR
In the evaluation using a corpus of 120 scientific publications multiply annotated for keyphrases, the system significantly outperformed Kea at the p < .05 level. Expand
Learning Algorithms for Keyphrase Extraction
TLDR
The experimental results support the claim that a custom-designed algorithm (GenEx), incorporating specialized procedural domain knowledge, can generate better keyphrases than a general-purpose algorithm (C4.5). Expand
KPSpotter: a flexible information gain-based keyphrase extraction system
TLDR
KPSpotter is a flexible web-enabled keyphrase extraction system, capable of processing various formats of input data and generating the extraction model as well as the list of keyphrases in XML, which is differentiated from other extraction systems in the followings. Expand
...
1
2
3
4
...