Venue Classification of Research Papers in Scholarly Digital Libraries

  title={Venue Classification of Research Papers in Scholarly Digital Libraries},
  author={Cornelia Caragea and Corina Florescu},
Open-access scholarly digital libraries crawl periodically a list of URLs in order to obtain appropriate collections of freely-available research papers. The metadata of the crawled papers, e.g., title, authors, and references, are automatically extracted before the papers are indexed in a digital library. The venue of publication is another important aspect about a scientific paper, which reflects its authoritativeness. However, the venue is not always readily available for a paper. Instead… 
1 Citations
FLY: Venue Recommendation using Limited Context
This paper proposes a system that uses very limited context, namely the paper title, abstract, and a venue network, which is constructed using only a small subset of authors' publication history, to do venue recommendation.


CiteSeer x : A Scholarly Big Dataset
This work proposes an approach to CiteSeer x metadata cleaning that incorporates information from an external data source, which is substantially cleaner than the entire set of data, and makes the new dataset available to the research community.
Citation-Enhanced Keyphrase Extraction from Research Papers: A Supervised Approach
Novel features based on citation network information are designed and used in conjunction with traditional features for keyphrase extraction to obtain remarkable improvements in performance over strong baselines.
Publication Venue Recommendation Based on Paper Abstract
The results show that the proposed venue recommendation system provides recommendations whose quality is aligned with previous works, while requiring much less information from both the paper and the knowledge base.
CiteSeer: an automatic citation indexing system
CiteSeer has many advantages over traditional citation indexes, including the ability to create more up-to-date databases which are not limited to a preselected set of journals or restricted by journal publication delays, completely autonomous operation with a corresponding reduction in cost, and powerful interactive browsing of the literature using the context of citations.
Publication Venue Recommendation Using Author Network's Publication History
This paper presents a new approach allowing researchers to automatically find appropriate publication venues for their research paper by exploring author's network of related co-authors and other researchers in the same domain, and shows that this new approach outperforms the content-based approach and provides accurate recommendation.
Extracting Keyphrases from Research Papers Using Citation Networks
This work proposes CiteTextRank for keyphrase extraction from research articles, a graph-based algorithm that incorporates evidence from both a document's content as well as the contexts in which the document is referenced within a citation network.
PositionRank: An Unsupervised Approach to Keyphrase Extraction from Scholarly Documents
An unsupervised model for keyphrase extraction from scholarly documents that incorporates information from all positions of a word’s occurrences into a biased PageRank, which achieves remarkable improvements over PageRank models that do not take into account word positions.
Co-Training for Topic Classification of Scholarly Data
A co-training approach that uses the text and citation information of a research article as two different views to predict the topic of an article is described, showing that this method improves significantly over the individual classifiers, while also bringing a substantial reduction in the amount of labeled data required for training accurate classifiers.
Graph-Based Keyword Extraction for Single-Document Summarization
The experiments show that given a set of summarized training documents, the supervised classification provides the highest keyword identification accuracy, while the highest F-measure is reached with a simple degree-based ranking.
Detecting research topics via the correlation between graphs and texts
This paper presents a unique approach that uses the correlation between the distribution of a term that represents a topic and the link distribution in the citation graph where the nodes are limited to the documents containing the term.