Node similarity in the citation graph

@article{Lu2006NodeSI,
  title={Node similarity in the citation graph},
  author={Wangzhong Lu and Jeannette C. M. Janssen and Evangelos E. Milios and Nathalie Japkowicz and Yongzheng Zhang},
  journal={Knowledge and Information Systems},
  year={2006},
  volume={11},
  pages={105-129}
}
Published scientific articles are linked together into a graph, the citation graph, through their citations. This paper explores the notion of similarity based on connectivity alone, and proposes several algorithms to quantify it. Our metrics take advantage of the local neighborhoods of the nodes in the citation graph. Two variants of link-based similarity estimation between two nodes are described, one based on the separate local neighborhoods of the nodes, and another based on the joint local… 
A Comparative Analysis of Network-Based Similarity Measures for Scientific Paper Recommendations
TLDR
Three similarity measures for scientific papers are compared: bibliographic coupling, co-citation coupling and cosine similarity, which shows that some measures are structurally very similar, yet if nodes are ordered according to their similarity, the two measures do not always produce the same rankings.
Assessing Document Relevance by Modeling Citation Networks with Probabilistic Graphs
TLDR
This paper addressed the problems of assessing both the importance of the citations of a paper and to identify relevant papers by modeling the citation network with a probabilistic graph useful to infer unknown links among the nodes representing papers.
Measure the similarity of nodes in the complex networks
TLDR
The results in this paper show that the proposed method which is based on the degree centrality and the Relative-entropy is useful and reasonable to measure the similarity of the nodes in the complex networks.
Extended co-citation search: Graph-based document retrieval on a co-citation network containing citation context information
  • Masaki Eto
  • Computer Science
    Inf. Process. Manag.
  • 2019
TLDR
It is clarified that the combination of graph-based algorithms and co-citation contexts are effective in improving the performance of co-Citation search techniques, and that sole use of a graph- based algorithm is not enough to enhance search performances from the baselines.
Document similarity detection using semantic social network analysis on RDF citation graph
TLDR
This work has answered the question of how to identify document similarity measures by applying semantic social network analysis techniques on RDF citation graphs of research papers to identify the pair wise similarity between these papers.
Link-based similarity measurement techniques and applications
TLDR
This thesis proposes an item-based top-N recommendation algorithm called GCP, which refines the "1 item"-based traditional CP (Conditional Probability) algorithm by taking the "multi-item"-based conditional probabilities into account and presents the item-graph model, which is used for tracking the relationships between items.
Automatically Generating Citation Graphs (and Variants) for Systematic Reviews
TLDR
A tool for automatically generating citation graphs from a set of paper documents, which runs stand-alone or integrated in a systematic reviews application, and proposes several strategies to reduce the complexity of citation graphs and add additional information for in-depth analysis of the impact of single publications.
Finding Relevant Papers Based on Citation Relations
TLDR
A novel method to address the problem of recommending relevant papers to researchers by incorporating various citation relations for a proper set of papers, which are more relevant but with a very limited size is proposed.
Academic Paper Recommendation Based on Heterogeneous Graph
TLDR
This paper proposes a novel recommendation method with a heterogeneous graph in which both citation and content knowledge are included, and demonstrates that this approach outperforms traditional methods.
Direction Awareness in Citation Recommendation
Literature search is an important part of academic research. The increase in the number of published papers each year makes manual search inefficient, hence, automatic methods must be devised.
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 32 REFERENCES
Characterizing and Mining the Citation Graph of the Computer Science Literature
TLDR
This work used a web robot to build several topic-specific citation graphs and their union graph from the digital library ResearchIndex, and discovered the existence of a single large weakly-connected and a singleLarge biconnected component, and confirmed the expected lack of a large strongly-connected component.
Co-citation in the scientific literature: A new measure of the relationship between two documents
  • H. Small
  • Computer Science
    J. Am. Soc. Inf. Sci.
  • 1973
TLDR
A new form of document coupling called co-citation is defined as the frequency with which two documents are cited together, and clusters of co- cited papers provide a new way to study the specialty structure of science.
Node similarity in networked information spaces
TLDR
This paper explores the definition of similarity based on connectivity only, and proposes several algorithms for this purpose, which take advantage of the local neighborhoods of the nodes in the networked information space.
Stable algorithms for link analysis
TLDR
The analysis is extended and it is shown how it gives insight into ways of designing stable link analysis methods and motivates two new algorithms, whose performance is studied empirically using citation data and web hyperlink data.
Visualising Semantic Spaces and Author Co-Citation Networks in Digital Libraries
TLDR
Salient semantic structures and citation patterns are extracted from several collections of documents, including the ACM SIGCHI Conference Proceedings and ACM Hypertext Conference Proceedings, using Latent Semantic Indexing and Pathfinder Network Scaling.
The synthesis of specialty narratives from co-citation clusters
  • H. Small
  • Computer Science
    J. Am. Soc. Inf. Sci.
  • 1986
TLDR
A method is described for generating reviews or synopses of scientific fields called specialty narratives, applied to a co-citation cluster in the field of cancer virology, and the resulting specialty narrative is described in relation to the structure of the original cocitation map.
Citation analysis as a tool in journal evaluation.
TLDR
In 1971, the Institute for Scientfic Information decided to undertake a systematic analysis of journal citation patterns across the whole of science and technology.
Extracting Large-Scale Knowledge Bases from the Web
TLDR
This paper develops novel algorithms for enumerating and organizing all web occurrences of certain subgraphs that are signatures of web phenomena such as tightly-focused topic communities, webrings, taxonomy trees, keiretsus, etc, and argues that these algorithms run efficiently in this model.
Communities of interest
TLDR
This work introduces a data structure that captures, in an approximate sense, the graph and its evolution through time, as the union of small subgraphs, called Communities of Interest (COI), centered on every node.
AUTOMATIC TERM EXTRACTION AND DOCUMENT SIMILARITY IN SPECIAL TEXT CORPORA
This paper conflrms that the performance of a state-of-the-art automatic term extraction method on a computer science corpus is similar to previously published performance data on a medical corpus.
...
1
2
3
4
...