Characterizing and Mining the Citation Graph of the Computer Science Literature

@article{An2003CharacterizingAM,
  title={Characterizing and Mining the Citation Graph of the Computer Science Literature},
  author={Yuan An and Jeannette C. M. Janssen and Evangelos E. Milios},
  journal={Knowledge and Information Systems},
  year={2003},
  volume={6},
  pages={664-678}
}
Citation graphs representing a body of scientific literature convey measures of scholarly activity and productivity. [...] Key Method After verifying that the degree distributions follow a power law, we applied a series of graph theoretical algorithms to elicit an aggregate picture of the citation graph in terms of its connectivity. We discovered the existence of a single large weakly-connected and a single large biconnected component, and confirmed the expected lack of a large strongly-connected component. The…Expand
Analysis of Scientific Collaboration Networks : Social Factors , Evolution , and Topical Clustering Diploma Thesis of
In scientometrics, the quantitative study of science, network analysis has become a prominent tool. The kinds of networks most frequently examined have been citation networks (mapping links between
Knowledge and Information Systems REGULAR PAPER
Published scientific articles are linked together into a graph, the citation graph, through their citations. This paper explores the notion of similarity based on connectivity alone, and proposes
Node similarity in the citation graph
TLDR
This paper explores the notion of similarity based on connectivity alone, and proposes several algorithms to quantify it, and takes advantage of the local neighborhoods of the nodes in the citation graph to demonstrate the complementarity of link-based and text-based retrieval.
Stochastic Block Model Reveals the Map of Citation Patterns and Their Evolution in Time
TLDR
This study maps out the large-scale structure of citation networks of science journals and follows their evolution in time by using stochastic block models (SBMs), and illustrates how these block networks can be used as maps of science.
Patent citation network analysis: A perspective from descriptive statistics and ERGMs
TLDR
This paper identifies and analyze the most cited patents, the most innovative and the highly cited companies along with the structural properties of the network by providing in-depth descriptive analysis and employs Exponential Random Graph Models (ERGMs) to analyze the citation networks.
Finding seminal scientific publications with graph mining
TLDR
It is found that the backbone graph provides a way to possibly discover seminal publications with low citation count, and combining betweenness and burstiness gives results on par with citation count.
Visualizing evolving networks: minimum spanning trees versus pathfinder networks
  • Chaomei Chen, S. Morris
  • Computer Science
    IEEE Symposium on Information Visualization 2003 (IEEE Cat. No.03TH8714)
  • 2003
TLDR
This article examines the animated visualization models of the evolution of botulinum toxin research in terms of its co-citation structure across a 58-year span and suggests that the design of visualization and modeling tools for network evolution should take the cohesiveness of critical paths into account.
Power walk: revisiting the random surfer
TLDR
This article introduces a new method of transitioning a graph, called Power Walk, that can successfully compute centrality scores for graphs with real weighted edges, and shows that it satisfies the desired properties, and that its computation time and centrality ranking is similar to when using the Random Surfer model for non-negative matrices.
Graph Mining and Community Evaluation with Degeneracy
The study and analysis of social networks attract attention from a variety of Sciences (psychology, statistics, sociology). Among them, the field of Data Mining offers tools to automatically extract
Exploring the computing literature using temporal graph visualization
TLDR
A novel technique for visualization of large graphs that evolve through time is used, given a dynamic graph, that produces two-dimensional representations of each timeslice, while preserving the mental map of the graph from one slice to the next.
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 54 REFERENCES
How popular is your paper? An empirical study of the citation distribution
Abstract:Numerical data for the distribution of citations are examined for: (i) papers published in 1981 in journals which are catalogued by the Institute for Scientific Information (783,339 papers)
Citation analysis as a tool in journal evaluation.
TLDR
In 1971, the Institute for Scientfic Information decided to undertake a systematic analysis of journal citation patterns across the whole of science and technology.
Authoritative sources in a hyperlinked environment
TLDR
This work proposes and test an algorithmic formulation of the notion of authority, based on the relationship between a set of relevant authoritative pages and the set of “hub pages” that join them together in the link structure, and has connections to the eigenvectors of certain matrices associated with the link graph.
Visualising Semantic Spaces and Author Co-Citation Networks in Digital Libraries
TLDR
Salient semantic structures and citation patterns are extracted from several collections of documents, including the ACM SIGCHI Conference Proceedings and ACM Hypertext Conference Proceedings, using Latent Semantic Indexing and Pathfinder Network Scaling.
CiteSeer: an autonomous Web agent for automatic retrieval and identification of interesting publications
TLDR
A Web based information agent that assists the user in the process of performing a scientific literature search and can find papers which are similar to a given paper using word information and byanalyzing common citations made by the papers.
Mining the Web's Link Structure
TLDR
Clever is a search engine that analyzes hyperlinks to uncover two types of pages: authorities, which provide the best source of information on a given topic; and hubs, which provides collections of links to authorities.
Scale-free characteristics of random networks: the topology of the world-wide web
The world-wide web forms a large directed graph, whose vertices are documents and edges are links pointing from one document to another. Here we demonstrate that despite its apparent random
Graph structure in the Web
TLDR
The study of the web as a graph yields valuable insight into web algorithms for crawling, searching and community discovery, and the sociological phenomena which characterize its evolution.
The Anatomy of a Large-Scale Hypertextual Web Search Engine
TLDR
This paper provides an in-depth description of Google, a prototype of a large-scale search engine which makes heavy use of the structure present in hypertext and looks at the problem of how to effectively deal with uncontrolled hypertext collections where anyone can publish anything they want.
Efficient identification of Web communities
TLDR
A focused crawler that crawls to a depth can approximate community membership by augmenting the graph induced by the cra wl with links to a virtual sink node.
...
1
2
3
4
5
...