Link mining: a survey

Abstract

Many datasets of interest today are best described as a linked collection of interrelated objects. These may represent homogeneous networks, in which there is a single-object type and link type, or richer, heterogeneous networks, in which there may be multiple object and link types (and possibly other semantic information). Examples of homogeneous networks include single mode social networks, such as people connected by friendship links, or the WWW, a collection of linked web pages. Examples of heterogeneous networks include those in medical domains describing patients, diseases, treatments and contacts, or in bibliographic domains describing publications, authors, and venues. <i>Link mining</i> refers to data mining techniques that explicitly consider these links when building predictive or descriptive models of the linked data. Commonly addressed link mining tasks include object ranking, group detection, collective classification, link prediction and subgraph discovery. While network analysis has been studied in depth in particular areas such as social network analysis, hypertext mining, and web analysis, only recently has there been a cross-fertilization of ideas among these different communities. This is an exciting, rapidly expanding area. In this article, we review some of the common emerging themes.

DOI: 10.1145/1117454.1117456

Extracted Key Phrases

050100'06'07'08'09'10'11'12'13'14'15'16'17
Citations per Year

802 Citations

Semantic Scholar estimates that this publication has 802 citations based on the available data.

See our FAQ for additional information.

Cite this paper

@article{Getoor2005LinkMA, title={Link mining: a survey}, author={Lise Getoor and Christopher P. Diehl}, journal={SIGKDD Explorations}, year={2005}, volume={7}, pages={3-12} }