Multi-view clustering with exemplars for scientific mapping

Abstract

Scientific mapping has now become an important subject in the scientometrics field. Journal clustering can provide insights into both the internal relations among journals and the evolution trend of studies. In this paper, we apply the affinity propagation (AP) algorithm to do scientific journal clustering. The AP algorithm identifies clusters by detecting their representative points through message passing within the data points. Compared with other clustering algorithms, it can provide representatives for each cluster and does not need to pre-specify the number of clusters. Because the input of the AP algorithm is the similarity matrix among data points, it can be applied to various forms of data sets with different similarity metrics. In this paper, we extract the similarity matrices from the journal data sets in both cross citation view and text view and use the AP algorithm to cluster the journals. Through empirical analysis, we conclude that these two clustering results by the two single views are highly complementary. Therefore, we further combine text information with cross citation information by using the simple average scheme and apply the AP algorithm to conduct multi-view clustering. The multi-view clustering strategy aims at obtaining refined clusters by integrating information from multiple views. With text view and citation view integrated, experiments on the Web of Science journal data set verify that the AP algorithm obtains better clustering results as expected.

DOI: 10.1007/s11192-015-1682-7

Extracted Key Phrases

19 Figures and Tables

Cite this paper

@article{Meng2015MultiviewCW, title={Multi-view clustering with exemplars for scientific mapping}, author={XiangFeng Meng and Xinhai Liu and Yunhai Tong and Wolfgang Gl{\"a}nzel and Shaohua Tan}, journal={Scientometrics}, year={2015}, volume={105}, pages={1527-1552} }