CLICK and EXPANDER: a system for clustering and visualizing gene expression data

Abstract

MOTIVATION Microarrays have become a central tool in biological research. Their applications range from functional annotation to tissue classification and genetic network inference. A key step in the analysis of gene expression data is the identification of groups of genes that manifest similar expression patterns. This translates to the algorithmic problem of clustering genes based on their expression patterns. RESULTS We present a novel clustering algorithm, called CLICK, and its applications to gene expression analysis. The algorithm utilizes graph-theoretic and statistical techniques to identify tight groups (kernels) of highly similar elements, which are likely to belong to the same true cluster. Several heuristic procedures are then used to expand the kernels into the full clusters. We report on the application of CLICK to a variety of gene expression data sets. In all those applications it outperformed extant algorithms according to several common figures of merit. We also point out that CLICK can be successfully used for the identification of common regulatory motifs in the upstream regions of co-regulated genes. Furthermore, we demonstrate how CLICK can be used to accurately classify tissue samples into disease types, based on their expression profiles. Finally, we present a new java-based graphical tool, called EXPANDER, for gene expression analysis and visualization, which incorporates CLICK and several other popular clustering algorithms. AVAILABILITY http://www.cs.tau.ac.il/~rshamir/expander/expander.html

DOI: 10.1093/bioinformatics/btg232

Extracted Key Phrases

10 Figures and Tables

050'02'04'06'08'10'12'14'16
Citations per Year

681 Citations

Semantic Scholar estimates that this publication has 681 citations based on the available data.

See our FAQ for additional information.

Cite this paper

@article{Sharan2003CLICKAE, title={CLICK and EXPANDER: a system for clustering and visualizing gene expression data}, author={Roded Sharan and Adi Maron-Katz and Ron Shamir}, journal={Bioinformatics}, year={2003}, volume={19 14}, pages={1787-99} }