Efficient clustering of large EST data sets on parallel computers.

  title={Efficient clustering of large EST data sets on parallel computers.},
  author={Anantharaman Kalyanaraman and Srinivas Aluru and Suresh C. Kothari and Volker Brendel},
  journal={Nucleic acids research},
  volume={31 11},
Clustering expressed sequence tags (ESTs) is a powerful strategy for gene identification, gene expression studies and identifying important genetic variations such as single nucleotide polymorphisms. To enable fast clustering of large-scale EST data, we developed PaCE (for Parallel Clustering of ESTs), a software program for EST clustering on parallel computers. In this paper, we report on the design and development of PaCE and its evaluation using Arabidopsis ESTs. The novel features of our… CONTINUE READING


Publications citing this paper.
Showing 1-10 of 65 extracted citations

A Parallel Algorithm for Finding All Pairs κ-Mismatch Maximal Common Substrings

SC16: International Conference for High Performance Computing, Networking, Storage and Analysis • 2016
View 1 Excerpt


Publications referenced by this paper.
Showing 1-10 of 16 references

Re®ned annotation of the Arabidopsis thaliana genome by complete EST mapping

W. Zhu, S. D. Schlueter, V. Brendel
Plant Physiol., • 2003
View 1 Excerpt

SpliceNest: visualizing gene structure and alternative splicing based on EST clusters

E. Coward, S. A. Hass, M. Vingron
Trends Genet., • 2002
View 1 Excerpt

Similar Papers

Loading similar papers…