Sequence clustering

Known as: Blastclust, Sequence cluster, Sequence clusters 
In bioinformatics, sequence clustering algorithms attempt to group biological sequences that are somehow related. The sequences can be either of… (More)
Wikipedia

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.
Highly Cited
2012
Highly Cited
2012
The rapid advances of high-throughput sequencing technologies dramatically prompted metagenomic studies of microbial communities… (More)
  • table 1
  • table 2
  • table 3
  • figure 1
  • figure 2
Is this relevant?
Highly Cited
2010
Highly Cited
2010
UNLABELLED CD-HIT is a widely used program for clustering and comparing large biological sequence datasets. In order to further… (More)
  • figure 1
Is this relevant?
Highly Cited
2010
Highly Cited
2010
The number of gene sequences that are available for comparative genomics approaches is increasing extremely quickly. A current… (More)
  • figure 1
  • figure 2
  • table 1
  • figure 3
  • figure 4
Is this relevant?
Highly Cited
2010
Highly Cited
2010
Sequencing of environmental DNA (often called metagenomics) has shown tremendous potential to uncover the vast number of unknown… (More)
  • figure 1
  • figure 2
  • figure 3
  • table 1
  • table 2
Is this relevant?
2007
2007
In this paper, we explore the discriminating subsequencebased clustering problem. First, several effective optimization… (More)
  • table 1
  • figure 1
Is this relevant?
Highly Cited
2006
Highly Cited
2006
MOTIVATION In 2001 and 2002, we published two papers (Bioinformatics, 17, 282-283, Bioinformatics, 18, 77-82) describing an… (More)
Is this relevant?
2006
2006
In this paper, we first discuss issues in clustering biological sequences with graph properties, which inspired the design of our… (More)
  • figure 1
  • figure 2
  • figure 4
  • figure 5
  • figure 6
Is this relevant?
Highly Cited
2003
Highly Cited
2003
Analyzing sequence data has become increasingly important recently in the area of biological sequences, text documents, web… (More)
  • figure 1
  • figure 2
  • table 1
  • figure 3
  • figure 4
Is this relevant?
Highly Cited
2002
Highly Cited
2002
Clustering of sequential or temporal data is more challenging than traditional clustering as dynamic observations should be… (More)
  • table 1
Is this relevant?
Highly Cited
2000
Highly Cited
2000
MOTIVATION Efficient, accurate and automatic clustering of large protein sequence datasets, such as complete proteomes, into… (More)
  • figure 1
  • figure 2
  • figure 3
  • figure 4
  • figure 5
Is this relevant?