UniqueProt: creating representative protein sequence sets

@article{Mika2003UniqueProtCR,
  title={UniqueProt: creating representative protein sequence sets},
  author={Sven Mika and Burkhard Rost},
  journal={Nucleic acids research},
  year={2003},
  volume={31 13},
  pages={3789-91}
}
UniqueProt is a practical and easy to use web service designed to create representative, unbiased data sets of protein sequences. The largest possible representative sets are found through a simple greedy algorithm using the HSSP-value to establish sequence similarity. UniqueProt is not a real clustering program in the sense that the 'representatives' are not at the centres of well-defined clusters since the definition of such clusters is problem-specific. Overall, UniqueProt is a reasonable… CONTINUE READING

Citations

Publications citing this paper.
Showing 1-10 of 81 extracted citations

Predicting Metal-Binding Sites from Protein Sequence

IEEE/ACM Transactions on Computational Biology and Bioinformatics • 2012
View 5 Excerpts
Highly Influenced

References

Publications referenced by this paper.
Showing 1-10 of 18 references

Local alignment statistics.

Methods in enzymology • 1996
View 8 Excerpts
Highly Influenced

Sequence conserved for subcellular localization.

Protein science : a publication of the Protein Society • 2002

Similar Papers

Loading similar papers…