Statistical modeling of sequencing errors in SAGE libraries.

@article{Beibarth2004StatisticalMO,
  title={Statistical modeling of sequencing errors in SAGE libraries.},
  author={Tim Bei\ssbarth and Lavinia Hyde and Gordon K. Smyth and Chris Job and W. J. V. Van der Boon and S Z Tan and Hamish S. Scott and Terence P. Speed},
  journal={Bioinformatics},
  year={2004},
  volume={20 Suppl 1},
  pages={i31-9}
}
MOTIVATION Sequencing errors may bias the gene expression measurements made by Serial Analysis of Gene Expression (SAGE). They may introduce non-existent tags at low abundance and decrease the real abundance of other tags. These effects are increased in the longer tags generated in LongSAGE libraries. Current sequencing technology generates quite accurate estimates of sequencing error rates. Here we make use of the sequence neighborhood of SAGE tags and error estimates from the base-calling… CONTINUE READING