Splice site prediction using stochastic regular grammars.

  title={Splice site prediction using stochastic regular grammars.},
  author={Andr{\'e} Yoshiaki Kashiwabara and Danielle Cristina Vieira and Ariane Machado-Lima and Alan M. Durham},
  journal={Genetics and molecular research : GMR},
  volume={6 1},
This paper presents a novel approach to the problem of splice site prediction, by applying stochastic grammar inference. We used four grammar inference algorithms to infer 1465 grammars, and used 10-fold cross-validation to select the best grammar for each algorithm. The corresponding grammars were embedded into a classifier and used to run splice site prediction and compare the results with those of NNSPLICE, the predictor used by the Genie gene finder. We indicate possible paths to improve… CONTINUE READING
Highly Cited
This paper has 21 citations. REVIEW CITATIONS

From This Paper

Figures, tables, and topics from this paper.

Explore Further: Topics Discussed in This Paper


Publications citing this paper.
Showing 1-9 of 9 extracted citations

Self-Adaptive Web Service Composition Based on Stochastic Context-Free Grammar

2014 IEEE 11th International Conference on e-Business Engineering • 2014
View 3 Excerpts
Highly Influenced

Neural network based splice site prediction methods

2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT) • 2016
View 1 Excerpt

Stochastic Finite Automata for the translation of DNA to protein

2014 IEEE International Conference on Big Data (Big Data) • 2014
View 1 Excerpt


Publications referenced by this paper.
Showing 1-10 of 27 references

Inferring regular languages in polynomial update time

J Oncina, P Garcia
Pattern recognition and image analysis • 1992
View 7 Excerpts
Highly Influenced

Ab initio gene finding in Drosophila genomic DNA.

Genome research • 2000
View 4 Excerpts
Highly Influenced

Prediction of complete gene structures in human genomic DNA.

Journal of molecular biology • 1997
View 4 Excerpts
Highly Influenced

Hidden Markov models for sequence analysis: extension and analysis of the basic method

Computer Applications in the Biosciences • 1996
View 5 Excerpts
Highly Influenced

A weight array method for splicing signal analysis

Computer Applications in the Biosciences • 1993
View 4 Excerpts
Highly Influenced

Laboratório de geração de classificadores de seqüências

A Machado-Lima
Master’s thesis, Instituto de Matemática e Estatística da Universidade de São Paulo, • 2002
View 1 Excerpt