Bag of Characters and SOM Clustering for Script Recognition and Writer Identification

@article{Marinai2010BagOC,
  title={Bag of Characters and SOM Clustering for Script Recognition and Writer Identification},
  author={Simone Marinai and Beatrice Miotti and Giovanni Soda},
  journal={2010 20th International Conference on Pattern Recognition},
  year={2010},
  pages={2182-2185}
}
In this paper, we describe a general approach for script (and language) recognition from printed documents and for writer identification in handwritten documents. The method is based on a bag of visual word strategy where the visual words correspond to characters and the clustering is obtained by means of Self Organizing Maps (SOM). Unknown pages (words in the case of script recognition) are classified comparing their vectorial representations with those of one training set using a cosine… CONTINUE READING
Highly Cited
This paper has 18 citations. REVIEW CITATIONS

From This Paper

Figures, tables, and topics from this paper.

Citations

Publications citing this paper.
Showing 1-10 of 11 extracted citations

A comparative study on clustering techniques for Urdu ligatures in nastaliq font

2017 13th International Conference on Emerging Technologies (ICET) • 2017
View 1 Excerpt

Khmer character recognition using artificial neural network

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific • 2014
View 1 Excerpt

OCR Performance Prediction Using a Bag of Allographs and Support Vector Regression

2014 11th IAPR International Workshop on Document Analysis Systems • 2014
View 1 Excerpt

References

Publications referenced by this paper.
Showing 1-10 of 10 references

On the Use of Textural Features for Writer Identification in Old Handwritten Music Scores

2009 10th International Conference on Document Analysis and Recognition • 2009
View 3 Excerpts
Highly Influenced

Information Retrieval Model for Online Handwritten Script Identification

2009 10th International Conference on Document Analysis and Recognition • 2009
View 1 Excerpt

Mathematical Symbol Indexing Using Topologically Ordered Clusters of Shape Contexts

2009 10th International Conference on Document Analysis and Recognition • 2009
View 1 Excerpt

Self-Organizing Maps for Clustering in Document Image Analysis

Machine Learning in Document Analysis and Recognition • 2008
View 1 Excerpt

Word-wise Sinhala Tamil and English script identification using Gaussian kernel SVM

2008 19th International Conference on Pattern Recognition • 2008
View 1 Excerpt

Clustering document images using a bag of symbols representation

Eighth International Conference on Document Analysis and Recognition (ICDAR'05) • 2005
View 1 Excerpt

Texture for script identification

IEEE Transactions on Pattern Analysis and Machine Intelligence • 2005
View 1 Excerpt

Similar Papers

Loading similar papers…