PHDIndic_11: page-level handwritten document image dataset of 11 official Indic scripts for script identification

@article{Obaidullah2017PHDIndic_11PH,
  title={PHDIndic_11: page-level handwritten document image dataset of 11 official Indic scripts for script identification},
  author={Sk Md Obaidullah and Chayan Halder and K. C. Santosh and Nibaran Das and Kaushik Roy},
  journal={Multimedia Tools and Applications},
  year={2017},
  volume={77},
  pages={1643-1678}
}
Without publicly available dataset, specifically in handwritten document recognition (HDR), we cannot make a fair and/or reliable comparison between the methods. Considering HDR, Indic script’s document recognition is still in its early stage compared to others such as Roman and Arabic. In this paper, we present a page-level handwritten document image dataset (PHDIndic_11), of 11 official Indic scripts: Bangla, Devanagari, Roman, Urdu, Oriya, Gurumukhi, Gujarati, Tamil, Telugu, Malayalam and… CONTINUE READING
BETA

Citations

Publications citing this paper.
SHOWING 1-8 OF 8 CITATIONS

Improved word-level handwritten Indic script identification by integrating small convolutional neural networks

  • Neural Computing and Applications
  • 2019
VIEW 4 EXCERPTS
CITES METHODS & BACKGROUND
HIGHLY INFLUENCED

Deep learning for word-level handwritten Indic script identification

VIEW 4 EXCERPTS
CITES METHODS & BACKGROUND
HIGHLY INFLUENCED

Extreme learning machine for handwritten Indic script identification in multiscript documents

  • J. Electronic Imaging
  • 2018
VIEW 3 EXCERPTS
CITES BACKGROUND & METHODS
HIGHLY INFLUENCED

Using dynamic routing to extract intermediate features for developing scalable capsule networks

Bodhisatwa Mandal, Swarnendu Ghosh, Ritesh Sarkhel, Nibaran Das, Mita Nasipuri
  • ArXiv
  • 2019
VIEW 1 EXCERPT

Script identification algorithms: a survey

  • International Journal of Multimedia Information Retrieval
  • 2017
VIEW 1 EXCERPT
CITES BACKGROUND

References

Publications referenced by this paper.
SHOWING 1-10 OF 36 REFERENCES

A benchmarkKannada handwritten document dataset and its segmentation

A Aleai, P Nagabhushan, U Pal
  • Proceedings of the International Conference on Document Analysis and Recognition,
  • 2011
VIEW 7 EXCERPTS
HIGHLY INFLUENTIAL

CMATERdb1: a database of unconstrained handwritten Bangla and Bangla–English mixed script document image

  • International Journal on Document Analysis and Recognition (IJDAR)
  • 2011
VIEW 7 EXCERPTS
HIGHLY INFLUENTIAL

Script Recognition—A Review

  • IEEE Transactions on Pattern Analysis and Machine Intelligence
  • 2010
VIEW 4 EXCERPTS
HIGHLY INFLUENTIAL

A Database for Handwritten Text Recognition Research

  • IEEE Trans. Pattern Anal. Mach. Intell.
  • 1994
VIEW 4 EXCERPTS
HIGHLY INFLUENTIAL

QUWI: An Arabic and English Handwriting Dataset for Offline Writer Identification

  • 2012 International Conference on Frontiers in Handwriting Recognition
  • 2012
VIEW 3 EXCERPTS
HIGHLY INFLUENTIAL

Databases for research on recognition of handwritten characters of Indian scripts

  • Eighth International Conference on Document Analysis and Recognition (ICDAR'05)
  • 2005
VIEW 4 EXCERPTS
HIGHLY INFLUENTIAL

Similar Papers

Loading similar papers…