GROUNDTRUTH GENERATION AND DOCUMENT IMAGE DEGRADATION

@inproceedings{Zi2005GROUNDTRUTHGA,
  title={GROUNDTRUTH GENERATION AND DOCUMENT IMAGE DEGRADATION},
  author={Gang Zi},
  year={2005}
}
Abstract : The problem of generating synthetic data for the training and evaluation of document analysis systems has been widely addressed in recent years. With the increased interest in processing multilingual sources, however, there is a tremendous need to be able to rapidly generate data in new languages and scripts, without the need to develop specialized systems. We have developed a system, which uses language support of the MS Windows operating system combined with custom print drivers to… CONTINUE READING

Similar Papers

Citations

Publications citing this paper.
SHOWING 1-10 OF 15 CITATIONS

Document Image Quality Assessment: A Brief Survey

  • 2013 12th International Conference on Document Analysis and Recognition
  • 2013
VIEW 4 EXCERPTS
CITES BACKGROUND
HIGHLY INFLUENCED

Automatic Ground Truth Generation of Camera Captured Documents Using Document Image Retrieval

  • 2013 12th International Conference on Document Analysis and Recognition
  • 2013
VIEW 2 EXCERPTS
CITES METHODS

References

Publications referenced by this paper.
SHOWING 1-10 OF 35 REFERENCES

Document image ground truth generation from electronic text

  • Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004.
  • 2004

A line drawings degradation model for performance characterization

  • Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.
  • 2003

Generation of synthetic training data for an HMM-based handwriting recognition system

  • Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.
  • 2003

Training on severely degraded text-line images

  • Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.
  • 2003

Attributed point matching for automatic groundtruth generation

  • International Journal on Document Analysis and Recognition
  • 2002

Synthetic data for Arabic OCR system development

  • Proceedings of Sixth International Conference on Document Analysis and Recognition
  • 2001