A Large Chinese Text Dataset in the Wild

@article{Yuan2019ALC,
  title={A Large Chinese Text Dataset in the Wild},
  author={Tai-Ling Yuan and Zhe Zhu and Kun Xu and Cheng-Jun Li and Tai-Jiang Mu and S. Hu},
  journal={Journal of Computer Science and Technology},
  year={2019},
  volume={34},
  pages={509-521}
}
In this paper, we introduce a very large Chinese text dataset in the wild. While optical character recognition (OCR) in document images is well studied and many commercial tools are available, the detection and recognition of text in natural images is still a challenging problem, especially for some more complicated character sets such as Chinese text. Lack of training data has always been a problem, especially for deep learning methods which require massive training data. In this paper, we… CONTINUE READING

Similar Papers

References

Publications referenced by this paper.
SHOWING 1-10 OF 41 REFERENCES

ICDAR2017 Competition on Reading Chinese Text in the Wild (RCTW-17)

  • 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)
  • 2017
VIEW 8 EXCERPTS
HIGHLY INFLUENTIAL

YOLO9000: Better, Faster, Stronger

  • 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
  • 2016
VIEW 13 EXCERPTS
HIGHLY INFLUENTIAL

Deep Residual Learning for Image Recognition

  • 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
  • 2015
VIEW 5 EXCERPTS
HIGHLY INFLUENTIAL

Detecting text in natural scenes with stroke width transform

  • 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition
  • 2010
VIEW 5 EXCERPTS
HIGHLY INFLUENTIAL

EAST: An Efficient and Accurate Scene Text Detector

  • 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
  • 2017
VIEW 2 EXCERPTS