Corpus ID: 236428366

Comprehensive Studies for Arbitrary-shape Scene Text Detection

@article{Dai2021ComprehensiveSF,
  title={Comprehensive Studies for Arbitrary-shape Scene Text Detection},
  author={Pengwen Dai and Xiaochun Cao},
  journal={ArXiv},
  year={2021},
  volume={abs/2107.11800}
}
  • Pengwen Dai, Xiaochun Cao
  • Published 2021
  • Computer Science
  • ArXiv
Numerous scene text detection methods have been proposed in recent years. Most of them declare they have achieved state-of-the-art performances. However, the performance comparison is unfair, due to lots of inconsistent settings (e.g., training data, backbone network, multi-scale feature fusion, evaluation protocols, etc.). These various settings would dissemble the pros and cons of the proposed core techniques. In this paper, we carefully examine and analyze the inconsistent settings, and… Expand

Figures and Tables from this paper

References

SHOWING 1-10 OF 56 REFERENCES
EAST: An Efficient and Accurate Scene Text Detector
  • X. Zhou, C. Yao, +4 authors Jiajun Liang
  • Computer Science
  • 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
  • 2017
TLDR
This work proposes a simple yet powerful pipeline that yields fast and accurate text detection in natural scenes, and significantly outperforms state-of-the-art methods in terms of both accuracy and efficiency. Expand
Efficient and Accurate Arbitrary-Shaped Text Detection With Pixel Aggregation Network
TLDR
This paper proposes an efficient and accurate arbitrary-shaped text detector, termed Pixel Aggregation Network (PAN), which is equipped with a low computational-cost segmentation head and a learnable post-processing. Expand
ContourNet: Taking a Further Step Toward Accurate Arbitrary-Shaped Scene Text Detection
TLDR
The ContourNet is proposed, which effectively handles the problems taking a further step toward accurate arbitrary-shaped text detection and suppresses false positives by only outputting predictions with high response value in both orthogonal directions. Expand
Shape Robust Text Detection With Progressive Scale Expansion Network
TLDR
A novel Progressive Scale Expansion Network (PSENet) is proposed, which can precisely detect text instances with arbitrary shapes and is effective to split the close text instances, making it easier to use segmentation-based methods to detect arbitrary-shaped text instances. Expand
Arbitrarily Shaped Scene Text Detection With a Mask Tightness Text Detector
TLDR
The Mask Tightness Text Detector (Mask TTD) uses a tightness prior and text frontier learning to enhance pixel-wise mask prediction and achieves mutual promotion by integrating a branch for the polygonal boundary of each text region, which significantly improves the detection performance of arbitrarily shaped text. Expand
Real-time Scene Text Detection with Differentiable Binarization
TLDR
This paper proposes a module named Differentiable Binarization (DB), which can perform the binarization process in a segmentation network, and validate the performance improvements of DB on five benchmark datasets, which consistently achieves state-of-the-art results, in terms of both detection accuracy and speed. Expand
Character Region Awareness for Text Detection
TLDR
A new scene text detection method to effectively detect text area by exploring each character and affinity between characters by exploiting both the given character- level annotations for synthetic images and the estimated character-level ground-truths for real images acquired by the learned interim model. Expand
CRNet: A Center-aware Representation for Detecting Text of Arbitrary Shapes
TLDR
This work proposes an anchor-free scene text detector leveraging Center-aware Representation to achieve accurate arbitrary-shaped scene text detection namely CRNet and proposes a center-aware location algorithm to explicitly learn center regions and center points of text instances, which is able to separate adjacent text instances effectively. Expand
Learning Shape-Aware Embedding for Scene Text Detection
TLDR
This work treats text detection as instance segmentation and proposes a segmentation-based framework, which extracts each text instance as an independent connected component and maps pixels onto an embedding space to distinguish different text instances. Expand
Sequential Deformation for Accurate Scene Text Detection
TLDR
This paper proposes a novel sequential deformation method to effectively model the line-shape of scene text and demonstrates that the proposed method has outperformed previous state-of-the-art methods. Expand
...
1
2
3
4
5
...