Convolutional Neural Network with Word Embeddings for Chinese Word Segmentation

@inproceedings{Wang2017ConvolutionalNN,
  title={Convolutional Neural Network with Word Embeddings for Chinese Word Segmentation},
  author={Chunqi Wang and Bo Xu},
  booktitle={IJCNLP},
  year={2017}
}
Character-based sequence labeling framework is flexible and efficient for Chinese word segmentation (CWS). Recently, many character-based neural models have been applied to CWS. While they obtain good performance, they have two obvious weaknesses. The first is that they heavily rely on manually designed bigram feature, i.e. they are not good at capturing n-gram features automatically. The second is that they make no use of full word information. For the first weakness, we propose a… CONTINUE READING
Recent Discussions
This paper has been referenced on Twitter 10 times over the past 90 days. VIEW TWEETS