LCSTS: A Large Scale Chinese Short Text Summarization Dataset

@inproceedings{Hu2015LCSTSAL,
  title={LCSTS: A Large Scale Chinese Short Text Summarization Dataset},
  author={Baotian Hu and Qingcai Chen and Fangze Zhu},
  booktitle={EMNLP},
  year={2015}
}
Automatic text summarization is widely regarded as the highly difficult problem, partially because of the lack of large text summarization data set. Due to the great challenge of constructing the large scale summaries for full text, in this paper, we introduce a large corpus of Chinese short text summarization dataset constructed from the Chinese microblogging website Sina Weibo, which is released to the public1. This corpus consists of over 2 million real Chinese short texts with short… CONTINUE READING
Highly Influential
This paper has highly influenced a number of papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 64 citations. REVIEW CITATIONS

6 Figures & Tables

Topics

Statistics

020402015201620172018
Citations per Year

64 Citations

Semantic Scholar estimates that this publication has 64 citations based on the available data.

See our FAQ for additional information.