Mining Naturally-occurring Corrections and Paraphrases from Wikipedia's Revision History

@inproceedings{Max2010MiningNC,
  title={Mining Naturally-occurring Corrections and Paraphrases from Wikipedia's Revision History},
  author={Aur{\'e}lien Max and Guillaume Wisniewski},
  booktitle={LREC},
  year={2010}
}
Naturally-occurring instances of linguistic phenomena are important both for training and for evaluating automatic text processing. When available in large quantities, they also prove interesting material for linguistic studies. In this article, we present WiCoPaCo (Wikipedia Correction and Paraphrase Corpus), a new freely-available resource built by automatically mining Wikipedia’s revision history. The WiCoPaCo corpus focuses on local modifications made by human revisors and include various… CONTINUE READING
Highly Cited
This paper has 55 citations. REVIEW CITATIONS

Citations

Publications citing this paper.
Showing 1-10 of 36 extracted citations

56 Citations

01020'12'14'16'18
Citations per Year
Semantic Scholar estimates that this publication has 56 citations based on the available data.

See our FAQ for additional information.

Similar Papers

Loading similar papers…