Using Mechanical Turk to Create a Corpus of Arabic Summaries

  title={Using Mechanical Turk to Create a Corpus of Arabic Summaries},
  author={Mahmoud El-Haj and Udo Kruschwitz and Chris Fox},
This paper describes the creation of a human-generated corpus of extractive Arabic summaries of a selection of Wikipedia and Arabic newspaper articles using Mechanical Turk—an online workforce. The purpose of this exercise was two-fold. First, it addresses a shortage of relevant data for Arabic natural language processing. Second, it demonstrates the application of Mechanical Turk to the problem of creating natural language resources. The paper also reports on a number of evaluations we have… CONTINUE READING
Highly Cited
This paper has 40 citations. REVIEW CITATIONS