Improving MT System Using Extracted Parallel Fragments of Text from Comparable Corpora

@inproceedings{Gupta2013ImprovingMS,
  title={Improving MT System Using Extracted Parallel Fragments of Text from Comparable Corpora},
  author={Rajdeep Gupta and Santanu Pal and Sivaji Bandyopadhyay},
  booktitle={BUCC@ACL},
  year={2013}
}
In this article, we present an automated approach of extracting English-Bengali parallel fragments of text from comparable corpora created using Wikipedia documents. Our approach exploits the multilingualism of Wikipedia. The most important fact is that this approach does not need any domain specific corpus. We have been able to improve the BLEU score of an existing domain specific EnglishBengali machine translation system by 11.14%. 
Highly Cited
This paper has 17 citations. REVIEW CITATIONS