Joan Albert Silvestre-Cerdà

Learn More
This paper presents a proposal for extracting parallel corpora from Wi-kipedia on the basis of statistical machine translation techniques. We have used word-level alignment models from IBM in order to obtain phrase-level bilingual alignments between documents pairs. We have manually annotated a set of test English-Spanish comparable documents in order to(More)
  • 1