WeBiText : Building Large Heterogeneous Translation Memories from Parallel Web Content

@inproceedings{Dsilets2008WeBiTextB,
  title={WeBiText : Building Large Heterogeneous Translation Memories from Parallel Web Content},
  author={Alain D{\'e}silets},
  year={2008}
}
This paper investigates the extent to which a useful general purpose Translation Memory (TM) can be built based on very large amounts of heterogeneous parallel texts mined from the Web. In particular, we evaluate whether such a TM could add value over TMs built from other large, publicly available parallel corpora, such as the Canadian Hansard. In the case of Canadian translators working with English and French, we show that the answer to both questions is a resounding yes. Using field data… CONTINUE READING
Highly Cited
This paper has 26 citations. REVIEW CITATIONS