Preparatory Work on Automatic Extraction of Bilingual Multi-Word Units from Parallel Corpora

@article{Chen2003PreparatoryWO,
  title={Preparatory Work on Automatic Extraction of Bilingual Multi-Word Units from Parallel Corpora},
  author={Boxing Chen and Limin Du},
  journal={IJCLCLP},
  year={2003},
  volume={8}
}
Automatic extraction of bilingual Multi-Word Units is an important subject of research in the automatic bilingual corpus alignment field. There are many cases of single source words corresponding to target multi-word units. This paper presents an algorithm for the automatic alignment of single source words and target multi-word units from a sentence-aligned parallel spoken language corpus. On the other hand, the output can be also used to extract bilingual multi-word units. The problem with… CONTINUE READING

Extracted Numerical Results

  • We randomly sampled 1000 sentences to check: if we did not take “un-listed words that are divided” as an error, then the precision rate was 98.88%; but if it was being taken as an error, the precision rate was 88.74%.

Topics