From Subtitles to Parallel Corpora

  title={From Subtitles to Parallel Corpora},
  author={Mark Fishel and Yota Georgakopoulou and Sergio Penkale and Volha Petukhova and Matej Rojc and Martin Volk and Andy Way},
We describe the preparation of parallel corpora based on professional quality subtitles in seven European language pairs. The main focus is the effect of the processing steps on the size and quality of the final corpora.