Corpus ID: 21707939

Machine Translation of Low-Resource Spoken Dialects: Strategies for Normalizing Swiss German

@article{Honnet2018MachineTO,
  title={Machine Translation of Low-Resource Spoken Dialects: Strategies for Normalizing Swiss German},
  author={Pierre-Edouard Honnet and A. Popescu-Belis and C. Musat and Michael Baeriswyl},
  journal={ArXiv},
  year={2018},
  volume={abs/1710.11035}
}
The goal of this work is to design a machine translation (MT) system for a low-resource family of dialects, collectively known as Swiss German, which are widely spoken in Switzerland but seldom written. We collected a significant number of parallel written resources to start with, up to a total of about 60k words. Moreover, we identified several other promising data sources for Swiss German. Then, we designed and compared three strategies for normalizing Swiss German input in order to address… Expand
13 Citations
SwissDial: Parallel Multidialectal Corpus of Spoken Swiss German
  • PDF
Neural text normalization with adapted decoding and POS features
  • 1
  • PDF
Encoder-Decoder Methods for Text Normalization
  • 19
  • PDF
Unsupervised dialectal neural machine translation
  • 8
Statistical Machine Translation of Myanmar Dialects
  • PDF
A Survey of Orthographic Information in Machine Translation
  • 11
  • Highly Influenced
  • PDF
...
1
2
...

References

SHOWING 1-10 OF 34 REFERENCES
Machine translation into multiple dialects: The example of Swiss German
  • 5
ArchiMob - A Corpus of Spoken Swiss German
  • 45
  • Highly Influential
  • PDF
Normalising orthographic and dialectal variants for the automatic processing of Swiss German
  • 25
  • PDF
Automatic normalisation of the Swiss German ArchiMob corpus using character-level machine translation
  • 22
  • PDF
METIS-II: low resource machine translation
  • 20
  • PDF
WERD: Using social text spelling variants for evaluating dialectal speech recognition
  • 6
  • PDF
Automatic speech recognition and translation of a Swiss German dialect: Walliserdeutsch
  • 8
  • Highly Influential
  • PDF
Combining Bilingual and Comparable Corpora for Low Resource Machine Translation
  • 63
  • PDF
Machine Translation of Arabic Dialects
  • 154
  • PDF
...
1
2
3
4
...