Corpus ID: 21707939

Machine Translation of Low-Resource Spoken Dialects: Strategies for Normalizing Swiss German

@article{Honnet2018MachineTO,
  title={Machine Translation of Low-Resource Spoken Dialects: Strategies for Normalizing Swiss German},
  author={Pierre-Edouard Honnet and A. Popescu-Belis and C. Musat and Michael Baeriswyl},
  journal={ArXiv},
  year={2018},
  volume={abs/1710.11035}
}
The goal of this work is to design a machine translation (MT) system for a low-resource family of dialects, collectively known as Swiss German, which are widely spoken in Switzerland but seldom written. We collected a significant number of parallel written resources to start with, up to a total of about 60k words. Moreover, we identified several other promising data sources for Swiss German. Then, we designed and compared three strategies for normalizing Swiss German input in order to address… Expand
13 Citations
SwissDial: Parallel Multidialectal Corpus of Spoken Swiss German
Neural text normalization with adapted decoding and POS features
Encoder-Decoder Methods for Text Normalization
Unsupervised dialectal neural machine translation
Spoken Dialect Identification in Twitter using a Multi-filter Architecture
...
1
2
...

References

SHOWING 1-10 OF 34 REFERENCES
Machine translation into multiple dialects: The example of Swiss German
ArchiMob - A Corpus of Spoken Swiss German
METIS-II: low resource machine translation
WERD: Using social text spelling variants for evaluating dialectal speech recognition
Automatic speech recognition and translation of a Swiss German dialect: Walliserdeutsch
Combining Bilingual and Comparable Corpora for Low Resource Machine Translation
Machine Translation of Arabic Dialects
...
1
2
3
4
...