Corpus ID: 216642164

Morphological Disambiguation of South Sámi with FSTs and Neural Networks

@inproceedings{Hmlinen2020MorphologicalDO,
  title={Morphological Disambiguation of South S{\'a}mi with FSTs and Neural Networks},
  author={M. H{\"a}m{\"a}l{\"a}inen and Linda Wiechetek},
  booktitle={SLTU/CCURL@LREC},
  year={2020}
}
  • M. Hämäläinen, Linda Wiechetek
  • Published in SLTU/CCURL@LREC 2020
  • Computer Science
  • We present a method for conducting morphological disambiguation for South S\'ami, which is an endangered language. Our method uses an FST-based morphological analyzer to produce an ambiguous set of morphological readings for each word in a sentence. These readings are disambiguated with a Bi-RNN model trained on the related North S\'ami UD Treebank and some synthetically generated South S\'ami data. The disambiguation is done on the level of morphological tags ignoring word forms and lemmas… CONTINUE READING

    Tables and Topics from this paper.

    References

    Publications referenced by this paper.
    SHOWING 1-10 OF 27 REFERENCES
    Morphosyntactic Disambiguation in an Endangered Language Setting
    2
    Annotation schemes in North Sámi dependency parsing
    13
    Cross-Lingual Morphological Tagging for Low-Resource Languages
    23
    UralicNLP: An NLP Library for Uralic Languages
    14
    Constraint Grammar As A Framework For Parsing Running Text
    226
    Bayesian Modeling of Lexical Resources for Low-Resource Settings
    4