Corpus ID: 233004275

Low-Resource Language Modelling of South African Languages

@article{Mesham2021LowResourceLM,
  title={Low-Resource Language Modelling of South African Languages},
  author={Stuart Mesham and Luc Hayward and Jared Shapiro and Jan Buys},
  journal={ArXiv},
  year={2021},
  volume={abs/2104.00772}
}
Language models are the foundation of current neural network-based models for natural language understanding and generation. However, research on the intrinsic performance of language models on African languages has been extremely limited, which is made more challenging by the lack of large or standardised training and evaluation sets that exist for English and other high-resource languages. In this paper, we evaluate the performance of open-vocabulary language models on lowresource South… Expand

Figures and Tables from this paper

References

SHOWING 1-10 OF 34 REFERENCES
Developing Text Resources for Ten South African Languages
Neural Machine Translation of Rare Words with Subword Units
An Analysis of Neural Language Modeling at Multiple Scales
Spell Once, Summon Anywhere: A Two-Level Open-Vocabulary Language Model
Regularizing and Optimizing LSTM Language Models
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
...
1
2
3
4
...