Higher Order n-gram Language Models for Arabic Diacritics Restoration

Abstract

Dynamic programming based Arabic diacritics restoration aims to assign diacritics to Arabic words. The technique is purely statistical approach and depends only on an Arabic corpus annotated with diacritics. The possible word sequences with diacritics are assigned scores using statistical n-gram language modeling approach. Using the assigned scores, it is… (More)

Topics

4 Figures and Tables

Slides referencing similar topics