Homograph Disambiguation through Selective Diacritic Restoration

@inproceedings{Alqahtani2019HomographDT,
  title={Homograph Disambiguation through Selective Diacritic Restoration},
  author={Sawsan Alqahtani and Hanan Aldarmaki and Mona T. Diab},
  booktitle={WANLP@ACL 2019},
  year={2019}
}
  • Sawsan Alqahtani, Hanan Aldarmaki, Mona T. Diab
  • Published in WANLP@ACL 2019
  • Computer Science
  • Lexical ambiguity, a challenging phenomenon in all natural languages, is particularly prevalent for languages with diacritics that tend to be omitted in writing, such as Arabic. Omitting diacritics leads to an increase in the number of homographs: different words with the same spelling. Diacritic restoration could theoretically help disambiguate these words, but in practice, the increase in overall sparsity leads to performance degradation in NLP applications. In this paper, we propose… CONTINUE READING

    Citations

    Publications citing this paper.

    References

    Publications referenced by this paper.
    SHOWING 1-10 OF 48 REFERENCES

    Enriching Word Vectors with Subword Information

    VIEW 2 EXCERPTS
    HIGHLY INFLUENTIAL

    Investigating the impact of various partial diacritization schemes on Arabic-English statistical machine translation

    • Sawsan Alqahtani, Mahmoud Ghoneim, Mona Diab.
    • International Association for Machine Translation in the Americas
    • 2016
    VIEW 6 EXCERPTS
    HIGHLY INFLUENTIAL

    Arabic diacritization in the context of statistical machine translation

    • Mona Diab, Mahmoud Ghoneim, Nizar Habash.
    • Proceedings of MTSummit.
    • 2007
    VIEW 6 EXCERPTS
    HIGHLY INFLUENTIAL

    ARLEX: A large scale comprehensive lexical inventory for Modern Standard Arabic

    • Sawsan Alqahtani, Mona Diab, Wajdi Zaghouani.
    • OSACT 3: The 3rd Workshop on Open-Source Arabic Corpora and Processing Tools.
    • 2018
    VIEW 3 EXCERPTS

    The hitchhikers guide to testing statistical significance in natural language processing

    • Rotem Dror, Gili Baumer, Segev Shlomov, Roi Reichart.
    • Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, volume 1,
    • 2018
    VIEW 1 EXCERPT