AIDA2: A Hybrid Approach for Token and Sentence Level Dialect Identification in Arabic

@inproceedings{AlBadrashiny2015AIDA2AH,
  title={AIDA2: A Hybrid Approach for Token and Sentence Level Dialect Identification in Arabic},
  author={Mohamed Al-Badrashiny and Heba Elfardy and Mona T. Diab},
  booktitle={CoNLL},
  year={2015}
}
In this paper, we present a hybrid approach for performing token and sentence levels Dialect Identification in Arabic. Specifically we try to identify whether each token in a given sentence belongs to Modern Standard Arabic (MSA), Egyptian Dialectal Arabic (EDA) or some other class and whether the whole sentence is mostly EDA or MSA. The token level component relies on a Conditional Random Field (CRF) classifier that uses decisions from several underlying components such as language models, a… CONTINUE READING
Highly Cited
This paper has 20 citations. REVIEW CITATIONS
13 Citations
22 References
Similar Papers

Citations

Publications citing this paper.
Showing 1-10 of 13 extracted citations

References

Publications referenced by this paper.
Showing 1-10 of 22 references

Ldc standard arabic morphological analyzer (sama) version

  • Mohamed Maamouri, Dave Graff, Basma Bouziri, Sondos Krouna, Ann Bies, Seth Kulick
  • 2010
Highly Influential
5 Excerpts

Similar Papers

Loading similar papers…