Though much research has been conducted on Subjectivity and Sentiment Analysis (SSA) during the last decade, little work has fo-cused on Arabic. In this work, we focus on SSA for both Modern Standard Arabic (MSA) news articles and dialectal Arabic microblogs from Twitter. We showcase some of the challenges associated with SSA on microblogs. We adopted a(More)
Modern Standard Arabic (MSA) is the lingua franca of the Arab world Arabic dialects are generally used in daily interac9ons and in social media Dialects differ from MSA and from each other. Differences are: lexical, morphological, phonological and syntac1c Previous work: • Claims that word unigram models are sufficient and effec9ve for the dialect(More)
This paper presents a machine learning approach based on an SVM classifier coupled with preprocessing rules for cross-document named entity normalization. The classifier uses lexical, orthographic, phonetic, and morphological features. The process involves disambiguating different entities with shared name mentions and normalizing identical entities with(More)