Subjectivity and Sentiment Analysis of Modern Standard Arabic

Muhammad Abdul-Mageed1,  Mona Diab2,  Mohammed Korayem3
1Department of Linguistics and School of Library & Information Science, Indiana University, Bloomington, USA, 2Center for Computational Learning Systems, Columbia University, NYC, USA, 3School of Informatics and Computing, Indiana University, Bloomington, USA


Abstract

Although Subjectivity and Sentiment Analysis (SSA) has been witnessing a flurry of novel research, there are few attempts to build SSA systems for Morphologically-Rich Languages (MRL). In the current study, we report efforts to partially fill this gap. We present a newly developed manually annotated corpus of Modern Standard Arabic (MSA) together with a new polarity lexicon.The corpus is a collection of news documents annotated on the sentence level. We also describe an automatic SSA tagging system that exploits the annotated data. We investigate the impact of different levels of preprocessing settings on the SSA classification task. We show that by explicitly accounting for the rich morphology the system is able to achieve significantly higher levels of performance.




Full paper: http://www.aclweb.org/anthology/P/P11/P11-2103.pdf