Home
Scholarly Works
Arabic dialect identification with deep learning...
Conference

Arabic dialect identification with deep learning and hybrid frequency based features

Abstract

Studies on Dialectical Arabic are growing more important by the day as it becomes the primary written and spoken form of Arabic online in informal settings. Among the important problems that should be explored is that of dialect identification. This paper reports different techniques that can be applied towards such goal and reports their performance on the Multi Arabic Dialect Applications and Resources (MADAR) Arabic Dialect Corpora. Our results show that improving on traditional systems using frequency based features and non deep learning classifiers is a challenging task. We propose different models based on different word and document representations. Our top model is able to achieve an F1 macro averaged score of 65.66 on MADAR's smallscale parallel corpus of 25 dialects and Modern Standard Arabic (MSA).

Authors

Fares Y; El-Zanaty Z; Abdel-Salam K; Ezzeldin M; Mohamed A; El-Awaad K; Torki M

Pagination

pp. 224-228

Publication Date

January 1, 2019

Conference proceedings

Acl 2019 4th Arabic Natural Language Processing Workshop Wanlp 2019 Proceedings of the Workshop

Contact the Experts team