You are in:Home/Publications/Modeling of Arabic Language for Authorship Identification

Dr. Ahmed Taha Abd El-Fatah Taha Abd Allah :: Publications:

Modeling of Arabic Language for Authorship Identification
Authors: Heba M. Khalil, Ahmed Taha, Tarek El-Shistawy
Year: 2021
Keywords: Not Available
Journal: the International Journal of Scientific & Technology Research
Volume: 10
Issue: 5
Pages: 157–162
Publisher: Not Available
Local/International: International
Paper Link:
Full paper Not Available
Supplementary materials Not Available

With the vast volume of data processed in digital form today, the need for and capability of analysing and processing this data for forensic authorship authentication has increased. The focus of study has concentrated on English, Spanish, and German. Arabic language has received less attention from the academic community due to the difficulty and length of Arabic sentences. This article provides a set of stylometric features derived from the study of many articles' parts of expression, including adjectives ratio, sentence size, conjunctions, and others. This details is classified into two categories: statistical features and linguistic features. The AdaBoost and Bagging ensemble approaches have been proposed in this research to maximise predictive efficiency in Arabic articles by using multiple learning. The results indicate that the Bagging model achieves average accuracy of 91.5 %, while the AdaBoost model achieves the highest accuracy of 93.6 %.

Google ScholarAcdemia.eduResearch GateLinkedinFacebookTwitterGoogle PlusYoutubeWordpressInstagramMendeleyZoteroEvernoteORCIDScopus