You are in:Home/Publications/Development of Data Mining Models Based on Features Ranks Voting (FRV)

Dr. Prof. Mofreh Ahmed El Sayed Hegou :: Publications:

Title:
Development of Data Mining Models Based on Features Ranks Voting (FRV)
Authors: Mofreh A. Hogo
Year: 2022
Keywords: Evaluator; features selection; data mining; forward; backward; voting; feature rank
Journal: computers, materials and Continua
Volume: Vol.73, No.2
Issue: doi:10.32604/cmc.2022.027300
Pages: pp.2947-2966
Publisher: Tech Science Press
Local/International: International
Paper Link:
Full paper Not Available
Supplementary materials Not Available
Abstract:

Data size plays a significant role in the design and the performance of data mining models. A good feature selection algorithm reduces the problems of big data size and noise due to data redundancy. Features selection algorithms aim at selecting the best features and eliminating unnecessary ones, which in turn simplifies the structure of the data mining model as well as increases its performance. This paper introduces a robust features selection algorithm, named Features Ranking Voting Algorithm FRV. It merges the benefits of the different features selection algorithms to specify the features ranks in the dataset correctly and robustly; based on the feature ranks and voting algorithm. The FRV comprises of three different proposed techniques to select the minimum best feature set, the forward voting technique to select the best high ranks features, the backward voting technique, which drops the low ranks features (low importance feature), and the third technique merges the outputs from the forward and backward techniques to maximize the robustness of the selected features set. Different data mining models were built using obtained selected features sets from applying the proposed FVR on different datasets; to evaluate the success behavior of the proposed FRV. The high performance of these data mining models reflects the success of the proposed FRV algorithm. The FRV performance is compared with other features selection algorithms. It successes to develop data mining models for the Hungarian CAD dataset with Acc. of 96.8%, and with Acc. of 96% for the Z-Alizadeh Sani CAD dataset compared with 83.94% and 92.56% respectively in [48].

Google ScholarAcdemia.eduResearch GateLinkedinFacebookTwitterGoogle PlusYoutubeWordpressInstagramMendeleyZoteroEvernoteORCIDScopus