You are in:Home/Publications/The effect of the missing rate and its mechanism on the performance of the imputation methods on different real data sets.

Dr. Mohamed Sewalim El-sayed Hamed :: Publications:

Title:
The effect of the missing rate and its mechanism on the performance of the imputation methods on different real data sets.
Authors: Saber, M. M., Sara Javadi, Mehrdad Taghipour, Mohamed S. Hamed, Abdussalam Aljadani, Mahmoud M. Mansour, & Haitham M. Yousof
Year: 2026
Keywords: Imputation Methods, Missing Data, Multiple Imputation, Multiple Imputation by Chained Equations, Incomplete Data, K-Nearest Neighbor imputation, Random Forest; Single Imputation.
Journal: Statistics, Optimization & Information Computing
Volume: 14
Issue: 5
Pages: 2131-2141
Publisher: International Academic Press
Local/International: International
Paper Link:
Full paper Mohamed Sewalim El-sayed Hamed_The Effect of the Missing Rate and Its Mechanism on the Performance of the.pdf
Supplementary materials Not Available
Abstract:

The purpose of this paper is to explore the mechanisms of data missingness and evaluate various imputation techniques used to handle missing data. Missing data is a common issue in data analysis, and its treatment is crucial for accurate modeling and analysis. This paper assesses prevalent imputation methods, including mean imputation, median imputation, K-Nearest Neighbor imputation (KNN), Classification and Regression Trees (CART), and Random Forest (RF). These techniques were chosen for their widespread use and varying levels of complexity and accuracy. Simple methods like mean and median imputation are computationally efficient but may introduce bias, especially when the missingness is not random. In contrast, more advanced methods like KNN, CART,andRFofferbetter handling of complex missingness patterns byconsidering relationships among variables. This paper aims to provide guidance for data scientists and analysts in selecting the most appropriate imputation methods based on their data characteristics and analysis objectives. By understanding the strengths and weaknesses of each technique, practitioners can improve the quality and reliability of their analyses.

Google ScholarAcdemia.eduResearch GateLinkedinFacebookTwitterGoogle PlusYoutubeWordpressInstagramMendeleyZoteroEvernoteORCIDScopus