A Comparison of Different Estimation Methods to Handle Missing Data in Explanatory Variables

Authors

  • Manal Jabbar Salman Department of Affairs of Higher Studies, University of Baghdad, Baghdad, Iraq

DOI:

https://doi.org/10.24996/ijs.2020.61.12.20

Keywords:

Missing data, Simulation, Recurrent Neural Networks, Expectation- Maximization, Multicycle –Expectation -Conditional-Maximization, Expectation-Conditional-Maximization-Either

Abstract

Missing data is one of the problems that may occur in regression models. This problem is usually handled by deletion mechanism available in statistical software. This method reduces statistical inference values because deletion affects sample size. In this paper, Expectation Maximization algorithm (EM), Multicycle-Expectation-Conditional Maximization algorithm (MC-ECM), Expectation-Conditional Maximization Either (ECME), and Recurrent Neural Networks (RNN) are used to estimate multiple regression models when explanatory variables have some missing values. Experimental dataset were generated using Visual Basic programming language with missing values of explanatory variables according to a missing mechanism at random general pattern and some ratios of missing values (10%, 20%, and 30%) with error variance values of 0.5, 1. 5, and 2, which were included in sample sizes of 25, 50, 100, and 500 and evaluated using Mean Squared Error (MSE). Simulation results show that RNN outperforms the other methods, followed by EM at small sample sizes.

Downloads

Download data is not yet available.

Downloads

Published

2020-12-30

Issue

Section

Mathematics

How to Cite

A Comparison of Different Estimation Methods to Handle Missing Data in Explanatory Variables. (2020). Iraqi Journal of Science, 61(12), 3327-3336. https://doi.org/10.24996/ijs.2020.61.12.20

Similar Articles

1-10 of 4803

You may also start an advanced similarity search for this article.