Articles Information
International Journal of Mathematics and Computational Science, Vol.4, No.3, Sep. 2018, Pub. Date: Jun. 14, 2018
On Estimation Methods for Binary Logistic Regression Model with Missing Values
Pages: 79-85 Views: 2087 Downloads: 1478
Authors
[01]
Mohamed Reda Abonazel, Department of Applied Statistics and Econometrics, Institute of Statistical Studies and Research, Cairo University, Cairo, Egypt.
[02]
Mohamed Gamal Ibrahim, Department of Applied Statistics and Econometrics, Institute of Statistical Studies and Research, Cairo University, Cairo, Egypt.
Abstract
This paper reviews some estimation methods for the binary logistic regression model with missing data in dependent and/or independent variables. Moreover, we present an empirical study for assessing the performance of these estimation methods under the existence of missing data. The results indicated that the regression imputation method is a very appropriate method for estimating the missing values in this model.
Keywords
EM Algorithm, Incomplete Data, Maximum Likelihood Estimation, Regression Imputation
References
[01]
Allison, P. D. (2002). Missing data: Quantitative applications in the social sciences. British Journal of Mathematical and Statistical Psychology, 55 (1), 193-196.
[02]
Byrne, B. M. (2013). Structural equation modeling with AMOS: Basic concepts, applications, and programming. Routledge.
[03]
Burns, R. P., & Burns, R. (2008). Business research methods and statistics using SPSS. Sage.
[04]
Consentino, F., & Claeskens, G. (2011). Missing covariates in logistic regression, estimation and distribution selection. Statistical Modelling, 11 (2), 159-183.
[05]
El-Sheikh, A. A., Abonazel, M. R., & Gamil, N. (2017). A Review of Software Packages for Structural Equation Modeling: A Comparative Study. Applied Mathematics and Physics, 5 (3), 85-94.
[06]
El-Sheikh, A. A., Abonazel, M. R., & Gamil, Noha (2017). A review of estimation methods for structural equation modeling. Working paper. Institute of Statistical Studies and Research. Cairo University, Egypt.
[07]
FitzGerald, P. E., & Knuiman, M. W. (1998). Theory and Methods: Estimation in Regressive Logistic Regression Analyses of Familial Data with Missing Outcomes. Australian & New Zealand Journal of Statistics, 40 (3), 305-316.
[08]
Fuchs, C. (1982). Maximum likelihood estimation and model selection in contingency tables with missing data. Journal of the American Statistical Association, 77 (378), 270-278.
[09]
Houchens, R. (2015). Missing Data Methods for the NIS and the SID. HCUP Methods Series Report # 2015-01. Agency for Healthcare Research and Quality [accessed on June 22, 2015]. Available: https://www.hcup-us.ahrq.gov/reports/methods/2015_01.pdf
[10]
Hsieh, F. Y., Bloch, D. A., & Larsen, M. D. (1998). A simple method of sample size calculation for linear and logistic regression. Statistics in medicine, 17 (14), 1623-1634.
[11]
Ibrahim, J. G. (1990). Incomplete data in generalized linear models. Journal of the American Statistical Association, 85 (411), 765-769.
[12]
Little, R. J. (1988). A test of missing completely at random for multivariate data with missing values. Journal of the American Statistical Association, 83 (404), 1198-1202.
[13]
Little, R. J. (1992). Regression with missing X’s: a review. Journal of the American Statistical Association, 87, 1227-1237.
[14]
Little, R. J., & Rubin, D. B. (2002). Statistical Analysis with Missing Data. John Wiley & Sons.
[15]
Little, R. J., & Schluchter, M. D. (1985). Maximum likelihood estimation for mixed continuous and categorical data with missing values. Biometrika, 72 (3), 497-512.
[16]
McLachlan, G. J., & Krishnan, T. (1997). Wiley series in probability and statistics. The EM Algorithm and Extensions, Second Edition, 361-369.
[17]
Mood, C. (2010). Logistic regression: Why we cannot do what we think we can do, and what we can do about it. European sociological review, 26 (1), 67-82.
[18]
Nargundkar, S. (2015). Chapter 12: Logistic Regression for Classification and Prediction. https://www.coursehero.com/file/7073839/Logistic-Regression.
[19]
Peugh, J. L., & Enders, C. K. (2004). Missing data in educational research: A review of reporting practices and suggestions for improvement. Review of Educational Research, 74, 525-556.
[20]
Sabbe, N., Thas, O., & Ottoy, J. P. (2013). EMLasso: logistic lasso with missing data. Statistics in medicine, 32 (18), 3143-3157.
[21]
Schluchter, M. D., & Jackson, K. L. (1989). Log-linear analysis of censored survival data with partially observed covariates. Journal of the American Statistical Association, 84 (405), 42-52.
[22]
Stephenson, B., Cook, D., Dixon, P., Duckworth, W., Kaiser, M., Koehler, K., & Meeker, W. (2008). Binary response and logistic regression analysis. Available: http://www.stat.wisc.edu/~mchung/teaching/MIA/reading/GLM.logistic.Rpackage.pdf
[23]
Maity, A. K., Pradhan, V., & Das, U. (2017). Bias Reduction in Logistic Regression with Missing Responses when the Missing Data Mechanism is Nonignorable. The American Statistician, (just-accepted).
[24]
Meeyai, S. (2016). Logistic Regression with Missing Data: A Comparison of Handling Methods, and Effects of Percent Missing Values. Journal of Traffic and Logistics Engineering, 4 (2).
[25]
Peng, C. Y. J., & Zhu, J. (2008). Comparison of two approaches for handling missing covariates in logistic regression. Educational and Psychological Measurement, 68 (1), 58-77.
[26]
Abonazel, M. R. (2018). A practical guide for creating Monte Carlo simulation studies using R. International Journal of Mathematics and Computational Science, 4 (1), 18-33.