A New Algorithm to Impute the Missing Values in the Multivariate Case

Authors

1 Department of Statistics, Faculty of Science, Razi University, Kermanshah, Iran.

2 Department of Statistics, Faculty of Science, University of Qom, Qom, Iran.

3 Razi University, Kermanshah, Iran.

10.52547/jirss.19.2.133

Abstract

There are several methods to make inferences about the parameters of the sampling distribution when we encounter the missing values and the censored data. In this paper, through the order statistics and the projection theorem, a novel algorithm is proposed to impute the missing values in the multivariate case. Then, the performance of this method is investigated through the simulation studies. In an attempt to validate the proposed method and compare it with some other methods a real data is used.

Keywords

  1. Almasi, I., Mohammadpour, A., and Mohammadi, M. (2017), Best linear unbiased inter-polation oforder statistics. Communicationsin Statistics-Simulationand Computation,46(5), 4161–4171. Allison, P. D. (2001). Missing data (Vol. 136). Sage publications. Asgharzadeh, A., Ahmadi, J., Mirzazadeh Ganji, Z., and Valiollahi, R. (2012), Reconstruction of the past failure times for the proportional reversed hazard rate model. Journal of Statistical Computation and Simulation, 82(3), 475–489. Balakrishnan, N., Cohen, A. C. (2014), Order statistics and inference: estimation methods. Elsevier. Brockwell, P. J., Davis, R. A. (1991), Time series: theory and methods. second ed., Springer Series in Statistics, Springer-Verlag, New York. Dempster, A. P., Laird, N. M., and Rubin, D. B. (1977), Maximum likelihood from in-complete data via the em algorithm. Journal of the Royal Statistical society. Series B (methodological), 1–38. Fleiss, J. L., Levin, B., and Paik, M. C. (2013)], Statistical methods for rates and proportions. John Wiley and Sons. Gold, M. S., Bentler, P. M. (2000), Treatments of missing data: A monte carlo comparison of rbhdi, iterative stochastic regression imputation, and expectation-maximization. Structural Equation Modeling, 7(3), 319–355. Greenland, S., Finkle, W. D. (1995), A critical look at methods for handling missing covariates in epidemiologic regression analyses. American journal of epidemiology, 142(12), 1255–1264. Haitovsky, Y. (1968), Missing data in regression analysis. Journal of the Royal Statistical Society. Series B (Methodological), 67–82. Khatib, B., Ahmadi, J., and Razmkhah, M. (2013), Bayesian reconstruction of the missing failure times in exponential distribution. Journal of Statistical Computation and Simulation, 83(3), 501–517. Kim, J. K., and Yu, C. L. (2011), A semiparametric estimation of mean functionals with nonignorable missing data. Journal of the American Statistical Association, 106(493), 157–165. Klimczak, M., Rychlik, T. (2005), Reconstruction of previous failure times and records. Metrika, 61(3), 277–290. Little, R. J., Rubin, D. B. (2002), Single imputation methods. Statistical analysis with missing data. 59–74. McLachlan, G., Krishnan, T.(2007),Theemalgorithmandextensions(Vol.382).JohnWiley and Sons. Musil, C. M., Warner, C. B., Yobas, P. K., and Jones, S. L. (2002), A comparison of imputation techniques for handling missing data. Western Journal of Nursing Research,24(7), 815–829. Razmkhah, M., Khatib, B., and Ahmadi, J. (2010), Reconstruction of order statistics inexponential distribution. Journal of the Iranian Statistical Society. 9:21-40. Rubin, D. B. (1976), Inference and missing data. Biometrika, 63(3), 581–592. Snedecor, G., Cochran, W. G. (1967), Statistical Methods, 6th, ed. Ames: Iowa State University Press. Yuan, Y., Yin, G. (2010), Bayesian quantile regression for longitudinal studies with nonignor-able missing data. Biometrics, 66(1), 105–114.
  2. Allison, P. D. (2001), Missing data (Vol. 136). Sage publications. [DOI:10.4135/9781412985079]
  3. Allison, P. D. (2001), Missing data (Vol. 136). Sage publications. [DOI:10.4135/9781412985079]
  4. Almasi, I., Mohammadpour, A. and Mohammadi, M. (2017), Best linear unbiased interpolation of order statistics. Communications in Statistics-Simulation and Computation, 46(5), 4161-4171.
  5. Almasi, I., Mohammadpour, A. and Mohammadi, M. (2017), Best linear unbiased interpolation of order statistics. Communications in Statistics-Simulation and Computation, 46(5), 4161-4171.
  6. Asgharzadeh, A., Ahmadi, J., Mirzazadeh Ganji, Z. and Valiollahi, R. (2012), Reconstruction of the past failure times for the proportional reversed hazard rate model. Journal of Statistical Computation and Simulation, 82(3), 475-489. [DOI:10.1080/00949655.2010.542550]
  7. Asgharzadeh, A., Ahmadi, J., Mirzazadeh Ganji, Z. and Valiollahi, R. (2012), Reconstruction of the past failure times for the proportional reversed hazard rate model. Journal of Statistical Computation and Simulation, 82(3), 475-489. [DOI:10.1080/00949655.2010.542550]
  8. Balakrishnan, N. and Cohen, A. C. (2014), Order statistics and inference: estimation methods. Elsevier.
  9. Balakrishnan, N. and Cohen, A. C. (2014), Order statistics and inference: estimation methods. Elsevier.
  10. Brockwell, P. J. and Davis, R. A. (1991), Time series: theory and methods. second ed., Springer Series in Statistics, Springer-Verlag, New York. [DOI:10.1007/978-1-4419-0320-4]
  11. Brockwell, P. J. and Davis, R. A. (1991), Time series: theory and methods. second ed., Springer Series in Statistics, Springer-Verlag, New York. [DOI:10.1007/978-1-4419-0320-4]
  12. Dempster, A. P., Laird, N. M. and Rubin, D. B. (1977), Maximum likelihood from incomplete data via the em algorithm. Journal of the Royal Statistical {society. Series B (methodological), 1-38. [DOI:10.1111/j.2517-6161.1977.tb01600.x]
  13. Dempster, A. P., Laird, N. M. and Rubin, D. B. (1977), Maximum likelihood from incomplete data via the em algorithm. Journal of the Royal Statistical {society. Series B (methodological), 1-38. [DOI:10.1111/j.2517-6161.1977.tb01600.x]
  14. Fleiss, J. L., Levin, B. and Paik, M. C. (2013)], Statistical methods for rates and proportions. John Wiley and Sons.
  15. Fleiss, J. L., Levin, B. and Paik, M. C. (2013)], Statistical methods for rates and proportions. John Wiley and Sons.
  16. Gold, M. S. and Bentler, P. M. (2000), Treatments of missing data: A {Monte Carlo comparison of rbhdi, iterative stochastic regression imputation, and expectation-maximization. Structural Equation Modeling, 7(3), 319-355. [DOI:10.1207/S15328007SEM0703_1]
  17. Gold, M. S. and Bentler, P. M. (2000), Treatments of missing data: A {Monte Carlo comparison of rbhdi, iterative stochastic regression imputation, and expectation-maximization. Structural Equation Modeling, 7(3), 319-355. [DOI:10.1207/S15328007SEM0703_1]
  18. Greenland, S. and Finkle, W. D. (1995), A critical look at methods for handling missing covariates in epidemiologic regression analyses. American {Journal of Epidemiology, 142(12), 1255-1264. [DOI:10.1093/oxfordjournals.aje.a117592]
  19. Greenland, S. and Finkle, W. D. (1995), A critical look at methods for handling missing covariates in epidemiologic regression analyses. American {Journal of Epidemiology, 142(12), 1255-1264. [DOI:10.1093/oxfordjournals.aje.a117592]
  20. Haitovsky, Y. (1968), Missing data in regression analysis. Journal of the Royal Statistical Society. Series B (Methodological), 67-82. [DOI:10.1111/j.2517-6161.1968.tb01507.x]
  21. Haitovsky, Y. (1968), Missing data in regression analysis. Journal of the Royal Statistical Society. Series B (Methodological), 67-82. [DOI:10.1111/j.2517-6161.1968.tb01507.x]
  22. Khatib, B., Ahmadi, J. and Razmkhah, M. (2013), Bayesian reconstruction of the missing failure times in exponential distribution. Journal of Statistical Computation and Simulation, 83(3), 501-517. [DOI:10.1080/00949655.2011.621123]
  23. Khatib, B., Ahmadi, J. and Razmkhah, M. (2013), Bayesian reconstruction of the missing failure times in exponential distribution. Journal of Statistical Computation and Simulation, 83(3), 501-517. [DOI:10.1080/00949655.2011.621123]
  24. Kim, J. K. and Yu, C. L. (2011), A semiparametric estimation of mean functionals with nonignorable missing data. Journal of the American Statistical Association, 106(493), 157-165. [DOI:10.1198/jasa.2011.tm10104]
  25. Kim, J. K. and Yu, C. L. (2011), A semiparametric estimation of mean functionals with nonignorable missing data. Journal of the American Statistical Association, 106(493), 157-165. [DOI:10.1198/jasa.2011.tm10104]
  26. Klimczak, M. and Rychlik, T. (2005), Reconstruction of previous failure times and records. Metrika, 61(3), 277-290. [DOI:10.1007/s001840400344]
  27. Klimczak, M. and Rychlik, T. (2005), Reconstruction of previous failure times and records. Metrika, 61(3), 277-290. [DOI:10.1007/s001840400344]
  28. Little, R. J. and Rubin, D. B. (2019), Statistical analysis with missing data (Vol. 793). John Wiley and Sons. [DOI:10.1002/9781119482260]
  29. Little, R. J. and Rubin, D. B. (2019), Statistical analysis with missing data (Vol. 793). John Wiley and Sons. [DOI:10.1002/9781119482260]
  30. McLachlan, G. and Krishnan, T. (2007), The EM algorithm and extensions (Vol. 382). John Wiley and Sons. [DOI:10.1002/9780470191613]
  31. McLachlan, G. and Krishnan, T. (2007), The EM algorithm and extensions (Vol. 382). John Wiley and Sons. [DOI:10.1002/9780470191613]
  32. Musil, C. M., Warner, C. B., Yobas, P. K. and Jones, S. L. (2002), A comparison of imputation techniques for handling missing data. Western Journal of Nursing Research, 24(7), 815-829. [DOI:10.1177/019394502762477004]
  33. Musil, C. M., Warner, C. B., Yobas, P. K. and Jones, S. L. (2002), A comparison of imputation techniques for handling missing data. Western Journal of Nursing Research, 24(7), 815-829. [DOI:10.1177/019394502762477004]
  34. Razmkhah, M., Khatib, B. and Ahmadi, J. (2010), Reconstruction of order statistics in exponential distribution. Journal of the Iranian Statistical Society, 9, 21-40.
  35. Razmkhah, M., Khatib, B. and Ahmadi, J. (2010), Reconstruction of order statistics in exponential distribution. Journal of the Iranian Statistical Society, 9, 21-40.
  36. Rubin, D. B. (1976), Inference and missing data. Biometrika, 63(3), 581-592. [DOI:10.1093/biomet/63.3.581]
  37. Rubin, D. B. (1976), Inference and missing data. Biometrika, 63(3), 581-592. [DOI:10.1093/biomet/63.3.581]
  38. Snedecor, G. and Cochran, W. G. (1967), Statistical Methods, 6th, ed. Ames: Iowa State University Press.
  39. Snedecor, G. and Cochran, W. G. (1967), Statistical Methods, 6th, ed. Ames: Iowa State University Press.
  40. Yuan, Y. and Yin, G. (2010), Bayesian quantile regression for longitudinal studies with nonignorable missing data. Biometrics, 66(1), 105-114. [DOI:10.1111/j.1541-0420.2009.01269.x]
  41. Yuan, Y. and Yin, G. (2010), Bayesian quantile regression for longitudinal studies with nonignorable missing data. Biometrics, 66(1), 105-114. [DOI:10.1111/j.1541-0420.2009.01269.x]
Volume 19, Issue 2
December 2020
Pages 133-143
  • Receive Date: 23 July 2022
  • Revise Date: 10 May 2024
  • Accept Date: 23 July 2022