ON THE OCCURRENCE OF THE RANDOM MISSING DATA MECHANISMS

  • Деян Лазаров
Keywords: missing data mechanisms, missing values, EM-clustering, Logistic regression, multivariate multilayer distributions

Abstract

That research presents the missing data mechanisms in their classic
definitions. But in the many situations when we implement the field research the lack of
homogeneous multivariate distributions occur. I call them multivariate multilayer
distributions (MMD). The main question that arises is: Are the available definitions of
missing data mechanisms work well in these situations? It’s necessary to make some
clarifications. I demonstrate the missing data mechanisms problems using simulations of
the MMD and EM-clustering end Logistic regressions.

References

1. Dempster, A.P., Laird, N.M., Rubin, D.B. (1977). Maximum likelihood from incomplete data
via EM algorithm (whit discussion). Journal of the Royal Statistical Society. B39, 1-38
2. Little, R.J.A, Rubin, D.B. (2002). Statistical Analysis with Missing Data - 2nd ed., New Jersey:
Wiley.
3. Nisbet R., Elder J. F., Mine G. (2009), „Handbook of Statistical Analysis and Data Mining
Applications”, Elsevier Inc.
4. Rubin, D.B. (1976). Inference and missing data (with discussion). Biometrika, 63, 581-592.
5. Rubin, D.B. (1987). Multiple Imputation for Nonresponse in Survey. New York: Wiley.
6. Witten, I. H., Frank, E. (2000). Data Mining: Practical Machine Learning Tools and
Techniques. New York: Morgan Kaufmann.
Published
2023-02-04
How to Cite
Лазаров, Д. (2023). ON THE OCCURRENCE OF THE RANDOM MISSING DATA MECHANISMS. Vanguard Scientific Instruments in Management, 6(6). Retrieved from https://vsim-journal.info/index.php?journal=vsim&page=article&op=view&path[]=482