Full-text resources of CEJSH and other databases are now available in the new Library of Science.
Visit https://bibliotekanauki.pl

PL EN


2012 | 884 | 143-156

Article title

Porównanie modeli regresji logistycznej odpornych na problem całkowitego rozdzielenia

Authors

Title variants

EN
Logistic Regression with Completely Separtated Data: A Comparison of Two Methods

Languages of publication

PL

Abstracts

EN
Applying logistic regression to small-sized data sets very often leads to the problem of complete separation. Generally speaking, separation is caused by a linear combination of covariates that perfectly separates successes (events) from failures (non-events). In such cases, results obtained by maximum likelihood method should not be trusted, since at least one parameter estimate diverges to infinity. A systematic review of the literature resulted in two theoretically sound procedures which always arrive at finite estimates, i.e. those of H. Heinze and S. Schemper (2002) and also R. Rousseeuw and C. Christmann (2003). The main goal of the paper is to compare them.

Keywords

Contributors

author
  • Uniwersytet Ekonomiczny w Krakowie, Katedra Statystyki, ul. Rakowicka 27, 31-510 Kraków, Poland

References

  • Albert A., Anderson J.A. [1984], On the Existence of Maximum Likelihood Estimates in Logistic Regression Models, „Biometrika”, nr 71.
  • Copas J.B. [1988], Binary Regression Models for Contaminated Data, with Discussion, „Journal of the Royal Statistical Society”, B, nr 50.
  • Ekholm A., Palmgren J. [1982], A Model for Binary Response with Misclassification, Proceedings of the International Conference on Generalized Linear Models, R. Gilchrist, GLIM-82, Springer, Heidelberg.
  • Firth D. [1993], Bias Reduction of Maximum Likelihood Estimates, „Biometrika”, nr 80.
  • Flury B., Riedwyl H. [1988], Multivariate Statistics: A Practical Approach, Chapman & Hall, London.
  • Greene W.H. [2003], Econometric Analysis, Pearson Education Inc.
  • Heinze G. [2006], A Comparative Investigation of Methods for Logistic Regression with Separated or Nearly Separated Data, „Statistics in Medicine”, nr 25.
  • Heinze G., Schemper M. [2002], A Solution to the Problem of Separation in Logistic Regression, „Statistics in Medicine”, nr 21.
  • Heinze G. [1999], Technical Report 10: The Application of Firth’s Procedure to Cox and Logistic Regression, Department of Medical Computer Sciences, Section of Clinical Biometrics, Vienna University, Vienna.
  • Long J.S. [1997], Regression Models for Categorical and Limited Dependent Variables, Sage.
  • Mehta C.R., Patel N.R. [1995], Exact Logistic Regression: Theory and Examples, „Statistics in Medicine”, nr 14.
  • R Development Core Team, R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing [2010], Vienna, Austria, http://www.R-project.org.
  • Rousseeuw P.J., Christmann A. [2003], Robustness against Separation and Outliers in Logistic Regression, „Computational Statistics & Data Analysis”, nr 43.
  • Stryhn H., Christensen J. [2003], Confidence Intervals by the Profile Likelihood Method, with Applications in Veterinary Epidemiology, ISVEE X, Chile.
  • Tutz G., Leitenstorfer F. [2006], Response Shrinkage Estimators in Binary Regression, „Computational Statistics & Data Analysis”, nr 50.

Document Type

Publication order reference

Identifiers

YADDA identifier

bwmeta1.element.desklight-1808d104-0c07-47d5-9175-c36ce3201a3c
JavaScript is turned off in your web browser. Turn it on to take full advantage of this site, then refresh the page.