Full-text resources of CEJSH and other databases are now available in the new Library of Science.
Visit https://bibliotekanauki.pl

PL EN


2014 | 15 | 2 | 403-413

Article title

CLASSIFICATION OF POLISH HOUSEHOLDS BASED ON THEIR INCOMES BY MEANS OF DECISION TREES

Content

Title variants

Languages of publication

EN

Abstracts

EN
Classification trees included in SQL Server 2008R2 Analysis Services package have been used to classify Polish households based on their incomes. The analysis has been performed by means of the three algorithms and their effectiveness has been measured. Using the best algorithm a groups of households with the lowest and the largest incomes have been distinguished. The most important attributes describing households with the lowest and the largest incomes were identified and discussed.

Year

Volume

15

Issue

2

Pages

403-413

Physical description

Dates

published
2014

Contributors

  • Faculty of Applied Informatics and Mathematics Warsaw University of Life Sciences – SGGW
  • Faculty of Applied Informatics and Mathematics Warsaw University of Life Sciences – SGGW
  • Faculty of Applied Informatics and Mathematics Warsaw University of Life Sciences – SGGW
  • Faculty of Applied Informatics and Mathematics Warsaw University of Life Sciences – SGGW

References

  • Anioła P., Gołaś Z. Zastosowanie wielowymiarowych metod statystycznych w typologii strategii oszczędnościowych gospodarstw domowych w Polsce, NBP, 2012.
  • Beckel Ch., Sadamori L., Santini S. (2013) Automatic SocioEconomic Classification of Households Using Electricity Consumption Data, e-Energy '13 Proceedings of the fourth International Conference on Future Energy Systems, 7586.
  • Breiman L., Friedman J.H., Olshen R.A., Stone C.J. Classification and Regression Trees, Wadsworth, Belmont, CA, 1984.
  • Chrzanowska M., Alfaro E., Witkowska D. (2009) The individual borrowers recognition: Single and ensemble trees, Expert Systems with Applications 36, 6409-6414.
  • Cooper G.F., Herskovits E. (1992) A Bayesian Method for the Induction of Probabilistic Networks from Data, Machine Learning 9, 309-347.
  • D’Ambrosio C. (2001) Household Characteristics and the Distribution of Income in Italy: an Application of Social Distance Measures, Review of Income and Wealth 47, No 1, 4364.
  • Dragulescu A.A., Yakovenko V.M. (2001) Exponential and power-law probability distributions of wealth and income in the United Kingdom and the United States, Physica A, 299, 213–221.
  • Dziechciarz-Duda M., Król A., Przybysz K. (2012) Minimum egzystencji a czynniki warunkujące skłonność do korzystania z pomocy społecznej. Klasyfikacja gospodarstw domowych, Taksonomia Nr 19 (242), 144152.
  • Gatnar E. Symboliczne metody klasyfikacji danych, PWN, Warszawa, 1998.
  • Heckerman D. (1997) Bayesian Networks for Data Mining, Data Mining and Knowledge Discovery 1, 79-119.
  • Heckerman D., Geiger D., Chickering D.M. (1995) Learning Bayesian networks: the combination of knowledge and statistical data, Machine Learning 20, 197-243.
  • Jagielski M., Kutner R. (2013) Modelling of income distribution in the European Union with the Fokker–Planck equation, Physica A 392, 2130–2138.
  • Kompa K., Witkowska D. (2013) Application of Classification Trees to Analyze Income Distribution in Poland, Quantitative Methods in Economics, Vol. XIV, No. 1, 265–275.
  • Koronacki J., Ćwik J. Statystyczne systemy uczące się, Exit, Warszawa, 2008.
  • Kozera A., Stanisławska J., Wysocki F. (2013) Klasyfikacja gospodarstw domowych ze względu na stopień zaspokojenia ich potrzeb mierzony kategorią minimum socjalnego, Marketing i Rynek 11, 31-38.
  • Lim T., Loh W., Shih Y. (2000) A Comparison of Prediction Accuracy, Complexity, and Training Time of Thirty-three Old and New Classification Algorithms, Machine Learning, Vol. 40, 203–229.
  • Łukasiewicz P., Karpio K., Orłowski A. (2012) The Models of Personal Incomes in USA, Acta Physica Polonica A, Vol. 121, B-82–B-85 (2012).
  • Łukasiewicz P., Orłowski A. (2004) Probabilistic models of income distributions, Physica A 344, 146.
  • Morgan J.N., Sonquist J.A. (1963) Problems in the Analysis of Survey Data, and a Proposal, Journal of the American Statistical Association 58 (302), 415-434.
  • Olbryś J. (2007) Sieć bayesowska jako narzedzie pozyskiwania wiedzy z ekonomicznej bazy danych, Zeszyty Naukowe Politechniki Białostockiej, Informatyka 2, 93-107.
  • Quinlan J. R. (1993) C4.5: Programs for Machine Learning, Morgan Kaufmann, Los Altos.

Document Type

Publication order reference

Identifiers

YADDA identifier

bwmeta1.element.desklight-8935dd27-3e7e-405f-aad5-4566a98604af
JavaScript is turned off in your web browser. Turn it on to take full advantage of this site, then refresh the page.