Results found: 2

Search results

Search:
in the keywords: k-nearest neighbour

Sort By:

Limit search:

A Comparison of Machine Learning Methods in a High-Dimensional Classification Problem

100%

Zekić-Sušac M., Pfeifer S., Šarlija N.

Business Systems Research Journal

2014

vol. 5

issue 3

82-96

Background: Large-dimensional data modelling often relies on variable reduction methods in the pre-processing and in the post-processing stage. However, such a reduction usually provides less information and yields a lower accuracy of the model. Objectives: The aim of this paper is to assess the high-dimensional classification problem of recognizing entrepreneurial intentions of students by machine learning methods. Methods/Approach: Four methods were tested: artificial neural networks, CART classification trees, support vector machines, and k-nearest neighbour on the same dataset in order to compare their efficiency in the sense of classification accuracy. The performance of each method was compared on ten subsamples in a 10-fold cross-validation procedure in order to assess computing sensitivity and specificity of each model. Results: The artificial neural network model based on multilayer perceptron yielded a higher classification rate than the models produced by other methods. The pairwise t-test showed a statistical significance between the artificial neural network and the k-nearest neighbour model, while the difference among other methods was not statistically significant. Conclusions: Tested machine learning methods are able to learn fast and achieve high classification accuracy. However, further advancement can be assured by testing a few additional methodological refinements in machine learning methods.

Predicting Dropout Student: An Application of Data Mining Methods in an Online Education Program

100%

Yukselturk E., Ozekes S., Türel Y. K.

European Journal of Open, Distance and E-Learning

2014

vol. 17

issue 1

118-133

This study examined the prediction of dropouts through data mining approaches in an online program. The subject of the study was selected from a total of 189 students who registered to the online Information Technologies Certificate Program in 2007-2009. The data was collected through online questionnaires (Demographic Survey, Online Technologies Self-Efficacy Scale, Readiness for Online Learning Questionnaire, Locus of Control Scale, and Prior Knowledge Questionnaire). The collected data included 10 variables, which were gender, age, educational level, previous online experience, occupation, self efficacy, readiness, prior knowledge, locus of control, and the dropout status as the class label (dropout/not). In order to classify dropout students, four data mining approaches were applied based on k-Nearest Neighbour (k-NN), Decision Tree (DT), Naive Bayes (NB) and Neural Network (NN). These methods were trained and tested using 10-fold cross validation. The detection sensitivities of 3-NN, DT, NN and NB classifiers were 87%, 79.7%, 76.8% and 73.9% respectively. Also, using Genetic Algorithm (GA) based feature selection method, online technologies self-efficacy, online learning readiness, and previous online experience were found as the most important factors in predicting the dropouts.

Refine search results

1 Business Systems Research Journal

1 European Journal of Open, Distance and E-Learning

1 Ozekes S.

1 Pfeifer S.

1 Türel Y. K.

1 Yukselturk E.

1 Zekić-Sušac M.

1 Šarlija N.

2 2014

Search results

A Comparison of Machine Learning Methods in a High-Dimensional Classification Problem

Predicting Dropout Student: An Application of Data Mining Methods in an Online Education Program