D 2016

A Comparative Analysis of Techniques for Predicting Student Performance

BYDŽOVSKÁ, Hana

Basic information

Original name

A Comparative Analysis of Techniques for Predicting Student Performance

Edition

Raleigh, NC, USA, Proceedings of the 9th International Conference on Educational Data Mining, p. 306-311, 6 pp. 2016

Publisher

International Educational Data Mining Society

Other information

Language

English

Type of outcome

Stať ve sborníku

Field of Study

10201 Computer sciences, information science, bioinformatics

Country of publisher

United States of America

Confidentiality degree

není předmětem státního či obchodního tajemství

Publication form

electronic version available online

Organization unit

Faculty of Informatics

Keywords in English

student performance prediction; student similarity; classification; regression; collaborative filtering

Tags

International impact, Reviewed
Změněno: 10/11/2016 15:29, RNDr. Hana Bydžovská, Ph.D.

Abstract

V originále

The problem of student final grade prediction in a particular course has recently been addressed using data mining techniques. In this paper, we present two different approaches solving this task. Both approaches are validated on 138 courses which were offered to students of the Faculty of Informatics of Masaryk University between the years of 2010 and 2013. The first approach is based on classification and regression algorithms that search for patterns in study-related data and also data about students' social behavior. We prove that students’ social behavior characteristics improve prediction for a quarter of courses. The second approach is based on collaborative filtering techniques. We predict the final grades based on previous achievements of similar students. The results show that both approaches reached similar average results and can be beneficially utilized for student final grade prediction. The first approach reaches significantly better results for courses with a small number of students. In contrary, the second approach achieves significantly better results for mathematical courses. We also identified groups of courses for which we are not able to predict the grades reliably. Finally, we are able to correctly identify half of all failures (that constitute less than a quarter of all grades) and predict the final grades only with the error of one degree in the grade scale.

Links

MUNI/A/0945/2015, interní kód MU
Name: Rozsáhlé výpočetní systémy: modely, aplikace a verifikace V.
Investor: Masaryk University, Category A