Early Prediction and Variable Importance of Certificate Accomplishment in a MOOC
Fecha
2017-05-23Resumen
The emergence of MOOCs (Massive Open Online Courses) makes available big amounts of data about students' interaction with online educational platforms. This allows for the possibility of making predictions about future learning outcomes of students based on these interactions. The prediction of certificate accomplishment can enable the early detection of students at risk, in order to perform interventions before it is too late. This study applies different machine learning techniques to predict which students are going to get a certificate during different timeframes. The purpose is to be able to analyze how the quality metrics change w
hen the models have more data available. From the
four machine learning techniques applied finally we choose a boosted trees model which provides stability in the prediction over the weeks with good quality metrics. We determine the variables that are most important for the prediction and how they change during the weeks of the course.