Insights from a socio-temporal approach to student failure prediction through discussion forum dynamics
Insights from a socio-temporal approach to student failure prediction through discussion forum dynamics
This research paper addresses the significant problem of identifying students-at-risk of failing or dropping out in educational settings. While extensively studied, improving early detection of students likely to fail or drop out remains essential for universities to provide support resources. Previous methods have relied on the students' academic performance to analyse and predict learning strategies, but the complexity of implementing predictive models is heightened by various factors influencing student outcomes. Notably, learning is both a dynamic and socially regulated process, with time and social interactions playing key roles for academic achievement. Nonetheless, despite the importance of these elements, educational research investigating their combined effect is scarce. As grade distribution is affected by several elements, including teaching modalities, grading policies, and course design, identifying students-at-risk and their learning strategies is generally an imbalanced problem, which can lead to biases in predictive algorithms. Our work addresses this issue from a social and temporal perspective, guided by two research questions: (1) To what extent is it possible to inform the early identification of students-at-risk of failing based on interaction data from online discussion forums?, and (2) How does the classification performance compare between traditional oversampling methods and oversampling methods that take the structure of the interactions into account? We based our research on data from an undergraduate course's online forum to build a temporal network of students' communication events across the 12 weeks of the course. Temporal sequences of centrality measures from these interactions served as input for time series classification algorithms. Two oversampling methods are compared: baseline minority oversampling, and a state-of-the-art graph oversampling method that accounts for network structure. Our results show that a temporal network approach, coupled with node oversampling, can enhance student-at-risk identification. However, due to the complexity of the problem and the interactions' sparsity the classification performance is limited when relying solely on this data. We discuss the impact of our findings and contributions, implications, limitations, and future research directions.
class imbalance, discussion forum, higher education, network dynamics, students-at-risk
López Flores, Nidia Guadalupe
1caf2b43-8f51-49ff-b1fa-974d213a8196
Uc-Cetina, Víctor
a313e15a-7773-422e-bc76-296eae0f2715
Islind, Anna Sigridur
46e6353f-a1b6-4628-916c-18e817695d03
Óskarsdóttir, María
d159ed8f-9dd3-4ff3-8b00-d43579ab71be
13 October 2024
López Flores, Nidia Guadalupe
1caf2b43-8f51-49ff-b1fa-974d213a8196
Uc-Cetina, Víctor
a313e15a-7773-422e-bc76-296eae0f2715
Islind, Anna Sigridur
46e6353f-a1b6-4628-916c-18e817695d03
Óskarsdóttir, María
d159ed8f-9dd3-4ff3-8b00-d43579ab71be
López Flores, Nidia Guadalupe, Uc-Cetina, Víctor, Islind, Anna Sigridur and Óskarsdóttir, María
(2024)
Insights from a socio-temporal approach to student failure prediction through discussion forum dynamics.
In 2024 IEEE Frontiers in Education Conference, FIE 2024 - Proceedings.
IEEE..
(doi:10.1109/FIE61694.2024.10893540).
Record type:
Conference or Workshop Item
(Paper)
Abstract
This research paper addresses the significant problem of identifying students-at-risk of failing or dropping out in educational settings. While extensively studied, improving early detection of students likely to fail or drop out remains essential for universities to provide support resources. Previous methods have relied on the students' academic performance to analyse and predict learning strategies, but the complexity of implementing predictive models is heightened by various factors influencing student outcomes. Notably, learning is both a dynamic and socially regulated process, with time and social interactions playing key roles for academic achievement. Nonetheless, despite the importance of these elements, educational research investigating their combined effect is scarce. As grade distribution is affected by several elements, including teaching modalities, grading policies, and course design, identifying students-at-risk and their learning strategies is generally an imbalanced problem, which can lead to biases in predictive algorithms. Our work addresses this issue from a social and temporal perspective, guided by two research questions: (1) To what extent is it possible to inform the early identification of students-at-risk of failing based on interaction data from online discussion forums?, and (2) How does the classification performance compare between traditional oversampling methods and oversampling methods that take the structure of the interactions into account? We based our research on data from an undergraduate course's online forum to build a temporal network of students' communication events across the 12 weeks of the course. Temporal sequences of centrality measures from these interactions served as input for time series classification algorithms. Two oversampling methods are compared: baseline minority oversampling, and a state-of-the-art graph oversampling method that accounts for network structure. Our results show that a temporal network approach, coupled with node oversampling, can enhance student-at-risk identification. However, due to the complexity of the problem and the interactions' sparsity the classification performance is limited when relying solely on this data. We discuss the impact of our findings and contributions, implications, limitations, and future research directions.
This record has no associated files available for download.
More information
Published date: 13 October 2024
Additional Information:
Publisher Copyright:
© 2024 IEEE.
Venue - Dates:
54th IEEE Frontiers in Education Conference, FIE 2024, , Washington, United States, 2024-10-13 - 2024-10-16
Keywords:
class imbalance, discussion forum, higher education, network dynamics, students-at-risk
Identifiers
Local EPrints ID: 508401
URI: http://eprints.soton.ac.uk/id/eprint/508401
ISSN: 1539-4565
PURE UUID: 4dfbeefb-b66e-4007-9de6-9cc144c23768
Catalogue record
Date deposited: 20 Jan 2026 17:58
Last modified: 21 Jan 2026 03:11
Export record
Altmetrics
Contributors
Author:
Nidia Guadalupe López Flores
Author:
Víctor Uc-Cetina
Author:
Anna Sigridur Islind
Author:
María Óskarsdóttir
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics