Fine-grained main ideas extraction and clustering of online course reviews
Fine-grained main ideas extraction and clustering of online course reviews
Online course reviews have been an essential way in which course providers could get insights into students’ perceptions about the course quality, especially in the context of massive open online courses (MOOCs), where it is hard for both parties to get further interaction. Analyzing online course reviews is thus an inevitable part for course providers towards the improvement of course quality and the structuring of future courses. However, reading through the often-time thousands of comments and extracting key ideas is not efficient and will potentially incur non-coverage of some important ideas. In this work, we propose a key idea extractor that is based on fine-grained aspect-level semantic units from comments, powered by different variations of state-of-the-art pre-trained language models (PLMs). Our approach differs from both previous topic modeling and keyword extraction methods, which lies in: First, we aim to not only eliminate the heavy reliance on human intervention and statistical characteristics that traditional topic models like LDA are based on, but also to overcome the coarse granularity of state-of-the-art topic models like top2vec. Second, different from previous keyword extraction methods, we do not extract keywords to summarize each comment, which we argue is not necessarily helpful for human readers to grasp key ideas at the course level. Instead, we cluster the ideas and concerns that have been most expressed throughout the whole course, without relying on the verbatimness of students’ wording. We show that this method provides high and stable coverage of students’ ideas.
294-306
Xiao, Chenghao
3f842843-9a48-4d52-bcf8-eb880669e114
Shi, Lei
676eb692-a53b-495a-a228-4b2d35d1703d
Cristea, Alexandra
e49d8136-3747-4a01-8fde-694151b7d718
Li, Zhaoxing
65935c45-a640-496c-98b8-43bed39e1850
Pan, Ziqi
83c72a73-ff6e-4865-9aac-f35d0dfbfa39
2022
Xiao, Chenghao
3f842843-9a48-4d52-bcf8-eb880669e114
Shi, Lei
676eb692-a53b-495a-a228-4b2d35d1703d
Cristea, Alexandra
e49d8136-3747-4a01-8fde-694151b7d718
Li, Zhaoxing
65935c45-a640-496c-98b8-43bed39e1850
Pan, Ziqi
83c72a73-ff6e-4865-9aac-f35d0dfbfa39
Xiao, Chenghao, Shi, Lei, Cristea, Alexandra, Li, Zhaoxing and Pan, Ziqi
(2022)
Fine-grained main ideas extraction and clustering of online course reviews.
Rodrigo, Maria Mercedes, Matsuda, Noburu, Cristea, Alexandra I. and Dimitrova, Vania
(eds.)
In Artificial Intelligence in Education: 23rd International Conference, AIED 2022, Durham, UK, July 27–31, 2022, Proceedings, Part I.
vol. 13355,
Springer Cham.
.
(doi:10.1007/978-3-031-11644-5_24).
Record type:
Conference or Workshop Item
(Paper)
Abstract
Online course reviews have been an essential way in which course providers could get insights into students’ perceptions about the course quality, especially in the context of massive open online courses (MOOCs), where it is hard for both parties to get further interaction. Analyzing online course reviews is thus an inevitable part for course providers towards the improvement of course quality and the structuring of future courses. However, reading through the often-time thousands of comments and extracting key ideas is not efficient and will potentially incur non-coverage of some important ideas. In this work, we propose a key idea extractor that is based on fine-grained aspect-level semantic units from comments, powered by different variations of state-of-the-art pre-trained language models (PLMs). Our approach differs from both previous topic modeling and keyword extraction methods, which lies in: First, we aim to not only eliminate the heavy reliance on human intervention and statistical characteristics that traditional topic models like LDA are based on, but also to overcome the coarse granularity of state-of-the-art topic models like top2vec. Second, different from previous keyword extraction methods, we do not extract keywords to summarize each comment, which we argue is not necessarily helpful for human readers to grasp key ideas at the course level. Instead, we cluster the ideas and concerns that have been most expressed throughout the whole course, without relying on the verbatimness of students’ wording. We show that this method provides high and stable coverage of students’ ideas.
This record has no associated files available for download.
More information
Published date: 2022
Venue - Dates:
Artificial Intelligence in Education 23rd International Conference, Durham University, Durham, United Kingdom, 2022-07-27 - 2022-07-31
Identifiers
Local EPrints ID: 486480
URI: http://eprints.soton.ac.uk/id/eprint/486480
ISSN: 0302-9743
PURE UUID: 0141bcdd-b9e8-45c8-b869-74fe36099a2b
Catalogue record
Date deposited: 24 Jan 2024 17:35
Last modified: 18 Mar 2024 04:17
Export record
Altmetrics
Contributors
Author:
Chenghao Xiao
Author:
Lei Shi
Author:
Alexandra Cristea
Author:
Zhaoxing Li
Author:
Ziqi Pan
Editor:
Maria Mercedes Rodrigo
Editor:
Noburu Matsuda
Editor:
Alexandra I. Cristea
Editor:
Vania Dimitrova
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics