Dataset and Baseline for Automatic Student Feedback Analysis

Nilanga KHerath MMaduwantha HRanathunga SDataset and Baseline for Automatic Student Feedback AnalysisEuropean Language Resources Association (ELRA)2022Target-oriented Opinion Word ExtractionAspect-level Sentiment AnalysisDocument-level Sentiment AnalysisPre-Trained Language Models (PLM)Student FeedbackMy UniversityMy UniversityCalzolari NBéchet FBlache PChoukri KCieri CDeclerck TGoggi SIsahara HMaegaard BMariani JMazo HOdijk JPiperidis S2025-06-112025-06-112022-01-01conferenceNilanga K, Herath M, Maduwantha H, Ranathunga S. (2022). Dataset and Baseline for Automatic Student Feedback Analysis. Calzolari N, Béchet F, Blache P, Choukri K, Cieri C, Declerck T, Goggi S, Isahara H, Maegaard B, Mariani J, Mazo H, Odijk J, Piperidis S. 2022 Language Resources and Evaluation Conference, LREC 2022. (pp. 2042-2049). European Language Resources Association (ELRA).979-10-95546-72-6https://mro.massey.ac.nz/handle/10179/73025c-conference-paper-in-proceedings2042-2049In this paper, we present a student feedback corpus that contains 3000 instances of feedback written by university students. This dataset has been annotated for aspect terms, opinion terms, polarities of the opinion terms towards targeted aspects, and document-level opinion polarities. We developed a hierarchical taxonomy for aspect categorisation, which covers many aspects of the teaching-learning process. We annotated both implicit and explicit aspects using this taxonomy. Annotation methodology, difficulties faced during the annotation, and the details of the aspect term categorization are discussed in detail. Using state-of-the-art techniques, we have built baseline models for the following tasks: Target oriented Opinion Extraction, Aspect Level Sentiment Analysis, and Document Level Sentiment Analysis. These models reported 64%, 75%, and 86% F1 scores (respectively) for the considered tasks. These results illustrate the reliability and usability of the corpus for different tasks related to sentiment analysis.