Integrating music information retrieval and transfer learning for advanced emotion recognition in Sri Lankan crowd soundscapes : dataset creation and analysis

dc.contributor.advisorChithraranjan, C
dc.contributor.authorAriyathilake, PBSN
dc.date.accept2025
dc.date.accessioned2026-02-12T04:30:36Z
dc.date.issued2025
dc.description.abstractUnderstanding crowd emotions through sound is critical for applications in event monitoring, public safety, and mental health studies. However, there has been a notable gap in the availability of specialized datasets and novel robust models for classifying crowd sound emotions. To address this, a comprehensive Sri Lankan Crowd Sound Emotion Dataset (SLCSED) was developed, enriched with detailed annotations, to support future research. The study proposes a computational framework based on Music Information Retrieval (MIR) techniques combined with advanced machine learning algorithms to perform emotion classification in crowd. Feature extraction was performed using MIR methods, Wav2Vec 2.0 embeddings, and Emotion2Vec representation. PCA was applied as a dimensionality reduction technique. Various machine learning and transfer learning classifiers, including TabNet, LightGBM, Multi-Layer Perceptrons (MLP), wav2vec, and emotion2vec, were evaluated. Specific architectures were tuned for better accuracy, such as LightGBM with Gradient boosting and MLPs with hidden layers of (128, 64) units. Furthermore, emotion recognition models were developed using supervised learning methods, drawing inspiration from approaches tested on decision trees, random forests, XGBoost, and LightGBM in related studies. The results demonstrated highly promising outcomes, with the LightGBM classifier achieving up to 99.95% validation accuracy on the Emotional Crowd Sounds Data(ECSD) dataset and the MLP achieving 99.53% on the SLCSED dataset without dimensionality reduction. PCA was found to slightly reduce the performance in most cases. Additionally, the Emotion2Vec framework showed significant improvements after PCA application, reaching 99.99% accuracy. These findings highlight the effectiveness of MIR-based feature engineering combined with carefully selected classifiers for crowd emotion detection. This work not only fills a major gap by introducing a localized and richly annotated dataset but also presents a robust methodological pipeline for crowd sound emotion recognition, paving the way for future applications in real-world monitoring and psychological analysis.
dc.identifier.accnoTH6009
dc.identifier.citationAriyathilake, P.B.S.N. (2025). Integrating music information retrieval and transfer learning for advanced emotion recognition in Sri Lankan crowd soundscapes : dataset creation and analysis [Master\'s theses, University of Moratuwa]. Institutional Repository University of Moratuwa. https://dl.lib.uom.lk/handle/123/24845
dc.identifier.degreeMSc in Computer Science
dc.identifier.departmentDepartment of Computer Science & Engineering
dc.identifier.facultyEngineering
dc.identifier.urihttps://dl.lib.uom.lk/handle/123/24845
dc.language.isoen
dc.subjectEMOTIONAL CROWD SOUND DATA
dc.subjectSRI LANKAN CROWD SOUND EMOTION DATASET
dc.subjectMUSIC INFORMATION RETRIEVAL
dc.subjectPRINCIPAL COMPONENT ANALYSIS
dc.subjectNEURAL NETWORKS-Multi-Layer Perceptions
dc.subjectCOMPUTER SCIENCE-Dissertation
dc.subjectCOMPUTER SCIENCE AND ENGINEERING-Dissertation
dc.subjectMSc in Computer Science
dc.titleIntegrating music information retrieval and transfer learning for advanced emotion recognition in Sri Lankan crowd soundscapes : dataset creation and analysis
dc.typeThesis-Abstract

Files

Original bundle

Now showing 1 - 3 of 3
Loading...
Thumbnail Image
Name:
TH6009-1.pdf
Size:
942.63 KB
Format:
Adobe Portable Document Format
Description:
Pre-text
Loading...
Thumbnail Image
Name:
TH6009-2.pdf
Size:
128.79 KB
Format:
Adobe Portable Document Format
Description:
Post-text
Loading...
Thumbnail Image
Name:
TH6009.pdf
Size:
9.04 MB
Format:
Adobe Portable Document Format
Description:
Full-thesis

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: