Question classification for the travel domain using deep contextualised word embedding models

dc.contributor.advisorRanathunga S
dc.contributor.authorWeerakoon CY
dc.date.accept2021
dc.date.accessioned2021
dc.date.available2021
dc.date.issued2021
dc.description.abstractQuestion answering can be considered as a key area in Natural Language Processing and Information Retrieval, where users construct queries in natural language and receive suitable answers in return. In the travel domain, most questions are “content questions”, where the expected answer is not the equivalent of “yes” or “no”, but rather factual information. Replying to a free-form factual question based on a large collection of text is challenging. Previous research has shown that the accuracy of question answering systems can be improved by adding a classification phase based on the expected answer type. This research focuses on implementing a multi-level, multi-class question classification system focusing on the travel domain. Existing research for the travel domain is conducted using language-specific features and traditional Machine Learning models. In contrast, this research employs transformer-based state-of-the-art deep contextualized word embedding models for question classification. The proposed method improves the coarse class Micro F1-score by 5.43% compared to baseline. Fine-grain Micro F1-Score has also improved by 3.8%. This research also presents an empirical analysis of the effectiveness of different transformer-based deep contextualized word embedding models for multi-level multi-class classificationen_US
dc.identifier.accnoTH4587en_US
dc.identifier.citationWeerakoon, CY. (2021). Question classification for the travel domain using deep contextualised word embedding models [Master's theses, University of Moratuwa]. Institutional Repository University of Moratuwa. http://dl.lib.uom.lk/handle/123/20757
dc.identifier.degreeMSc in Computer Science and Engineeringen_US
dc.identifier.departmentDepartment of Computer Science & Engineeringen_US
dc.identifier.facultyEngineeringen_US
dc.identifier.urihttp://dl.lib.uom.lk/handle/123/20757
dc.language.isoenen_US
dc.subjectDEEP CONTEXTUALIZED WORD EMBEDDING MODELSen_US
dc.subjectQUESTION CLASSIFICATIONen_US
dc.subjectWORD EMBEDDINGen_US
dc.subjectNATURAL LANGUAGE PROCESSINGen_US
dc.subjectINFORMATION TECHNOLOGY -Dissertationen_US
dc.subjectCOMPUTER SCIENCE -Dissertationen_US
dc.subjectCOMPUTER SCIENCE & ENGINEERING -Dissertationen_US
dc.titleQuestion classification for the travel domain using deep contextualised word embedding modelsen_US
dc.typeThesis-Abstracten_US

Files

Original bundle

Now showing 1 - 3 of 3
Loading...
Thumbnail Image
Name:
TH4587-1.pdf
Size:
325.04 KB
Format:
Adobe Portable Document Format
Description:
Pre-text
Loading...
Thumbnail Image
Name:
TH4587-2.pdf
Size:
225.63 KB
Format:
Adobe Portable Document Format
Description:
Post-text
Loading...
Thumbnail Image
Name:
TH4587.pdf
Size:
1.34 MB
Format:
Adobe Portable Document Format
Description:
Full-thesis