Question classification for the travel domain using deep contextualised word embedding models

Weerakoon CY

UoM IR
→
Thesis & Dissertation
→
Faculty of Engineering, Computer Science & Engineering
→
Master of Science in Computer science and Engineering
→
View Item

dc.contributor.advisor	Ranathunga S
dc.contributor.author	Weerakoon CY
dc.date.accessioned	2021
dc.date.available	2021
dc.date.issued	2021
dc.identifier.citation	Weerakoon, CY. (2021). Question classification for the travel domain using deep contextualised word embedding models [Master's theses, University of Moratuwa]. Institutional Repository University of Moratuwa. http://dl.lib.uom.lk/handle/123/20757
dc.identifier.uri	http://dl.lib.uom.lk/handle/123/20757
dc.description.abstract	Question answering can be considered as a key area in Natural Language Processing and Information Retrieval, where users construct queries in natural language and receive suitable answers in return. In the travel domain, most questions are “content questions”, where the expected answer is not the equivalent of “yes” or “no”, but rather factual information. Replying to a free-form factual question based on a large collection of text is challenging. Previous research has shown that the accuracy of question answering systems can be improved by adding a classification phase based on the expected answer type. This research focuses on implementing a multi-level, multi-class question classification system focusing on the travel domain. Existing research for the travel domain is conducted using language-specific features and traditional Machine Learning models. In contrast, this research employs transformer-based state-of-the-art deep contextualized word embedding models for question classification. The proposed method improves the coarse class Micro F1-score by 5.43% compared to baseline. Fine-grain Micro F1-Score has also improved by 3.8%. This research also presents an empirical analysis of the effectiveness of different transformer-based deep contextualized word embedding models for multi-level multi-class classification	en_US
dc.language.iso	en	en_US
dc.subject	DEEP CONTEXTUALIZED WORD EMBEDDING MODELS	en_US
dc.subject	QUESTION CLASSIFICATION	en_US
dc.subject	WORD EMBEDDING	en_US
dc.subject	NATURAL LANGUAGE PROCESSING	en_US
dc.subject	INFORMATION TECHNOLOGY -Dissertation	en_US
dc.subject	COMPUTER SCIENCE -Dissertation	en_US
dc.subject	COMPUTER SCIENCE & ENGINEERING -Dissertation	en_US
dc.title	Question classification for the travel domain using deep contextualised word embedding models	en_US
dc.type	Thesis-Abstract	en_US
dc.identifier.faculty	Engineering	en_US
dc.identifier.degree	MSc in Computer Science and Engineering	en_US
dc.identifier.department	Department of Computer Science & Engineering	en_US
dc.date.accept	2021
dc.identifier.accno	TH4587	en_US