Minimizing domain bias when adapting sentiment analysis techniques to the legal domain
dc.contributor.advisor | Perera AS | |
dc.contributor.advisor | De Silva N | |
dc.contributor.author | Ratnayaka G | |
dc.date.accept | 2022 | |
dc.date.accessioned | 2022 | |
dc.date.available | 2022 | |
dc.date.issued | 2022 | |
dc.description.abstract | Sentiment Analysis can be considered as an integral part of Natural Language Processing with a wide variety of significant use cases related to different application domains. Analyzing sentiments of descriptions that are given in Legal Opinion Texts has the potential to be applied in several legal information extraction tasks such as predicting the judgement of a legal case, predicting the winning party of a legal case, and identifying contradictory opinions and statements. However, the lack of annotated datasets for legal sentiment analysis imposes a major challenge when developing automatic approaches for legal sentiment analysis using supervised learning. In this work, we demonstrate an effective approach to develop reliable sentiment annotators for legal domain while utilizing a minimum number of resources. In that regard, we made use of domain adaptation techniques based on transfer learning, where a dataset from a high resource source domain is adapted to the target domain (legal opinion text domain). In this work, we have come up with a novel approach based on domain specific word representations to minimize the drawbacks that can be caused due to the differences in language semantics between the source and target domains when adapting a dataset from a source domain to a target domain. This novel approach is based on the observations that were derived using several word representational and language modelling techniques that were trained using legal domain specific copora. In order to evaluate different word representational techniques in the legal domain, we have prepared a legal domain specific context based verb similarity dataset named LeCoVe . The experiments carried out within this research work demonstrate that our approach to develop sentiment annotators for legal domain in a low resource setting is successful with promising results and significant improvements over existing works. | en_US |
dc.identifier.accno | TH5037 | en_US |
dc.identifier.citation | Ratnayaka, G. (2022). Minimizing domain bias when adapting sentiment analysis techniques to the legal domain [Master's theses, University of Moratuwa]. Institutional Repository University of Moratuwa. http://dl.lib.uom.lk/handle/123/21669 | |
dc.identifier.degree | MSc In Computer Science and Engineering by Research | en_US |
dc.identifier.department | Department of Computer Science and Engineering | en_US |
dc.identifier.faculty | Engineering | en_US |
dc.identifier.uri | http://dl.lib.uom.lk/handle/123/21669 | |
dc.language.iso | en | en_US |
dc.subject | SENTIMENT ANALYSIS | en_US |
dc.subject | SEMANTIC ANALYSIS | en_US |
dc.subject | WORD REPRESENTATION | en_US |
dc.subject | DEEP LEARNING | en_US |
dc.subject | INFORMATION TECHNOLOGY -Dissertation | en_US |
dc.subject | COMPUTER SCIENCE -Dissertation | en_US |
dc.subject | COMPUTER SCIENCE & ENGINEERING -Dissertation | en_US |
dc.title | Minimizing domain bias when adapting sentiment analysis techniques to the legal domain | en_US |
dc.type | Thesis-Full-text | en_US |
Files
Original bundle
1 - 3 of 3
Loading...
- Name:
- TH5037-1.pdf
- Size:
- 247.24 KB
- Format:
- Adobe Portable Document Format
- Description:
- Pre-Text
Loading...
- Name:
- TH5037-2.pdf
- Size:
- 129.67 KB
- Format:
- Adobe Portable Document Format
- Description:
- Post-Text
Loading...
- Name:
- TH5037.pdf
- Size:
- 619.14 KB
- Format:
- Adobe Portable Document Format
- Description:
- Full-theses
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.71 KB
- Format:
- Item-specific license agreed upon to submission
- Description: