Sentiment analysis for financial market prediction

dc.contributor.advisorRanathunga S
dc.contributor.authorMethmal THH
dc.date.accept2020
dc.date.accessioned2020
dc.date.available2020
dc.date.issued2020
dc.description.abstractToday’s world is highly dependent on financial markets. Financial markets are very dynamic, making it difficult to predict prices. If we can make good predictions, we can make financial gains without any risks. There are two main categories of data that we can use to predict the market price; historic market data and textual data. Most importantly, textual analysis on sources like news, social media and reports is more popular today among researchers. This process can be introduced as sentiment analysis. The whole idea behind sentiment analysis is checking the opinion behind the text; whether it is a positive, negative, or neutral polarity. This research focuses on sentiment-analysis-based financial market prediction using deep leaning. Market prediction using sentiment analysis is a very challenging task. There are complex linguistic issues to solve and using a microblog dataset like Twitter for the prediction task makes it even more difficult. However, the current prediction approach rarely exceeds the seventy percent accuracy mark. This research is based on the SEMEVAL 2017 fifth task and will use the same dataset shared by the SEMEVAL team. This thesis presents an improved version for above mention reported baseline. We experiment with different techniques in both machine leaning and deep learning domains. Lexicon based dictionaries are heavily used here in each model since this is a small dataset with train set (1693) and test set (793). We had to enlarge the dataset as much as possible to achieve good accuracy. We created mainly four models which are based on machine learning and deep leaning techniques. Support vector regression algorithm is used for the machine leaning model. Also we used convolutional neural network (CNN) , long short term memory (LSTM ) and gated recurrent unit (GRU ) as deep leaning architectures which are performed better than any of the baseline models on this dataset. Our deep leaning models are achieved maximum similarity scores than any of the single system. Finally, we experiment three main ensemble techniques which are Averaging Linear Regression and multilayer perceptron. We achieved best results from averaging ensemble model.en_US
dc.identifier.accnoTH4260en_US
dc.identifier.degreeMSc in Computer Scienceen_US
dc.identifier.departmentDepartment of Computer Science & Engineeringen_US
dc.identifier.facultyEngineeringen_US
dc.identifier.urihttp://dl.lib.uom.lk/handle/123/16785
dc.language.isoenen_US
dc.subjectCOMPUTER SCIENCE AND ENGINEERING-Dissertationsen_US
dc.subjectCOMPUTER SCIENCE-Dissertationsen_US
dc.subjectSENTIMENT ANALYSIS-Financeen_US
dc.subjectFINANCE-Data Processingen_US
dc.subjectCOMPUTATIONAL LINGUISTICSen_US
dc.subjectMACHINE LEARNINGen_US
dc.subjectDEEP LEARNINGen_US
dc.subjectSEMEVAL 2017en_US
dc.titleSentiment analysis for financial market predictionen_US
dc.typeThesis-Full-texten_US

Files

Original bundle

Now showing 1 - 3 of 3
Loading...
Thumbnail Image
Name:
TH4260-1.pdf
Size:
299.17 KB
Format:
Adobe Portable Document Format
Description:
Pre-text
Loading...
Thumbnail Image
Name:
TH4260-2.pdf
Size:
263.47 KB
Format:
Adobe Portable Document Format
Description:
Post-text
Loading...
Thumbnail Image
Name:
TH4260.pdf
Size:
1.45 MB
Format:
Adobe Portable Document Format
Description:
Full-thesis