Loglearn : predicting computer node failures using continuous machine learning

dc.contributor.advisorPerera, I
dc.contributor.authorKabilesh, K
dc.date.accept2023
dc.date.accessioned2024-08-13T02:43:39Z
dc.date.available2024-08-13T02:43:39Z
dc.date.issued2023
dc.description.abstractEnsuring reliability, availability, and fault-tolerance is crucial in modern computer systems. Despite the substantial efforts put into the development, testing, and operation, failures still occur during runtime, leading to significant consequences. To address this issue, a proactive approach is necessary to predict and prevent failures before they happen. System and software logs provide essential data for monitoring systems and their performance during runtime. However, processing this information in real-time poses a unique challenge for machine learning because of the properties of streaming big data such as logs. Therefore, this study utilizes the continuous machine learning paradigm to develop a failure prediction model called LogLearn, which uses system log data. The design and development of LogLearn consider the drawbacks and limitations of current continuous machine learning models to provide a more efficient and accurate approach to predicting computer node failures and their potential root cause with a high lead time. The LogLearn model is implemented with an online failure prediction method, which is evaluated using multiple algorithms. Logistic regression showed the best performance in prediction. The LogLearn model outperformed previous studies’ models in terms of accuracy, precision, recall, and f1-score. Additionally, an online timeseries prediction model using the SNARIMAX algorithm was implemented to forecast the potential time of failure. Although previous studies have shown promising results, their lead times were insufficient to fix the underlying cause of failure in advance. Thus, LogLearn provides a viable alternative approach for failure prediction in computer systems.en_US
dc.identifier.accnoTH5306en_US
dc.identifier.citationKabilesh, K. (2023). Loglearn : predicting computer node failures using continuous machine learning [Master’s theses, University of Moratuwa]. Institutional Repository University of Moratuwa. hhttp://dl.lib.uom.lk/handle/123/22653
dc.identifier.degreeMSc in Computer Scienceen_US
dc.identifier.departmentDepartment of Computer Science and Engineeringen_US
dc.identifier.facultyEngineeringen_US
dc.identifier.urihttp://dl.lib.uom.lk/handle/123/22653
dc.language.isoenen_US
dc.subjectSYSTEM LOGSen_US
dc.subjectCONTINUOUS MACHINE LEARNINGen_US
dc.subjectDATA STREAMSen_US
dc.subjectFAILURE PREDICTIONen_US
dc.subjectANOMALY DETECTIONen_US
dc.subjectCOMPUTER SCIENCE & ENGINEERING – Dissertationen_US
dc.subjectCOMPUTER SCIENCE- Dissertationen_US
dc.titleLoglearn : predicting computer node failures using continuous machine learningen_US
dc.title.alternativepredicting computer node failures using continuous machine learningen_US
dc.typeThesis-Abstracten_US

Files

Original bundle

Now showing 1 - 3 of 3
Loading...
Thumbnail Image
Name:
TH5306-1.pdf
Size:
85.42 KB
Format:
Adobe Portable Document Format
Description:
Pre-text
Loading...
Thumbnail Image
Name:
TH5306-2.pdf
Size:
93.41 KB
Format:
Adobe Portable Document Format
Description:
Post-text
Loading...
Thumbnail Image
Name:
TH5306.pdf
Size:
5.22 MB
Format:
Adobe Portable Document Format
Description:
Full-thesis