CatBoost and random forest algorithms in binary classification tasks
| dc.contributor.author | Liyanage, CK | |
| dc.contributor.author | Thayasivam, U | |
| dc.contributor.editor | Gunawardena, S | |
| dc.date.accessioned | 2025-11-21T05:57:53Z | |
| dc.date.issued | 2025 | |
| dc.description.abstract | Among the many ML techniques, ensemble learning methods have grabbed significant attention due to their enhanced predictive performance achieved by combining multiple learning algorithms. Notably, ensemble methods like Random Forest, CatBoost have demonstrated superior capabilities in various predictive tasks, including numerous Kaggle competitions. In this study, I conduct an analysis of these two ensemble learning algorithms within the context of a binary classification task. This paper details the dataset selected, followed by the development of classification models using Random Forest, and CatBoost algorithms. I systematically evaluate and compare the performance of these models, analyzing the impact of hyperparameter optimization by Bayesian Optimization, model suitability based on features present, for both algorithms. The findings offer insights into the suitability of specific preprocessing techniques and model selections for each dataset, contributing to the optimization of ensemble learning applica tions in classification tasks. | |
| dc.identifier.conference | Applied Data Science & Artificial Intelligence (ADScAI) Symposium 2025 | |
| dc.identifier.department | Department of Computer Science & Engineering | |
| dc.identifier.doi | https://doi.org/10.31705/ADScAI.2025.29 | |
| dc.identifier.email | chathurangi.22@cse.mrt.ac.lk | |
| dc.identifier.email | rtuthaya@cse.mrt.ac.lk | |
| dc.identifier.faculty | Engineering | |
| dc.identifier.place | Moratuwa, Sri Lanka | |
| dc.identifier.proceeding | Proceedings of Applied Data Science & Artificial Intelligence Symposium 2025 | |
| dc.identifier.uri | https://dl.lib.uom.lk/handle/123/24429 | |
| dc.language.iso | en | |
| dc.publisher | Department of Computer Science and Engineering | |
| dc.subject | CatBoost | |
| dc.subject | RandomForest | |
| dc.subject | Bayesian Optimization | |
| dc.subject | Model Architecture Selection | |
| dc.title | CatBoost and random forest algorithms in binary classification tasks | |
| dc.type | Conference-Extended-Abstract |
