Prediction of diabetes using cost sensitive learning and oversampling techniques on Bangladeshi and Indian female patients

dc.contributor.authorPranto, B
dc.contributor.authorMehnaz, SK
dc.contributor.authorMomen, S
dc.contributor.authorHuq, SM
dc.contributor.editorKarunananda, AS
dc.contributor.editorTalagala, PD
dc.date.accessioned2022-11-14T09:39:55Z
dc.date.available2022-11-14T09:39:55Z
dc.date.issued2020-12
dc.description.abstractDiabetes is a major non-communicable disease that is responsible for many associated health risks and is rapidly increasing in low and middle income countries like Bangladesh. Class imbalance existing in datasets is a dire issue that can result the predictions of diabetes to be biased towards the majority class - thus reducing the reliability of machine learning models. Considering the associated risks of diabetes, a decrease in recall can result in life threatening consequences. In order to tackle this problem, a cost-sensitive learning and synthetic minority oversampling technique (SMOTE) have been applied on the PIMA Indian dataset. After that, the models have been tested on PIMA test set as well as on dataset collected from Kurmitola General Hospital (KGH), Dhaka, Bangladesh. Our results demonstrate that this proposed approach has successfully improved the reliability of the previous ML models to predict diabetes among Bangladeshi female population.en_US
dc.identifier.citationB. Pranto, S. M. Mehnaz, S. Momen and S. M. Huq, "Prediction of diabetes using cost sensitive learning and oversampling techniques on Bangladeshi and Indian female patients," 2020 5th International Conference on Information Technology Research (ICITR), 2020, pp. 1-6, doi: 10.1109/ICITR51448.2020.9310892.en_US
dc.identifier.conference5th International Conference in Information Technology Research 2020en_US
dc.identifier.departmentInformation Technology Research Unit, Faculty of Information Technology, University of Moratuwa.en_US
dc.identifier.doidoi: 10.1109/ICITR51448.2020.9310892.en_US
dc.identifier.facultyITen_US
dc.identifier.placeMoratuwa, Sri Lankaen_US
dc.identifier.proceedingProceedings of the 5th International Conference in Information Technology Research 2020en_US
dc.identifier.urihttp://dl.lib.uom.lk/handle/123/19509
dc.identifier.year2020en_US
dc.language.isoenen_US
dc.publisherFaculty of Information Technology, University of Moratuwa.en_US
dc.relation.urihttps://ieeexplore.ieee.org/document/9310892en_US
dc.subjectDiabetes predictionen_US
dc.subjectImbalanced dataseten_US
dc.subjectCost-sensitive learningen_US
dc.subjectSMOTEen_US
dc.subjectPrecisionen_US
dc.subjectRecallen_US
dc.titlePrediction of diabetes using cost sensitive learning and oversampling techniques on Bangladeshi and Indian female patientsen_US
dc.typeConference-Full-texten_US

Files

Collections