Modelling the risk for type 2 diabetes using logistic regression approach
dc.contributor.advisor | Peiris, TSG | |
dc.contributor.advisor | Jayasundara, DDM | |
dc.contributor.author | Attanayake, AMCH | |
dc.date.accept | 2016 | |
dc.date.accessioned | 2017-03-28T06:14:52Z | |
dc.date.available | 2017-03-28T06:14:52Z | |
dc.description.abstract | Type 2 diabetes is one of the growing vitally fatal diseases all over the world. The knowledge of the significant risk factors for type 2 diabetes will be useful to keep the diabetes under control. This study has identified eight significant risk factors for type 2 diabetes in the data set of UCI machine learning repository by using point-biserial correlation. With the aim of developing an accurate predictive model to predict the presence of diabetes based on identified significant risk factors a binary logistic regression approach was applied. The performance of a predictive model is overestimated when simply determined on the sample of subjects that was used to construct the model. Therefore five-fold cross validation technique has applied in order to validate the predictive ability of the developed model. Results reveal that low value of optimism (0.0108) and high value of c-statistic (0.8512) in the fitted model indicating an acceptable discrimination power of type 2 diabetes. There is a significant influence by Glucose level, BMI and Pedigree for the diabetes on the classification of the patient as type 2 diabetes. | en_US |
dc.identifier.accno | TH3199 | en_US |
dc.identifier.degree | MSc in Business Statistics | en_US |
dc.identifier.department | Department of Mathematics | en_US |
dc.identifier.faculty | Engineering | en_US |
dc.identifier.uri | http://dl.lib.mrt.ac.lk/handle/123/12590 | |
dc.language.iso | en | en_US |
dc.subject | Binary logistic regression | en_US |
dc.subject | BMI | |
dc.subject | Five-fold cross validation | |
dc.subject | C-statistic | |
dc.subject | Glucose level | |
dc.subject | Optimism | |
dc.subject | Pedigree | |
dc.subject | Point-biserial correlation | |
dc.subject | Risk factors | |
dc.subject | Type 2 diabetes | |
dc.title | Modelling the risk for type 2 diabetes using logistic regression approach | en_US |
dc.type | Thesis-Full-text | en_US |