Show simple item record

dc.contributor.advisor Perera AS
dc.contributor.author Fernando ML
dc.date.accessioned 2019
dc.date.available 2019
dc.date.issued 2019
dc.identifier.citation Fernando, M.L. (2019). Spatio temporal forecasting of dengue outbreaks using machine learning [Master’s theses, University of Moratuwa]. Institutional Repository University of Moratuwa. http://dl.lib.mrt.ac.lk/handle/123/15856
dc.identifier.uri http://dl.lib.mrt.ac.lk/handle/123/15856
dc.description.abstract Dengue is one of the most critical public health concerns in Sri Lanka which imposes a severe economic and welfare burden on the nation annually. Prior work has shown that there are multiple factors that contribute to propagation of dengue, including sociological factors such as human mobility. Therefore, it is a non-trivial task to model the propagation of this disease accurately at a regional level. However, accurate quantitative modeling approaches that can predict dengue incidence for a public health administrative division would be invaluable in allocating valuable public health resources and preventing sudden disease outbreaks. In this study, we make use of large-scale pseudonymized call detail records of approximately 10 million mobile phone subscribers to derive human mobility patterns that can contribute towards disease propagation. We develop 3 distinct proxy indicators for human mobility based on different assumptions and evaluate the suitability of each indicator to accurately model the disease transmission dynamics of dengue. Using the proxy measures developed by us, we go on to show that human mobility has a significant impact on the disease incidence at a regional level, even if the disease is already endemic to a given region. Combining these proxy mobility indicators with other climatic factors that is known to affect dengue incidence, we build multiple predictive models using different machine learning methods to predict dengue incidence 2 weeks ahead of time for a given MOH division. By introducing an automated input feature selection method based on genetic algorithms, we show that we are able to improve the predictive accuracy of our models significantly, with predictive models based on XGBoost yielding the best performance, with an R2 of 0.935 and RMSE of 7.688. en_US
dc.language.iso en en_US
dc.subject COMPUTER SCIENCE AND ENGINEERING-Dissertations en_US
dc.subject BIG DATA en_US
dc.subject MACHINE LEARNING en_US
dc.subject MOBILE COMMUNICATION en_US
dc.subject DISEASES-Outbreak-Forecasting en_US
dc.subject DISEASES-Sri Lanka en_US
dc.title Spatio temporal forecasting of dengue outbreaks using machine learning en_US
dc.type Thesis-Full-text en_US
dc.identifier.faculty Engineering en_US
dc.identifier.degree MSc in Computer Science and Engineering by research en_US
dc.identifier.department Department of Computer Science & Engineering en_US
dc.date.accept 2019
dc.identifier.accno TH4058 en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record