dc.contributor.advisor |
Perera AS |
|
dc.contributor.author |
Fernando ML |
|
dc.date.accessioned |
2019 |
|
dc.date.available |
2019 |
|
dc.date.issued |
2019 |
|
dc.identifier.citation |
Fernando, M.L. (2019). Spatio temporal forecasting of dengue outbreaks using machine learning [Master’s theses, University of Moratuwa]. Institutional Repository University of Moratuwa. http://dl.lib.mrt.ac.lk/handle/123/15856 |
|
dc.identifier.uri |
http://dl.lib.mrt.ac.lk/handle/123/15856 |
|
dc.description.abstract |
Dengue is one of the most critical public health concerns in Sri Lanka which imposes
a severe economic and welfare burden on the nation annually. Prior work has shown
that there are multiple factors that contribute to propagation of dengue, including
sociological factors such as human mobility. Therefore, it is a non-trivial task to
model the propagation of this disease accurately at a regional level. However, accurate
quantitative modeling approaches that can predict dengue incidence for a public health
administrative division would be invaluable in allocating valuable public health resources
and preventing sudden disease outbreaks.
In this study, we make use of large-scale pseudonymized call detail records of
approximately 10 million mobile phone subscribers to derive human mobility patterns
that can contribute towards disease propagation. We develop 3 distinct proxy indicators
for human mobility based on different assumptions and evaluate the suitability of each
indicator to accurately model the disease transmission dynamics of dengue. Using the
proxy measures developed by us, we go on to show that human mobility has a significant
impact on the disease incidence at a regional level, even if the disease is already endemic
to a given region.
Combining these proxy mobility indicators with other climatic factors that is known
to affect dengue incidence, we build multiple predictive models using different machine
learning methods to predict dengue incidence 2 weeks ahead of time for a given MOH
division. By introducing an automated input feature selection method based on genetic
algorithms, we show that we are able to improve the predictive accuracy of our models
significantly, with predictive models based on XGBoost yielding the best performance,
with an R2 of 0.935 and RMSE of 7.688. |
en_US |
dc.language.iso |
en |
en_US |
dc.subject |
COMPUTER SCIENCE AND ENGINEERING-Dissertations |
en_US |
dc.subject |
BIG DATA |
en_US |
dc.subject |
MACHINE LEARNING |
en_US |
dc.subject |
MOBILE COMMUNICATION |
en_US |
dc.subject |
DISEASES-Outbreak-Forecasting |
en_US |
dc.subject |
DISEASES-Sri Lanka |
en_US |
dc.title |
Spatio temporal forecasting of dengue outbreaks using machine learning |
en_US |
dc.type |
Thesis-Full-text |
en_US |
dc.identifier.faculty |
Engineering |
en_US |
dc.identifier.degree |
MSc in Computer Science and Engineering by research |
en_US |
dc.identifier.department |
Department of Computer Science & Engineering |
en_US |
dc.date.accept |
2019 |
|
dc.identifier.accno |
TH4058 |
en_US |