STHITHIKA cost-based query distribution with query re-writing distributed complex event processing systems

dc.contributor.advisorRanathunga S
dc.contributor.authorDilrukshi IMT
dc.date.accept2019
dc.date.accessioned2019
dc.date.available2019
dc.date.issued2019
dc.description.abstractComplex event processing (CEP) is very useful in analyzing event streams and identifying useful patterns from them. Due to the distributed nature of existing applications, high volume of event generation and complex queries, using a single node CEP became problematic. One way to overcome this problem is to introduce multiple complex event processing nodes and distribute the queries among them for load balancing. However, due to the stateful nature of events, distributing queries among CEP nodes is not an easy task. Query distribution across CEP nodes is an NP hard problem. This research is focused on the problem of optimally processing a large number of different event streams using a large number of CEP queries in a distributed manner. Optimization of query processing and distribution is done in two aspects: optimizing the individual query by introducing query rewriting, and optimizing query distribution across multiple nodes by introducing new factors to the query distribution algorithm. Cost of individual queries, number of event streams common to queries, CPU and memory utilization of nodes that run CEP queries, type of queries, and the number of queries in each node are the factors considered for query distribution. Usability improvement is done in two ways: adding standard communication by introducing JSON messages for communication, and integrating firebase messaging service to standardize the event source. Experiments show that with these optimizations, compared to existing systems, STHITHIKA is capable of providing a higher system throughput, without making an adverse impact on event duplication or process load variance across processing nodes. It has the ability to handle higher number of queries compared to existing system. It is also more robust to event bursts. Due to the changes in query distribution and re-writing, time taken for initial query distribution has increased. Usability improvement enabled the easy integration with other technology and decoupling event source from the system.en_US
dc.identifier.accnoTH4068en_US
dc.identifier.citationDilrukshi, I.M.T. (2019). STHITHIKA cost-based query distribution with query re-writing distributed complex event processing systems [Master’s theses, University of Moratuwa]. Institutional Repository University of Moratuwa. http://dl.lib.mrt.ac.lk/handle/123/15984
dc.identifier.degreeMSc in Computer Scienceen_US
dc.identifier.departmentDepartment of Computer Science & Engineeringen_US
dc.identifier.facultyEngineeringen_US
dc.identifier.urihttp://dl.lib.mrt.ac.lk/handle/123/15984
dc.language.isoenen_US
dc.subjectCOMPUTER SCIENCE AND ENGINEERING-Dissertationsen_US
dc.subjectCOMPUTER SCIENCE-Dissertationsen_US
dc.subjectCOMPLEX EVENT PROCESSING SYSTEMSen_US
dc.subjectCOMPLEX EVEN PROCESSING ENGINESen_US
dc.subjectOPTIMIZATIONen_US
dc.titleSTHITHIKA cost-based query distribution with query re-writing distributed complex event processing systemsen_US
dc.typeThesis-Full-texten_US

Files

Original bundle

Now showing 1 - 3 of 3
Loading...
Thumbnail Image
Name:
TH4068-1.pdf
Size:
283.66 KB
Format:
Adobe Portable Document Format
Description:
Pre-text
Loading...
Thumbnail Image
Name:
TH4068-2.pdf
Size:
241 KB
Format:
Adobe Portable Document Format
Description:
Post-text
Loading...
Thumbnail Image
Name:
TH4068.pdf
Size:
1.84 MB
Format:
Adobe Portable Document Format
Description:
Full-thesis