Modelling Website User Behaviors by Combining the EM and DBSCAN Algorithms

dc.contributor.authorUdantha, M
dc.contributor.authorRanathunga, S
dc.contributor.authorDias, G
dc.date.accessioned2017-01-17T10:05:56Z
dc.date.available2017-01-17T10:05:56Z
dc.description.abstractWeb logs can provide a wealth of information on user access patterns of a corresponding website, when they are properly analyzed. However, finding interesting patterns hidden in the low-level log data is non-trivial due to large log volumes, and the distribution of the log files in cluster environments. This paper presents a novel technique, the application of Density-Based Spatial Clustering of Applications with Noise (DBSCAN) and Expectation Maximization (EM) algorithms in an iterative manner for clustering web user sessions. Each cluster corresponds to one or more web user activities. The unique user access pattern of each cluster is identified by frequent pattern mining and sequential pattern mining techniques. When compared with the clustering output of EM, DBSCAN, and k-means algorithms, this technique shows better accuracy in web session mining, and it is more effective in identifying cluster changes with time. We demonstrate that the implemented system is capable of not only identifying common user behaviors, but also of identifying cyber-attacks.en_US
dc.identifier.conferenceMoratuwa Engineering Research Conference (MERCon) 2016en_US
dc.identifier.emailgihan@uom.lken_US
dc.identifier.facultyEngineeringen_US
dc.identifier.pgnos168-173en_US
dc.identifier.urihttp://dl.lib.mrt.ac.lk/handle/123/12247
dc.identifier.year2016en_US
dc.relation.urihttps://www.researchgate.net/profile/Madhuka_Udantha/publication/303688166_Modelling_website_user_behaviors_by_combining_the_EM_and_DBSCAN_algorithms/links/574f214708ae1880a8211a4b.pdfen_US
dc.source.urihttps://www.researchgate.net/profile/Madhuka_Udantha/publication/303688166_Modelling_website_user_behaviors_by_combining_the_EM_and_DBSCAN_algorithms/links/574f214708ae1880a8211a4b.pdfen_US
dc.titleModelling Website User Behaviors by Combining the EM and DBSCAN Algorithmsen_US
dc.typeConference-Abstracten_US

Files