Privacy preserving data publishing framework for unstructured textual social media data

dc.contributor.advisorUthyasanker T
dc.contributor.authorAbeywardana PBPA
dc.date.accept2020
dc.date.accessioned2020
dc.date.available2020
dc.date.issued2020
dc.description.abstractPrivacy has become an essential part of data science and analytics due to the potential of personal data misuse. As a result of privacy breaches reported in various analytical studies privacy preservation has become a legal responsibility rather than a simple social responsibility. Preserving privacy of unstructured data is more challenging compared to structured data. Social media has become largely popular over the past couple of decades and they are pumping a huge amount of data at a high velocity into analytical systems. Social media profiles contain a wealth of personal and sensitive information, creating enormous opportunities for third parties to analyze them with different algorithms, draw conclusions and use in disinformation campaigns and micro targeting based dark advertising. The primary goal of this study is to provide a mitigation mechanism for privacy breaches happening via disinformation campaigns that are done based on the insights extracted from personal/sensitive data analysis. Specifically, this research is aimed at building a privacy preserving data publishing framework for unstructured and textual social media data without compromising the true analytical value of those data. A novel way is proposed to apply traditional structured privacy preserving techniques on unstructured data. Creating a comprehensive twitter corpus annotated with privacy attributes is another objective of this research, especially because the research community is lacking one. An easily extensible framework that can be adopted by many domains is implemented here, integrating different concepts from the literature. A comprehensive set of experiments are also performed in order to assess the capabilities of the machine learning models, algorithms as well as to simulate some real-world privacy preserving data publishing use cases.en_US
dc.identifier.accnoTH4287en_US
dc.identifier.degreeMSc in Computer Science and Engineeringen_US
dc.identifier.departmentDepartment of Computer Science and Engineeringen_US
dc.identifier.facultyEngineeringen_US
dc.identifier.urihttp://dl.lib.uom.lk/handle/123/16486
dc.language.isoenen_US
dc.subjectCOMPUTER SCIENCE- Dissertationen_US
dc.subjectCOMPUTER SCIENCE & ENGINEERING - Dissertationen_US
dc.subjectSOCIAL MEDIA - Twitteren_US
dc.subjectUNSTRUCTURED TEXTUAL DATAen_US
dc.subjectDATA - Privacyen_US
dc.subjectTWITTERen_US
dc.titlePrivacy preserving data publishing framework for unstructured textual social media dataen_US
dc.typeThesis-Full-texten_US

Files

Original bundle

Now showing 1 - 3 of 3
Loading...
Thumbnail Image
Name:
TH4287-1.pdf
Size:
158.97 KB
Format:
Adobe Portable Document Format
Description:
Pre- text
Loading...
Thumbnail Image
Name:
TH4287-2.pdf
Size:
164.35 KB
Format:
Adobe Portable Document Format
Description:
Post-text
Loading...
Thumbnail Image
Name:
TH4287.pdf
Size:
1.28 MB
Format:
Adobe Portable Document Format
Description:
Full-thesis