Bias lens: systemic bias detection with explainable analysis
| dc.contributor.author | Bhavaneetharan, L | |
| dc.contributor.author | Jawwadh, S | |
| dc.contributor.editor | Gunawardena, S | |
| dc.date.accessioned | 2025-11-24T05:45:57Z | |
| dc.date.issued | 2025 | |
| dc.description.abstract | Natural Language Processing (NLP) models have revolutionized the way information is processed, yet they often inherit and even amplify societal biases present in training data [3]. These biases can lead to unfair, discriminatory outcomes in high-stakes applications. Traditional bias detection methods generally rely on binary classification, which oversimplifies the complex and nuanced nature of biased language. In response, we propose Bias Lens—a comprehensive framework that combines fine-grained multi-label token classification with a suite of Explainable AI (XAI) methods to not only detect but also elucidate bias in text. Our approach leverages advanced techniques, most notably Integrated Gap Gradients (IG2), to provide detailed neuron-level attributions, while also incorporating complementary methods (LIME, SHAP, DeepLIFT, Attention, Counterfactual, LRP, Occlusion, and Saliency) to create a multifaceted explanation ecosystem. This dual capability enhances transparency and facilitates trust among users such as content moderators and AI auditors. | |
| dc.identifier.conference | Applied Data Science & Artificial Intelligence (ADScAI) Symposium 2025 | |
| dc.identifier.department | Department of Computer Science & Engineering | |
| dc.identifier.doi | https://doi.org/10.31705/ADScAI.2025.17 | |
| dc.identifier.email | loganthan.20201212@iit.ac.lk | |
| dc.identifier.email | saadh.j@iit.ac.lk | |
| dc.identifier.faculty | Engineering | |
| dc.identifier.place | Moratuwa, Sri Lanka | |
| dc.identifier.proceeding | Proceedings of Applied Data Science & Artificial Intelligence Symposium 2025 | |
| dc.identifier.uri | https://dl.lib.uom.lk/handle/123/24453 | |
| dc.language.iso | en | |
| dc.publisher | Department of Computer Science and Engineering | |
| dc.subject | Bias Detection | |
| dc.subject | Explainable AI | |
| dc.subject | Fairness | |
| dc.subject | Multilabel Classification | |
| dc.subject | Natural Language Processing | |
| dc.title | Bias lens: systemic bias detection with explainable analysis | |
| dc.type | Conference-Extended-Abstract |
