Siamese networks for multilingual classified ad matching
| dc.contributor.author | Chathunka, P | |
| dc.contributor.author | Alwis, D | |
| dc.contributor.author | Rawishanka, R | |
| dc.contributor.author | Perera, I | |
| dc.date.accessioned | 2025-12-08T04:39:44Z | |
| dc.date.issued | 2025 | |
| dc.description.abstract | This paper presents a novel approach to semantically match ”Resource Wanted” and ”Resource Offering” classified ads within Sri Lanka’s complex multilingual digital marketplace. We introduce a Siamese neural network architecture specifically designed to effectively process both textual content and categorical metadata across English and Sinhala languages. Our model leverages advanced multilingual transformer models to create semantically rich embeddings, with a LaBSEbased implementation achieving superior performance, reaching a Recall@1 of 0.5813 and a Recall@10 of 0.9151. Crucially, the integration of categorical features with text embeddings yielded the best results, demonstrating a 1.5% improvement in Recall@1over the text-only approach. Our methodology addresses the significant challenge of matching ads across linguistic boundaries in a low-resource setting, providing a method that can significantly improve transaction efficiency in Sri Lanka’s diverse digital marketplace. | |
| dc.identifier.conference | Moratuwa Engineering Research Conference 2025 | |
| dc.identifier.department | Engineering Research Unit, University of Moratuwa | |
| dc.identifier.email | pamudu.20@cse.mrt.ac.lk | |
| dc.identifier.email | damikaa.20@cse.mrt.ac.lk | |
| dc.identifier.email | rasindu.20@cse.mrt.ac.lk | |
| dc.identifier.email | indika@cse.mrt.ac.lk | |
| dc.identifier.faculty | Engineering | |
| dc.identifier.isbn | 979-8-3315-6724-8 | |
| dc.identifier.pgnos | pp. 758-763 | |
| dc.identifier.proceeding | Proceedings of Moratuwa Engineering Research Conference 2025 | |
| dc.identifier.uri | https://dl.lib.uom.lk/handle/123/24521 | |
| dc.language.iso | en | |
| dc.publisher | IEEE | |
| dc.subject | multilingual classified ads | |
| dc.subject | semantic matching | |
| dc.subject | LaBSE | |
| dc.subject | Siamese network | |
| dc.subject | ad classification | |
| dc.title | Siamese networks for multilingual classified ad matching | |
| dc.type | Conference-Full-text |
