Multimodal Search Exploration for E-Commerce

dc.contributor.authorRatnalingam, G
dc.contributor.authorJalangan, M
dc.contributor.editorAthuraliya, CD
dc.date.accessioned2025-11-20T09:04:38Z
dc.date.issued2025
dc.description.abstractThe landscape of electronic commerce search has been changing drastically in a constant manner with an exponentially growing number of products, user-generated content and complex consumer behaviour patterns [2]. These aspects have made e-commerce search a challenging problem in order to provide accurate, relevant, and personalized search results. The challenges in e-commerce search are complex where the misalignment between visual and textual modalities in multimodal search systems may lead to poor search experiences, especially when users submit detailed, ambiguous or more natural queries [2]. This research addresses these issues by introducing an integrated approach that fuses text and image data within a unified space utilizing the ColPali mechanismwith late interaction for seamless multimodal alignment. The existing search systems are moving towards the conversational search or chatbots which uses the natural queries like “Black Jacket” or “Black Jacket with fleets” and through this multimodal system proposed, the specific and compelling use case of using VLM comes into the effect where a user can upload an image of a specific piece of clothing while asking a query as “Find me some similar jackets in black with fleets and a waterproof lining”. Hence the traditional systems will still struggle to interpret such queries holistically.
dc.identifier.conferenceApplied Data Science & Artificial Intelligence (ADScAI) Symposium 2025
dc.identifier.departmentDepartment of Computer Science & Engineering
dc.identifier.doihttps://doi.org/10.31705/ADScAI.2025.38
dc.identifier.emailgajanhcc007@gmail.com
dc.identifier.emailmithushan.j@iit.ac.lk
dc.identifier.facultyEngineering
dc.identifier.placeMoratuwa, Sri Lanka
dc.identifier.proceedingProceedings of Applied Data Science & Artificial Intelligence Symposium 2025
dc.identifier.urihttps://dl.lib.uom.lk/handle/123/24416
dc.language.isoen
dc.publisherDepartment of Computer Science and Engineering
dc.subjectproduct search(PS)
dc.subjectvision-language model(VLM)
dc.subjectcontextual embeddings(CE)
dc.subjectinformation retrieval(IR)
dc.titleMultimodal Search Exploration for E-Commerce
dc.typeConference-Extended-Abstract

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Paper 38 - ADScAI 2025.pdf
Size:
268.3 KB
Format:
Adobe Portable Document Format

Collections