Institutional-Repository, University of Moratuwa.  

Adversarial learning to improve question image embedding in medical visual question answering

Show simple item record

dc.contributor.author Silva, K
dc.contributor.author Maheepala, T
dc.contributor.author Tharaka, K
dc.contributor.author Ambegoda, TD
dc.contributor.editor Rathnayake, M
dc.contributor.editor Adhikariwatte, V
dc.contributor.editor Hemachandra, K
dc.date.accessioned 2022-10-27T08:24:41Z
dc.date.available 2022-10-27T08:24:41Z
dc.date.issued 2022-07
dc.identifier.citation K. Silva, T. Maheepala, K. Tharaka and T. D. Ambegoda, "Adversarial Learning to Improve Question Image Embedding in Medical Visual Question Answering," 2022 Moratuwa Engineering Research Conference (MERCon), 2022, pp. 1-6, doi: 10.1109/MERCon55799.2022.9906168. en_US
dc.identifier.uri http://dl.lib.uom.lk/handle/123/19266
dc.description.abstract Visual Question Answering (VQA) is a computer vision task in which a system produces an accurate answer to a given image and a question that is relevant to the image. Medical VQA can be considered as a subfield of general VQA, which focuses on images and questions in the medical domain. The VQA model’s most crucial task is to learn the question-image joint representation to reflect the information related to the correct answer. Medical VQA remains a difficult task due to the ineffectiveness of question-image embeddings, despite recent research on general VQA models finding significant progress. To address this problem, we propose a new method for training VQA models that utilizes adversarial learning to improve the question-image embedding and illustrate how this embedding can be used as the ideal embedding for answer inference. For adversarial learning, we use two embedding generators (question–image embedding and a question-answer embedding generator) and a discriminator to differentiate the two embeddings. The questionanswer embedding is used as the ideal embedding and the question-image embedding is improved in reference to that. The experiment results indicate that pre-training the question-image embedding generation module using adversarial learning improves overall performance, implying the effectiveness of the proposed method. en_US
dc.language.iso en en_US
dc.publisher IEEE en_US
dc.relation.uri https://ieeexplore.ieee.org/document/9906168 en_US
dc.subject Medical visual question answering en_US
dc.subject Adversarial learning en_US
dc.title Adversarial learning to improve question image embedding in medical visual question answering en_US
dc.type Conference-Full-text en_US
dc.identifier.faculty Engineering en_US
dc.identifier.department Engineering Research Unit, University of Moratuwa en_US
dc.identifier.year 2022 en_US
dc.identifier.conference Moratuwa Engineering Research Conference 2022 en_US
dc.identifier.place Moratuwa, Sri Lanka en_US
dc.identifier.proceeding Proceedings of Moratuwa Engineering Research Conference 2022 en_US
dc.identifier.email silvamkc.17@cse.mrt.ac.lk
dc.identifier.email thanuja.17@cse.mrt.ac.lk
dc.identifier.email kasunt.17@cse.mrt.ac.lk
dc.identifier.email thanujaa@uom.lk
dc.identifier.doi 10.1109/MERCon55799.2022.9906168 en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record