Automatic generation of research paper abstracts using deep-hybrid models

dc.contributor.advisorDe Silva, N
dc.contributor.authorKumarasinghe, RPD
dc.date.accept2025
dc.date.accessioned2026-02-12T05:07:18Z
dc.date.issued2025
dc.description.abstractCondensing important information into a summary is crucial for readers navigating lengthy documents. In the context of research papers, the abstract serves as a concise overview of the study. This thesis focuses on enhancing research paper summarization by introducing a novel section-wise relevance matrix. To address the token size limitations of Large Language Models (LLMs) , such as GPT-Neo, we developed a two-fold approach. First, we employed extractive summarization to condense lengthy texts into key sentences, followed by the application of abstractive summarization to generate coherent and concise summaries from these extracts. Our approach, combining both extractive and abstractive techniques, leverages section-wise involvement ratios, with particular attention to the abstract section, improving the accuracy and quality of generated summaries. We introduced a pioneering dataset of research papers organized into sections, which plays a crucial role in this summarization process. Experimental results demonstrated that our method produces high-quality summaries while effectively overcoming token limitations, offering significant potential for summarizing long documents in low-resource and cost-effective environments. However, challenges arise when section-wise segmentation is unclear, impacting the accuracy of summaries. This research underscores the need for further refinements and offers a promising framework for enhancing summarization techniques, benefiting researchers, educators, and information seekers alike.
dc.identifier.accnoTH6011
dc.identifier.citationKumarasinghe, R.P.D. (2025). Automatic generation of research paper abstracts using deep-hybrid models [Master's theses, University of Moratuwa]. Institutional Repository University of Moratuwa. https://dl.lib.uom.lk/handle/123/24847
dc.identifier.degreeMSc in Computer Science
dc.identifier.departmentDepartment of Computer Science & Engineering
dc.identifier.facultyEngineering
dc.identifier.urihttps://dl.lib.uom.lk/handle/123/24847
dc.language.isoen
dc.subjectNATUARL LANGUAGE PROCESSING
dc.subjectAUTOMATIC TEXT SUMMARIZATION
dc.subjectTEXT GENERATION
dc.subjectLARGE LANGUAGE MODELS
dc.subjectDOCUMENTATION-Abstracts
dc.subjectRESEARCH DISSEMINATION
dc.subjectCOMPUTER SCIENCE-Dissertation
dc.subjectCOMPUTER SCIENCE AND ENGINEERING-Dissertation
dc.subjectMSc in Computer Science
dc.titleAutomatic generation of research paper abstracts using deep-hybrid models
dc.typeThesis-Full-text

Files

Original bundle

Now showing 1 - 3 of 3
Loading...
Thumbnail Image
Name:
TH6011-1.pdf
Size:
78.89 KB
Format:
Adobe Portable Document Format
Description:
Pre-text
Loading...
Thumbnail Image
Name:
TH6011-2.pdf
Size:
50.19 KB
Format:
Adobe Portable Document Format
Description:
Post-text
Loading...
Thumbnail Image
Name:
TH6011.pdf
Size:
786.03 KB
Format:
Adobe Portable Document Format
Description:
Full-thesis

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: