Document analysis based automatic concept map generation for enterprises

Karannagoda, EL; Herath, HMTC; Fernando, KNJ; Karunarathne, MWID; De Silva, NHND; Perera, AS

UoM IR
→
Research Publications
→
Conference Proceedings
→
Workshops, Seminars, Symposiums & Conferences
→
Workshops, Seminars, Symposiums & Conferences
→
View Item

dc.contributor.author	Karannagoda, EL
dc.contributor.author	Herath, HMTC
dc.contributor.author	Fernando, KNJ
dc.contributor.author	Karunarathne, MWID
dc.contributor.author	De Silva, NHND
dc.contributor.author	Perera, AS
dc.date.accessioned	2014-06-25T15:56:27Z
dc.date.available	2014-06-25T15:56:27Z
dc.date.issued	2014-06-25
dc.identifier.uri	http://dl.lib.mrt.ac.lk/handle/123/10091
dc.description.abstract	Ever growing knowledge bases of enterprises present the demanding challenge of proper organization of information that would enable fast retrieval of related and intended information. Document repositories of enterprises consist of large collections of documents of varying size, format and writing styles. This diversified and unstructured nature of documents restrict the possibilities of developing uniform techniques for extracting important concepts and relationships for summarization, structured representation and fast retrieval. The documented textual content is used as the input for the construction ofa concept map. Here a rule based approach is used to extract concepts and relationships among them. Sentence level breakdown enables these rules to identify those concepts and relationships. These rules are based on elements in a phase structure tree of a sentence. For improving accuracy and the relevance of the extracted concepts and relationships, the special features such as titles, bold and upper case texts are used. This paper discusses how to overcome the above mentioned challenges by utilizing high level natural language processing techniques, document pre-processing techniques and developing easily understandable and extractable compact representation of concept maps. Each document in the repository is converted to a concept map representation to capture concepts and relationships among concepts described in the said document. This organization would represent a summary of the document. These individual concept maps are utilized to generate concept maps that represent sections of the repository or the entire document repository. This paper discusses how statistical techniques are used to calculate certain metrics which are used to facilitate certain requirements of the solution. Principle component analysis is used in ranking the documents by importance. The concept map is visualized using force directed type graphs which represent concepts by nodes and relationships by edges.	en_US
dc.language.iso	en	en_US
dc.source.uri	http://www.icter.org/conference/ICTer2013-home	en_US
dc.title	Document analysis based automatic concept map generation for enterprises	en_US
dc.type	Conference-Abstract	en_US
dc.identifier.faculty	Engineering	en_US
dc.identifier.department	Department of Computer Science and Engineering	en_US
dc.identifier.year	2013	en_US
dc.identifier.conference	International Conference on Advances in ICT for Emerging Regions, ICTer 2013	en_US
dc.identifier.place	Colombo	en_US
dc.identifier.pgnos	pp. 154-159	en_US
dc.identifier.email	shehan@uom.lk	en_US

Files in this item

Files	Size	Format	View
There are no files associated with this item.

This item appears in the following Collection(s)

Workshops, Seminars, Symposiums & Conferences [1498]

Show simple item record

Search UoM-IR

Browse

All of UoM-IR
This Collection
- Authors
- Titles
- Subjects
- Faculty
- Acc. No.
- Document Type
- Year
- Conference Proceedings

Document analysis based automatic concept map generation for enterprises

Files in this item

This item appears in the following Collection(s)

Search UoM-IR

Browse

All of UoM-IR

This Collection

My Account