Document analysis based automatic concept map generation for enterprises

dc.contributor.authorHerath, HMTC
dc.contributor.authorFernando, KNJ
dc.contributor.authorKarannagoda, EL
dc.contributor.authorKarunarathne, WMID
dc.contributor.authorDe Silva, NHN
dc.contributor.authorPerera, AS
dc.contributor.editorRodrigo, R
dc.date.accessioned2022-12-22T06:36:16Z
dc.date.available2022-12-22T06:36:16Z
dc.date.issued2013-02
dc.description.abstractEver growing knowledge bases of enterprises present the demanding challenge of proper organization of information that would enable fast retrieval of related and intended information. Document repositories of enterprises consist of large collections of documents of varying size, format and writing styles. This diversified and unstructured nature of documents restrict the possibilities of developing uniform techniques for extracting important concepts and relationships for summarization, structured representation and fast retrieval. The documented textual content is used as the input for the construction of this concept map. Here a rule based approach is used to extract concepts and relationships among them. Sentence level breakdown enables these ndes to identify’ those concepts and relationships. These rules are based on elements in a phase structure tree of a sentence. For improving accuracy and the relevance of the extracted concepts and relationships, the special features such as titles, bold and upper case texts are used. This paper discusses how to overcome these challenges by utilizing high level natural language processing techniques, document preprocessing techniques and developing easily understandable and extractable compact representation of concept maps. Each document in the repository is converted to a concept map representation to capture concepts and relationships among concepts described in the said document. This organization would represent a summary> of the document. These individual concept maps are utilized to generate concept maps that represent sections of the repository> or the entire document repository’. This paper discusses how the statistical techniques used to calculate certain metrics which facilitate certain requirements of the solution. Principle component analysis is used in ranking the documents by importance. The concept map is visualized using force directed type graphs which represent concepts by nodes and relationships by edges.en_US
dc.identifier.citation*****en_US
dc.identifier.conferenceNational Engineering Conference Engineering Research For Nation Buildingen_US
dc.identifier.departmentEngineering Research Unit, University of Moratuwaen_US
dc.identifier.facultyEngineeringen_US
dc.identifier.pgnospp. 68-73en_US
dc.identifier.placeKatubeddaen_US
dc.identifier.proceedingProceedings of the 18th Annual Research Symposiumen_US
dc.identifier.urihttp://dl.lib.uom.lk/handle/123/19866
dc.identifier.year2013en_US
dc.language.isoenen_US
dc.publisherThe Engineering Research Unit, University of Moratuwaen_US
dc.subjectNatural Language Processingen_US
dc.subjectConcept Mapen_US
dc.subjectConcepts/Relationships Extractionen_US
dc.titleDocument analysis based automatic concept map generation for enterprisesen_US
dc.typeConference-Full-texten_US

Files

Collections