An Adapter Architecture for heterogeneous data processing in bioinformatics pipelines

dc.contributor.authorLenadora, D
dc.contributor.authorWickramarachchi, A
dc.contributor.authorMeedeniya, D
dc.contributor.authorMallawaarachchi, V
dc.contributor.authorPerera, L
dc.date.accessioned2019-09-03T04:03:21Z
dc.date.available2019-09-03T04:03:21Z
dc.description.abstractBioinformatics is a growing field focused on both the domains of computer science and biology. A range of bioinformatics data processing tools exists at present, which takes inputs and produces outputs in varying formats depending on the algorithms and processes being used. The undesirable situation where such processes would produce outputs that may not allow the pipelining of other processes, calls for a generic bioinformatics data format converter. Though such converters currently exist, most of them are limited to text conversions and provide limited functionality. In addition, such functions have the potential capability of supporting parallelism to increase the overall throughput. A solution that can provide the said conversion functions as well as utility functions, while processing with a high throughput via parallelism is proposed through this paper. A utility function of this system requires storing bioinformatics data locally. In addition to facilitating this, an average compression rate of 26% achieved in data storage. Evaluation of the system using a set of 7,000,000 gene data showed the maximum time consumption for retrieval as 400ms.en_US
dc.identifier.conferenceMoratuwa Engineering Research Conference - MERCon 2019en_US
dc.identifier.departmentDepartment of Computer Science and Engineeringen_US
dc.identifier.facultyEngineeringen_US
dc.identifier.placeMoraruwa, Sri Lankaen_US
dc.identifier.urihttp://dl.lib.mrt.ac.lk/handle/123/14933
dc.identifier.year2019en_US
dc.language.isoenen_US
dc.subjectBioinformaticsen_US
dc.subjectData format conversionen_US
dc.subjectPipelinesen_US
dc.subjectAdapter architectureen_US
dc.titleAn Adapter Architecture for heterogeneous data processing in bioinformatics pipelinesen_US
dc.typeConference-Abstracten_US

Files

Collections