Mathematical modelling of hidden layer architecture in artificial neural networks

Wagarachchi NM

Mathematical modelling of hidden layer architecture in artificial neural networks

dc.contributor.advisor	Karunananda A
dc.contributor.author	Wagarachchi NM
dc.date.accept	2019
dc.date.accessioned	2019
dc.date.available	2019
dc.date.issued	2019
dc.description.abstract	The performance of an Artificial Neural Network (ANN) strongly depends on its hidden layer architecture. The generated solution by an ANN does not guarantee that it has always been devised with the simplest neural network architecture suitable for modeling the particular problem. This results in computational complexity of training of an ANN, deployment, and usage of the trained network. Therefore, modeling the hidden layer architecture of an ANN remains as a research challenge. This thesis presents a theoretically-based approach to prune hidden layers of trained artificial neural networks, ensuring better or the same performance of a simpler network as compared with the original network. The method described in the thesis is inspired by the finding from neuroscience that the human brain has a neural network with nearly 100 billion neurons, yet our activities are performed by a much simpler neural network with a much lesser number of neurons. Furthermore, in biological neural networks, the neurons which do not significantly contribute to the performance of the network will naturally be disregarded. According to neuroplasticity, biological neural networks can also solicit activations of neurons in the proximity of the active neural network to improve the performance of the network. On the same token, it is hypothesized that for a given complex-trained ANN, we can discover an ANN, which is much more simplified than the original given architecture. This research has discovered a theory to reduce certain number of hidden layers and to eliminate disregarding neurons from the remaining hidden layers of a given ANN architecture. The procedure begins with a complex neural network architecture trained with backpropagation algorithm and reach to the optimum solution by two phases. First, the number of hidden layers is determined by using a peak search algorithm discovered by this research. The newly discovered simpler network with lesser number of hidden layers and highest generalization power considered for pruning of its hidden neurons. The pruning of neurons in the hidden layers has been theorized by identifying the neurons, which give least contribution to the network performances. These neurons are identified by detecting the correlations regarding minimization of error in training. Experiments have shown that the simplified network architecture generated by this approach exhibits same or better performance as compared with the original large network architecture. Generally, it reduces more than 80% of neurons while increasing the generalization by about 30%. As such, the proposed approach can be used to discover simple network architecture relevant to a given complex architecture of an ANN solution. Due to its architectural simplicity, the new architecture has been computationally efficient in training, usage and further training.	en_US
dc.identifier.accno	TH4062	en_US
dc.identifier.citation	Wagarachchi, N.M. (2019). Mathematical modelling of hidden layer architecture in artificial neural networks [Doctoral dissertation, University of Moratuwa]. Institutional Repository University of Moratuwa. http://dl.lib.mrt.ac.lk/handle/123/15864
dc.identifier.degree	Doctor of Philosophy	en_US
dc.identifier.department	Department of Computational Mathematics	en_US
dc.identifier.faculty	IT	en_US
dc.identifier.uri	http://dl.lib.mrt.ac.lk/handle/123/15864
dc.language.iso	en	en_US
dc.subject	COMPUTATIONAL MATHEMATICS-Dissertations	en_US
dc.subject	ARTIFICIAL INTELLIGENCE	en_US
dc.subject	NEURAL NETWORKS	en_US
dc.subject	ARTIFICIAL NEURAL NETWORKS	en_US
dc.subject	NEUROPLASTICITY	en_US
dc.title	Mathematical modelling of hidden layer architecture in artificial neural networks	en_US
dc.type	Thesis-Full-text	en_US

Files

Original bundle

Now showing 1 - 3 of 3

Name:: TH4062-1.pdf
Size:: 651.88 KB
Format:: Adobe Portable Document Format
Description:: Pre-text

Download

Name:: TH4062-2.pdf
Size:: 290.27 KB
Format:: Adobe Portable Document Format
Description:: Post-text

Download

Name:: TH4062.pdf
Size:: 3.92 MB
Format:: Adobe Portable Document Format
Description:: Full-thesis

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Doctor of Philosophy (Ph.D.)