Comparative Study of Surrogate Techniques for CNN Hyperparameter Optimization

Mohd Aszemi, Nurshazlyn and M. Zakaria, Nordin and Paneer Selvam, Dhanapal Durai Dominic (2022) Comparative Study of Surrogate Techniques for CNN Hyperparameter Optimization. In: New Frontiers in Communication and Intelligent Systems. Computing & Intelligent Systems, SCRS, India, pp. 463-473. ISBN 978-81-95502-00-4

[thumbnail of Comparative Study of Surrogate Techniques for CNN Hyperparameter Optimization.pdf]
Preview
Text
Comparative Study of Surrogate Techniques for CNN Hyperparameter Optimization.pdf - Published Version

Download (575kB) | Preview
Official URL: https://www.publications.scrs.in/chapter/978-81-95...

Abstract

Optimizing hyper parameters in Convolutional Neural networks is a tedious process for many researchers and practitioners. It requires a high degree of expertise or experience to optimise the hyper parameters, and manual optimisation is likely to be biased. To date, methods or approaches to automate hyper parameter optimization include grid search, random search, and Genetic Algorithms (GAs). However, evaluating large number of sample points in the hyperparameter configuration space, as is typically required by these methods, is computationally expensive process. Hence, the objective of this paper is to explore regression as a surrogate technique in CNN hyperparameter optimisation. Performance in terms of accuracy, error rate, training time and coefficient of determination (R2) are evaluated and recorded. Although there is no significant performance difference between the resulting optimized Deep Learning and state-of-the-art on CIFAR-10 datasets, using
regression as a surrogate technique for CNN hyperparameter optimization contributes to minimising the time taken for the optimization process, a benefit which has not been fully explored in the literature to the best of the author’s knowledge.

Item Type: Book Section
Subjects: T Technology > T Technology (General)
Departments / MOR / COE: Sciences and Information Technology > Computer and Information Sciences
Depositing User: Ms Nurshazlyn Mohd Aszemi
Date Deposited: 15 May 2023 07:44
Last Modified: 15 May 2023 07:44
URI: http://utpedia.utp.edu.my/id/eprint/24082

Actions (login required)

View Item
View Item