HyperEstimator: Evolving Computationally Efficient CNN Models with Grammatical Evolution

Gauri Vaidya, Luise Ilg, Meghana Kshirsagar, Enrique Naredo, Conor Ryan

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Deep learning (DL) networks have the dual benefits due to over parameterization and regularization rendering them more accurate than conventional Machine Learning (ML) models. However, they consume massive amounts of resources in training and thus are computationally expensive. A single experimental run consumes a lot of computational resources, in such a way that it could cost millions of dollars thereby dramatically leading to massive project costs. Some of the factors for vast expenses for DL models can be attributed to the computational costs incurred during training, massive storage requirements, along with specialized hardware such as Graphical Processing Unit (GPUs). This research seeks to address some of the challenges mentioned above. Our approach, HyperEstimator, estimates the optimal values of hyperparameters for a given Convolutional Neural Networks (CNN) model and dataset using a suite of Machine Learning algorithms. Our approach consists of three stages: (i) obtaining candidate values for hyperparameters with Grammatical Evolution; (ii) prediction of optimal values of hyperparameters with supervised ML techniques; (iii) training CNN model for object detection. As a case study, the CNN models are validated by using a real-time video dataset representing road traffic captured in some Indian cities. The results are also compared against CIFAR10 and CIFAR100 benchmark datasets.

Original languageEnglish
Title of host publicationProceedings of the 19th International Conference on Smart Business Technologies, ICSBT 2022
EditorsFons Wijnhoven, Slimane Hammoudi, Marten van Sinderen
PublisherScience and Technology Publications, Lda
Pages57-68
Number of pages12
ISBN (Electronic)9789897585876
DOIs
Publication statusPublished - 2022
Event19th International Conference on Smart Business Technologies, ICSBT 2022 - Lisbon, Portugal
Duration: 14 Jul 202216 Jul 2022

Publication series

NameICSBT International Conference on Smart Business Technologies
Volume2022-July
ISSN (Print)2184-772X

Conference

Conference19th International Conference on Smart Business Technologies, ICSBT 2022
Country/TerritoryPortugal
CityLisbon
Period14/07/2216/07/22

Keywords

  • Business Modelling
  • Convolutional Neural Networks
  • GPU
  • Grammatical Evolution
  • Hyperparameters
  • Machine Learning
  • Smart City

Fingerprint

Dive into the research topics of 'HyperEstimator: Evolving Computationally Efficient CNN Models with Grammatical Evolution'. Together they form a unique fingerprint.

Cite this