Multi-objective classification and feature selection of covid-19 proteins sequences using NSGA-II and MAP-Elites

Vijay Sambhe, Shanmukha Rajesh, Enrique Naredo, Douglas Mota Dias, Meghana Kshirsagar, Conor Ryan

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The advent of the Covid-19 pandemic has resulted in a global crisis making the health systems vulnerable, challenging the research community to find novel approaches to facilitate early detection of infections. This open-up a window of opportunity to exploit machine learning and artificial intelligence techniques to address some of the issues related to this disease. In this work, we address the classification of ten SARS-CoV-2 protein sequences related to Covid-19 using k-mer frequency as features and considering two objectives; classification performance and feature selection. The first set of experiments considered the objectives one at the time, four techniques were used for the feature selection and twelve well known machine learning methods, where three are neural network based for the classification. The second set of experiments considered a multiobjective approach where we tested a well known multi-objective approach Non-dominated Sorting Genetic Algorithm II (NSGA-II), and the Multi-dimensional Archive of Phenotypic Elites (MAP-Elites), which considers quality+diversity containers to guide the search through elite solutions. The experimental results shows that ResNet and PCA is the best combination using single objectives. Whereas, for the mulit-classification, NSGA-II outperforms ME with two out of three classifiers, while ME gets competitive results bringing more diverse set of solutions.

Original languageEnglish
Title of host publicationICAART 2021 - Proceedings of the 13th International Conference on Agents and Artificial Intelligence
EditorsAna Paula Rocha, Luc Steels, Jaap van den Herik
PublisherSciTePress
Pages1241-1248
Number of pages8
ISBN (Electronic)9789897584848
Publication statusPublished - 2021
Event13th International Conference on Agents and Artificial Intelligence, ICAART 2021 - Virtual, Online
Duration: 4 Feb 20216 Feb 2021

Publication series

NameICAART 2021 - Proceedings of the 13th International Conference on Agents and Artificial Intelligence
Volume2

Conference

Conference13th International Conference on Agents and Artificial Intelligence, ICAART 2021
CityVirtual, Online
Period4/02/216/02/21

Keywords

  • DNA Sequences
  • Feature selection
  • Genetic algorithms
  • K-mer
  • MAP-Elites
  • NSGA-II

Fingerprint

Dive into the research topics of 'Multi-objective classification and feature selection of covid-19 proteins sequences using NSGA-II and MAP-Elites'. Together they form a unique fingerprint.

Cite this