Grammatical Feature Construction for Enhanced Interpretability in Breast Cancer Classification

  • Yumnah Hasan
  • , Allan de Lima
  • , Darian Reyes Fernández de Bulnes
  • , Douglas Mota Dias
  • , Conor Ryan

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Recent advances in Artificial Intelligence have yielded significant progress in developing medical and clinical diagnosis techniques. Machine learning algorithms are among the most promising methods for detection and classification problems. Despite their inherent robustness, the primary challenge in employing these approaches lies in their opaque behaviour, a critical factor in medical diagnosis. Establishing trust between clinicians and patients requires an explainable model. This paper presents a two-stage approach to improving explainability: the first stage, Grammatical Feature Construction (GFC), uses Grammatical Evolution (GE) to perform feature construction. These features are interpretable as they are generated from the original features by applying simple arithmetic operations to the original data. These features are independent of the model/algorithm that will be used for classification, so any classification algorithm could be used in the second stage; we focus here on Linear Discriminant Analysis (LDA) to create GFC/LDA, which provides greater explainability than using LDA alone while maintaining comparable performance. To evaluate the effectiveness of GFC/LDA, we conducted a comprehensive comparative analysis against methods including GE as a classifier and LDA using all original features in two Breast Cancer datasets, the Digital Database for Screening Mammography and the Wisconsin Breast Cancer dataset. The results demonstrate that the GFC/LDA approach yields comparable with the other methods but produces more interpretable models.

Original languageEnglish
Title of host publicationApplications of Evolutionary Computation - 28th European Conference, EvoApplications 2025, Held as Part of EvoStar 2025, Proceedings
EditorsPablo García-Sánchez, Emma Hart, Sarah L. Thomson
PublisherSpringer Science and Business Media Deutschland GmbH
Pages204-220
Number of pages17
ISBN (Print)9783031900617
DOIs
Publication statusPublished - 2025
Event28th European Conference on Applications of Evolutionary Computation, EvoApplications 2025, held as part of EvoStar 2025 - Trieste, Italy
Duration: 23 Apr 202525 Apr 2025

Publication series

NameLecture Notes in Computer Science
Volume15612 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference28th European Conference on Applications of Evolutionary Computation, EvoApplications 2025, held as part of EvoStar 2025
Country/TerritoryItaly
CityTrieste
Period23/04/2525/04/25

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • Breast Cancer
  • Feature Construction
  • Grammatical Evolution
  • Interpretability
  • Machine Learning

Fingerprint

Dive into the research topics of 'Grammatical Feature Construction for Enhanced Interpretability in Breast Cancer Classification'. Together they form a unique fingerprint.

Cite this