TY - JOUR
T1 - On fitting numerical features into probabilistic distributions to represent data for fuzzy pattern trees
AU - de Lima, Allan
AU - Albarracín, Juan F.H.
AU - Dias, Douglas Mota
AU - Amaral, Jorge
AU - Ryan, Conor
N1 - Publisher Copyright:
© The Author(s) 2025.
PY - 2025/12
Y1 - 2025/12
N2 - Fuzzy Pattern Trees (FPTs) are symbolic tree-based structures whose internal nodes are fuzzy operators, and the leaves are fuzzy features, which enhance interpretability by representing data with meaningful fuzzy terms. However, conventional FPT approaches typically employ uniformly distributed membership functions, which often fail to accurately represent features in real-world datasets. In this work, we propose an automatic method to adapt the bounds of fuzzy features based on their data distributions, with a focus on a simple triangular membership scheme. We evaluate our approach across 11 benchmark classification problems, incorporating six parsimony pressure methods to promote more compact solutions. Our results demonstrate that the adapted fuzzification scheme, beyond improving interpretability, consistently yields models that better balance accuracy and size when compared to uniform representations, appearing on the Pareto front 20 times, while the second-best scheme appeared only 15 times.
AB - Fuzzy Pattern Trees (FPTs) are symbolic tree-based structures whose internal nodes are fuzzy operators, and the leaves are fuzzy features, which enhance interpretability by representing data with meaningful fuzzy terms. However, conventional FPT approaches typically employ uniformly distributed membership functions, which often fail to accurately represent features in real-world datasets. In this work, we propose an automatic method to adapt the bounds of fuzzy features based on their data distributions, with a focus on a simple triangular membership scheme. We evaluate our approach across 11 benchmark classification problems, incorporating six parsimony pressure methods to promote more compact solutions. Our results demonstrate that the adapted fuzzification scheme, beyond improving interpretability, consistently yields models that better balance accuracy and size when compared to uniform representations, appearing on the Pareto front 20 times, while the second-best scheme appeared only 15 times.
KW - Bloat control
KW - Fuzzy pattern trees
KW - Genetic programming
KW - Lexicase selection
UR - https://www.scopus.com/pages/publications/105014889804
U2 - 10.1007/s10710-025-09522-9
DO - 10.1007/s10710-025-09522-9
M3 - Article
AN - SCOPUS:105014889804
SN - 1389-2576
VL - 26
JO - Genetic Programming and Evolvable Machines
JF - Genetic Programming and Evolvable Machines
IS - 2
M1 - 25
ER -