TY - JOUR
T1 - Leveraging EfficientNetV2 for Litter Detection
AU - McMahon, Niamh
AU - Grua, Eoin M.
AU - Eising, Ciarán
N1 - Publisher Copyright:
© This is an open access article published by the IET under the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/)
PY - 2024
Y1 - 2024
N2 - Litter pollution is a significant environmental challenge globally, demanding advanced methods for detection and management. Recent advancements in computer vision offer promising alternatives to traditional approaches, yet they often rely on narrowly focused datasets with a lack of segmentation data, which is required for applications such as robotic grasping. This research paper addresses these limitations while utilising the TACO (Trash Annotations in Context) dataset, a small but growing dataset, created to encompass diverse environmental contexts and detailed annotation. EfficientNetV2, a state-of-the-art convolutional neural network architecture, is leveraged for both image classification and object detection/instance segmentation models. The image classification model achieves an F1-score of 0.854 in categorizing an image containing ‘Litter' or ‘No Litter'. For object detection and instance segmentation, EfficientNetV2 is integrated with the Mask R-CNN architecture implemented in Detectron2. This hybrid model achieves a box and segmentation mAP of 0.65 and 0.52 respectively.
AB - Litter pollution is a significant environmental challenge globally, demanding advanced methods for detection and management. Recent advancements in computer vision offer promising alternatives to traditional approaches, yet they often rely on narrowly focused datasets with a lack of segmentation data, which is required for applications such as robotic grasping. This research paper addresses these limitations while utilising the TACO (Trash Annotations in Context) dataset, a small but growing dataset, created to encompass diverse environmental contexts and detailed annotation. EfficientNetV2, a state-of-the-art convolutional neural network architecture, is leveraged for both image classification and object detection/instance segmentation models. The image classification model achieves an F1-score of 0.854 in categorizing an image containing ‘Litter' or ‘No Litter'. For object detection and instance segmentation, EfficientNetV2 is integrated with the Mask R-CNN architecture implemented in Detectron2. This hybrid model achieves a box and segmentation mAP of 0.65 and 0.52 respectively.
KW - convolutional neural networks
KW - image classification
KW - object detection
KW - Waste detection
UR - http://www.scopus.com/inward/record.url?scp=85216779016&partnerID=8YFLogxK
U2 - 10.1049/icp.2024.3322
DO - 10.1049/icp.2024.3322
M3 - Conference article
AN - SCOPUS:85216779016
SN - 2732-4494
VL - 2024
SP - 315
EP - 318
JO - IET Conference Proceedings
JF - IET Conference Proceedings
IS - 10
T2 - 26th Irish Machine Vision and Image Processing Conference, IMVIP 2024
Y2 - 21 August 2024 through 23 August 2024
ER -