TY - JOUR
T1 - Deformable Convolution Based Road Scene Semantic Segmentation of Fisheye Images in Autonomous Driving
AU - Manzoor, Anam
AU - Singh, Aryan
AU - Sistu, Ganesh
AU - Mohandas, Reenu
AU - Grua, Eoin
AU - Scanlan, Anthony
AU - Eising, Ciarán
N1 - Publisher Copyright:
© This is an open access article published by the IET under the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/)
PY - 2024
Y1 - 2024
N2 - This study investigates the effectiveness of modern Deformable Convolutional Neural Networks (DCNNs) for semantic segmentation tasks, particularly in autonomous driving scenarios with fisheye images. These images, providing a wide field of view, pose unique challenges for extracting spatial and geometric information due to dynamic changes in object attributes. Our experiments focus on segmenting the WoodScape fisheye image dataset into ten distinct classes, assessing the Deformable Networks' ability to capture intricate spatial relationships and improve segmentation accuracy. Additionally, we explore different loss functions to address class imbalance issues and compare the performance of conventional CNN architectures with Deformable Convolution-based CNNs, including Vanilla U-Net and Residual U-Net architectures. The significant improvement in mIoU score resulting from integrating Deformable CNNs demonstrates their effectiveness in handling the geometric distortions present in fisheye imagery, exceeding the performance of traditional CNN architectures. This underscores the significant role of Deformable convolution in enhancing semantic segmentation performance for fisheye imagery.
AB - This study investigates the effectiveness of modern Deformable Convolutional Neural Networks (DCNNs) for semantic segmentation tasks, particularly in autonomous driving scenarios with fisheye images. These images, providing a wide field of view, pose unique challenges for extracting spatial and geometric information due to dynamic changes in object attributes. Our experiments focus on segmenting the WoodScape fisheye image dataset into ten distinct classes, assessing the Deformable Networks' ability to capture intricate spatial relationships and improve segmentation accuracy. Additionally, we explore different loss functions to address class imbalance issues and compare the performance of conventional CNN architectures with Deformable Convolution-based CNNs, including Vanilla U-Net and Residual U-Net architectures. The significant improvement in mIoU score resulting from integrating Deformable CNNs demonstrates their effectiveness in handling the geometric distortions present in fisheye imagery, exceeding the performance of traditional CNN architectures. This underscores the significant role of Deformable convolution in enhancing semantic segmentation performance for fisheye imagery.
KW - Deformable Convolution
KW - Fisheye Images
KW - Semantic segmentation
UR - http://www.scopus.com/inward/record.url?scp=85211779881&partnerID=8YFLogxK
U2 - 10.1049/icp.2024.3270
DO - 10.1049/icp.2024.3270
M3 - Conference article
AN - SCOPUS:85211779881
SN - 2732-4494
VL - 2024
SP - 7
EP - 14
JO - IET Conference Proceedings
JF - IET Conference Proceedings
IS - 10
T2 - 26th Irish Machine Vision and Image Processing Conference, IMVIP 2024
Y2 - 21 August 2024 through 23 August 2024
ER -