Reflective Teacher: Semi-Supervised Multimodal 3D Object Detection in Bird's-Eye-View via Uncertainty Measure

Saheli Hazra, Sudip Das, Rohit Choudhary, Arindam Das, Ganesh Sistu, Ciarán Eising, Ujjwal Bhattacharya

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    Applying pseudo labeling techniques has been found to be advantageous in semi-supervised 3D object detection (SSO D) in Bird' s-Eye-View (BEV) for autonomous driving, particularly where labeled data is limited. In the literature, Exponential Moving Average (EMA) has been used for adjustments of the weights of teacher network by the student network. However, the same induces catastrophic forgetting in the teacher network. In this work, we address this issue by introducing a novel concept of Reflective Teacher where the student is trained by both labeled and pseudo labeled data while its knowledge is progressively passed to the teacher through a regularizer to ensure retention of previous knowledge. Additionally, we propose Geometry Aware BEV Fusion (GA-BEVFusion) for efficient alignment of multi-modal BEV features, thus reducing the disparity between the modalities-camera and LiDAR. This helps to map the precise geometric information embedded among LiDAR points reliably with the spatial priors for extraction of semantic information from camera images. Our experiments on the nuScenes and Waymo datasets demonstrate: 1) improved performance over state-of-the-art methods in both fully supervised and semi-supervised settings; 2) Reflective Teacher achieves equivalent performance with only 25% and 22% of labeled datafor nuScenes and Waymo datasets respectively, in contrast to other fully supervised methods that utilize the full labeled dataset.

    Original languageEnglish
    Title of host publicationProceedings - 2025 IEEE Winter Conference on Applications of Computer Vision, WACV 2025
    PublisherInstitute of Electrical and Electronics Engineers Inc.
    Pages1649-1659
    Number of pages11
    ISBN (Electronic)9798331510831
    DOIs
    Publication statusPublished - 2025
    Event2025 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2025 - Tucson, United States
    Duration: 28 Feb 20254 Mar 2025

    Publication series

    NameProceedings - 2025 IEEE Winter Conference on Applications of Computer Vision, WACV 2025

    Conference

    Conference2025 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2025
    Country/TerritoryUnited States
    CityTucson
    Period28/02/254/03/25

    Keywords

    • birds-eye-view
    • lidar
    • multimodal learning
    • semi supervised learning

    Fingerprint

    Dive into the research topics of 'Reflective Teacher: Semi-Supervised Multimodal 3D Object Detection in Bird's-Eye-View via Uncertainty Measure'. Together they form a unique fingerprint.

    Cite this