Camera-view supervision for bird's-eye-view semantic segmentation

Yang, Bowen; Yu, LinLin; Chen, Feng

doi:10.3389/fdata.2024.1431346

Citation Details

This content will become publicly available on November 15, 2025

Camera-view supervision for bird's-eye-view semantic segmentation

Bird's-eye-view Semantic Segmentation (BEVSS) is a powerful and crucial component of planning and control systems in many autonomous vehicles. Current methods rely on end-to-end learning to train models, leading to indirectly supervised and inaccurate camera-to-BEV projections. We propose a novel method of supervising feature extraction with camera-view depth and segmentation information, which improves the quality of feature extraction and projection in the BEVSS pipeline. Our model, evaluated on the nuScenes dataset, shows a 3.8% improvement in Intersection-over-Union (IoU) for vehicle segmentation and a 30-fold reduction in depth error compared to baselines, while maintaining competitive inference times of 32 FPS. This method offers more accurate and reliable BEVSS for real-time autonomous driving systems. The codes and implementation details and code can be found athttps://github.com/bluffish/sucam. more »

Award ID(s):: 2107449

PAR ID:: 10639139

Author(s) / Creator(s):: Yang, Bowen; Yu, LinLin; Chen, Feng

Publisher / Repository:: Frontiers Media S.A

Date Published:: 2024-11-15

Journal Name:: Frontiers in Big Data

Volume:: 7

ISSN:: 2624-909X

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on November 15, 2025
Journal Article:
https://doi.org/10.3389/fdata.2024.1431346

More Like this