My Account Log in

1 option

A Unified-BEV Network for Joint 3D Object Detection and Map Segmentation in Complex Traffic Scenario Xihua University

SAE Technical Papers (1906-current) Available online

View online
Format:
Book
Conference/Event
Author/Creator:
Li, Mohan, author.
Contributor:
Liu, Xulei
Song, Tao
Xu, Yanhai
Zhou, Guofeng
Zhou, Zhisong
Conference Name:
2024 International Conference on Smart Transportation Interdisciplinary Studies (2024-12-13 : Nanjing, China)
Language:
English
Physical Description:
1 online resource cm
Place of Publication:
Warrendale, PA SAE International 2025
Summary:
Recently, the multi-view image-based Bird's Eye View (BEV) perception for autonomous driving has gained considerable attention due to its cost-effectiveness and capacity for rich semantic information. However, the majority of existing studies focus primarily on improving the performance of single task, neglect to utilize the dense and robust BEV representation that is beneficial for various downstream tasks such as 3D object detection, semantic map segmentation. These approaches inherently add extra computational burden due to repeated feature extraction and propagation for different tasks. To this end, we develop a network that simultaneously performs 3D object detection and map segmentation in a unified BEV representation space with multi-camera perspective view (PV) image inputs. Firstly, a shared network includes image feature extractor and PV-BEV transformation is employed to generate a unified BEV feature. The BEV feature serves as the input for the decoders of various tasks. Additionally, a temporal encoder and a perspective supervision head are employed in the model to enhance the performance for specific tasks. Finally, specific task decoders utilize the unified BEV representation to predict dynamic or static objects and semantic map surround ego car. Comprehensive experiments are conducted on the nuScenes dataset and the results demonstrate that our multi-task framework outperforms existing state-of-the-art approaches on 3D object detection and semantic map construction
Notes:
Vendor supplied data
Publisher Number:
2025-01-7202
Access Restriction:
Restricted for use by site license

The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.

Find

Home Release notes

My Account

Shelf Request an item Bookmarks Fines and fees Settings

Guides

Using the Find catalog Using Articles+ Using your account