Autonomous driving paper index

Monocular 3D Perception and Lane-Aware Bird’s-Eye-View Mapping for Autonomous Driving

2026-04-07 · SAE technical paper series

autonomous driving systemautonomous drivingbev3d object detectiontrajectory predictionpath planninglane segmentationobject detectionmonocular cameraperceptionpredictionplanning

One-line summary

Accurate perception of the surrounding environment is fundamental and essential to safe and reliable autonomous driving.

Engineering notes

Key topics: autonomous driving system, autonomous driving, bev, 3d object detection, trajectory prediction, path planning, lane segmentation, object detection, monocular camera, perception, prediction, planning. See the paper for implementation details and experimental results.

Chinese explanation / 中文解读

中文解读待补充:本站会优先为端到端自动驾驶、BEV感知、3D目标检测、轨迹预测、路径规划、LiDAR感知等高价值论文补充中文说明。

Original abstract

Accurate perception of the surrounding environment is fundamental and essential to safe and reliable autonomous driving. This work presents an integrated vision-based framework that com bines object detection, 3D spatial localization, and lane segmentation to construct a unified bird’s-eye-view (BEV) representation of the driving scene. The pipeline provides geometric information on object position and orientation by employing Omni3D to infer 3D bounding boxes of objects from monocular camera frames. Detections are subsequently projected onto a 2D BEV canvas, where object instances are represented with respect to the ground plane for enhanced interpretability. To complement the object-level perception, we utilized YOLOPv2 to perform lane segmentation, producing both lane masks and lane line masks in the image domain for future coordinate transformation. By adopting a pinhole camera model, the coordinate transformation of these masks from the perspective image plane into the BEV canvas can be performed. The fusion of 3D object detections and geometrically transformed lane representations yields a coherent and structured spatial map of the vehicle’s surroundings. In addition, the BEV space is integrated into a local 2D map generated from Mapbox tool. This unified environment model enables explicit reasoning about drivable space and surrounding obstacles, facilitating its integration into downstream modules such as path planning and trajectory prediction. The framework demonstrates the feasibility of leveraging recent advances in monocular 3D perception and deep learning-based lane segmentation to construct a computationally efficient and semantically rich BEV representation, which is a potential core perception component in real-time autonomous driving systems.

5.0Engineering value
7.0Research novelty
5.0Business relevance

Links and sources

Need this topic turned into a technical roadmap?

Full Self Driving can prepare a custom autonomous driving literature review, code map, dataset map, and B2B technology assessment.

Request B2B research

Comments

No comments yet. Be the first to share your thoughts on this paper.
Login or register to leave a comment