Autonomous Driving Papers with Code, Datasets and Engineering Notes

Browse curated autonomous driving papers on end-to-end driving, BEV perception, 3D object detection, motion prediction, path planning, ADAS, Tesla FSD, Waymo, and self-driving foundation models.

2025-04-01

Frustum consistency augmentation for 3D object detection in LiDAR

In this paper, we innovatively implement a combination of 2D detectors and raw points within the RoI (region of interest) to filter virtual points to resolve the challenges previously outlined.

3D Object Detection LiDAR Perception

Engineering 5.0 · Research 7.0 · Business 5.0

2025-04-01

Safety-Certified Receding-Horizon Motion Planning and Containment Control of Autonomous Surface Vehicles via Neurodynamic Optimization

This paper addresses the safety-certified motion planning and containment control of under-actuated autonomous surface vehicles subject to model uncertainties, external disturbances, and input constraints in the presence of stationary and moving obstacles.

Path Planning

Engineering 5.0 · Research 7.0 · Business 5.0

2025-04-01

An Improved Multi-Modal Fusion 3D Object Detection Algorithm Based on Point Density

In this study, we introduce a novel multi-modal, density-aware 3D object detection framework, PVNet, which leverages virtual point clouds generated through depth completion to overcome fusion difficulties.

3D Object Detection LiDAR Perception Sensor Fusion

Engineering 5.0 · Research 8.0 · Business 5.0

2025-03-31

Video-Based Traffic Light Recognition by Rockchip RV1126 for Autonomous Driving

We present ViTLR, a novel video-based end-to-end neural network that processes multiple consecutive frames to achieve robust traffic light detection and state classification.

End-to-End Autonomous Driving

Engineering 6.0 · Research 8.0 · Business 6.5

2025-03-31

Uniocc: a Unified Benchmark for Occupancy Forecasting and Prediction in Autonomous Driving

We introduce UniOcc, a comprehensive, unified benchmark and toolkit for occupancy forecasting (i.e., predicting future occupancies based on historical information) and occupancy prediction (i.e., predicting current-frame occupancy from camera images.

Occupancy Prediction Autonomous Driving Simulation

Engineering 7.5 · Research 8.0 · Business 6.5

2025-03-31

MDFusion: Multi-Dimension Semantic–Spatial Feature Fusion for LiDAR–Camera 3D Object Detection

To mitigate these issues, this paper proposes a novel multi-dimension semantic–spatial feature fusion (MDFusion) method that combines LiDAR and image features in 2D and 3D spaces.

BEV Perception 3D Object Detection LiDAR Perception Sensor Fusion

Engineering 5.0 · Research 8.0 · Business 5.0

2025-03-30

GuardianLane – An Intelligent Road Safety System using Lane Detection, Road Sign Recognition, and SOS Response

Important tasks in Intelligent Transportation Systems (ITS) for autonomous driving include lane detection, traffic sign detection, and vehicle collision prediction.

Autonomous Driving Research

Engineering 5.0 · Research 7.0 · Business 5.0

2025-03-30

OpenDriveVLA: Towards End-to-end Autonomous Driving with Large Vision Language Action Model

We present OpenDriveVLA, a Vision-Language Action (VLA) model designed for end-to-end autonomous driving, built upon open-source large language models.

End-to-End Autonomous Driving Path Planning

Engineering 7.5 · Research 8.5 · Business 5.0

2025-03-28

Stream and Query-guided Feature Aggregation for Efficient and Effective 3D Occupancy Prediction

To mitigate this trade-off, we introduce DuOcc, which employs a dual aggregation strategy that retains dense voxel representations to preserve spatial fidelity while maintaining high efficiency.

Occupancy Prediction

Engineering 5.5 · Research 8.0 · Business 5.0

2025-03-26

Design and Assessment of Reinforcement Learning Algorithms for End-to-End Autonomous Driving Learning

End-to-end autonomous driving learning refers to the process of mapping the original sensor data (such as camera images, radar signals, etc.) directly to driving decisions or control instructions, without the need to manually design complex feature extraction and rule making.

End-to-End Autonomous Driving Autonomous Driving Simulation

Engineering 5.5 · Research 7.0 · Business 5.0

2025-03-26

GAIA-2: A Controllable Multi-View Generative World Model for Autonomous Driving

We introduce GAIA-2, Generative AI for Autonomy, a latent diffusion world model that unifies these capabilities within a single generative framework.

Autonomous Driving Simulation

Engineering 5.0 · Research 7.0 · Business 5.0

2025-03-26

Omnidirectional Depth-Aided Occupancy Prediction based on Cylindrical Voxel for Autonomous Driving

Based on the depth information, we propose a Sketch-Coloring framework OmniDepth-Occ.

Occupancy Prediction

Engineering 5.0 · Research 7.0 · Business 5.0

2025-03-25

End-to-End Lane Detection: A Two-Branch Instance Segmentation Approach

To address the challenges of lane line recognition failure and insufficient segmentation accuracy in complex autonomous driving scenarios, this paper proposes a dual-branch instance segmentation method that integrates multi-scale modeling and dynamic feature enhancement.

End-to-End Autonomous Driving

Engineering 5.5 · Research 7.0 · Business 5.0

2025-03-25

Robust Multi-Sensor Fusion for Localization in Hazardous Environments Using Thermal, LiDAR, and GNSS Data

We propose a robust sensor fusion algorithm that integrates data from a thermal camera, a LiDAR sensor, and a GNSS to provide reliable localization, even in environments where individual sensor data may be compromised.

LiDAR Perception Sensor Fusion

Engineering 5.5 · Research 7.0 · Business 5.5

2025-03-25

Semi-SD: Semi-Supervised Metric Depth Estimation via Surrounding Cameras for Autonomous Driving

In this paper, we introduce Semi-SMD, a novel metric depth estimation framework tailored for surrounding cameras equipment in autonomous driving.

Autonomous Driving Research

Engineering 7.0 · Research 8.0 · Business 5.0

2025-03-25

Orion: A Holistic End-To-End Autonomous Driving Framework by Vision-Language Instructed Action Generation

To tackle this issue, we propose ORION, a hOlistic E2E autonomous dRiving framework by vIsion-language instructed actiON generation.

End-to-End Autonomous Driving Motion Prediction

Engineering 5.5 · Research 8.5 · Business 5.0

2025-03-25

Multimodal vehicle trajectory prediction method based on visual perception information

This paper proposes a multimodal vehicle trajectory prediction model based on visual perception information (VP-MTP).

BEV Perception Motion Prediction

Engineering 5.5 · Research 7.0 · Business 6.0

2025-03-24

Applications of Large Language Models and Multimodal Large Models in Autonomous Driving: A Comprehensive Review

In this paper, we present a systematic review on the integration of LLMs and MLMs in autonomous driving systems.

Autonomous Driving Research

Engineering 5.0 · Research 7.5 · Business 5.0

2025-03-23

A Framework for Autonomous UAV Navigation Based on Monocular Depth Estimation

With this work, we propose a framework and scenarios in three different open-source virtual environments, varying in complexity, to test and compare autonomous UAV navigation methods based on vision.

Path Planning LiDAR Perception Occupancy Prediction Sensor Fusion

Engineering 6.5 · Research 7.0 · Business 5.0

2025-03-23

M3Net: Multimodal Multi-task Learning for 3D Detection, Segmentation, and Occupancy Prediction in Autonomous Driving

In this paper, we introduce M3Net, a novel multimodal and multi-task network that simultaneously tackles detection, segmentation, and 3D occupancy prediction for autonomous driving and achieves superior performance than single task model.

BEV Perception 3D Object Detection Occupancy Prediction

Engineering 5.5 · Research 8.0 · Business 5.0