Autonomous Driving Papers with Code, Datasets and Engineering Notes

Browse curated autonomous driving papers on end-to-end driving, BEV perception, 3D object detection, motion prediction, path planning, ADAS, Tesla FSD, Waymo, and self-driving foundation models.

2025-02-25

InVDriver: Intra-Instance Aware Vectorized Query-Based Autonomous Driving Transformer

To address these limitations, we propose InVDriver, a novel vectorized query-based system that systematically models intra-instance spatial dependencies through masked self-attention layers, thereby enhancing planning accuracy and trajectory smoothness.

End-to-End Autonomous Driving

Engineering 6.5 · Research 8.0 · Business 6.0

2025-02-24

How Lightweight Deep Learning Enhances Performance in DPU-Accelerated Autonomous Driving on Zynq SoC

This study presents a lightweight deep learning model developed for DPU-accelerated systems.

Autonomous Driving Research

Engineering 5.5 · Research 7.0 · Business 6.5

2025-02-23

Learning from Rendering: Realistic and Controllable Extreme Rainy Image Synthesis for Autonomous Driving Simulation

To that end, we propose a learning-from-rendering rainy image synthesizer, which combines the benefits of the realism of rendering-based methods and the controllability of learning-based methods.

Autonomous Driving Simulation

Engineering 7.0 · Research 7.0 · Business 5.5

2025-02-20

OrchardDepth: Precise Metric Depth Estimation of Orchard Scene from Monocular Camera Images

Therefore, we propose OrchardDepth, which fills the gap in the estimation of the metric depth of the monocular camera in the orchard/vineyard environment.

Autonomous Driving Research

Engineering 5.0 · Research 7.0 · Business 5.0

2025-02-20

OG-Gaussian: Occupancy Based Street Gaussians for Autonomous Driving

To address these challenges, we propose OG-Gaussian, a novel approach that replaces LiDAR point clouds with Occupancy Grids (OGs) generated from surround-view camera images using Occupancy Prediction Network (ONet).

LiDAR Perception Occupancy Prediction Sensor Fusion Autonomous Driving Simulation

Engineering 5.5 · Research 8.0 · Business 6.0

2025-02-20

Deep Reinforcement Learning for a Self-Driving Vehicle Operating Solely on Visual Information

To address these challenges, we developed a ViT-based DRL model and evaluated its performance through extensive training in the MetaDrive simulator and testing in the high-fidelity AirSim simulator.

Autonomous Driving Research

Engineering 5.0 · Research 7.0 · Business 5.0

2025-02-19

Sce2DriveX: A Generalized MLLM Framework for Scene-to-Drive Learning

We propose Sce2DriveX, a human-like chain-of-thought (CoT) driving reasoning MLLM framework, designed to achieve progressive learning from multi-view scene understanding to behavior analysis, motion planning, and vehicle control driving process.

End-to-End Autonomous Driving BEV Perception Path Planning Autonomous Driving Simulation

Engineering 5.5 · Research 8.5 · Business 5.0

2025-02-18

BiLSTM-Based VAE-GAN for Predicting Future Road States in Autonomous Driving

The ability to accurately predict future road conditions is essential for the advancement of autonomous driving systems.

Autonomous Driving Simulation

Engineering 5.5 · Research 7.0 · Business 5.5

2025-02-17

A Neural Network Model for Autonomous Vehicles Safety Check Using Camera-only/Camera-Lidar Images

Camera and LiDAR sensors play a crucial role in vehicle perception systems, enabling accurate detection of obstacles and other vehicles in autonomous driving technologies.

LiDAR Perception Sensor Fusion

Engineering 5.0 · Research 8.0 · Business 5.0

2025-02-14

A Center Point-based Deep Learning Method for Monocular Depth Estimation

To overcome this challenge, we develop a virtual dataset, designated as the Intersection Dataset, which includes extensive annotations of vehicles and pedestrians in various traffic scenarios.

BEV Perception Autonomous Driving Simulation

Engineering 5.0 · Research 7.0 · Business 5.0

2025-02-13

Advancing Road Lane Detection in Autonomous Driving through Multistage Attention Network

Developing reliable autonomous systems requires road lane segmentation models that can mimic human perception without the associated errors.

Autonomous Driving Research

Engineering 5.5 · Research 7.0 · Business 5.5

2025-02-04

SimBEV: A Synthetic Multi-Task Multi-Sensor Driving Data Generation Tool and Dataset

To address this challenge, we introduce SimBEV.

BEV Perception 3D Object Detection Sensor Fusion

Engineering 5.0 · Research 7.0 · Business 5.0

2025-02-01

Quality Evaluation for Colored Point Clouds Produced by Autonomous Vehicle Sensor Fusion Systems

This paper presents an evaluation method to compare colored point clouds, a common fused data type, among two LiDAR–camera fusion systems and a stereo camera setup.

LiDAR Perception Sensor Fusion

Engineering 5.0 · Research 7.0 · Business 5.0

2025-02-01

LinkOcc: 3D Semantic Occupancy Prediction With Temporal Association

In this paper, we introduce LinkOcc, a sparse-queries approach incorporating an efficient temporal association mechanism for 3D semantic occupancy prediction.

Occupancy Prediction

Engineering 5.5 · Research 8.0 · Business 5.5

2025-02-01

Point-Level Fusion and Channel Attention for 3D Object Detection in Autonomous Driving

As autonomous driving technology progresses, LiDAR-based 3D object detection has emerged as a fundamental element of environmental perception systems.

3D Object Detection LiDAR Perception

Engineering 5.0 · Research 7.0 · Business 5.0

2025-02-01

STFNET: Sparse Temporal Fusion for 3D Object Detection in LiDAR Point Cloud

To address these, we propose the sparse temporal fusion network (STFNet), which leverages multiframe historical information to improve 3D object detection accuracy.

3D Object Detection LiDAR Perception

Engineering 5.5 · Research 8.0 · Business 5.0

2025-02-01

CFPC: The Curbed Fake Point Collector to Pseudo-LiDAR-Based 3D Object Detection for Autonomous Vehicles

In this paper, a curbed fake point collector (CFPC), which addresses the three issues caused by pseudo points, is proposed to support 3D object detection for autonomous vehicles.

3D Object Detection LiDAR Perception

Engineering 5.0 · Research 7.0 · Business 5.0

2025-02-01

Multimodal 3D Object Detection Based on Sparse Interaction in Internet of Vehicles

Then, we propose the dense fusion with multi-scale masked attention (DFMMA), using multi-scale feature masks from bird's-eye-view (BEV)-level multimodal features to improve performance for small object feature perception.

BEV Perception 3D Object Detection LiDAR Perception Sensor Fusion

Engineering 5.0 · Research 7.0 · Business 5.0

2025-01-29

BSM-NET: multi-bandwidth, multi-scale and multi-modal fusion network for 3D object detection of 4D radar and LiDAR

To overcome the limitations of single-sensor perception, this paper proposes the BSM-NET method, a multi-bandwidth, multi-scale, multi-modal fusion approach for 4D radar and LiDAR.

BEV Perception 3D Object Detection LiDAR Perception

Engineering 5.0 · Research 8.0 · Business 5.0

2025-01-28

AFLaneNet: an attention-fused instance segmentation network for real-time lane detection

An autonomous driving research paper: AFLaneNet: an attention-fused instance segmentation network for real-time lane detection.

Autonomous Driving Research

Engineering 5.0 · Research 7.0 · Business 5.0