Autonomous Driving Papers with Code, Datasets and Engineering Notes

Browse curated autonomous driving papers on end-to-end driving, BEV perception, 3D object detection, motion prediction, path planning, ADAS, Tesla FSD, Waymo, and self-driving foundation models.

2026-05-30

Semantic-decoupled spatial partition guided point-supervised oriented object detection

An autonomous driving research paper: Semantic-decoupled spatial partition guided point-supervised oriented object detection.

Autonomous Driving Research

Engineering 5.0 · Research 7.0 · Business 5.0

2026-05-30

SkyShield: Occupancy as a Safety Interface for Low-Altitude UAV Autonomy

To bridge this gap, we introduce SkyShield, to the best of our knowledge the first front-view monocular semantic occupancy benchmark for urban UAV flight below 20 meters.

Occupancy Prediction Autonomous Driving Simulation

Engineering 5.0 · Research 7.0 · Business 5.0

2026-05-30

Occlusion-aware multi-modal 3D object detection via multi-stage cross-modal fusion

An autonomous driving research paper: Occlusion-aware multi-modal 3D object detection via multi-stage cross-modal fusion.

3D Object Detection

Engineering 5.0 · Research 7.0 · Business 5.0

2026-05-29

Integrating ego-conditioned prediction and gap-driven motion planning for safe autonomous driving

This paper proposes a coupled prediction-planning framework that deeply integrates intention-aware multi-agent prediction with gap-driven trajectory optimization.

Path Planning Autonomous Driving Simulation

Engineering 5.5 · Research 8.0 · Business 5.0

2026-05-29

4D Radar Meets LiDAR and Camera: Cooperative Perception under Adverse Weather

Our approach extends two representative backbones: a radar-camera pipeline where radar substitutes LiDAR, and a LiDAR-radar pipeline where radar complements LiDAR.

LiDAR Perception Sensor Fusion Autonomous Driving Simulation

Engineering 5.0 · Research 7.0 · Business 5.0

2026-05-29

Modeling Robotics Dataset Construction as an Artifact-Based Build Process

Robotic systems generate large volumes of multimodal sensor data, but converting ROS bag recordings into machine learning datasets is often handled by ad hoc sequential scripts, creating engineering overhead and slow iteration cycles.

Autonomous Driving Research

Engineering 7.0 · Research 7.0 · Business 5.0

2026-05-29

IDOL: Inverse-Dynamics-Guided Future Prediction for End-to-End Autonomous Driving

To address this limitation, we propose \mathbf{IDOL}, an inverse-dynamics-guided future prediction framework for world-model-based end-to-end planning in latent BEV space, where inverse dynamics serves as the key bridge between future prediction and trajectory optimization.

End-to-End Autonomous Driving BEV Perception

Engineering 5.5 · Research 8.0 · Business 5.0

2026-05-29

Probing Collision Grounding in Vision-Language Models for Safe Human-Robot Collaboration

We introduce TouchSafeBench, a physics-grounded benchmark for evaluating collision grounding in vision-language models (VLMs).

Autonomous Driving Research

Engineering 5.5 · Research 7.0 · Business 6.0

2026-05-29

NTR: Neural Token Reconstruction for Scene Token Bottleneck in End-to-End Driving

To address this limitation, we introduce Neural Token Reconstruction (NTR), a representation learning framework to directly constrain the compact scene-token bottleneck in perception-free driving.

End-to-End Autonomous Driving

Engineering 6.0 · Research 8.0 · Business 6.0

2026-05-29

Can BEV Perception Gracefully Degrade under Sensor Failures?

To this end, we present Grace-BEV, a lightweight and plug-and-play framework that enforces active reliability awareness during multi-modal fusion.

BEV Perception LiDAR Perception

Engineering 5.5 · Research 7.0 · Business 5.0

2026-05-29

Does Visual Information Play a Decisive Role in Vision-Language-Action Model Driving Behavior?

In this work, we introduce a structured multi-level visual perturbation framework to analyze visual-behavior dependency in VLA-based driving models systematically.

Motion Prediction

Engineering 5.0 · Research 7.0 · Business 5.0

2026-05-28

World Models: A Comprehensive Survey of Architectures, Methodologies, Reasoning Paradigms, and Applications

World models, internal simulators that learn the structure and dynamics of an environment, have emerged as a central paradigm in the pursuit of artificial general intelligence, enabling agents to predict, plan, and reason within learned representations.

Autonomous Driving Research

Engineering 5.5 · Research 7.0 · Business 6.0

2026-05-28

CityGen: Structure-Guided City-Style Synthesis for Cross-City Autonomous Driving

Autonomous driving systems are commonly trained and evaluated within limited geographic regions, which hinders their scalability when deployed in new cities.

Autonomous Driving Research

Engineering 5.5 · Research 7.0 · Business 6.0

2026-05-28

ReasonLight: A Multimodal Foundation Model-Enhanced Reinforcement Learning Framework for Zero-Shot Traffic Signal Control

To this end, we propose ReasonLight, a multimodal foundation model-enhanced RL framework for zero-shot TSC.

Autonomous Driving Research

Engineering 5.0 · Research 7.5 · Business 5.0

2026-05-28

ACF4D: Alignment and Consistency Guided Temporal Fusion for Multi-View 3D Object Detection

To overcome these challenges, we propose ACF4D, a novel temporal fusion framework designed for multi-view 3D object detection.

BEV Perception 3D Object Detection

Engineering 5.5 · Research 8.0 · Business 5.0

2026-05-27

Pose-aware BEV feature refinement for robust cooperative perception under pose uncertainty

To address this issue, we propose a pose-aware BEV feature refinement method for post-fusion BEV representations.

BEV Perception 3D Object Detection

Engineering 5.5 · Research 7.0 · Business 6.5

2026-05-27

DriveWAM: Video Generative Priors Enable Scalable World-Action Modeling for Autonomous Driving

We present DriveWAM, a driving world-action model that adapts a pretrained video diffusion transformer into an autoregressive video-action policy.

End-to-End Autonomous Driving

Engineering 5.5 · Research 7.5 · Business 5.0

2026-05-27

DRIFT: Driving Risk Inference via Field Transmission for Human-like Autonomous Driving

We present DRIFT, a spatiotemporal risk field governed by an advection-diffusion-reaction partial differential equation (PDE), with an optional telegrapher term.

Autonomous Driving Research

Engineering 5.5 · Research 7.0 · Business 5.5

2026-05-27

Modeling Vehicle-Type-Specific Pedestrian Crash Avoidance Behavior in Safety-Critical Interactions Using Smooth-Mamba Deep Reinforcement Learning

To model vehicle-type-specific pedestrian crash avoidance behavior, we develop a Smooth-Mamba Deep Deterministic Policy Gradient framework, termed SMamba-DDPG, which integrates smooth action constraints with efficient temporal representation learning.

Autonomous Driving Simulation

Engineering 5.5 · Research 8.0 · Business 6.5

2026-05-26

SDEF-BEV: spatial-aware dual-expert radar-camera fusion for robust BEV 3D object detection

To address these issues, we propose SDEF-BEV, a novel spatial-aware dual-expert fusion network.

BEV Perception 3D Object Detection

Engineering 5.5 · Research 8.0 · Business 5.0