Autonomous driving paper index

Robust Pothole Avoidance in Autonomous Driving Using Multi-Task Transformer Perception and Ensemble Reinforcement Learning (2026-06-12)

2026-06-14 · Zenodo (CERN European Organization for Nuclear Research)

autonomous drivingautonomous vehicleend-to-endreinforcement learningperceptioncontrol

One-line summary

Autonomous Vehicles (AVs) lack considerable reliability in unstructured contexts with road surface deterioration, particularly potholes.

Engineering notes

A comprehensive evaluation on a unified test set of 1,180 frames indicates that the system achieves an average lateral deviation of 0.289m, reflecting a 26.3% increase in trajectory efficiency over baseline RL approaches.

Chinese explanation / 中文解读

中文解读待补充:本站会优先为端到端自动驾驶、BEV感知、3D目标检测、轨迹预测、路径规划、LiDAR感知等高价值论文补充中文说明。

Original abstract

Autonomous Vehicles (AVs) lack considerable reliability in unstructured contexts with road surface deterioration, particularly potholes. Existing solutions are often divided into two categories: pure perception tasks that require downstream control actionable integration, and end-to-end Reinforcement Learning (RL), which suffers from sample inefficiencies and safety hazards due to the “limited explainability” nature of neural rules. To address this gap, this paper introduces a new Hybrid Perception-Decision Framework that tightly integrates uncertainty-aware vision with an enhanced ensemble control approach. A Transformer-U-Net (TransUNet) architecture capable of estimating pothole segmentation, depth, and perceptual uncertainty all at the same time is presented. These outputs are then transformed into a compact 117-dimensional risk vector, which serves as basis of state space for a “Dual-layer Ensemble” RL controller comprising PPO, A2C, TRPO, and Recurrent PPO agents. Crucially, a deterministic Safety Refinement Layer (SRL) that serves as a kinematic guardrail, superseding RL ideas based on a Confidence Shield and Neighbour Mini-Planner is designed. A comprehensive evaluation on a unified test set of 1,180 frames indicates that the system achieves an average lateral deviation of 0.289m, reflecting a 26.3% increase in trajectory efficiency over baseline RL approaches. While RL agents demonstrated efficiency during inference, the Safety Layer interfered in 55.9% of test sequences to avoid risky manoeuvres in high-uncertainty zones, while preserving computational efficiency acceptable for near-real-time operations.

5.5Engineering value
7.0Research novelty
5.0Business relevance

Links and sources

Need this topic turned into a technical roadmap?

Full Self Driving can prepare a custom autonomous driving literature review, code map, dataset map, and B2B technology assessment.

Request B2B research

Comments

No comments yet. Be the first to share your thoughts on this paper.
Login or register to leave a comment