ATN3D: Density-Aware LiDAR-Radar Early 3D Object Detection Under Extreme Sparsity
We propose ``Ask The Neighbor'' (ATN3D), a LiDAR-Radar framework tailored for sparse-range conditions.
Engineering 5.5 · Research 7.0 · Business 5.5
Occupancy grid and occupancy network methods for autonomous driving — volumetric 3D environment representation for obstacle avoidance and free-space estimation.
We propose ``Ask The Neighbor'' (ATN3D), a LiDAR-Radar framework tailored for sparse-range conditions.
Engineering 5.5 · Research 7.0 · Business 5.5
An autonomous driving research paper: 3D Occupancy Prediction for Motion Forecasting in Autonomous Driving.
Engineering 5.0 · Research 7.0 · Business 5.0
To validate the effectiveness of our approach, we construct a dedicated dataset of unstructured scenes collected from open-pit mines.
Engineering 5.5 · Research 8.0 · Business 5.0
An autonomous driving research paper: A dual-stream foreground-aware enhancement network with spiralscan-Mamba for vision-based occupancy prediction in autonomous driving.
Engineering 5.0 · Research 7.0 · Business 5.0
To bridge this gap, we introduce SkyShield, to the best of our knowledge the first front-view monocular semantic occupancy benchmark for urban UAV flight below 20 meters.
Engineering 5.0 · Research 7.0 · Business 5.0
To address these challenges, we propose a class-distribution guided active learning framework for selecting training samples to annotate in autonomous driving datasets.
Engineering 5.5 · Research 7.0 · Business 5.0
To address these challenges, this paper presents an instance-centric benchmark for the 3D panoptic occupancy prediction task.
Engineering 5.0 · Research 7.0 · Business 5.0
This paper presents a framework leveraging SParse representation and SCalable feature interaction to address the aforementioned challenges, called SPSC.
Engineering 5.5 · Research 8.0 · Business 5.0
We introduce a systematic missing-view evaluation protocol on the nuScenes-based SurroundOcc benchmark, encompassing both deterministic single-view failures and stochastic multi-view dropout scenarios.
Engineering 7.5 · Research 7.0 · Business 6.5
To address these challenges, we propose Dr.
Engineering 5.5 · Research 7.0 · Business 5.0
In this paper, we propose DrivePI, a novel spatial-aware 4D MLLM that serves as a unified Vision-Language-Action (VLA) framework that is also compatible with vision-action (VA) models.
Engineering 7.5 · Research 8.5 · Business 5.0
This paper proposes a real-time 3D occupancy prediction method for autonomous driving.
Engineering 5.5 · Research 7.0 · Business 5.0
In addition, we design an Alignment-Aware Fusion Module that performs global alignment between the two modalities via bidirectional dynamic offsets.
Engineering 6.0 · Research 8.0 · Business 5.5
We propose R<inline-formula> <tex-math notation="LaTeX">$M^{2}$ </tex-math></inline-formula>Occ, the first 3D occupancy perception network that integrates multi-sensor fusion based on different sensor principles and achieves multi-task learning.
Engineering 5.5 · Research 8.0 · Business 5.0
An autonomous driving research paper: Gaussian-Based World Model: Gaussian Priors for Voxel-Based Occupancy Prediction and Future Motion Prediction.
Engineering 5.0 · Research 7.0 · Business 5.0
To address these limitations, we present PG-Occ, an innovative Progressive Gaussian Transformer Framework that enables open-vocabulary 3D occupancy prediction.
Engineering 7.0 · Research 8.0 · Business 5.5
An autonomous driving research paper: HMambaOcc: Hierarchical Mamba for occupancy flow field prediction in autonomous driving under mixed traffic environments.
Engineering 5.0 · Research 7.0 · Business 5.0
In this paper, we conduct a comprehensive evaluation of existing semantic occupancy prediction models from a reliability perspective for the first time.
Engineering 5.0 · Research 7.0 · Business 5.0
Efficient and high-accuracy 3D occupancy prediction is vital for the performance of autonomous driving systems.
Engineering 6.0 · Research 7.0 · Business 6.0
To address this limitation, we propose VFCAnet (Voxel-wise Fusion and Channel-wise Attention Network), a novel multimodal fusion framework for 3D semantic occupancy prediction.
Engineering 5.0 · Research 8.0 · Business 5.0