Collaborative Perception

This repository is a paper digest of recent advances in collaborative / cooperative / multi-agent perception for V2I / V2V / V2X autonomous driving scenario. Papers are listed in alphabetical order of the first character.

🔗Jump to:

Note: I find it hard to fairly compare all methods on each benchmark since some published results are obtained without specified training and testing settings, or even modified model architectures. In fact, many works evaluate all baselines under their own settings and report them. Therefore, it is probably to find inconsistency between papers. Hence, I discard the collection and reproducton of all the benchmarks in a previous update. If you are interested, you can find a bunch of results in this archived version.

🌟Recommendation

Helpful Learning Resource:thumbsup::thumbsup::thumbsup:

(Position) Research Challenges and Progress in the End-to-End V2X Cooperative Autonomous Driving Competition [paper], Automated Vehicles Should be Connected with Natural Language [paper], Collaborative Perception Datasets for Autonomous Driving: A Review [paper], Collaborative Perception for Connected and Autonomous Driving: Challenges, Possible Solutions and Opportunities [paper], V2X Cooperative Perception for Autonomous Driving: Recent Advances and Challenges [paper], Towards Vehicle-to-Everything Autonomous Driving: A Survey on Collaborative Perception [paper], Collaborative Perception in Autonomous Driving: Methods, Datasets and Challenges [paper], A Survey and Framework of Cooperative Perception: From Heterogeneous Singleton to Hierarchical Cooperation [paper]
(Talk) Vehicle-to-Vehicle (V2V) Communication (Waabi CVPR 24 Tutorial on Self-Driving Cars) [video], Vehicle-to-Vehicle (V2V) Communication (Waabi CVPR 23 Tutorial on Self-Driving Cars) [video], The Ultimate Solution for L4 Autonomous Driving [video], When Vision Transformers Meet Cooperative Perception [video], Scene Understanding beyond the Visible [video], Robust Collaborative Perception against Communication Interruption [video], Collaborative and Adversarial 3D Perception for Autonomous Driving [video], Vehicle-to-Vehicle Communication for Self-Driving [video], Adversarial Robustness for Self-Driving [video], L4感知系统的终极形态：协同驾驶 [video], CoBEVFlow-解决车-车/路协同感知的时序异步问题 [video], 新一代协作感知Where2comm减少通信带宽十万倍 [video], 从任务相关到任务无关的多机器人协同感知 [video], 协同自动驾驶：仿真与感知 [video], 基于群体协作的超视距态势感知 [video]
(Library) V2Xverse: A Codebase for V2X-Based Collaborative End2End Autonomous Driving [code] [doc], HEAL: An Extensible Framework for Open Heterogeneous Collaborative Perception [code] [doc], OpenCOOD: Open Cooperative Detection Framework for Autonomous Driving [code] [doc], CoPerception: SDK for Collaborative Perception [code] [doc], OpenCDA: Simulation Tool Integrated with Prototype Cooperative Driving Automation [code] [doc]
(Workshop) Co-Intelligence@ECCV'24 [web], CoPerception@ICRA'23 [web], ScalableAD@ICRA'23 [web]
(Background) Current Approaches and Future Directions for Point Cloud Object Detection in Intelligent Agents [video], 3D Object Detection for Autonomous Driving: A Review and New Outlooks [paper], DACOM: Learning Delay-Aware Communication for Multi-Agent Reinforcement Learning [video], A Survey of Multi-Agent Reinforcement Learning with Communication [paper]

Typical Collaboration Modes:handshake::handshake::handshake:

Possible Optimization Directions:fire::fire::fire:

🔖Method and Framework

Note: {Related} denotes that it is not a pure collaborative perception paper but has related content.

Selected Preprint

ACCO (Is Discretization Fusion All You Need for Collaborative Perception?) [paper] [code]
AR2VP (Dynamic V2X Autonomous Perception from Road-to-Vehicle Vision) [paper] [code]
CPPC (Point Cluster: A Compact Message Unit for Communication-Efficient Collaborative Perception) [paper&review] [~~code~~]
CP-FREEZER (CP-FREEZER: Latency Attacks against Vehicular Cooperative Perception) [paper] [code]
CMP (CMP: Cooperative Motion Prediction with Multi-Agent Communication) [paper] [~~code~~]
CoBEVFusion (CoBEVFusion: Cooperative Perception with LiDAR-Camera Bird's-Eye View Fusion) [paper] [~~code~~]
CoBEVGlue (Self-Localized Collaborative Perception) [paper] [code]
CoCMT (CoCMT: Towards Communication-Efficient Corss-Modal Transformer For Collaborative Perception) [paper&review] [code]
CoDiff (CoDiff: Conditional Diffusion Model for Collaborative 3D Object Detection) [paper] [~~code~~]
CoDriving (Towards Collaborative Autonomous Driving: Simulation Platform and End-to-End System) [paper] [code]
CoDrivingLLM (Towards Interactive and Learnable Cooperative Driving Automation: A Large Language Model-Driven Decision-making Framework) [paper] [code]
CollaMamba (CollaMamba: Efficient Collaborative Perception with Cross-Agent Spatial-Temporal State Space Model) [paper] [~~code~~]
CoLMDriver (CoLMDriver: LLM-Based Negotiation Benefits Cooperative Autonomous Driving) [paper] [code]
CoMamba (CoMamba: Real-Time Cooperative Perception Unlocked with State Space Models) [paper] [~~code~~]
CoPLOT (Beyond BEV: Optimizing Point-Level Tokens for Collaborative Perception) [paper] [code]
CP-Guard+ (CP-Guard+: A New Paradigm for Malicious Agent Detection and Defense in Collaborative Perception) [paper&review] [~~code~~]
CTCE (Leveraging Temporal Contexts to Enhance Vehicle-Infrastructure Cooperative Perception) [paper] [~~code~~]
Debrief (Talking Vehicles: Cooperative Driving via Natural Language) [paper&review] [~~code~~]
DeepFleet (DeepFleet: Multi-Agent Foundation Models for Mobile Robots) [paper] [~~code~~]
DiffCP (DiffCP: Ultra-Low Bit Collaborative Perception via Diffusion Model) [paper] [~~code~~]
HyComm (Communication-Efficient Multi-Agent 3D Detection via Hybrid Collaboration) [paper] [~~code~~]
InSPE (InSPE: Rapid Evaluation of Heterogeneous Multi-Modal Infrastructure Sensor Placement) [paper] [~~code~~]
I2XTraj (Knowledge-Informed Multi-Agent Trajectory Prediction at Signalized Intersections for Infrastructure-to-Everything) [paper] [~~code~~]
LangCoop (LangCoop: Collaborative Driving with Language) [paper] [code]
LCV2I (LCV2I: Communication-Efficient and High-Performance Collaborative Perception Framework with Low-Resolution LiDAR) [paper] [~~code~~]
LMMCoDrive (LMMCoDrive: Cooperative Driving with Large Multimodal Model) [paper] [code]
mmCooper (mmCooper: A Multi-Agent Multi-Stage Communication-Efficient and Collaboration-Robust Cooperative Perception Framework) [paper] [~~code~~]
MOT-CUP (Collaborative Multi-Object Tracking with Conformal Uncertainty Propagation) [paper] [code]
RopeBEV (RopeBEV: A Multi-Camera Roadside Perception Network in Bird’s-Eye-View)
ParCon (ParCon: Noise-Robust Collaborative Perception via Multi-Module Parallel Connection) [paper] [~~code~~]
PragComm (Pragmatic Communication in Multi-Agent Collaborative Perception) [paper] [code]
QUEST (QUEST: Query Stream for Vehicle-Infrastructure Cooperative Perception) [paper] [~~code~~]
RCDN (RCDN: Towards Robust Camera-Insensitivity Collaborative Perception via Dynamic Feature-Based 3D Neural Modeling) [paper] [~~code~~]
RG-Attn (RG-Attn: Radian Glue Attention for Multi-Modality Multi-Agent Cooperative Perception) [paper] [~~code~~]
RiskMM (Risk Map As Middleware: Towards Interpretable Cooperative End-to-End Autonomous Driving for Risk-Aware Planning) [paper] [~~code~~]
RoCo-Sim (RoCo-Sim: Enhancing Roadside Collaborative Perception through Foreground Simulation) [paper] [code]
SiCP (SiCP: Simultaneous Individual and Cooperative Perception for 3D Object Detection in Connected and Automated Vehicles) [paper] [code]
SparseAlign (SparseAlign: A Fully Sparse Framework for Cooperative Object Detection) [paper] [~~code~~]
Talking Vehicles (Towards Natural Language Communication for Cooperative Autonomous Driving via Self-Play) [paper] [code]
TOCOM-V2I (Task-Oriented Communication for Vehicle-to-Infrastructure Cooperative Perception) [paper] [~~code~~]
{Related} TYP (Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene) [paper] [~~code~~]
VIMI (VIMI: Vehicle-Infrastructure Multi-View Intermediate Fusion for Camera-Based 3D Object Detection) [paper] [code]
VLIF (Is Intermediate Fusion All You Need for UAV-Based Collaborative Perception?) [paper] [code]
V2V-LLM (V2V-LLM: Vehicle-to-Vehicle Cooperative Autonomous Driving with Multi-Modal Large Language Models) [paper] [code]
V2XPnP (V2XPnP: Vehicle-to-Everything Spatio-Temporal Fusion for Multi-Agent Perception and Prediction) [paper] [code]
V2X-DGPE (V2X-DGPE: Addressing Domain Gaps and Pose Errors for Robust Collaborative 3D Object Detection) [paper] [code]
V2X-DGW (V2X-DGW: Domain Generalization for Multi-Agent Perception under Adverse Weather Conditions) [paper] [~~code~~]
V2X-M2C (V2X-M2C: Efficient Multi-Module Collaborative Perception with Two Connections) [paper] [~~code~~]
V2X-PC (V2X-PC: Vehicle-to-Everything Collaborative Perception via Point Cluster) [paper] [~~code~~]
V2X-REALM (V2X-REALM: Vision-Language Model-Based Robust End-to-End Cooperative Autonomous Driving with Adaptive Long-Tail Modeling) [paper] [~~code~~]
V2X-ReaLO (V2X-ReaLO: An Open Online Framework and Dataset for Cooperative Perception in Reality) [paper] [~~code~~]
V2X-UniPool (V2X-UniPool: Unifying Multimodal Perception and Knowledge Reasoning for Autonomous Driving) [paper] [code]
V2X-VLM (V2X-VLM: End-to-End V2X Cooperative Autonomous Driving Through Large Vision-Language Models) [paper] [~~code~~]

CVPR 2025

CoGMP (Generative Map Priors for Collaborative BEV Semantic Segmentation) [paper] [~~code~~]
CoSDH (CoSDH: Communication-Efficient Collaborative Perception via Supply-Demand Awareness and Intermediate-Late Hybridization) [paper] [code]
HeCoFuse (HeCoFuse: Cross-Modal Complementary V2X Cooperative Perception with Heterogeneous Sensors) [paper] [code]
PolyInter (One is Plenty: A Polymorphic Feature Interpreter for Immutable Heterogeneous Collaborative Perception) [paper] [code]
SparseAlign (SparseAlign: A Fully Sparse Framework for Cooperative Object Detection) [paper] [~~code~~]
TraF-Align (TraF-Align: Trajectory-aware Feature Alignment for Asynchronous Multi-agent Perception) [paper] [code]
V2X-R (V2X-R: Cooperative LiDAR-4D Radar Fusion for 3D Object Detection with Denoising Diffusion) [paper] [code]

ICCV 2025

CoopTrack (CoopTrack: Exploring End-to-End Learning for Efficient Cooperative Sequential Perception) [paper] [code]
CoST (CoST: Efficient Collaborative Perception From Unified Spatiotemporal Perspective) [paper] [code]
MamV2XCalib (MamV2XCalib: V2X-Based Target-Less Infrastructure Camera Calibration with State Space Model) [paper] [code]
SlimComm (SlimComm: Doppler-Guided Sparse Queries for Bandwidth-Efficient Cooperative 3-D Perception) [paper] [~~code~~]
TurboTrain (TurboTrain: Towards Efficient and Balanced Multi-Task Learning for Multi-Agent Perception and Prediction) [paper] [code]

ICLR 2025

CPPC (Point Cluster: A Compact Message Unit for Communication-Efficient Collaborative Perception) [paper&review] [~~code~~]
R&B-POP (Learning 3D Perception from Others' Predictions) [paper&review] [code]
STAMP (STAMP: Scalable Task- And Model-Agnostic Collaborative Perception) [paper&review] [code]

AAAI 2025

CP-Guard (CP-Guard: Malicious Agent Detection and Defense in Collaborative Bird's Eye View Perception) [paper] [~~code~~]
DSRC (DSRC: Learning Density-Insensitive and Semantic-Aware Collaborative Representation against Corruptions) [paper] [code]
UniV2X (End-to-End Autonomous Driving through V2X Cooperation) [paper] [code]

ICRA 2025

CoDynTrust (CoDynTrust: Robust Asynchronous Collaborative Perception via Dynamic Feature Trust Modulus) [paper] [code]
CoopDETR (CoopDETR: A Unified Cooperative Perception Framework for 3D Detection via Object Query) [paper] [~~code~~]
Co-MTP (Co-MTP: A Cooperative Trajectory Prediction Framework with Multi-Temporal Fusion for Autonomous Driving) [paper] [code]
Direct-CP (Direct-CP: Directed Collaborative Perception for Connected and Autonomous Vehicles via Proactive Attention) [paper] [~~code~~]
V2X-DG (V2X-DG: Domain Generalization for Vehicle-to-Everything Cooperative Perception) [paper] [~~code~~]

IROS 2025

CooPre (CooPre: Cooperative Pretraining for V2X Cooperative Perception) [paper] [code]
CRUISE (CRUISE: Cooperative Reconstruction and Editing in V2X Scenarios using Gaussian Splatting) [paper] [code]

CVPR 2024

CoHFF (Collaborative Semantic Occupancy Prediction with Hybrid Feature Fusion in Connected Automated Vehicles) [paper] [~~code~~]
CoopDet3D (TUMTraf V2X Cooperative Perception Dataset) [paper] [code]
CodeFilling (Communication-Efficient Collaborative Perception via Information Filling with Codebook) [paper] [code]
ERMVP (ERMVP: Communication-Efficient and Collaboration-Robust Multi-Vehicle Perception in Challenging Environments) [paper] [code]
MRCNet (Multi-Agent Collaborative Perception via Motion-Aware Robust Communication Network) [paper] [code]

NeurIPS 2024

V2X-Graph (Learning Cooperative Trajectory Representations for Motion Forecasting) [paper] [code]

ECCV 2024

Hetecooper (Hetecooper: Feature Collaboration Graph for Heterogeneous Collaborative Perception) [paper] [~~code~~]
Infra-Centric CP (Rethinking the Role of Infrastructure in Collaborative Perception) [paper] [~~code~~]

ICLR 2024

HEAL (An Extensible Framework for Open Heterogeneous Collaborative Perception) [paper&review] [code]

AAAI 2024

CMiMC (What Makes Good Collaborative Views? Contrastive Mutual Information Maximization for Multi-Agent Perception) [paper] [code]
DI-V2X (DI-V2X: Learning Domain-Invariant Representation for Vehicle-Infrastructure Collaborative 3D Object Detection) [paper] [code]
V2XFormer (DeepAccident: A Motion and Accident Prediction Benchmark for V2X Autonomous Driving) [paper] [code]

WACV 2024

MACP (MACP: Efficient Model Adaptation for Cooperative Perception) [paper] [code]

ICRA 2024

DMSTrack (Probabilistic 3D Multi-Object Cooperative Tracking for Autonomous Driving via Differentiable Multi-Sensor Kalman Filter) [paper] [code]
FreeAlign (Robust Collaborative Perception without External Localization and Clock Devices) [paper] [code]

CVPR 2023

{Related} BEVHeight (BEVHeight: A Robust Framework for Vision-Based Roadside 3D Object Detection) [paper] [code]
CoCa3D (Collaboration Helps Camera Overtake LiDAR in 3D Detection) [paper] [code]
FF-Tracking (V2X-Seq: The Large-Scale Sequential Dataset for the Vehicle-Infrastructure Cooperative Perception and Forecasting) [paper] [code]

NeurIPS 2023

CoBEVFlow (Robust Asynchronous Collaborative 3D Detection via Bird's Eye View Flow) [paper&review] [code]
FFNet (Flow-Based Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection) [paper&review] [code]
How2comm (How2comm: Communication-Efficient and Collaboration-Pragmatic Multi-Agent Perception) [paper&review] [code]

ICCV 2023

CORE (CORE: Cooperative Reconstruction for Multi-Agent Perception) [paper] [code]
HM-ViT (HM-ViT: Hetero-Modal Vehicle-to-Vehicle Cooperative Perception with Vision Transformer) [paper] [code]
ROBOSAC (Among Us: Adversarially Robust Collaborative Perception by Consensus) [paper] [code]
SCOPE (Spatio-Temporal Domain Awareness for Multi-Agent Collaborative Perception) [paper] [code]
TransIFF (TransIFF: An Instance-Level Feature Fusion Framework for Vehicle-Infrastructure Cooperative 3D Detection with Transformers) [paper] [~~code~~]
UMC (UMC: A Unified Bandwidth-Efficient and Multi-Resolution Based Collaborative Perception Framework) [paper] [code]

ICLR 2023

{Related} CO3 (CO3: Cooperative Unsupervised 3D Representation Learning for Autonomous Driving) [paper&review] [code]

CoRL 2023

BM2CP {BM2CP: Efficient Collaborative Perception with LiDAR-Camera Modalities} [paper&review] [code]

MM 2023

DUSA (DUSA: Decoupled Unsupervised Sim2Real Adaptation for Vehicle-to-Everything Collaborative Perception) [paper] [code]
FeaCo (FeaCo: Reaching Robust Feature-Level Consensus in Noisy Pose Conditions) [paper] [code]
What2comm (What2comm: Towards Communication-Efficient Collaborative Perception via Feature Decoupling) [paper] [~~code~~]

WACV 2023

AdaFusion (Adaptive Feature Fusion for Cooperative Perception Using LiDAR Point Clouds) [paper] [code]

ICRA 2023

CoAlign (Robust Collaborative 3D Object Detection in Presence of Pose Errors) [paper] [code]
{Related} DMGM (Deep Masked Graph Matching for Correspondence Identification in Collaborative Perception) [paper] [code]
Double-M Quantification (Uncertainty Quantification of Collaborative Detection for Self-Driving) [paper] [code]
MAMP (Model-Agnostic Multi-Agent Perception Framework) [paper] [code]
MATE (Communication-Critical Planning via Multi-Agent Trajectory Exchange) [paper] [~~code~~]
MPDA (Bridging the Domain Gap for Multi-Agent Perception) [paper] [code]
WNT (We Need to Talk: Identifying and Overcoming Communication-Critical Scenarios for Self-Driving) [paper] [~~code~~]

CVPR 2022

Coopernaut (COOPERNAUT: End-to-End Driving with Cooperative Perception for Networked Vehicles) [paper] [code]
{Related} LAV (Learning from All Vehicles) [paper] [code]
TCLF (DAIR-V2X: A Large-Scale Dataset for Vehicle-Infrastructure Cooperative 3D Object Detection) [paper] [code]

NeurIPS 2022

Where2comm (Where2comm: Efficient Collaborative Perception via Spatial Confidence Maps) [paper&review] [code]

ECCV 2022

SyncNet (Latency-Aware Collaborative Perception) [paper] [code]
V2X-ViT (V2X-ViT: Vehicle-to-Everything Cooperative Perception with Vision Transformer) [paper] [code]

CoRL 2022

CoBEVT (CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse Transformers) [paper&review] [code]
STAR (Multi-Robot Scene Completion: Towards Task-Agnostic Collaborative Perception) [paper&review] [code]

IJCAI 2022

IA-RCP (Robust Collaborative Perception against Communication Interruption) [paper] [~~code~~]

MM 2022

CRCNet (Complementarity-Enhanced and Redundancy-Minimized Collaboration Network for Multi-agent Perception) [paper] [~~code~~]

ICRA 2022

AttFuse (OPV2V: An Open Benchmark Dataset and Fusion Pipeline for Perception with Vehicle-to-Vehicle Communication) [paper] [code]
MP-Pose (Multi-Robot Collaborative Perception with Graph Neural Networks) [paper] [~~code~~]

NeurIPS 2021

DiscoNet (Learning Distilled Collaboration Graph for Multi-Agent Perception) [paper&review] [code]

ICCV 2021

Adversarial V2V (Adversarial Attacks On Multi-Agent Communication) [paper] [~~code~~]

IROS 2021

MASH (Overcoming Obstructions via Bandwidth-Limited Multi-Agent Spatial Handshaking) [paper] [code]

CVPR 2020

When2com (When2com: Multi-Agent Perception via Communication Graph Grouping) [paper] [code]

ECCV 2020

DSDNet (DSDNet: Deep Structured Self-Driving Network) [paper] [~~code~~]
V2VNet (V2VNet: Vehicle-to-Vehicle Communication for Joint Perception and Prediction) [paper] [code]

CoRL 2020

Robust V2V (Learning to Communicate and Correct Pose Errors) [paper] [code]

ICRA 2020

Who2com (Who2com: Collaborative Perception via Learnable Handshake Communication) [paper] [code]
MAIN (Enhancing Multi-Robot Perception via Learned Data Association) [paper] [~~code~~]

🔖Dataset and Simulator

Note: {Real} denotes that the sensor data is obtained by real-world collection instead of simulation.

Selected Preprint

Adver-City (Adver-City: Open-Source Multi-Modal Dataset for Collaborative Perception Under Adverse Weather Conditions) [paper] [code] [project]
{Real} AGC-Drive (AGC-Drive: A Large-Scale Dataset for Real-World Aerial-Ground Collaboration in Driving Scenarios) [paper] [code] [~~project~~]
AirV2X (AirV2X: Unified Air-Ground Vehicle-to-Everything Collaboration) [paper] [code] [project]
CP-GuardBench (CP-Guard+: A New Paradigm for Malicious Agent Detection and Defense in Collaborative Perception) [paper&review] [~~code~~] [~~project~~]
Griffin (Griffin: Aerial-Ground Cooperative Detection and Tracking Dataset and Benchmark) [paper] [code] [project]
{Real} InScope (InScope: A New Real-world 3D Infrastructure-side Collaborative Perception Dataset for Open Traffic Scenarios) [paper] [code] [~~project~~]
{Real} Mixed Signals (Mixed Signals: A Diverse Point Cloud Dataset for Heterogeneous LiDAR V2X Collaboration) [paper] [code] [project]
MobileVerse (MobiVerse: Scaling Urban Mobility Simulation with Hybrid Lightweight Domain-Specific Generator and Large Language Models) [paper] [code] [~~project~~]
Multi-V2X (Multi-V2X: A Large Scale Multi-modal Multi-penetration-rate Dataset for Cooperative Perception) [paper] [code] [~~project~~]
M3CAD (M3CAD: Towards Generic Cooperative Autonomous Driving Benchmark) [paper] [code] [project]
OPV2V-N (RCDN: Towards Robust Camera-Insensitivity Collaborative Perception via Dynamic Feature-Based 3D Neural Modeling) [paper] [~~code~~] [~~project~~]
TalkingVehiclesGym (Towards Natural Language Communication for Cooperative Autonomous Driving via Self-Play) [paper] [code] [project]
TruckV2X (TruckV2X: A Truck-Centered Perception Dataset) [paper] [~~code~~] [project]
V2V-QA (V2V-LLM: Vehicle-to-Vehicle Cooperative Autonomous Driving with Multi-Modal Large Language Models) [paper] [code] [project]
{Real} V2XPnP-Seq (V2XPnP: Vehicle-to-Everything Spatio-Temporal Fusion for Multi-Agent Perception and Prediction) [paper] [code] [project]
{Real} V2X-Radar (V2X-Radar: A Multi-Modal Dataset with 4D Radar for Cooperative Perception) [paper] [code] [project]
{Real} V2X-Real (V2X-Real: a Large-Scale Dataset for Vehicle-to-Everything Cooperative Perception) [paper] [~~code~~] [project]
{Real} V2X-ReaLO (V2X-ReaLO: An Open Online Framework and Dataset for Cooperative Perception in Reality) [paper] [~~code~~] [~~project~~]
WHALES (WHALES: A Multi-Agent Scheduling Dataset for Enhanced Cooperation in Autonomous Driving) [paper] [code] [project]

CVPR 2025

Mono3DVLT-V2X (Mono3DVLT: Monocular-Video-Based 3D Visual Language Tracking) [paper] [~~code~~] [~~project~~]
RCP-Bench (RCP-Bench: Benchmarking Robustness for Collaborative Perception Under Diverse Corruptions) [paper] [code] [~~project~~]
V2X-R (V2X-R: Cooperative LiDAR-4D Radar Fusion for 3D Object Detection with Denoising Diffusion) [paper] [code] [~~project~~]

CVPR 2024

{Real} HoloVIC (HoloVIC: Large-Scale Dataset and Benchmark for Multi-Sensor Holographic Intersection and Vehicle-Infrastructure Cooperative) [paper] [~~code~~] [project]
{Real} Open Mars Dataset (Multiagent Multitraversal Multimodal Self-Driving: Open MARS Dataset) [code] [paper] [project]
{Real} RCooper (RCooper: A Real-World Large-Scale Dataset for Roadside Cooperative Perception) [paper] [code] [project]
{Real} TUMTraf-V2X (TUMTraf V2X Cooperative Perception Dataset) [paper] [code] [project]

NeurIPS 2024

{Real} DAIR-V2X-Traj (Learning Cooperative Trajectory Representations for Motion Forecasting) [paper] [code] [project]

ECCV 2024

{Real} H-V2X (H-V2X: A Large Scale Highway Dataset for BEV Perception) [paper] [~~code~~] [~~project~~]

ICLR 2024

OPV2V-H (An Extensible Framework for Open Heterogeneous Collaborative Perception) [paper&review] [code] [project]

AAAI 2024

DeepAccident (DeepAccident: A Motion and Accident Prediction Benchmark for V2X Autonomous Driving) [paper] [code] [project]

CVPR 2023

CoPerception-UAV+ (Collaboration Helps Camera Overtake LiDAR in 3D Detection) [paper] [code] [project]
OPV2V+ (Collaboration Helps Camera Overtake LiDAR in 3D Detection) [paper] [code] [project]
{Real} V2V4Real (V2V4Real: A Large-Scale Real-World Dataset for Vehicle-to-Vehicle Cooperative Perception) [paper] [code] [project]
{Real} DAIR-V2X-Seq (V2X-Seq: The Large-Scale Sequential Dataset for the Vehicle-Infrastructure Cooperative Perception and Forecasting) [paper] [code] [project]

NeurIPS 2023

IRV2V (Robust Asynchronous Collaborative 3D Detection via Bird's Eye View Flow) [paper&review] [~~code~~] [~~project~~]

ICCV 2023

Roadside-Opt (Optimizing the Placement of Roadside LiDARs for Autonomous Driving) [paper] [~~code~~] [~~project~~]

ICRA 2023

{Real} DAIR-V2X-C Complemented (Robust Collaborative 3D Object Detection in Presence of Pose Errors) [paper] [code] [project]
RLS (Analyzing Infrastructure LiDAR Placement with Realistic LiDAR Simulation Library) [paper] [code] [~~project~~]
V2XP-ASG (V2XP-ASG: Generating Adversarial Scenes for Vehicle-to-Everything Perception) [paper] [code] [~~project~~]

CVPR 2022

AutoCastSim (COOPERNAUT: End-to-End Driving with Cooperative Perception for Networked Vehicles) [paper] [code] [project]
{Real} DAIR-V2X (DAIR-V2X: A Large-Scale Dataset for Vehicle-Infrastructure Cooperative 3D Object Detection) [paper] [code] [project]

NeurIPS 2022

CoPerception-UAV (Where2comm: Efficient Collaborative Perception via Spatial Confidence Maps) [paper&review] [code] [project]

ECCV 2022

V2XSet (V2X-ViT: Vehicle-to-Everything Cooperative Perception with Vision Transformer) [paper] [code] [project]

ICRA 2022

OPV2V (OPV2V: An Open Benchmark Dataset and Fusion Pipeline for Perception with Vehicle-to-Vehicle Communication) [paper] [code] [project]

ACCV 2022

DOLPHINS (DOLPHINS: Dataset for Collaborative Perception Enabled Harmonious and Interconnected Self-Driving) [paper] [code] [project]

ICCV 2021

V2X-Sim (V2X-Sim: Multi-Agent Collaborative Perception Dataset and Benchmark for Autonomous Driving) [paper] [code] [project]

CoRL 2017

CARLA (CARLA: An Open Urban Driving Simulator) [paper] [code] [project]

Name		Name	Last commit message	Last commit date
Latest commit History 260 Commits
README.md		README.md
direction.png		direction.png
mode.png		mode.png

Little-Podi/Collaborative_Perception

Folders and files

Latest commit

History

Repository files navigation

Collaborative Perception

🔗Jump to:

[Method and Framework]

[Dataset and Simulator]

🌟Recommendation

Helpful Learning Resource:thumbsup::thumbsup::thumbsup:

Typical Collaboration Modes:handshake::handshake::handshake:

Possible Optimization Directions:fire::fire::fire:

🔖Method and Framework

Selected Preprint

CVPR 2025

ICCV 2025

ICLR 2025

AAAI 2025

ICRA 2025

IROS 2025

CVPR 2024

NeurIPS 2024

ECCV 2024

ICLR 2024

AAAI 2024

WACV 2024

ICRA 2024

CVPR 2023

NeurIPS 2023

ICCV 2023

ICLR 2023

CoRL 2023

MM 2023

WACV 2023

ICRA 2023

CVPR 2022

NeurIPS 2022

ECCV 2022

CoRL 2022

IJCAI 2022

MM 2022

ICRA 2022

NeurIPS 2021

ICCV 2021

IROS 2021

CVPR 2020

ECCV 2020

CoRL 2020

ICRA 2020

🔖Dataset and Simulator

Selected Preprint

CVPR 2025

CVPR 2024

NeurIPS 2024

ECCV 2024

ICLR 2024

AAAI 2024

CVPR 2023

NeurIPS 2023

ICCV 2023

ICRA 2023

CVPR 2022

NeurIPS 2022

ECCV 2022

ICRA 2022

ACCV 2022

ICCV 2021

CoRL 2017

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Packages