General Geometry-aware Weakly Supervised 3D Object Detection

This repo is the official implementation of ECCV24 paper General Geometry-aware (GGA) Weakly Supervised 3D Object Detection. Our GGA exhibits promising generalization capabilites, allowing it to be easily extend to various novel scenarios and classes. GGA achieves state-of-the-art performance on 2D bbox-supervised Monocular 3D object Detection. GGA is built on the codebase of MMDetection3D.

🔥News

-[24-07-04] Our GGA is accepted by ECCV'24 🎉🎉🎉, if you find it helpful, please give it a star.
-[24-07-18] Code of KITTI is released.

👀Overview

📘TODO

Release the code of KITTI.
Release the arxiv version.
Release the pseudo labels.
Release more detail results.

Notice

We are currently updating this repository due to a code reorganization. There may be some issues. Please feel free to report any problems in the issues section.

🏆Main Results

Outdoor Monocular 3D Object Detection (on KITTI test)

	AP_BEV			AP_3D
Model	Easy	Mod.	Hard	Easy	Mod.	Hard
PGD+GGA	17.42	10.21	8.09	10.42	6.08	4.65

Outdoor Monocular 3D Object Detection (on KITTI validation)

	AP_BEV			AP_3D
Model	Easy	Mod.	Hard	Easy	Mod.	Hard
MonoDETR+GGA	30.07	21.49	18.23	21.18	14.96	10.89

Indoor Point Cloud 3D Object Detection (on SUN-RGBD)

Model	bathtub	bed	bkshelf	chair	desk	dresser	nstand	sofa	table	toilet	mAP
FCAF3D+GGA	55.4	69.9	22.4	59.1	22.5	31.3	59.3	58.9	34.8	71.4	48.5

🚀Quick Start

Installation

conda create --name gga python=3.8 -y  
conda activate gga  
conda install pytorch torchvision -c pytorch  
pip install openmim  
mim install mmcv-full  
mim install mmdet  
mim install mmsegmentation  
git clone https://github.com/gwenzhang/GGA.git  
cd GGA  
pip install -e .

Data Preparation

KITTI

mmdetection3d
├── data
│   ├── kitti
│   │   ├── ImageSets
│   │   ├── testing
│   │   │   ├── calib
│   │   │   ├── image_2
│   │   │   ├── velodyne
│   │   ├── training
│   │   │   ├── calib
│   │   │   ├── image_2
│   │   │   ├── label_2
│   │   │   ├── velodyne

Generate the data infos by running the following command (it may take several hours):

cd GGA  
python ./tools/create_data_gga.py kitti --root_path ./data/kitti --out_dir ./data/kitti  
# Create dataset info file, and lidar pseudo database

The format of the generated data is as follows:

mmdetection3d
├── data
│   ├── kitti
│   │   ├── ImageSets
│   │   ├── testing
│   │   ├── training
│   │   ├── kitti_gt_database_GGA
│   │   ├── kitti_infos_train_GGA.pkl
│   │   ├── kitti_infos_val_GGA.pkl
│   │   ├── kitti_infos_trainval_GGA.pkl
│   │   ├── kitti_infos_test.pkl
│   │   ├── kitti_dbinfos_train_GGA.pkl
│   ├── kitti_GGA_split_file

Training GGA

./tools/dist_train.sh configs/gga/gga_kitti_config.py 8

Generate Pseudo 3D Labels

./tools/dist_pseudo.sh configs/gga/gga_kitti_matching_config.py {checkpoints} 8  --eval mAP  
python create_data_gga_retrain_mono.py kitti --root_path ./data/kitti --out_dir ./data/kitti

The format of the generated data is as follows:

mmdetection3d
├── data
│   ├── kitti
│   │   ├── ImageSets
│   │   ├── testing
│   │   ├── training
│   │   ├── kitti_gt_database_GGA
│   │   ├── kitti_infos_train_GGA.pkl
│   │   ├── kitti_infos_val_GGA.pkl
│   │   ├── kitti_infos_trainval_GGA.pkl
│   │   ├── kitti_infos_test.pkl
│   │   ├── kitti_dbinfos_train_GGA.pkl
│   │   ├── kitti_infos_trainval_GGA_mono3d.coco.json
│   │   ├── kitti_infos_test_mono3d.coco.json
│   ├── kitti_GGA_split_file

Retraining

./tools/dist_train.sh configs/gga/gga_pgd.py 8

Testing (Generate submission files)

./tools/dist_test.sh configs/gga/gga_pgd.py {checkpoint_dir} 8  --format-only --eval-options 'pklfile_prefix=./gga_results' 'submission_prefix=./gga_results'

Citation

Please consider citing our work as follows if it is helpful.

@article{zhang2024general,
  title={General Geometry-aware Weakly Supervised 3D Object Detection},
  author={Zhang, Guowen and Fan, Junsong and Chen, Liyi and Zhang, Zhaoxiang and Lei, Zhen and Zhang, Lei},
  booktitle={European Conference on Computer Vision},
  organization={Springer},
  year={2024},
}

Acknowledgments

GGA is based on MMDetection3D.
We also thank the FGR, MonoDETR, PDG, CenterPoint and FCAF3D authors for their efforts.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
.dev_scripts		.dev_scripts
configs		configs
demo		demo
docker		docker
docs		docs
mmdet3d		mmdet3d
projects/example_project		projects/example_project
requirements		requirements
resources		resources
tests		tests
tools		tools
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
model-index.yml		model-index.yml
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py
temp.py		temp.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

General Geometry-aware Weakly Supervised 3D Object Detection

🔥News

👀Overview

📘TODO

Notice

🏆Main Results

Outdoor Monocular 3D Object Detection (on KITTI test)

Outdoor Monocular 3D Object Detection (on KITTI validation)

Indoor Point Cloud 3D Object Detection (on SUN-RGBD)

🚀Quick Start

Installation

Data Preparation

KITTI

Training GGA

Generate Pseudo 3D Labels

Retraining

Testing (Generate submission files)

Citation

Acknowledgments

About

Releases

Packages

Languages

License

gwenzhang/GGA

Folders and files

Latest commit

History

Repository files navigation

General Geometry-aware Weakly Supervised 3D Object Detection

🔥News

👀Overview

📘TODO

Notice

🏆Main Results

Outdoor Monocular 3D Object Detection (on KITTI test)

Outdoor Monocular 3D Object Detection (on KITTI validation)

Indoor Point Cloud 3D Object Detection (on SUN-RGBD)

🚀Quick Start

Installation

Data Preparation

KITTI

Training GGA

Generate Pseudo 3D Labels

Retraining

Testing (Generate submission files)

Citation

Acknowledgments

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages