Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
imvoxelnet_2xb4_sunrgbd-3d-10class.py		imvoxelnet_2xb4_sunrgbd-3d-10class.py
imvoxelnet_8xb4_kitti-3d-car.py		imvoxelnet_8xb4_kitti-3d-car.py
metafile.yml		metafile.yml

README.md

ImVoxelNet: Image to Voxels Projection for Monocular and Multi-View General-Purpose 3D Object Detection

ImVoxelNet: Image to Voxels Projection for Monocular and Multi-View General-Purpose 3D Object Detection

Abstract

In this paper, we introduce the task of multi-view RGB-based 3D object detection as an end-to-end optimization problem. To address this problem, we propose ImVoxelNet, a novel fully convolutional method of 3D object detection based on posed monocular or multi-view RGB images. The number of monocular images in each multiview input can variate during training and inference; actually, this number might be unique for each multi-view input. ImVoxelNet successfully handles both indoor and outdoor scenes, which makes it general-purpose. Specifically, it achieves state-of-the-art results in car detection on KITTI (monocular) and nuScenes (multi-view) benchmarks among all methods that accept RGB images. Moreover, it surpasses existing RGB-based 3D object detection methods on the SUN RGB-D dataset. On ScanNet, ImVoxelNet sets a new benchmark for multi-view 3D object detection.

Introduction

We implement a monocular 3D detector ImVoxelNet and provide its results and checkpoints on KITTI dataset. Results for SUN RGB-D, ScanNet and nuScenes are currently available in ImVoxelNet authors repo (based on mmdetection3d).

Results and models

KITTI

Backbone	Class	Lr schd	Mem (GB)	Inf time (fps)	mAP	Download
ResNet-50	Car	3x			17.26	model \| log

SUN RGB-D

Backbone	Lr schd	Mem (GB)	Inf time (fps)	mAP@0.25	mAP@0.5	Download
ResNet-50	2x	7.2	22.5	40.96	13.50	model \| log

Citation

@article{rukhovich2021imvoxelnet,
  title={ImVoxelNet: Image to Voxels Projection for Monocular and Multi-View General-Purpose 3D Object Detection},
  author={Danila Rukhovich, Anna Vorontsova, Anton Konushin},
  journal={arXiv preprint arXiv:2106.01178},
  year={2021}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

imvoxelnet

imvoxelnet

README.md

ImVoxelNet: Image to Voxels Projection for Monocular and Multi-View General-Purpose 3D Object Detection

Abstract

Introduction

Results and models

KITTI

SUN RGB-D

Citation

Files

imvoxelnet

Directory actions

More options

Directory actions

More options

Latest commit

History

imvoxelnet

Folders and files

parent directory

README.md

ImVoxelNet: Image to Voxels Projection for Monocular and Multi-View General-Purpose 3D Object Detection

Abstract

Introduction

Results and models

KITTI

SUN RGB-D

Citation