MarS3D: A Plug-and-Play Motion-Aware Model for Semantic Segmentation on Multi-Scan 3D Point Clouds

Jiahui Liu^1*, Chirui Chang^1*, Jianhui Liu¹, Xiaoyang Wu¹, Lan Ma², Xiaojuan Qi^1†,

¹The University of Hong Kong ²TCL AI Lab

*equal contribution ⁺corresponding author

CVPR 2023

MarS3D is a plug-and-play motion-aware module for semantic segmentation on multi-scan 3D point clouds. Extensive experiments show that MarS3D can improve the performance of the baseline model by a large margin.

video | arXiv | paper

Interesting Features

A plug-and-play module, which can be flexibly integrated with mainstream single-scan segmentation models.
A Cross-Frame Feature Embedding for temporal information preservation.
A BEV-based Motion-Aware Feature Learning module to exploit temporal information and enhance the model’s motion awareness.

Performance

Performance improvement over baselines on SemanticKITTI public multi-scan validation set.

Method mIoU #param

SPVCNN 49.70% 21.8M

SPVCNN+MarS3D 54.66% 21.9M

SparseUNet 48.99% 39.2M

SparseUNet+MarS3D 54.64% 39.3M

MinkUNet 48.47% 37.9M

MinkUNet+MarS3D 54.71% 38.0M
Comparison with the state-of-the-art models on SemanticKITTI multi-scan benchmark.

Method mIoU

SpSequenceNet 43.1%

TemporalLidarSeg 47.0%

TemporalLatticeNet 47.1%

Meta-RangeSeg 49.5%

KPConv 51.2%

SPVCNN 49.2%

SPVCNN+MarS3D 52.7%

Getting Started

Installatioon

System requirements

Ubuntu: 18.04+
CUDA: 11.x

Python dependencies

conda create -n mars3d python=3.9 -y
conda activate mars3d
conda install ninja -y
conda install pytorch torchvision torchaudio cudatoolkit -c pytorch -y  # follow the instructions from the official pytorch website
conda install -c anaconda h5py pyyaml -y
conda install -c conda-forge sharedarray tensorboardx yapf addict einops scipy plyfile -y
pip install tqdm
pip install numpy==1.23 # for fixing version conflict, should < 1.25

Backbone dependencies

# Please follow the official instructions from the official repo for different backbones
# Minkowski Engine: https://github.com/NVIDIA/MinkowskiEngine

# spconv: https://github.com/traveller59/spconv

# torchsparse: https://github.com/mit-han-lab/torchsparse

# torchsparse: an optional installation without sudo apt install
conda install google-sparsehash -c bioconda
export C_INCLUDE_PATH=${CONDA_PREFIX}/include:$C_INCLUDE_PATH
export CPLUS_INCLUDE_PATH=${CONDA_PREFIX}/include:CPLUS_INCLUDE_PATH
pip install --upgrade git+https://github.com/mit-han-lab/torchsparse.git

Dataset Preparation

Semantic-Kitti

Download the Semantic-Kitti
The dataset file structure is listed as follows:

./
├── 
├── ...
└── path_to_semantic_kitti/
    ├──sequences
        ├── 00/           
        │   ├── velodyne/	
        |   |	├── 000000.bin
        |   |	├── 000001.bin
        |   |	└── ...
        │   └── labels/ 
        |       ├── 000000.label
        |       ├── 000001.label
        |       └── ...
        ├── 08/ # 08 is the validation set
        ├── 11/ # 11-21 is the test set
        └── ...
        └── 21/

Symlink the paths to them as follows:

mkdir data
ln -s /path_to_semantic_kitti data/semantic_kitti

Run

Training

sh scripts/train.sh -p INTERPRETER_PATH -d DATASET -c CONFIG_NAME -n EXP_NAME

for example:

sh scripts/train.sh -p python -d semantic_kitti -c spvcnn-mh -n mars3d-spvcnn

Inference

For calling test script, exp folder generated in training process start by training script is needed.

Furthermore, currently, the weight path should be specified in test.py

sh scripts/test.sh -p INTERPRETER_PATH -d DATASET -c CONFIG_NAME -n EXP_NAME

for example:

sh scripts/test.sh -p python -d semantic_kitti -c spvcnn-mh -n mars3d-spvcnn

Citation

If you find this project useful in your research, please consider cite:

@inproceedings{liu2023mars3d,
  title={MarS3D: A Plug-and-Play Motion-Aware Model for Semantic Segmentation on Multi-Scan 3D Point Clouds},
  author={Liu, Jiahui and Chang, Chirui and Liu, Jianhui and Wu, Xiaoyang and Ma, Lan and Qi, Xiaojuan},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={9372--9381},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
assets		assets
configs		configs
pointseg		pointseg
scripts		scripts
tools		tools
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MarS3D: A Plug-and-Play Motion-Aware Model for Semantic Segmentation on Multi-Scan 3D Point Clouds

Interesting Features

Performance

Getting Started

Installatioon

System requirements

Python dependencies

Backbone dependencies

Dataset Preparation

Run

Training

Inference

Citation

About

Releases

Packages

Contributors 2

Languages

Method	mIoU	#param
SPVCNN	49.70%	21.8M
SPVCNN+MarS3D	54.66%	21.9M
SparseUNet	48.99%	39.2M
SparseUNet+MarS3D	54.64%	39.3M
MinkUNet	48.47%	37.9M
MinkUNet+MarS3D	54.71%	38.0M

Method	mIoU
SpSequenceNet	43.1%
TemporalLidarSeg	47.0%
TemporalLatticeNet	47.1%
Meta-RangeSeg	49.5%
KPConv	51.2%
SPVCNN	49.2%
SPVCNN+MarS3D	52.7%

License

CVMI-Lab/MarS3D

Folders and files

Latest commit

History

Repository files navigation

MarS3D: A Plug-and-Play Motion-Aware Model for Semantic Segmentation on Multi-Scan 3D Point Clouds

Interesting Features

Performance

Getting Started

Installatioon

System requirements

Python dependencies

Backbone dependencies

Dataset Preparation

Run

Training

Inference

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages