This repository contains several CNNs for semantic segmentation (U-Net, SegNet, ResNet, FractalNet) using Keras library. The code was developed assuming the use of depth data (e.g. Kinect, Asus Xtion Pro Live).
Implementation details for "Convolutional Networks for Semantic Heads Segmentation using Top-View RGB-D Data in Crowded Environment" accepted in Internation Conference on Pattern Recognition (ICPR), 2018.
Tested on: Ubuntu 16.04, Keras 2.2.2, Tensorflow 1.7.0, Python 3.5.2
This repo just contain some implementation details for Reproducible Label in Pattern Recognition, for the complete project please visit the original repository by Daniele Liciotti.
Dataset:
Provided data is processed by data.py
script. This script just loads the images and saves them into NumPy binary format files .npy
for faster loading later.
python3 data.py
The provided models are basically a convolutional auto-encoders.
python3 train_fractal_unet.py
python3 train_resnet.py
python3 train_segnet.py
python3 train_unet.py
python3 train_unet2.py
python3 train_unet3_conv.py
These deep neural network is implemented with Keras functional API.
Output from the networks is a 96 x 128 which represents mask that should be learned. Sigmoid activation function makes sure that mask pixels are in [0, 1] range.
You can test the online prediction with an OpenNI registration (.oni
file).
python3 online_prediction.py --v <oni_video_path>
Requirement for this is OpenNI2 installation: https://github.com/occipital/OpenNI2, then link the libOpenNI2.so and the OpenNI2 directory in the script path. Before launching the script create a folder predicted_images
.
sudo apt-get install python3-pip python3-dev python-virtualenv # for Python 3.n
virtualenv -p python3 deepseg
. deepseg/bin/activate
The preceding command should change your prompt to the following:
(deepseg)$
Install TensorFlow in the active virtualenv environment:
pip3 install --upgrade tensorflow-gpu # for Python 3.n and GPU
Install the others library:
pip3 install --upgrade keras scikit-learn scikit-image h5py opencv-python primesense
- Create a folder
raw
in the same filesystem level of the above python scripts - Download the dataset and extract all the images in a folder
raw/train
. - Run
python3 data.py
a folder "npy" will be created containig Numpy binary format npy files with traning and validation dataset. - Run the above python training and testing scripts, for example
python3 train_unet3_conv.py
- Log files with final results
log_conv_8.csv
andlog_conv_16.csv
will be created. - Predicted images for the test data will be stored in folders
preds_16
andpreds_8
- Daniele Liciotti | GitHub
- This work is partially inspired by the work of jocicmarko.