Getting Started

Mask RCNN for Human Pose Estimation

This repository includes the codes for evaluation, with some modifications to make most of the functions in the original codes work well for Keypoint Detection task.

The original code is from "https://github.com/Superlee506/Mask_RCNN_Humanpose" and "https://github.com/matterport/Mask_RCNN" on Python 3, Keras, and TensorFlow. The code reproduce the work of "https://arxiv.org/abs/1703.06870" for human pose estimation.

Problems

Low performance. The visualization of the keypoint detection seems okay but the evaluation results are much lower than the paper shows. I have tried trainning several times and the results were almost the same.
NOT supporting Multi-GPUs.

However RodrigoGantier's project has the following problems:

It's codes have few comments and still use the oringal names from @Matterport's project, which make the project hard to understand.
When I trained this model, I found it's hard to converge as described in issue#3.

Requirements

Python 3.5+
TensorFlow 1.4+
Keras 2.0.8+
Jupyter Notebook
Numpy, skimage, scipy, Pillow, cython, h5py

Getting Started

Please search "/home" to change the paths before you run any codes.
inference_humanpose.ipynb shows how to predict the keypoint of human using my trained model. It randomly chooses a image from the validation set. You can download pre-trained COCO weights for human pose estimation (mask_rcnn_coco_humanpose.h5) from the releases page (https://github.com/Superlee506/Mask_RCNN_Humanpose/releases).
train_humanpose.ipynb shows how to train the model step by step. You can also use "python train_humanpose.py" to start training.
inspect_humanpose.ipynb visulizes the proposal target keypoints to check it's validity. It also outputs some innner layers to help us debug the model.
demo_human_pose.ipynb A new demo for images input from the "images" folder. [04-11-2018]
video_demo.py A new demo for video input from camera.[04-11-2018]

Evaluation

Discussion

I convert the joint coordinates into an integer label ([0, 56*56)), and use tf.nn.sparse_softmax_cross_entropy_with_logits as the loss function. This refers to the original Detectron code which is key reason why my loss can converge quickly.
If you still want to use the keypoint mask as output, you'd better adopt the modified loss function proposed by @QtSignalProcessing in issue#2. Because after crop and resize, the keypoint masks may hava more than one 1 values, and this will make the original soft_cross entropy_loss hard to converge.
Althougth the loss converge quickly, the prediction results isn't as good as the oringal papers, especially for right or left shoulder, right or left knee, etc. I'm confused with it, so I release the code and any contribution or suggestion to this repository is welcome.

Name		Name	Last commit message	Last commit date
Latest commit History 84 Commits
assets		assets
images		images
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
coco.py		coco.py
config.py		config.py
demo_human_pose.ipynb		demo_human_pose.ipynb
human_pose.py		human_pose.py
inference_humanpose.ipynb		inference_humanpose.ipynb
inspect_humanpose.ipynb		inspect_humanpose.ipynb
model.py		model.py
parallel_model.py		parallel_model.py
train_human_pose.ipynb		train_human_pose.ipynb
train_shapes.ipynb		train_shapes.ipynb
utils.py		utils.py
video_demo.py		video_demo.py
visualize.py		visualize.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mask RCNN for Human Pose Estimation

Problems

However RodrigoGantier's project has the following problems:

Requirements

Getting Started

Evaluation

Discussion

About

Releases

Packages

Languages

License

BernieZhu/Mask_RCNN_Humanpose

Folders and files

Latest commit

History

Repository files navigation

Mask RCNN for Human Pose Estimation

Problems

However RodrigoGantier's project has the following problems:

Requirements

Getting Started

Evaluation

Discussion

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages