Relation Regularized Scene Graph Generation

This repository contains data and code for the paper “Relation Regularized Scene Graph Generation”. This code is based on the neural-motifs.

Framework

Abstract

Scene graph generation (SGG) is built on top of detected objects to predict object pairwise visual relations for describing the image content abstraction. Existing works have revealed that if the links between objects are given as prior knowledge, the performance of SGG is significantly improved. Inspired by this observation, in this paper, we propose a Relation Regularized Network (R2-Net), which can predict whether there is a relationship between two objects and encode this relation into object feature refinement and better SGG. Specifically, we first construct an affinity matrix among detected objects to represent the probability of a relationship between two objects. Graph Convolution Networks (GCNs) over this relation affinity matrix are then used as object encoders, producing relation-regularized representations of objects. With these relation-regularized features, our R2-Net can effectively refine object labels and generate scene graphs. Extensive experiments are conducted on the Visual Genome dataset for three SGG tasks (\ie PREDCLS, SGSLS, and SGDET), demonstrating the effectiveness of our proposed method. Ablation studies also verify the key roles of our proposed components in performance improvement.

Setup

Install python3.6 and pytorch 3. I recommend the Anaconda distribution. To install PyTorch if you haven't already, use conda install pytorch torchvision cuda90 -c pytorch.
Update the config file with the dataset paths. Specifically:
- Visual Genome (the VG_100K folder, image_data.json, VG-SGG.h5, and VG-SGG-dicts.json). See data/stanford_filtered/README.md for the steps I used to download these.
- You'll also need to fix your PYTHONPATH: export PYTHONPATH=/home/guoyuyu/code/scene-graph
Compile everything. run make in the main directory: this compiles the Bilinear Interpolation operation for the RoIs as well as the Highway LSTM.
Pretrain VG detection. The old version involved pretraining COCO as well, but we got rid of that for simplicity. Run ./scripts/pretrain_detector.sh Note: You might have to modify the learning rate and batch size, particularly if you don't have 3 Titan X GPUs (which is what I used). You can also download the pretrained detector checkpoint here.
Train VG scene graph classification: run ./scripts/train_models_sgcls.sh.
Refine for detection: run ./scripts/refine_for_detection.sh.
Evaluate: Refer to the scripts ./scripts/eval_models_sg[cls/det].sh.

Help

Feel free to ping me if you encounter trouble getting it to work!

Bibtex

@ARTICLE{9376912,
  author={Guo, Yuyu and Gao, Lianli and Song, Jingkuan and Wang, Peng and Sebe, Nicu and Shen, Heng Tao and Li, Xuelong},
  journal={IEEE Transactions on Cybernetics}, 
  title={Relation Regularized Scene Graph Generation}, 
  year={2021},
  volume={},
  number={},
  pages={1-12},
  doi={10.1109/TCYB.2021.3052522}}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
__pycache__		__pycache__
data/stanford_filtered		data/stanford_filtered
dataloaders		dataloaders
docs		docs
experiments		experiments
lib		lib
log		log
misc		misc
models		models
output		output
scripts		scripts
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
__background__		__background__
config.py		config.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Relation Regularized Scene Graph Generation

Framework

Abstract

Setup

Help

Bibtex

About

Releases

Packages

Languages

License

gyy8426/R2-Net

Folders and files

Latest commit

History

Repository files navigation

Relation Regularized Scene Graph Generation

Framework

Abstract

Setup

Help

Bibtex

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages