TRN

简介

@article{zhou2017temporalrelation,
    title = {Temporal Relational Reasoning in Videos},
    author = {Zhou, Bolei and Andonian, Alex and Oliva, Aude and Torralba, Antonio},
    journal={European Conference on Computer Vision},
    year={2018}
}

模型库

Something-Something V1

配置文件	分辨率	GPU 数量	主干网络	预训练	top1 准确率 (efficient/accurate)	top5 准确率 (efficient/accurate)	GPU 显存占用 (M)	ckpt	log	json
trn_r50_1x1x8_50e_sthv1_rgb	高 100	8	ResNet50	ImageNet	31.62 / 33.88	60.01 / 62.12	11010	ckpt	log	json

Something-Something V2

配置文件	分辨率	GPU 数量	主干网络	预训练	top1 准确率 (efficient/accurate)	top5 准确率 (efficient/accurate)	GPU 显存占用 (M)	ckpt	log	json
trn_r50_1x1x8_50e_sthv2_rgb	高 256	8	ResNet50	ImageNet	48.39 / 51.28	76.58 / 78.65	11010	ckpt	log	json

注：

这里的 GPU 数量 指的是得到模型权重文件对应的 GPU 个数。默认地，MMAction2 所提供的配置文件对应使用 8 块 GPU 进行训练的情况。依据线性缩放规则，当用户使用不同数量的 GPU 或者每块 GPU 处理不同视频个数时，需要根据批大小等比例地调节学习率。如，lr=0.01 对应 4 GPUs x 2 video/gpu，以及 lr=0.08 对应 16 GPUs x 4 video/gpu。
对于 Something-Something 数据集，有两种测试方案：efficient（对应 center crop x 1 clip）和 accurate（对应 Three crop x 2 clip）。
在原代码库中，作者在 Something-Something 数据集上使用了随机水平翻转，但这种数据增强方法有一些问题，因为 Something-Something 数据集有一些方向性的动作，比如从左往右推。所以 MMAction2 把随机水平翻转改为带标签映射的水平翻转，同时修改了测试模型的数据处理方法，即把裁剪 10 个图像块（这里面包括 5 个翻转后的图像块）修改成采帧两次 & 裁剪 3 个图像块。
MMAction2 使用 ResNet50 代替 BNInception 作为 TRN 的主干网络。使用原代码，在 sthv1 数据集上训练 TRN-ResNet50 时，实验得到的 top1 (top5) 的准确度为 30.542 (58.627)，而 MMAction2 的精度为 31.62 (60.01)。

关于数据处理的更多细节，用户可以参照

准备 sthv1
准备 sthv2

如何训练

用户可以使用以下指令进行模型训练。

python tools/train.py ${CONFIG_FILE} [optional arguments]

例如：以一个确定性的训练方式，辅以定期的验证过程进行 TRN 模型在 sthv1 数据集上的训练。

python tools/train.py configs/recognition/trn/trn_r50_1x1x8_50e_sthv1_rgb.py \
    --work-dir work_dirs/trn_r50_1x1x8_50e_sthv1_rgb \
    --validate --seed 0 --deterministic

更多训练细节，可参考基础教程中的 训练配置 部分。

如何测试

用户可以使用以下指令进行模型测试。

python tools/test.py ${CONFIG_FILE} ${CHECKPOINT_FILE} [optional arguments]

例如：在 sthv1 数据集上测试 TRN 模型，并将结果导出为一个 json 文件。

python tools/test.py configs/recognition/trn/trn_r50_1x1x8_50e_sthv1_rgb.py \
    checkpoints/SOME_CHECKPOINT.pth --eval top_k_accuracy mean_class_accuracy \
    --out result.json

更多测试细节，可参考基础教程中的 测试某个数据集 部分。

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README_zh-CN.md

README_zh-CN.md

TRN

简介

模型库

Something-Something V1

Something-Something V2

如何训练

如何测试

Files

README_zh-CN.md

Latest commit

History

README_zh-CN.md

File metadata and controls

TRN

简介

模型库

Something-Something V1

Something-Something V2

如何训练

如何测试