To conduct human evaluation, we need to generate various samples. We provide many prompts in assets/texts
, and defined some test setting covering different resolution, duration and aspect ratio in eval/sample.sh
. To facilitate the usage of multiple GPUs, we split sampling tasks into several parts.
# image (1)
bash eval/sample.sh /path/to/ckpt -1
# video (2a 2b 2c ...)
bash eval/sample.sh /path/to/ckpt -2a
# launch 8 jobs at once (you must read the script to understand the details)
bash eval/launch.sh /path/to/ckpt
VBench is a benchmark for short text to video generation. We provide a script for easily generating samples required by VBench.
# vbench tasks (4a 4b 4c ...)
bash eval/sample.sh /path/to/ckpt -4a
# launch 8 jobs at once (you must read the script to understand the details)
bash eval/launch.sh /path/to/ckpt
After generation, install the VBench package according to their instructions. Then, run the following commands to evaluate the generated samples.
bash eval/vbench/vbench.sh /path/to/video_folder
VBench-i2v is a benchmark for short image to video generation (beta version).
TBD