[Improvement] Add a tool to find invalid videos. #907

irvingzhang0512 · 2021-06-07T02:51:11Z

Motivation

Fix #893
Check all samples from a video dataset specified by the configuration file, and save file paths of invalid videos(corrupted or missing) in an output file.

Modification

Add a script in tools and update related usefull tool docs.

Use cases (Optional)

Use decord to decode all videos from the train set of /path/to/config.py with 5 processes, save all invalid video paths in invalid_videos.txt

python tools/check_videos.py /path/to/config.py \
    --decoder decord \
    --split train \
    --output-file invalid_videos.txt \
    --num-processes 5

Use opencv to decode all videos from the test set of /path/to/config.py with 10 processes, save all invalid video paths in invalid_videos.txt and remove all corrupted videos.

python tools/check_videos.py /path/to/config.py \
    --decoder opencv \
    --split test \
    --output-file invalid_videos.txt \
    --remove-corrupted-videos \
    --num-processes 10

TODO

Read dataset configs from config file.
Choose video decoder by --decoder
Check video by opening video file and read first, last and 3 random frames.
Multiprocessing
Generate a file list of invalid(corrupted or missing) video paths
Optional remove all corrupted videos.

codecov · 2021-06-07T03:29:01Z

Codecov Report

Merging #907 (26f8ebe) into master (f007661) will decrease coverage by 0.05%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master     #907      +/-   ##
==========================================
- Coverage   83.58%   83.53%   -0.06%     
==========================================
  Files         132      132              
  Lines        9977     9977              
  Branches     1720     1720              
==========================================
- Hits         8339     8334       -5     
- Misses       1219     1222       +3     
- Partials      419      421       +2

Flag	Coverage Δ
unittests	`83.53% <ø> (-0.06%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
mmaction/core/evaluation/accuracy.py	`92.27% <0.00%> (-0.91%)`	⬇️
mmaction/datasets/pipelines/augmentations.py	`92.41% <0.00%> (-0.35%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update f007661...26f8ebe. Read the comment docs.

kennymckormick · 2021-06-07T06:09:39Z

Great tool. However, I think it can still be improved from two aspects:

Even if you open a video successfully with a backend, it doesn't necessarily mean you can read frames from it properly. Perhaps you need to read a frame from the video to validate.
Sequentially checking each video may be very slow, can you check videos in the dataset in parallel?

innerlee · 2021-06-07T07:04:55Z

Yeah try to read the first, last and random three frames

irvingzhang0512 added 7 commits June 7, 2021 09:44

add tool

9d5cabc

update docs

a6d919c

polish script

911d3e1

polish

dd60546

polish

2c98e8b

polish

ceeebf8

fix a bug

96ade26

irvingzhang0512 added 5 commits June 7, 2021 15:56

check first/last frames and 3 random frames

408b2f6

pool

2b83840

polish

1ece7ee

update docs

19b4d1e

update changelog

cf5ef0f

dreamerlin changed the title ~~[Improvment] Add a tool to find invalid videos.~~ [Improvement] Add a tool to find invalid videos. Jun 9, 2021

kennymckormick approved these changes Jun 10, 2021

View reviewed changes

Merge branch 'master' into check-video-tool

26f8ebe

kennymckormick merged commit 4368ef3 into open-mmlab:master Jun 13, 2021

irvingzhang0512 deleted the check-video-tool branch June 14, 2021 16:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Improvement] Add a tool to find invalid videos. #907

[Improvement] Add a tool to find invalid videos. #907

irvingzhang0512 commented Jun 7, 2021 •

edited

Loading

codecov bot commented Jun 7, 2021 •

edited

Loading

kennymckormick commented Jun 7, 2021

innerlee commented Jun 7, 2021

[Improvement] Add a tool to find invalid videos. #907

[Improvement] Add a tool to find invalid videos. #907

Conversation

irvingzhang0512 commented Jun 7, 2021 • edited Loading

Motivation

Modification

Use cases (Optional)

TODO

codecov bot commented Jun 7, 2021 • edited Loading

Codecov Report

kennymckormick commented Jun 7, 2021

innerlee commented Jun 7, 2021

irvingzhang0512 commented Jun 7, 2021 •

edited

Loading

codecov bot commented Jun 7, 2021 •

edited

Loading