Feature Extraction

We have used the following scripts to extract features for the collected data.

These primarily include video recognition models and multimodal understanding models.

Our feature primarily include two categories:

Segment features
Frame features

Segment Features

We have used the following models to extract segment features:

SlowFast
3DResNet
Omnivore
X3D
Imagebind

Frame Features

Omnivore
Imagebind
TSM

Multimodal Understanding Features

Depth - Imagebind
Audio - Imagebind
Text - (Video --> Lavila --> Imagebind)

How to extract features

Run scripts

You can download the data using the downloader script
Place the data in folders and change the respective paths in the scripts

Use extracted features

You can download all the extracted features used to train all our models from here

Name		Name	Last commit message	Last commit date
Latest commit History 209 Commits
audio_features		audio_features
frame_features		frame_features
lib		lib
modalities		modalities
segment_features		segment_features
text_features		text_features
.gitmodules		.gitmodules
README.md		README.md
frame_utils.py		frame_utils.py
omnivore_transforms.py		omnivore_transforms.py
requirements.txt		requirements.txt
test_data.py		test_data.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Feature Extraction

Segment Features

Frame Features

Multimodal Understanding Features

How to extract features

Run scripts

Use extracted features

About

Releases

Packages

Contributors 3

Languages

CaptainCook4D/feature_extractors

Folders and files

Latest commit

History

Repository files navigation

Feature Extraction

Segment Features

Frame Features

Multimodal Understanding Features

How to extract features

Run scripts

Use extracted features

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages