Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Possible implementation in Google Colab. #46

Open
francescomarino10 opened this issue Oct 14, 2022 · 1 comment
Open

Possible implementation in Google Colab. #46

francescomarino10 opened this issue Oct 14, 2022 · 1 comment

Comments

@francescomarino10
Copy link

francescomarino10 commented Oct 14, 2022

I would need to generate a caption in a video, what would be the easiest way to use it? I don't have to train it, just use it to generate this caption.
I haven't found a working example in the documentation.
Could it possibly be possible to have a colab notebook ready to use?

@YehLi
Copy link
Owner

YehLi commented Oct 21, 2022

The project is mainly focused on training. To generate a caption from a raw video, you can refer to configs/image_caption/transformer/clip_transformer_test_raw.yaml and write a new video loader (similar to MSCoCoRawDataset). The dataloader needs to parse the video into frames and extract the features of the frames.

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants