Updated Dependencies, Better Docker Support, and Segmentation Demo #480

tim-win · 2024-09-01T02:02:52Z

This PR introduces several significant improvements and updates to code surrounding the core YOLO-World project, addressing multiple issues and enhancing overall functionality and ease of use. When I finally got out of dependency hell, I decided to put down a ladder!

Key Changes

Dependency Updates:
- Updated dependencies to match the latest recommended versions from issue #364, including torch >2.0.0 (phew!).
- Upgraded to CUDA 12.1 to ensure compatibility with the latest GPU architectures and because thank god it works.
Docker Support:
- Took the existing Docker demo system under my wing and cleaned it right up: it automatically handles the mm* dependency issues everyone has run into, as well as torch and others required for the demo.
- Added a build_and_run.sh script for easy building and running of Docker containers with different model configurations, matching configs to models, so no one else needs the headache I have.
Segmentation Demo:
- Added demo/segmentation_demo.py to showcase YOLO-World's open vocabulary segmentation capabilities. The guts of which was stolen shamelessly form @onuralpszr 's excellent hugginface space, https://huggingface.co/spaces/onuralpszr/YOLO-World-Seg, which did not work but showed me enough to get this running.
- Integrated segmentation support into the Docker container, allowing for easy testing and demonstration of this feature.
Issue Resolutions:
- This PR covers much of the work done in #419, bringing it up to date as of August 2024.
- Implicitly fixes issues #279, #364, and #425.
Tested Configurations:
- Verified functionality with pretrain-x-1280ft, which performs excellently.
- Tested seg-l and seg-l-seghead configurations, which show good performance but really work well with my use case ( :/ )

Detailed Improvements

Refactored the Dockerfile for better efficiency and clarity.
Updated pyproject.toml and requirements files with pinned dependency versions.
Minor changes to configuration files, there were some local paths that needed to be removed.
Documentation the Docker-based demo workflow.

How to Use

Users can now easily run YOLO-World demos, including the new segmentation demo, using the provided Docker build system. For example:

./build_and_run.sh pretrain-x-1280ft  # For gradio object detection demo
./build_and_run.sh seg-l              # For segmentation demo

(note, while this is in MR, the fixes are not on master. So you have to replace this line in the dockerfile:

RUN git clone --recursive https://github.com/AILab-CVC/YOLO-World /yolo/

With this line:

RUN git clone --recursive https://github.com/tim-win/YOLO-World /yolo/

Hopefully this PR will save the people who come after me significant amounts of time. Feedback and further testing is welcome!

tim-win · 2024-09-03T22:45:46Z

Pinging @wondervictor as you may be able to review!

tim-win added 10 commits August 31, 2024 15:50

Basic

766dfd8

Reasonable facimile of working dependencies

d7bebb2

Use off the shelf clip

91543e4

Latest working dockerfile

76ffdc7

Experimental update to libraries

a34bae5

Fully featured build and run script

57b9244

Add basic segmentation demo support

512e9a1

Reference remote code to avoid duplicating build steps

797b293

Make it sort of work all together now

4041949

Cleanup MR so its a little more professional

bf71d2b

tim-win force-pushed the master branch from b466180 to bf71d2b Compare September 1, 2024 02:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updated Dependencies, Better Docker Support, and Segmentation Demo #480

Updated Dependencies, Better Docker Support, and Segmentation Demo #480

tim-win commented Sep 1, 2024 •

edited

Loading

tim-win commented Sep 3, 2024

Updated Dependencies, Better Docker Support, and Segmentation Demo #480

Are you sure you want to change the base?

Updated Dependencies, Better Docker Support, and Segmentation Demo #480

Conversation

tim-win commented Sep 1, 2024 • edited Loading

Key Changes

Detailed Improvements

How to Use

tim-win commented Sep 3, 2024

tim-win commented Sep 1, 2024 •

edited

Loading