Olympus: A Universal Task Router for Computer Vision Tasks

Official implementation of "Olympus: A Universal Task Router for Computer Vision Tasks"

♥️ If you find our project is helpful for your research, please kindly give us a 🌟 and cite our paper 📑 : )

📣 News

Release the training code.
Release Olympus datasets.
Release the inference code of Olympus.
We've released the weights of Olympus.

📄 Abstract

We introduce Olympus, a new approach that transforms Multimodal Large Language Models (MLLMs) into a unified framework capable of handling a wide array of computer vision tasks. Utilizing a controller MLLM, Olympus delegates over 20 specialized tasks across images, videos, and 3D objects to dedicated modules. This instruction-based routing enables complex workflows through chained actions without the need for training heavy generative models. Olympus easily integrates with existing MLLMs, expanding their capabilities with comparable performance. Experimental results demonstrate that Olympus achieves an average routing accuracy of 94.75% across 20 tasks and precision of 91.82% in chained action scenarios, showcasing its effectiveness as a universal task router that can solve a diverse range of computer vision tasks.

🔅 Overview

🔮 Suppored Capacities (Covering 20 tasks)

🏂 Diverse Applications

Citation

If you find our work useful in your research or applications, please consider citing our paper using the following BibTeX:

@article{lin2024olympus,
  title={Olympus: A Universal Task Router for Computer Vision Tasks},
  author={Lin, Yuanze and Li, Yunsheng and Chen, Dongdong and Xu, Weijian and Clark, Ronald and Torr, Philip HS},
  journal={arXiv preprint arXiv:2412.09612},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 69 Commits
asset		asset
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Olympus: A Universal Task Router for Computer Vision Tasks

📣 News

📄 Abstract

🔅 Overview

🔮 Suppored Capacities (Covering 20 tasks)

🏂 Diverse Applications

Citation

About

Releases

Packages

yuanze-lin/Olympus

Folders and files

Latest commit

History

Repository files navigation

Olympus: A Universal Task Router for Computer Vision Tasks

📣 News

📄 Abstract

🔅 Overview

🔮 Suppored Capacities (Covering 20 tasks)

🏂 Diverse Applications

Citation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages