Skip to content
@om-ai-lab

Om AI Lab

Open Multimodal AGI Research

Popular repositories Loading

  1. VLM-R1 VLM-R1 Public

    Solve Visual Understanding with Reinforced VLMs

    Python 5.1k 310

  2. OmAgent OmAgent Public

    Build multimodal language agents for fast prototype and production

    Python 2.5k 272

  3. OmDet OmDet Public

    Real-time and accurate open-vocabulary end-to-end object detection

    Python 1.3k 110

  4. RS5M RS5M Public

    RS5M: a large-scale vision language dataset for remote sensing [TGRS]

    Python 260 11

  5. awesome-RSVLM awesome-RSVLM Public

    Collection of Remote Sensing Vision-Language Models

    137 4

  6. VL-CheckList VL-CheckList Public

    Evaluating Vision & Language Pretraining Models with Objects, Attributes and Relations. [EMNLP 2022]

    Python 130 5

Repositories

Showing 10 of 17 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…