Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

[New Model]: Cohere2 (Command R7B) #11181

Closed
1 task done
fizzAI opened this issue Dec 13, 2024 · 1 comment
Closed
1 task done

[New Model]: Cohere2 (Command R7B) #11181

fizzAI opened this issue Dec 13, 2024 · 1 comment
Labels
new model Requests to new models

Comments

@fizzAI
Copy link

fizzAI commented Dec 13, 2024

The model to consider.

https://huggingface.co/CohereForAI/c4ai-command-r7b-12-2024

The closest model vllm already supports.

Likely either the original Cohere (for. obvious reasons) or Gemma2 (as it also has a funky SWA architecture)

What's your difficulty of supporting the model you want?

It uses SWA, but this can likely be ditched to get MVP inference working ala how gemma 2 was done
For some reason every 4th layer uses global attention without positional embeddings? Not sure how or why that one works tbh

Before submitting a new issue...

  • Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.
@fizzAI fizzAI added the new model Requests to new models label Dec 13, 2024
@fizzAI fizzAI changed the title [New Model]: Cohere2 (Command R7B [New Model]: Cohere2 (Command R7B) Dec 13, 2024
@Isotr0py
Copy link
Collaborator

Closed as #11203 merged.

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
new model Requests to new models
Projects
None yet
Development

No branches or pull requests

2 participants