You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardexpand all lines: docs/source/models/supported_models.rst
+53-26
Original file line number
Diff line number
Diff line change
@@ -3,7 +3,7 @@
3
3
Supported Models
4
4
================
5
5
6
-
vLLM supports a variety of generative Transformer models in `HuggingFace Transformers <https://huggingface.co/models>`_.
6
+
vLLM supports a variety of generative Transformer models in `HuggingFace (HF) Transformers <https://huggingface.co/models>`_.
7
7
The following is the list of model architectures that are currently supported by vLLM.
8
8
Alongside each architecture, we include some popular models that use it.
9
9
@@ -19,7 +19,7 @@ Text Generation
19
19
20
20
* - Architecture
21
21
- Models
22
-
- Example HuggingFace Models
22
+
- Example HF Models
23
23
- :ref:`LoRA <lora>`
24
24
- :ref:`PP <distributed_serving>`
25
25
* - :code:`AquilaForCausalLM`
@@ -280,7 +280,7 @@ Text Embedding
280
280
281
281
* - Architecture
282
282
- Models
283
-
- Example HuggingFace Models
283
+
- Example HF Models
284
284
- :ref:`LoRA <lora>`
285
285
- :ref:`PP <distributed_serving>`
286
286
* - :code:`Gemma2Model`
@@ -303,7 +303,7 @@ Reward Modeling
303
303
304
304
* - Architecture
305
305
- Models
306
-
- Example HuggingFace Models
306
+
- Example HF Models
307
307
- :ref:`LoRA <lora>`
308
308
- :ref:`PP <distributed_serving>`
309
309
* - :code:`Qwen2ForRewardModel`
@@ -316,86 +316,93 @@ Reward Modeling
316
316
As an interim measure, these models are supported via Embeddings API. See `this RFC <https://github.com/vllm-project/vllm/issues/8967>`_ for upcoming changes.
317
317
318
318
Multimodal Language Models
319
-
^^^^^^^^^^^^^^^^^^^^^^^^^^^^
319
+
^^^^^^^^^^^^^^^^^^^^^^^^^^
320
+
321
+
The following modalities are supported depending on the model:
322
+
323
+
- **T**\ ext
324
+
- **I**\ mage
325
+
- **V**\ ideo
326
+
- **A**\ udio
320
327
321
328
.. _supported_vlms:
322
329
323
330
Text Generation
324
331
---------------
325
332
326
333
.. list-table::
327
-
:widths: 25 25 25 25 5 5
334
+
:widths: 25 25 15 25 5 5
328
335
:header-rows: 1
329
336
330
337
* - Architecture
331
338
- Models
332
-
- Modalities
333
-
- Example HuggingFace Models
339
+
- Inputs
340
+
- Example HF Models
334
341
- :ref:`LoRA <lora>`
335
342
- :ref:`PP <distributed_serving>`
336
343
* - :code:`Blip2ForConditionalGeneration`
337
344
- BLIP-2
338
-
- Image\ :sup:`E`
345
+
- T + I\ :sup:`E`
339
346
- :code:`Salesforce/blip2-opt-2.7b`, :code:`Salesforce/blip2-opt-6.7b`, etc.
340
347
-
341
348
- ✅︎
342
349
* - :code:`ChameleonForConditionalGeneration`
343
350
- Chameleon
344
-
- Image
351
+
- T + I
345
352
- :code:`facebook/chameleon-7b` etc.
346
353
-
347
354
- ✅︎
348
355
* - :code:`FuyuForCausalLM`
349
356
- Fuyu
350
-
- Image
357
+
- T + I
351
358
- :code:`adept/fuyu-8b` etc.
352
359
-
353
360
- ✅︎
354
361
* - :code:`ChatGLMModel`
355
362
- GLM-4V
356
-
- Image
363
+
- T + I
357
364
- :code:`THUDM/glm-4v-9b` etc.
358
365
-
359
366
- ✅︎
360
367
* - :code:`InternVLChatModel`
361
368
- InternVL2
362
-
- Image\ :sup:`E+`
369
+
- T + I\ :sup:`E+`
363
370
- :code:`OpenGVLab/InternVL2-4B`, :code:`OpenGVLab/InternVL2-8B`, etc.
364
371
-
365
372
- ✅︎
366
373
* - :code:`LlavaForConditionalGeneration`
367
374
- LLaVA-1.5
368
-
- Image\ :sup:`E+`
375
+
- T + I\ :sup:`E+`
369
376
- :code:`llava-hf/llava-1.5-7b-hf`, :code:`llava-hf/llava-1.5-13b-hf`, etc.
370
377
-
371
378
- ✅︎
372
379
* - :code:`LlavaNextForConditionalGeneration`
373
380
- LLaVA-NeXT
374
-
- Image\ :sup:`E+`
381
+
- T + I\ :sup:`E+`
375
382
- :code:`llava-hf/llava-v1.6-mistral-7b-hf`, :code:`llava-hf/llava-v1.6-vicuna-7b-hf`, etc.
0 commit comments