Skip to content

Commit 17aeab2

Browse files
committed
Revert "[Bugfix] Fix block size validation (#10938)"
This reverts commit 69ba344. Signed-off-by: Konrad Zawora <kzawora@habana.ai>
1 parent f9ecbb1 commit 17aeab2

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

vllm/engine/arg_utils.py

+1-1
Original file line numberDiff line numberDiff line change
@@ -423,7 +423,7 @@ def add_cli_args(parser: FlexibleArgumentParser) -> FlexibleArgumentParser:
423423
parser.add_argument('--block-size',
424424
type=int,
425425
default=EngineArgs.block_size,
426-
choices=[8, 16, 32],
426+
choices=[8, 16, 32, 64, 128],
427427
help='Token block size for contiguous chunks of '
428428
'tokens. This is ignored on neuron devices and '
429429
'set to max-model-len')

0 commit comments

Comments
 (0)