Skip to content

Commit 1621381

Browse files
Update vllm/model_executor/model_loader/loader.py
Co-authored-by: Simon Mo <simon.mo@hey.com> Signed-off-by: Lucas Wilkinson <lwilkinson@neuralmagic.com>
1 parent 9829fae commit 1621381

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

vllm/model_executor/model_loader/loader.py

+1-1
Original file line numberDiff line numberDiff line change
@@ -644,7 +644,7 @@ def load_model(self, vllm_config: VllmConfig) -> nn.Module:
644644
hasattr(module, "process_weights_after_loading"):
645645
# When attention modules need to process weights after
646646
# currently only used by MLA
647-
module.process_weights_after_loading()
647+
module.process_weights_after_loading(model_config.dtype)
648648
rank = get_tensor_model_parallel_rank()
649649
pattern = os.path.join(
650650
local_model_path,

0 commit comments

Comments
 (0)