python

Enabled high-performance Automatic Tensor Parallelism (auto TP) for the Qwen2-MoE and DeepSeek-V2 models on multiple GPUs/HPUs #11468

# to view logs

Re-run triggered January 21, 2025 08:31

delock

#6964

gyou2021:autoTP_Qwen2Moe_DeepSeekv2

Status Success

Total duration 2m 57s

Artifacts –

python.yml

on: pull_request

Matrix: unit-tests