python

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models #11461

# to view logs

Re-run triggered January 21, 2025 03:34

delock

#6553

gyou2021:configurable_autoTP

Status Success

Total duration 2m 9s

Artifacts –

python.yml

on: pull_request

Matrix: unit-tests