Added missing gpu support for relu op and also disabled padding for gemms #2806

pemeliya · 2025-01-17T10:38:26Z

Note that CublasPadForGemms was only enabled for the parity with NV, but in fact it does not bring any better performance on ROCM platform

…emms

i-chaochen · 2025-01-17T13:50:31Z

third_party/xla/xla/service/gpu/amdgpu_compiler.cc

-    pre_pipeline.AddPass<CublasPadForGemms>(rocm_compute_capability,
-                                            req.data_type, req.multiple_of);
-  }
+  // for (const auto& req : HipblasPaddingRequirements) {


Could you add comment here to indicate this is CUDA passs so we disable it.

yep, did it

i-chaochen

LGTM

added missing gpu support for relu op and also disabled padding for g…

e3e0a38

…emms

pemeliya requested review from i-chaochen and jayfurmanek January 17, 2025 10:38

i-chaochen reviewed Jan 17, 2025

View reviewed changes

i-chaochen approved these changes Jan 17, 2025

View reviewed changes

i-chaochen mentioned this pull request Jan 17, 2025

Enabled gpu support for several ops: including ReLu for bfloat16 #2801

Merged

added comment

f5bfcfd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added missing gpu support for relu op and also disabled padding for gemms #2806

Added missing gpu support for relu op and also disabled padding for gemms #2806

Uh oh!

pemeliya commented Jan 17, 2025

Uh oh!

i-chaochen Jan 17, 2025

Uh oh!

pemeliya Jan 17, 2025

Uh oh!

i-chaochen left a comment

Uh oh!

Uh oh!

Added missing gpu support for relu op and also disabled padding for gemms #2806

Are you sure you want to change the base?

Added missing gpu support for relu op and also disabled padding for gemms #2806

Uh oh!

Conversation

pemeliya commented Jan 17, 2025

Uh oh!

i-chaochen Jan 17, 2025

Choose a reason for hiding this comment

Uh oh!

pemeliya Jan 17, 2025

Choose a reason for hiding this comment

Uh oh!

i-chaochen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!