Use compute queue for AMD devices #102

PatriceVignola · 2020-11-18T23:52:30Z

All the evidence we've seen so far points to compute queues being better at preventing TDRs, but also being more performant on AMD. We can always revert the change later if it turns out to not be stable enough, but we should at least have this change be tested by Autopilot and ai-benchmark tests.

jstoecker · 2020-11-18T23:57:40Z

tensorflow/core/common_runtime/dml/dml_device_state.cc

+  D3D12_COMMAND_LIST_TYPE queue_type = D3D12_COMMAND_LIST_TYPE_DIRECT;
+
+  if (adapter.VendorID() == VendorID::kAmd) {
+    queue_type = D3D12_COMMAND_LIST_TYPE_COMPUTE;


Let's keep an environment variable to force the queue on/off as well, but default to on for AMD. Might be useful for experimentation.

Merges some of the recent changes from the directml branch: * Use compute queue for AMD devices (#102) * Register List Kernels for DML (#95) * Update DirectMLX to latest (#104) * Remove extra rows from test email (#106) * Fix DML's Select kernel for int64 (#113) * Fix list kernels and tensor array ops registration (#114) * Simplify CI scripts (#112) * Fix StridedSlice's input size coalescing (#115) * Disable int64 image test (#116) * Fix network share copy path (#117) * Pipeline should continue if a test job fails (#118) * Switch network share path to use build number instead of build ID * Add missing HostMemory int32 registrations for _Arg and _RetVal (#122) * Implement all the arithmetic Scatter and ResourceScatter operators (#121) * Register emulated kernel implementations for RandomStandardNormal and TruncatedNormal (#120)

WIP

cb6b2c9

PatriceVignola requested review from jstoecker and adtsai November 18, 2020 23:52

jstoecker approved these changes Nov 18, 2020

View reviewed changes

jstoecker reviewed Nov 18, 2020

View reviewed changes

PatriceVignola added 2 commits November 18, 2020 16:22

Add env var

82af559

asdf

7ef5d2b

PatriceVignola merged commit 1838dc3 into directml Nov 19, 2020

PatriceVignola deleted the user/pavignol/use-compute-queue-amd branch November 19, 2020 02:20

jstoecker pushed a commit that referenced this pull request Dec 15, 2020

Use compute queue for AMD devices (#102)

48cc8d1

jstoecker mentioned this pull request Jan 4, 2021

New update doesn't show GPU Usage #134

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use compute queue for AMD devices #102

Use compute queue for AMD devices #102

PatriceVignola commented Nov 18, 2020

jstoecker Nov 18, 2020

Use compute queue for AMD devices #102

Use compute queue for AMD devices #102

Conversation

PatriceVignola commented Nov 18, 2020

jstoecker Nov 18, 2020

Choose a reason for hiding this comment