Support scatter for CUDA gradient #13

yuehhua · 2021-06-16T07:50:34Z

No description provided.

CarloLucibello · 2021-06-16T08:09:02Z

Loos like reverse_indices is not gpu friendly
https://github.com/FluxML/NNlib.jl/blob/a6516f498c25fee821cf64751dba0ecb5e005b8d/src/utils.jl#L18

CarloLucibello · 2021-06-21T00:01:15Z

what's the blocker here?

yuehhua · 2021-06-21T00:12:20Z

The gradient of scatter * and / for cuda need a cuda kernel. I wrote the cuda kernel and I am debugging it. It's almost there.

add count_indices for cuarray add CUDA kernel for divide_by_counts! add NNlib.∇scatter_src for cuda gradient support scatter mean AD for CUDA support scatter *,/ AD for CUDA

yuehhua · 2021-07-01T08:47:14Z

Finally!!

yuehhua · 2021-07-01T09:59:44Z

src/scatter.jl

+        # multiply all values to be aggregated but not itself
+        x = one(T)
+        for k in inds
+            jk = Base._to_linear_index(src, Tuple(cart_j)..., Tuple(k)...)


Base._to_linear_index is introduced here to transform index of any form into integer index. Integer index is required to index a cuarray.

yuehhua requested a review from CarloLucibello June 16, 2021 07:50

This was referenced Jun 17, 2021

Make reverse_indices gpu compatible FluxML/NNlib.jl#326

Merged

Support reverse_indices for integer, tuple, CartesianIndex index FluxML/NNlib.jl#327

Merged

yuehhua mentioned this pull request Jun 21, 2021

Fix the output type of reverse_indices FluxML/NNlib.jl#328

Merged

yuehhua force-pushed the scatter branch 3 times, most recently from 6a692dc to 5b53a86 Compare June 30, 2021 15:46

support scatter for cuda gradient

60585e2

add count_indices for cuarray add CUDA kernel for divide_by_counts! add NNlib.∇scatter_src for cuda gradient support scatter mean AD for CUDA support scatter *,/ AD for CUDA

yuehhua force-pushed the scatter branch from bc1004e to 60585e2 Compare July 1, 2021 08:33

yuehhua commented Jul 1, 2021

View reviewed changes

CarloLucibello merged commit c3e1331 into FluxML:master Jul 2, 2021

yuehhua mentioned this pull request Jul 2, 2021

merge into NNlib and CUDA? yuehhua/ScatterNNlib.jl#32

Closed

yuehhua deleted the scatter branch July 2, 2021 14:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support scatter for CUDA gradient #13

Support scatter for CUDA gradient #13

yuehhua commented Jun 16, 2021

CarloLucibello commented Jun 16, 2021

CarloLucibello commented Jun 21, 2021

yuehhua commented Jun 21, 2021

yuehhua commented Jul 1, 2021

yuehhua Jul 1, 2021

Support scatter for CUDA gradient #13

Support scatter for CUDA gradient #13

Conversation

yuehhua commented Jun 16, 2021

CarloLucibello commented Jun 16, 2021

CarloLucibello commented Jun 21, 2021

yuehhua commented Jun 21, 2021

yuehhua commented Jul 1, 2021

yuehhua Jul 1, 2021

Choose a reason for hiding this comment