Include MultiheadAttention module in C# API #320

fwaris · 2021-08-04T11:15:16Z

We need the MultiheadAttention module by itself for some types of models.

Although MultiheadAttention is part of the transformer modules, it is also needed separately (i.e. outside of transformer) for some models.

GeorgeS2019 · 2021-08-04T19:07:33Z

It seems TorchSharp TorchText is now making critical progress.. Thanks!!

GeorgeS2019 · 2021-08-04T19:14:43Z

@fwaris is there a plan to include unit tests and examples for MultiHeadAttention?

fwaris · 2021-08-04T20:58:29Z

I have a unit test that is passing. It exercises the basic functionality.

I am working on porting the Temporal Graph Network model which requires MHA. Once done, I can create an example also.

NiklasGustafsson · 2021-08-10T14:56:29Z

src/TorchSharp/NN/MultiheadAttention.cs

+                    attn_mask?.Handle ?? IntPtr.Zero,
+                    out var res1,
+                    out var res2);
+                if (res1 == IntPtr.Zero) { torch.CheckForErrors(); }


I think it should probably check both res1 and res2, just to be safe.

NiklasGustafsson · 2021-08-10T14:57:28Z

src/TorchSharp/NN/MultiheadAttention.cs

+            /// <param name="kdim">total number of features in key</param>
+            /// <param name="vdim">total number of features in value</param>
+            /// <returns></returns>
+            static public MultiheadAttention MultiheadAttention(long embeded_dim, long num_heads, double dropout = 0.0, bool bias = true, bool add_bias_kv = false, bool add_zero_attn = false, long? kdim=null, long? vdim=null)


'embeded_dim' -> 'embedded_dim'

will fix. thanks

NiklasGustafsson · 2021-08-10T19:20:36Z

@fwaris:

This is a good addition to TorchSharp, and we really value your contribution.

We're in the middle of moving this repository to a different organization (Xamarin is not the right long-term place for it), where we will have a proper Contributor License Agreement for all contributions, internal and external.

Therefore, we will hold off on merging this PR for a little bit.

NiklasGustafsson · 2021-08-30T17:49:52Z

An update: this repo will be moved to the .NET Foundation organization, after which we will have the CLA and can accept the contribution.

NiklasGustafsson · 2021-09-01T20:18:12Z

@fwaris, if you resubmit this PR, GitHub will take you through the signing of a CLA, and we can accept your PR. Also, the text for the copyright header has changed from 'Microsoft Corp.' to '.NET Foundation.'

dnfadmin · 2021-09-01T21:35:21Z

All CLA requirements met.

NiklasGustafsson

Minor changes required before the PR will be accepted.

NiklasGustafsson · 2021-09-01T21:39:02Z

src/TorchSharp/NN/MultiheadAttention.cs

@@ -0,0 +1,79 @@
+// Copyright (c) Microsoft Corporation and contributors.  All Rights Reserved.  See License.txt in the project root for license information.


NiklasGustafsson · 2021-09-01T21:39:55Z

src/TorchSharp/NN/MultiheadAttention.cs

+            /// <returns>attn_output, attn_ouput_weights</returns>
+
+            public Tuple<Tensor,Tensor> forward(Tensor query, Tensor key, Tensor value, Tensor? key_padding_mask = null, bool need_weights = true, Tensor? attn_mask = null)
+            //const NNModule module, const Tensor query, const Tensor key, const Tensor value, const Tensor key_padding_mask, const bool need_weights, const Tensor attn_mask, Tensor res1, Tensor res2 )


There's commented code,

NiklasGustafsson · 2021-09-01T21:40:18Z

src/TorchSharp/NN/MultiheadAttention.cs

+            public Tuple<Tensor,Tensor> forward(Tensor query, Tensor key, Tensor value, Tensor? key_padding_mask = null, bool need_weights = true, Tensor? attn_mask = null)
+            //const NNModule module, const Tensor query, const Tensor key, const Tensor value, const Tensor key_padding_mask, const bool need_weights, const Tensor attn_mask, Tensor res1, Tensor res2 )
+            {
+                //var res1 = IntPtr.Zero;


More commented code.

NiklasGustafsson · 2021-09-01T21:41:47Z

.gitignore

@@ -268,3 +268,4 @@ packages/
 .ionide
 *.bin
 /*.png
+/src/Native/out/build/x64-Debug


Odd. Was this necessary?

fwaris · 2021-09-01T21:42:49Z

i signed the CLA but not sure if the PR is still valid. If not, create a new one later this week.

NiklasGustafsson · 2021-09-01T21:44:32Z

i signed the CLA but not sure if the PR is still valid. If not, create a new one later this week.

I closed and resubmitted the PR, so now the checks are running. I had some review comments.

NiklasGustafsson · 2021-09-02T13:56:09Z

@fwaris, I would like to get this in and then do a NuGet release with it, so I'm going to merge the PR and address the requested changes in another PR.

fwaris · 2021-09-02T14:43:05Z

thanks. I will rebase and make the request changes as another PR

NiklasGustafsson · 2021-09-02T14:45:56Z

thanks. I will rebase and make the request changes as another PR

@fwaris, there's no need. I already made the changes and they are in main now.

GeorgeS2019 · 2021-09-28T19:03:36Z

@fwaris We are attempting to visualize MultiheadAttention module within the transformer architecture. You input is appreciated.

:-) I still looking forwards to your tests and samples on MultiheadAttention :-)

fwaris · 2021-09-30T10:16:28Z

day job is keeping me very busy but I intend to complete the TGN model port (see above) port in the next few weeks. Only a few modules left to port.

GeorgeS2019 · 2021-09-30T11:32:04Z

@fwaris take your time. Thx for valuable voluntary contribution :-)

fwaris added 2 commits August 4, 2021 06:57

multiheadattention module

cc5f404

multiheadattention module

8780654

NiklasGustafsson reviewed Aug 10, 2021

View reviewed changes

fwaris added 2 commits August 10, 2021 12:27

multiheadattention return check added

fd7dfdd

multiheadattention update

bf5c776

NiklasGustafsson closed this Sep 1, 2021

NiklasGustafsson reopened this Sep 1, 2021

NiklasGustafsson requested changes Sep 1, 2021

View reviewed changes

Merge branch 'main' into master

471c5e3

NiklasGustafsson merged commit 6c0a8f2 into dotnet:main Sep 2, 2021

GeorgeS2019 mentioned this pull request Sep 28, 2021

Provide Nested graph layout example on Transformer based ONNX fel88/Dagre.NET#2

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Include MultiheadAttention module in C# API #320

Include MultiheadAttention module in C# API #320

fwaris commented Aug 4, 2021

GeorgeS2019 commented Aug 4, 2021 •

edited

Loading

GeorgeS2019 commented Aug 4, 2021

fwaris commented Aug 4, 2021

NiklasGustafsson Aug 10, 2021 •

edited

Loading

NiklasGustafsson Aug 10, 2021

fwaris Aug 10, 2021

NiklasGustafsson commented Aug 10, 2021

NiklasGustafsson commented Aug 30, 2021

NiklasGustafsson commented Sep 1, 2021

dnfadmin commented Sep 1, 2021 •

edited

Loading

NiklasGustafsson left a comment

NiklasGustafsson Sep 1, 2021

NiklasGustafsson Sep 1, 2021

NiklasGustafsson Sep 1, 2021

NiklasGustafsson Sep 1, 2021

fwaris commented Sep 1, 2021

NiklasGustafsson commented Sep 1, 2021

NiklasGustafsson commented Sep 2, 2021

fwaris commented Sep 2, 2021

NiklasGustafsson commented Sep 2, 2021

GeorgeS2019 commented Sep 28, 2021

fwaris commented Sep 30, 2021

GeorgeS2019 commented Sep 30, 2021

		@@ -0,0 +1,79 @@
		// Copyright (c) Microsoft Corporation and contributors. All Rights Reserved. See License.txt in the project root for license information.

Include MultiheadAttention module in C# API #320

Include MultiheadAttention module in C# API #320

Conversation

fwaris commented Aug 4, 2021

GeorgeS2019 commented Aug 4, 2021 • edited Loading

GeorgeS2019 commented Aug 4, 2021

fwaris commented Aug 4, 2021

NiklasGustafsson Aug 10, 2021 • edited Loading

Choose a reason for hiding this comment

NiklasGustafsson Aug 10, 2021

Choose a reason for hiding this comment

fwaris Aug 10, 2021

Choose a reason for hiding this comment

NiklasGustafsson commented Aug 10, 2021

NiklasGustafsson commented Aug 30, 2021

NiklasGustafsson commented Sep 1, 2021

dnfadmin commented Sep 1, 2021 • edited Loading

NiklasGustafsson left a comment

Choose a reason for hiding this comment

NiklasGustafsson Sep 1, 2021

Choose a reason for hiding this comment

NiklasGustafsson Sep 1, 2021

Choose a reason for hiding this comment

NiklasGustafsson Sep 1, 2021

Choose a reason for hiding this comment

NiklasGustafsson Sep 1, 2021

Choose a reason for hiding this comment

fwaris commented Sep 1, 2021

NiklasGustafsson commented Sep 1, 2021

NiklasGustafsson commented Sep 2, 2021

fwaris commented Sep 2, 2021

NiklasGustafsson commented Sep 2, 2021

GeorgeS2019 commented Sep 28, 2021

fwaris commented Sep 30, 2021

GeorgeS2019 commented Sep 30, 2021

GeorgeS2019 commented Aug 4, 2021 •

edited

Loading

NiklasGustafsson Aug 10, 2021 •

edited

Loading

dnfadmin commented Sep 1, 2021 •

edited

Loading