Script MultiheadAttention (#1524) #681

cndn · 2020-01-10T01:53:57Z

Summary:
Pull Request resolved: facebookresearch/fairseq#1524

Make fairseq MultiheadAttention scriptable. Looking for feedbacks.

Add types
Move incremental state management logic from util functions to initializers. TorchScript in general doesn't support global dict. As a result modules with multihead attention in it would assign itself fairseq_instance_id in the initializer.
There might be opportunities to make assertions and annotations cleaner.

Differential Revision: D18772594

facebook-github-bot · 2020-01-10T01:54:19Z

This pull request was exported from Phabricator. Differential Revision: D18772594

Summary: Pull Request resolved: pytorch#681 Pull Request resolved: facebookresearch/fairseq#1524 Make fairseq MultiheadAttention scriptable. Looking for feedbacks. 1. Add types 2. Move incremental state management logic from util functions to initializers. TorchScript in general doesn't support global dict. As a result modules with multihead attention in it would assign itself fairseq_instance_id in the initializer. 3. There might be opportunities to make assertions and annotations cleaner. Differential Revision: D18772594 fbshipit-source-id: 4353d522d244b1508190d33ca5be6f2299e8442c

facebook-github-bot · 2020-01-14T23:39:48Z

This pull request was exported from Phabricator. Differential Revision: D18772594

Summary: Pull Request resolved: pytorch/translate#681 Pull Request resolved: facebookresearch#1524 Make fairseq MultiheadAttention scriptable. Looking for feedbacks. 1. Add types 2. Move incremental state management logic from util functions to initializers. TorchScript in general doesn't support global dict. As a result modules with multihead attention in it would assign itself fairseq_instance_id in the initializer. 3. There might be opportunities to make assertions and annotations cleaner. Differential Revision: D18772594 fbshipit-source-id: 8b8b87f0e74f4afb863b15fc4172482b640f6197

Summary: Pull Request resolved: pytorch#681 Pull Request resolved: facebookresearch/fairseq#1524 Make fairseq MultiheadAttention scriptable. Looking for feedbacks. 1. Add types 2. Move incremental state management logic from util functions to initializers. TorchScript in general doesn't support global dict. As a result modules with multihead attention in it would assign itself fairseq_instance_id in the initializer. 3. There might be opportunities to make assertions and annotations cleaner. Differential Revision: D18772594 fbshipit-source-id: 5c21d7d84db1320201f486015bb91469006ffd95

facebook-github-bot · 2020-01-16T18:11:25Z

This pull request was exported from Phabricator. Differential Revision: D18772594

Summary: Pull Request resolved: fairinternal/fairseq-py#1002 Pull Request resolved: pytorch/translate#681 Pull Request resolved: #1524 Make fairseq MultiheadAttention scriptable. Looking for feedbacks. 1. Add types 2. Move incremental state management logic from util functions to initializers. TorchScript in general doesn't support global dict. As a result modules with multihead attention in it would assign itself fairseq_instance_id in the initializer. 3. There might be opportunities to make assertions and annotations cleaner. Reviewed By: myleott Differential Revision: D18772594 fbshipit-source-id: 377aef4bbb7ef51da5b6bac9a87a6f7b03b16fe1

facebook-github-bot · 2020-01-22T03:10:16Z

This pull request has been merged in 23cedf7.

Summary: Pull Request resolved: fairinternal/fairseq-py#1002 Pull Request resolved: pytorch/translate#681 Pull Request resolved: facebookresearch#1524 Make fairseq MultiheadAttention scriptable. Looking for feedbacks. 1. Add types 2. Move incremental state management logic from util functions to initializers. TorchScript in general doesn't support global dict. As a result modules with multihead attention in it would assign itself fairseq_instance_id in the initializer. 3. There might be opportunities to make assertions and annotations cleaner. Reviewed By: myleott Differential Revision: D18772594 fbshipit-source-id: 377aef4bbb7ef51da5b6bac9a87a6f7b03b16fe1

Summary: Pull Request resolved: fairinternal/fairseq-py#1002 Pull Request resolved: pytorch/translate#681 Pull Request resolved: facebookresearch/fairseq#1524 Make fairseq MultiheadAttention scriptable. Looking for feedbacks. 1. Add types 2. Move incremental state management logic from util functions to initializers. TorchScript in general doesn't support global dict. As a result modules with multihead attention in it would assign itself fairseq_instance_id in the initializer. 3. There might be opportunities to make assertions and annotations cleaner. Reviewed By: myleott Differential Revision: D18772594 fbshipit-source-id: 377aef4bbb7ef51da5b6bac9a87a6f7b03b16fe1

facebook-github-bot added the fb-exported label Jan 10, 2020

cndn force-pushed the export-D18772594 branch from 9c65372 to 0c86ec3 Compare January 14, 2020 23:39

cndn force-pushed the export-D18772594 branch from 0c86ec3 to b673ecd Compare January 16, 2020 18:11

facebook-github-bot closed this in 23cedf7 Jan 22, 2020

facebook-github-bot added the Merged label Jan 22, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Script MultiheadAttention (#1524) #681

Script MultiheadAttention (#1524) #681

cndn commented Jan 10, 2020

facebook-github-bot commented Jan 10, 2020

facebook-github-bot commented Jan 14, 2020

facebook-github-bot commented Jan 16, 2020

facebook-github-bot commented Jan 22, 2020

Script MultiheadAttention (#1524) #681

Script MultiheadAttention (#1524) #681

Conversation

cndn commented Jan 10, 2020

facebook-github-bot commented Jan 10, 2020

facebook-github-bot commented Jan 14, 2020

facebook-github-bot commented Jan 16, 2020

facebook-github-bot commented Jan 22, 2020