Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

recompute core attn #635

Merged
merged 4 commits into from
Aug 22, 2022
Merged

Conversation

FeixLiu
Copy link
Contributor

@FeixLiu FeixLiu commented Aug 16, 2022

b65e5dd2dc6fa37dea8b5dbdbd58f7d8

option gpu memory occupy speed
full_attn + tensor fusion 16604 240946
core_attn + tensor fusion 18012 230387

@FeixLiu FeixLiu force-pushed the recompute_core_attn branch from 3268f32 to 1794910 Compare August 22, 2022 00:44
@FeixLiu FeixLiu force-pushed the recompute_core_attn branch from bee465e to a4569a6 Compare August 22, 2022 03:46
Copy link
Member

@ForFishes ForFishes left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@ForFishes ForFishes merged commit aeda695 into PaddlePaddle:develop Aug 22, 2022
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants