Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

[transformer] fix sdpa u2pp training nan #2419

Merged
merged 1 commit into from
Mar 19, 2024
Merged

[transformer] fix sdpa u2pp training nan #2419

merged 1 commit into from
Mar 19, 2024

Conversation

Mddct
Copy link
Collaborator

@Mddct Mddct commented Mar 19, 2024

sdpa 导致dynamic left的时候 nan

@Mddct Mddct requested a review from xingchensong March 19, 2024 06:44
@xingchensong xingchensong merged commit d51d1bc into main Mar 19, 2024
6 checks passed
@xingchensong xingchensong deleted the Mddct-sdpa-fix branch March 19, 2024 07:22
@Mddct
Copy link
Collaborator Author

Mddct commented Mar 21, 2024

训练性能也保持一致了

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants