You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Do you use a 2080Ti GPU but find that it doesn't support flash attention? Are you getting any error logs? Do you mean to use the non-flash attention version for inference?
Motivation
as title
Related resources
No response
The text was updated successfully, but these errors were encountered: