Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Added AdamW fused. Made fused logic more generic. #1

Conversation

stepfunction83
Copy link

I attempted to make the logic fit for additional optimizers and added in AdamW while I was at it. Unfortunately, I can't test it as is due to it not fitting in 24GB of VRAM with AdamW. This is likely an issue with my implementation, so if you wouldn't mind taking a look and seeing if you could debug it, it would be appreciated.

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant