Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Minimum learning rate #27

Closed
artnoage opened this issue Sep 15, 2023 · 1 comment
Closed

Minimum learning rate #27

artnoage opened this issue Sep 15, 2023 · 1 comment

Comments

@artnoage
Copy link

The minimum learning rate is the same as the "max". Is this intentional or a mistake? If yes, why (you can skip explanation if it is too bothersome)?

@jzhang38
Copy link
Owner

jzhang38 commented Sep 16, 2023

This is a mistake made by us and thanks a million for spotting that out!(by right it should be 4e-5)
Fortunately, we are still at the very early stage of training and there is still room to correct it. I have corrected this mistake, and the TinyLlama's lr curve will look like below:
image
I will update the readme pointing to this issue together with the release of the 500B-token checkpoint later.

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants