Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

samples_per_second_per_gpu or tokens_per_second_per_gpu? #262

Open
Muennighoff opened this issue Apr 29, 2024 · 1 comment
Open

samples_per_second_per_gpu or tokens_per_second_per_gpu? #262

Muennighoff opened this issue Apr 29, 2024 · 1 comment

Comments

@Muennighoff
Copy link
Contributor

I'm probably missing something but isn't this tokens per second per GPU:

samples_per_second_per_gpu = inputs.numel() / batch_time_m.val

inputs.numel() gives all tokens; for samples it would be inputs.shape[0], no?

@achalddave
Copy link
Collaborator

achalddave commented Apr 29, 2024 via email

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants