Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Multi-gpu slower than single-gpu #269

Open
weiyx15 opened this issue Jul 6, 2020 · 1 comment
Open

Multi-gpu slower than single-gpu #269

weiyx15 opened this issue Jul 6, 2020 · 1 comment

Comments

@weiyx15
Copy link

weiyx15 commented Jul 6, 2020

Hi,
I found that with same hyper-parameters but different num_core_per_host (num_core_per_host=1 for single-gpu and num_core_per_host=6 for multi-gpu), global_step/sec of multi-gpu is slightly fewer than that of single-gpu.
num_core_per_host=6:

INFO:tensorflow:global_step/sec: 1.09456
INFO:tensorflow:loss = 1.490116e-08, step = 401200 (91.361 sec)

num_core_per_host=1:

INFO:tensorflow:global_step/sec: 1.21364
INFO:tensorflow:loss = 0.053051353, step = 62400 (82.396 sec)

Is this phenomenon reasonable and why?

System Information:
cuda V10.0.130
cudnn 7.4.1
nccl 2.6.4
tensorflow-gpu 1.13.1 (from pip in conda virtual environment)

Best Regards

@guotong1988
Copy link

guotong1988 commented Sep 28, 2020

I guess multi-gpu's loss decreases faster then single-gpu.

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants