Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Some errors when using multi-gpus #62

Open
lcf000000 opened this issue Sep 21, 2018 · 1 comment
Open

Some errors when using multi-gpus #62

lcf000000 opened this issue Sep 21, 2018 · 1 comment

Comments

@lcf000000
Copy link

lcf000000 commented Sep 21, 2018

Training is too slow, and I want using a 4xTian xp to speed up it.
when I set
opts.gpus=[1,2,3,4]
then I get some errors as:

`Warning: Function labindex has the same name as a MATLAB builtin. We suggest you rename the function to avoid a potential name
conflict.

In path at 109
In addpath at 86
In cnn_train_dag at 11
In trainig_siameseFC at 97
In run_increasingData at 6
Warning: Function numlabs has the same name as a MATLAB builtin. We suggest you rename the function to avoid a potential name
conflict.
In path at 109
In addpath at 86
In cnn_train_dag at 11
In trainig_siameseFC at 97
In run_increasingData at 6
Starting parallel pool (parpool) using the 'local' profile ... connected to 1 workers.
cnn_train_dag: resetting GPU
Lab 1:
ans =
Error using cnn_train_dag>(spmd) (line 492)
The parallel pool that SPMD was using has been shut down.

Error in cnn_train_dag>prepareGPUs (line 492)
spmd

Error in cnn_train_dag (line 92)
prepareGPUs(opts, epoch == start+1) ;

Error in trainig_siameseFC (line 97)
[net, stats] = cnn_train_dag(net, imdb, batch_fn, opts.train);

Error in run_increasingData (line 6)
trainig_siameseFC(imdb_data, 86);

The client lost connection to worker 1. This might be due to network problems, or the interactive communicating job might have
errored.
`

@lcf000000 lcf000000 changed the title I get some errors when I use multi-gpus Some errors when using multi-gpus Sep 21, 2018
@jvlmdr
Copy link

jvlmdr commented Sep 21, 2018

Sorry, we haven't tested support for training with multiple GPUs. These look like errors coming from matconvnet and/or the parallel toolbox.

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants