fairseq/examples/roberta/README.pretraining.md #1073

yucoian · 2019-08-27T03:04:58Z

When I followed this tutorial for pretraining the Roberta on a 16GPUs machine, I found it didn't output a step result, like a loss, ppl etc. However, the GPUs and CPUs utilization both are 100%. Is there any solutions?

ngoyal2707 · 2019-08-27T03:21:05Z

Can you please share the logs?
GPUs being stuck exactly at 100% and not doing anything could mean different things, it could mean one or more GPU oomed in distributed training.

lematt1991 · 2019-09-24T21:15:15Z

Closing due to inactivity. Please open a new issue if you are still having problems.

…nt (#1073) Summary: [This commit](dd1298e) made it so that duplicate entries in a dictionary are ignored. Unfortunately the Camembert model depends on overwriting `<unk>`, `<s>` and `</s>`. The proposed solution here is to allow the dictionary to have entries like: ``` <unk> 999 #fairseq:overwrite <s> 999 #fairseq:overwrite </s> 999 #fairseq:overwrite , 999 ▁de 999 . 999 (...) ``` These will preserve the old overwriting behavior. Thus we can release a new `camembert.v0.tar.gz` with a dictionary like above and it works. Pull Request resolved: fairinternal/fairseq-py#1073 Reviewed By: kahne Differential Revision: D20284569 Pulled By: myleott fbshipit-source-id: bf78fbff13c94bf8a6485cbdda62305ddc30c056

…nt (facebookresearch#1073) Summary: [This commit](facebookresearch@dd1298e) made it so that duplicate entries in a dictionary are ignored. Unfortunately the Camembert model depends on overwriting `<unk>`, `<s>` and `</s>`. The proposed solution here is to allow the dictionary to have entries like: ``` <unk> 999 #fairseq:overwrite <s> 999 #fairseq:overwrite </s> 999 #fairseq:overwrite , 999 ▁de 999 . 999 (...) ``` These will preserve the old overwriting behavior. Thus we can release a new `camembert.v0.tar.gz` with a dictionary like above and it works. Pull Request resolved: fairinternal/fairseq-py#1073 Reviewed By: kahne Differential Revision: D20284569 Pulled By: myleott fbshipit-source-id: bf78fbff13c94bf8a6485cbdda62305ddc30c056

lematt1991 closed this as completed Sep 24, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fairseq/examples/roberta/README.pretraining.md #1073

fairseq/examples/roberta/README.pretraining.md #1073

yucoian commented Aug 27, 2019

ngoyal2707 commented Aug 27, 2019

lematt1991 commented Sep 24, 2019

fairseq/examples/roberta/README.pretraining.md #1073

fairseq/examples/roberta/README.pretraining.md #1073

Comments

yucoian commented Aug 27, 2019

ngoyal2707 commented Aug 27, 2019

lematt1991 commented Sep 24, 2019