Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

[Feature] Add multi machine dist_train #1303

Merged
merged 1 commit into from
Mar 15, 2022

Conversation

ZCMax
Copy link
Collaborator

@ZCMax ZCMax commented Mar 11, 2022

Thanks for your contribution and we appreciate it a lot. The following instructions would make your pull request more healthy and more easily get feedback. If you do not understand some items, don't worry, just make the pull request and seek help from maintainers.

Motivation

support multi machine dist_train doc and script

Modification

support multi machine dist_train doc and script

BC-breaking (Optional)

Does the modification introduce changes that break the back-compatibility of the downstream repos?
If so, please describe how it breaks the compatibility and how the downstream projects should modify their code to keep compatibility with this PR.

Use cases (Optional)

If this PR introduces a new feature, it is better to list some use cases here, and update the documentation.

Checklist

  1. Pre-commit or other linting tools are used to fix the potential lint issues.
  2. The modification is covered by complete unit tests. If not, please add more unit test to ensure the correctness.
  3. If the modification has potential influence on downstream projects, this PR should be tested with downstream projects.
  4. The documentation has been modified accordingly, like docstring or example tutorials.

@ZwwWayne ZwwWayne merged commit 9c7270d into open-mmlab:master Mar 15, 2022
ZwwWayne added a commit that referenced this pull request Apr 13, 2022
* fix README typo (#1292)

* [Fix]fix init_model to support 'device=cpu' (#1275)

* fix init_model

* Refine warning message

Co-authored-by: Tai-Wang <tab_wang@outlook.com>

* fixed docs/zh_cn.getting_started (#1298)

* [Doc] Add documentation for multi-node train with pytorch original ddp (#1296)

* update mn_train

* update

* Fix typos

Co-authored-by: Tai-Wang <tab_wang@outlook.com>

* add multi-machine dist_train (#1303)

* Update Chinese document for speed benchmark

* Fix inappropriate expressions

Co-authored-by: ChaimZhu <zhuchenming@pjlab.org.cn>
Co-authored-by: VVsssssk <88368822+VVsssssk@users.noreply.github.com>
Co-authored-by: Tai-Wang <tab_wang@outlook.com>
Co-authored-by: Subjectivist <xuejiapeng_upc@163.com>
Co-authored-by: Enze Xie <Johnny_ez@163.com>
Co-authored-by: Wenwei Zhang <40779233+ZwwWayne@users.noreply.github.com>
ZwwWayne pushed a commit that referenced this pull request Apr 13, 2022
* fix README typo (#1292)

* [Fix]fix init_model to support 'device=cpu' (#1275)

* fix init_model

* Refine warning message

Co-authored-by: Tai-Wang <tab_wang@outlook.com>

* fixed docs/zh_cn.getting_started (#1298)

* [Doc] Add documentation for multi-node train with pytorch original ddp (#1296)

* update mn_train

* update

* Fix typos

Co-authored-by: Tai-Wang <tab_wang@outlook.com>

* add multi-machine dist_train (#1303)

* translate lidar_det3d.md into corresponding Chinese version

* fixed typos and embellished some expressions

* [doc] modify some translation errors

Co-authored-by: ChaimZhu <zhuchenming@pjlab.org.cn>
Co-authored-by: VVsssssk <88368822+VVsssssk@users.noreply.github.com>
Co-authored-by: Tai-Wang <tab_wang@outlook.com>
Co-authored-by: Subjectivist <xuejiapeng_upc@163.com>
Co-authored-by: Enze Xie <Johnny_ez@163.com>
ZwwWayne added a commit that referenced this pull request Apr 13, 2022
* fix README typo (#1292)

* [Fix]fix init_model to support 'device=cpu' (#1275)

* fix init_model

* Refine warning message

Co-authored-by: Tai-Wang <tab_wang@outlook.com>

* fixed docs/zh_cn.getting_started (#1298)

* [Doc] Add documentation for multi-node train with pytorch original ddp (#1296)

* update mn_train

* update

* Fix typos

Co-authored-by: Tai-Wang <tab_wang@outlook.com>

* add multi-machine dist_train (#1303)

* Update README.md

fix the link error

Co-authored-by: ChaimZhu <zhuchenming@pjlab.org.cn>
Co-authored-by: VVsssssk <88368822+VVsssssk@users.noreply.github.com>
Co-authored-by: Tai-Wang <tab_wang@outlook.com>
Co-authored-by: Subjectivist <xuejiapeng_upc@163.com>
Co-authored-by: Enze Xie <Johnny_ez@163.com>
Co-authored-by: Wenwei Zhang <40779233+ZwwWayne@users.noreply.github.com>
deleomike pushed a commit to deleomike/mmdetection3d that referenced this pull request Apr 14, 2022
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants