Run inference using checkpoints from registered models #509

Shruthi42 · 2021-06-24T08:59:31Z

Adds the ability to run inference on registered models using the parameter model_id.

dumbledad

Checked through and all looks good. The test_model_inference_on_single_run test may prove useful for the PR I'm working on now!

ant0nsc

Overall, looking good. Some parameters could find better homes/classes.
Can you please add documentation around how this new flag should be used to run inference? Also, ensure that the documentation is cleared of any references to functionality that no longer exists.

ant0nsc · 2021-07-13T21:15:19Z

InnerEye/Azure/azure_config.py

+    run_recovery_id: str = param.String(doc="A run recovery id string in the form 'experiment name:run id' "
+                                            "to use for inference, recovering a model training run or to register "
+                                            "a model.")


Unrelated to your PR, but why are these living in AzureConfig?

InnerEye/Azure/azure_config.py

InnerEye/ML/model_testing.py

ant0nsc · 2021-07-13T21:23:49Z

InnerEye/ML/run_ml.py

+            self.container.extra_downloaded_run_id = run_recovery_object
+        else:
+            self.container.extra_downloaded_run_id = None


The field name reads a bit strange - it seems to indicate that this is a run ID, but it's a RunRecovery object (which in turn is nothing but a list of Paths).

Renamed, see my comment below.

ant0nsc · 2021-07-13T21:25:28Z

InnerEye/ML/run_ml.py

+                                                                                run_to_recover,
+                                                                                EXTRA_RUN_SUBFOLDER,
+                                                                                only_return_path=not is_global_rank_zero())
+            self.container.extra_downloaded_run_id = run_recovery_object


This field "extra_downloaded_run_id" could do with some documentation. Also, we define that twice in DeepLearningConfig and in the container. Maybe it is better places in WorkflowParams?

I've renamed extra_downloaded_run_id to pretraining_run_checkpoints and initialized it once in WorkflowParams. I've also moved pretraining_run_recovery_id from AzureConfig to WorkflowParams.

Moving pretraining_run_checkpoints to WorkflowParams causes issues with initialization, I'm reverting this change for now.

InnerEye/ML/run_ml.py

ant0nsc · 2021-07-13T21:38:12Z

azure-pipelines/build.yaml

@@ -49,7 +49,7 @@ steps:
  # hence don't set PYTHONPATH
  - bash: |
      source activate InnerEye
-      pytest ./Tests/ -m "not (gpu or azureml or after_training_single_run or after_training_ensemble_run or inference or after_training_2node or after_training_glaucoma_cv_run)" --doctest-modules --junitxml=junit/test-results.xml --cov=. --cov-config=.coveragerc --cov-report=xml -n 2 --dist=loadscope --verbose
+      pytest ./Tests/ -m "not (gpu or azureml or after_training_single_run or after_training_ensemble_run or inference or after_training_2node or after_training_glaucoma_cv_run or after_training_hello_container)" --doctest-modules --junitxml=junit/test-results.xml --cov=. --cov-config=.coveragerc --cov-report=xml -n 2 --dist=loadscope --verbose


See this SO question: https://stackoverflow.com/a/55921954 - effectively we are running everything that does not have a mark, right?

This is a bit complicated - there does not seem to be an easy way to check for custom markers only and ignore markers such as "skipif" and "parametrize".

ant0nsc · 2021-07-15T12:48:06Z

InnerEye/ML/deep_learning_config.py

@@ -246,8 +256,13 @@ class WorkflowParams(param.Parameterized):
                                "be relative to the repository root directory.")

    def validate(self) -> None:
-        if self.weights_url and self.local_weights_path:
-            raise ValueError("Cannot specify both local_weights_path and weights_url.")
+        if sum([bool(param) for param in [self.weights_url, self.local_weights_path, self.model_id]]) > 1:


Not worth an extra push, but I think any([bool(param)...]) would have been clearer

The code throws an error if 2 or more options are set, but needs to allow the case were zero or one option is set.

Shruthi42 added 24 commits June 23, 2021 11:28

Use registered model for inference

1250de0

Merge branch 'main' into shbannur/load_registered_models

efc34fb

Bug fix

f94b2e9

Fix tests

7430785

Fix tests

229b7ea

mypy

16a09c3

Fix tests

ee957d8

Add tests

596efee

Fix tests

2c2160f

Fix tests

fe6fa93

Add test

45895e7

Fix tests

41f1b48

Fix test

f6bb7a2

Remove unnecessary function

3e3b069

Update tests

c0de1e6

Flake8

0889a88

Fix tests

f98b22e

mypy

d73f769

Merge branch 'main' into shbannur/load_registered_models

f4dfbe7

Merge branch 'main' into shbannur/load_registered_models

b204f08

Change docstring

dd17d78

Update CHANGELOG.md

388e0a8

Rename

d727fe1

Fix test

fa7a6e4

dumbledad self-requested a review July 13, 2021 18:26

dumbledad previously approved these changes Jul 13, 2021

View reviewed changes

ant0nsc suggested changes Jul 13, 2021

View reviewed changes

Address PR comments

e35db5b

Shruthi42 dismissed dumbledad’s stale review via e35db5b July 14, 2021 08:18

Use list of pytest markers

9093b7e

Shruthi42 added 6 commits July 14, 2021 10:32

Move model_id to WorkflowParams

bf072c0

Refactor extra_downloaded_run_id

e064483

Update documentation and argparser

838bb48

Revert changes to generic_parsing

9c7f6b4

Update documentation

d612c6e

Flake8 and mypy

6861178

dumbledad previously approved these changes Jul 14, 2021

View reviewed changes

Revert changes to pytest

df0caa2

Shruthi42 dismissed dumbledad’s stale review via df0caa2 July 14, 2021 16:23

Shruthi42 added 3 commits July 14, 2021 17:27

Update CHANGELOG.md

abd11dd

Update docstring

ae9a4ac

Fix comment line

d9a2686

dumbledad previously approved these changes Jul 14, 2021

View reviewed changes

ant0nsc mentioned this pull request Jul 15, 2021

Documentation for the end-to-end workflow of evaluating a segmentation model #540

Closed

Merge branch 'main' into shbannur/load_registered_models

894b44d

Shruthi42 dismissed dumbledad’s stale review via 894b44d July 15, 2021 12:35

dumbledad approved these changes Jul 15, 2021

View reviewed changes

ant0nsc approved these changes Jul 15, 2021

View reviewed changes

ant0nsc enabled auto-merge (squash) July 15, 2021 12:49

ant0nsc merged commit 9fcc08f into main Jul 15, 2021

ant0nsc deleted the shbannur/load_registered_models branch July 15, 2021 14:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run inference using checkpoints from registered models #509

Run inference using checkpoints from registered models #509

Shruthi42 commented Jun 24, 2021 •

edited

Loading

dumbledad left a comment

ant0nsc left a comment

ant0nsc Jul 13, 2021

ant0nsc Jul 13, 2021

Shruthi42 Jul 14, 2021

ant0nsc Jul 13, 2021

Shruthi42 Jul 14, 2021

Shruthi42 Jul 14, 2021

ant0nsc Jul 13, 2021 •

edited

Loading

Shruthi42 Jul 14, 2021

ant0nsc Jul 15, 2021

Shruthi42 Jul 15, 2021

Run inference using checkpoints from registered models #509

Run inference using checkpoints from registered models #509

Conversation

Shruthi42 commented Jun 24, 2021 • edited Loading

dumbledad left a comment

Choose a reason for hiding this comment

ant0nsc left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ant0nsc Jul 13, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Shruthi42 commented Jun 24, 2021 •

edited

Loading

ant0nsc Jul 13, 2021 •

edited

Loading