New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

#

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Jump to bottom

Included PyTorch model tests for CFRL #799

Merged

jklaise merged 16 commits into SeldonIO:master from RobertSamoilescu:fix/models-pytorch-coverage

Nov 18, 2022

Collaborator

RobertSamoilescu commented Oct 14, 2022

This PR includes the PyTorch model test for CFRL to increase coverage (i.e. addresses #760)

RobertSamoilescu added 12 commits

October 13, 2022 10:54


          Wrote actor-critic tests.

f0b0333


          Included autoencoder tests.

9271a7c


          Implemented cfrl_models tests.

d6efefc


          Included metrics tests.

e4ed58b


          validate prediction labels tests -- in progress

366f4d8


          Finalized all tests. Before refactoring.

a4d47a4


          isort, included docs, consisten use of quotes.

9193c83


          pytest error checks, removed docstrings.


          Refactored test_model

918a4e7


          Minor refactoring

18bcd51


          solve flake8 issues.

010e523


          Included pytest-mock in the requirements/dev.txt

b1cdcf5

RobertSamoilescu requested a review from jklaise

October 14, 2022 15:57

codecov bot commented Oct 14, 2022 •

edited

Loading

Codecov Report

Merging #799 (4605790) into master (8fd13c8) will increase coverage by 2.98%.
The diff coverage is n/a.

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #799      +/-   ##
==========================================
+ Coverage   76.53%   79.52%   +2.98%     
==========================================
  Files          72       73       +1     
  Lines        8224     8477     +253     
==========================================
+ Hits         6294     6741     +447     
+ Misses       1930     1736     -194

Flag	Coverage Δ
macos-latest-3.10	`79.48% <ø> (+2.97%)`	⬆️
ubuntu-latest-3.10	`79.39% <ø> (+2.88%)`	⬆️
ubuntu-latest-3.7	`79.36% <ø> (+3.01%)`	⬆️
ubuntu-latest-3.8	`79.30% <ø> (+2.91%)`	⬆️
ubuntu-latest-3.9	`79.30% <ø> (+2.91%)`	⬆️
windows-latest-3.9	`77.06% <ø> (+2.57%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
alibi/models/pytorch/metrics.py	`92.98% <ø> (+52.63%)`	⬆️
alibi/models/pytorch/model.py	`94.20% <ø> (+76.81%)`	⬆️
alibi/utils/discretizer.py	`92.50% <0.00%> (-5.00%)`	⬇️
alibi/api/defaults.py	`100.00% <0.00%> (ø)`
alibi/explainers/__init__.py	`100.00% <0.00%> (ø)`
alibi/explainers/pd_variance.py	`40.72% <0.00%> (ø)`
alibi/explainers/partial_dependence.py	`48.16% <0.00%> (+0.21%)`	⬆️
alibi/datasets/default.py	`93.47% <0.00%> (+23.91%)`	⬆️
alibi/explainers/ale.py	`98.26% <0.00%> (+32.60%)`	⬆️
alibi/datasets/tensorflow.py	`100.00% <0.00%> (+50.00%)`	⬆️
... and 5 more


          Improved tests for train_step, test_step, fit, and evaluate.

f5def51

RobertSamoilescu force-pushed the fix/models-pytorch-coverage branch from 14d13ee to f5def51 Compare

October 17, 2022 12:28

jklaise reviewed

View reviewed changes

alibi/models/pytorch/metrics.py Outdated Show resolved Hide resolved

Contributor

jklaise commented Oct 21, 2022 •

edited

Loading

@RobertSamoilescu there are two binary files called .pymon in this PR which should be removed.

jklaise reviewed

View reviewed changes

alibi/models/pytorch/tests/test_cfrl_models.py Outdated Show resolved Hide resolved

jklaise reviewed

View reviewed changes

alibi/models/pytorch/tests/test_cfrl_models.py Outdated Show resolved Hide resolved

jklaise reviewed

View reviewed changes

alibi/models/pytorch/tests/test_model.py

+                  def __init__(self, input_dim: int, output_dim: int):
+                      super().__init__()
+                      self.fc1 = nn.Linear(input_dim, output_dim, bias=False)
+                      self.to(self.device)

Contributor

jklaise Oct 21, 2022

What's the reason for doing this here (contrasting with other tests where cpu() was called explicitly after model instance creation).

Collaborator Author

RobertSamoilescu Nov 1, 2022

See comment above. Probably can be deleted and was just added for some consistency. Guess I will just leave it there?

Contributor

jklaise Nov 17, 2022

I would say better to remove it unless there's an explicit reason to have it here, otherwise it's another potential line for dev confusion :)

jklaise reviewed

View reviewed changes

alibi/models/pytorch/tests/test_model.py Outdated Show resolved Hide resolved

jklaise reviewed

View reviewed changes

alibi/models/pytorch/tests/test_model.py Outdated

Comment on lines 42 to 48

+              @contextlib.contextmanager
+              def reset_model(model: Model):
+                  model._reset_loss()
+                  model._reset_metrics()
+                  yield
+                  model._reset_loss()
+                  model._reset_metrics()

Contributor

jklaise Oct 21, 2022

Just to confirm my understanding - when used in a with statement this would basically reset losses and metrics, hand over the control to the user insider the with block, and after existing reset everything again?

Collaborator Author

RobertSamoilescu Nov 1, 2022

Yes. That's correct.

jklaise reviewed

View reviewed changes

alibi/models/pytorch/tests/test_model.py Outdated

Comment on lines 373 to 376

+                  """ Test if the train step return the appropriate statistics. """
+                  # copy the model `state_dict` as it will be modified by the `train_step`. Not really necessary now,
+                  # but to avoid any errors in the future if it is the case.
+                  state_dict_cpy = deepcopy(unimodal_model.state_dict())

Contributor

jklaise Oct 21, 2022

Is the concern here because of fixture sharing between test functions? Could it be done in a more idiomatic way using "yield" fixtures and/or scoping the model fixture at a test-function level instead of module level?

Collaborator Author

RobertSamoilescu Nov 1, 2022

yes, the concern is that the model is shared between tests. To be honest, I am not sure how I can re-write it with "yield". Changing the scope to function level will do the job, but what I tried to do is avoid creating a new object again and again for each test. I guess if we decide to move the fixture to function level, then I think there will be no need for the context manager to reset the metrics and the loss. Not entirely sure what's the best approach ... Any suggestions?

Contributor

jklaise Nov 17, 2022

I think just initializing a model isn't a huge overhead so think function scope makes sense and we have better guarantees for test isolation.

jklaise reviewed

View reviewed changes

alibi/models/pytorch/tests/test_model.py

+                      assert (param.grad is None) or (torch.allclose(param, torch.zeros_like(param)))
+                  # compute prediction
+                  unimodal_model.eval()

Contributor

jklaise Oct 21, 2022

Should this be reverted before exiting the test (also see previous comment on fixtures)?

Collaborator Author

RobertSamoilescu Nov 1, 2022

Doesn't really matter for those tests since the model has the same behavior in train and eval. Now that I think about it, probably it makes sense to set the scope to function level ...

jklaise reviewed

View reviewed changes

alibi/models/pytorch/tests/test_model.py Outdated

+                  model1 = UnimodalModel(input_dim=input_dim, output_dim=output_dim)
+                  model2 = UnimodalModel(input_dim=input_dim, output_dim=output_dim)
+                  with tempfile.TemporaryDirectory() as temp_dir:

Contributor

jklaise Oct 21, 2022

pytest actually ships with a tmp_path fixture which can be used, saving some code and imports :) https://docs.pytest.org/en/7.1.x/how-to/tmp_path.html

jklaise suggested changes

View reviewed changes

Contributor

jklaise left a comment

Thanks, nice tests! Left a few comments, particularly around fixture management.

RobertSamoilescu added 2 commits

November 1, 2022 10:42


          Addressed comments. TODO: decide fixture scope

9e19b7c


          Moved models' scope to function.

ea10869

jklaise reviewed

View reviewed changes

alibi/models/pytorch/tests/test_model.py Show resolved Hide resolved

jklaise suggested changes

View reviewed changes

Contributor

jklaise left a comment

Looks good, just one more comment.


          Removed global variables.

jklaise approved these changes

View reviewed changes

jklaise merged commit 13198a9 into SeldonIO:master

RobertSamoilescu added a commit to RobertSamoilescu/alibi that referenced this pull request


          Included PyTorch model tests for CFRL (SeldonIO#799)

01dd2c9

* Wrote actor-critic tests.

* Included autoencoder tests.

* Implemented cfrl_models tests.

* Included metrics tests.

* validate prediction labels tests -- in progress

* Finalized all tests. Before refactoring.

* isort, included docs, consisten use of quotes.

* pytest error checks, removed docstrings.

* Refactored test_model

* Minor refactoring

* solve flake8 issues.

* Included pytest-mock in the requirements/dev.txt

* Improved tests for train_step, test_step, fit, and evaluate.

* Addressed comments. TODO: decide fixture scope

* Moved models' scope to function.

* Removed global variables.

# for free to join this conversation on GitHub. Already have an account? # to comment

Labels

None yet