`ModelComparisonSimulator`: handle different outputs from individual simulators #452

Kucharssim · 2025-04-29T08:07:54Z

fixes #441

As per #441 (comment), this PR implements

"smarter" concatenation function (for example one that pads with nan if a parameter is not available for a given model).

However, by default the simulator will just drop (with an info warning) keys that are not common for all simulators, since in most situations we would not need those outputs in the first place.

codecov · 2025-04-29T08:16:09Z

Codecov Report

Attention: Patch coverage is 97.56098% with 1 line in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
bayesflow/simulators/model_comparison_simulator.py	97.56%	1 Missing ⚠️

Files with missing lines	Coverage Δ
bayesflow/simulators/model_comparison_simulator.py	`84.26% <97.56%> (+57.18%)`	⬆️

... and 22 files with indirect coverage changes

vpratz · 2025-04-29T15:32:12Z

Thanks for the PR, I skimmed it and like the idea behind the changes. I'll try to conduct a proper review some time this week.

LarsKue

Looks good from my side. See individual comments.

Can we also add tests for the (few) missed edge-case lines?

bayesflow/simulators/model_comparison_simulator.py

add newlines to correctly render lists, make reference to other class a link

vpratz

See my comment on one edge case, I'm not sure if it is a relevant one. What do you think?
Apart from that, the PR looks good to me, I only added minor formatting fixes to the docstring.

vpratz · 2025-04-30T15:20:46Z

bayesflow/simulators/model_comparison_simulator.py

+    def _determine_key_conflicts(self, sims):
+        # determine only once
+        if self._keys is not None:
+            return self._keys


Can this return "wrong" results if some simulators had n=0 in line 120 when the function first runs, and n>0 later on? Is this something we want to safeguard against?

I can imagine this function to be quite cheap to compute, would it make sense to run it completely every time (but only logging the info once)?

ok, will do that

LarsKue

The changes are looking great. Can we address Valentin's comments before we merge? I also left some minor comments, still.

LarsKue · 2025-05-03T17:09:07Z

bayesflow/simulators/model_comparison_simulator.py

+    def _determine_key_conflicts(self, sims):
+        # determine only once
+        if self._keys is not None:
+            return self._keys


LarsKue · 2025-05-03T17:09:44Z

tests/test_simulators/test_simulators.py

        assert set(samples) == {"x", "model_indices", "c", "w"}
        assert np.sum(np.isnan(samples["c"])) + np.sum(np.isnan(samples["w"])) == batch_size
+    elif multimodel_key_conflicts.key_conflicts == "error":
+        with pytest.raises(Exception):


This is too broad of a check. Use specific exception types.

Kucharssim added 3 commits April 28, 2025 13:55

drop or fill missing keys from the output

86f6f41

fix typo

a717600

add test

3c93679

Kucharssim requested a review from vpratz April 29, 2025 08:07

LarsKue self-requested a review April 29, 2025 18:24

LarsKue reviewed Apr 29, 2025

View reviewed changes

bayesflow/simulators/model_comparison_simulator.py Outdated Show resolved Hide resolved

bayesflow/simulators/model_comparison_simulator.py Outdated Show resolved Hide resolved

address code review from Lars

3e4813a

Kucharssim requested a review from LarsKue April 30, 2025 07:58

formatting in the docstring

e77fb51

add newlines to correctly render lists, make reference to other class a link

vpratz reviewed Apr 30, 2025

View reviewed changes

LarsKue requested changes May 3, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`ModelComparisonSimulator`: handle different outputs from individual simulators #452

`ModelComparisonSimulator`: handle different outputs from individual simulators #452

Kucharssim commented Apr 29, 2025

codecov bot commented Apr 29, 2025 •

edited

Loading

vpratz commented Apr 29, 2025

LarsKue left a comment

vpratz left a comment

vpratz Apr 30, 2025

vpratz Apr 30, 2025

LarsKue May 3, 2025

Kucharssim May 5, 2025

LarsKue left a comment

LarsKue May 3, 2025

LarsKue May 3, 2025

ModelComparisonSimulator: handle different outputs from individual simulators #452

Are you sure you want to change the base?

ModelComparisonSimulator: handle different outputs from individual simulators #452

Conversation

Kucharssim commented Apr 29, 2025

codecov bot commented Apr 29, 2025 • edited Loading

Codecov Report

vpratz commented Apr 29, 2025

LarsKue left a comment

Choose a reason for hiding this comment

vpratz left a comment

Choose a reason for hiding this comment

vpratz Apr 30, 2025

Choose a reason for hiding this comment

vpratz Apr 30, 2025

Choose a reason for hiding this comment

LarsKue May 3, 2025

Choose a reason for hiding this comment

Kucharssim May 5, 2025

Choose a reason for hiding this comment

LarsKue left a comment

Choose a reason for hiding this comment

LarsKue May 3, 2025

Choose a reason for hiding this comment

LarsKue May 3, 2025

Choose a reason for hiding this comment

`ModelComparisonSimulator`: handle different outputs from individual simulators #452

`ModelComparisonSimulator`: handle different outputs from individual simulators #452

codecov bot commented Apr 29, 2025 •

edited

Loading