Document a Bayesian approach to automated V&V #1382

zmbc · 2023-10-30T16:54:38Z

Using Bayesian instead of frequentist hypothesis testing.

Code implementing the statistics and applying this method to domestic migration and immigration is here: ihmeuw/vivarium_census_prl_synth_pop#333

aflaxman · 2023-10-30T22:10:46Z

docs/source/model_design/vivarium_features/automated_v_and_v/index.rst

 one for each of the values we want to check in the simulation.
-In these hypothesis tests, the null hypothesis is that the simulation value matches the V&V target;
+In these hypothesis tests, the null hypothesis is that the simulation value comes from our V&V target distribution
+and the alternative hypothesis is that it comes from a prior distribution of bugs/errors;


there is something philosophically interesting here... the alternative hypothesis is that the prior has bugs/errors and they matter. It is possible that there is a bug but it is not caught by this test. But then is it really a bug?

Hmm I would say that a bug is still a bug because its something we don't want in the code, even if it doesn't impact the results. For example, if I used a GBD 2018 value instead of 2019 - it's wrong but it might not appear wrong in the outputs. But then yes, we're only testing for bugs that matter.

Abie, that might be (arguably) the alternative hypothesis we want, but here I am describing what alternative hypothesis we are actually testing. With how I have currently done this, there is a distribution of rates if there is no bug (specified by the V&V target) and a distribution of rates if there is a bug (currently this prior is always the same). The latter can have mass around or at the correct values, which represents the situation you are describing -- a bug that is accidentally right. We still include that as part of the alternative hypothesis.

SylLutze · 2023-10-31T13:15:10Z

docs/source/model_design/vivarium_features/automated_v_and_v/index.rst

 one for each of the values we want to check in the simulation.
-In these hypothesis tests, the null hypothesis is that the simulation value matches the V&V target;
+In these hypothesis tests, the null hypothesis is that the simulation value comes from our V&V target distribution
+and the alternative hypothesis is that it comes from a prior distribution of bugs/errors;


Hmm I would say that a bug is still a bug because its something we don't want in the code, even if it doesn't impact the results. For example, if I used a GBD 2018 value instead of 2019 - it's wrong but it might not appear wrong in the outputs. But then yes, we're only testing for bugs that matter.

Also moves away from the null/alternative language.

Document a Bayesian approach to automated V&V

edd62ec

zmbc added the meta modeling strategy Docs not related to a single project in particular label Oct 30, 2023

zmbc requested review from aflaxman, NathanielBlairStahn, pletale, alibow and SylLutze October 30, 2023 16:54

zmbc mentioned this pull request Oct 30, 2023

Fuzzy checking: Bayesian version ihmeuw/vivarium_census_prl_synth_pop#333

Merged

aflaxman approved these changes Oct 30, 2023

View reviewed changes

SylLutze approved these changes Oct 31, 2023

View reviewed changes

zmbc and others added 5 commits November 3, 2023 10:41

Add additional details

16fe379

Expand on sensitivity/specificity ideas

76012b1

Add a TODO about more efficient runs

776b052

Add example of verification

c8746a0

Also moves away from the null/alternative language.

Merge branch 'main' into automated_v_and_v_bayesian

a26d4d1

zmbc mentioned this pull request Nov 6, 2023

Add more about how to interpret hypotheses #1389

Merged

alibow approved these changes Nov 6, 2023

View reviewed changes

Merge branch 'main' into automated_v_and_v_bayesian

560147d

zmbc merged commit a0fcf17 into main Nov 7, 2023
2 checks passed

zmbc deleted the automated_v_and_v_bayesian branch November 7, 2023 16:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Document a Bayesian approach to automated V&V #1382

Document a Bayesian approach to automated V&V #1382

zmbc commented Oct 30, 2023

aflaxman Oct 30, 2023

SylLutze Oct 31, 2023

zmbc Oct 31, 2023

SylLutze Oct 31, 2023

Document a Bayesian approach to automated V&V #1382

Document a Bayesian approach to automated V&V #1382

Conversation

zmbc commented Oct 30, 2023

aflaxman Oct 30, 2023

Choose a reason for hiding this comment

SylLutze Oct 31, 2023

Choose a reason for hiding this comment

zmbc Oct 31, 2023

Choose a reason for hiding this comment

SylLutze Oct 31, 2023

Choose a reason for hiding this comment