Add group-level weights argument `pw` to `gr()` and `mm()`, for #1719 #1727

bschneidr · 2025-01-20T19:37:23Z

This is a PR with draft implementation and tests of a weights argument in gr(), for the feature request in #1719. Would you mind reviewing Paul with any suggestions?

In the Stan code and data, the group-level weights are denoted GMW_{id} (for "group model weights")- I wasn't sure what the best way to name them in a way that fits the notation used elsewhere. The weights name in the Stan code is already used by the ad term resp | weights(w), and the mm() weights use the W_ syntax. So I wanted to pick a name for the group-level weights that was clearly different from the other existing names for different types of weights in the Stan code.

along with unit tests.

paul-buerkner · 2025-01-23T15:01:22Z

Thanks! Looks pretty good already upon first glance. I quickly went over it and added some comments/questions.

bschneidr · 2025-01-23T15:20:49Z

Thanks Paul- can you share a link to the added comments/questions? I don't see them anywhere (for example at the "Changed files" list ) after some searching in different places

R/data-predictor.R

R/formula-re.R

R/stan-predictor.R

R/data-predictor.R

paul-buerkner · 2025-01-23T15:28:51Z

Sorry forgot to press "submit review" :-D

…corresponding tests.

…n `.stan_re()` uses `mm()` and not `gr()`.

bschneidr · 2025-01-30T22:18:53Z

Thank you Paul- I've gone through and addressed each of those comments with some new commits and a statistical explanation in one place. I think that should resolve all the points you raised, but I'm happy to make further changes.

paul-buerkner · 2025-01-31T17:34:06Z

Thank you! Looks good for the most parts. Just two more comments remain.

…else()` function with `if else` control flow.

bschneidr · 2025-01-31T18:29:11Z

Thanks Paul! I've added a new commit to incorporate those changes.

paul-buerkner · 2025-01-31T20:44:28Z

thank you! now that I think more of it, these kind of weights should be possible also in mm terms right? for completeness it would be good to have these kind of weights already there. which brings us back to the naming question. could we perhaps come up with a name of two letters, e.g "pw" for prior weights or something like that? this could then be implemented for both gr and mm and would be clearly different naming wise than the weights argument name used in other places. what would you think about it? sorry for coming up with new stuff again. Ben Schneider ***@***.***> schrieb am Fr., 31. Jan. 2025, 19:29:

…

Thanks Paul! I've added a new commit to incorporate those changes. — Reply to this email directly, view it on GitHub <#1727 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ADCW2AARLG6ZJSCEVUBI7LT2NO6I5AVCNFSM6AAAAABVRATBDOVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDMMRYGA2DAOJXG4> . You are receiving this because you commented.Message ID: ***@***.***>

bschneidr · 2025-01-31T21:12:47Z

I think that's a good idea to let mm() have weights too, and to disambiguate the names in that case: the name pw for prior or weights seems good to me in terms of clarifying the weights' purpose.

Do you think this kind of call makes sense?

y ~  x1 + ( 1 | mm(g1, g2, weights = cbind(w1, w2), pw = cbind(grpw1, grpw2))

I'm not sure if it would be necessary to do pw = cbind(grpw1, grpw2) instead of just pw = grpw; that is, if there are some groups that will only show up in g2 and not g1. Do you know if that's the case?

paul-buerkner · 2025-01-31T22:22:30Z

yeah I think we need cbind. if you think this is too much for this PR you can also just add the pw argument to mm but then error upon specification saying that it is not yet implemented. I don't want to burden you with too much new stuff necessarily Ben Schneider ***@***.***> schrieb am Fr., 31. Jan. 2025, 22:13:

…

I think that's a good idea to let mm() have weights too, and to disambiguate the names in that case: the name pw for prior or weights seems good to me in terms of clarifying the weights' purpose. Do you think this kind of call makes sense? y ~ x1 + ( 1 | mm(g1, g2, weights = cbind(w1, w2), pw = cbind(grpw1, grpw2)) I'm not sure if it would be necessary to do pw = cbind(grpw1, grpw2) instead of just pw = grpw; that is, if there are some groups that will only show up in g2 and not g1. Do you know if that's the case? — Reply to this email directly, view it on GitHub <#1727 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ADCW2ACUFU5PAKNGERP2FHL2NPROLAVCNFSM6AAAAABVRATBDOVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDMMRYGQYDOMJRHE> . You are receiving this because you commented.Message ID: ***@***.***>

bschneidr · 2025-02-23T21:48:18Z

Got it, thanks for clarifying Paul. I've added a couple commits that do the following:

469f3eb: Consistently uses the argument name pw in both gr() and mm(), and in the generated Stan code represents these weights as PW (previous commits used GMW).
37763af Adds the pw argument to mm(), along with accompanying unit tests.

Can you take a look and let me know if there's anything else you'd like me to add?

paul-buerkner · 2025-02-24T13:19:23Z

Thank you! I will take another look again make few remaining edits before merging. :-)

paul-buerkner · 2025-02-24T15:30:44Z

I have now cleaned up the PR a bit, especially the mm part. The inconsitency induced by the matrix weights required a bit too much code duplication for my taste, so I changed things back a bit. pw in mm is not yet super safe to use so I labled it as experimental. The chances of people actually using these features together is very small and things will not fail silently so I guess people can complain if they need it and it doesn't run.

Can you double check that the code is still correct even after my edits? Once you give the okay, I will merge this PR.

bschneidr · 2025-02-24T17:02:50Z

Thanks, Paul. I think it's a good idea to clean it up. Unfortunately I think the mm() processing may be too clean right now. It errors out on this small reprex.

# Load the package
devtools::load_all()

# Generate multimembership data with prior weights
dat <- data.frame(
  y = rnorm(100), x1 = rnorm(100), x2 = rnorm(100),
  mw1 = rep(0.7, times = 100), mw2 = rep(0.3, times = 100),
  g1 = sample(1:10, 100, TRUE), g2 = sample(1:10, 100, TRUE)
)
pw_of_groups <- runif(n = 10, min = 0.9, max = 1.1) 
dat[['g1w']] <- pw_of_groups[dat[['g1']]]
dat[['g2w']] <- pw_of_groups[dat[['g2']]]
 
# multi-membership model with two members per group and equal weights
mm_standata <- brms::make_standata(
  y ~ x1 + (1|mm(g1, g2, weights = cbind(mw1, mw2), pw = cbind(g1w, g2w))), 
  data = dat
)
#> Warning: Support for prior weights in multimembership terms is experimental.
#> Error in tapply(X = group_prior_weights, INDEX = J, FUN = function(x) length(unique(x)) == : arguments must have same length

I think instead of trying to depend on the last grouping vector J to work (which I think it won't if it's missing one of the groups), it would be better to just reconstruct J so that it meets the requirements for the subsequent data processing. Here's a proposed update that simply reconstructs J by retrieving the output that was previously stored in out.

Here's a proposed update:

    # prepare data for group prior weights if specified
    if (nzchar(id_reframe$gcall[[1]]$pw)) {      
      if (id_reframe$gtype[1] == "mm") {
        warning2("Support for prior weights in multimembership terms is experimental.")
        J <- numeric()
        for (i in seq_along(gs)) {
          J <- c(J, out[[paste0("J_", idresp, "_", i)]])
        }
      }

Would you be open to me adding a commit with this change, and then updating the unit tests accordingly?

paul-buerkner · 2025-02-24T17:08:29Z

I think that's great! Would a simple J <- unlist(out[paste0("J_", idresp, "_", seq_along(gs))]) also work?

bschneidr · 2025-02-24T17:25:26Z

Thanks! That's cleaner still.

I just put that update into the latest commit and updated the unit test for the data. I tweaked the unit test so that the multimembership group data is a little more complex: some levels only appear in g1 or g2 and not both. So now the unit test confirms that the code is correctly picking up groups and group weights from all the grouping variables.

My preference would be to leave the warning message out since there's a working unit test for it, but I think it's OK to leave it in.

Now that I've seen the unit tests are passing and I've tried this out with example data, I'm comfortable with the PR being merged whenever you choose.

paul-buerkner · 2025-02-24T22:58:18Z

Perfect! I made some final edits and the PR is now ready for merging. Thank you for contributing to brms! I highly appreciate it!

Add weights argument to gr(), for paul-buerkner#1719, along

ac823b5

along with unit tests.

paul-buerkner requested changes Jan 23, 2025

View reviewed changes

bschneidr added 3 commits January 29, 2025 17:56

Throw informative errors or warning for bad group-level weights. Add …

6c49c23

…corresponding tests.

Refactoring suggested in code review and to catch potential issue whe…

1001ce6

…n `.stan_re()` uses `mm()` and not `gr()`.

In newdata, fill in group-level weights if not supplied

bd0bc55

Replace gr_weights with weights whenever unambiguous. Replace `if…

cdc69ec

…else()` function with `if else` control flow.

paul-buerkner added the feature label Feb 3, 2025

paul-buerkner added this to the brms 2.23.0 milestone Feb 3, 2025

bschneidr added 2 commits February 23, 2025 16:06

Consistent naming of prior weights for groups

469f3eb

Added pw argument to mm().

37763af

bschneidr changed the title ~~Add weights argument to gr(), for #1719~~ Add weights argument to gr() and mm(), for #1719 Feb 23, 2025

bschneidr changed the title ~~Add weights argument to gr() and mm(), for #1719~~ Add group-level weights argument pw to gr() and mm(), for #1719 Feb 23, 2025

clean up code related to prior weights

f5c03b1

Prior weights in mm() working. More robust unit test.

15fba28

minor cleaning

060eb9c

paul-buerkner added 2 commits February 24, 2025 23:53

prepare for merge

dbbd752

Merge branch 'master' into group-weights

6246d1c

paul-buerkner merged commit c741df7 into paul-buerkner:master Feb 24, 2025

paul-buerkner mentioned this pull request Feb 24, 2025

Feature request: Allow weights to also be specified at group level #1719

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add group-level weights argument `pw` to `gr()` and `mm()`, for #1719 #1727

Add group-level weights argument `pw` to `gr()` and `mm()`, for #1719 #1727

bschneidr commented Jan 20, 2025

paul-buerkner commented Jan 23, 2025

bschneidr commented Jan 23, 2025

paul-buerkner commented Jan 23, 2025

bschneidr commented Jan 30, 2025

paul-buerkner commented Jan 31, 2025

bschneidr commented Jan 31, 2025

paul-buerkner commented Jan 31, 2025 via email

bschneidr commented Jan 31, 2025

paul-buerkner commented Jan 31, 2025 via email

bschneidr commented Feb 23, 2025

paul-buerkner commented Feb 24, 2025

paul-buerkner commented Feb 24, 2025 •

edited

Loading

bschneidr commented Feb 24, 2025

paul-buerkner commented Feb 24, 2025

bschneidr commented Feb 24, 2025

paul-buerkner commented Feb 24, 2025

Add group-level weights argument pw to gr() and mm(), for #1719 #1727

Add group-level weights argument pw to gr() and mm(), for #1719 #1727

Conversation

bschneidr commented Jan 20, 2025

paul-buerkner commented Jan 23, 2025

bschneidr commented Jan 23, 2025

paul-buerkner commented Jan 23, 2025

bschneidr commented Jan 30, 2025

paul-buerkner commented Jan 31, 2025

bschneidr commented Jan 31, 2025

paul-buerkner commented Jan 31, 2025 via email

bschneidr commented Jan 31, 2025

paul-buerkner commented Jan 31, 2025 via email

bschneidr commented Feb 23, 2025

paul-buerkner commented Feb 24, 2025

paul-buerkner commented Feb 24, 2025 • edited Loading

bschneidr commented Feb 24, 2025

paul-buerkner commented Feb 24, 2025

bschneidr commented Feb 24, 2025

paul-buerkner commented Feb 24, 2025

Add group-level weights argument `pw` to `gr()` and `mm()`, for #1719 #1727

Add group-level weights argument `pw` to `gr()` and `mm()`, for #1719 #1727

paul-buerkner commented Feb 24, 2025 •

edited

Loading