Enable fantasy models for multitask GPs Reborn #2317

yyexela · 2023-04-05T03:53:56Z

This pull request (PR) takes this previous pull request and completes it. These code changes allow get_fantasy_model to work with multitask Gaussian Processes (GPs). This in turn, allows derivative enabled GPs to work with the qKG (Knowledge Gradient) acquisition function from the BoTorch package. This PR is a part of my effort to get derivative enabled GPs to work in BoTorch.

I have verified that these changes do not break existing unit tests, and I have added a new unit test which goes through the added code.

Please take a look and let me know what I should change!

yyexela · 2023-04-05T03:58:46Z

gpytorch/models/exact_prediction_strategies.py

@@ -285,7 +302,11 @@ def exact_predictive_mean(self, test_mean: Tensor, test_train_covar: LinearOpera
        # NOTE TO FUTURE SELF:
        # You **cannot* use addmv here, because test_train_covar may not actually be a non lazy tensor even for an exact
        # GP, and using addmv requires you to to_dense test_train_covar, which is obviously a huge no-no!
-        res = (test_train_covar @ self.mean_cache.unsqueeze(-1)).squeeze(-1)
+
+        if len(self.mean_cache.shape) == 4:


The main issue I have with this PR is this line.

While working with a simple BO loop using BoTorch to test my code changes and observe the shapes of everything going through the code, I found that sometimes test_train_covar would be of size [5,1,4,24] and self.mean_cache would be of size [5,1,1,24]. This is why I have an if-statement here. I'm not sure if there's a better way to check if we're working with a derivative enabled GP in this function.

These observed shapes are also why my unit test has new_x and new_y as shapes (1, 1, dim) and (num_fantasies, 1, 1, 1 + dim) respectively, since that's what I observed from my test code.

I'm curious if anyone has any thoughts about this!

I found that sometimes test_train_covar would be of size [5,1,4,24] and self.mean_cache would be of size [5,1,1,24]

Do you have a sense for why this is? Is this some insufficient invalidation of self.mean_cache? Figure I'd ask first before getting into a rabbit hole here...

Honestly I'm not 100% sure, but self.mean_cache appears to be set to size [5,1,1,24] from a call to exact_gp.y:239 when the new prediction_strategy is created

This is called from get_fantasy_model(self, inputs, targets) where

# The below are torch.tensors, I just show their dimensions inputs =[torch.Size([1,1,3]) targets = torch.Size([5,1,1,4])

And old_pred_strat.get_fantasy_strategy(inputs, targets, full_inputs, full_targets, full_output) is called with:

# The below are torch.tensors, I just show their dimensions inputs = [torch.Size([1,1,3])] targets = torch.Size([5,1,1,4]) full_inputs = [torch.Size([1, 6, 3])] full_targets = torch.Size([5, 1, 6, 4]) full_output = MultitaskMultivariateNormal(loc: torch.Size([1, 24]))

The targets come from model.py:335 in BoTorch from a call to sampler. I don't specify a sampler in my code, I just use whatever default sampler is in qKnowledgeGradient:

scal_transf = ScalarizedPosteriorTransform(weights=torch.tensor([1.0] + [0.0]*dim, dtype=torch.double)) # Define qKG acquisition function qKG = qKnowledgeGradient(model,\ posterior_transform=scal_transf,\ num_fantasies=5)

Hopefully this helps! I'm not sure what the expected behavior should be, but please let me know how I can help.

I wonder if the extra 1 appearing is just a soft incompatibility we never noticed where BoTorch requires an explicit task dim for the labels, but we don't in gpytorch. Indeed, my default is usually to have a single dim label vector, so when writing the code something like this could have slipped by me.

Balandat

Thanks, great for taking on this effort.

Balandat · 2023-04-05T04:11:56Z

gpytorch/models/exact_prediction_strategies.py

@@ -209,6 +223,9 @@ def get_fantasy_strategy(self, inputs, targets, full_inputs, full_targets, full_
            new_root = BatchRepeatLinearOperator(DenseLinearOperator(new_root), repeat_shape)
            # no need to repeat the covar cache, broadcasting will do the right thing

+        if isinstance(full_output, MultitaskMultivariateNormal):
+            full_mean = full_mean.view(*target_batch_shape, -1, num_tasks).contiguous()


instead of .view().contiguous(), can also just use reshape() here

Balandat · 2023-04-05T04:14:24Z

gpytorch/models/exact_prediction_strategies.py

@@ -285,7 +302,11 @@ def exact_predictive_mean(self, test_mean: Tensor, test_train_covar: LinearOpera
        # NOTE TO FUTURE SELF:
        # You **cannot* use addmv here, because test_train_covar may not actually be a non lazy tensor even for an exact
        # GP, and using addmv requires you to to_dense test_train_covar, which is obviously a huge no-no!
-        res = (test_train_covar @ self.mean_cache.unsqueeze(-1)).squeeze(-1)
+
+        if len(self.mean_cache.shape) == 4:


I found that sometimes test_train_covar would be of size [5,1,4,24] and self.mean_cache would be of size [5,1,1,24]

Do you have a sense for why this is? Is this some insufficient invalidation of self.mean_cache? Figure I'd ask first before getting into a rabbit hole here...

yyexela · 2023-05-02T21:39:02Z

Hey! Wondering if there is anything else I should do to try to get this through? I'm graduating soon and will have limited time to work on this in the fall, with some time over the summer but I'll be interning full time. Thanks!

jacobrgardner · 2023-05-09T00:55:02Z

We should definitely try to get this in, thanks for putting this up! Things mostly lgtm, other than Max's comment.

jacobrgardner

Went through more carefully, lgtm!

yyexela added 3 commits March 31, 2023 15:44

qKG First commit

969a9ec

Passing unit tests now

b9dc064

First unit test passing

6c2fd48

yyexela commented Apr 5, 2023

View reviewed changes

Balandat reviewed Apr 5, 2023

View reviewed changes

Merge branch 'master' into dGPFantasize

f50a9f8

Merge branch 'master' into dGPFantasize

2f7a3cf

Merge branch 'master' into dGPFantasize

28ee4ca

jacobrgardner approved these changes May 11, 2023

View reviewed changes

jacobrgardner merged commit c6a4f46 into cornellius-gp:master May 11, 2023

jacobrgardner mentioned this pull request May 11, 2023

Enable fantasy models for multitask GPs #805

Closed

yyexela mentioned this pull request May 19, 2023

get_fantasy_model for multi-task GPs #800

Closed

manish-pra mentioned this pull request Mar 11, 2024

Avoid reinstantiating GPs for new data manish-pra/sampling-gpmpc#4

Open

ancorso mentioned this pull request Sep 1, 2024

[Bug] Fantasy Models for Multitask GPs are broken #2577

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable fantasy models for multitask GPs Reborn #2317

Enable fantasy models for multitask GPs Reborn #2317

yyexela commented Apr 5, 2023

yyexela Apr 5, 2023

yyexela Apr 5, 2023

Balandat Apr 5, 2023

yyexela Apr 5, 2023

jacobrgardner May 9, 2023

Balandat left a comment

Balandat Apr 5, 2023

Balandat Apr 5, 2023

yyexela commented May 2, 2023

jacobrgardner commented May 9, 2023

jacobrgardner left a comment

Enable fantasy models for multitask GPs Reborn #2317

Enable fantasy models for multitask GPs Reborn #2317

Conversation

yyexela commented Apr 5, 2023

yyexela Apr 5, 2023

Choose a reason for hiding this comment

yyexela Apr 5, 2023

Choose a reason for hiding this comment

Balandat Apr 5, 2023

Choose a reason for hiding this comment

yyexela Apr 5, 2023

Choose a reason for hiding this comment

jacobrgardner May 9, 2023

Choose a reason for hiding this comment

Balandat left a comment

Choose a reason for hiding this comment

Balandat Apr 5, 2023

Choose a reason for hiding this comment

Balandat Apr 5, 2023

Choose a reason for hiding this comment

yyexela commented May 2, 2023

jacobrgardner commented May 9, 2023

jacobrgardner left a comment

Choose a reason for hiding this comment