KernelSHAP / Lime Improvements #619

vivekmig · 2021-02-18T00:13:47Z

Adds support for generators as perturb function for Lime with corresponding tests
Modifies KernelSHAP to sample based on categorical distributed on expected selected features and randomly sample vectors given expected number of selected features. This is theoretically equivalent to the previous approach of weighting randomly selected vectors, but this approach computationally scales better with larger numbers of features, since weights for larger numbers of features lead to arithmetic underflow.

facebook-github-bot

@vivekmig has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

miguelmartin75

First pass - mostly nits

miguelmartin75 · 2021-02-18T02:52:20Z

captum/attr/_core/kernel_shap.py

    return torch.tensor([similarities])


+def kernel_shap_perturb_generator(
+    original_inp, **kwargs


typehint? I assume original_inp is Union[Tensor, Tuple[Tensor, ...]]

captum/attr/_core/lime.py

miguelmartin75 · 2021-02-18T03:06:36Z

captum/attr/_core/kernel_shap.py

-        # weight to 100 (all other weights are < 1).
-        similarities = 100.0
+        # weight to 1000000 (all other weights are 1).
+        similarities = 1000000.0


I doubt this would be a concern, but just incase we could add this as a default param to this method. With this, users can do a functools.partial to change the value just incase it is not sufficient.

Yes, it's a good point that this could need to be customized. For now, to avoid having too many parameters, I can make this an instance variable that advanced users can override on the object after creation, but we can make it a parameter later if necessary.

miguelmartin75 · 2021-02-18T03:14:09Z

captum/attr/_core/lime.py

@@ -72,7 +73,7 @@ def __init__(
        forward_func: Callable,
        interpretable_model: Model,
        similarity_func: Callable,
-        perturb_func: Callable,
+        perturb_func: Union[Callable],


Missing typehint in the Union?

Good catch, thanks! Forgot to revert this change.

vivekmig · 2021-02-19T21:46:01Z

Thanks for the review @miguelmartin75 ! Addressed comments.

NarineK

Thank you for working on this PR, @vivekmig!
I left couple nits. I think that it would be good to describe this trick a little bit in the code since the original approach in the paper is a bit different in terms of the kernel similarity function.

captum/attr/_core/kernel_shap.py

captum/attr/_core/lime.py

captum/attr/_core/kernel_shap.py

NarineK · 2021-02-22T23:30:25Z

captum/attr/_core/kernel_shap.py

+            threshold = torch.kthvalue(
+                rand_vals, num_features - num_selected_features
+            ).values.item()
+            yield (rand_vals > threshold).to(device=device).long()


nit: Can we, please, describe a little bit why are we following this logic instead of the default behavior in default_perturb_func:

captum/captum/attr/_core/lime.py

Line 612 in 6b71d66

def default_perturb_func(original_inp, **kwargs):

I think default_perturb_func is missing typehints too.

Sure, will add more documentation on this. There are a few other helper methods in Lime without type hints, so will add them together in a separate PR.

captum/attr/_core/kernel_shap.py

captum/attr/_core/lime.py

tests/attr/test_lime.py

tests/attr/test_kernel_shap.py

captum/attr/_core/kernel_shap.py

vivekmig · 2021-02-24T16:45:26Z

Thanks for the review @NarineK ! Addressed comments.

facebook-github-bot

@vivekmig has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

NarineK · 2021-02-24T23:35:55Z

captum/attr/_core/kernel_shap.py

+        Perturbations are sampled by the following process:
+         - Choose k (number of selected features), based on the distribution
+                p(k) = (M - 1) / (k * (M - k))
+            where M is the total number of features


nit: total number of features in the interpretable space ?

NarineK

Thank you for the explanation! Looks great! Maybe you could add in the description that each of the (M choose k) samples has equal prob of getting chosen thus we do this:

rand_vals = torch.randn(1, num_features)
            threshold = torch.kthvalue( ...

If I remember your explanation correctly.

facebook-github-bot

@vivekmig has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2021-02-26T01:13:26Z

@vivekmig merged this pull request in c825327.

vivekmig added 3 commits February 17, 2021 15:04

Fixes

fce896d

Fixes

f393a8b

Small fixes

b0eae42

facebook-github-bot added the cla signed label Feb 18, 2021

vivekmig added 2 commits February 17, 2021 16:19

Lint fix

1243281

gpu fix

abde859

vivekmig requested review from NarineK, miguelmartin75 and bilalsal February 18, 2021 02:51

facebook-github-bot reviewed Feb 18, 2021

View reviewed changes

miguelmartin75 suggested changes Feb 19, 2021

View reviewed changes

Fixes

6b71d66

NarineK reviewed Feb 23, 2021

View reviewed changes

Fixes

b994ee3

facebook-github-bot reviewed Feb 24, 2021

View reviewed changes

NarineK reviewed Feb 24, 2021

View reviewed changes

NarineK self-requested a review February 25, 2021 00:24

NarineK approved these changes Feb 25, 2021

View reviewed changes

Comments

e212a81

facebook-github-bot reviewed Feb 25, 2021

View reviewed changes

facebook-github-bot closed this in c825327 Feb 26, 2021

facebook-github-bot added the Merged label Feb 26, 2021

KernelSHAP / Lime Improvements #619

KernelSHAP / Lime Improvements #619

Uh oh!

Conversation

vivekmig commented Feb 18, 2021

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

miguelmartin75 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

vivekmig commented Feb 19, 2021

Uh oh!

NarineK left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vivekmig commented Feb 24, 2021

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

NarineK left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Feb 26, 2021

Uh oh!

Uh oh!

NarineK left a comment •

edited

Loading