Reduce unnecessary reliance and usage of numpy #746

aobo-y · 2021-08-31T21:58:01Z

This search gives us all the usage of Numpy in Captum https://github.com/pytorch/captum/search?p=1&q=numpy
Many of them are not necessary

either empty imports
or similar functionality can be replaced with pytorch

In order to reduce the unneeded dependency, we can investigate the current usages and remove/replace them when possible

The text was updated successfully, but these errors were encountered:

bilalsal · 2021-08-31T23:59:27Z

Excellent find, @aobo-y!
Indeed, we noticed significant performance gain when we avoided numpy() while working on CCA and CKA.
You might need to pay attention to numerical precision issues, esp. to ensure that the unit tests continue to work meaningfully.

Thank you for bringing up this good proposal!

aobo-y · 2021-09-01T00:23:33Z

Thank you for the heads up @bilalsal !
I originally noticed this issue when helping @NarineK remove numpy for an internal use case (#714)
I will create a series of similar small PRs each of which contains only tightly-related changes, instead of one PR for all, so that we can review and discuss the removal/replacement case by case.

Summary: as title, for issue #746 Pull Request resolved: #755 Reviewed By: bilalsal Differential Revision: D30857228 Pulled By: aobo-y fbshipit-source-id: fbf258bc0af2ab1d39557678fd31ccbe37594669

NarineK · 2021-09-23T19:14:38Z

@aobo-y can we close this issue since it has been addressed ?

aobo-y · 2021-09-23T21:56:32Z

@NarineK sure, I think we almost cleared the unnecessary cases.

However, there are still some tricky ones left. For example,

captum/captum/attr/_core/gradient_shap.py

Lines 359 to 363 in 4faf1ea

    
           rand_coefficient = torch.tensor( 
        
               np.random.uniform(0.0, 1.0, inputs[0].shape[0]), 
        
               device=inputs[0].device, 
        
               dtype=inputs[0].dtype, 
        
           )

is perfect scenario to directly apply torch.rand(...). But the change will require quite some extra efforts to fix many tests, because even with fixed seed during tests, torch's random gives different values from np. As you can imagine, it needs update of the expected values and relaxation in many "almost-equal" asserts.
I will create separate issues dedicated for them.

aobo-y self-assigned this Aug 31, 2021

aobo-y mentioned this issue Sep 10, 2021

remove unncessary numpy dependency in deep_lift #755

Closed

aobo-y mentioned this issue Sep 23, 2021

remove numpy from attribution tests #760

Closed

aobo-y closed this as completed Sep 23, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce unnecessary reliance and usage of numpy #746

Reduce unnecessary reliance and usage of numpy #746

aobo-y commented Aug 31, 2021

bilalsal commented Aug 31, 2021

aobo-y commented Sep 1, 2021

NarineK commented Sep 23, 2021

aobo-y commented Sep 23, 2021

Reduce unnecessary reliance and usage of numpy #746

Reduce unnecessary reliance and usage of numpy #746

Comments

aobo-y commented Aug 31, 2021

bilalsal commented Aug 31, 2021

aobo-y commented Sep 1, 2021

NarineK commented Sep 23, 2021

aobo-y commented Sep 23, 2021