awesome-xai

Resources we like for Explainable Artificial Intelligence: sites, papers, implementations, and more!

This page is set up as part of the Interpretable Learning Spring 2022 curriculum arranged at UiT The Arctic University, but it is open to everyone who wants to contribute - send a pull request if you feel something is missing!

Primer on explainable artificial intelligence

Briefly, explainable artificial intelligence (XAI) methods try to create machine learning models capable of explaining how they arrive at predictions. We can use these methods to verify that the models are using features relevant to prediction, to identify areas where the models need improvement, and to extend our own understanding of the problems the models solve.

We separate between

Intrinsic and post-hoc explanations: intrinsically interpretable models rely on the model design being inherently understandable, while post-hoc methods try to infer conclusions about the model after training
Model-specific and model-agnostic explanations: model-specific methods rely on certain properties of the model design to provide explanations, while model-agnostic methods can be adapted for all models
Local and global explanations: local explanations attempt to explain specific predictions for a limited set of data points, while global explanations show patterns and rules applied for every input point

For a longer introduction, Nirmal Sobha Kartha's article for The Gradient, Explain Yourself - A Primer on ML Interpretability & Explainability, is highly recommended.

Libraries and repositories

pair-code/what-if-tool
- Tool for exploring black-box classification/regression models - partial dependence plots and
EthicalML/xai
- Python library with utilities for showing per-group statistical metrics, plotting feature importance, upsampling/downsampling to balance a dataset against specific attributes
Quantus
- Toolkit for quantitative evaluation of explanation methods

Interpretable models

scikit-learn
- Python library for data analysis and machine learning, with implementations of nearest-neighbors classification, Lasso regression, decision trees, and more
glmnet
- R library for generalized linear models with Lasso or ElasticNet regression

Post-hoc explanations

Alibi
- Python library for model inspection and interpretation with a scikit-learn-inspired interface
marcotcr/lime
- Python library implementing the LIME (Local Interpretable Model Explanations) method
slundberg/shap
- Python library implementing the SHAP (SHapley Additive exPlanations) method
marcotcr/anchor
- Python library implementing scoped rules/anchors for the article Anchors: High-Precision Model-Agnostic Explanations

Books

Interpretable Machine Learning (2022) by Christoph Molnar
- Available freely online, available for purchase in PDF/ebook format on Leanpub and in print on lulu.com
Interpretable AI by Ajay Thampi
- Part of Manning's MEAP series, publishes in May 2022

Courses / MOOCs

CS335: Fair, Accountable, and Transparent (FAccT) Deep Learning at Stanford University
- Lectures 3 through 7 discuss interpretable models and explainability methods. Slides and recorded lectures available.

Articles

Explain Yourself - A Primer on ML Interpretability & Explainability (2021) by Kartha
- Summarizes the goals of interpretability/explainability methods, sets out the taxonomy mentioned in the introduction, discusses limitations and the future of explainable AI
Towards A Rigorous Science of Interpretable Machine Learning (2017) by Doshi-Velez and Kim
"Why Should I Trust You?": Explaining the Predictions of Any Classifier (2016) by Ribeiro et al.
Visualizing and Understanding Convolutional Networks (2014) by Zeiler and Fergus
Sanity Checks for Saliency Maps (2018) by Adebayo et al.
- Argues that visualization-based explainability methods are vulnerable to confirmation bias
- Proposes testing these methods on models with arbitrarily assigned weights or labels, to show that they reflect the model's actual behavior and are not just highlighting shapes
Counterfactual Explanations Without Opening The Black Box: Automated Decisions and the GDPR (2017) by Wachter, Mittelstadt and Russell
- Examines the legal grounding for "the right to explanation" in the GDPR as it is popularly interpreted
- Poses counterfactual explanation as an optimization problem

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

awesome-xai

Primer on explainable artificial intelligence

Libraries and repositories

Interpretable models

Post-hoc explanations

Books

Courses / MOOCs

Articles

More recent articles

About

Releases

Packages

Contributors 2

SFI-Visual-Intelligence/awesome-xai

Folders and files

Latest commit

History

Repository files navigation

awesome-xai

Primer on explainable artificial intelligence

Libraries and repositories

Interpretable models

Post-hoc explanations

Books

Courses / MOOCs

Articles

More recent articles

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Packages