SHAP demystified: understand what Shapley values are and how they work

Question 1

What are Shapley values in machine learning?

Accepted Answer

Shapley values are a result from game theory that apply to ML by treating input features as players that cooperate to generate the model's output. For a black-box ML model that makes predictions based on a set of features, Shapley values help understand the influence each feature had on the model's predictions. The SHAP scores represent each feature's contribution, and when added together, they equal the model's output.

Question 2

How do SHAP values work for explaining ML model predictions?

Accepted Answer

SHAP values work by computing a weighted average of the marginal contributions each feature made to the prediction in different scenarios. The formula calculates how much each feature added to the output by evaluating the model across all possible subsets of features. This allows SHAP to fairly attribute the model's prediction to each input feature based on their individual contributions.

Question 3

What are the main challenges with implementing SHAP in practice?

Accepted Answer

Two main challenges exist for implementing SHAP: First, evaluating the model's output for any subset of input features when the model was trained using all features. SHAP overcomes this by using conditional expectations and assuming feature independence. Second, the combinatorial explosion problem where computing Shapley values requires summing over 2^(d-1) terms for d features. SHAP addresses this by cleverly using sampling methods.

Question 4

How does SHAP relate to LIME for model explainability?

Accepted Answer

LIME and SHAP are very connected methods for model explainability. They are both solutions to the same optimization problem, but with slightly different choices made along the way. Both fall into the category of additive feature attribution methods, as shown in the paper 'A Unified Approach to Interpreting Model Predictions' by Scott Lundberg and Su-In Lee.

Question 5

What is the practical value of using SHAP for ML model debugging?

Accepted Answer

SHAP provides practical value by helping identify which features contributed most to mispredictions. For example, in a churn prediction model, SHAP can reveal that Age was the feature that contributed most to a false positive prediction. This enables what-if analysis where you can modify feature values to see how the model's prediction would change, helping teams understand and debug model behavior.

Stop guessing. Ship with confidence.

The AI governance and observability platform

We value your privacy