Explainers

The GSNN library provides a comprehensive suite of explainability methods designed to interpret predictions from Graph Structured Neural Networks. These methods help researchers understand which edges contribute to predictions, how perturbations affect outcomes, and what minimal changes would lead to different results. Each explainer is tailored to different types of questions and provides unique insights into model behavior.

Note

Unlike traditional neural network explainers that focus on input features, GSNN explainers operate on the edge space or node space of the graph. This allows direct interpretation of how specific molecular interactions or pathway connections contribute to predictions. Many explainers support both edge-level (target='edge') and node-level (target='node') attributions.

Edge Attribution Methods

Integrated Gradients (IG) Explainer

The Integrated Gradients explainer computes per-edge attributions by integrating gradients along a straight-line path in feature space from a baseline input to the target observation. This method satisfies the completeness axiom, ensuring that the sum of all edge attributions equals the difference between the prediction and baseline prediction.

What the results indicate:

Positive scores indicate edges that contribute positively to the prediction
Negative scores indicate edges that contribute negatively
Near-zero scores indicate edges with minimal impact

When to use:

Understanding how individual edges contribute to a single prediction
When baseline comparisons are meaningful (e.g., comparing to “no activity” state)
For generating faithful attributions with theoretical guarantees

Batch processing:

The explainer supports multiple observations via the reduction parameter:

reduction='mean' — average attributions across all samples
reduction='sum' — sum attributions across all samples
reduction='none' — return attributions for each sample separately

Strengths:

Mathematically principled with completeness guarantees
Relatively stable across different baseline choices
Computationally efficient for single predictions
Supports batch processing for multiple observations

Weaknesses:

Requires careful baseline selection
May struggle with highly nonlinear interactions

Contrastive Integrated Gradients Explainer

The Contrastive IG explainer extends Integrated Gradients to contrastive questions by attributing the prediction difference Δf = f(x₁) - f(x₂) to individual edges or nodes. This method integrates along mask paths while keeping both inputs fixed, making it ideal for understanding why the model predicts differently for two related observations.

What the results indicate:

Positive scores indicate elements that increase the absolute prediction difference |Δf|
Negative scores indicate elements that decrease the absolute prediction difference
Near-zero scores indicate elements irrelevant to the contrast

Attribution targets:

target='edge' — attribute to individual edges (default)
target='node' — attribute to individual nodes

When to use:

Comparing predictions between related samples (e.g., diseased vs. healthy)
Understanding differential pathway activation
When interested in relative rather than absolute importance

Batch processing:

The explainer supports multiple observation pairs via the reduction parameter:

reduction='mean' — average attributions across all pairs
reduction='sum' — sum attributions across all pairs
reduction='none' — return attributions for each pair separately

Strengths:

Principled approach to contrastive explanations
Maintains completeness axiom for prediction differences
Excellent for comparative analysis
Supports both edge and node level attributions
Supports batch processing for multiple observation pairs

Weaknesses:

Requires paired observations for meaningful interpretation
More computationally expensive than single-input methods
May be sensitive to input selection

Direct Perturbation Methods

Occlusion Explainer

The Occlusion explainer provides a direct, model-agnostic measure of edge importance by systematically removing each edge and measuring the resulting change in prediction. This “knock-out” approach offers intuitive interpretability by directly quantifying the effect of completely removing each edge.

What the results indicate:

Positive scores indicate edges that contribute positively (removal decreases prediction)
Negative scores indicate edges that inhibit the prediction (removal increases prediction)
Near-zero scores indicate edges with no measurable impact

When to use:

When direct causal interpretation is needed
For validating other explanation methods
When computational budget allows exhaustive perturbation testing
For non-differentiable models or when gradients are unreliable

Batch processing:

The explainer supports multiple observations via the reduction parameter:

reduction='mean' — average attributions across all samples
reduction='sum' — sum attributions across all samples
reduction='none' — return attributions for each sample separately

Strengths:

Intuitive and direct interpretation
Model-agnostic (works with any architecture)
Provides clear causal insights
No baseline selection required
Supports batch processing for multiple observations

Weaknesses:

Computationally expensive (scales linearly with number of edges)
May miss interaction effects between edges
Can be unstable for models with sharp decision boundaries

Contrastive Occlusion Explainer

The Contrastive Occlusion explainer extends the occlusion approach to contrastive scenarios by measuring how edge or node removal affects the absolute prediction difference between two inputs. This method identifies elements that specifically contribute to differential predictions.

What the results indicate:

Positive scores indicate elements whose removal decreases |Δf| (element contributes to difference)
Negative scores indicate elements whose removal increases |Δf| (element reduces difference)
Near-zero scores indicate elements with no impact on the prediction difference

Attribution targets:

target='edge' — attribute to individual edges (default)
target='node' — attribute to individual nodes

When to use:

Identifying pathway differences between conditions
Understanding mechanism-specific effects
When gradient-based contrastive methods are not applicable

Batch processing:

The explainer supports multiple observation pairs via the reduction parameter:

reduction='mean' — average attributions across all pairs
reduction='sum' — sum attributions across all pairs
reduction='none' — return attributions for each pair separately

Element masking:

Use the element_mask parameter to restrict occlusion to a subset of edges or nodes, which can significantly reduce computation time when only certain elements are of interest.

Strengths:

Direct measurement of differential importance
Model-agnostic approach
Clear interpretation for comparative studies
Supports both edge and node level attributions
Supports batch processing and element masking

Weaknesses:

Computationally intensive (scales with number of elements)
Limited to pairwise comparisons
May miss subtle interaction effects

Optimization-Based Methods

GSNN Explainer

The GSNN explainer learns a sparse binary edge mask that maximizes fidelity to the original prediction while minimizing the number of active edges. Using a differentiable Gumbel-Softmax relaxation, this method identifies the minimal set of edges necessary to reproduce the target prediction.

What the results indicate:

Scores near 1 indicate edges essential for reproducing the original prediction
Scores near 0 indicate edges that can be removed with minimal impact
The overall mask reveals the minimal sufficient subgraph for the prediction

When to use:

Identifying core pathways or mechanisms
When sparsity is desired (e.g., for downstream analysis or intervention)
Understanding model redundancy and robustness
Generating simplified explanatory models

Strengths:

Produces inherently sparse explanations
Balances fidelity with simplicity
Differentiable optimization allows flexible objective functions
Can incorporate domain knowledge through constraints

Weaknesses:

Requires careful hyperparameter tuning (sparsity vs. fidelity trade-off)
May converge to local optima
Computational overhead for iterative optimization
Binary masks may miss nuanced importance gradients

Contrastive GSNN Explainer

The Contrastive GSNN explainer learns a sparse binary mask (edge or node level) that maximizes fidelity to the prediction difference Δf = f(x₁) - f(x₂) while minimizing the number of active elements. This method identifies the minimal set of elements necessary to reproduce the differential prediction between two inputs.

When given multiple input pairs (batch), the explainer learns a single mask that works well across all pairs simultaneously by treating the differences as a multivariate MSE objective. This is much more efficient than per-sample optimization.

What the results indicate:

Scores near 1 indicate elements essential for reproducing the prediction difference
Scores near 0 indicate elements that can be removed without affecting the difference
The overall mask reveals the minimal contrastive subgraph

Attribution targets:

target='edge' — learn edge-level mask (default)
target='node' — learn node-level mask

When to use:

Identifying edges/nodes that drive differential predictions between conditions
When sparse contrastive explanations are needed
Understanding which pathway components are responsible for prediction differences
Comparing multiple pairs efficiently with a single mask

Hyperparameter tuning:

The tune() method automatically finds the optimal beta parameter to balance sparsity and fidelity. It searches for the maximum sparsity while maintaining a minimum fidelity threshold (default 0.9).

Strengths:

Produces sparse contrastive explanations
Efficient batch processing with shared mask optimization
Supports both edge and node level attributions
Automatic hyperparameter tuning via tune() method
Multivariate MSE objective preserves sign and target dimensions

Weaknesses:

Requires paired observations
May converge to local optima
Hyperparameter sensitivity (beta, learning rate)
Single mask may not capture sample-specific differences

Counterfactual Explainer

The Counterfactual explainer learns minimal perturbations to input features that achieve a target prediction. Using gradient descent with L2 regularization, this method answers “what is the smallest change needed to reach a desired outcome?” This approach is particularly valuable for understanding model decision boundaries and generating actionable insights.

What the results indicate:

Positive perturbations indicate features that need to be increased to reach the target
Negative perturbations indicate features that need to be decreased
Near-zero perturbations indicate features irrelevant for achieving the target
The magnitude indicates how much change is needed

When to use:

Understanding how to achieve desired outcomes (e.g., therapeutic targets)
Identifying minimal interventions
Exploring model decision boundaries
Generating “what-if” scenarios for intervention planning

Strengths:

Directly actionable insights for intervention
Incorporates minimality constraint naturally
Flexible targeting (specific outputs or full prediction vectors)
Supports feature masking for constrained optimization

Weaknesses:

May find local rather than global minima
Requires differentiable models
Sensitive to hyperparameter choice (learning rate, weight decay)
Limited to continuous perturbations

Robustness and Stability Methods

Noise Tunnel

The Noise Tunnel method enhances the stability and robustness of gradient-based explainers by running them multiple times with Gaussian noise injected into the edge-mask space, then aggregating the results. This approach is inspired by SmoothGrad but is adapted specifically for GSNN’s edge-based architecture.

What the results indicate:

Smoothed attribution scores that are more robust to model sensitivity
Confidence intervals through multiple noisy samples
Stable rankings of edge importance less susceptible to noise

When to use:

When base explainer results are noisy or unstable
For more reliable feature selection based on explanations
When model has sharp gradients or discontinuities
For producing confidence estimates on attributions

Strengths:

Significantly improves stability of gradient-based methods
Provides uncertainty quantification for explanations
Can be applied to any gradient-based explainer
Helps identify robust vs. artifact attributions

Weaknesses:

Computationally expensive (multiple runs required)
May over-smooth important sharp transitions
Requires careful noise level selection
Limited to methods that accept noise injection

Choosing the Right Explainer

For Single Predictions:

Use IG Explainer for theoretically grounded attributions with completeness guarantees
Use Occlusion Explainer for direct, model-agnostic importance measures
Use GSNN Explainer when you need sparse, minimal explanations
Use Counterfactual Explainer for actionable intervention insights

For Comparative Analysis:

Use Contrastive IG Explainer for principled differential attribution
Use Contrastive Occlusion Explainer for model-agnostic comparative analysis
Use Contrastive GSNN Explainer when you need sparse contrastive explanations

For Batch Processing:

All explainers support batch processing via the reduction parameter
Contrastive GSNN Explainer is particularly efficient for batches (learns single shared mask)

For Robust Explanations:

Wrap gradient-based methods with Noise Tunnel for stability
Compare results across multiple explainers for validation

Computational Considerations:

Fastest: IG Explainer
Moderate: GSNN Explainer, Contrastive GSNN Explainer (batch)
Slowest: Occlusion-based methods (scale with graph size)

Note

Best Practice: For critical applications, we recommend using multiple complementary explainers and comparing their results. Convergent findings across different methods provide stronger evidence for interpretation validity.