Explainers
The GSNN library provides a comprehensive suite of explainability methods designed to interpret predictions from Graph Structured Neural Networks. These methods help researchers understand which edges contribute to predictions, how perturbations affect outcomes, and what minimal changes would lead to different results. Each explainer is tailored to different types of questions and provides unique insights into model behavior.
Note
Unlike traditional neural network explainers that focus on input features, GSNN explainers operate on the edge space or node space of the graph. This allows direct interpretation of how specific molecular interactions or pathway connections contribute to predictions. Many explainers support both edge-level (target='edge') and node-level (target='node') attributions.
Edge Attribution Methods
Integrated Gradients (IG) Explainer
The Integrated Gradients explainer computes per-edge attributions by integrating gradients along a straight-line path in feature space from a baseline input to the target observation. This method satisfies the completeness axiom, ensuring that the sum of all edge attributions equals the difference between the prediction and baseline prediction.
- What the results indicate:
Positive scores indicate edges that contribute positively to the prediction
Negative scores indicate edges that contribute negatively
Near-zero scores indicate edges with minimal impact
- When to use:
Understanding how individual edges contribute to a single prediction
When baseline comparisons are meaningful (e.g., comparing to “no activity” state)
For generating faithful attributions with theoretical guarantees
- Batch processing:
The explainer supports multiple observations via the
reductionparameter:reduction='mean'— average attributions across all samplesreduction='sum'— sum attributions across all samplesreduction='none'— return attributions for each sample separately
- Strengths:
Mathematically principled with completeness guarantees
Relatively stable across different baseline choices
Computationally efficient for single predictions
Supports batch processing for multiple observations
- Weaknesses:
Requires careful baseline selection
May struggle with highly nonlinear interactions
Contrastive Integrated Gradients Explainer
The Contrastive IG explainer extends Integrated Gradients to contrastive questions by attributing the prediction difference Δf = f(x₁) - f(x₂) to individual edges or nodes. This method integrates along mask paths while keeping both inputs fixed, making it ideal for understanding why the model predicts differently for two related observations.
- What the results indicate:
Positive scores indicate elements that increase the absolute prediction difference |Δf|
Negative scores indicate elements that decrease the absolute prediction difference
Near-zero scores indicate elements irrelevant to the contrast
- Attribution targets:
target='edge'— attribute to individual edges (default)target='node'— attribute to individual nodes
- When to use:
Comparing predictions between related samples (e.g., diseased vs. healthy)
Understanding differential pathway activation
When interested in relative rather than absolute importance
- Batch processing:
The explainer supports multiple observation pairs via the
reductionparameter:reduction='mean'— average attributions across all pairsreduction='sum'— sum attributions across all pairsreduction='none'— return attributions for each pair separately
- Strengths:
Principled approach to contrastive explanations
Maintains completeness axiom for prediction differences
Excellent for comparative analysis
Supports both edge and node level attributions
Supports batch processing for multiple observation pairs
- Weaknesses:
Requires paired observations for meaningful interpretation
More computationally expensive than single-input methods
May be sensitive to input selection
Direct Perturbation Methods
Occlusion Explainer
The Occlusion explainer provides a direct, model-agnostic measure of edge importance by systematically removing each edge and measuring the resulting change in prediction. This “knock-out” approach offers intuitive interpretability by directly quantifying the effect of completely removing each edge.
- What the results indicate:
Positive scores indicate edges that contribute positively (removal decreases prediction)
Negative scores indicate edges that inhibit the prediction (removal increases prediction)
Near-zero scores indicate edges with no measurable impact
- When to use:
When direct causal interpretation is needed
For validating other explanation methods
When computational budget allows exhaustive perturbation testing
For non-differentiable models or when gradients are unreliable
- Batch processing:
The explainer supports multiple observations via the
reductionparameter:reduction='mean'— average attributions across all samplesreduction='sum'— sum attributions across all samplesreduction='none'— return attributions for each sample separately
- Strengths:
Intuitive and direct interpretation
Model-agnostic (works with any architecture)
Provides clear causal insights
No baseline selection required
Supports batch processing for multiple observations
- Weaknesses:
Computationally expensive (scales linearly with number of edges)
May miss interaction effects between edges
Can be unstable for models with sharp decision boundaries
Contrastive Occlusion Explainer
The Contrastive Occlusion explainer extends the occlusion approach to contrastive scenarios by measuring how edge or node removal affects the absolute prediction difference between two inputs. This method identifies elements that specifically contribute to differential predictions.
- What the results indicate:
- Attribution targets:
target='edge'— attribute to individual edges (default)target='node'— attribute to individual nodes
- When to use:
Identifying pathway differences between conditions
Understanding mechanism-specific effects
When gradient-based contrastive methods are not applicable
- Batch processing:
The explainer supports multiple observation pairs via the
reductionparameter:reduction='mean'— average attributions across all pairsreduction='sum'— sum attributions across all pairsreduction='none'— return attributions for each pair separately
- Element masking:
Use the
element_maskparameter to restrict occlusion to a subset of edges or nodes, which can significantly reduce computation time when only certain elements are of interest.- Strengths:
Direct measurement of differential importance
Model-agnostic approach
Clear interpretation for comparative studies
Supports both edge and node level attributions
Supports batch processing and element masking
- Weaknesses:
Computationally intensive (scales with number of elements)
Limited to pairwise comparisons
May miss subtle interaction effects
Optimization-Based Methods
GSNN Explainer
The GSNN explainer learns a sparse binary edge mask that maximizes fidelity to the original prediction while minimizing the number of active edges. Using a differentiable Gumbel-Softmax relaxation, this method identifies the minimal set of edges necessary to reproduce the target prediction.
- What the results indicate:
Scores near 1 indicate edges essential for reproducing the original prediction
Scores near 0 indicate edges that can be removed with minimal impact
The overall mask reveals the minimal sufficient subgraph for the prediction
- When to use:
Identifying core pathways or mechanisms
When sparsity is desired (e.g., for downstream analysis or intervention)
Understanding model redundancy and robustness
Generating simplified explanatory models
- Strengths:
Produces inherently sparse explanations
Balances fidelity with simplicity
Differentiable optimization allows flexible objective functions
Can incorporate domain knowledge through constraints
- Weaknesses:
Requires careful hyperparameter tuning (sparsity vs. fidelity trade-off)
May converge to local optima
Computational overhead for iterative optimization
Binary masks may miss nuanced importance gradients
Contrastive GSNN Explainer
The Contrastive GSNN explainer learns a sparse binary mask (edge or node level) that maximizes fidelity to the prediction difference Δf = f(x₁) - f(x₂) while minimizing the number of active elements. This method identifies the minimal set of elements necessary to reproduce the differential prediction between two inputs.
When given multiple input pairs (batch), the explainer learns a single mask that works well across all pairs simultaneously by treating the differences as a multivariate MSE objective. This is much more efficient than per-sample optimization.
- What the results indicate:
Scores near 1 indicate elements essential for reproducing the prediction difference
Scores near 0 indicate elements that can be removed without affecting the difference
The overall mask reveals the minimal contrastive subgraph
- Attribution targets:
target='edge'— learn edge-level mask (default)target='node'— learn node-level mask
- When to use:
Identifying edges/nodes that drive differential predictions between conditions
When sparse contrastive explanations are needed
Understanding which pathway components are responsible for prediction differences
Comparing multiple pairs efficiently with a single mask
- Hyperparameter tuning:
The
tune()method automatically finds the optimalbetaparameter to balance sparsity and fidelity. It searches for the maximum sparsity while maintaining a minimum fidelity threshold (default 0.9).- Strengths:
Produces sparse contrastive explanations
Efficient batch processing with shared mask optimization
Supports both edge and node level attributions
Automatic hyperparameter tuning via
tune()methodMultivariate MSE objective preserves sign and target dimensions
- Weaknesses:
Requires paired observations
May converge to local optima
Hyperparameter sensitivity (beta, learning rate)
Single mask may not capture sample-specific differences
Counterfactual Explainer
The Counterfactual explainer learns minimal perturbations to input features that achieve a target prediction. Using gradient descent with L2 regularization, this method answers “what is the smallest change needed to reach a desired outcome?” This approach is particularly valuable for understanding model decision boundaries and generating actionable insights.
- What the results indicate:
Positive perturbations indicate features that need to be increased to reach the target
Negative perturbations indicate features that need to be decreased
Near-zero perturbations indicate features irrelevant for achieving the target
The magnitude indicates how much change is needed
- When to use:
Understanding how to achieve desired outcomes (e.g., therapeutic targets)
Identifying minimal interventions
Exploring model decision boundaries
Generating “what-if” scenarios for intervention planning
- Strengths:
Directly actionable insights for intervention
Incorporates minimality constraint naturally
Flexible targeting (specific outputs or full prediction vectors)
Supports feature masking for constrained optimization
- Weaknesses:
May find local rather than global minima
Requires differentiable models
Sensitive to hyperparameter choice (learning rate, weight decay)
Limited to continuous perturbations
Robustness and Stability Methods
Noise Tunnel
The Noise Tunnel method enhances the stability and robustness of gradient-based explainers by running them multiple times with Gaussian noise injected into the edge-mask space, then aggregating the results. This approach is inspired by SmoothGrad but is adapted specifically for GSNN’s edge-based architecture.
- What the results indicate:
Smoothed attribution scores that are more robust to model sensitivity
Confidence intervals through multiple noisy samples
Stable rankings of edge importance less susceptible to noise
- When to use:
When base explainer results are noisy or unstable
For more reliable feature selection based on explanations
When model has sharp gradients or discontinuities
For producing confidence estimates on attributions
- Strengths:
Significantly improves stability of gradient-based methods
Provides uncertainty quantification for explanations
Can be applied to any gradient-based explainer
Helps identify robust vs. artifact attributions
- Weaknesses:
Computationally expensive (multiple runs required)
May over-smooth important sharp transitions
Requires careful noise level selection
Limited to methods that accept noise injection
Choosing the Right Explainer
- For Single Predictions:
Use IG Explainer for theoretically grounded attributions with completeness guarantees
Use Occlusion Explainer for direct, model-agnostic importance measures
Use GSNN Explainer when you need sparse, minimal explanations
Use Counterfactual Explainer for actionable intervention insights
- For Comparative Analysis:
Use Contrastive IG Explainer for principled differential attribution
Use Contrastive Occlusion Explainer for model-agnostic comparative analysis
Use Contrastive GSNN Explainer when you need sparse contrastive explanations
- For Batch Processing:
All explainers support batch processing via the
reductionparameterContrastive GSNN Explainer is particularly efficient for batches (learns single shared mask)
- For Robust Explanations:
Wrap gradient-based methods with Noise Tunnel for stability
Compare results across multiple explainers for validation
- Computational Considerations:
Fastest: IG Explainer
Moderate: GSNN Explainer, Contrastive GSNN Explainer (batch)
Slowest: Occlusion-based methods (scale with graph size)
Note
Best Practice: For critical applications, we recommend using multiple complementary explainers and comparing their results. Convergent findings across different methods provide stronger evidence for interpretation validity.