OpenXAI

Faithfulness

Method	PGI PGI measures the difference in prediction probability that results from perturbing the features deemed as influential by a given post hoc explanation. Higher values are better.	PGU PGU measures the difference in prediction probability that results from perturbing the features deemed as unimportant by a given post hoc explanation. Lower values are better.
LIME	0.42	0.68
SHAP	0.44	0.64
Vanilla Gradient	0.39	0.68
SmoothGrad	0.49	0.66
Integrated Gradient	0.40	0.68
Gradient x Input	0.45	0.62

Stability

Method	RIS RIS measures the maximum change in explanation relative to changes in the inputs. Values closer to zero are better (log 1 ~ 0).	ROS ROS measures the maximum change in explanation relative to changes in the output prediction probabilities. Values closer to zero are better (log 1 ~ 0).	RRS RRS measures the maximum change in explanation relative to changes in the internal representations learned by the model. Values closer to zero are better (log 1 ~ 0).
Integrated Gradient	0.62	2.88	6.83
SHAP	1.28	4.46	11.46
Vanilla Gradient	1.35	4.44	11.34
Gradient x Input	-1.90	1.32	8.10
SmoothGrad	2.97	6.17	13.10
LIME	6.35	9.57	16.50

Fairness

Method	Fairness@PGI Computes the absolute difference in PGI scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@PGU Computes the absolute difference in PGU scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@RIS Computes the absolute difference in RIS scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@ROS Computes the absolute difference in ROS scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@RRS Computes the absolute difference in RRS scores between the majority and the minority subgroups. Lower disparities are better.
Gradient x Input	0.001	0.056	0.270	0.093	0.245
Integrated Gradient	0.009	0.056	0.070	0.214	0.137
SmoothGrad	0.035	0.085	0.011	0.321	0.026
Vanilla Gradient	0.045	0.079	0.273	0.093	0.255
SHAP	0.102	0.009	0.021	0.361	0.032
LIME	0.138	0.095	0.013	0.32	0.062

Faithfulness

Method	FA FA computes the fraction of top-K features that are common between a given post hoc explanation and the corresponding ground truth explanation. Higher values are better.	RA RA measures the fraction of top-K features that are not only common between a given explanation and the corresponding ground truth explanation, but also have the same position in the respective rank orders. Higher values are better.	SA SA computes the fraction of top-K features that are not only common between a given explanation and the corresponding ground truth explanation, but also share the same sign in both the explanations. Higher values are better	SRA SRA computes the fraction of top-K features that are not only common between a given explanation and the corresponding ground truth explanation, but also share the same feature attribution sign and rank in both the explanations. Higher values the better.	RC RC computes the Spearman’s rank correlation coefficient to measure the agreement between feature rankings provided by a given explanation and the corresponding ground truth explanation. Higher values the better.	PRA PRA captures if the relative ordering of every pair of features is the same for a given explanation as well as the corresponding ground truth explanation and computes the fraction of feature pairs for which the relative ordering is the same between the two explanations. Higher values the better.	PGI PGI measures the difference in prediction probability that results from perturbing the features deemed as influential by a given post hoc explanation. Higher values are better.	PGU PGU measures the difference in prediction probability that results from perturbing the features deemed as unimportant by a given post hoc explanation. Lower values are better.
LIME	0.938	0.810	0.938	0.814	0.998	0.989	0.156	0.169
SHAP	0.130	0.007	0.112	0.006	-0.053	0.488	0.135	0.180
Vanilla Gradient	0.950	0.950	0.846	0.846	1.000	1.000	0.149	0.174
SmoothGrad	0.950	0.950	0.606	0.606	1.000	1.000	0.202	0.124
Integrated Gradient	0.950	0.950	0.846	0.846	1.000	1.000	0.148	0.173
Gradient x Input	0.785	0.161	0.382	0.071	0.890	0.875	0.171	0.161

Stability

Method	RIS RIS measures the maximum change in explanation relative to changes in the inputs. Values closer to zero are better (log 1 ~ 0).	ROS ROS measures the maximum change in explanation relative to changes in the output prediction probabilities. Values closer to zero are better (log 1 ~ 0).	RRS RRS measures the maximum change in explanation relative to changes in the internal representations learned by the model. Values closer to zero are better (log 1 ~ 0).
SHAP	-0.230	10.056	10.056
LIME	-0.698	9.397	9.397
Gradient x Input	-0.906	9.437	9.437
Vanilla Gradient	-1.384	5.241	5.241
Integrated Gradient	-2.004	4.560	4.560
SmoothGrad	-4.780	4.931	4.931

Fairness

Method	Fairness@FA Computes the absolute difference in FA scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@RA Computes the absolute difference in RA scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@SA Computes the absolute difference in SA scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@SRA Computes the absolute difference in SRA scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@RC Computes the absolute difference in RC scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@PRA Computes the absolute difference in PRA scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@PGI Computes the absolute difference in PGI scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@PGU Computes the absolute difference in PGU scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@RIS Computes the absolute difference in RIS scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@ROS Computes the absolute difference in ROS scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@RRS Computes the absolute difference in RRS scores between the majority and the minority subgroups. Lower disparities are better.
SmoothGrad	0.018	0.014	0.011	0.014	0.005	0.006	0.045	0.028	0.200	0.104	0.104
Vanilla Gradient	0.019	0.014	0.142	0.081	0.053	0.064	0.016	0.029	0.172	0.014	0.014
Integrated Gradient	0.021	0.016	0.141	0.080	0.053	0.064	0.010	0.032	0.363	0.140	0.140
LIME	0.027	0.017	0.014	0.016	0.011	0.012	0.056	0.051	0.238	0.050	0.050
SHAP	0.050	0.039	0.038	0.039	0.016	0.020	0.012	0.017	0.279	0.067	0.067
Gradient x Input	0.057	0.044	0.042	0.044	0.017	0.022	0.012	0.028	0.219	0.015	0.015

Faithfulness

Method	PGI PGI measures the difference in prediction probability that results from perturbing the features deemed as influential by a given post hoc explanation. Higher values are better.	PGU PGU measures the difference in prediction probability that results from perturbing the features deemed as unimportant by a given post hoc explanation. Lower values are better.
LIME	0.26	0.24
SHAP	0.26	0.23
Vanilla Gradient	0.24	0.26
SmoothGrad	0.33	0.13
Integrated Gradient	0.24	0.26
Gradient x Input	0.25	0.23

Stability

Method	RIS RIS measures the maximum change in explanation relative to changes in the inputs. Values closer to zero are better (log 1 ~ 0).	ROS ROS measures the maximum change in explanation relative to changes in the output prediction probabilities. Values closer to zero are better (log 1 ~ 0).	RRS RRS measures the maximum change in explanation relative to changes in the internal representations learned by the model. Values closer to zero are better (log 1 ~ 0).
LIME	1.19	3.33	13.25
SHAP	1.57	3.80	9.40
Gradient x Input	2.46	4.28	9.56
SmoothGrad	-3.11	-0.95	4.60
Integrated Gradient	3.16	5.17	9.53
Vanilla Gradient	4.06	5.86	10.80

Fairness

Fairness evaluation is not applicable for the HELOC dataset as it does not include any protected attributes.

Faithfulness

Method	FA FA computes the fraction of top-K features that are common between a given post hoc explanation and the corresponding ground truth explanation. Higher values are better.	RA RA measures the fraction of top-K features that are not only common between a given explanation and the corresponding ground truth explanation, but also have the same position in the respective rank orders. Higher values are better.	SA SA computes the fraction of top-K features that are not only common between a given explanation and the corresponding ground truth explanation, but also share the same sign in both the explanations. Higher values are better	SRA SRA computes the fraction of top-K features that are not only common between a given explanation and the corresponding ground truth explanation, but also share the same feature attribution sign and rank in both the explanations. Higher values the better.	RC RC computes the Spearman’s rank correlation coefficient to measure the agreement between feature rankings provided by a given explanation and the corresponding ground truth explanation. Higher values the better.	PRA PRA captures if the relative ordering of every pair of features is the same for a given explanation as well as the corresponding ground truth explanation and computes the fraction of feature pairs for which the relative ordering is the same between the two explanations. Higher values the better.	PGI PGI measures the difference in prediction probability that results from perturbing the features deemed as influential by a given post hoc explanation. Higher values are better.	PGU PGU measures the difference in prediction probability that results from perturbing the features deemed as unimportant by a given post hoc explanation. Lower values are better.
LIME	0.932	0.671	0.929	0.670	0.994	0.982	0.156	0.104
SHAP	0.586	0.054	0.269	0.024	0.384	0.645	0.121	0.136
Vanilla Gradient	0.957	0.957	0.469	0.469	1.000	1.000	0.129	0.131
SmoothGrad	0.957	0.957	0.274	0.274	1.000	1.000	0.054	0.177
Integrated Gradient	0.957	0.957	0.469	0.469	1.000	1.000	0.127	0.132
Gradient x Input	0.582	0.049	0.255	0.022	0.390	0.641	0.123	0.135

Stability

Method	RIS RIS measures the maximum change in explanation relative to changes in the inputs. Values closer to zero are better (log 1 ~ 0).	ROS ROS measures the maximum change in explanation relative to changes in the output prediction probabilities. Values closer to zero are better (log 1 ~ 0).	RRS RRS measures the maximum change in explanation relative to changes in the internal representations learned by the model. Values closer to zero are better (log 1 ~ 0).
Gradient x Input	1.036	8.964	8.964
Integrated Gradient	1.403	5.776	5.776
SHAP	1.458	9.331	9.331
Vanilla Gradient	2.124	6.336	6.336
SmoothGrad	-3.724	3.507	3.507
LIME	4.534	8.833	8.833

Fairness

Fairness evaluation is not applicable for the HELOC dataset as it does not include any protected attributes.

Faithfulness

Method	PGI PGI measures the difference in prediction probability that results from perturbing the features deemed as influential by a given post hoc explanation. Higher values are better.	PGU PGU measures the difference in prediction probability that results from perturbing the features deemed as unimportant by a given post hoc explanation. Lower values are better.
LIME	0.67	0.26
SHAP	0.53	0.33
Vanilla Gradient	0.29	0.64
SmoothGrad	0.70	0.12
Integrated Gradient	0.29	0.64
Gradient x Input	0.58	0.26

Stability

Method	RIS RIS measures the maximum change in explanation relative to changes in the inputs. Values closer to zero are better (log 1 ~ 0).	ROS ROS measures the maximum change in explanation relative to changes in the output prediction probabilities. Values closer to zero are better (log 1 ~ 0).	RRS RRS measures the maximum change in explanation relative to changes in the internal representations learned by the model. Values closer to zero are better (log 1 ~ 0).
SmoothGrad	-1.70	-0.95	6.22
LIME	1.79	2.45	9.77
SHAP	1.98	2.68	9.78
Gradient x Input	3.25	3.08	8.25
Integrated Gradient	4.32	4.33	11.74
Vanilla Gradient	5.01	4.58	9.86

Fairness

Method	Fairness@PGI Computes the absolute difference in PGI scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@PGU Computes the absolute difference in PGU scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@RIS Computes the absolute difference in RIS scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@ROS Computes the absolute difference in ROS scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@RRS Computes the absolute difference in RRS scores between the majority and the minority subgroups. Lower disparities are better.
SmoothGrad	0.006	0.040	0.024	TBD	0.017
LIME	0.024	0.006	0.014	TBD	0.012
Gradient x Input	0.034	0.016	0.021	TBD	0.042
Integrated Gradient	0.036	0.018	0.012	TBD	0.005
Vanilla Gradient	0.037	0.013	0.015	TBD	0.037
SHAP	0.058	0.017	0.011	TBD	0.005

Faithfulness

Method	FA FA computes the fraction of top-K features that are common between a given post hoc explanation and the corresponding ground truth explanation. Higher values are better.	RA RA measures the fraction of top-K features that are not only common between a given explanation and the corresponding ground truth explanation, but also have the same position in the respective rank orders. Higher values are better.	SA SA computes the fraction of top-K features that are not only common between a given explanation and the corresponding ground truth explanation, but also share the same sign in both the explanations. Higher values are better.	SRA SRA computes the fraction of top-K features that are not only common between a given explanation and the corresponding ground truth explanation, but also share the same feature attribution sign and rank in both the explanations. Higher values the better.	RC RC computes the Spearman’s rank correlation coefficient to measure the agreement between feature rankings provided by a given explanation and the corresponding ground truth explanation. Higher values the better.	PRA PRA captures if the relative ordering of every pair of features is the same for a given explanation as well as the corresponding ground truth explanation and computes the fraction of feature pairs for which the relative ordering is the same between the two explanations. Higher values the better.	PGI PGI measures the difference in prediction probability that results from perturbing the features deemed as influential by a given post hoc explanation. Higher values are better.	PGU PGU measures the difference in prediction probability that results from perturbing the features deemed as unimportant by a given post hoc explanation. Lower values are better.
LIME	0.869	0.697	0.858	0.689	0.921	0.913	0.420	0.269
SHAP	0.601	0.105	0.133	0.009	0.379	0.655	0.391	0.205
Vanilla Gradient	0.923	0.921	0.138	0.136	1.000	1.000	0.297	0.391
SmoothGrad	0.923	0.923	0.741	0.741	1.000	1.000	0.485	0.082
Integrated Gradient	0.923	0.923	0.138	0.138	1.000	1.000	0.297	0.392
Gradient x Input	0.567	0.075	0.070	0.003	0.281	0.580	0.395	0.193

Stability

Method	RIS RIS measures the maximum change in explanation relative to changes in the inputs. Values closer to zero are better (log 1 ~ 0).	ROS ROS measures the maximum change in explanation relative to changes in the output prediction probabilities. Values closer to zero are better (log 1 ~ 0).	RRS RRS measures the maximum change in explanation relative to changes in the internal representations learned by the model. Values closer to zero are better (log 1 ~ 0).
LIME	1.715	7.899	7.899
SHAP	1.857	7.883	7.883
Gradient x Input	2.307	7.543	7.543
SmoothGrad	-3.007	2.941	2.941
Integrated Gradient	3.492	6.843	6.843
Vanilla Gradient	4.092	6.928	6.928

Fairness

Method	Fairness@FA Computes the absolute difference in FA scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@RA Computes the absolute difference in RA scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@SA Computes the absolute difference in SA scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@SRA Computes the absolute difference in SRA scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@RC Computes the absolute difference in RC scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@PRA Computes the absolute difference in PRA scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@PGI Computes the absolute difference in PGI scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@PGU Computes the absolute difference in PGU scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@RIS Computes the absolute difference in RIS scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@ROS Computes the absolute difference in ROS scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@RRS Computes the absolute difference in RRS scores between the majority and the minority subgroups. Lower disparities are better.
Gradient x Input	0.001	0.001	0.002	0.001	0.001	0.001	0.001	0.002	0.020	0.399	0.399
SHAP	0.001	0.000	0.001	0.000	0.000	0.001	0.004	0.001	0.106	0.200	0.200
SmoothGrad	0.002	0.002	0.001	0.001	0.001	0.001	0.003	0.004	0.018	0.102	0.102
Integrated Gradient	0.002	0.002	0.007	0.005	0.004	0.005	0.009	0.004	0.140	0.010	0.010
Vanilla Gradient	0.004	0.003	0.009	0.006	0.005	0.006	0.009	0.006	0.079	0.167	0.167
LIME	0.005	0.003	0.002	0.003	0.001	0.002	0.003	0.006	0.055	0.113	0.113

Faithfulness

Method	PGI PGI measures the difference in prediction probability that results from perturbing the features deemed as influential by a given post hoc explanation. Higher values are better.	PGU PGU measures the difference in prediction probability that results from perturbing the features deemed as unimportant by a given post hoc explanation. Lower values are better.
LIME	0.25	0.35
SHAP	0.32	0.27
Vanilla Gradient	0.31	0.29
SmoothGrad	0.41	0.13
Integrated Gradient	0.31	0.29
Gradient x Input	0.31	0.29

Stability

Method	RIS RIS measures the maximum change in explanation relative to changes in the inputs. Values closer to zero are better (log 1 ~ 0).	ROS ROS measures the maximum change in explanation relative to changes in the output prediction probabilities. Values closer to zero are better (log 1 ~ 0).	RRS RRS measures the maximum change in explanation relative to changes in the internal representations learned by the model. Values closer to zero are better (log 1 ~ 0).
SmoothGrad	1.11	-1.07	5.31
LIME	5.53	3.40	9.86
SHAP	5.96	3.95	10.39
Gradient x Input	6.80	3.46	7.22
Vanilla Gradient	7.58	4.24	8.02
Integrated Gradient	8.14	5.53	11.60

Fairness

Fairness evaluation is not applicable for the Synthetic dataset as it does not include any protected attributes.

Faithfulness

Method	FA FA computes the fraction of top-K features that are common between a given post hoc explanation and the corresponding ground truth explanation. Higher values are better.	RA RA measures the fraction of top-K features that are not only common between a given explanation and the corresponding ground truth explanation, but also have the same position in the respective rank orders. Higher values are better.	SA SA computes the fraction of top-K features that are not only common between a given explanation and the corresponding ground truth explanation, but also share the same sign in both the explanations. Higher values are better.	SRA SRA computes the fraction of top-K features that are not only common between a given explanation and the corresponding ground truth explanation, but also share the same feature attribution sign and rank in both the explanations. Higher values the better.	RC RC computes the Spearman’s rank correlation coefficient to measure the agreement between feature rankings provided by a given explanation and the corresponding ground truth explanation. Higher values the better.	PRA PRA captures if the relative ordering of every pair of features is the same for a given explanation as well as the corresponding ground truth explanation and computes the fraction of feature pairs for which the relative ordering is the same between the two explanations. Higher values the better.	PGI PGI measures the difference in prediction probability that results from perturbing the features deemed as influential by a given post hoc explanation. Higher values are better.	PGU PGU measures the difference in prediction probability that results from perturbing the features deemed as unimportant by a given post hoc explanation. Lower values are better.
LIME	0.930	0.763	0.928	0.762	0.995	0.986	0.154	0.180
SHAP	0.775	0.145	0.377	0.068	0.873	0.860	0.171	0.160
Vanilla Gradient	0.950	0.950	0.464	0.464	1.000	1.000	0.171	0.162
SmoothGrad	0.950	0.950	0.456	0.456	1.000	1.000	0.227	0.063
Integrated Gradient	0.950	0.950	0.464	0.464	1.000	1.000	0.171	0.163
Gradient x Input	0.785	0.161	0.382	0.071	0.890	0.875	0.171	0.161

Stability

Method	RIS RIS measures the maximum change in explanation relative to changes in the inputs. Values closer to zero are better (log 1 ~ 0).	ROS ROS measures the maximum change in explanation relative to changes in the output prediction probabilities. Values closer to zero are better (log 1 ~ 0).	RRS RRS measures the maximum change in explanation relative to changes in the internal representations learned by the model. Values closer to zero are better (log 1 ~ 0).
Gradient x Input	0.405	3.422	3.422
SmoothGrad	5.249	9.419	9.419
SHAP	5.673	8.751	8.751
Integrated Gradient	5.957	9.022	9.022
Vanilla Gradient	6.133	6.144	6.144
LIME	9.355	13.564	13.564

Fairness

Fairness evaluation is not applicable for the Synthetic dataset as it does not include any protected attributes.

Faithfulness

Method	PGI PGI measures the difference in prediction probability that results from perturbing the features deemed as influential by a given post hoc explanation. Higher values are better.	PGU PGU measures the difference in prediction probability that results from perturbing the features deemed as unimportant by a given post hoc explanation. Lower values are better.
LIME	0.232	0.247
SHAP	0.274	0.194
Vanilla Gradient	0.24	0.24
SmoothGrad	0.324	0.106
Integrated Gradient	0.24	0.238
Gradient x Input	0.254	0.216

Stability

Method	RIS RIS measures the maximum change in explanation relative to changes in the inputs. Values closer to zero are better (log 1 ~ 0).	ROS ROS measures the maximum change in explanation relative to changes in the output prediction probabilities. Values closer to zero are better (log 1 ~ 0).	RRS RRS measures the maximum change in explanation relative to changes in the internal representations learned by the model. Values closer to zero are better (log 1 ~ 0).
SmoothGrad	-0.59	-1.39	3.15
Gradient x Input	3.81	3.19	7.16
SHAP	4.24	3.72	8.27
LIME	4.49	3.54	8.36
Integrated Gradient	5.91	5.29	9.27
Vanilla Gradient	6.29	5.54	9.02

Fairness

Method	Fairness@PGI Computes the absolute difference in PGI scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@PGU Computes the absolute difference in PGU scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@RIS Computes the absolute difference in RIS scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@ROS Computes the absolute difference in ROS scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@RRS Computes the absolute difference in RRS scores between the majority and the minority subgroups. Lower disparities are better.
Gradient x Input	0.052	0.082	0.647	0.786	0.392
LIME	0.054	0.083	0.275	0.279	0.062
SHAP	0.066	0.084	0.229	0.182	0.023
Vanilla Gradient	0.076	0.086	0.399	0.663	0.175
Integrated Gradient	0.087	0.085	0.177	0.225	0.090
SmoothGrad	0.119	0.096	0.094	0.090	0.169

Faithfulness

Method	FA FA computes the fraction of top-K features that are common between a given post hoc explanation and the corresponding ground truth explanation. Higher values are better.	RA RA measures the fraction of top-K features that are not only common between a given explanation and the corresponding ground truth explanation, but also have the same position in the respective rank orders. Higher values are better.	SA SA computes the fraction of top-K features that are not only common between a given explanation and the corresponding ground truth explanation, but also share the same sign in both the explanations. Higher values are better.	SRA SRA computes the fraction of top-K features that are not only common between a given explanation and the corresponding ground truth explanation, but also share the same feature attribution sign and rank in both the explanations. Higher values the better.	RC RC computes the Spearman’s rank correlation coefficient to measure the agreement between feature rankings provided by a given explanation and the corresponding ground truth explanation. Higher values the better.	PRA PRA captures if the relative ordering of every pair of features is the same for a given explanation as well as the corresponding ground truth explanation and computes the fraction of feature pairs for which the relative ordering is the same between the two explanations. Higher values the better.	PGI PGI measures the difference in prediction probability that results from perturbing the features deemed as influential by a given post hoc explanation. Higher values are better.	PGU PGU measures the difference in prediction probability that results from perturbing the features deemed as unimportant by a given post hoc explanation. Lower values are better.
LIME	0.838	0.738	0.836	0.735	0.986	0.983	0.108	0.107
SHAP	0.619	0.281	0.527	0.267	0.469	0.692	0.128	0.087
Vanilla Gradient	0.857	0.857	0.790	0.791	1.	1.	0.109	0.105
SmoothGrad	0.857	0.857	0.299	0.299	1.	1.	0.148	0.054
Integrated Gradient	0.857	0.857	0.790	0.791	1.	1.	0.109	0.104
Gradient x Input	0.639	0.286	0.507	0.264	0.535	0.690	0.129	0.086

Stability

Method	RIS RIS measures the maximum change in explanation relative to changes in the inputs. Values closer to zero are better (log 1 ~ 0).	ROS ROS measures the maximum change in explanation relative to changes in the output prediction probabilities. Values closer to zero are better (log 1 ~ 0).	RRS RRS measures the maximum change in explanation relative to changes in the internal representations learned by the model. Values closer to zero are better (log 1 ~ 0).
SmoothGrad	-1.10	-2.93	-2.93
Gradient x Input	2.98	1.78	1.78
SHAP	3.12	2.35	2.35
LIME	4.24	2.28	2.28
Integrated Gradient	4.33	1.41	1.41
Vanilla Gradient	4.77	1.82	1.82

Fairness

Method	Fairness@FA Computes the absolute difference in FA scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@RA Computes the absolute difference in RA scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@SA Computes the absolute difference in SA scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@SRA Computes the absolute difference in SRA scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@RC Computes the absolute difference in RC scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@PRA Computes the absolute difference in PRA scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@PGI Computes the absolute difference in PGI scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@PGU Computes the absolute difference in PGU scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@RIS Computes the absolute difference in RIS scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@ROS Computes the absolute difference in ROS scores between the majority and the minority subgroups. Lower disparities are better.	Fairness@RRS Computes the absolute difference in RRS scores between the majority and the minority subgroups. Lower disparities are better.
SmoothGrad	0.048	0.036	0.024	0.029	0.019	0.023	0.088	0.073	0.036	0.046	0.046
Vanilla Gradient	0.046	0.034	0.011	0.006	0.001	0.002	0.069	0.069	0.068	0.023	0.023
Integrated Gradient	0.046	0.034	0.011	0.006	0.001	0.001	0.067	0.069	0.056	0.008	0.008
LIME	0.045	0.048	0.043	0.052	0.037	0.037	0.063	0.067	0.028	0.011	0.011
SHAP	0.029	0.016	0.060	0.045	0.032	0.026	0.070	0.072	0.058	0.036	0.036
Gradient x Input	0.026	0.019	0.063	0.048	0.035	0.029	0.071	0.069	0.079	0.058	0.058

Faithfulness

Method	PGI PGI measures the difference in prediction probability that results from perturbing the features deemed as influential by a given post hoc explanation. Higher values are better.	PGU PGU measures the difference in prediction probability that results from perturbing the features deemed as unimportant by a given post hoc explanation. Lower values are better.
LIME	0.083	0.181
SHAP	0.113	0.153
Vanilla Gradient	0.079	0.183
SmoothGrad	0.184	0.084
Integrated Gradient	0.073	0.185
Gradient x Input	0.107	0.154

Stability

Method	RIS RIS measures the maximum change in explanation relative to changes in the inputs. Values closer to zero are better (log 1 ~ 0).	ROS ROS measures the maximum change in explanation relative to changes in the output prediction probabilities. Values closer to zero are better (log 1 ~ 0).	RRS RRS measures the maximum change in explanation relative to changes in the internal representations learned by the model. Values closer to zero are better (log 1 ~ 0).
LIME	1.25	3.31	7.82
SHAP	1.27	3.06	6.93
Gradient x Input	1.89	3.54	6.67
SmoothGrad	-3.03	-1.02	3.20
Integrated Gradient	3.07	4.76	7.82
Vanilla Gradient	3.91	5.63	8.46

Fairness

Fairness evaluation is not applicable for the Give me some credit dataset as it does not include any protected attributes.

Faithfulness

Method	FA FA computes the fraction of top-K features that are common between a given post hoc explanation and the corresponding ground truth explanation. Higher values are better.	RA RA measures the fraction of top-K features that are not only common between a given explanation and the corresponding ground truth explanation, but also have the same position in the respective rank orders. Higher values are better.	SA SA computes the fraction of top-K features that are not only common between a given explanation and the corresponding ground truth explanation, but also share the same sign in both the explanations. Higher values are better.	SRA SRA computes the fraction of top-K features that are not only common between a given explanation and the corresponding ground truth explanation, but also share the same feature attribution sign and rank in both the explanations. Higher values the better.	RC RC computes the Spearman’s rank correlation coefficient to measure the agreement between feature rankings provided by a given explanation and the corresponding ground truth explanation. Higher values the better.	PRA PRA captures if the relative ordering of every pair of features is the same for a given explanation as well as the corresponding ground truth explanation and computes the fraction of feature pairs for which the relative ordering is the same between the two explanations. Higher values the better.	PGI PGI measures the difference in prediction probability that results from perturbing the features deemed as influential by a given post hoc explanation. Higher values are better.	PGU PGU measures the difference in prediction probability that results from perturbing the features deemed as unimportant by a given post hoc explanation. Lower values are better.
LIME	0.828	0.585	0.828	0.576	0.926	0.911	0.060	0.264
SHAP	0.530	0.155	0.462	0.143	0.156	0.556	0.113	0.225
Vanilla Gradient	0.900	0.900	0.886	0.886	1.000	1.000	0.066	0.259
SmoothGrad	0.900	0.900	0.369	0.369	1.000	1.000	0.277	0.056
Integrated Gradient	0.900	0.900	0.886	0.886	1.000	1.000	0.066	0.259
Gradient x Input	0.553	0.144	0.455	0.138	0.233	0.562	0.098	0.232

Stability

Method	RIS RIS measures the maximum change in explanation relative to changes in the inputs. Values closer to zero are better (log 1 ~ 0).	ROS ROS measures the maximum change in explanation relative to changes in the output prediction probabilities. Values closer to zero are better (log 1 ~ 0).	RRS RRS measures the maximum change in explanation relative to changes in the internal representations learned by the model. Values closer to zero are better (log 1 ~ 0).
SHAP	1.44	0.09	0.09
Gradient x Input	2.06	0.15	0.15
LIME	2.56	1.81	1.81
Integrated Gradient	3.12	0.59	0.59
SmoothGrad	-3.20	-3.56	-3.56
Vanilla Gradient	3.96	1.38	1.38

Fairness

Fairness evaluation is not applicable for the Give me some credit dataset as it does not include any protected attributes.

Leaderboards

Step-by-Step instructions

Explore Leaderboards

Faithfulness

Stability

Fairness

Faithfulness

Stability

Fairness

Faithfulness

Stability

Fairness

Faithfulness

Stability

Fairness

Faithfulness

Stability

Fairness

Faithfulness

Stability

Fairness

Faithfulness

Stability

Fairness

Faithfulness

Stability

Fairness

Faithfulness

Stability

Fairness

Faithfulness

Stability

Fairness

Faithfulness

Stability

Fairness

Faithfulness

Stability

Fairness