Treatment Equality (TE)

The treatment equality (TE) is the difference in the ratio of false negatives to false positives between facets a and d. The main idea of this metric is to assess whether, even if the accuracy across groups is the same, is it the case that errors are more harmful to one group than another? Error rate comes from the total of false positives and false negatives, but the breakdown of these two maybe very different across facets. TE measures whether errors are compensating in the similar or different ways across facets.

The formula for the treatment equality:

TE = FN_d/FP_d - FN_a/FP_a

Where:

FN_d are the false negatives predicted for facet d.
FP_d are the false positives predicted for facet d.
FN_a are the false negatives predicted for facet a.
FP_a are the false positives predicted for facet a.

Note the metric becomes unbounded if FP_a or FP_d is zero.

For example, suppose that there are 100 loan applicants from facet a and 50 from facet d. For facet a, 8 were wrongly denied a loan (FN_a) and another 6 were wrongly approved (FP_a). The remaining predictions were true, so TP_a + TN_a = 86. For facet d, 5 were wrongly denied (FN_d) and 2 were wrongly approved (FP_d). The remaining predictions were true, so TP_d + TN_d = 43. The ratio of false negatives to false positives equals 8/6 = 1.33 for facet a and 5/2 = 2.5 for facet d. Hence TE = 2.5 - 1.33 = 1.167, even though both facets have the same accuracy:

ACC_a = (86)/(86+ 8 + 6) = 0.86

ACC_d = (43)/(43 + 5 + 2) = 0.86

The range of values for differences in conditional rejection for binary and multicategory facet labels is (-∞, +∞). The TE metric is not defined for continuous labels. The interpretation of this metric depends on the relative important of false positives (Type I error) and false negatives (Type II error).

Positive values occur when the ratio of false negatives to false positives for facet d is greater than that for facet a.
Values near zero occur when the ratio of false negatives to false positives for facet a is similar to that for facet d.
Negative values occur when the ratio of false negatives to false positives for facet d is less than that for facet a.

Note

A previous version stated that the Treatment Equality metric is computed as FP_a / FN_a - FP_d / FN_d instead of FN_d / FP_d - FN_a / FP_a. While either of the versions can be used. For more information, see Fairness measures for Machine Learning in Finance.

Warning Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Document Conventions

Accuracy Difference (AD)

Conditional Demographic Disparity in Predicted Labels (CDDPL)