Sum
Checks the sum of all the values in a column against a given expression.
Syntax
Sum
<COL_NAME>
<EXPRESSION>
COL_NAME – The name of the column that you want to evaluate the data quality rule against.
Supported column types: Byte, Decimal, Double, Float, Integer, Long, Short
EXPRESSION – An expression to run against the rule type response in order to produce a Boolean value. For more information, see Expressions.
Example: Sum
The following example rule checks whether the sum of all of the values in a column exceeds a given threshold.
Sum "transaction_total" > 500000 Sum "Salary" < 55600 where "Customer_ID < 10"
Sample dynamic rules
Sum "ColA" > avg(last(10))
Sum "colA" between min(last(10)) - 1 and max(last(10)) + 1
Null behavior
The Sum
rule will ignore rows with NULL
values in the calculation of sum. For example:
+---+-----------+ |id |units | +---+-----------+ |100|0 | |101|null | |102|20 | |103|null | |104|40 | +---+-----------+
The sum of column units
will not consider rows 101 and 103 and result to (0 + 20 + 40) = 60.