Skip to content

Checks

This is a list of the checks in DQO broken down by category and a brief description of what they do.

Table checks

accuracy

Compares the tested table with another (reference) table.

Check name Check type Description
profile_total_row_count_match_percent profiling Verifies that the total row count of the tested table matches the total row count of another (reference) table.
daily_total_row_count_match_percent recurring Verifies the total ow count of a tested table and compares it to a row count of a reference table. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_total_row_count_match_percent recurring Verifies the total row count of a tested table and compares it to a row count of a reference table. Stores the most recent row count for each month when the data quality check was evaluated.

availability

Checks whether the table is accessible and available for use.

Check name Check type Description
profile_table_availability profiling Verifies availability of the table in a database using a simple row count.
daily_table_availability recurring Verifies availability on table in database using simple row count. Stores the most recent table availability status for each day when the data quality check was evaluated.
monthly_table_availability recurring Verifies availability on table in database using simple row count. Stores the most recent table availability status for each month when the data quality check was evaluated.

comparisons

Check name Check type Description
profile_row_count_match profiling Verifies that the row count of the tested (parent) table matches the row count of the reference table. Compares each group of data with a GROUP BY clause.
daily_row_count_match recurring Verifies that the row count of the tested (parent) table matches the row count of the reference table. Compares each group of data with a GROUP BY clause. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_row_count_match recurring Verifies that the row count of the tested (parent) table matches the row count of the reference table. Compares each group of data with a GROUP BY clause. Stores the most recent captured value for each month when the data quality check was evaluated.
daily_partition_row_count_match partitioned Verifies that the row count of the tested (parent) table matches the row count of the reference table. Compares each group of data with a GROUP BY clause on the time period (the daily partition) and all other data grouping columns. Stores the most recent captured value for each daily partition that was analyzed.
monthly_partition_row_count_match partitioned Verifies that the row count of the tested (parent) table matches the row count of the reference table, for each monthly partition (grouping rows by the time period, truncated to the month). Compares each group of data with a GROUP BY clause. Stores the most recent captured value for each monthly partition and optionally data groups.

schema

Detects schema drifts such as columns added, removed, reordered or the data types of columns have changed.

Check name Check type Description
profile_column_count profiling Detects if the number of column matches an expected number. Retrieves the metadata of the monitored table, counts the number of columns and compares it to an expected value (an expected number of columns).
daily_column_count recurring Detects if the number of column matches an expected number. Retrieves the metadata of the monitored table, counts the number of columns and compares it to an expected value (an expected number of columns). Stores the most recent column count for each day when the data quality check was evaluated.
monthly_column_count recurring Detects if the number of column matches an expected number. Retrieves the metadata of the monitored table, counts the number of columns and compares it to an expected value (an expected number of columns). Stores the most recent column count for each month when the data quality check was evaluated.
Check name Check type Description
profile_column_count_changed profiling Detects if the count of columns has changed. Retrieves the metadata of the monitored table, counts the number of columns and compares it the last known column count that was captured when this data quality check was executed the last time.
daily_column_count_changed recurring Detects if the count of columns has changed since the most recent day. Retrieves the metadata of the monitored table, counts the number of columns and compares it the last known column count that was captured when this data quality check was executed the last time. Stores the most recent column count for each day when the data quality check was evaluated.
monthly_column_count_changed recurring Detects if the count of columns has changed since the last month. Retrieves the metadata of the monitored table, counts the number of columns and compares it the last known column count that was captured when this data quality check was executed the last time. Stores the most recent column count for each month when the data quality check was evaluated.
Check name Check type Description
profile_column_list_changed profiling Detects if new columns were added or existing columns were removed. Retrieves the metadata of the monitored table and calculates an unordered hash of the column names. Compares the current hash to the previously known hash to detect any changes to the list of columns.
daily_column_list_changed recurring Detects if new columns were added or existing columns were removed since the most recent day. Retrieves the metadata of the monitored table and calculates an unordered hash of the column names. Compares the current hash to the previously known hash to detect any changes to the list of columns.
monthly_column_list_changed recurring Detects if new columns were added or existing columns were removed since the last month. Retrieves the metadata of the monitored table and calculates an unordered hash of the column names. Compares the current hash to the previously known hash to detect any changes to the list of columns.
Check name Check type Description
profile_column_list_or_order_changed profiling Detects if new columns were added, existing columns were removed or the columns were reordered. Retrieves the metadata of the monitored table and calculates an ordered hash of the column names. Compares the current hash to the previously known hash to detect any changes to the list of columns or their order.
daily_column_list_or_order_changed recurring Detects if new columns were added, existing columns were removed or the columns were reordered since the most recent day. Retrieves the metadata of the monitored table and calculates an ordered hash of the column names. Compares the current hash to the previously known hash to detect any changes to the list of columns or their order.
monthly_column_list_or_order_changed recurring Detects if new columns were added, existing columns were removed or the columns were reordered since the last month. Retrieves the metadata of the monitored table and calculates an ordered hash of the column names. Compares the current hash to the previously known hash to detect any changes to the list of columns or their order.
Check name Check type Description
profile_column_types_changed profiling Detects if new columns were added, removed or their data types have changed. Retrieves the metadata of the monitored table and calculates an unordered hash of the column names and the data types (including the length, scale, precision, nullability). Compares the current hash to the previously known hash to detect any changes to the list of columns or their types.
daily_column_types_changed recurring Detects if new columns were added, removed or their data types have changed since the most recent day. Retrieves the metadata of the monitored table and calculates an unordered hash of the column names and the data types (including the length, scale, precision, nullability). Compares the current hash to the previously known hash to detect any changes to the list of columns or their types.
monthly_column_types_changed recurring Detects if new columns were added, removed or their data types have changed since the last month. Retrieves the metadata of the monitored table and calculates an unordered hash of the column names and the data types (including the length, scale, precision, nullability). Compares the current hash to the previously known hash to detect any changes to the list of columns or their types.

sql

Validate data against user-defined SQL queries at the table level. Checks in this group allow for validation that the set percentage of rows passed a custom SQL expression or that the custom SQL expression is not outside the set range.

Check name Check type Description
profile_sql_condition_passed_percent_on_table profiling Verifies that a set percentage of rows passed a custom SQL condition (expression).
daily_sql_condition_passed_percent_on_table recurring Verifies that a set percentage of rows passed a custom SQL condition (expression). Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_sql_condition_passed_percent_on_table recurring Verifies that a set percentage of rows passed a custom SQL condition (expression). Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_sql_condition_passed_percent_on_table partitioned Verifies that a set percentage of rows passed a custom SQL condition (expression). Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_sql_condition_passed_percent_on_table partitioned Verifies that a set percentage of rows passed a custom SQL condition (expression). Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_sql_condition_failed_count_on_table profiling Verifies that a set number of rows failed a custom SQL condition (expression).
daily_sql_condition_failed_count_on_table recurring Verifies that a set number of rows failed a custom SQL condition (expression). Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_sql_condition_failed_count_on_table recurring Verifies that a set number of rows failed a custom SQL condition (expression). Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_sql_condition_failed_count_on_table partitioned Verifies that a set number of rows failed a custom SQL condition (expression). Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_sql_condition_failed_count_on_table partitioned Verifies that a set number of rows failed a custom SQL condition (expression). Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_sql_aggregate_expr_table profiling Verifies that a custom aggregated SQL expression (MIN, MAX, etc.) is not outside the set range.
daily_sql_aggregate_expr_table recurring Verifies that a custom aggregated SQL expression (MIN, MAX, etc.) is not outside the set range. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_sql_aggregate_expr_table recurring Verifies that a custom aggregated SQL expression (MIN, MAX, etc.) is not outside the set range. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_sql_aggregate_expr_table partitioned Verifies that a custom aggregated SQL expression (MIN, MAX, etc.) is not outside the set range. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_sql_aggregate_expr_table partitioned Verifies that a custom aggregated SQL expression (MIN, MAX, etc.) is not outside the set range. Creates a separate data quality check (and an alert) for each monthly partition.

timeliness

Assesses the freshness and staleness of data, as well as data ingestion delay and reload lag for partitioned data.

Check name Check type Description
profile_data_freshness profiling Calculates the number of days since the most recent event timestamp (freshness)
daily_data_freshness recurring Daily calculating the number of days since the most recent event timestamp (freshness)
monthly_data_freshness recurring Monthly recurring calculating the number of days since the most recent event timestamp (freshness)
daily_partition_data_freshness partitioned Daily partitioned check calculating the number of days since the most recent event timestamp (freshness)
monthly_partition_data_freshness partitioned Monthly partitioned check calculating the number of days since the most recent event (freshness)
Check name Check type Description
profile_data_staleness profiling Calculates the time difference in days between the current date and the most recent data ingestion timestamp (staleness)
daily_data_staleness recurring Daily calculating the time difference in days between the current date and the most recent data ingestion timestamp (staleness)
monthly_data_staleness recurring Monthly recurring calculating the time difference in days between the current date and the most recent data ingestion timestamp (staleness)
daily_partition_data_staleness partitioned Daily partitioned check calculating the time difference in days between the current date and the most recent data ingestion timestamp (staleness)
monthly_partition_data_staleness partitioned Monthly partitioned check calculating the time difference in days between the current date and the most recent data data ingestion timestamp (staleness)
Check name Check type Description
profile_data_ingestion_delay profiling Calculates the time difference in days between the most recent event timestamp and the most recent ingestion timestamp
daily_data_ingestion_delay recurring Daily calculating the time difference in days between the most recent event timestamp and the most recent ingestion timestamp
monthly_data_ingestion_delay recurring Monthly recurring calculating the time difference in days between the most recent event timestamp and the most recent ingestion timestamp
daily_partition_data_ingestion_delay partitioned Daily partitioned check calculating the time difference in days between the most recent event timestamp and the most recent ingestion timestamp
monthly_partition_data_ingestion_delay partitioned Monthly partitioned check calculating the time difference in days between the most recent event timestamp and the most recent ingestion timestamp
Check name Check type Description
daily_partition_reload_lag partitioned Daily partitioned check calculating the longest time a row waited to be load
monthly_partition_reload_lag partitioned Monthly partitioned check calculating the longest time a row waited to be load

volume

Evaluates the overall quality of the table by verifying the number of rows.

Check name Check type Description
profile_row_count profiling Verifies that the number of rows in a table does not exceed the minimum accepted count.
daily_row_count recurring Verifies that the number of rows in a table does not exceed the minimum accepted count. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_row_count recurring Verifies that the number of rows in a table does not exceed the minimum accepted count. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_row_count partitioned Verifies that the number of rows in a table does not exceed the minimum accepted count. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_min_row_count partitioned Verifies that the number of rows in a table does not exceed the minimum accepted count. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_row_count_anomaly_differencing_30_days profiling Verifies that the total row count of the tested table changes in a rate within a percentile boundary during last 30 days.
daily_row_count_anomaly_differencing_30_days recurring Verifies that the total row count of the tested table changes in a rate within a percentile boundary during last 30 days.
Check name Check type Description
profile_row_count_anomaly_differencing profiling Verifies that the total row count of the tested table changes in a rate within a percentile boundary during last 90 days.
daily_row_count_anomaly_differencing recurring Verifies that the total row count of the tested table changes in a rate within a percentile boundary during last 90 days.
Check name Check type Description
profile_row_count_change profiling Verifies that the total row count of the tested table has changed by a fixed rate since the last readout.
daily_row_count_change recurring Verifies that the total row count of the tested table has changed by a fixed rate since the last readout.
monthly_row_count_change recurring Verifies that the total row count of the tested table has changed by a fixed rate since the last readout.
daily_partition_row_count_change partitioned Verifies that the total row count of the tested table has changed by a fixed rate since the last readout.
monthly_partition_row_count_change partitioned Verifies that the total row count of the tested table has changed by a fixed rate since the last readout.
Check name Check type Description
profile_row_count_change_yesterday profiling Verifies that the total row count of the tested table has changed by a fixed rate since the last readout from yesterday. Allows for exact match to readouts from yesterday or past readouts lookup.
daily_row_count_change_yesterday recurring Verifies that the total row count of the tested table has changed by a fixed rate since the last readout from yesterday. Allows for exact match to readouts from yesterday or past readouts lookup.
daily_partition_row_count_change_yesterday partitioned Verifies that the total row count of the tested table has changed by a fixed rate since the last readout from yesterday. Allows for exact match to readouts from yesterday or past readouts lookup.
Check name Check type Description
profile_row_count_change_7_days profiling Verifies that the total row count of the tested table has changed by a fixed rate since the last readout from last week. Allows for exact match to readouts from 7 days ago or past readouts lookup.
daily_row_count_change_7_days recurring Verifies that the total row count of the tested table has changed by a fixed rate since the last readout from last week. Allows for exact match to readouts from 7 days ago or past readouts lookup.
daily_partition_row_count_change_7_days partitioned Verifies that the total row count of the tested table has changed by a fixed rate since the last readout from last week. Allows for exact match to readouts from 7 days ago or past readouts lookup.
Check name Check type Description
profile_row_count_change_30_days profiling Verifies that the total row count of the tested table has changed by a fixed rate since the last readout from last month. Allows for exact match to readouts from 30 days ago or past readouts lookup.
daily_row_count_change_30_days recurring Verifies that the total row count of the tested table has changed by a fixed rate since the last readout from last month. Allows for exact match to readouts from 30 days ago or past readouts lookup.
daily_partition_row_count_change_30_days partitioned Verifies that the total row count of the tested table has changed by a fixed rate since the last readout from last month. Allows for exact match to readouts from 30 days ago or past readouts lookup.
Check name Check type Description
daily_partition_row_count_anomaly_stationary_30_days partitioned Verifies that the total row count of the tested table is within a percentile from measurements made during the last 30 days.
Check name Check type Description
daily_partition_row_count_anomaly_stationary partitioned Verifies that the total row count of the tested table is within a percentile from measurements made during the last 90 days.

Column checks

accuracy

Check name Check type Description
profile_total_sum_match_percent profiling Verifies that percentage of the difference in total sum of a column in a table and total sum of a column of another table does not exceed the set number.
daily_total_sum_match_percent recurring Verifies that the percentage of difference in total sum of a column in a table and total sum of a column of another table does not exceed the set number. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_total_sum_match_percent recurring Verifies that the percentage of difference in total sum of a column in a table and total sum of a column of another table does not exceed the set number. Stores the most recent row count for each month when the data quality check was evaluated.
Check name Check type Description
profile_total_min_match_percent profiling Verifies that the percentage of difference in total min of a column in a table and total min of a column of another table does not exceed the set number.
daily_total_min_match_percent recurring Verifies that the percentage of difference in total min of a column in a table and total min of a column of another table does not exceed the set number. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_total_min_match_percent recurring Verifies that the percentage of difference in total min of a column in a table and total min of a column of another table does not exceed the set number. Stores the most recent row count for each month when the data quality check was evaluated.
Check name Check type Description
profile_total_max_match_percent profiling Verifies that the percentage of difference in total max of a column in a table and total max of a column of another table does not exceed the set number.
daily_total_max_match_percent recurring Verifies that the percentage of difference in total max of a column in a table and total max of a column of another table does not exceed the set number. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_total_max_match_percent recurring Verifies that the percentage of difference in total max of a column in a table and total max of a column of another table does not exceed the set number. Stores the most recent row count for each month when the data quality check was evaluated.
Check name Check type Description
profile_total_average_match_percent profiling Verifies that the percentage of difference in total average of a column in a table and total average of a column of another table does not exceed the set number.
daily_total_average_match_percent recurring Verifies that the percentage of difference in total average of a column in a table and total average of a column of another table does not exceed the set number. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_total_average_match_percent recurring Verifies that the percentage of difference in total average of a column in a table and total average of a column of another table does not exceed the set number. Stores the most recent row count for each month when the data quality check was evaluated.
Check name Check type Description
profile_total_not_null_count_match_percent profiling Verifies that the percentage of difference in total not null count of a column in a table and total not null count of a column of another table does not exceed the set number. Stores the most recent captured value for each day when the data quality check was evaluated.
daily_total_not_null_count_match_percent recurring Verifies that the percentage of difference in total not null count of a column in a table and total not null count of a column of another table does not exceed the set number. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_total_not_null_count_match_percent recurring Verifies that the percentage of difference in total not null count of a column in a table and total not null count of a column of another table does not exceed the set number. Stores the most recent row count for each month when the data quality check was evaluated.

anomaly

Detects anomalous (unexpected) changes and outliers in the time series of data quality results collected over a period of time.

Check name Check type Description
profile_mean_anomaly_stationary_30_days profiling Verifies that the mean value in a column changes in a rate within a percentile boundary during last 30 days.
daily_mean_anomaly_stationary_30_days recurring Verifies that the mean value in a column changes in a rate within a percentile boundary during last 30 days.
daily_partition_mean_anomaly_stationary_30_days partitioned Verifies that the mean value in a column is within a percentile from measurements made during the last 30 days.
Check name Check type Description
profile_mean_anomaly_stationary profiling Verifies that the mean value in a column changes in a rate within a percentile boundary during last 90 days.
daily_mean_anomaly_stationary recurring Verifies that the mean value in a column changes in a rate within a percentile boundary during last 90 days.
daily_partition_mean_anomaly_stationary partitioned Verifies that the mean value in a column is within a percentile from measurements made during the last 90 days.
Check name Check type Description
profile_median_anomaly_stationary_30_days profiling Verifies that the median in a column changes in a rate within a percentile boundary during last 30 days.
daily_median_anomaly_stationary_30_days recurring Verifies that the median in a column changes in a rate within a percentile boundary during last 30 days.
daily_partition_median_anomaly_stationary_30_days partitioned Verifies that the median in a column is within a percentile from measurements made during the last 30 days.
Check name Check type Description
profile_median_anomaly_stationary profiling Verifies that the median in a column changes in a rate within a percentile boundary during last 90 days.
daily_median_anomaly_stationary recurring Verifies that the median in a column changes in a rate within a percentile boundary during last 90 days.
daily_partition_median_anomaly_stationary partitioned Verifies that the median in a column is within a percentile from measurements made during the last 90 days.
Check name Check type Description
profile_sum_anomaly_differencing_30_days profiling Verifies that the sum in a column changes in a rate within a percentile boundary during last 30 days.
daily_sum_anomaly_differencing_30_days recurring Verifies that the sum in a column changes in a rate within a percentile boundary during last 30 days.
Check name Check type Description
profile_sum_anomaly_differencing profiling Verifies that the sum in a column changes in a rate within a percentile boundary during last 90 days.
daily_sum_anomaly_differencing recurring Verifies that the sum in a column changes in a rate within a percentile boundary during last 90 days.
Check name Check type Description
profile_mean_change profiling Verifies that the mean value in a column changed in a fixed rate since last readout.
daily_mean_change recurring Verifies that the mean value in a column changed in a fixed rate since last readout.
monthly_mean_change recurring Verifies that the mean value in a column changed in a fixed rate since last readout.
daily_partition_mean_change partitioned Verifies that the mean value in a column changed in a fixed rate since last readout.
monthly_partition_mean_change partitioned Verifies that the mean value in a column changed in a fixed rate since last readout.
Check name Check type Description
profile_mean_change_yesterday profiling Verifies that the mean value in a column changed in a fixed rate since last readout from yesterday.
daily_mean_change_yesterday recurring Verifies that the mean value in a column changed in a fixed rate since last readout from yesterday.
daily_partition_mean_change_yesterday partitioned Verifies that the mean value in a column changed in a fixed rate since last readout from yesterday.
Check name Check type Description
profile_mean_change_7_days profiling Verifies that the mean value in a column changed in a fixed rate since last readout from last week.
daily_mean_change_7_days recurring Verifies that the mean value in a column changed in a fixed rate since last readout from last week.
daily_partition_mean_change_7_days partitioned Verifies that the mean value in a column changed in a fixed rate since last readout from last week.
Check name Check type Description
profile_mean_change_30_days profiling Verifies that the mean value in a column changed in a fixed rate since last readout from last month.
daily_mean_change_30_days recurring Verifies that the mean value in a column changed in a fixed rate since last readout from last month.
daily_partition_mean_change_30_days partitioned Verifies that the mean value in a column changed in a fixed rate since last readout from last month.
Check name Check type Description
profile_median_change profiling Verifies that the median in a column changed in a fixed rate since last readout.
daily_median_change recurring Verifies that the median in a column changed in a fixed rate since last readout.
monthly_median_change recurring Verifies that the median in a column changed in a fixed rate since last readout.
daily_partition_median_change partitioned Verifies that the median in a column changed in a fixed rate since last readout.
monthly_partition_median_change partitioned Verifies that the median in a column changed in a fixed rate since last readout.
Check name Check type Description
profile_median_change_yesterday profiling Verifies that the median in a column changed in a fixed rate since last readout from yesterday.
daily_median_change_yesterday recurring Verifies that the median in a column changed in a fixed rate since last readout from yesterday.
daily_partition_median_change_yesterday partitioned Verifies that the median in a column changed in a fixed rate since last readout from yesterday.
Check name Check type Description
profile_median_change_7_days profiling Verifies that the median in a column changed in a fixed rate since last readout from last week.
daily_median_change_7_days recurring Verifies that the median in a column changed in a fixed rate since last readout from last week.
daily_partition_median_change_7_days partitioned Verifies that the median in a column changed in a fixed rate since last readout from last week.
Check name Check type Description
profile_median_change_30_days profiling Verifies that the median in a column changed in a fixed rate since last readout from last month.
daily_median_change_30_days recurring Verifies that the median in a column changed in a fixed rate since last readout from last month.
daily_partition_median_change_30_days partitioned Verifies that the median in a column changed in a fixed rate since last readout from last month.
Check name Check type Description
profile_sum_change profiling Verifies that the sum in a column changed in a fixed rate since last readout.
daily_sum_change recurring Verifies that the sum in a column changed in a fixed rate since last readout.
monthly_sum_change recurring Verifies that the sum in a column changed in a fixed rate since last readout.
daily_partition_sum_change partitioned Verifies that the sum in a column changed in a fixed rate since last readout.
monthly_partition_sum_change partitioned Verifies that the sum in a column changed in a fixed rate since last readout.
Check name Check type Description
profile_sum_change_yesterday profiling Verifies that the sum in a column changed in a fixed rate since last readout from yesterday.
daily_sum_change_yesterday recurring Verifies that the sum in a column changed in a fixed rate since last readout from yesterday.
daily_partition_sum_change_yesterday partitioned Verifies that the sum in a column changed in a fixed rate since last readout from yesterday.
Check name Check type Description
profile_sum_change_7_days profiling Verifies that the sum in a column changed in a fixed rate since last readout from last week.
daily_sum_change_7_days recurring Verifies that the sum in a column changed in a fixed rate since last readout from last week.
daily_partition_sum_change_7_days partitioned Verifies that the sum in a column changed in a fixed rate since last readout from last week.
Check name Check type Description
profile_sum_change_30_days profiling Verifies that the sum in a column changed in a fixed rate since last readout from last month.
daily_sum_change_30_days recurring Verifies that the sum in a column changed in a fixed rate since last readout from last month.
daily_partition_sum_change_30_days partitioned Verifies that the sum in a column changed in a fixed rate since last readout from last month.
Check name Check type Description
daily_partition_sum_anomaly_stationary_30_days partitioned Verifies that the sum in a column is within a percentile from measurements made during the last 30 days.
Check name Check type Description
daily_partition_sum_anomaly_stationary partitioned Verifies that the sum in a column is within a percentile from measurements made during the last 90 days.

bool

Calculates the percentage of data in a Boolean format.

Check name Check type Description
profile_true_percent profiling Verifies that the percentage of true values in a column does not exceed the minimum accepted percentage.
daily_true_percent recurring Verifies that the percentage of true values in a column does not exceed the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_true_percent recurring Verifies that the percentage of true values in a column does not exceed the minimum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_true_percent partitioned Verifies that the percentage of true values in a column does not exceed the minimum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_true_percent partitioned Verifies that the percentage of true values in a column does not exceed the minimum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_false_percent profiling Verifies that the percentage of false values in a column does not exceed the minimum accepted percentage.
daily_false_percent recurring Verifies that the percentage of false values in a column does not exceed the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_false_percent recurring Verifies that the percentage of false values in a column does not exceed the minimum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_false_percent partitioned Verifies that the percentage of false values in a column does not exceed the minimum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_false_percent partitioned Verifies that the percentage of false values in a column does not exceed the minimum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.

comparisons

Check name Check type Description
profile_sum_match profiling Verifies that percentage of the difference between the sum of values in a tested column in a parent table and the sum of a values in a column in the reference table. The difference must be below defined percentage thresholds.
daily_sum_match recurring Verifies that percentage of the difference between the sum of values in a tested column in a parent table and the sum of a values in a column in the reference table. The difference must be below defined percentage thresholds. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_sum_match recurring Verifies that percentage of the difference between the sum of values in a tested column in a parent table and the sum of a values in a column in the reference table. The difference must be below defined percentage thresholds. Stores the most recent captured value for each month when the data quality check was evaluated.
daily_partition_sum_match partitioned Verifies that percentage of the difference between the sum of values in a tested column in a parent table and the sum of a values in a column in the reference table. The difference must be below defined percentage thresholds. Compares each daily partition (each day of data) between the compared table and the reference table (the source of truth).
monthly_partition_sum_match partitioned Verifies that percentage of the difference between the sum of values in a tested column in a parent table and the sum of a values in a column in the reference table. The difference must be below defined percentage thresholds. Compares each monthly partition (each month of data) between the compared table and the reference table (the source of truth).
Check name Check type Description
profile_min_match profiling Verifies that percentage of the difference between the minimum value in a tested column in a parent table and the minimum value in a column in the reference table. The difference must be below defined percentage thresholds.
daily_min_match recurring Verifies that percentage of the difference between the minimum value in a tested column in a parent table and the minimum value in a column in the reference table. The difference must be below defined percentage thresholds. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_min_match recurring Verifies that percentage of the difference between the minimum value in a tested column in a parent table and the minimum value in a column in the reference table. The difference must be below defined percentage thresholds. Stores the most recent captured value for each month when the data quality check was evaluated.
daily_partition_min_match partitioned Verifies that percentage of the difference between the minimum value in a tested column in a parent table and the minimum value in a column in the reference table. The difference must be below defined percentage thresholds. Compares each daily partition (each day of data) between the compared table and the reference table (the source of truth).
monthly_partition_min_match partitioned Verifies that percentage of the difference between the minimum value in a tested column in a parent table and the minimum value in a column in the reference table. The difference must be below defined percentage thresholds. Compares each monthly partition (each month of data) between the compared table and the reference table (the source of truth).
Check name Check type Description
profile_max_match profiling Verifies that percentage of the difference between the maximum value in a tested column in a parent table and the maximum value in a column in the reference table. The difference must be below defined percentage thresholds.
daily_max_match recurring Verifies that percentage of the difference between the maximum value in a tested column in a parent table and the maximum value in a column in the reference table. The difference must be below defined percentage thresholds. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_max_match recurring Verifies that percentage of the difference between the maximum value in a tested column in a parent table and the maximum value in a column in the reference table. The difference must be below defined percentage thresholds. Stores the most recent captured value for each month when the data quality check was evaluated.
daily_partition_max_match partitioned Verifies that percentage of the difference between the maximum value in a tested column in a parent table and the maximum value in a column in the reference table. The difference must be below defined percentage thresholds. Compares each daily partition (each day of data) between the compared table and the reference table (the source of truth).
monthly_partition_max_match partitioned Verifies that percentage of the difference between the maximum value in a tested column in a parent table and the maximum value in a column in the reference table. The difference must be below defined percentage thresholds. Compares each monthly partition (each month of data) between the compared table and the reference table (the source of truth).
Check name Check type Description
profile_mean_match profiling Verifies that percentage of the difference between the mean (average) value in a tested column in a parent table and the mean (average) value in a column in the reference table. The difference must be below defined percentage thresholds.
daily_mean_match recurring Verifies that percentage of the difference between the mean (average) value in a tested column in a parent table and the mean (average) value in a column in the reference table. The difference must be below defined percentage thresholds. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_mean_match recurring Verifies that percentage of the difference between the mean (average) value in a tested column in a parent table and the mean (average) value in a column in the reference table. The difference must be below defined percentage thresholds. Stores the most recent captured value for each month when the data quality check was evaluated.
daily_partition_mean_match partitioned Verifies that percentage of the difference between the mean (average) value in a tested column in a parent table and the mean (average) value in a column in the reference table. The difference must be below defined percentage thresholds. Compares each daily partition (each day of data) between the compared table and the reference table (the source of truth).
monthly_partition_mean_match partitioned Verifies that percentage of the difference between the mean (average) value in a tested column in a parent table and the mean (average) value in a column in the reference table. The difference must be below defined percentage thresholds. Compares each monthly partition (each month of data) between the compared table and the reference table (the source of truth).
Check name Check type Description
profile_not_null_count_match profiling Verifies that percentage of the difference between the count of not null values in a tested column in a parent table and the count of not null values in a column in the reference table. The difference must be below defined percentage thresholds.
daily_not_null_count_match recurring Verifies that percentage of the difference between the count of not null values in a tested column in a parent table and the count of not null values in a column in the reference table. The difference must be below defined percentage thresholds. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_not_null_count_match recurring Verifies that percentage of the difference between the count of not null values in a tested column in a parent table and the count of not null values in a column in the reference table. The difference must be below defined percentage thresholds. Stores the most recent captured value for each month when the data quality check was evaluated.
daily_partition_not_null_count_match partitioned Verifies that percentage of the difference between the count of not null values in a tested column in a parent table and the count of not null values in a column in the reference table. The difference must be below defined percentage thresholds. Compares each daily partition (each day of data) between the compared table and the reference table (the source of truth).
monthly_partition_not_null_count_match partitioned Verifies that percentage of the difference between the count of not null values in a tested column in a parent table and the count of not null values in a column in the reference table. The difference must be below defined percentage thresholds. Compares each monthly partition (each month of data) between the compared table and the reference table (the source of truth).
Check name Check type Description
profile_null_count_match profiling Verifies that percentage of the difference between the count of null values in a tested column in a parent table and the count of null values in a column in the reference table. The difference must be below defined percentage thresholds.
daily_null_count_match recurring Verifies that percentage of the difference between the count of null values in a tested column in a parent table and the count of null values in a column in the reference table. The difference must be below defined percentage thresholds. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_null_count_match recurring Verifies that percentage of the difference between the count of null values in a tested column in a parent table and the count of null values in a column in the reference table. The difference must be below defined percentage thresholds. Stores the most recent captured value for each month when the data quality check was evaluated.
daily_partition_null_count_match partitioned Verifies that percentage of the difference between the count of null values in a tested column in a parent table and the count of null values in a column in the reference table. The difference must be below defined percentage thresholds. Compares each daily partition (each day of data) between the compared table and the reference table (the source of truth).
monthly_partition_null_count_match partitioned Verifies that percentage of the difference between the count of null values in a tested column in a parent table and the count of null values in a column in the reference table. The difference must be below defined percentage thresholds. Compares each monthly partition (each month of data) between the compared table and the reference table (the source of truth).

datatype

Check name Check type Description
profile_date_match_format_percent profiling Verifies that the percentage of date values matching the given format in a column does not exceed the minimum accepted percentage.
daily_date_match_format_percent recurring Verifies that the percentage of date values matching the given format in a column does not exceed the maximum accepted percentage. Creates a separate data quality check (and an alert) for each daily recurring.
monthly_date_match_format_percent recurring Verifies that the percentage of date values matching the given format in a column does not exceed the maximum accepted percentage. Creates a separate data quality check (and an alert) for each monthly recurring.
daily_partition_date_match_format_percent partitioned Verifies that the percentage of date values matching the given format in a column does not exceed the maximum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_date_match_format_percent partitioned Verifies that the percentage of date values matching the given format in a column does not exceed the maximum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_string_datatype_changed profiling Detects that the data type of texts stored in a text column has changed since the last verification. The sensor returns the detected data type of a column: 1 - integers, 2 - floats, 3 - dates, 4 - timestamps, 5 - booleans, 6 - strings, 7 - mixed data types.
daily_string_datatype_changed recurring Detects that the data type of texts stored in a text column has changed since the last verification. The sensor returns the detected data type of a column: 1 - integers, 2 - floats, 3 - dates, 4 - timestamps, 5 - booleans, 6 - strings, 7 - mixed data types. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_datatype_changed recurring Detects that the data type of texts stored in a text column has changed since the last verification. The sensor returns the detected data type of a column: 1 - integers, 2 - floats, 3 - dates, 4 - timestamps, 5 - booleans, 6 - strings, 7 - mixed data types. Stores the most recent captured value for each day when the data quality check was evaluated.
daily_partition_string_datatype_changed partitioned Detects that the data type of texts stored in a text column has changed when compared to an earlier not empty partition. The sensor returns the detected data type of a column: 1 - integers, 2 - floats, 3 - dates, 4 - timestamps, 5 - booleans, 6 - strings, 7 - mixed data types. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_string_datatype_changed partitioned Detects that the data type of texts stored in a text column has changed when compared to an earlier not empty partition. The sensor returns the detected data type of a column: 1 - integers, 2 - floats, 3 - dates, 4 - timestamps, 5 - booleans, 6 - strings, 7 - mixed data types. Creates a separate data quality check (and an alert) for each monthly partition.

datetime

Validates that the data in a date or time column is in the expected format and within predefined ranges.

Check name Check type Description
profile_date_values_in_future_percent profiling Verifies that the percentage of date values in future in a column does not exceed the maximum accepted percentage.
daily_date_values_in_future_percent recurring Verifies that the percentage of date values in future in a column does not exceed the maximum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_date_values_in_future_percent recurring Verifies that the percentage of date values in future in a column does not exceed the maximum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_date_values_in_future_percent partitioned Verifies that the percentage of date values in future in a column does not exceed the maximum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_date_values_in_future_percent partitioned Verifies that the percentage of date values in future in a column does not exceed the maximum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_datetime_value_in_range_date_percent profiling Verifies that the percentage of date values in the range defined by the user in a column does not exceed the maximum accepted percentage.
daily_datetime_value_in_range_date_percent recurring Verifies that the percentage of date values in the range defined by the user in a column does not exceed the maximum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_datetime_value_in_range_date_percent recurring Verifies that the percentage of date values in the range defined by the user in a column does not exceed the maximum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_datetime_value_in_range_date_percent partitioned Verifies that the percentage of date values in the range defined by the user in a column does not exceed the maximum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_datetime_value_in_range_date_percent partitioned Verifies that the percentage of date values in the range defined by the user in a column does not exceed the maximum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.

integrity

Checks the referential integrity of a column against a column in another table.

Check name Check type Description
profile_foreign_key_not_match_count profiling Verifies that the number of values in a column that does not match values in another table column does not exceed the set count.
daily_foreign_key_not_match_count recurring Verifies that the number of values in a column that does not match values in another table column does not exceed the set count. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_foreign_key_not_match_count recurring Verifies that the number of values in a column that does not match values in another table column does not exceed the set count. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_foreign_key_not_match_count partitioned Verifies that the number of values in a column that does not match values in another table column does not exceed the set count. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_foreign_key_not_match_count partitioned Verifies that the number of values in a column that does not match values in another table column does not exceed the set count. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_foreign_key_match_percent profiling Verifies that the percentage of values in a column that matches values in another table column does not exceed the set count.
daily_foreign_key_match_percent recurring Verifies that the percentage of values in a column that matches values in another table column does not exceed the set count. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_foreign_key_match_percent recurring Verifies that the percentage of values in a column that matches values in another table column does not exceed the set count. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_foreign_key_match_percent partitioned Verifies that the percentage of values in a column that matches values in another table column does not exceed the set count. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_foreign_key_match_percent partitioned Verifies that the percentage of values in a column that matches values in another table column does not exceed the set count. Creates a separate data quality check (and an alert) for each monthly partition.

nulls

Checks for the presence of null or missing values in a column.

Check name Check type Description
profile_nulls_count profiling Verifies that the number of null values in a column does not exceed the maximum accepted count.
daily_nulls_count recurring Verifies that the number of null values in a column does not exceed the maximum accepted count. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_nulls_count recurring Verifies that the number of null values in a column does not exceed the maximum accepted count. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_nulls_count partitioned Verifies that the number of null values in a column does not exceed the set count. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_nulls_count partitioned Verifies that the number of null values in a column does not exceed the set count. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_nulls_percent profiling Verifies that the percent of null values in a column does not exceed the maximum accepted percentage.
daily_nulls_percent recurring Verifies that the percentage of nulls in a column does not exceed the maximum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_nulls_percent recurring Verifies that the percentage of null values in a column does not exceed the maximum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_nulls_percent partitioned Verifies that the percentage of null values in a column does not exceed the set percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_nulls_percent partitioned Verifies that the percentage of null values in a column does not exceed the set percentage. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_nulls_percent_anomaly_stationary_30_days profiling Verifies that the null percent value in a column changes in a rate within a percentile boundary during last 30 days.
daily_nulls_percent_anomaly_stationary_30_days recurring Verifies that the null percent value in a column changes in a rate within a percentile boundary during last 30 days.
daily_partition_nulls_percent_anomaly_stationary_30_days partitioned Verifies that the null percent value in a column changes in a rate within a percentile boundary during last 30 days.
Check name Check type Description
profile_nulls_percent_anomaly_stationary profiling Verifies that the null percent value in a column changes in a rate within a percentile boundary during last 90 days.
daily_nulls_percent_anomaly_stationary recurring Verifies that the null percent value in a column changes in a rate within a percentile boundary during last 90 days.
daily_partition_nulls_percent_anomaly_stationary partitioned Verifies that the null percent value in a column changes in a rate within a percentile boundary during last 90 days.
Check name Check type Description
profile_nulls_percent_change profiling Verifies that the null percent value in a column changed in a fixed rate since last readout.
daily_nulls_percent_change recurring Verifies that the null percent value in a column changed in a fixed rate since last readout.
daily_partition_nulls_percent_change partitioned Verifies that the null percent value in a column changed in a fixed rate since last readout.
Check name Check type Description
profile_nulls_percent_change_yesterday profiling Verifies that the null percent value in a column changed in a fixed rate since last readout from yesterday.
daily_nulls_percent_change_yesterday recurring Verifies that the null percent value in a column changed in a fixed rate since last readout from yesterday.
daily_partition_nulls_percent_change_yesterday partitioned Verifies that the null percent value in a column changed in a fixed rate since last readout from yesterday.
Check name Check type Description
profile_nulls_percent_change_7_days profiling Verifies that the null percent value in a column changed in a fixed rate since last readout from last week.
daily_nulls_percent_change_7_days recurring Verifies that the null percent value in a column changed in a fixed rate since last readout from last week.
daily_partition_nulls_percent_change_7_days partitioned Verifies that the null percent value in a column changed in a fixed rate since last readout from last week.
Check name Check type Description
profile_nulls_percent_change_30_days profiling Verifies that the null percent value in a column changed in a fixed rate since last readout from last month.
daily_nulls_percent_change_30_days recurring Verifies that the null percent value in a column changed in a fixed rate since last readout from last month.
daily_partition_nulls_percent_change_30_days partitioned Verifies that the null percent value in a column changed in a fixed rate since last readout from last month.
Check name Check type Description
profile_not_nulls_count profiling Verifies that the number of not null values in a column does not exceed the minimum accepted count.
daily_not_nulls_count recurring Verifies that the number of not null values in a column does not fall below the minimum accepted count. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_not_nulls_count recurring Verifies that the number of not null values in a column does not fall below the minimum accepted count. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_not_nulls_count partitioned Verifies that the number of not null values in a column does not exceed the set count. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_not_nulls_count partitioned Verifies that the number of not null values in a column does not exceed the set count. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_not_nulls_percent profiling Verifies that the percent of not null values in a column does not exceed the minimum accepted percentage.
daily_not_nulls_percent recurring Verifies that the percentage of not nulls in a column does not fall below the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_not_nulls_percent recurring Verifies that the percentage of not nulls in a column does not fall below the minimum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_not_nulls_percent partitioned Verifies that the percentage of not null values in a column does not exceed the set percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_not_nulls_percent partitioned Verifies that the percentage of not null values in a column does not exceed the set percentage. Creates a separate data quality check (and an alert) for each monthly partition.

numeric

Validates that the data in a numeric column is in the expected format or within predefined ranges.

Check name Check type Description
profile_negative_count profiling Verifies that the number of negative values in a column does not exceed the maximum accepted count.
daily_negative_count recurring Verifies that the number of negative values in a column does not exceed the maximum accepted count. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_negative_count recurring Verifies that the number of negative values in a column does not exceed the maximum accepted count. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_negative_count partitioned Verifies that the number of negative values in a column does not exceed the maximum accepted count. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_negative_count partitioned Verifies that the number of negative values in a column does not exceed the maximum accepted count. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_negative_percent profiling Verifies that the percentage of negative values in a column does not exceed the maximum accepted percentage.
daily_negative_percent recurring Verifies that the percentage of negative values in a column does not exceed the maximum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_negative_percent recurring Verifies that the percentage of negative values in a column does not exceed the maximum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_negative_percent partitioned Verifies that the percentage of negative values in a column does not exceed the maximum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_negative_percent partitioned Verifies that the percentage of negative values in a column does not exceed the maximum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_non_negative_count profiling Verifies that the number of non-negative values in a column does not exceed the maximum accepted count.
daily_non_negative_count recurring Verifies that the number of non-negative values in a column does not exceed the maximum accepted count. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_non_negative_count recurring Verifies that the number of non-negative values in a column does not exceed the maximum accepted count. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_non_negative_count partitioned Verifies that the number of non-negative values in a column does not exceed the maximum accepted count. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_non_negative_count partitioned Verifies that the number of non-negative values in a column does not exceed the maximum accepted count. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_non_negative_percent profiling Verifies that the percentage of non-negative values in a column does not exceed the maximum accepted percentage.
daily_non_negative_percent recurring Verifies that the percentage of non-negative values in a column does not exceed the maximum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_non_negative_percent recurring Verifies that the percentage of non-negative values in a column does not exceed the maximum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_non_negative_percent partitioned Verifies that the percentage of non-negative values in a column does not exceed the maximum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_non_negative_percent partitioned Verifies that the percentage of non-negative values in a column does not exceed the maximum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_expected_numbers_in_use_count profiling Verifies that the expected numeric values were found in the column. Raises a data quality issue when too many expected values were not found (were missing).
daily_expected_numbers_in_use_count recurring Verifies that the expected numeric values were found in the column. Raises a data quality issue when too many expected values were not found (were missing). Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_expected_numbers_in_use_count recurring Verifies that the expected numeric values were found in the column. Raises a data quality issue when too many expected values were not found (were missing). Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_expected_numbers_in_use_count partitioned Verifies that the expected numeric values were found in the column. Raises a data quality issue when too many expected values were not found (were missing). Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_expected_numbers_in_use_count partitioned Verifies that the expected numeric values were found in the column. Raises a data quality issue when too many expected values were not found (were missing). Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_number_value_in_set_percent profiling The check measures the percentage of rows whose value in a tested column is one of values from a list of expected values or the column value is null. Verifies that the percentage of rows having a valid column value does not exceed the minimum accepted percentage.
daily_number_value_in_set_percent recurring The check measures the percentage of rows whose value in a tested column is one of values from a list of expected values or the column value is null. Verifies that the percentage of rows having a valid column value does not exceed the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_number_value_in_set_percent recurring The check measures the percentage of rows whose value in a tested column is one of values from a list of expected values or the column value is null. Verifies that the percentage of rows having a valid column value does not exceed the minimum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_number_value_in_set_percent partitioned The check measures the percentage of rows whose value in a tested column is one of values from a list of expected values or the column value is null. Verifies that the percentage of rows having a valid column value does not exceed the minimum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_number_value_in_set_percent partitioned The check measures the percentage of rows whose value in a tested column is one of values from a list of expected values or the column value is null. Verifies that the percentage of rows having a valid column value does not exceed the minimum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_values_in_range_numeric_percent profiling Verifies that the percentage of values from range in a column does not exceed the minimum accepted percentage.
daily_values_in_range_numeric_percent recurring Verifies that the percentage of values from range in a column does not exceed the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_values_in_range_numeric_percent recurring Verifies that the percentage of values from range in a column does not exceed the minimum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_values_in_range_numeric_percent partitioned Verifies that the percentage of values from range in a column does not exceed the minimum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_values_in_range_numeric_percent partitioned Verifies that the percentage of values from range in a column does not exceed the minimum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_values_in_range_integers_percent profiling Verifies that the percentage of values from range in a column does not exceed the minimum accepted percentage.
daily_values_in_range_integers_percent recurring Verifies that the percentage of values from range in a column does not exceed the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_values_in_range_integers_percent recurring Verifies that the percentage of values from range in a column does not exceed the minimum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_values_in_range_integers_percent partitioned Verifies that the percentage of values from range in a column does not exceed the minimum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_values_in_range_integers_percent partitioned Verifies that the percentage of values from range in a column does not exceed the minimum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_value_below_min_value_count profiling The check counts the number of values in the column that is below the value defined by the user as a parameter.
daily_value_below_min_value_count recurring The check counts the number of values in the column that is below the value defined by the user as a parameter. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_value_below_min_value_count recurring The check counts the number of values in the column that is below the value defined by the user as a parameter. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_value_below_min_value_count partitioned The check counts the number of values in the column that is below the value defined by the user as a parameter. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_value_below_min_value_count partitioned The check counts the number of values in the column that is below the value defined by the user as a parameter. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_value_below_min_value_percent profiling The check counts the percentage of values in the column that is below the value defined by the user as a parameter.
daily_value_below_min_value_percent recurring The check counts the percentage of values in the column that is below the value defined by the user as a parameter. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_value_below_min_value_percent recurring The check counts the percentage of values in the column that is below the value defined by the user as a parameter. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_value_below_min_value_percent partitioned The check counts the percentage of values in the column that is below the value defined by the user as a parameter. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_value_below_min_value_percent partitioned The check counts the percentage of values in the column that is below the value defined by the user as a parameter. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_value_above_max_value_count profiling The check counts the number of values in the column that is above the value defined by the user as a parameter.
daily_value_above_max_value_count recurring The check counts the number of values in the column that is above the value defined by the user as a parameter. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_value_above_max_value_count recurring The check counts the number of values in the column that is above the value defined by the user as a parameter. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_value_above_max_value_count partitioned The check counts the number of values in the column that is above the value defined by the user as a parameter. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_value_above_max_value_count partitioned The check counts the number of values in the column that is above the value defined by the user as a parameter. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_value_above_max_value_percent profiling The check counts the percentage of values in the column that is above the value defined by the user as a parameter.
daily_value_above_max_value_percent recurring The check counts the percentage of values in the column that is above the value defined by the user as a parameter. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_value_above_max_value_percent recurring The check counts the percentage of values in the column that is above the value defined by the user as a parameter. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_value_above_max_value_percent partitioned The check counts the percentage of values in the column that is above the value defined by the user as a parameter. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_value_above_max_value_percent partitioned The check counts the percentage of values in the column that is above the value defined by the user as a parameter. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_max_in_range profiling Verifies that the maximal value in a column is not outside the set range.
daily_max_in_range recurring Verifies that the maximal value in a column is not outside the set range. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_max_in_range recurring Verifies that the maximal value in a column does not exceed the set range. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_max_in_range partitioned Verifies that the maximal value in a column is not outside the set range. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_max_in_range partitioned Verifies that the maximal value in a column is not outside the set range. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_min_in_range profiling Verifies that the minimal value in a column is not outside the set range.
daily_min_in_range recurring Verifies that the minimal value in a column is not outside the set range. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_min_in_range recurring Verifies that the minimal value in a column does not exceed the set range. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_min_in_range partitioned Verifies that the minimal value in a column is not outside the set range. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_min_in_range partitioned Verifies that the minimal value in a column is not outside the set range. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_mean_in_range profiling Verifies that the average (mean) of all values in a column is not outside the set range.
daily_mean_in_range recurring Verifies that the average (mean) of all values in a column is not outside the set range. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_mean_in_range recurring Verifies that the average (mean) of all values in a column does not exceed the set range. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_mean_in_range partitioned Verifies that the average (mean) of all values in a column is not outside the set range. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_mean_in_range partitioned Verifies that the average (mean) of all values in a column is not outside the set range. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_percentile_in_range profiling Verifies that the percentile of all values in a column is not outside the set range.
profile_median_in_range profiling Verifies that the median of all values in a column is not outside the set range.
profile_percentile_10_in_range profiling Verifies that the percentile 10 of all values in a column is not outside the set range.
profile_percentile_25_in_range profiling Verifies that the percentile 25 of all values in a column is not outside the set range.
profile_percentile_75_in_range profiling Verifies that the percentile 75 of all values in a column is not outside the set range.
profile_percentile_90_in_range profiling Verifies that the percentile 90 of all values in a column is not outside the set range.
daily_percentile_in_range recurring Verifies that the percentile of all values in a column is not outside the set range. Stores the most recent captured value for each day when the data quality check was evaluated.
daily_median_in_range recurring Verifies that the median of all values in a column is not outside the set range. Stores the most recent row count for each month when the data quality check was evaluated.
daily_percentile_10_in_range recurring Verifies that the percentile 10 of all values in a column is not outside the set range. Stores the most recent captured value for each day when the data quality check was evaluated.
daily_percentile_25_in_range recurring Verifies that the percentile 25 of all values in a column is not outside the set range. Stores the most recent captured value for each day when the data quality check was evaluated.
daily_percentile_75_in_range recurring Verifies that the percentile 75 of all values in a column is not outside the set range. Stores the most recent captured value for each day when the data quality check was evaluated.
daily_percentile_90_in_range recurring Verifies that the percentile 90 of all values in a column is not outside the set range. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_percentile_in_range recurring Verifies that the percentile of all values in a column is not outside the set range. Stores the most recent row count for each month when the data quality check was evaluated.
monthly_median_in_range recurring Verifies that the median of all values in a column is not outside the set range. Stores the most recent row count for each month when the data quality check was evaluated.
monthly_percentile_10_in_range recurring Verifies that the percentile 10 of all values in a column is not outside the set range. Stores the most recent row count for each month when the data quality check was evaluated.
monthly_percentile_25_in_range recurring Verifies that the percentile 25 of all values in a column is not outside the set range. Stores the most recent row count for each month when the data quality check was evaluated.
monthly_percentile_75_in_range recurring Verifies that the percentile 75 of all values in a column is not outside the set range. Stores the most recent row count for each month when the data quality check was evaluated.
monthly_percentile_90_in_range recurring Verifies that the percentile 90 of all values in a column is not outside the set range. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_percentile_in_range partitioned Verifies that the percentile of all values in a column is not outside the set range. Creates a separate data quality check (and an alert) for each daily partition.
daily_partition_median_in_range partitioned Verifies that the median of all values in a column is not outside the set range. Creates a separate data quality check (and an alert) for each daily partition.
daily_partition_percentile_10_in_range partitioned Verifies that the percentile 10 of all values in a column is not outside the set range. Creates a separate data quality check (and an alert) for each daily partition.
daily_partition_percentile_25_in_range partitioned Verifies that the percentile 25 of all values in a column is not outside the set range. Creates a separate data quality check (and an alert) for each daily partition.
daily_partition_percentile_75_in_range partitioned Verifies that the percentile 75 of all values in a column is not outside the set range. Creates a separate data quality check (and an alert) for each daily partition.
daily_partition_percentile_90_in_range partitioned Verifies that the percentile 90 of all values in a column is not outside the set range. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_percentile_in_range partitioned Verifies that the percentile of all values in a column is not outside the set range. Creates a separate data quality check (and an alert) for each monthly partition.
monthly_partition_median_in_range partitioned Verifies that the median of all values in a column is not outside the set range. Creates a separate data quality check (and an alert) for each monthly partition.
monthly_partition_percentile_10_in_range partitioned Verifies that the percentile 10 of all values in a column is not outside the set range. Creates a separate data quality check (and an alert) for each monthly partition.
monthly_partition_percentile_25_in_range partitioned Verifies that the percentile 25 of all values in a column is not outside the set range. Creates a separate data quality check (and an alert) for each monthly partition.
monthly_partition_percentile_75_in_range partitioned Verifies that the percentile 75 of all values in a column is not outside the set range. Creates a separate data quality check (and an alert) for each monthly partition.
monthly_partition_percentile_90_in_range partitioned Verifies that the percentile 90 of all values in a column is not outside the set range. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_sample_stddev_in_range profiling Verifies that the sample standard deviation of all values in a column is not outside the set range.
daily_sample_stddev_in_range recurring Verifies that the sample standard deviation of all values in a column is not outside the set range. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_sample_stddev_in_range recurring Verifies that the sample standard deviation of all values in a column is not outside the set range. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_sample_stddev_in_range partitioned Verifies that the sample standard deviation of all values in a column is not outside the set range. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_sample_stddev_in_range partitioned Verifies that the sample standard deviation of all values in a column is not outside the set range. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_population_stddev_in_range profiling Verifies that the population standard deviation of all values in a column is not outside the set range.
daily_population_stddev_in_range recurring Verifies that the population standard deviation of all values in a column is not outside the set range. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_population_stddev_in_range recurring Verifies that the population standard deviation of all values in a column is not outside the set range. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_population_stddev_in_range partitioned Verifies that the population standard deviation of all values in a column is not outside the set range. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_population_stddev_in_range partitioned Verifies that the population standard deviation of all values in a column is not outside the set range. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_sample_variance_in_range profiling Verifies that the sample variance of all values in a column is not outside the set range.
daily_sample_variance_in_range recurring Verifies that the sample variance of all values in a column is not outside the set range. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_sample_variance_in_range recurring Verifies that the sample variance of all values in a column is not outside the set range. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_sample_variance_in_range partitioned Verifies that the sample variance of all values in a column is not outside the set range. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_sample_variance_in_range partitioned Verifies that the sample variance of all values in a column is not outside the set range. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_population_variance_in_range profiling Verifies that the population variance of all values in a column is not outside the set range.
daily_population_variance_in_range recurring Verifies that the population variance of all values in a column is not outside the set range. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_population_variance_in_range recurring Verifies that the population variance of all values in a column is not outside the set range. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_population_variance_in_range partitioned Verifies that the population variance of all values in a column is not outside the set range. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_population_variance_in_range partitioned Verifies that the population variance of all values in a column is not outside the set range. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_sum_in_range profiling Verifies that the sum of all values in a column is not outside the set range.
daily_sum_in_range recurring Verifies that the sum of all values in a column is not outside the set range. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_sum_in_range recurring Verifies that the sum of all values in a column does not exceed the set range. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_sum_in_range partitioned Verifies that the sum of all values in a column is not outside the set range. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_sum_in_range partitioned Verifies that the sum of all values in a column is not outside the set range. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_invalid_latitude_count profiling Verifies that the number of invalid latitude values in a column does not exceed the maximum accepted count.
daily_invalid_latitude_count recurring Verifies that the number of invalid latitude values in a column does not exceed the maximum accepted count. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_invalid_latitude_count recurring Verifies that the number of invalid latitude values in a column does not exceed the maximum accepted count. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_invalid_latitude_count partitioned Verifies that the number of invalid latitude values in a column does not exceed the maximum accepted count. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_invalid_latitude_count partitioned Verifies that the number of invalid latitude values in a column does not exceed the maximum accepted count. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_valid_latitude_percent profiling Verifies that the percentage of valid latitude values in a column does not fall below the minimum accepted percentage.
daily_valid_latitude_percent recurring Verifies that the percentage of valid latitude values in a column does not fall below the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_valid_latitude_percent recurring Verifies that the percentage of valid latitude values in a column does not fall below the minimum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_valid_latitude_percent partitioned Verifies that the percentage of valid latitude values in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_valid_latitude_percent partitioned Verifies that the percentage of valid latitude values in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_invalid_longitude_count profiling Verifies that the number of invalid longitude values in a column does not exceed the maximum accepted count.
daily_invalid_longitude_count recurring Verifies that the number of invalid longitude values in a column does not exceed the maximum accepted count. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_invalid_longitude_count recurring Verifies that the number of invalid longitude values in a column does not exceed the maximum accepted count. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_invalid_longitude_count partitioned Verifies that the number of invalid longitude values in a column does not exceed the maximum accepted count. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_invalid_longitude_count partitioned Verifies that the number of invalid longitude values in a column does not exceed the maximum accepted count. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_valid_longitude_percent profiling Verifies that the percentage of valid longitude values in a column does not fall below the minimum accepted percentage.
daily_valid_longitude_percent recurring Verifies that the percentage of valid longitude values in a column does not fall below the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_valid_longitude_percent recurring Verifies that the percentage of valid longitude values in a column does not fall below the minimum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_valid_longitude_percent partitioned Verifies that the percentage of valid longitude values in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_valid_longitude_percent partitioned Verifies that the percentage of valid longitude values in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.

pii

Checks for the presence of sensitive or personally identifiable information (PII) in a column such as email, phone, zip code, IP4 and IP6 addresses.

Check name Check type Description
profile_valid_usa_phone_percent profiling Verifies that the percentage of valid USA phone values in a column does not fall below the minimum accepted percentage.
daily_valid_usa_phone_percent recurring Verifies that the percentage of valid USA phone values in a column does not fall below the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_valid_usa_phone_percent recurring Verifies that the percentage of valid USA phone values in a column does not fall below the minimum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_valid_usa_phone_percent partitioned Verifies that the percentage of valid USA phone values in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_valid_usa_phone_percent partitioned Verifies that the percentage of valid USA phone values in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_contains_usa_phone_percent profiling Verifies that the percentage of rows that contains USA phone number in a column does not exceed the maximum accepted percentage.
daily_contains_usa_phone_percent recurring Verifies that the percentage of rows that contains a USA phone number in a column does not exceed the maximum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_contains_usa_phone_percent recurring Verifies that the percentage of rows that contains a USA phone number in a column does not exceed the maximum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_contains_usa_phone_percent partitioned Verifies that the percentage of rows that contains USA phone number in a column does not exceed the maximum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_contains_usa_phone_percent partitioned Verifies that the percentage of rows that contains USA phone number in a column does not exceed the maximum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_valid_usa_zipcode_percent profiling Verifies that the percentage of valid USA zip code values in a column does not fall below the minimum accepted percentage.
daily_valid_usa_zipcode_percent recurring Verifies that the percentage of valid USA zip code values in a column does not fall below the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_valid_usa_zipcode_percent recurring Verifies that the percentage of valid USA zip code values in a column does not fall below the minimum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_valid_usa_zipcode_percent partitioned Verifies that the percentage of valid USA zip code values in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_valid_usa_zipcode_percent partitioned Verifies that the percentage of valid USA zip code values in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_contains_usa_zipcode_percent profiling Verifies that the percentage of rows that contains USA zip code in a column does not exceed the maximum accepted percentage.
daily_contains_usa_zipcode_percent recurring Verifies that the percentage of rows that contains a USA zip code in a column does not exceed the maximum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_contains_usa_zipcode_percent recurring Verifies that the percentage of rows that contains a USA zip code in a column does not exceed the maximum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_contains_usa_zipcode_percent partitioned Verifies that the percentage of rows that contains USA zip code in a column does not exceed the maximum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_contains_usa_zipcode_percent partitioned Verifies that the percentage of rows that contains USA zip code in a column does not exceed the maximum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_valid_email_percent profiling Verifies that the percentage of valid emails values in a column does not fall below the minimum accepted percentage.
daily_valid_email_percent recurring Verifies that the percentage of valid emails values in a column does not fall below the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_valid_email_percent recurring Verifies that the percentage of valid emails values in a column does not fall below the minimum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_valid_email_percent partitioned Verifies that the percentage of valid emails values in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_valid_email_percent partitioned Verifies that the percentage of valid emails values in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_contains_email_percent profiling Verifies that the percentage of rows that contains valid emails in a column does not exceed the minimum accepted percentage.
daily_contains_email_percent recurring Verifies that the percentage of rows that contains emails in a column does not exceed the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_contains_email_percent recurring Verifies that the percentage of rows that contains emails in a column does not exceed the minimum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_contains_email_percent partitioned Verifies that the percentage of rows that contains emails in a column does not exceed the minimum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_contains_email_percent partitioned Verifies that the percentage of rows that contains emails in a column does not exceed the minimum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_valid_ip4_address_percent profiling Verifies that the percentage of valid IP4 address values in a column does not fall below the minimum accepted percentage.
daily_valid_ip4_address_percent recurring Verifies that the percentage of valid IP4 address values in a column does not fall below the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_valid_ip4_address_percent recurring Verifies that the percentage of valid IP4 address values in a column does not fall below the minimum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_valid_ip4_address_percent partitioned Verifies that the percentage of valid IP4 address values in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_valid_ip4_address_percent partitioned Verifies that the percentage of valid IP4 address values in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_contains_ip4_percent profiling Verifies that the percentage of rows that contains valid IP4 address values in a column does not fall below the minimum accepted percentage.
daily_contains_ip4_percent recurring Verifies that the percentage of rows that contains IP4 address values in a column does not fall below the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_contains_ip4_percent recurring Verifies that the percentage of rows that contains IP4 address values in a column does not fall below the minimum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_contains_ip4_percent partitioned Verifies that the percentage of rows that contains IP4 address values in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_contains_ip4_percent partitioned Verifies that the percentage of rows that contains IP4 address values in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_valid_ip6_address_percent profiling Verifies that the percentage of valid IP6 address values in a column does not fall below the minimum accepted percentage.
daily_valid_ip6_address_percent recurring Verifies that the percentage of valid IP6 address values in a column does not fall below the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_valid_ip6_address_percent recurring Verifies that the percentage of valid IP6 address values in a column does not fall below the minimum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_valid_ip6_address_percent partitioned Verifies that the percentage of valid IP6 address values in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_valid_ip6_address_percent partitioned Verifies that the percentage of valid IP6 address values in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_contains_ip6_percent profiling Verifies that the percentage of rows that contains valid IP6 address values in a column does not fall below the minimum accepted percentage.
daily_contains_ip6_percent recurring Verifies that the percentage of rows that contains valid IP6 address values in a column does not fall below the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_contains_ip6_percent recurring Verifies that the percentage of rows that contains valid IP6 address values in a column does not fall below the minimum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_contains_ip6_percent partitioned Verifies that the percentage of rows that contains valid IP6 address values in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_contains_ip6_percent partitioned Verifies that the percentage of rows that contains valid IP6 address values in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.

schema

Detects schema drifts such as a column is missing or the data type has changed.

Check name Check type Description
profile_column_exists profiling Checks the metadata of the monitored table and verifies if the column exists.
daily_column_exists recurring Checks the metadata of the monitored table and verifies if the column exists. Stores the most recent value for each day when the data quality check was evaluated.
monthly_column_exists recurring Checks the metadata of the monitored table and verifies if the column exists. Stores the most recent value for each month when the data quality check was evaluated.
Check name Check type Description
profile_column_type_changed profiling Checks the metadata of the monitored column and detects if the data type (including the length, precision, scale, nullability) has changed.
daily_column_type_changed recurring Checks the metadata of the monitored column and detects if the data type (including the length, precision, scale, nullability) has changed since the last day. Stores the most recent hash for each day when the data quality check was evaluated.
monthly_column_type_changed recurring Checks the metadata of the monitored column and detects if the data type (including the length, precision, scale, nullability) has changed since the last month. Stores the most recent hash for each month when the data quality check was evaluated.

sql

Validate data against user-defined SQL queries at the column level. Checks in this group allows to validate that the set percentage of rows passed a custom SQL expression or that the custom SQL expression is not outside the set range.

Check name Check type Description
profile_sql_condition_passed_percent_on_column profiling Verifies that a minimum percentage of rows passed a custom SQL condition (expression).
daily_sql_condition_passed_percent_on_column recurring Verifies that a minimum percentage of rows passed a custom SQL condition (expression). Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_sql_condition_passed_percent_on_column recurring Verifies that a minimum percentage of rows passed a custom SQL condition (expression). Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_sql_condition_passed_percent_on_column partitioned Verifies that a minimum percentage of rows passed a custom SQL condition (expression). Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_sql_condition_passed_percent_on_column partitioned Verifies that a minimum percentage of rows passed a custom SQL condition (expression). Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_sql_condition_failed_count_on_column profiling Verifies that a number of rows failed a custom SQL condition(expression) does not exceed the maximum accepted count.
daily_sql_condition_failed_count_on_column recurring Verifies that a number of rows failed a custom SQL condition(expression) does not exceed the maximum accepted count. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_sql_condition_failed_count_on_column recurring Verifies that a number of rows failed a custom SQL condition(expression) does not exceed the maximum accepted count. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_sql_condition_failed_count_on_column partitioned Verifies that a number of rows failed a custom SQL condition(expression) does not exceed the maximum accepted count. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_sql_condition_failed_count_on_column partitioned Verifies that a number of rows failed a custom SQL condition(expression) does not exceed the maximum accepted count. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_sql_aggregate_expr_column profiling Verifies that a custom aggregated SQL expression (MIN, MAX, etc.) is not outside the set range.
daily_sql_aggregate_expr_column recurring Verifies that a custom aggregated SQL expression (MIN, MAX, etc.) is not outside the set range. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_sql_aggregate_expr_column recurring Verifies that a custom aggregated SQL expression (MIN, MAX, etc.) is not outside the set range. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_sql_aggregate_expr_column partitioned Verifies that a custom aggregated SQL expression (MIN, MAX, etc.) is not outside the set range. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_sql_aggregate_expr_column partitioned Verifies that a custom aggregated SQL expression (MIN, MAX, etc.) is not outside the set range. Creates a separate data quality check (and an alert) for each monthly partition.

strings

Validates that the data in a string column match the expected format or pattern.

Check name Check type Description
profile_string_max_length profiling Verifies that the length of string in a column does not exceed the maximum accepted length.
daily_string_max_length recurring Verifies that the length of string in a column does not exceed the maximum accepted length. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_max_length recurring Verifies that the length of string in a column does not exceed the maximum accepted length. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_string_max_length partitioned Verifies that the length of string in a column does not exceed the maximum accepted length. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_string_max_length partitioned Verifies that the length of string in a column does not exceed the maximum accepted length. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_string_min_length profiling Verifies that the length of string in a column does not fall below the minimum accepted length.
daily_string_min_length recurring Verifies that the length of string in a column does not fall below the minimum accepted length. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_min_length recurring Verifies that the length of string in a column does not exceed the minimum accepted length. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_string_min_length partitioned Verifies that the length of string in a column does not fall below the minimum accepted length. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_string_min_length partitioned Verifies that the length of string in a column does not fall below the minimum accepted length. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_string_mean_length profiling Verifies that the length of string in a column does not exceed the mean accepted length.
daily_string_mean_length recurring Verifies that the length of string in a column does not exceed the mean accepted length. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_mean_length recurring Verifies that the length of string in a column does not exceed the mean accepted length. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_string_mean_length partitioned Verifies that the length of string in a column does not exceed the mean accepted length. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_string_mean_length partitioned Verifies that the length of string in a column does not exceed the mean accepted length. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_string_length_below_min_length_count profiling The check counts the number of strings in the column that is below the length defined by the user as a parameter.
daily_string_length_below_min_length_count recurring The check counts the number of strings in the column that is below the length defined by the user as a parameter. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_length_below_min_length_count recurring The check counts those strings with length below the one provided by the user in a column. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_string_length_below_min_length_count partitioned The check counts the number of strings in the column that is below the length defined by the user as a parameter. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_string_length_below_min_length_count partitioned The check counts the number of strings in the column that is below the length defined by the user as a parameter. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_string_length_below_min_length_percent profiling The check counts the percentage of strings in the column that is below the length defined by the user as a parameter.
daily_string_length_below_min_length_percent recurring The check counts the percentage of strings in the column that is below the length defined by the user as a parameter. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_length_below_min_length_percent recurring The check counts percentage of those strings with length below the one provided by the user in a column. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_string_length_below_min_length_percent partitioned The check counts the percentage of strings in the column that is below the length defined by the user as a parameter. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_string_length_below_min_length_percent partitioned The check counts the percentage of strings in the column that is below the length defined by the user as a parameter. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_string_length_above_max_length_count profiling The check counts the number of strings in the column that is above the length defined by the user as a parameter.
daily_string_length_above_max_length_count recurring The check counts the number of strings in the column that is above the length defined by the user as a parameter. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_length_above_max_length_count recurring The check counts those strings with length above the one provided by the user in a column. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_string_length_above_max_length_count partitioned The check counts the number of strings in the column that is above the length defined by the user as a parameter. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_string_length_above_max_length_count partitioned The check counts the number of strings in the column that is above the length defined by the user as a parameter. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_string_length_above_max_length_percent profiling The check counts the percentage of strings in the column that is above the length defined by the user as a parameter.
daily_string_length_above_max_length_percent recurring The check counts the percentage of strings in the column that is above the length defined by the user as a parameter. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_length_above_max_length_percent recurring The check counts percentage of those strings with length above the one provided by the user in a column. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_string_length_above_max_length_percent partitioned The check counts the percentage of strings in the column that is above the length defined by the user as a parameter. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_string_length_above_max_length_percent partitioned The check counts the percentage of strings in the column that is above the length defined by the user as a parameter. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_string_length_in_range_percent profiling The check counts the percentage of those strings with length in the range provided by the user in the column.
daily_string_length_in_range_percent recurring The check counts the percentage of those strings with length in the range provided by the user in the column. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_length_in_range_percent recurring The check counts percentage of those strings with length in the range provided by the user in a column. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_string_length_in_range_percent partitioned The check counts the percentage of those strings with length in the range provided by the user in the column. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_string_length_in_range_percent partitioned The check counts the percentage of those strings with length in the range provided by the user in the column. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_string_empty_count profiling Verifies that empty strings in a column does not exceed the maximum accepted count.
daily_string_empty_count recurring Verifies that the number of empty strings in a column does not exceed the maximum accepted count. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_empty_count recurring Verifies that the number of empty strings in a column does not exceed the maximum accepted count. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_string_empty_count partitioned Verifies that the number of empty strings in a column does not exceed the maximum accepted count. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_string_empty_count partitioned Verifies that the number of empty strings in a column does not exceed the maximum accepted count. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_string_empty_percent profiling Verifies that the percentage of empty strings in a column does not exceed the maximum accepted percentage.
daily_string_empty_percent recurring Verifies that the percentage of empty strings in a column does not exceed the maximum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_empty_percent recurring Verifies that the percentage of empty strings in a column does not exceed the maximum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_string_empty_percent partitioned Verifies that the percentage of empty strings in a column does not exceed the maximum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_string_empty_percent partitioned Verifies that the percentage of empty strings in a column does not exceed the maximum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_string_whitespace_count profiling Verifies that the number of whitespace strings in a column does not exceed the maximum accepted count.
daily_string_whitespace_count recurring Verifies that the number of whitespace strings in a column does not exceed the maximum accepted count. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_whitespace_count recurring Verifies that the number of whitespace strings in a column does not exceed the maximum accepted count. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_string_whitespace_count partitioned Verifies that the number of whitespace strings in a column does not exceed the maximum accepted count. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_string_whitespace_count partitioned Verifies that the number of whitespace strings in a column does not exceed the maximum accepted count. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_string_whitespace_percent profiling Verifies that the percentage of whitespace strings in a column does not exceed the minimum accepted percentage.
daily_string_whitespace_percent recurring Verifies that the percentage of whitespace strings in a column does not exceed the maximum accepted percent. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_whitespace_percent recurring Verifies that the percentage of whitespace strings in a column does not exceed the maximum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_string_whitespace_percent partitioned Verifies that the percentage of whitespace strings in a column does not exceed the maximum accepted percent. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_string_whitespace_percent partitioned Verifies that the percentage of whitespace strings in a column does not exceed the maximum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_string_surrounded_by_whitespace_count profiling Verifies that the number of strings surrounded by whitespace in a column does not exceed the maximum accepted count.
daily_string_surrounded_by_whitespace_count recurring Verifies that the number of strings surrounded by whitespace in a column does not exceed the maximum accepted count. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_surrounded_by_whitespace_count recurring Verifies that the number of strings surrounded by whitespace in a column does not exceed the maximum accepted count. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_string_surrounded_by_whitespace_count partitioned Verifies that the number of strings surrounded by whitespace in a column does not exceed the maximum accepted count. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_string_surrounded_by_whitespace_count partitioned Verifies that the number of strings surrounded by whitespace in a column does not exceed the maximum accepted count. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_string_surrounded_by_whitespace_percent profiling Verifies that the percentage of strings surrounded by whitespace in a column does not exceed the maximum accepted percentage.
daily_string_surrounded_by_whitespace_percent recurring Verifies that the percentage of strings surrounded by whitespace in a column does not exceed the maximum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_surrounded_by_whitespace_percent recurring Verifies that the percentage of strings surrounded by whitespace in a column does not exceed the maximum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_string_surrounded_by_whitespace_percent partitioned Verifies that the percentage of strings surrounded by whitespace in a column does not exceed the maximum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_string_surrounded_by_whitespace_percent partitioned Verifies that the percentage of strings surrounded by whitespace in a column does not exceed the maximum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_string_null_placeholder_count profiling Verifies that the number of null placeholders in a column does not exceed the maximum accepted count.
daily_string_null_placeholder_count recurring Verifies that the number of null placeholders in a column does not exceed the maximum accepted count. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_null_placeholder_count recurring Verifies that the number of null placeholders in a column does not exceed the maximum accepted count. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_string_null_placeholder_count partitioned Verifies that the number of null placeholders in a column does not exceed the maximum accepted count. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_string_null_placeholder_count partitioned Verifies that the number of null placeholders in a column does not exceed the maximum accepted count. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_string_null_placeholder_percent profiling Verifies that the percentage of null placeholders in a column does not exceed the maximum accepted percentage.
daily_string_null_placeholder_percent recurring Verifies that the percentage of null placeholders in a column does not exceed the maximum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_null_placeholder_percent recurring Verifies that the percentage of null placeholders in a column does not exceed the maximum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_string_null_placeholder_percent partitioned Verifies that the percentage of null placeholders in a column does not exceed the maximum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_string_null_placeholder_percent partitioned Verifies that the percentage of null placeholders in a column does not exceed the maximum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_string_boolean_placeholder_percent profiling Verifies that the percentage of boolean placeholder for strings in a column does not fall below the minimum accepted percentage.
daily_string_boolean_placeholder_percent recurring Verifies that the percentage of boolean placeholder for strings in a column does not fall below the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_boolean_placeholder_percent recurring Verifies that the percentage of boolean placeholder for strings in a column does not exceed the minimum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_string_boolean_placeholder_percent partitioned Verifies that the percentage of boolean placeholder for strings in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_string_boolean_placeholder_percent partitioned Verifies that the percentage of boolean placeholder for strings in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_string_parsable_to_integer_percent profiling Verifies that the percentage of parsable to integer string in a column does not fall below the minimum accepted percentage.
daily_string_parsable_to_integer_percent recurring Verifies that the percentage of parsable to integer string in a column does not fall below the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_parsable_to_integer_percent recurring Verifies that the percentage of parsable to integer string in a column does not exceed the minimum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_string_parsable_to_integer_percent partitioned Verifies that the percentage of parsable to integer string in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_string_parsable_to_integer_percent partitioned Verifies that the percentage of parsable to integer string in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_string_parsable_to_float_percent profiling Verifies that the percentage of parsable to float string in a column does not fall below the minimum accepted percentage.
daily_string_parsable_to_float_percent recurring Verifies that the percentage of parsable to float string in a column does not fall below the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_parsable_to_float_percent recurring Verifies that the percentage of parsable to float string in a column does not exceed the minimum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_string_parsable_to_float_percent partitioned Verifies that the percentage of parsable to float string in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_string_parsable_to_float_percent partitioned Verifies that the percentage of parsable to float string in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_expected_strings_in_use_count profiling Verifies that the expected string values were found in the column. Raises a data quality issue when too many expected values were not found (were missing).
daily_expected_strings_in_use_count recurring Verifies that the expected string values were found in the column. Raises a data quality issue when too many expected values were not found (were missing). Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_expected_strings_in_use_count recurring Verifies that the expected string values were found in the column. Raises a data quality issue when too many expected values were not found (were missing). Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_expected_strings_in_use_count partitioned Verifies that the expected string values were found in the column. Raises a data quality issue when too many expected values were not found (were missing). Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_expected_strings_in_use_count partitioned Verifies that the expected string values were found in the column. Raises a data quality issue when too many expected values were not found (were missing). Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_string_value_in_set_percent profiling The check measures the percentage of rows whose value in a tested column is one of values from a list of expected values or the column value is null. Verifies that the percentage of rows having a valid column value does not exceed the minimum accepted percentage.
daily_string_value_in_set_percent recurring The check measures the percentage of rows whose value in a tested column is one of values from a list of expected values or the column value is null. Verifies that the percentage of rows having a valid column value does not exceed the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_value_in_set_percent recurring The check measures the percentage of rows whose value in a tested column is one of values from a list of expected values or the column value is null. Verifies that the percentage of rows having a valid column value does not exceed the minimum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_string_value_in_set_percent partitioned The check measures the percentage of rows whose value in a tested column is one of values from a list of expected values or the column value is null. Verifies that the percentage of rows having a valid column value does not exceed the minimum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_string_value_in_set_percent partitioned The check measures the percentage of rows whose value in a tested column is one of values from a list of expected values or the column value is null. Verifies that the percentage of rows having a valid column value does not exceed the minimum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_string_valid_dates_percent profiling Verifies that the percentage of valid dates in a column does not fall below the minimum accepted percentage.
daily_string_valid_dates_percent recurring Verifies that the percentage of valid dates in a column does not fall below the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_valid_dates_percent recurring Verifies that the percentage of valid dates in a column does not exceed the minimum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_string_valid_dates_percent partitioned Verifies that the percentage of valid dates in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_string_valid_dates_percent partitioned Verifies that the percentage of valid dates in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_string_valid_country_code_percent profiling Verifies that the percentage of valid country code in a column does not fall below the minimum accepted percentage.
daily_string_valid_country_code_percent recurring Verifies that the percentage of valid country code in a column does not fall below the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_valid_country_code_percent recurring Verifies that the percentage of valid country code in a column does not exceed the minimum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_string_valid_country_code_percent partitioned Verifies that the percentage of valid country code in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_string_valid_country_code_percent partitioned Verifies that the percentage of valid country code in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_string_valid_currency_code_percent profiling Verifies that the percentage of valid currency code in a column does not fall below the minimum accepted percentage.
daily_string_valid_currency_code_percent recurring Verifies that the percentage of valid currency code in a column does not fall below the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_valid_currency_code_percent recurring Verifies that the percentage of valid currency code in a column does not exceed the minimum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_string_valid_currency_code_percent partitioned Verifies that the percentage of valid currency code in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_string_valid_currency_code_percent partitioned Verifies that the percentage of valid currency code in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_string_invalid_email_count profiling Verifies that the number of invalid emails in a column does not exceed the maximum accepted count.
daily_string_invalid_email_count recurring Verifies that the number of invalid emails in a column does not exceed the maximum accepted count. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_invalid_email_count recurring Verifies that the number of invalid emails in a column does not exceed the maximum accepted count. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_string_invalid_email_count partitioned Verifies that the number of invalid emails in a column does not exceed the maximum accepted count. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_string_invalid_email_count partitioned Verifies that the number of invalid emails in a column does not exceed the maximum accepted count. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_string_invalid_uuid_count profiling Verifies that the number of invalid UUID in a column does not exceed the maximum accepted count.
daily_string_invalid_uuid_count recurring Verifies that the number of invalid UUID in a column does not exceed the maximum accepted count. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_invalid_uuid_count recurring Verifies that the number of invalid UUID in a column does not exceed the maximum accepted count. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_string_invalid_uuid_count partitioned Verifies that the number of invalid UUID in a column does not exceed the maximum accepted count. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_string_invalid_uuid_count partitioned Verifies that the number of invalid UUID in a column does not exceed the maximum accepted count. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_string_valid_uuid_percent profiling Verifies that the percentage of valid UUID in a column does not fall below the minimum accepted percentage.
daily_string_valid_uuid_percent recurring Verifies that the percentage of valid UUID in a column does not fall below the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_valid_uuid_percent recurring Verifies that the percentage of valid UUID in a column does not exceed the minimum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_valid_uuid_percent partitioned Verifies that the percentage of valid UUID in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_valid_uuid_percent partitioned Verifies that the percentage of valid UUID in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_string_invalid_ip4_address_count profiling Verifies that the number of invalid IP4 address in a column does not exceed the maximum accepted count.
daily_string_invalid_ip4_address_count recurring Verifies that the number of invalid IP4 address in a column does not exceed the maximum accepted count. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_invalid_ip4_address_count recurring Verifies that the number of invalid IP4 address in a column does not exceed the maximum accepted count. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_string_invalid_ip4_address_count partitioned Verifies that the number of invalid IP4 address in a column does not exceed the maximum accepted count. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_string_invalid_ip4_address_count partitioned Verifies that the number of invalid IP4 address in a column does not exceed the maximum accepted count. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_string_invalid_ip6_address_count profiling Verifies that the number of invalid IP6 address in a column does not exceed the maximum accepted count.
daily_string_invalid_ip6_address_count recurring Verifies that the number of invalid IP6 address in a column does not exceed the maximum accepted count. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_invalid_ip6_address_count recurring Verifies that the number of invalid IP6 address in a column does not exceed the maximum accepted count. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_string_invalid_ip6_address_count partitioned Verifies that the number of invalid IP6 address in a column does not exceed the maximum accepted count. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_string_invalid_ip6_address_count partitioned Verifies that the number of invalid IP6 address in a column does not exceed the maximum accepted count. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_string_not_match_regex_count profiling Verifies that the number of strings not matching the custom regex in a column does not exceed the maximum accepted count.
daily_string_not_match_regex_count recurring Verifies that the number of strings not matching the custom regex in a column does not exceed the maximum accepted count. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_not_match_regex_count recurring Verifies that the number of strings not matching the custom regex in a column does not exceed the maximum accepted count. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_string_not_match_regex_count partitioned Verifies that the number of strings not matching the custom regex in a column does not exceed the maximum accepted count. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_string_not_match_regex_count partitioned Verifies that the number of strings not matching the custom regex in a column does not exceed the maximum accepted count. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_string_match_regex_percent profiling Verifies that the percentage of strings matching the custom regex in a column does not fall below the minimum accepted percentage.
daily_string_match_regex_percent recurring Verifies that the percentage of strings matching the custom regex in a column does not fall below the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_match_regex_percent recurring Verifies that the percentage of strings matching the custom regex in a column does not exceed the minimum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_string_match_regex_percent partitioned Verifies that the percentage of strings matching the custom regex in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_string_match_regex_percent partitioned Verifies that the percentage of strings matching the custom regex in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_string_not_match_date_regex_count profiling Verifies that the number of strings not matching the date format regex in a column does not exceed the maximum accepted count.
daily_string_not_match_date_regex_count recurring Verifies that the number of strings not matching the date format regex in a column does not exceed the maximum accepted count. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_not_match_date_regex_count recurring Verifies that the number of strings not matching the date format regex in a column does not exceed the maximum accepted count. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_string_not_match_date_regex_count partitioned Verifies that the number of strings not matching the date format regex in a column does not exceed the maximum accepted count. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_string_not_match_date_regex_count partitioned Verifies that the number of strings not matching the date format regex in a column does not exceed the maximum accepted count. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_string_match_date_regex_percent profiling Verifies that the percentage of strings matching the date format regex in a column does not fall below the minimum accepted percentage.
daily_string_match_date_regex_percent recurring Verifies that the percentage of strings matching the date format regex in a column does not fall below the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_match_date_regex_percent recurring Verifies that the percentage of strings matching the date format regex in a column does not exceed the maximum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_string_match_date_regex_percent partitioned Verifies that the percentage of strings matching the date format regex in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_string_match_date_regex_percent partitioned Verifies that the percentage of strings matching the date format regex in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_string_match_name_regex_percent profiling Verifies that the percentage of strings matching the name regex in a column does not fall below the minimum accepted percentage.
daily_string_match_name_regex_percent recurring Verifies that the percentage of strings matching the name format regex in a column does not fall below the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_match_name_regex_percent recurring Verifies that the percentage of strings matching the name regex in a column does not exceed the maximum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_string_match_name_regex_percent partitioned Verifies that the percentage of strings matching the name format regex in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_string_match_name_regex_percent partitioned Verifies that the percentage of strings matching the name format regex in a column does not fall below the minimum accepted percentage. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_expected_strings_in_top_values_count profiling Verifies that the top X most popular column values contain all values from a list of expected values.
daily_expected_strings_in_top_values_count recurring Verifies that the top X most popular column values contain all values from a list of expected values. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_expected_strings_in_top_values_count recurring Verifies that the top X most popular column values contain all values from a list of expected values. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_expected_strings_in_top_values_count partitioned Verifies that the top X most popular column values contain all values from a list of expected values. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_expected_strings_in_top_values_count partitioned Verifies that the top X most popular column values contain all values from a list of expected values. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_string_datatype_detected profiling Detects the data type of text values stored in the column. The sensor returns the code of the detected data type of a column: 1 - integers, 2 - floats, 3 - dates, 4 - timestamps, 5 - booleans, 6 - strings, 7 - mixed data types. Raises a data quality issue when the detected data type does not match the expected data type.
daily_string_datatype_detected recurring Detects the data type of text values stored in the column. The sensor returns the code of the detected data type of a column: 1 - integers, 2 - floats, 3 - dates, 4 - timestamps, 5 - booleans, 6 - strings, 7 - mixed data types. Raises a data quality issue when the detected data type does not match the expected data type. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_string_datatype_detected recurring Detects the data type of text values stored in the column. The sensor returns the code of the detected data type of a column: 1 - integers, 2 - floats, 3 - dates, 4 - timestamps, 5 - booleans, 6 - strings, 7 - mixed data types. Raises a data quality issue when the detected data type does not match the expected data type. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_string_datatype_detected partitioned Detects the data type of text values stored in the column. The sensor returns the code of the detected data type of a column: 1 - integers, 2 - floats, 3 - dates, 4 - timestamps, 5 - booleans, 6 - strings, 7 - mixed data types. Raises a data quality issue when the detected data type does not match the expected data type. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_string_datatype_detected partitioned Detects the data type of text values stored in the column. The sensor returns the code of the detected data type of a column: 1 - integers, 2 - floats, 3 - dates, 4 - timestamps, 5 - booleans, 6 - strings, 7 - mixed data types. Raises a data quality issue when the detected data type does not match the expected data type. Creates a separate data quality check (and an alert) for each monthly partition.

uniqueness

Counts the number or percent of duplicate or unique values in a column.

Check name Check type Description
profile_distinct_count profiling Verifies that the number of distinct values in a column does not fall below the minimum accepted count.
daily_distinct_count recurring Verifies that the number of distinct values in a column does not fall below the minimum accepted count. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_distinct_count recurring Verifies that the number of distinct values in a column does not fall below the minimum accepted count. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_distinct_count partitioned Verifies that the number of distinct values in a column does not fall below the minimum accepted count. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_distinct_count partitioned Verifies that the number of distinct values in a column does not fall below the minimum accepted count. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_distinct_percent profiling Verifies that the percentage of distinct values in a column does not fall below the minimum accepted percent.
daily_distinct_percent recurring Verifies that the percentage of distinct values in a column does not fall below the minimum accepted percent. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_distinct_percent recurring Verifies that the percentage of distinct values in a column does not fall below the minimum accepted percent. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_distinct_percent partitioned Verifies that the percentage of distinct values in a column does not fall below the minimum accepted percent. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_distinct_percent partitioned Verifies that the percentage of distinct values in a column does not fall below the minimum accepted percent. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_duplicate_count profiling Verifies that the number of duplicate values in a column does not exceed the maximum accepted count.
daily_duplicate_count recurring Verifies that the number of duplicate values in a column does not exceed the maximum accepted count. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_duplicate_count recurring Verifies that the number of duplicate values in a column does not exceed the maximum accepted count. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_duplicate_count partitioned Verifies that the number of duplicate values in a column does not exceed the maximum accepted count. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_duplicate_count partitioned Verifies that the number of duplicate values in a column does not exceed the maximum accepted count. Creates a separate data quality check (and an alert) for each monthly partition.
Check name Check type Description
profile_duplicate_percent profiling Verifies that the percentage of duplicate values in a column does not exceed the maximum accepted percentage.
daily_duplicate_percent recurring Verifies that the percentage of duplicate values in a column does not exceed the maximum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_duplicate_percent recurring Verifies that the percentage of duplicate values in a column does not exceed the maximum accepted percentage. Stores the most recent row count for each month when the data quality check was evaluated.
daily_partition_duplicate_percent partitioned Verifies that the percent of duplicate values in a column does not exceed the maximum accepted percent. Creates a separate data quality check (and an alert) for each daily partition.
monthly_partition_duplicate_percent partitioned Verifies that the percent of duplicate values in a column does not exceed the maximum accepted percent. Creates a separate data quality check (and an alert) for each monthly partition.