Skip to content

Last updated: July 22, 2025

List of column level uniqueness data quality checks

This is a list of uniqueness column data quality checks supported by DQOps and a brief description of what data quality issued they detect.

column-level uniqueness checks

Counts the number or percent of duplicate or unique values in a column.

distinct count

This check counts distinct values and verifies if the distinct count is within an accepted range. It raises a data quality issue when the distinct count is below or above the accepted range.

Data quality check name Friendly name Check type Description Standard
profile_distinct_count Verify that the count of distinct values is in the expected range profiling Verifies that the number of distinct values stays within an accepted range.
daily_distinct_count Verify that the count of distinct values is in the expected range monitoring Verifies that the number of distinct values stays within an accepted range. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_distinct_count Verify that the count of distinct values is in the expected range monitoring Verifies that the number of distinct values stays within an accepted range. Stores the most recent check result for each month when the data quality check was evaluated.
daily_partition_distinct_count Verify that the count of distinct values is in the expected range partitioned Verifies that the number of distinct values stays within an accepted range. Stores a separate data quality check result for each daily partition.
monthly_partition_distinct_count Verify that the count of distinct values is in the expected range partitioned Verifies that the number of distinct values stays within an accepted range. Stores a separate data quality check result for each monthly partition.

distinct percent

This check measures the percentage of distinct values in all non-null values. It verifies that the percentage of distinct values meets a minimum and maximum values. The default value of 100% distinct values ensures the column has no duplicate values.

Data quality check name Friendly name Check type Description Standard
profile_distinct_percent The minimum ratio of distinct values to the count of non null values (detect duplicate values) profiling Verifies that the percentage of distinct values in a column does not fall below the minimum accepted percent.
daily_distinct_percent The minimum ratio of distinct values to the count of non null values (detect duplicate values) monitoring Verifies that the percentage of distinct values in a column does not fall below the minimum accepted percent. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_distinct_percent The minimum ratio of distinct values to the count of non null values (detect duplicate values) monitoring Verifies that the percentage of distinct values in a column does not fall below the minimum accepted percent. Stores the most recent check result for each month when the data quality check was evaluated.
daily_partition_distinct_percent The minimum ratio of distinct values to the count of non null values (detect duplicate values) partitioned Verifies that the percentage of distinct values in a column does not fall below the minimum accepted percent. Stores a separate data quality check result for each daily partition.
monthly_partition_distinct_percent The minimum ratio of distinct values to the count of non null values (detect duplicate values) partitioned Verifies that the percentage of distinct values in a column does not fall below the minimum accepted percent. Stores a separate data quality check result for each monthly partition.

duplicate count

This check counts duplicate values. It raises a data quality issue when the number of duplicates is above a minimum accepted value. The default configuration detects duplicate values by enforcing that the min_count of duplicates is zero.

Data quality check name Friendly name Check type Description Standard
profile_duplicate_count Maximum count of duplicate values profiling Verifies that the number of duplicate values in a column does not exceed the maximum accepted count.
daily_duplicate_count Maximum count of duplicate values monitoring Verifies that the number of duplicate values in a column does not exceed the maximum accepted count. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_duplicate_count Maximum count of duplicate values monitoring Verifies that the number of duplicate values in a column does not exceed the maximum accepted count. Stores the most recent check result for each month when the data quality check was evaluated.
daily_partition_duplicate_count Maximum count of duplicate values partitioned Verifies that the number of duplicate values in a column does not exceed the maximum accepted count. Stores a separate data quality check result for each daily partition.
monthly_partition_duplicate_count Maximum count of duplicate values partitioned Verifies that the number of duplicate values in a column does not exceed the maximum accepted count. Stores a separate data quality check result for each monthly partition.

duplicate percent

This check measures the percentage of duplicate values in all non-null values. It raises a data quality issue when the percentage of duplicates is above an accepted threshold. The default threshold is 0% duplicate values.

Data quality check name Friendly name Check type Description Standard
profile_duplicate_percent Maximum percentage of duplicate values profiling Verifies that the percentage of duplicate values in a column does not exceed the maximum accepted percentage.
daily_duplicate_percent Maximum percentage of duplicate values monitoring Verifies that the percentage of duplicate values in a column does not exceed the maximum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_duplicate_percent Maximum percentage of duplicate values monitoring Verifies that the percentage of duplicate values in a column does not exceed the maximum accepted percentage. Stores the most recent check result for each month when the data quality check was evaluated.
daily_partition_duplicate_percent Maximum percentage of duplicate values partitioned Verifies that the percent of duplicate values in a column does not exceed the maximum accepted percent. Stores a separate data quality check result for each daily partition.
monthly_partition_duplicate_percent Maximum percentage of duplicate values partitioned Verifies that the percent of duplicate values in a column does not exceed the maximum accepted percent. Stores a separate data quality check result for each monthly partition.

distinct count anomaly

This check monitors the count of distinct values and detects anomalies in the changes of the distinct count. It monitors a 90-day time window. The check is configured by setting a desired percentage of anomalies to identify as data quality issues.

Data quality check name Friendly name Check type Description Standard
profile_distinct_count_anomaly Abnormal change in the number of distinct values. Measured as a percentile of anomalous measures. profiling Verifies that the distinct count in a monitored column is within a two-tailed percentile from measurements made during the last 90 days.
daily_distinct_count_anomaly Abnormal change in the number of distinct values. Measured as a percentile of anomalous measures. monitoring Verifies that the distinct count in a monitored column is within a two-tailed percentile from measurements made during the last 90 days.
daily_partition_distinct_count_anomaly Abnormal change in the number of distinct values. Measured as a percentile of anomalous measures. partitioned Verifies that the distinct count in a monitored column is within a two-tailed percentile from measurements made during the last 90 days.

distinct percent anomaly

This check monitors the percentage of distinct values and detects anomalies in the changes in this percentage. It monitors a 90-day time window. The check is configured by setting a desired percentage of anomalies to identify as data quality issues.

Data quality check name Friendly name Check type Description Standard
profile_distinct_percent_anomaly Abnormal change in the percentage of distinct values. Measured as a percentile of anomalous measures. profiling Verifies that the distinct percent in a monitored column is within a two-tailed percentile from measurements made during the last 90 days.
daily_distinct_percent_anomaly Abnormal change in the percentage of distinct values. Measured as a percentile of anomalous measures. monitoring Verifies that the distinct percent in a monitored column is within a two-tailed percentile from measurements made during the last 90 days.
daily_partition_distinct_percent_anomaly Abnormal change in the percentage of distinct values. Measured as a percentile of anomalous measures. partitioned Verifies that the distinct percent in a monitored column is within a two-tailed percentile from measurements made during the last 90 days.

distinct count change

This check monitors the count of distinct values and compares it to the last known value. It raises a data quality issue when the change exceeds an accepted threshold.

Data quality check name Friendly name Check type Description Standard
profile_distinct_count_change Maximum relative change in the count of distinct values since the last known value profiling Verifies that the distinct count in a monitored column has changed by a fixed rate since the last readout.
daily_distinct_count_change Maximum relative change in the count of distinct values since the last known value monitoring Verifies that the distinct count in a monitored column has changed by a fixed rate since the last readout.
monthly_distinct_count_change Maximum relative change in the count of distinct values since the last known value monitoring Verifies that the distinct count in a monitored column has changed by a fixed rate since the last readout.
daily_partition_distinct_count_change Maximum relative change in the count of distinct values since the last known value partitioned Verifies that the distinct count in a monitored column has changed by a fixed rate since the last readout.
monthly_partition_distinct_count_change Maximum relative change in the count of distinct values since the last known value partitioned Verifies that the distinct count in a monitored column has changed by a fixed rate since the last readout.

distinct count change 1 day

This check monitors the count of distinct values and compares it to the measure from the previous day. It raises a data quality issue when the change exceeds an accepted threshold.

Data quality check name Friendly name Check type Description Standard
profile_distinct_count_change_1_day Maximum relative change in the count of distinct values vs 1 day ago profiling Verifies that the distinct count in a monitored column has changed by a fixed rate since the last readout from yesterday.
daily_distinct_count_change_1_day Maximum relative change in the count of distinct values vs 1 day ago monitoring Verifies that the distinct count in a monitored column has changed by a fixed rate since the last readout from yesterday.
daily_partition_distinct_count_change_1_day Maximum relative change in the count of distinct values vs 1 day ago partitioned Verifies that the distinct count in a monitored column has changed by a fixed rate since the last readout from yesterday.

distinct count change 7 days

This check monitors the count of distinct values and compares it to the measure seven days ago to overcome the weekly seasonability impact. It raises a data quality issue when the change exceeds an accepted threshold.

Data quality check name Friendly name Check type Description Standard
profile_distinct_count_change_7_days Maximum relative change in the count of distinct values vs 7 days ago profiling Verifies that the distinct count in a monitored column has changed by a fixed rate since the last readout from last week.
daily_distinct_count_change_7_days Maximum relative change in the count of distinct values vs 7 days ago monitoring Verifies that the distinct count in a monitored column has changed by a fixed rate since the last readout from last week.
daily_partition_distinct_count_change_7_days Maximum relative change in the count of distinct values vs 7 days ago partitioned Verifies that the distinct count in a monitored column has changed by a fixed rate since the last readout from the last week.

distinct count change 30 days

This check monitors the count of distinct values and compares it to the measure thirty days ago to overcome the monthly seasonability impact. It raises a data quality issue when the change exceeds an accepted threshold.

Data quality check name Friendly name Check type Description Standard
profile_distinct_count_change_30_days Maximum relative change in the count of distinct values vs 30 days ago profiling Verifies that the distinct count in a monitored column has changed by a fixed rate since the last readout from last month.
daily_distinct_count_change_30_days Maximum relative change in the count of distinct values vs 30 days ago monitoring Verifies that the distinct count in a monitored column has changed by a fixed rate since the last readout from last month.
daily_partition_distinct_count_change_30_days Maximum relative change in the count of distinct values vs 30 days ago partitioned Verifies that the distinct count in a monitored column has changed by a fixed rate since the last readout from the last month.

distinct percent change

This check monitors the percentage of distinct values and compares it to the last known value. It raises a data quality issue when the change exceeds an accepted threshold.

Data quality check name Friendly name Check type Description Standard
profile_distinct_percent_change Maximum relative change in the percentage of distinct values since the last known value profiling Verifies that the distinct percent in a monitored column has changed by a fixed rate since the last readout.
daily_distinct_percent_change Maximum relative change in the percentage of distinct values since the last known value monitoring Verifies that the distinct percent in a monitored column has changed by a fixed rate since the last readout.
monthly_distinct_percent_change Maximum relative change in the percentage of distinct values since the last known value monitoring Verifies that the distinct percent in a monitored column has changed by a fixed rate since the last readout.
daily_partition_distinct_percent_change Maximum relative change in the percentage of distinct values since the last known value partitioned Verifies that the distinct percent in a monitored column has changed by a fixed rate since the last readout.
monthly_partition_distinct_percent_change Maximum relative change in the percentage of distinct values since the last known value partitioned Verifies that the distinct percent in a monitored column has changed by a fixed rate since the last readout.

distinct percent change 1 day

This check monitors the percentage of distinct values and compares it to the measure from the previous day. It raises a data quality issue when the change exceeds an accepted threshold.

Data quality check name Friendly name Check type Description Standard
profile_distinct_percent_change_1_day Maximum relative change in the percentage of distinct values vs 1 day ago profiling Verifies that the distinct percent in a monitored column has changed by a fixed rate since the last readout from yesterday.
daily_distinct_percent_change_1_day Maximum relative change in the percentage of distinct values vs 1 day ago monitoring Verifies that the distinct percent in a monitored column has changed by a fixed rate since the last readout from yesterday.
daily_partition_distinct_percent_change_1_day Maximum relative change in the percentage of distinct values vs 1 day ago partitioned Verifies that the distinct percent in a monitored column has changed by a fixed rate since the last readout from yesterday.

distinct percent change 7 days

This check monitors the percentage of distinct values and compares it to the measure seven days ago to overcome the weekly seasonability impact. It raises a data quality issue when the change exceeds an accepted threshold.

Data quality check name Friendly name Check type Description Standard
profile_distinct_percent_change_7_days Maximum relative change in the percentage of distinct values vs 7 days sago profiling Verifies that the distinct percent in a monitored column has changed by a fixed rate since the last readout from last week.
daily_distinct_percent_change_7_days Maximum relative change in the percentage of distinct values vs 7 days sago monitoring Verifies that the distinct percent in a monitored column has changed by a fixed rate since the last readout from last week.
daily_partition_distinct_percent_change_7_days Maximum relative change in the percentage of distinct values vs 7 days sago partitioned Verifies that the distinct percent in a monitored column has changed by a fixed rate since the last readout from the last week.

distinct percent change 30 days

This check monitors the percentage of distinct values and compares it to the measure thirty days ago to overcome the monthly seasonability impact. It raises a data quality issue when the change exceeds an accepted threshold.

Data quality check name Friendly name Check type Description Standard
profile_distinct_percent_change_30_days Maximum relative change in the percentage of distinct values vs 30 days ago profiling Verifies that the distinct percent in a monitored column has changed by a fixed rate since the last readout from last month.
daily_distinct_percent_change_30_days Maximum relative change in the percentage of distinct values vs 30 days ago monitoring Verifies that the distinct percent in a monitored column has changed by a fixed rate since the last readout from last month.
daily_partition_distinct_percent_change_30_days Maximum relative change in the percentage of distinct values vs 30 days ago partitioned Verifies that the distinct percent in a monitored column has changed by a fixed rate since the last readout from the last month.