Skip to content

Last updated: July 22, 2025

List of column level accepted values data quality checks

This is a list of accepted_values column data quality checks supported by DQOps and a brief description of what data quality issued they detect.

column-level accepted_values checks

Verifies if all values in the column are from a set of known values, such as country codes.

text found in set percent

A column-level check that calculates the percentage of rows for which the tested text column contains a value from a set of expected values. Columns with null values are also counted as a passing value (the sensor assumes that a 'null' is also an expected and accepted value). The check raises a data quality issue when the percentage of rows with a not null column value that is not expected (not one of the values in the expected_values set) is below the expected threshold. For example, 99% of rows should have values from the defined domain. This data quality check is useful for checking text columns that have a small number of unique values, and all the values should come from a set of expected values. For example, testing country, state, currency, gender, type, and department columns whose expected values are known.

Data quality check name Friendly name Check type Description Standard
profile_text_found_in_set_percent Minimum percentage of rows having text values outside the list of expected values profiling The check measures the percentage of rows whose value in a tested column is one of values from a list of expected values or the column value is null. Verifies that the percentage of rows having a valid column value does not exceed the minimum accepted percentage.
daily_text_found_in_set_percent Minimum percentage of rows having text values outside the list of expected values monitoring The check measures the percentage of rows whose value in a tested column is one of values from a list of expected values or the column value is null. Verifies that the percentage of rows having a valid column value does not exceed the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_text_found_in_set_percent Minimum percentage of rows having text values outside the list of expected values monitoring The check measures the percentage of rows whose value in a tested column is one of values from a list of expected values or the column value is null. Verifies that the percentage of rows having a valid column value does not exceed the minimum accepted percentage. Stores the most recent captured value for each month when the data quality check was evaluated.
daily_partition_text_found_in_set_percent Minimum percentage of rows having text values outside the list of expected values partitioned The check measures the percentage of rows whose value in a tested column is one of values from a list of expected values or the column value is null. Verifies that the percentage of rows having a valid column value does not exceed the minimum accepted percentage. Stores a separate data quality check result for each daily partition.
monthly_partition_text_found_in_set_percent Minimum percentage of rows having text values outside the list of expected values partitioned The check measures the percentage of rows whose value in a tested column is one of values from a list of expected values or the column value is null. Verifies that the percentage of rows having a valid column value does not exceed the minimum accepted percentage. Stores a separate data quality check result for each monthly partition.

number found in set percent

A column-level check that calculates the percentage of rows for which the tested numeric column contains a value from a set of expected values. Columns with null values are also counted as a passing value (the sensor assumes that a 'null' is also an expected and accepted value). The check raises a data quality issue when the percentage of rows with a not null column value that is not expected (not one of the values in the expected_values set) is below the expected threshold. For example, 99% of rows should have values from the defined domain. This data quality check is useful for checking numeric columns that store numeric codes (such as status codes) to see if the only values found in the column are from the set of expected values.

Data quality check name Friendly name Check type Description Standard
profile_number_found_in_set_percent Minimum percentage of rows having numeric values outside the list of expected values profiling The check measures the percentage of rows whose value in a tested column is one of values from a list of expected values or the column value is null. Verifies that the percentage of rows having a valid column value does not exceed the minimum accepted percentage.
daily_number_found_in_set_percent Minimum percentage of rows having numeric values outside the list of expected values monitoring The check measures the percentage of rows whose value in a tested column is one of values from a list of expected values or the column value is null. Verifies that the percentage of rows having a valid column value does not exceed the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_number_found_in_set_percent Minimum percentage of rows having numeric values outside the list of expected values monitoring The check measures the percentage of rows whose value in a tested column is one of values from a list of expected values or the column value is null. Verifies that the percentage of rows having a valid column value does not exceed the minimum accepted percentage. Stores the most recent captured value for each month when the data quality check was evaluated.
daily_partition_number_found_in_set_percent Minimum percentage of rows having numeric values outside the list of expected values partitioned The check measures the percentage of rows whose value in a tested column is one of values from a list of expected values or the column value is null. Verifies that the percentage of rows having a valid column value does not exceed the minimum accepted percentage. Stores a separate data quality check result for each daily partition.
monthly_partition_number_found_in_set_percent Minimum percentage of rows having numeric values outside the list of expected values partitioned The check measures the percentage of rows whose value in a tested column is one of values from a list of expected values or the column value is null. Verifies that the percentage of rows having a valid column value does not exceed the minimum accepted percentage. Stores a separate data quality check result for each monthly partition.

expected text values in use count

A column-level check that counts unique values in a text column and counts how many values out of a list of expected string values were found in the column. The check raises a data quality issue when the threshold for the maximum number of missing has been exceeded (too many expected values were not found in the column). This check is useful for analysing columns with a low number of unique values, such as status codes, to detect whether all status codes are used in any row.

Data quality check name Friendly name Check type Description Standard
profile_expected_text_values_in_use_count Maximum number of expected text values that are not in use profiling Verifies that the expected string values were found in the column. Raises a data quality issue when too many expected values were not found (were missing).
daily_expected_text_values_in_use_count Maximum number of expected text values that are not in use monitoring Verifies that the expected string values were found in the column. Raises a data quality issue when too many expected values were not found (were missing). Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_expected_text_values_in_use_count Maximum number of expected text values that are not in use monitoring Verifies that the expected string values were found in the column. Raises a data quality issue when too many expected values were not found (were missing). Stores the most recent captured value for each month when the data quality check was evaluated.
daily_partition_expected_text_values_in_use_count Maximum number of expected text values that are not in use partitioned Verifies that the expected string values were found in the column. Raises a data quality issue when too many expected values were not found (were missing). Stores a separate data quality check result for each daily partition.
monthly_partition_expected_text_values_in_use_count Maximum number of expected text values that are not in use partitioned Verifies that the expected string values were found in the column. Raises a data quality issue when too many expected values were not found (were missing). Stores a separate data quality check result for each monthly partition.

expected texts in top values count

A column-level check that counts how many expected text values are among the TOP most popular values in the column. The check will first count the number of occurrences of each column's value and will pick the TOP X most popular values (configurable by the 'top' parameter). Then, it will compare the list of most popular values to the given list of expected values that should be most popular. This check will verify how many supposed most popular values (provided in the 'expected_values' list) were not found in the top X most popular values in the column. This check is helpful in analyzing string columns with frequently occurring values, such as country codes for countries with the most customers.

Data quality check name Friendly name Check type Description Standard
profile_expected_texts_in_top_values_count Verify that the most popular text values match the list of expected values profiling Verifies that the top X most popular column values contain all values from a list of expected values.
daily_expected_texts_in_top_values_count Verify that the most popular text values match the list of expected values monitoring Verifies that the top X most popular column values contain all values from a list of expected values. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_expected_texts_in_top_values_count Verify that the most popular text values match the list of expected values monitoring Verifies that the top X most popular column values contain all values from a list of expected values. Stores the most recent captured value for each month when the data quality check was evaluated.
daily_partition_expected_texts_in_top_values_count Verify that the most popular text values match the list of expected values partitioned Verifies that the top X most popular column values contain all values from a list of expected values. Stores a separate data quality check result for each daily partition.
monthly_partition_expected_texts_in_top_values_count Verify that the most popular text values match the list of expected values partitioned Verifies that the top X most popular column values contain all values from a list of expected values. Stores a separate data quality check result for each monthly partition.

expected numbers in use count

A column-level check that counts unique values in a numeric column and counts how many values out of a list of expected numeric values were found in the column. The check raises a data quality issue when the threshold for the maximum number of missing has been exceeded (too many expected values were not found in the column). This check is useful for analysing columns with a low number of unique values, such as status codes, to detect whether all status codes are used in any row.

Data quality check name Friendly name Check type Description Standard
profile_expected_numbers_in_use_count Maximum number of expected numeric values that are not in use profiling Verifies that the expected numeric values were found in the column. Raises a data quality issue when too many expected values were not found (were missing).
daily_expected_numbers_in_use_count Maximum number of expected numeric values that are not in use monitoring Verifies that the expected numeric values were found in the column. Raises a data quality issue when too many expected values were not found (were missing). Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_expected_numbers_in_use_count Maximum number of expected numeric values that are not in use monitoring Verifies that the expected numeric values were found in the column. Raises a data quality issue when too many expected values were not found (were missing). Stores the most recent captured value for each month when the data quality check was evaluated.
daily_partition_expected_numbers_in_use_count Maximum number of expected numeric values that are not in use partitioned Verifies that the expected numeric values were found in the column. Raises a data quality issue when too many expected values were not found (were missing). Stores a separate data quality check result for each daily partition.
monthly_partition_expected_numbers_in_use_count Maximum number of expected numeric values that are not in use partitioned Verifies that the expected numeric values were found in the column. Raises a data quality issue when too many expected values were not found (were missing). Stores a separate data quality check result for each monthly partition.

text valid country code percent

This check measures the percentage of text values that are valid two-letter country codes. It raises a data quality issue when the percentage of valid country codes (excluding null values) falls below a minimum accepted rate.

Data quality check name Friendly name Check type Description Standard
profile_text_valid_country_code_percent Minimum percentage of rows containing valid country codes profiling Verifies that the percentage of valid country codes in a text column does not fall below the minimum accepted percentage
daily_text_valid_country_code_percent Minimum percentage of rows containing valid country codes monitoring Verifies that the percentage of valid country codes in a text column does not fall below the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_text_valid_country_code_percent Minimum percentage of rows containing valid country codes monitoring Verifies that the percentage of valid country codes in a text column does not fall below the minimum accepted percentage. Stores the most recent captured value for each month when the data quality check was evaluated.
daily_partition_text_valid_country_code_percent Minimum percentage of rows containing valid country codes partitioned Verifies that the percentage of valid country codes in a text column does not fall below the minimum accepted percentage. Analyzes every daily partition and creates a separate data quality check result with the time period value that identifies the daily partition.
monthly_partition_text_valid_country_code_percent Minimum percentage of rows containing valid country codes partitioned Verifies that the percentage of valid country codes in a text column does not fall below the minimum accepted percentage. Analyzes every monthly partition and creates a separate data quality check result with the time period value that identifies the monthly partition.

text valid currency code percent

This check measures the percentage of text values that are valid currency names. It raises a data quality issue when the percentage of valid currency names (excluding null values) falls below a minimum accepted rate.

Data quality check name Friendly name Check type Description Standard
profile_text_valid_currency_code_percent Minimum percentage of rows containing valid currency codes profiling Verifies that the percentage of valid currency codes in a text column does not fall below the minimum accepted percentage
daily_text_valid_currency_code_percent Minimum percentage of rows containing valid currency codes monitoring Verifies that the percentage of valid currency codes in a text column does not fall below the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated.
monthly_text_valid_currency_code_percent Minimum percentage of rows containing valid currency codes monitoring Verifies that the percentage of valid currency codes in a text column does not fall below the minimum accepted percentage. Stores the most recent captured value for each month when the data quality check was evaluated.
daily_partition_text_valid_currency_code_percent Minimum percentage of rows containing valid currency codes partitioned Verifies that the percentage of valid currency codes in a text column does not fall below the minimum accepted percentage. Analyzes every daily partition and creates a separate data quality check result with the time period value that identifies the daily partition.
monthly_partition_text_valid_currency_code_percent Minimum percentage of rows containing valid currency codes partitioned Verifies that the percentage of valid currency codes in a text column does not fall below the minimum accepted percentage. Analyzes every monthly partition and creates a separate data quality check result with the time period value that identifies the monthly partition.