Last updated: July 22, 2025
List of column level accepted values data quality checks
This is a list of accepted_values column data quality checks supported by DQOps and a brief description of what data quality issued they detect.
column-level accepted_values checks
Verifies if all values in the column are from a set of known values, such as country codes.
text found in set percent
A column-level check that calculates the percentage of rows for which the tested text column contains a value from a set of expected values. Columns with null values are also counted as a passing value (the sensor assumes that a 'null' is also an expected and accepted value). The check raises a data quality issue when the percentage of rows with a not null column value that is not expected (not one of the values in the expected_values set) is below the expected threshold. For example, 99% of rows should have values from the defined domain. This data quality check is useful for checking text columns that have a small number of unique values, and all the values should come from a set of expected values. For example, testing country, state, currency, gender, type, and department columns whose expected values are known.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_text_found_in_set_percent |
Minimum percentage of rows having text values outside the list of expected values | profiling | The check measures the percentage of rows whose value in a tested column is one of values from a list of expected values or the column value is null. Verifies that the percentage of rows having a valid column value does not exceed the minimum accepted percentage. | |
daily_text_found_in_set_percent |
Minimum percentage of rows having text values outside the list of expected values | monitoring | The check measures the percentage of rows whose value in a tested column is one of values from a list of expected values or the column value is null. Verifies that the percentage of rows having a valid column value does not exceed the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated. | |
monthly_text_found_in_set_percent |
Minimum percentage of rows having text values outside the list of expected values | monitoring | The check measures the percentage of rows whose value in a tested column is one of values from a list of expected values or the column value is null. Verifies that the percentage of rows having a valid column value does not exceed the minimum accepted percentage. Stores the most recent captured value for each month when the data quality check was evaluated. | |
daily_partition_text_found_in_set_percent |
Minimum percentage of rows having text values outside the list of expected values | partitioned | The check measures the percentage of rows whose value in a tested column is one of values from a list of expected values or the column value is null. Verifies that the percentage of rows having a valid column value does not exceed the minimum accepted percentage. Stores a separate data quality check result for each daily partition. | |
monthly_partition_text_found_in_set_percent |
Minimum percentage of rows having text values outside the list of expected values | partitioned | The check measures the percentage of rows whose value in a tested column is one of values from a list of expected values or the column value is null. Verifies that the percentage of rows having a valid column value does not exceed the minimum accepted percentage. Stores a separate data quality check result for each monthly partition. |
number found in set percent
A column-level check that calculates the percentage of rows for which the tested numeric column contains a value from a set of expected values. Columns with null values are also counted as a passing value (the sensor assumes that a 'null' is also an expected and accepted value). The check raises a data quality issue when the percentage of rows with a not null column value that is not expected (not one of the values in the expected_values set) is below the expected threshold. For example, 99% of rows should have values from the defined domain. This data quality check is useful for checking numeric columns that store numeric codes (such as status codes) to see if the only values found in the column are from the set of expected values.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_number_found_in_set_percent |
Minimum percentage of rows having numeric values outside the list of expected values | profiling | The check measures the percentage of rows whose value in a tested column is one of values from a list of expected values or the column value is null. Verifies that the percentage of rows having a valid column value does not exceed the minimum accepted percentage. | |
daily_number_found_in_set_percent |
Minimum percentage of rows having numeric values outside the list of expected values | monitoring | The check measures the percentage of rows whose value in a tested column is one of values from a list of expected values or the column value is null. Verifies that the percentage of rows having a valid column value does not exceed the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated. | |
monthly_number_found_in_set_percent |
Minimum percentage of rows having numeric values outside the list of expected values | monitoring | The check measures the percentage of rows whose value in a tested column is one of values from a list of expected values or the column value is null. Verifies that the percentage of rows having a valid column value does not exceed the minimum accepted percentage. Stores the most recent captured value for each month when the data quality check was evaluated. | |
daily_partition_number_found_in_set_percent |
Minimum percentage of rows having numeric values outside the list of expected values | partitioned | The check measures the percentage of rows whose value in a tested column is one of values from a list of expected values or the column value is null. Verifies that the percentage of rows having a valid column value does not exceed the minimum accepted percentage. Stores a separate data quality check result for each daily partition. | |
monthly_partition_number_found_in_set_percent |
Minimum percentage of rows having numeric values outside the list of expected values | partitioned | The check measures the percentage of rows whose value in a tested column is one of values from a list of expected values or the column value is null. Verifies that the percentage of rows having a valid column value does not exceed the minimum accepted percentage. Stores a separate data quality check result for each monthly partition. |
expected text values in use count
A column-level check that counts unique values in a text column and counts how many values out of a list of expected string values were found in the column. The check raises a data quality issue when the threshold for the maximum number of missing has been exceeded (too many expected values were not found in the column). This check is useful for analysing columns with a low number of unique values, such as status codes, to detect whether all status codes are used in any row.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_expected_text_values_in_use_count |
Maximum number of expected text values that are not in use | profiling | Verifies that the expected string values were found in the column. Raises a data quality issue when too many expected values were not found (were missing). | |
daily_expected_text_values_in_use_count |
Maximum number of expected text values that are not in use | monitoring | Verifies that the expected string values were found in the column. Raises a data quality issue when too many expected values were not found (were missing). Stores the most recent captured value for each day when the data quality check was evaluated. | |
monthly_expected_text_values_in_use_count |
Maximum number of expected text values that are not in use | monitoring | Verifies that the expected string values were found in the column. Raises a data quality issue when too many expected values were not found (were missing). Stores the most recent captured value for each month when the data quality check was evaluated. | |
daily_partition_expected_text_values_in_use_count |
Maximum number of expected text values that are not in use | partitioned | Verifies that the expected string values were found in the column. Raises a data quality issue when too many expected values were not found (were missing). Stores a separate data quality check result for each daily partition. | |
monthly_partition_expected_text_values_in_use_count |
Maximum number of expected text values that are not in use | partitioned | Verifies that the expected string values were found in the column. Raises a data quality issue when too many expected values were not found (were missing). Stores a separate data quality check result for each monthly partition. |
expected texts in top values count
A column-level check that counts how many expected text values are among the TOP most popular values in the column. The check will first count the number of occurrences of each column's value and will pick the TOP X most popular values (configurable by the 'top' parameter). Then, it will compare the list of most popular values to the given list of expected values that should be most popular. This check will verify how many supposed most popular values (provided in the 'expected_values' list) were not found in the top X most popular values in the column. This check is helpful in analyzing string columns with frequently occurring values, such as country codes for countries with the most customers.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_expected_texts_in_top_values_count |
Verify that the most popular text values match the list of expected values | profiling | Verifies that the top X most popular column values contain all values from a list of expected values. | |
daily_expected_texts_in_top_values_count |
Verify that the most popular text values match the list of expected values | monitoring | Verifies that the top X most popular column values contain all values from a list of expected values. Stores the most recent captured value for each day when the data quality check was evaluated. | |
monthly_expected_texts_in_top_values_count |
Verify that the most popular text values match the list of expected values | monitoring | Verifies that the top X most popular column values contain all values from a list of expected values. Stores the most recent captured value for each month when the data quality check was evaluated. | |
daily_partition_expected_texts_in_top_values_count |
Verify that the most popular text values match the list of expected values | partitioned | Verifies that the top X most popular column values contain all values from a list of expected values. Stores a separate data quality check result for each daily partition. | |
monthly_partition_expected_texts_in_top_values_count |
Verify that the most popular text values match the list of expected values | partitioned | Verifies that the top X most popular column values contain all values from a list of expected values. Stores a separate data quality check result for each monthly partition. |
expected numbers in use count
A column-level check that counts unique values in a numeric column and counts how many values out of a list of expected numeric values were found in the column. The check raises a data quality issue when the threshold for the maximum number of missing has been exceeded (too many expected values were not found in the column). This check is useful for analysing columns with a low number of unique values, such as status codes, to detect whether all status codes are used in any row.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_expected_numbers_in_use_count |
Maximum number of expected numeric values that are not in use | profiling | Verifies that the expected numeric values were found in the column. Raises a data quality issue when too many expected values were not found (were missing). | |
daily_expected_numbers_in_use_count |
Maximum number of expected numeric values that are not in use | monitoring | Verifies that the expected numeric values were found in the column. Raises a data quality issue when too many expected values were not found (were missing). Stores the most recent captured value for each day when the data quality check was evaluated. | |
monthly_expected_numbers_in_use_count |
Maximum number of expected numeric values that are not in use | monitoring | Verifies that the expected numeric values were found in the column. Raises a data quality issue when too many expected values were not found (were missing). Stores the most recent captured value for each month when the data quality check was evaluated. | |
daily_partition_expected_numbers_in_use_count |
Maximum number of expected numeric values that are not in use | partitioned | Verifies that the expected numeric values were found in the column. Raises a data quality issue when too many expected values were not found (were missing). Stores a separate data quality check result for each daily partition. | |
monthly_partition_expected_numbers_in_use_count |
Maximum number of expected numeric values that are not in use | partitioned | Verifies that the expected numeric values were found in the column. Raises a data quality issue when too many expected values were not found (were missing). Stores a separate data quality check result for each monthly partition. |
text valid country code percent
This check measures the percentage of text values that are valid two-letter country codes. It raises a data quality issue when the percentage of valid country codes (excluding null values) falls below a minimum accepted rate.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_text_valid_country_code_percent |
Minimum percentage of rows containing valid country codes | profiling | Verifies that the percentage of valid country codes in a text column does not fall below the minimum accepted percentage | |
daily_text_valid_country_code_percent |
Minimum percentage of rows containing valid country codes | monitoring | Verifies that the percentage of valid country codes in a text column does not fall below the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated. | |
monthly_text_valid_country_code_percent |
Minimum percentage of rows containing valid country codes | monitoring | Verifies that the percentage of valid country codes in a text column does not fall below the minimum accepted percentage. Stores the most recent captured value for each month when the data quality check was evaluated. | |
daily_partition_text_valid_country_code_percent |
Minimum percentage of rows containing valid country codes | partitioned | Verifies that the percentage of valid country codes in a text column does not fall below the minimum accepted percentage. Analyzes every daily partition and creates a separate data quality check result with the time period value that identifies the daily partition. | |
monthly_partition_text_valid_country_code_percent |
Minimum percentage of rows containing valid country codes | partitioned | Verifies that the percentage of valid country codes in a text column does not fall below the minimum accepted percentage. Analyzes every monthly partition and creates a separate data quality check result with the time period value that identifies the monthly partition. |
text valid currency code percent
This check measures the percentage of text values that are valid currency names. It raises a data quality issue when the percentage of valid currency names (excluding null values) falls below a minimum accepted rate.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_text_valid_currency_code_percent |
Minimum percentage of rows containing valid currency codes | profiling | Verifies that the percentage of valid currency codes in a text column does not fall below the minimum accepted percentage | |
daily_text_valid_currency_code_percent |
Minimum percentage of rows containing valid currency codes | monitoring | Verifies that the percentage of valid currency codes in a text column does not fall below the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated. | |
monthly_text_valid_currency_code_percent |
Minimum percentage of rows containing valid currency codes | monitoring | Verifies that the percentage of valid currency codes in a text column does not fall below the minimum accepted percentage. Stores the most recent captured value for each month when the data quality check was evaluated. | |
daily_partition_text_valid_currency_code_percent |
Minimum percentage of rows containing valid currency codes | partitioned | Verifies that the percentage of valid currency codes in a text column does not fall below the minimum accepted percentage. Analyzes every daily partition and creates a separate data quality check result with the time period value that identifies the daily partition. | |
monthly_partition_text_valid_currency_code_percent |
Minimum percentage of rows containing valid currency codes | partitioned | Verifies that the percentage of valid currency codes in a text column does not fall below the minimum accepted percentage. Analyzes every monthly partition and creates a separate data quality check result with the time period value that identifies the monthly partition. |