Last updated: July 22, 2025
List of column level patterns data quality checks
This is a list of patterns column data quality checks supported by DQOps and a brief description of what data quality issued they detect.
column-level patterns checks
Validates if a text column matches predefined patterns (such as an email address) or a custom regular expression.
text not matching regex found
This check validates text values using a pattern defined as a regular expression. It counts the number of invalid values and raises a data quality issue when the number exceeds a threshold.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_text_not_matching_regex_found |
Maximum count of rows containing texts values not matching regex | profiling | Verifies that the number of text values not matching the custom regular expression pattern does not exceed the maximum accepted count. | |
daily_text_not_matching_regex_found |
Maximum count of rows containing texts values not matching regex | monitoring | Verifies that the number of text values not matching the custom regular expression pattern does not exceed the maximum accepted count. | |
monthly_text_not_matching_regex_found |
Maximum count of rows containing texts values not matching regex | monitoring | Verifies that the number of text values not matching the custom regular expression pattern does not exceed the maximum accepted count. | |
daily_partition_text_not_matching_regex_found |
Maximum count of rows containing texts values not matching regex | partitioned | Verifies that the number of text values not matching the custom regular expression pattern does not exceed the maximum accepted count. | |
monthly_partition_text_not_matching_regex_found |
Maximum count of rows containing texts values not matching regex | partitioned | Verifies that the number of text values not matching the custom regular expression pattern does not exceed the maximum accepted count. |
texts not matching regex percent
This check validates text values using a pattern defined as a regular expression. It measures the percentage of invalid values and raises a data quality issue when the rate is above a threshold.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_texts_not_matching_regex_percent |
Maximum percent of rows containing texts values not matching regex | profiling | Verifies that the percentage of strings not matching the custom regular expression pattern does not exceed the maximum accepted percentage. | |
daily_texts_not_matching_regex_percent |
Maximum percent of rows containing texts values not matching regex | monitoring | Verifies that the percentage of strings not matching the custom regular expression pattern does not exceed the maximum accepted percentage. | |
monthly_texts_not_matching_regex_percent |
Maximum percent of rows containing texts values not matching regex | monitoring | Verifies that the percentage of strings not matching the custom regular expression pattern does not exceed the maximum accepted percentage. | |
daily_partition_texts_not_matching_regex_percent |
Maximum percent of rows containing texts values not matching regex | partitioned | Verifies that the percentage of strings matching the custom regular expression pattern does not exceed the maximum accepted percentage. | |
monthly_partition_texts_not_matching_regex_percent |
Maximum percent of rows containing texts values not matching regex | partitioned | Verifies that the percentage of strings matching the custom regular expression pattern does not exceed the maximum accepted percentage. |
invalid email format found
This check detects invalid email addresses in text columns using a regular expression. It counts the number of invalid emails and raises a data quality issue when the number is above a threshold.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_invalid_email_format_found |
Maximum count of rows containing emails in invalid format | profiling | Verifies that the number of invalid emails in a text column does not exceed the maximum accepted count. | |
daily_invalid_email_format_found |
Maximum count of rows containing emails in invalid format | monitoring | Verifies that the number of invalid emails in a text column does not exceed the maximum accepted count. | |
monthly_invalid_email_format_found |
Maximum count of rows containing emails in invalid format | monitoring | Verifies that the number of invalid emails in a text column does not exceed the maximum accepted count. | |
daily_partition_invalid_email_format_found |
Maximum count of rows containing emails in invalid format | partitioned | Verifies that the number of invalid emails in a text column does not exceed the maximum accepted count. | |
monthly_partition_invalid_email_format_found |
Maximum count of rows containing emails in invalid format | partitioned | Verifies that the number of invalid emails in a text column does not exceed the maximum accepted count. |
invalid email format percent
This check detects invalid email addresses in text columns using a regular expression. It calculated the percentage of invalid emails and raises a data quality issue when the percentage is above a threshold.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_invalid_email_format_percent |
Minimum percent of rows containing emails in invalid format | profiling | Verifies that the percentage of invalid emails in a text column does not exceed the maximum accepted percentage. | |
daily_invalid_email_format_percent |
Minimum percent of rows containing emails in invalid format | monitoring | Verifies that the percentage of invalid emails in a text column does not exceed the maximum accepted percentage. | |
monthly_invalid_email_format_percent |
Minimum percent of rows containing emails in invalid format | monitoring | Verifies that the percentage of invalid emails in a text column does not exceed the maximum accepted percentage. | |
daily_partition_invalid_email_format_percent |
Minimum percent of rows containing emails in invalid format | partitioned | Verifies that the percentage of invalid emails in a text column does not exceed the maximum accepted percentage. | |
monthly_partition_invalid_email_format_percent |
Minimum percent of rows containing emails in invalid format | partitioned | Verifies that the percentage of invalid emails in a text column does not exceed the maximum accepted percentage. |
text not matching date pattern found
This check detects dates in the wrong format inside text columns using a regular expression. It counts the number of incorrectly formatted dates and raises a data quality issue when the number exceeds a threshold.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_text_not_matching_date_pattern_found |
Maximum count of rows containing texts not matching an expected date pattern | profiling | Verifies that the number of texts not matching the date format regular expression does not exceed the maximum accepted count. | |
daily_text_not_matching_date_pattern_found |
Maximum count of rows containing texts not matching an expected date pattern | monitoring | Verifies that the number of texts not matching the date format regular expression does not exceed the maximum accepted count. | |
monthly_text_not_matching_date_pattern_found |
Maximum count of rows containing texts not matching an expected date pattern | monitoring | Verifies that the number of texts not matching the date format regular expression does not exceed the maximum accepted count. | |
daily_partition_text_not_matching_date_pattern_found |
Maximum count of rows containing texts not matching an expected date pattern | partitioned | Verifies that the number of texts not matching the date format regular expression does not exceed the maximum accepted count. | |
monthly_partition_text_not_matching_date_pattern_found |
Maximum count of rows containing texts not matching an expected date pattern | partitioned | Verifies that the number of texts not matching the date format regular expression does not exceed the maximum accepted count. |
text not matching date pattern percent
This check validates the date format of dates stored in text columns. It measures the percentage of incorrectly formatted dates and raises a data quality issue when the rate is above a threshold.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_text_not_matching_date_pattern_percent |
Maximum percentage of rows containing texts not matching an expected date pattern | profiling | Verifies that the percentage of texts not matching the date format regular expression in a column does not exceed the maximum accepted percentage. | |
daily_text_not_matching_date_pattern_percent |
Maximum percentage of rows containing texts not matching an expected date pattern | monitoring | Verifies that the percentage of texts not matching the date format regular expression in a column does not exceed the maximum accepted percentage. | |
monthly_text_not_matching_date_pattern_percent |
Maximum percentage of rows containing texts not matching an expected date pattern | monitoring | Verifies that the percentage of texts not matching the date format regular expression in a column does not exceed the maximum accepted percentage. | |
daily_partition_text_not_matching_date_pattern_percent |
Maximum percentage of rows containing texts not matching an expected date pattern | partitioned | Verifies that the percentage of texts matching the date format regular expression in a column does not exceed the maximum accepted percentage. | |
monthly_partition_text_not_matching_date_pattern_percent |
Maximum percentage of rows containing texts not matching an expected date pattern | partitioned | Verifies that the percentage of texts matching the date format regular expression in a column does not exceed the maximum accepted percentage. |
text not matching name pattern percent
This check verifies if values stored in a text column contain only letters and are usable as literal identifiers. It measures the percentage of invalid literal identifiers and raises a data quality issue when the rate is above a threshold.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_text_not_matching_name_pattern_percent |
Maximum percentage of rows not containing texts that are names (e.g. numeric) | profiling | Verifies that the percentage of texts not matching the name regular expression does not exceed the maximum accepted percentage. | |
daily_text_not_matching_name_pattern_percent |
Maximum percentage of rows not containing texts that are names (e.g. numeric) | monitoring | Verifies that the percentage of texts not matching the name regular expression does not exceed the maximum accepted percentage. | |
monthly_text_not_matching_name_pattern_percent |
Maximum percentage of rows not containing texts that are names (e.g. numeric) | monitoring | Verifies that the percentage of texts not matching the name regular expression does not exceed the maximum accepted percentage. | |
daily_partition_text_not_matching_name_pattern_percent |
Maximum percentage of rows not containing texts that are names (e.g. numeric) | partitioned | Verifies that the percentage of texts matching the name regular expression does not exceed the maximum accepted percentage. | |
monthly_partition_text_not_matching_name_pattern_percent |
Maximum percentage of rows not containing texts that are names (e.g. numeric) | partitioned | Verifies that the percentage of texts matching the name regular expression does not exceed the maximum accepted percentage. |
invalid uuid format found
This check detects invalid UUID identifiers in text columns using a regular expression. It counts the number of invalid UUIDs and raises a data quality issue when the number is above a threshold.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_invalid_uuid_format_found |
Maximum count of rows containing invalid UUID values | profiling | Verifies that the number of invalid UUIDs in a text column does not exceed the maximum accepted count. | |
daily_invalid_uuid_format_found |
Maximum count of rows containing invalid UUID values | monitoring | Verifies that the number of invalid UUIDs in a text column does not exceed the maximum accepted count. | |
monthly_invalid_uuid_format_found |
Maximum count of rows containing invalid UUID values | monitoring | Verifies that the number of invalid UUIDs in a text column does not exceed the maximum accepted count. | |
daily_partition_invalid_uuid_format_found |
Maximum count of rows containing invalid UUID values | partitioned | Verifies that the number of invalid UUIDs in a text column does not exceed the maximum accepted count. | |
monthly_partition_invalid_uuid_format_found |
Maximum count of rows containing invalid UUID values | partitioned | Verifies that the number of invalid UUIDs in a text column does not exceed the maximum accepted count. |
invalid uuid format percent
This check validates the format of UUID values in text columns. It measures the percentage of invalid UUIDs and raises a data quality issue when the rate is above a threshold.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_invalid_uuid_format_percent |
Maximum percentage of rows containing invalid UUID values | profiling | Verifies that the percentage of invalid UUID in a text column does not exceed the maximum accepted percentage. | |
daily_invalid_uuid_format_percent |
Maximum percentage of rows containing invalid UUID values | monitoring | Verifies that the percentage of invalid UUID in a text column does not exceed the maximum accepted percentage. | |
monthly_invalid_uuid_format_percent |
Maximum percentage of rows containing invalid UUID values | monitoring | Verifies that the percentage of invalid UUID in a text column does not exceed the maximum accepted percentage. | |
daily_partition_invalid_uuid_format_percent |
Maximum percentage of rows containing invalid UUID values | partitioned | Verifies that the percentage of invalid UUID in a text column does not exceed the maximum accepted percentage. | |
monthly_partition_invalid_uuid_format_percent |
Maximum percentage of rows containing invalid UUID values | partitioned | Verifies that the percentage of invalid UUID in a text column does not exceed the maximum accepted percentage. |
invalid ip4 address format found
This check detects invalid IP4 internet addresses in text columns using a regular expression. It counts the number of invalid addresses and raises a data quality issue when the number is above a threshold.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_invalid_ip4_address_format_found |
Maximum count of rows containing invalid IP4 address values | profiling | Verifies that the number of invalid IP4 addresses in a text column does not exceed the maximum accepted count. | |
daily_invalid_ip4_address_format_found |
Maximum count of rows containing invalid IP4 address values | monitoring | Verifies that the number of invalid IP4 addresses in a text column does not exceed the maximum accepted count. | |
monthly_invalid_ip4_address_format_found |
Maximum count of rows containing invalid IP4 address values | monitoring | Verifies that the number of invalid IP4 addresses in a text column does not exceed the maximum accepted count. | |
daily_partition_invalid_ip4_address_format_found |
Maximum count of rows containing invalid IP4 address values | partitioned | Verifies that the number of invalid IP4 addresses in a text column does not exceed the maximum accepted count. | |
monthly_partition_invalid_ip4_address_format_found |
Maximum count of rows containing invalid IP4 address values | partitioned | Verifies that the number of invalid IP4 addresses in a text column does not exceed the maximum accepted count. |
invalid ip6 address format found
This check detects invalid IP6 internet addresses in text columns using a regular expression. It counts the number of invalid addresses and raises a data quality issue when the number is above a threshold.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_invalid_ip6_address_format_found |
Maximum count of rows containing invalid IP6 address values | profiling | Verifies that the number of invalid IP6 addresses in a text column does not exceed the maximum accepted count. | |
daily_invalid_ip6_address_format_found |
Maximum count of rows containing invalid IP6 address values | monitoring | Verifies that the number of invalid IP6 addresses in a text column does not exceed the maximum accepted count. | |
monthly_invalid_ip6_address_format_found |
Maximum count of rows containing invalid IP6 address values | monitoring | Verifies that the number of invalid IP6 addresses in a text column does not exceed the maximum accepted count. | |
daily_partition_invalid_ip6_address_format_found |
Maximum count of rows containing invalid IP6 address values | partitioned | Verifies that the number of invalid IP6 addresses in a text column does not exceed the maximum accepted count. | |
monthly_partition_invalid_ip6_address_format_found |
Maximum count of rows containing invalid IP6 address values | partitioned | Verifies that the number of invalid IP6 addresses in a text column does not exceed the maximum accepted count. |
invalid usa phone format found
This check validates the format of USA phone numbers inside text columns. It counts the number of invalid phone number and raises a data quality issue when too many rows contain phone numbers.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_invalid_usa_phone_format_found |
Maximum count of rows containing invalid USA phone number values | profiling | Verifies that the number of invalid USA phone numbers in a text column does not exceed the maximum accepted count. | |
daily_invalid_usa_phone_format_found |
Maximum count of rows containing invalid USA phone number values | monitoring | Verifies that the number of invalid USA phone numbers in a text column does not exceed the maximum accepted count. | |
monthly_invalid_usa_phone_format_found |
Maximum count of rows containing invalid USA phone number values | monitoring | Verifies that the number of invalid USA phone numbers in a text column does not exceed the maximum accepted count. | |
daily_partition_invalid_usa_phone_format_found |
Maximum count of rows containing invalid USA phone number values | partitioned | Verifies that the number of invalid USA phone numbers in a text column does not exceed the maximum accepted count. | |
monthly_partition_invalid_usa_phone_format_found |
Maximum count of rows containing invalid USA phone number values | partitioned | Verifies that the number of invalid USA phone numbers in a text column does not exceed the maximum accepted count. |
invalid usa zipcode format found
This check validates the format of a USA zip code inside text columns. It counts the number of invalid zip code and raises a data quality issue when the rate is below a threshold.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_invalid_usa_zipcode_format_found |
Maximum count of rows containing invalid USA zip code values | profiling | Verifies that the number of invalid zip codes in a text column does not exceed the maximum accepted count. | |
daily_invalid_usa_zipcode_format_found |
Maximum count of rows containing invalid USA zip code values | monitoring | Verifies that the number of invalid zip codes in a text column does not exceed the maximum accepted count. | |
monthly_invalid_usa_zipcode_format_found |
Maximum count of rows containing invalid USA zip code values | monitoring | Verifies that the number of invalid zip codes in a text column does not exceed the maximum accepted count. | |
daily_partition_invalid_usa_zipcode_format_found |
Maximum count of rows containing invalid USA zip code values | partitioned | Verifies that the number of invalid zip codes in a text column does not exceed the maximum accepted count. | |
monthly_partition_invalid_usa_zipcode_format_found |
Maximum count of rows containing invalid USA zip code values | partitioned | Verifies that the number of invalid zip codes in a text column does not exceed the maximum accepted count. |
invalid usa phone format percent
This check validates the format of USA phone numbers inside text columns. It measures the percentage of columns containing invalid phone numbers and raises a data quality issue when the rate is above a threshold.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_invalid_usa_phone_format_percent |
Maximum percentage of rows containing invalid USA phone number values | profiling | Verifies that the percentage of invalid USA phones number in a text column does not exceed the maximum accepted percentage. | |
daily_invalid_usa_phone_format_percent |
Maximum percentage of rows containing invalid USA phone number values | monitoring | Verifies that the percentage of invalid USA phones number in a text column does not exceed the maximum accepted percentage. | |
monthly_invalid_usa_phone_format_percent |
Maximum percentage of rows containing invalid USA phone number values | monitoring | Verifies that the percentage of invalid USA phones number in a text column does not exceed the maximum accepted percentage. | |
daily_partition_invalid_usa_phone_format_percent |
Maximum percentage of rows containing invalid USA phone number values | partitioned | Verifies that the percentage of invalid USA phones number in a text column does not exceed the maximum accepted percentage. | |
monthly_partition_invalid_usa_phone_format_percent |
Maximum percentage of rows containing invalid USA phone number values | partitioned | Verifies that the percentage of invalid USA phones number in a text column does not exceed the maximum accepted percentage. |
invalid usa zipcode format percent
This check validates the format of a USA zip code inside text columns. It measures the percentage of columns containing invalid zip codes and raises a data quality issue when the rate is above a threshold.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_invalid_usa_zipcode_format_percent |
Maximum percentage of rows containing invalid USA zip code values | profiling | Verifies that the percentage of invalid USA phones number in a text column does not exceed the maximum accepted percentage. | |
daily_invalid_usa_zipcode_format_percent |
Maximum percentage of rows containing invalid USA zip code values | monitoring | Verifies that the percentage of invalid USA zip code in a text column does not exceed the maximum accepted percentage. | |
monthly_invalid_usa_zipcode_format_percent |
Maximum percentage of rows containing invalid USA zip code values | monitoring | Verifies that the percentage of invalid USA zip code in a text column does not exceed the maximum accepted percentage. | |
daily_partition_invalid_usa_zipcode_format_percent |
Maximum percentage of rows containing invalid USA zip code values | partitioned | Verifies that the percentage of invalid USA phones number in a text column does not exceed the maximum accepted percentage. | |
monthly_partition_invalid_usa_zipcode_format_percent |
Maximum percentage of rows containing invalid USA zip code values | partitioned | Verifies that the percentage of invalid USA phones number in a text column does not exceed the maximum accepted percentage. |