Last updated: July 22, 2025
List of column level pii data quality checks
This is a list of pii column data quality checks supported by DQOps and a brief description of what data quality issued they detect.
column-level pii checks
Checks for the presence of sensitive or personally identifiable information (PII) in a column such as an email, phone, zip code, IP4, and IP6 addresses.
contains usa phone percent
This check detects USA phone numbers inside text columns. It measures the percentage of columns containing a phone number and raises a data quality issue when too many rows contain phone numbers.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_contains_usa_phone_percent |
Detect USA phone numbers inside text columns | profiling | Detects USA phone numbers in text columns. Verifies that the percentage of rows that contains USA phone number in a column does not exceed the maximum accepted percentage. | |
daily_contains_usa_phone_percent |
Detect USA phone numbers inside text columns | monitoring | Detects USA phone numbers in text columns. Verifies that the percentage of rows that contains a USA phone number in a column does not exceed the maximum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated. | |
monthly_contains_usa_phone_percent |
Detect USA phone numbers inside text columns | monitoring | Detects USA phone numbers in text columns. Verifies that the percentage of rows that contains a USA phone number in a column does not exceed the maximum accepted percentage. Stores the most recent check result for each month when the data quality check was evaluated. | |
daily_partition_contains_usa_phone_percent |
Detect USA phone numbers inside text columns | partitioned | Detects USA phone numbers in text columns. Verifies that the percentage of rows that contains USA phone number in a column does not exceed the maximum accepted percentage. Stores a separate data quality check result for each daily partition. | |
monthly_partition_contains_usa_phone_percent |
Detect USA phone numbers inside text columns | partitioned | Detects USA phone numbers in text columns. Verifies that the percentage of rows that contains USA phone number in a column does not exceed the maximum accepted percentage. Stores a separate data quality check result for each monthly partition. |
contains email percent
This check detects emails inside text columns. It measures the percentage of columns containing an email and raises a data quality issue when too many rows contain emails.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_contains_email_percent |
Detect emails inside text columns | profiling | Detects emails in text columns. Verifies that the percentage of rows that contains valid emails in a column does not exceed the minimum accepted percentage. | |
daily_contains_email_percent |
Detect emails inside text columns | monitoring | Detects emails in text columns. Verifies that the percentage of rows that contains emails in a column does not exceed the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated. | |
monthly_contains_email_percent |
Detect emails inside text columns | monitoring | Detects emails in text columns. Verifies that the percentage of rows that contains emails in a column does not exceed the minimum accepted percentage. Stores the most recent check result for each month when the data quality check was evaluated. | |
daily_partition_contains_email_percent |
Detect emails inside text columns | partitioned | Detects emails in text columns. Verifies that the percentage of rows that contains emails in a column does not exceed the minimum accepted percentage. Stores a separate data quality check result for each daily partition. | |
monthly_partition_contains_email_percent |
Detect emails inside text columns | partitioned | Detects emails in text columns. Verifies that the percentage of rows that contains emails in a column does not exceed the minimum accepted percentage. Stores a separate data quality check result for each monthly partition. |
contains usa zipcode percent
This check detects USA zip code inside text columns. It measures the percentage of columns containing a zip code and raises a data quality issue when too many rows contain zip codes.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_contains_usa_zipcode_percent |
Detect USA zip codes inside text columns | profiling | Detects USA zip codes in text columns. Verifies that the percentage of rows that contains USA zip code in a column does not exceed the maximum accepted percentage. | |
daily_contains_usa_zipcode_percent |
Detect USA zip codes inside text columns | monitoring | Detects USA zip codes in text columns. Verifies that the percentage of rows that contains a USA zip code in a column does not exceed the maximum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated. | |
monthly_contains_usa_zipcode_percent |
Detect USA zip codes inside text columns | monitoring | Detects USA zip codes in text columns. Verifies that the percentage of rows that contains a USA zip code in a column does not exceed the maximum accepted percentage. Stores the most recent check result for each month when the data quality check was evaluated. | |
daily_partition_contains_usa_zipcode_percent |
Detect USA zip codes inside text columns | partitioned | Detects USA zip codes in text columns. Verifies that the percentage of rows that contains USA zip code in a column does not exceed the maximum accepted percentage. Stores a separate data quality check result for each daily partition. | |
monthly_partition_contains_usa_zipcode_percent |
Detect USA zip codes inside text columns | partitioned | Detects USA zip codes in text columns. Verifies that the percentage of rows that contains USA zip code in a column does not exceed the maximum accepted percentage. Stores a separate data quality check result for each monthly partition. |
contains ip4 percent
This check detects IP4 addresses inside text columns. It measures the percentage of columns containing an IP4 address and raises a data quality issue when too many rows contain IP4 addresses.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_contains_ip4_percent |
Detect IP4 addresses inside text columns | profiling | Detects IP4 addresses in text columns. Verifies that the percentage of rows that contains valid IP4 address values in a column does not fall below the minimum accepted percentage. | |
daily_contains_ip4_percent |
Detect IP4 addresses inside text columns | monitoring | Detects IP4 addresses in text columns. Verifies that the percentage of rows that contains IP4 address values in a column does not fall below the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated. | |
monthly_contains_ip4_percent |
Detect IP4 addresses inside text columns | monitoring | Detects IP4 addresses in text columns. Verifies that the percentage of rows that contains IP4 address values in a column does not fall below the minimum accepted percentage. Stores the most recent check result for each month when the data quality check was evaluated. | |
daily_partition_contains_ip4_percent |
Detect IP4 addresses inside text columns | partitioned | Detects IP4 addresses in text columns. Verifies that the percentage of rows that contains IP4 address values in a column does not fall below the minimum accepted percentage. Stores a separate data quality check result for each daily partition. | |
monthly_partition_contains_ip4_percent |
Detect IP4 addresses inside text columns | partitioned | Detects IP4 addresses in text columns. Verifies that the percentage of rows that contains IP4 address values in a column does not fall below the minimum accepted percentage. Stores a separate data quality check result for each monthly partition. |
contains ip6 percent
This check detects IP6 addresses inside text columns. It measures the percentage of columns containing an IP6 address and raises a data quality issue when too many rows contain IP6 addresses.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_contains_ip6_percent |
Detect IP6 addresses inside text columns | profiling | Detects IP6 addresses in text columns. Verifies that the percentage of rows that contains valid IP6 address values in a column does not fall below the minimum accepted percentage. | |
daily_contains_ip6_percent |
Detect IP6 addresses inside text columns | monitoring | Detects IP6 addresses in text columns. Verifies that the percentage of rows that contains valid IP6 address values in a column does not fall below the minimum accepted percentage. Stores the most recent captured value for each day when the data quality check was evaluated. | |
monthly_contains_ip6_percent |
Detect IP6 addresses inside text columns | monitoring | Detects IP6 addresses in text columns. Verifies that the percentage of rows that contains valid IP6 address values in a column does not fall below the minimum accepted percentage. Stores the most recent check result for each month when the data quality check was evaluated. | |
daily_partition_contains_ip6_percent |
Detect IP6 addresses inside text columns | partitioned | Detects IP6 addresses in text columns. Verifies that the percentage of rows that contains valid IP6 address values in a column does not fall below the minimum accepted percentage. Stores a separate data quality check result for each daily partition. | |
monthly_partition_contains_ip6_percent |
Detect IP6 addresses inside text columns | partitioned | Detects IP6 addresses in text columns. Verifies that the percentage of rows that contains valid IP6 address values in a column does not fall below the minimum accepted percentage. Stores a separate data quality check result for each monthly partition. |