Last updated: July 22, 2025
List of column level text data quality checks
This is a list of text column data quality checks supported by DQOps and a brief description of what data quality issued they detect.
column-level text checks
Validates that the data in a text column has a valid range.
text min length
This check finds the length of the shortest text in a column. DQOps validates the shortest length using a range rule. DQOps raises an issue when the minimum text length is outside a range of accepted values.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_text_min_length |
Verify that the minimum length of the text column is in the range | profiling | This check finds the length of the shortest text in a column. Then, it verifies that the minimum length is within an accepted range. It detects that the shortest text is too short. | |
daily_text_min_length |
Verify that the minimum length of the text column is in the range | monitoring | This check finds the length of the shortest text in a column. Then, it verifies that the minimum length is within an accepted range. It detects that the shortest text is too short. Stores the most recent captured value for each day when the data quality check was evaluated. | |
monthly_text_min_length |
Verify that the minimum length of the text column is in the range | monitoring | This check finds the length of the shortest text in a column. Then, it verifies that the minimum length is within an accepted range. It detects that the shortest text is too short. Stores the most recent captured value for each month when the data quality check was evaluated. | |
daily_partition_text_min_length |
Verify that the minimum length of the text column is in the range | partitioned | This check finds the length of the shortest text in a column. Then, it verifies that the minimum length is within an accepted range. It detects that the shortest text is too short. Analyzes every daily partition and creates a separate data quality check result with the time period value that identifies the daily partition. | |
monthly_partition_text_min_length |
Verify that the minimum length of the text column is in the range | partitioned | This check finds the length of the shortest text in a column. Then, it verifies that the minimum length is within an accepted range. It detects that the shortest text is too short. Analyzes every monthly partition and creates a separate data quality check result with the time period value that identifies the monthly partition. |
text max length
This check finds the length of the longest text in a column. DQOps validates the maximum length using a range rule. DQOps raises an issue when the maximum text length is outside a range of accepted values.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_text_max_length |
Verify that the maximum length of the text is in the range | profiling | This check finds the length of the longest text in a column. Then, it verifies that the maximum length is within an accepted range. It detects that the texts are too long or not long enough. | |
daily_text_max_length |
Verify that the maximum length of the text is in the range | monitoring | This check finds the length of the longest text in a column. Then, it verifies that the maximum length is within an accepted range. It detects that the texts are too long or not long enough. Stores the most recent captured value for each day when the data quality check was evaluated. | |
monthly_text_max_length |
Verify that the maximum length of the text is in the range | monitoring | This check finds the length of the longest text in a column. Then, it verifies that the maximum length is within an accepted range. It detects that the texts are too long or not long enough. Stores the most recent captured value for each month when the data quality check was evaluated. | |
daily_partition_text_max_length |
Verify that the maximum length of the text is in the range | partitioned | This check finds the length of the longest text in a column. Then, it verifies that the maximum length is within an accepted range. It detects that the texts are too long or not long enough. Analyzes every daily partition and creates a separate data quality check result with the time period value that identifies the daily partition. | |
monthly_partition_text_max_length |
Verify that the maximum length of the text is in the range | partitioned | This check finds the length of the longest text in a column. Then, it verifies that the maximum length is within an accepted range. It detects that the texts are too long or not long enough. Analyzes every monthly partition and creates a separate data quality check result with the time period value that identifies the monthly partition. |
text mean length
This check calculates the average text length in a column. DQOps validates the mean length using a range rule. DQOps raises an issue when the mean text length is outside a range of accepted values.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_text_mean_length |
Verify that the mean length of the text is in the range | profiling | Verifies that the mean (average) length of texts in a column is within an accepted range. | |
daily_text_mean_length |
Verify that the mean length of the text is in the range | monitoring | Verifies that the mean (average) length of texts in a column is within an accepted range. Stores the most recent captured value for each day when the data quality check was evaluated. | |
monthly_text_mean_length |
Verify that the mean length of the text is in the range | monitoring | Verifies that the mean (average) length of texts in a column is within an accepted range. Stores the most recent captured value for each month when the data quality check was evaluated. | |
daily_partition_text_mean_length |
Verify that the mean length of the text is in the range | partitioned | Verifies that the mean (average) length of texts in a column is within an accepted range. Analyzes every daily partition and creates a separate data quality check result with the time period value that identifies the daily partition. | |
monthly_partition_text_mean_length |
Verify that the mean length of the text is in the range | partitioned | Verifies that the mean (average) length of texts in a column is within an accepted range. Analyzes every monthly partition and creates a separate data quality check result with the time period value that identifies the monthly partition. |
text length below min length
This check finds texts that are shorter than the minimum accepted text length. It counts the number of texts that are too short and raises a data quality issue when too many invalid texts are found.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_text_length_below_min_length |
Find text values shorter than the minimum accepted length | profiling | The check counts the number of text values in the column that is below the length defined by the user as a parameter. | |
daily_text_length_below_min_length |
Find text values shorter than the minimum accepted length | monitoring | The check counts the number of text values in the column that is below the length defined by the user as a parameter. Stores the most recent captured value for each day when the data quality check was evaluated. | |
monthly_text_length_below_min_length |
Find text values shorter than the minimum accepted length | monitoring | The check counts the number of text values in the column that is below the length defined by the user as a parameter. Stores the most recent captured value for each month when the data quality check was evaluated. | |
daily_partition_text_length_below_min_length |
Find text values shorter than the minimum accepted length | partitioned | The check counts the number of text values in the column that is below the length defined by the user as a parameter. Analyzes every daily partition and creates a separate data quality check result with the time period value that identifies the daily partition. | |
monthly_partition_text_length_below_min_length |
Find text values shorter than the minimum accepted length | partitioned | The check counts the number of text values in the column that is below the length defined by the user as a parameter. Analyzes every monthly partition and creates a separate data quality check result with the time period value that identifies the monthly partition. |
text length below min length percent
This check finds texts that are shorter than the minimum accepted text length. It measures the percentage of too short texts and raises a data quality issue when too many invalid texts are found.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_text_length_below_min_length_percent |
Measure the percentage of rows containing text values shorter than the minimum accepted length | profiling | The check measures the percentage of text values in the column that is below the length defined by the user as a parameter. | |
daily_text_length_below_min_length_percent |
Measure the percentage of rows containing text values shorter than the minimum accepted length | monitoring | The check measures the percentage of text values in the column that is below the length defined by the user as a parameter. Stores the most recent captured value for each day when the data quality check was evaluated. | |
monthly_text_length_below_min_length_percent |
Measure the percentage of rows containing text values shorter than the minimum accepted length | monitoring | The check measures the percentage of text values in the column that is below the length defined by the user as a parameter. Stores the most recent captured value for each month when the data quality check was evaluated. | |
daily_partition_text_length_below_min_length_percent |
Measure the percentage of rows containing text values shorter than the minimum accepted length | partitioned | The check measures the percentage of text values in the column that is below the length defined by the user as a parameter. Analyzes every daily partition and creates a separate data quality check result with the time period value that identifies the daily partition. | |
monthly_partition_text_length_below_min_length_percent |
Measure the percentage of rows containing text values shorter than the minimum accepted length | partitioned | The check measures the percentage of text values in the column that is below the length defined by the user as a parameter. Analyzes every monthly partition and creates a separate data quality check result with the time period value that identifies the monthly partition. |
text length above max length
This check finds texts that are longer than the maximum accepted text length. It counts the number of texts that are too long and raises a data quality issue when too many invalid texts are found.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_text_length_above_max_length |
Find text values longer than the maximum accepted length | profiling | The check counts the number of text values in the column that is above the length defined by the user as a parameter. | |
daily_text_length_above_max_length |
Find text values longer than the maximum accepted length | monitoring | The check counts the number of text values in the column that is above the length defined by the user as a parameter. Stores the most recent captured value for each day when the data quality check was evaluated. | |
monthly_text_length_above_max_length |
Find text values longer than the maximum accepted length | monitoring | The check counts the number of text values in the column that is above the length defined by the user as a parameter. Stores the most recent captured value for each month when the data quality check was evaluated. | |
daily_partition_text_length_above_max_length |
Find text values longer than the maximum accepted length | partitioned | The check counts the number of text values in the column that is above the length defined by the user as a parameter. Analyzes every daily partition and creates a separate data quality check result with the time period value that identifies the daily partition. | |
monthly_partition_text_length_above_max_length |
Find text values longer than the maximum accepted length | partitioned | The check counts the number of text values in the column that is above the length defined by the user as a parameter. Analyzes every monthly partition and creates a separate data quality check result with the time period value that identifies the monthly partition. |
text length above max length percent
This check finds texts that are longer than the maximum accepted text length. It measures the percentage of texts that are too long and raises a data quality issue when too many invalid texts are found.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_text_length_above_max_length_percent |
Measure the percentage of rows containing text values longer than the maximum accepted length | profiling | The check measures the percentage of text values in the column that is above the length defined by the user as a parameter. | |
daily_text_length_above_max_length_percent |
Measure the percentage of rows containing text values longer than the maximum accepted length | monitoring | The check measures the percentage of text values in the column that is above the length defined by the user as a parameter. Stores the most recent captured value for each day when the data quality check was evaluated. | |
monthly_text_length_above_max_length_percent |
Measure the percentage of rows containing text values longer than the maximum accepted length | monitoring | The check measures the percentage of text values in the column that is above the length defined by the user as a parameter. Stores the most recent captured value for each month when the data quality check was evaluated. | |
daily_partition_text_length_above_max_length_percent |
Measure the percentage of rows containing text values longer than the maximum accepted length | partitioned | The check measures the percentage of text values in the column that is above the length defined by the user as a parameter. Analyzes every daily partition and creates a separate data quality check result with the time period value that identifies the daily partition. | |
monthly_partition_text_length_above_max_length_percent |
Measure the percentage of rows containing text values longer than the maximum accepted length | partitioned | The check measures the percentage of text values in the column that is above the length defined by the user as a parameter. Analyzes every monthly partition and creates a separate data quality check result with the time period value that identifies the monthly partition. |
text length in range percent
This check verifies that the minimum and maximum lengths of text values are in the range of accepted values. It measures the percentage of texts with a valid length and raises a data quality issue when an insufficient number of texts have a valid length.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_text_length_in_range_percent |
Measure the percentage of rows containing text values in the expected range | profiling | The check measures the percentage of those text values with length in the range provided by the user in the column. | |
daily_text_length_in_range_percent |
Measure the percentage of rows containing text values in the expected range | monitoring | The check measures the percentage of those text values with length in the range provided by the user in the column. Stores the most recent captured value for each day when the data quality check was evaluated. | |
monthly_text_length_in_range_percent |
Measure the percentage of rows containing text values in the expected range | monitoring | The check measures the percentage of those text values with length in the range provided by the user in the column. Stores the most recent captured value for each month when the data quality check was evaluated. | |
daily_partition_text_length_in_range_percent |
Measure the percentage of rows containing text values in the expected range | partitioned | The check measures the percentage of those text values with length in the range provided by the user in the column. Analyzes every daily partition and creates a separate data quality check result with the time period value that identifies the daily partition. | |
monthly_partition_text_length_in_range_percent |
Measure the percentage of rows containing text values in the expected range | partitioned | The check measures the percentage of those text values with length in the range provided by the user in the column. Analyzes every monthly partition and creates a separate data quality check result with the time period value that identifies the monthly partition. |
min word count
This check finds the lowest count of words in text in a column. DQOps validates the shortest length using a range rule. DQOps raises an issue when the minimum word count is outside a range of accepted values.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_min_word_count |
Verify that the minimum word count of the text column is in the range | profiling | This check finds the lowest word count of text in a column. Then, it verifies that the minimum length is within an accepted range. It detects that the text contains too less words. | |
daily_min_word_count |
Verify that the minimum word count of the text column is in the range | monitoring | This check finds the lowest word count of text in a column. Then, it verifies that the minimum length is within an accepted range. It detects that the text contains too less words. | |
monthly_min_word_count |
Verify that the minimum word count of the text column is in the range | monitoring | This check finds the lowest word count of text in a column. Then, it verifies that the minimum length is within an accepted range. It detects that the text contains too less words. | |
daily_partition_min_word_count |
Verify that the minimum word count of the text column is in the range | partitioned | This check finds the lowest word count of text in a column. Then, it verifies that the minimum length is within an accepted range. It detects that the text contains too less words. | |
monthly_partition_min_word_count |
Verify that the minimum word count of the text column is in the range | partitioned | This check finds the lowest word count of text in a column. Then, it verifies that the minimum length is within an accepted range. It detects that the text contains too less words. |
max word count
This check finds the highest count of words in text in a column. DQOps validates the maximum length using a range rule. DQOps raises an issue when the maximum word count is outside a range of accepted values.
Data quality check name | Friendly name | Check type | Description | Standard |
---|---|---|---|---|
profile_max_word_count |
Verify that the maximum word count of the text is in the range | profiling | This check finds the highest word count of text in a column. Then, it verifies that the maximum length is within an accepted range. It detects that the text contains too many words. | |
daily_max_word_count |
Verify that the maximum word count of the text is in the range | monitoring | This check finds the highest word count of text in a column. Then, it verifies that the maximum length is within an accepted range. It detects that the text contains too many words. | |
monthly_max_word_count |
Verify that the maximum word count of the text is in the range | monitoring | This check finds the highest word count of text in a column. Then, it verifies that the maximum length is within an accepted range. It detects that the text contains too many words. | |
daily_partition_max_word_count |
Verify that the maximum word count of the text is in the range | partitioned | This check finds the highest word count of text in a column. Then, it verifies that the maximum length is within an accepted range. It detects that the text contains too many words. | |
monthly_partition_max_word_count |
Verify that the maximum word count of the text is in the range | partitioned | This check finds the highest word count of text in a column. Then, it verifies that the maximum length is within an accepted range. It detects that the text contains too many words. |