Skip to content

Last updated: July 05, 2025

How to Detect Data Quality Issues with Data Observability

Read this guide to learn how to automatically activate data quality checks to profile data and monitor the most common data quality issues or anomalies.

Purpose of data observability

Data observability is a critical practice within data management aimed at ensuring that data pipelines and processes are transparent, reliable, and understandable. At its core, data observability provides organizations with the ability to monitor, understand, and troubleshoot their data infrastructure in real-time.

This includes monitoring data ingestion, transformation, and storage processes to detect anomalies, errors, or deviations from expected behavior. Moreover, data observability empowers data teams to proactively address potential issues before they escalate, thus minimizing the risk of data downtime, inaccuracies, or disruptions to business operations. By implementing robust data observability practices, companies can gain insights into the health and performance of their data pipelines, identify and resolve issues promptly, and maintain data integrity throughout the entire data lifecycle.

DQOps enables data observability by automatically activating selected data quality checks on data sources.

How DQOps activates data observability?

DQOps applies data observability by automatically activating data quality checks on monitored data sources.

Upon installation, DQOps creates Data quality policy configuration files. These policies automatically activate checks on all tables and columns in DQOps.

You can enable, disable or customize the existing policies, and also add a new policies. This customization can be done using the user interface or through the policies YAML files located in the patterns folder.

Automatic activation of checks

DQOps makes it easy to distinguish between automatically activated and manually activated data quality checks within the data quality check editor.

  • Automatically activated checks are pre-enabled by policies and appear with a light green toggle button.
  • Manually activated checks that have been manually activated are indicated by a darker green toggle button. When a check is manually activated on a table or column, DQOps uses the check's parameters from the .dqotable.yaml file instead of the configuration from a check policy file.

Deactivating a check on the table or column is also possible to prevent it from running.

Default data observability checks activated on table in DQOps

Data quality policies

The data quality policies, also called patterns, are configurations of data quality checks. These policies are configured in YAML files in the patterns folder. Every configuration is identified by a policy name and has a set of filters for target tables and columns on which DQOps applies these checks.

The default content of the patterns folder is shown below.

$DQO_USER_HOME
├───...
├───patterns
   ├───Detect big day-to-day data volume changes.dqotablepattern.yaml
   ├───Detect data freshness anomalies daily.dqotablepattern.yaml
   ├───Detect data volume anomalies.dqotablepattern.yaml
   ├───Detect empty tables daily.dqotablepattern.yaml
   ├───Detect table availability issues daily.dqotablepattern.yaml
   ├───Detect table schema changes.dqotablepattern.yaml
   ├───Detect anomalies in the count of distinct values.dqocolumnpattern.yaml
   ├───Detect anomalies in the percentage of null values.dqocolumnpattern.yaml
   ├───Detect anomalies in the sum and average of numeric values.dqocolumnpattern.yaml
   ├───Detect anomalies in the sum and average of numeric values at a partition level.dqocolumnpattern.yaml
   ├───Detect change of the data type of values in text columns.dqocolumnpattern.yaml
   ├───Detect column schema changes.dqocolumnpattern.yaml
   ├───Detect columns containing any null values.dqocolumnpattern.yaml
   ├───Detect empty columns.dqocolumnpattern.yaml
   ├───Detect outliers in numeric values.dqocolumnpattern.yaml
   ├───Detect outliers in numeric values across daily partitions.dqocolumnpattern.yaml
   ├───Detect significant changes in the percentage of null values.dqocolumnpattern.yaml
   ├───Profile text columns to detect PII values (sensitive data).dqocolumnpattern.yaml
   ├───Track the count and percentage of null values.dqocolumnpattern.yaml
   ├───Track volume of daily partitions.dqotablepattern.yaml
└───...   

Policy file format

The structure of the check policies YAML files is very similar to the structure of the .dqotable.yaml YAML files. The definition of the default table-level checks also has the profiling_checks, monitoring_checks, and partitioned_checks nodes.

The structure of the table-level <policy_name>.dqotablespattern.yaml file is shown below.

# yaml-language-server: $schema=https://cloud.dqops.com/dqo-yaml-schema/TableDefaultChecksPatternYaml-schema.json
apiVersion: dqo/v1
kind: default_table_checks
spec:
  priority: 1000 #(1)!
  description: ...
  target: #(2)!
    connection: ...
    schema: ...
    table: ...
  profiling_checks: #(3)!
    ...
  monitoring_checks: #(4)!
    daily:
      ...
    monthly:
      ... 
  partitioned_checks: #(5)!
    daily:
      ...
    monthly:
      ... 
  1. The priority (order) of the pattern file. DQOps sorts the pattern files by the priority value is ascending order and activates the configuration from the files with the lowest priority first.
  2. An optional filter to select target tables or columns.
  3. The configuration of profiling checks.
  4. The configuration of monitoring checks.
  5. The configuration of partition checks.

The structure of the column-level <policy_name>.dqocolumnspattern.yaml file is very similar. The main differences are highlighted.

# yaml-language-server: $schema=https://cloud.dqops.com/dqo-yaml-schema/ColumnDefaultChecksPatternYaml-schema.json
apiVersion: dqo/v1
kind: default_column_checks
spec:
  priority: 1000
  description: ...
  target:
    connection: ...
    schema: ...
    table: ...
  profiling_checks: ...
  monitoring_checks: ...
  partitioned_checks: ...

Policies priority

DQOps activates the policies on tables and columns according to priority. The configuration files with a lower priority are activated first. If an earlier priority file already had a configuration for some data quality check, it is not reconfigured by configuration from lower priority pattern files.

The priority field is shown below.

1
2
3
4
5
# yaml-language-server: $schema=https://cloud.dqops.com/dqo-yaml-schema/TableDefaultChecksPatternYaml-schema.json
apiVersion: dqo/v1
kind: default_table_checks
spec:
  priority: 1000 #(1)!
  1. The priority (order) of activating the pattern files.

Observability status in UI

DQOps automatically activates monitoring checks to track various aspects of your data health. The Observability status screen provides a quick overview of any data quality problems for the selected table or column. This allows you to proactively identify potential issues before they impact downstream processes.

With that screen, you can analyze trends in schema changes, volume, and freshness anomalies at table level and distinct count and null percent anomalies at the column level. The screen also displays a summary of data quality issues for the table or columns of choice.

Table-level observability status

To access Observability status at the table level, click on the Monitoring checks or Partition checks sections, select the table of interest and select the Observability status tab.

The screen below provides an example of the Observability status screen at the table level in the Monitoring section.

Table-level Observability status

The number of elements on the screen depends on which checks have been activated. The 5 blocks at the top of the screen show the last 15 results from 5 checks in the schema category:

The results are displayed as squares with colors indicating the result of the run check:

  • Green for a correct result
  • Yellow for a warning
  • Orange for an error
  • Red for a fatal error
  • Black for execution error

Below are line graphs that display table freshness and volume anomalies:

Towards the bottom of the screen, there is a column chart that provides a summary of data quality issues for the selected table.

The Observability status screen at the table level in the Partition checks section displays a line graph showing Partition row count anomaly with data from the daily_partition_row_count_anomaly check and a column chart showing data quality issues.

Column-level observability status

To access Observability status at the column level, click on the Monitoring checks or Partition checks sections. Then, select the column of interest and select the Observability status tab.

The screen below provides an example of the Observability status screen at the column level in the Monitoring section.

Column-level Observability status

The two blocks at the top of the screen display the last 15 results from two checks in the schema category:

Below, there are line graphs that show anomalies in distinct count and nulls percent:

At the bottom of the screen, there is a column chart summarizing data quality issues for the selected column.

The Observability status screen at the column level in the Partition checks section displays two line graphs for Null percent anomaly (daily_partition_nulls_percent_anomaly check) and Distinct count anomaly (daily_partition_distinct_count_anomaly check), as well as a column chart showing data quality issues.

Target tables and columns

DQOps supports creating multiple data quality check configurations, named check policies or patterns. Each policy file contains an optional target table or target column filter for columns or tables on which the checks are activated.

The filters for the target table or the target column are defined in an optional target node. The filters support simple prefix and suffix filters, such as *, *_id, fact_*.

Default data quality check policy

The default policy files <policy_name>.dqocolumnpattern.yaml and <policy_name>.dqotablespattern.yaml do not have the configuration of the target table and column. They are applied to all tables and columns.

Target table

All supported filters for table-level check patterns are shown below. When a filter parameter is not configured, or the value is *, DQOps skips the filter parameter.

# yaml-language-server: $schema=https://cloud.dqops.com/dqo-yaml-schema/TableDefaultChecksPatternYaml-schema.json
apiVersion: dqo/v1
kind: default_table_checks
spec:
  priority: 1000
  description:
  target:
    connection: "dwh_*"
    schema: "public"
    table: "fact_*"
    label: "prod"
    stage: "landing"
    table_priority: 3

The target table parameters are listed in the following table.

Filter parameter Description
connection The target connection name (data source name).
schema The target schema name.
table The target table name.
label This filter matches any label applied on the data source or the table.
stage This filter matches the stage field of a table.
table_priority This filter matches tables at a given priority or with a higher priority (which means a lower value of the priority field).

Target column

The configuration of target columns for column-level check patterns is similar. All table-level filters are supported. The additional filters are specific to columns.

# yaml-language-server: $schema=https://cloud.dqops.com/dqo-yaml-schema/ColumnDefaultChecksPatternYaml-schema.json
apiVersion: dqo/v1
kind: default_column_checks
spec:
  priority: 1000
  description:
  target:
    connection: "dwh_*"
    schema: "public"
    table: "fact_*"
    label: "prod" #(1)!
    stage: "landing"
    table_priority: 3
    column: "*_id"
    data_type: "VARCHAR"
    data_type_category: string
  1. The label fields supports labels defined on the data source, table or column.

The target column parameters are listed in the following table.

Filter parameter Description
column The target column name.
data_type The physical data type of the target column, if it is stored in the type_snapshot.column_type field.
data_type_category The category of the data type detected by DQOps. DQOps detects a database independent category of the data type.

Targeting multiple data assets

All filters support targeting multiple objects, except the data_type_category parameter, which uses well-known values from an enumeration. Targeting multiple data assets, such as multiple connections, schemas, tables, columns, labels, or data types, is supported by providing all the target data names separated by a comma.

The following example shows how to target multiple tables.

1
2
3
4
5
6
7
8
# yaml-language-server: $schema=https://cloud.dqops.com/dqo-yaml-schema/TableDefaultChecksPatternYaml-schema.json
apiVersion: dqo/v1
kind: default_table_checks
spec:
  priority: 1000
  description:
  target:
    table: "fact_sales,dim_pro*"

The following example shows how to target multiple columns.

1
2
3
4
5
6
7
8
# yaml-language-server: $schema=https://cloud.dqops.com/dqo-yaml-schema/ColumnDefaultChecksPatternYaml-schema.json
apiVersion: dqo/v1
kind: default_column_checks
spec:
  priority: 1000
  description:
  target:
    column: "customer_id,product_id"

Deactivating the policy

The configurations of data quality policies can be disabled. DQOps does not apply the disabled policies. Each checks configuration file has a disabled boolean flag. The following examples show how to turn off a policy.

The following example shows how to disable a table-level policy.

1
2
3
4
5
6
7
8
9
# yaml-language-server: $schema=https://cloud.dqops.com/dqo-yaml-schema/TableDefaultChecksPatternYaml-schema.json
apiVersion: dqo/v1
kind: default_table_checks
spec:
  priority: 1000
  disabled: true
  description:
  target:
    table: "fact_sales,dim_pro*"

The following example shows how to disable a column-level policy.

1
2
3
4
5
6
7
8
9
# yaml-language-server: $schema=https://cloud.dqops.com/dqo-yaml-schema/ColumnDefaultChecksPatternYaml-schema.json
apiVersion: dqo/v1
kind: default_column_checks
spec:
  priority: 1000
  disabled: true
  description:
  target:
    column: "customer_id,product_id"

Configuring check policies in UI

The configuration of the data quality check policies (patterns) in DQOps is found in the Data quality policies node of the Configuration section.

Table-level check policies

The list of data quality policies at the table level is located under the Table-level quality policies' node. The screen displays the table-level policies. The toggle buttons on the left side of the Quality policy name are used to activate or deactivate the policy. The list also shows connection, schema, and table search filters if they were set. You can add a new policy, by clicking on the Add data quality policy button.

List of table-level default data quality check policies in DQOps

Clicking on the name of the policy or the Modify action button allows to modify the policy configuration. The search filter for the target patterns is configured on the Target table tab.

Table-level policy configuration in DQOps

The Profiling, Monitoring, and Partition tabs show a data quality check configuration editor. The following screenshot shows the configuration of the Detect big day-to-day data volume changes policy with activated daily_row_count_change check.

Detect big day-to-day data volume changes policy in DQOps

Column-level check policies

The list of data quality check policies at the column level is located under the Column-level checks patterns node. The screen displays the column-level policies. The toggle buttons on the left side of the Quality policy name are used to activate or deactivate the policy. The list also shows connection, schema, and table search filters if they were set. You can add a new policy, by clicking on the Add data quality policy button.

List of column-level default data quality check policies in DQOps

Clicking on the name of the policy or the Modify action button allows to modify the policy configuration. The search filter for the target patterns is configured on the Target table tab.

Target column filters pattern for activating data quality checks in DQOps Detect anomalies in the count of distinct values

The Profiling, Monitoring, and Partition tabs show a data quality check configuration editor. The following screenshot shows the default configuration of the Detect anomalies in the count of distinct values policy with activated daily_distinct_count_anomaly check.

Detect anomalies in the count of distinct values policy in DQOps

Table-level policies

The policies that DQOps can activate on all tables are described below.

The configuration can be changed using a policy editor screen in the Configuration section of the DQOps user interface. or by editing the patterns/<policy_name>.dqotablepattern.yaml file directly or

Detect big day-to-day data volume changes

Monitors data volume of the whole table daily and raises an issue when the volume has increased of decreased significantly.

This policy is disabled by default.

The table below provides a description of the checks activated by this policy.

Category Data quality check Description Data quality rule
volume daily_row_count_change Detects significant changes in the table volume measured as a percentage increase or decrease in the row count. Raises a warning severity issue when the increase or decrease in row count since yesterday (or the last known row count) is above 10%.

Detect data freshness anomalies daily

Monitors data freshness anomalies daily.

This policy is activated by default.

The table below provides a description of the checks activated by this policy.

Category Data quality check Description Data quality rule
timeliness daily_data_freshness_anomaly Measures the freshness of the table and detects anomalous (outstanding) delays in data freshness. The table must be properly configured to support timeliness checks. Raises a warning severity issue when the data is outdated and the delay is in the top 1% of the longest delays.

Detect data volume anomalies

Monitors data volume of the whole table (using daily monitoring checks) and for each daily partition, using daily partition checks.

This policy is activated by default.

The table below provides a description of the checks activated by this policy.

Category Data quality check Description Data quality rule
volume daily_row_count_anomaly Detects anomalies in table volume (table's row count). Identifies the most significant volume increases or decreases since the previous day or the last known row count. Raises a warning severity issue when the increase or decrease in row count is in the top 1% of the biggest day-to-day changes.
volume daily_partition_row_count_anomaly Detects anomalies in volume of daily partitions (table's row count). Identifies the most significant volume increases or decreases since the previous day or the last known row count. Raises a warning severity issue when the increase or decrease in row count is in the top 1% of the biggest day-to-day changes.

Detect empty tables daily

Detects empty tables using daily monitoring checks.

This policy is activated by default.

The table below provides a description of the checks activated by this policy.

Category Data quality check Description Data quality rule
volume daily_row_count Captures the daily row count of the table and identifies empty tables. Raises a warning severity issue when an empty table is detected.

Detect table availability issues daily

Monitors table availability issues daily

This policy is activated by default.

The table below provides a description of the checks activated by this policy.

Category Data quality check Description Data quality rule
availability daily_table_availability Verifies the availability of the table by running a simple query. Detects when the table is missing or not accessible due to invalid credentials or missing permission issues Raises a warning severity issue when the table is not available, the table is missing, or the credentials are outdated.

Detect table schema changes

Monitors the table schema and raises issues when the schema of the table was changed.

This policy is activated by default.

The table below provides a description of the checks activated by this policy.

Category Data quality check Description Data quality rule
schema daily_column_count_changed Monitors the table's schema and detects when the column count has changed since the previous day or the last known column count. Raises a warning severity issue when the column count has changed since the last known column count value.
schema daily_column_list_changed Monitors the table's schema and detects when the list of columns has changed. Detects that new columns were added or removed since the previous day or the last known column list but does not care about the order of columns. Raises a warning severity issue when any columns are added or removed.
schema daily_column_list_or_order_changed Monitors the table's schema and detects when the list or order of columns has changed. Detects that new columns were added, reordered or removed since the previous day or the last known column list. Raises a warning severity issue when any columns are added or removed or any column changes the position in the table.
schema daily_column_types_changed Monitors the table's schema and detects when the physical data types of columns have changed. It also detects whether columns were added or removed since the previous day or the last known column list. This check does not care about the order of columns. Raises a warning severity issue when any columns are added or removed or the physical data type of any column is changed.

Track volume of daily partitions

Monitors volume (row count) of daily partitions.

This policy is activated by default.

The table below provides a description of the checks activated by this policy.

Category Data quality check Description Data quality rule
volume daily_partition_row_count Captures the volume (row count) of daily partitions. The partition volume can be reviewed on the DQOps data quality dashboards. no rules (use the dashboards to review the results)

Column-level policies

The policies that DQOps can activate on all columns are described below.

The configuration can be changed using a policy editor screen in the Configuration section of the DQOps user interface or by editing the patterns/<policy_name>.dqocolumnpattern.yaml files directly.

Detect anomalies in the count of distinct values

Monitors the count of distinct values in a column and raises an issue when an anomaly is detected.

This policy is activated by default.

The table below provides a description of the checks activated by this policy.

Category Data quality check Description Data quality rule
uniqueness daily_distinct_count_anomaly Detects anomalies in the count of distinct (unique) values. Identifies the most significant increases or decreases in the count of distinct values since the previous day or the last known value. Raises a warning severity issue when the increase or decrease in the count of distinct values is in the top 1% of the most significant day-to-day changes.
uniqueness daily_partition_distinct_count_anomaly Detects anomalies in the count of distinct (unique) values of daily partitions. Identifies the most significant increases or decreases in the count of distinct values since the previous day or the last known value. Raises a warning severity issue when the increase or decrease in the count of distinct values is in the top 1% of the most significant day-to-day changes.

Detect anomalies in the percentage of null values

Monitors the scale of null values in columns and raises an issue when the day-to-day change is significant.

This policy is activated by default.

The table below provides a description of the checks activated by this policy.

Category Data quality check Description Data quality rule
nulls daily_nulls_percent_anomaly Detects anomalies in the percentage of null values. Identifies the most significant increases or decreases in the rate of null values since the previous day or the last known value. Raises a warning severity issue when the increase or decrease in the percentage of nulls is in the top 1% of the biggest day-to-day changes.
nulls daily_partition_nulls_percent_anomaly Detects anomalies in the percentage of null values in daily partitions. Identifies the most significant increases or decreases in the rate of null values since the previous day or the last known value. Raises a warning severity issue when the increase or decrease in the percentage of nulls is in the top 1% of the biggest day-to-day changes.

Detect anomalies in the sum and average of numeric values

Monitors the sum and average (mean) aggregated values of numeric values and raises a data quality issue when the value changes too much day-to-day.

This policy is activated by default.

The table below provides a description of the checks activated by this policy.

Category Data quality check Description Data quality rule
anomaly daily_sum_anomaly Detects anomalies in the sum of numeric values. Identifies the most significant increases or decreases in the sum of values since the previous day or the last known value. DQOps activates this check only on numeric columns. Raises a warning severity issue when the increase or decrease in the sum of numeric values is in the top 1% of the most significant day-to-day changes.
anomaly daily_mean_anomaly Detects anomalies in the mean (average) of numeric values. Identifies the most significant increases or decreases in the mean of values since the previous day or the last known value. DQOps activates this check only on numeric columns. Raises a warning severity issue when the increase or decrease in the mean of numeric values is in the top 1% of the most significant day-to-day changes.

Detect anomalies in the sum and average of numeric values at a partition level

Monitors the sum and average (mean) aggregated values of numeric values and raises a data quality issue when the value changes too much between daily partitions.

This policy is activated by default.

The table below provides a description of the checks activated by this policy.

Category Data quality check Description Data quality rule
anomaly daily_partition_sum_anomaly Detects anomalies in the sum of numeric values in daily partitions. Identifies the most significant increases or decreases in the sum of values since the previous day or the last known value. DQOps activates this check only on numeric columns. Raises a warning severity issue when the increase or decrease in the sum of numeric values in daily partition is in the top 1% of the most significant day-to-day changes.
anomaly daily_partition_mean_anomaly Detects anomalies in the mean (average) of numeric values in daily partitions. Identifies the most significant increases or decreases in the mean of values since the previous day or the last known value. DQOps activates this check only on numeric columns. Raises a warning severity issue when the increase or decrease in the mean of numeric values in daily partition is in the top 1% of the most significant day-to-day changes.

Detect change of the data type of values in text columns

Detects when the values stored in a text column change their type. This policy should be activated on raw tables in the landing zones for table that store all values (also numeric an dates) in text columns.

This policy is activated by default.

The table below provides a description of the checks activated by this policy.

Category Data quality check Description Data quality rule
datatype daily_detected_datatype_in_text_changed Analyzes values in text columns to detect if all values are convertible to the same data type (boolean, numeric, date, etc). DQOps activates this check only on text columns. Raises a warning severity issue when the values found in a text column are in a different format or a new value that is not convertible to the previously detected data type is found. For example, the column customer_id in the landing zone table always contained integer values, and a non-numeric value appeared.
datatype daily_partition_detected_datatype_in_text_changed Analyzes values in text columns to detect if all values are convertible to the same data type (boolean, numeric, date, etc). DQOps activates this check only on text columns. Raises a warning severity issue when the values found in a text column are in a different format or a new value that is not convertible to the previously detected data type is found. For example, the column customer_id in the landing zone table always contained integer values, and a non-numeric value appeared.

Detect column schema changes

Monitors the schema of columns registered in DQOps. Raises a data quality issue when the column is missing, or its data has changed.

This policy is activated by default.

The table below provides a description of the checks activated by this policy.

Category Data quality check Description Data quality rule
schema daily_column_exists Verifies that the column exists in the monitored table. Raises a warning severity issue when the column is missing.
schema daily_column_type_changed Detects changes to the physical data type of the column since the last known data type. DQOps detects a change in the data type, length, precision, scale, and nullability. Raises a warning severity issue when the physical data type of the column is changed since the last known data type.

Detect columns containing any null values

Detects columns containing any null values using both monitoring checks and daily partitioned checks.

This policy is disabled by default.

The table below provides a description of the checks activated by this policy.

Category Data quality check Description Data quality rule
nulls daily_nulls_count Counts null values in a monitored column. Detects partially incomplete columns that contain any null values. Raises a warning severity issue when an empty row is detected in a table.
nulls daily_partition_nulls_count Counts null values in a monitored column in daily partition. Detects partially incomplete columns that contain any null values. Raises a warning severity issue when an empty row is detected in a table.

Detect empty columns

Detects empty columns using both monitoring checks an daily partitioned checks.

This policy is disabled by default.

The table below provides a description of the checks activated by this policy.

Category Data quality check Description Data quality rule
nulls daily_empty_column_found Detects empty columns that contain only null values. Raises a warning severity issue when an empty column is found.
nulls daily_partition_empty_column_found Detects empty columns that contain only null values. Raises a warning severity issue when an empty column is found.

Detect outliers in numeric values

Monitors numeric columns to detect new smallest (min) or biggest (max) value, which must be an anomaly.

This policy is disabled by default.

The table below provides a description of the checks activated by this policy.

Category Data quality check Description Data quality rule
anomaly daily_min_anomaly Detects anomalies in the minimum numeric values. Identifies the most significant increases or decreases in the min values since the previous day or the last known value. DQOps activates this check only on numeric columns. Raises a warning severity issue when the increase or decrease in the minimum of numeric values is in the top 1% of the most significant day-to-day changes.
anomaly daily_max_anomaly Detects anomalies in the maximum numeric values. Identifies the most significant increases or decreases in the max values since the previous day or the last known value. DQOps activates this check only on numeric columns. Raises a warning severity issue when the increase or decrease in the maximum of numeric values is in the top 1% of the most significant day-to-day changes.

Detect outliers in numeric values across daily partitions

Monitors numeric columns to detect new smallest (min) or biggest (max) value for each daily partition. Raises a data quality issue when the partition contains a big or small value that exceeds regular ranges.

This policy is activated by default.

The table below provides a description of the checks activated by this policy.

Category Data quality check Description Data quality rule
anomaly daily_partition_min_anomaly Detects anomalies in the minimum numeric values. Identifies the most significant increases or decreases in the min values since the previous day or the last known value. DQOps activates this check only on numeric columns. Raises a warning severity issue when the increase or decrease in the minimum of numeric values is in the top 1% of the most significant day-to-day changes.
anomaly daily_partition_max_anomaly Detects anomalies in the maximum numeric values. Identifies the most significant increases or decreases in the max values since the previous day or the last known value. DQOps activates this check only on numeric columns. Raises a warning severity issue when the increase or decrease in the maximum of numeric values is in the top 1% of the most significant day-to-day changes.

Detect significant changes in the percentage of null values

Monitors the percentage of null values in columns and raises an issue when the day-to-day change is above a threshold.

This policy is activated by default.

The table below provides a description of the checks activated by this policy.

Category Data quality check Description Data quality rule
nulls daily_nulls_percent_change Detects significant changes in the percentage of null values measured as a percentage increase or decrease in the rate of null values compared to the previous day. Raises a warning severity issue when the increase or decrease in the percentage of nulls since yesterday (or the last known nulls percentage) is above 10%.
nulls daily_partition_nulls_percent_change Detects significant changes in the percentage of null values in daily partitions measured as a percentage increase or decrease in the rate of null values compared to the previous day. Raises a warning severity issue when the increase or decrease in the percentage of nulls since yesterday (or the last known nulls percentage) is above 10%.

Profile text columns to detect PII values (sensitive data)

Activates data profiling checks on all text columns to detect if they contain sensitive data (emails, phone numbers). Enabling this policy allows the data quality rule miner to set up PII checks when sensitive values are identified.

This policy is activated by default.

The table below provides a description of the checks activated by this policy.

Category Data quality check Description Data quality rule
pii profile_contains_usa_phone_percent Detects USA phone numbers inside text columns. no rules
pii profile_contains_email_percent Detects emails inside text columns. no rules

Track the count and percentage of null values

Monitors the count and the percentage of null values without raising data quality issues.

This policy is activated by default.

The table below provides a description of the checks activated by this policy.

Category Data quality check Description Data quality rule
nulls daily_nulls_count Counts null values in a monitored column. Detects partially incomplete columns that contain any null values. no rules (use the dashboards to review the results)
nulls daily_nulls_percent Measures the percentage of null values in a column. no rules (use the dashboards to review the results)
nulls daily_partition_nulls_count Counts null values in a monitored column in daily partitions. Detects partially incomplete columns that contain any null values. no rules (use the dashboards to review the results)
nulls daily_partition_nulls_percent Measures the percentage of null values in a column in daily partitions. no rules (use the dashboards to review the results)

Next steps