Skip to content

incidents

incidents

The data quality incidents table that tracks open incidents. Incidents are grouping multiple failed data quality checks (stored in the check_results table). The check results that are part of an incident could be matched to incidents by the incident_hash column. The incidents are stored in the errors table is located in the $DQO_USER_HOME/.data/incidents folder that contains uncompressed parquet files. The table is partitioned using a Hive compatible partitioning folder structure. When the $DQO_USER_HOME is not configured, it is the folder where DQO was started (the DQO user's home folder).

The folder partitioning structure for this table is: c=[connection_name]/m=[first_day_of_month]/, for example: c=myconnection/m=2023-01-01/.

The columns of this table is described below

Column name Description Data type
id The incident id (primary key), it is a UUID created from a hash of target affected by the incident (target_hash) and a first_seen_utc. This value identifies a single row. text
incident_hash The hash of the incident. long
schema_name The table schema. text
table_name The table name. text
table_priority The table priority. integer
data_group_name The data group name, it is a concatenated name of the data group dimension values, created from [grouping_level_1] / [grouping_level_2] / ... text
quality_dimension The data quality dimension. text
check_category The check category. text
check_type The check type (profiling, checkpoint, partitioned). text
check_name The check name. text
highest_severity The highest data quality check result severity detected as part of this incident. The values are 0, 1, 2, 3 for none, warning, error and fatal severity alerts. integer
minimum_severity Minimum severity of data quality issues (data quality check results) that are included in the incident. It is copied from the incident configuration at a connection or table level at the time when the incident is first seen. The values are 0, 1, 2, 3 for none, warning, error and fatal severity alerts. integer
first_seen Stores the exact time when the incident was raised (seen) for the first time, as a UTC timestamp: first_seen. instant
last_seen Stores the exact time when the incident was raised (seen) for the last time, as a UTC timestamp: last_seen. instant
incident_until Stores the timestamp of the end of the incident when new issues will not be appended to this incident, as a UTC timestamp: incident_until. instant
failed_checks_count Stores the number of checks that failed. integer
issue_url Stores the user provided url to an external ticket management platform that is tracking this incident. text
resolved_by Stores the login of the user who resolved the incident. text
status Stores the current status of the incident. The statuses are described in the {@link IncidentStatus IncidentStatus} enumeration. text
created_at The timestamp when the row was created at. instant
updated_at The timestamp when the row was updated at. instant
created_by Stores the login of the user who created the incident by running a check. text
updated_by The login of the user that updated the row. text