Delete data quality results
In DQO, there are two ways to delete stored data quality results:
- using the graphical interface
- using DQO Shell
In DQO, sensor readouts and check results are stored as Apache Parquet files following the Apache Hive compatible folder tree, partitioned by connection name, table name, and month.
For more information where and how the sensor readouts and check results are stored, see DQO concepts section.
You can learn how to run data quality checks here.
Delete data quality results using the graphical interface
Using graphical interface you can delete data quality results at the connection, table, column, group of checks or individual check level.
To delete the data quality results at the connection, table or column level follow the steps below
-
In DQO graphical interface, click on the three-dot icon next to the name of a connection, table or column in the tree view.
-
From the drop-down menu select the Delete data command.
-
In the modal window, you can specify whether you want to delete all data, data for a specific time range or a specific category.
-
Click the Delete button to delete the selected data.
Delete data quality results using the DQO Shell
To delete data quality results using the DQO Shell, use the data delete command.
To delete all the data for a connection run the following command
Type the name of the connection you want to delete e.g.
A summary of deleted data similar to the following table will be displayed.
3 affected partitions.
+--------------------+--------------+------------------+----------+-------------+-----------------+
|Data type |Connection |Table |Month |Affected rows|Partition deleted|
+--------------------+--------------+------------------+----------+-------------+-----------------+
|data_sensor_readouts|testconnection|austin_crime.crime|2023-05-01|2 |true |
+--------------------+--------------+------------------+----------+-------------+-----------------+
|data_check_results |testconnection|austin_crime.crime|2023-05-01|2 |true |
+--------------------+--------------+------------------+----------+-------------+-----------------+
|data_statistics |testconnection|austin_crime.crime|2023-05-01|131 |true |
+--------------------+--------------+------------------+----------+-------------+-----------------+
Using various parameters, you can limit the data that will be deleted to a specific table, column, time period, data type,
check name, check category type, and more. For a full description of the data delete
command and its parameters, see the Command-line interface section.