Skip to content

Last updated: July 22, 2025

DQOps YAML file definitions

The definition of YAML files used by DQOps to configure the data sources, monitored tables, and the configuration of activated data quality checks.

RuleDefinitionYaml

Custom rule specification that describes the configuration of a python module with the rule code (additional parameters).

The structure of this object is described below

 Property name   Description                       Data type   Enum values   Default value   Sample values 
api_version DQOps YAML schema version string dqo/v1
kind File type enum source
table
sensor
provider_sensor
rule
check
settings
file_index
connection_similarity_index
dashboards
default_schedules
default_checks
default_table_checks
default_column_checks
default_notifications
rule
spec Custom data quality rule specification object with definition of a custom rule RuleDefinitionSpec

RuleDefinitionSpec

Custom data quality rule specification. Provides the custom rule configuration. For example, rules that require a range of historic values will have this configuration.

The structure of this object is described below

 Property name   Description                       Data type   Enum values   Default value   Sample values 
type Rule runner type enum python
java_class
java_class_name Java class name for a rule runner that will execute the sensor. The "type" must be "java_class". string
mode Rule historic (past) values mode. A rule may require just the current sensor readout or use sensor readouts from past periods to perform prediction. The number of time windows is configured in the time_window setting. enum current_value
previous_readouts
time_window Rule time window configuration when the mode is previous_readouts. Configures the number of past time windows (sensor readouts) that are passes as a parameter to the rule. For example, to calculate the average or perform prediction on historic data. RuleTimeWindowSettingsSpec
fields List of fields that are parameters of a custom rule. Those fields are used by the DQOps UI to display the data quality check editing screens with proper UI controls for all required fields. ParameterDefinitionsListSpec
parameters Additional rule parameters Dict[string, string]

RuleTimeWindowSettingsSpec

Rule historic data configuration. Specifies the number of past values for rules that are analyzing historic data.

The structure of this object is described below

 Property name   Description                       Data type   Enum values   Default value   Sample values 
prediction_time_window Number of historic time periods to look back for results. Returns results from previous time periods before the sensor readout timestamp to be used in a rule. Time periods are used in rules that need historic data to calculate an average to detect anomalies. e.g. when the sensor is configured to use a 'day' time period, the rule will receive results from the time_periods number of days before the time period in the sensor readout. The default is 14 (days). integer
min_periods_with_readouts Minimum number of past time periods with a sensor readout that must be present in the data in order to call the rule. The rule is not called and the sensor readout is discarded as not analyzable (not enough historic data to perform prediction) when the number of past sensor readouts is not met. The default is 7. integer
historic_data_point_grouping Time period grouping for collecting previous data quality sensor results for the data quality rules that use historic data for prediction. For example, when the default time period grouping 'day' is used, DQOps will find the most recent data quality sensor readout for each day and pass an array of most recent days per day in an array of historic sensor readout data points to a data quality rule for prediction. enum year
quarter
month
week
day
hour
last_n_readouts