Last updated: July 22, 2025
DQOps YAML file definitions
The definition of YAML files used by DQOps to configure the data sources, monitored tables, and the configuration of activated data quality checks.
RuleDefinitionYaml
Custom rule specification that describes the configuration of a python module with the rule code (additional parameters).
The structure of this object is described below
Property name | Description | Data type | Enum values | Default value | Sample values |
---|---|---|---|---|---|
api_version |
DQOps YAML schema version | string | dqo/v1 | ||
kind |
File type | enum | source table sensor provider_sensor rule check settings file_index connection_similarity_index dashboards default_schedules default_checks default_table_checks default_column_checks default_notifications |
rule | |
spec |
Custom data quality rule specification object with definition of a custom rule | RuleDefinitionSpec |
RuleDefinitionSpec
Custom data quality rule specification. Provides the custom rule configuration. For example, rules that require a range of historic values will have this configuration.
The structure of this object is described below
Property name | Description | Data type | Enum values | Default value | Sample values |
---|---|---|---|---|---|
type |
Rule runner type | enum | python java_class |
||
java_class_name |
Java class name for a rule runner that will execute the sensor. The "type" must be "java_class". | string | |||
mode |
Rule historic (past) values mode. A rule may require just the current sensor readout or use sensor readouts from past periods to perform prediction. The number of time windows is configured in the time_window setting. | enum | current_value previous_readouts |
||
time_window |
Rule time window configuration when the mode is previous_readouts. Configures the number of past time windows (sensor readouts) that are passes as a parameter to the rule. For example, to calculate the average or perform prediction on historic data. | RuleTimeWindowSettingsSpec | |||
fields |
List of fields that are parameters of a custom rule. Those fields are used by the DQOps UI to display the data quality check editing screens with proper UI controls for all required fields. | ParameterDefinitionsListSpec | |||
parameters |
Additional rule parameters | Dict[string, string] |
RuleTimeWindowSettingsSpec
Rule historic data configuration. Specifies the number of past values for rules that are analyzing historic data.
The structure of this object is described below
Property name | Description | Data type | Enum values | Default value | Sample values |
---|---|---|---|---|---|
prediction_time_window |
Number of historic time periods to look back for results. Returns results from previous time periods before the sensor readout timestamp to be used in a rule. Time periods are used in rules that need historic data to calculate an average to detect anomalies. e.g. when the sensor is configured to use a 'day' time period, the rule will receive results from the time_periods number of days before the time period in the sensor readout. The default is 14 (days). | integer | |||
min_periods_with_readouts |
Minimum number of past time periods with a sensor readout that must be present in the data in order to call the rule. The rule is not called and the sensor readout is discarded as not analyzable (not enough historic data to perform prediction) when the number of past sensor readouts is not met. The default is 7. | integer | |||
historic_data_point_grouping |
Time period grouping for collecting previous data quality sensor results for the data quality rules that use historic data for prediction. For example, when the default time period grouping 'day' is used, DQOps will find the most recent data quality sensor readout for each day and pass an array of most recent days per day in an array of historic sensor readout data points to a data quality rule for prediction. | enum | year quarter month week day hour last_n_readouts |